Most Cited 2025 "reward modeling" Papers
22,274 papers found • Page 76 of 112
Conference
Dataset Distillation for Pre-Trained Self-Supervised Vision Models
George Cazenavette, Antonio Torralba, Vincent Sitzmann
How Does Label Noise Gradient Descent Improve Generalization in the Low SNR Regime?
Wei Huang, Andi Han, Yujin Song et al.
EraseFlow: Learning Concept Erasure Policies via GFlowNet-Driven Alignment
Naga Sai Abhiram Kusumba, Maitreya Patel, Kyle Min et al.
CompSlider: Compositional Slider for Disentangled Multiple-Attribute Image Generation
Zixin Zhu, Kevin Duarte, Mamshad Nayeem Rizve et al.
MUG: Pseudo Labeling Augmented Audio-Visual Mamba Network for Audio-Visual Video Parsing
Langyu Wang, Langyu Wang, Yingying Chen et al.
IAP: Invisible Adversarial Patch Attack through Perceptibility-Aware Localization and Perturbation Optimization
Subrat Kishore Dutta, Xiao Zhang
Learning Yourself: Class-Incremental Semantic Segmentation with Language-Inspired Bootstrapped Disentanglement
Ruitao Wu, Yifan Zhao, Jia Li
GraphBridge: Towards Arbitrary Transfer Learning in GNNs
Li Ju, Xingyi Yang, Qi Li et al.
Strategic Cost Selection in Participatory Budgeting
Piotr Faliszewski, Łukasz Janeczko, Andrzej Kaczmarczyk et al.
Linear Differential Vision Transformer: Learning Visual Contrasts via Pairwise Differentials
Yifan Pu, Jixuan Ying, Qixiu Li et al.
Local Loss Optimization in the Infinite Width: Stable Parameterization of Predictive Coding Networks and Target Propagation
Satoki Ishikawa, Rio Yokota, Ryo Karakida
Towards Privacy-preserved Pre-training of Remote Sensing Foundation Models with Federated Mutual-guidance Learning
Jieyi Tan, Chengwei Zhang, Bo Dang et al.
VL-SAE: Interpreting and Enhancing Vision-Language Alignment with a Unified Concept Set
Shufan Shen, Junshu Sun, Qingming Huang et al.
From Synapses to Dynamics: Obtaining Function from Structure in a Connectome Constrained Model of the Head Direction Circuit
Sunny Duan, Ling L. Dong, Ila Fiete
Human-in-the-Loop Local Corrections of 3D Scene Layouts via Infilling
Christopher Xie, Armen Avetisyan, Henry Howard-Jenkins et al.
ProGait: A Multi-Purpose Video Dataset and Benchmark for Transfemoral Prosthesis Users
Xiangyu Yin, Boyuan Yang, Weichen Liu et al.
Regret-Optimal Q-Learning with Low Cost for Single-Agent and Federated Reinforcement Learning
Haochen Zhang, Zhong Zheng, Lingzhou Xue
Computational Efficiency under Covariate Shift in Kernel Ridge Regression
Andrea Della Vecchia, Arnaud Mavakala Watusadisi, Ernesto De Vito et al.
Learning to See in the Extremely Dark
Hai Jiang, Binhao Guan, Zhen Liu et al.
EDiT: A Local-SGD-Based Efficient Distributed Training Method for Large Language Models
Jialiang Cheng, Ning Gao, Yun Yue et al.
ReMindRAG: Low-Cost LLM-Guided Knowledge Graph Traversal for Efficient RAG
Yikuan Hu, Jifeng Zhu, Lanrui Tang et al.
Equipping Vision Foundation Model with Mixture of Experts for Out-of-Distribution Detection
Shizhen Zhao, Jiahui Liu, Xin Wen et al.
Evaluating Program Semantics Reasoning with Type Inference in System $F$
Yifeng He, Luning Yang, Christopher Gonzalo et al.
MFogHub: Bridging Multi-Regional and Multi-Satellite Data for Global Marine Fog Detection and Forecasting
Mengqiu XU, Kaixin Chen, Heng Guo et al.
Optimal kernel regression bounds under energy-bounded noise
Amon Lahr, Johannes Köhler, Anna Scampicchio et al.
Beyond Low-Rank Tuning: Model Prior-Guided Rank Allocation for Effective Transfer in Low-Data and Large-Gap Regimes.
Chuyan Zhang, Kefan Wang, Yun Gu
Deep Value Benchmark: Measuring Whether Models Generalize Deep values or Shallow Preferences
Joshua Ashkinaze, Hua Shen, Saipranav Avula et al.
Towards Interpretable and Efficient Attention: Compressing All by Contracting a Few
Qishuai Wen, Zhiyuan Huang, Chun-Guang Li
Pruning Spurious Subgraphs for Graph Out-of-Distribution Generalization
Tianjun Yao, Haoxuan Li, Yongqiang Chen et al.
Learning on the Go: A Meta-learning Object Navigation Model
Xiaorong Qin, Xinhang Song, Sixian Zhang et al.
Agnostic Learning under Targeted Poisoning: Optimal Rates and the Role of Randomness
Bogdan Chornomaz, Yonatan Koren, Shay Moran et al.
Disentangling Superpositions: Interpretable Brain Encoding Model with Sparse Concept Atoms
Alicia Zeng, Jack Gallant
Domain Generalizable Portrait Style Transfer
Xinbo Wang, Wenju Xu, Qing Zhang et al.
Hybrid-TTA: Continual Test-time Adaptation via Dynamic Domain Shift Detection
Hyewon Park, Hyejin Park, Jueun Ko et al.
Data-free Universal Adversarial Perturbation with Pseudo-semantic Prior
Chanhui Lee, Yeonghwan Song, Jeany Son
Feature Distillation is the Better Choice for Model-Heterogeneous Federated Learning
Yichen Li, Xiuying Wang, Wenchao Xu et al.
Language-Assisted Debiasing and Smoothing for Foundation Model-Based Semi-Supervised Learning
Na Zheng, Xuemeng Song, Xue Dong et al.
A Unified Framework for Provably Efficient Algorithms to Estimate Shapley Values
Tyler Chen, Akshay Seshadri, Mattia Jacopo Villani et al.
Preference Learning with Response Time: Robust Losses and Guarantees
Ayush Sawarni, Sahasrajit Sarmasarkar, Vasilis Syrgkanis
Neural Collapse under Gradient Flow on Shallow ReLU Networks for Orthogonally Separable Data
Hancheng Min, Zhihui Zhu, Rene Vidal
SpiLiFormer: Enhancing Spiking Transformers with Lateral Inhibition
Zeqi Zheng, Yanchen Huang, Yingchao Yu et al.
Depth AnyEvent: A Cross-Modal Distillation Paradigm for Event-Based Monocular Depth Estimation
Luca Bartolomei, Enrico Mannocci, Fabio Tosi et al.
Embodied Navigation with Auxiliary Task of Action Description Prediction
Haru Kondoh, Asako Kanezaki
3D Visual Illusion Depth Estimation
Chengtang Yao, Zhidan Liu, Jiaxi Zeng et al.
Heterogeneous Adversarial Play in Interactive Environments
Manjie Xu, Xinyi Yang, Jiayu Zhan et al.
D-Attn: Decomposed Attention for Large Vision-and-Language Model
Chia-Wen Kuo, Sijie Zhu, Fan Chen et al.
Spurious-Aware Prototype Refinement for Reliable Out-of-Distribution Detection
Reihaneh Zohrabi, Hosein Hasani, Mahdieh Soleymani et al.
Multi-modal Topology-embedded Graph Learning for Spatially Resolved Genes Prediction from Pathology Images with Prior Gene Similarity Information
Hang Shi, Chi Changxi, Peng Wan et al.
VaMP: Variational Multi-Modal Prompt Learning for Vision-Language Models
Silin Cheng, Kai Han
Boosting Adversarial Transferability with Spatial Adversarial Alignment
Zhaoyu Chen, HaiJing Guo, Kaixun Jiang et al.
Mamba Goes HoME: Hierarchical Soft Mixture-of-Experts for 3D Medical Image Segmentation
Szymon Płotka, Gizem Mert, Maciej Chrabaszcz et al.
Separation Power of Equivariant Neural Networks
Marco Pacini, Xiaowen Dong, Bruno Lepri et al.
ClearSight: Human Vision-Inspired Solutions for Event-Based Motion Deblurring
Xiaopeng LIN, Yulong Huang, Hongwei Ren et al.
OmniSegmentor: A Flexible Multi-Modal Learning Framework for Semantic Segmentation
Bo-Wen Yin, Jiao-Long Cao, Xuying Zhang et al.
Clique Number Estimation via Differentiable Functions of Adjacency Matrix Permutations
Indradyumna Roy, Eeshaan Jain, Soumen Chakrabarti et al.
Cue3D: Quantifying the Role of Image Cues in Single-Image 3D Generation
Xiang Li, Zirui Wang, Zixuan Huang et al.
Spiking Neural Networks Need High-Frequency Information
Yuetong Fang, Deming Zhou, Ziqing Wang et al.
GeneFlow: Translation of Single-cell Gene Expression to Histopathological Images via Rectified Flow
Mengbo Wang, Shourya Verma, Aditya Malusare et al.
DoDo-Code: an Efficient Levenshtein Distance Embedding-based Code for 4-ary IDS Channel
Alan J.X. Guo, Sihan Sun, Xiang Wei et al.
VL-SAM-V2: Open-World Object Detection with General and Specific Query Fusion
Zhiwei Lin, Yongtao Wang
Less Is More, but Where? Dynamic Token Compression via LLM-Guided Keyframe Prior
Yulin Li, Haokun GUI, Ziyang Fan et al.
$\textit{Hyper-GoalNet}$: Goal-Conditioned Manipulation Policy Learning with HyperNetworks
Pei Zhou, Wanting Yao, Qian Luo et al.
ResidualViT for Efficient Temporally Dense Video Encoding
Mattia Soldan, Fabian Caba Heilbron, Bernard Ghanem et al.
Stability and Sharper Risk Bounds with Convergence Rate $\tilde{O}(1/n^2)$
Bowei Zhu, Shaojie Li, Mingyang Yi et al.
PAVE: Patching and Adapting Video Large Language Models
Zhuoming Liu, Yiquan Li, Khoi D Nguyen et al.
Revisiting Point Cloud Completion: Are We Ready For The Real-World?
Stuti Pathak, Prashant Kumar, Dheeraj Baiju et al.
Scalable Valuation of Human Feedback through Provably Robust Model Alignment
Masahiro Fujisawa, Masaki Adachi, Michael A Osborne
AutoOpt: A Dataset and a Unified Framework for Automating Optimization Problem Solving
Ankur Sinha, Shobhit Arora, Dhaval Pujara
Stabilizing LTI Systems under Partial Observability: Sample Complexity and Fundamental Limits
Ziyi Zhang, Yorie Nakahira, Guannan Qu
HyperPose: Hypernetwork-Infused Camera Pose Localization and an Extended Cambridge Landmarks Dataset
Ron Ferens, Yosi Keller
Contextual Dynamic Pricing with Heterogeneous Buyers
Thodoris Lykouris, Sloan Nietert, Princewill Okoroafor et al.
Continuous Domain Generalization
Zekun CAI, Yiheng YAO, Guangji Bai et al.
Doubly-Robust Estimation of Counterfactual Policy Mean Embeddings
Houssam Zenati, Bariscan Bozkurt, Arthur Gretton
PLMP - Point-Line Minimal Problems for Projective SfM
Kim Kiehn, Albin Ahlbäck, Kathlén Kohn
Online Dense Point Tracking with Streaming Memory
Qiaole Dong, Yanwei Fu
Jamais Vu: Exposing the Generalization Gap in Supervised Semantic Correspondence
Octave Mariotti, Zhipeng Du, Yash Bhalgat et al.
CAR-Flow: Condition-Aware Reparameterization Aligns Source and Target for Better Flow Matching
Chen Chen, Pengsheng Guo, Liangchen Song et al.
Securing the Language of Life: Inheritable Watermarks from DNA Language Models to Proteins
ZAIXI ZHANG, Ruofan Jin, Le Cong et al.
Identity-preserving Distillation Sampling by Fixed-Point Iterator
SeonHwa Kim, Jiwon Kim, Soobin Park et al.
Prioritizing Perception-Guided Self-Supervision: A New Paradigm for Causal Modeling in End-to-End Autonomous Driving
Yi Huang, Zhan Qu, Lihui Jiang et al.
Incremental Object Keypoint Learning
Mingfu Liang, Jiahuan Zhou, Xu Zou et al.
Stop Learning it all to Mitigate Visual Hallucination, Focus on the Hallucination Target.
Dokyoon Yoon, Youngsook Song, Woomyoung Park
GeoProg3D: Compositional Visual Reasoning for City-Scale 3D Language Fields
Shunsuke Yasuki, Taiki Miyanishi, Nakamasa Inoue et al.
REMI: Reconstructing Episodic Memory During Internally Driven Path Planning
Zhaoze Wang, Genela Morris, Dori Derdikman et al.
Physics-informed Value Learner for Offline Goal-Conditioned Reinforcement Learning
Vittorio Giammarino, Ruiqi Ni, Ahmed Qureshi
Can Large Multimodal Models Understand Agricultural Scenes? Benchmarking with AgroMind
Qingmei Li, Yang Zhang, Zurong Mai et al.
Reinforcement Learning with Imperfect Transition Predictions: A Bellman-Jensen Approach
Chenbei Lu, Zaiwei Chen, Tongxin Li et al.
Simple Distillation for One-Step Diffusion Models
Huaisheng Zhu, Teng Xiao, Shijie Zhou et al.
Semantix: An Energy-guided Sampler for Semantic Style Transfer
Huiang He, Minghui HU, Chuanxia Zheng et al.
FineRS: Fine-grained Reasoning and Segmentation of Small Objects with Reinforcement Learning
Lu Zhang, Jiazuo Yu, Haomiao Xiong et al.
Continual Optimization with Symmetry Teleportation for Multi-Task Learning
Zhipeng Zhou, Ziqiao Meng, Pengcheng Wu et al.
AMBER: Adaptive Mesh Generation by Iterative Mesh Resolution Prediction
Niklas Freymuth, Tobias Würth, Nicolas Schreiber et al.
ALMGuard: Safety Shortcuts and Where to Find Them as Guardrails for Audio–Language Models
Weifei Jin, Yuxin Cao, Junjie Su et al.
FreqPolicy: Frequency Autoregressive Visuomotor Policy with Continuous Tokens
Yiming Zhong, Yumeng Liu, Chuyang Xiao et al.
DartQuant: Efficient Rotational Distribution Calibration for LLM Quantization
YUANTIAN SHAO, Yuanteng Chen, Peisong Wang et al.
Masked Diffusion Models as Energy Minimization
Sitong Chen, Shen Nie, Jiacheng Sun et al.
Matching Markets Meet LLMs: Algorithmic Reasoning with Ranked Preferences
Hadi Hosseini, Samarth Khanna, Ronak Singh
RadZero: Similarity-Based Cross-Attention for Explainable Vision-Language Alignment in Chest X-ray with Zero-Shot Multi-Task Capability
Jonggwon Park, Byungmu Yoon, Soobum Kim et al.
Learning Latent Variable Models via Jarzynski-adjusted Langevin Algorithm
James Cuin, Davide Carbone, O. Deniz Akyildiz
Robust Unfolding Network for HDR Imaging with Modulo Cameras
Zhile Chen, Hui Ji
Mechanism Design via the Interim Relaxation
Kshipra Bhawalkar, Marios Mertzanidis, Divyarthi Mohan et al.
Mind the GAP! The Challenges of Scale in Pixel-based Deep Reinforcement Learning
Ghada Sokar, Pablo Samuel Castro
Discrete Spatial Diffusion: Intensity-Preserving Diffusion Modeling
Javier E. Santos, Agnese Marcato, Roman Colman et al.
LiveStar: Live Streaming Assistant for Real-World Online Video Understanding
Zhenyu Yang, Kairui Zhang, Yuhang Hu et al.
SimWorld: An Open-ended Simulator for Agents in Physical and Social Worlds
Xiaokang Ye, Jiawei Ren, Yan Zhuang et al.
CARE Transformer: Mobile-Friendly Linear Visual Transformer via Decoupled Dual Interaction
Yuan Zhou, Qingshan Xu, Jiequan Cui et al.
Taught Well Learned Ill: Towards Distillation-conditional Backdoor Attack
Yukun Chen, Boheng Li, Yu Yuan et al.
Knowledge Graph Enhanced Generative Multi-modal Models for Class-Incremental Learning
Xusheng Cao, Haori Lu, Linlan Huang et al.
DialNav: Multi-turn Dialog Navigation with a Remote Guide
Leekyeung Han, Hyunji Min, Gyeom Hwangbo et al.
Neural Compression for 3D Geometry Sets
Siyu Ren, Junhui Hou, Weiyao Lin et al.
UNEM: UNrolled Generalized EM for Transductive Few-Shot Learning
Long Zhou, Fereshteh Shakeri, Aymen Sadraoui et al.
GraphTOP: Graph Topology-Oriented Prompting for Graph Neural Networks
Xingbo Fu, Zhenyu Lei, Zihan Chen et al.
Self-Calibrating BCIs: Ranking and Recovery of Mental Targets Without Labels
Jonathan Grizou, Carlos De la Torre-Ortiz, Tuukka Ruotsalo
HoliTracer: Holistic Vectorization of Geographic Objects from Large-Size Remote Sensing Imagery
Yu Wang, Bo Dang, Wanchun Li et al.
InstructRestore: Region-Customized Image Restoration with Human Instructions
Shuaizheng Liu, Jianqi Ma, Lingchen Sun et al.
Feature Spectrum Learning for Remote Sensing Change Detection
Qi Zang, Dong Zhao, Shuang Wang et al.
Divide-and-Conquer for Enhancing Unlabeled Learning, Stability, and Plasticity in Semi-supervised Continual Learning
Yue Duan, Taicai Chen, Lei Qi et al.
TRACE: Contrastive learning for multi-trial time series data in neuroscience
Lisa Schmors, Dominic Gonschorek, Jan Niklas Böhm et al.
The Burden of Interactive Alignment with Inconsistent Preferences
Ali Shirali
Latent Space Imaging
Matheus Souza, Yidan Zheng, Kaizhang Kang et al.
Sparse-Dense Side-Tuner for efficient Video Temporal Grounding
David Pujol-Perich, Sergio Escalera, Albert Clapés
Hawk: Leveraging Spatial Context for Faster Autoregressive Text-to-Image Generation
Zhi-Kai Chen, Jun-Peng Jiang, Han-Jia Ye et al.
Can We Ignore Labels in Out of Distribution Detection?
Hong Yang, Qi Yu, Travis Desell
Towards Efficient General Feature Prediction in Masked Skeleton Modeling
Shengkai Sun, Zefan Zhang, Jianfeng Dong et al.
GUI-Rise: Structured Reasoning and History Summarization for GUI Navigation
Tao Liu, Chongyu Wang, Rongjie Li et al.
Align-A-Video: Deterministic Reward Tuning of Image Diffusion Models for Consistent Video Editing
Shengzhi Wang, Yingkang Zhong, Jiangchuan Mu et al.
Transferring Linear Features Across Language Models With Model Stitching
Alan Chen, Jack Merullo, Alessandro Stolfo et al.
T2Bs: Text-to-Character Blendshapes via Video Generation
Jiahao Luo, Chaoyang Wang, Michael Vasilkovsky et al.
Fish2Mesh Transformer: 3D Human Mesh Recovery from Egocentric Vision
Tianma Shen, Aditya Shrish Puranik, James Vong et al.
FED-PsyAU: Privacy-Preserving Micro-Expression Recognition via Psychological AU Coordination and Dynamic Facial Motion Modeling
Jingting Li, Yu Qian, Lin Zhao et al.
Visual Relation Diffusion for Human-Object Interaction Detection
Ping Cao, Yepeng Tang, Chunjie Zhang et al.
CA-I2P: Channel-Adaptive Registration Network with Global Optimal Selection
Zhixin Cheng, Jiacheng Deng, Xinjun Li et al.
ZEUS: Zero-shot Embeddings for Unsupervised Separation of Tabular Data
Patryk Marszałek, Tomasz Kuśmierczyk, Witold Wydmański et al.
The Cost of Compression: Tight Quadratic Black-Box Attacks on Sketches for $\ell_2$ Norm Estimation
Sara Ahmadian, Edith Cohen, Uri Stemmer
Image as a World: Generating Interactive World from Single Image via Panoramic Video Generation
Dongnan Gui, Xun Guo, Wengang Zhou et al.
Acceleration via silver step-size on Riemannian manifolds with applications to Wasserstein space
Jiyoung Park, Abhishek Roy, Jonathan W. Siegel et al.
CTSketch: Compositional Tensor Sketching for Scalable Neurosymbolic Learning
Seewon Choi, Alaia Solko-Breslin, Rajeev Alur et al.
Latent Swap Joint Diffusion for 2D Long-Form Latent Generation
Yusheng Dai, Chenxi Wang, Chang Li et al.
Uncoupled and Convergent Learning in Monotone Games under Bandit Feedback
Jing Dong, Baoxiang Wang, Yaoliang Yu
ConcreTizer: Model Inversion Attack via Occupancy Classification and Dispersion Control for 3D Point Cloud Restoration
Youngseok Kim, Sunwook Hwang, Hyung-Sin Kim et al.
MaskHand: Generative Masked Modeling for Robust Hand Mesh Reconstruction in the Wild
Muhammad Usama Saleem, Ekkasit Pinyoanuntapong, Mayur Patel et al.
RAGD: Regional-Aware Diffusion Model for Text-to-Image Generation
Chen Zhennan, Yajie Li, Haofan Wang et al.
Protein Inverse Folding From Structure Feedback
Junde Xu, Zijun Gao, Xinyi Zhou et al.
WildCAT3D: Appearance-Aware Multi-View Diffusion in the Wild
Morris Alper, David Novotny, Filippos Kokkinos et al.
SALAD -- Semantics-Aware Logical Anomaly Detection
Matic Fučka, Vitjan Zavrtanik, Danijel Skocaj
ALTER: All-in-One Layer Pruning and Temporal Expert Routing for Efficient Diffusion Generation
Xiaomeng Yang, LEI LU, Qihui Fan et al.
PixPerfect: Seamless Latent Diffusion Local Editing with Discriminative Pixel-Space Refinement
Haitian Zheng, Yuan Yao, yongsheng yu et al.
Dataset Distillation as Data Compression: A Rate-Utility Perspective
Youneng Bao, Yiping Liu, Zhuo Chen et al.
GTPBD: A Fine-Grained Global Terraced Parcel and Boundary Dataset
Zhiwei Zhang, Zi Ye, Yibin Wen et al.
Graph Your Own Prompt
Xi Ding, Lei Wang, Piotr Koniusz et al.
Degrees of Freedom for Linear Attention: Distilling Softmax Attention with Optimal Feature Efficiency
Naoki Nishikawa, Rei Higuchi, Taiji Suzuki
CALM: Culturally Self-Aware Language Models
Lingzhi Shen, Xiaohao Cai, Yunfei Long et al.
Missing Data Imputation by Reducing Mutual Information with Rectified Flows
Jiahao Yu, Qizhen Ying, Leyang Wang et al.
Variance-Aware Feel-Good Thompson Sampling for Contextual Bandits
Xuheng Li, Quanquan Gu
Redefining Experts: Interpretable Decomposition of Language Models for Toxicity Mitigation
Zuhair Hasan Shaik, Abdullah Mazhar, Aseem Srivastava et al.
Soft Self-labeling and Potts Relaxations for Weakly-supervised Segmentation
Zhongwen Zhang, Yuri Boykov
Synchronizing Task Behavior: Aligning Multiple Tasks during Test-Time Training
Wooseong Jeong, Jegyeong Cho, Youngho Yoon et al.
Plug-in Feedback Self-adaptive Attention in CLIP for Training-free Open-Vocabulary Segmentation
Zhixiang Chi, Yanan Wu, Li Gu et al.
Conditional Latent Diffusion Models for Zero-Shot Instance Segmentation
Maximilian Ulmer, Wout Boerdijk, Rudolph Triebel et al.
Perspective-aware 3D Gaussian Inpainting with Multi-view Consistency
Yuxin CHENG, Binxiao Huang, Taiqiang Wu et al.
Breaking Rectangular Shackles: Cross-View Object Segmentation for Fine-Grained Object Geo-Localization
Qingwang Zhang, Yingying Zhu
Multimodal Negative Learning
Baoquan Gong, Xiyuan Gao, Pengfei Zhu et al.
Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs
Zitian Wang, Yue Liao, RONG KANG et al.
SCSA: A Plug-and-Play Semantic Continuous-Sparse Attention for Arbitrary Semantic Style Transfer
Chunnan Shang, Zhizhong Wang, Hongwei Wang et al.
Towards Explicit Geometry-Reflectance Collaboration for Generalized LiDAR Segmentation in Adverse Weather
Longyu Yang, Ping Hu, Shangbo Yuan et al.
OpenBox: Annotate Any Bounding Boxes in 3D
In-Jae Lee, Mungyeom Kim, Kwonyoung Ryu et al.
Touch2Shape: Touch-Conditioned 3D Diffusion for Shape Exploration and Reconstruction
Yuanbo Wang, Zhaoxuan Zhang, Jiajin Qiu et al.
Rethinking Approximate Gaussian Inference in Classification
Bálint Mucsányi, Nathaël Da Costa, Philipp Hennig
Black-Box Membership Inference Attack for LVLMs via Prior Knowledge-Calibrated Memory Probing
Jinhua Yin, Peiru Yang, Chen Yang et al.
MAGE : Single Image to Material-Aware 3D via the Multi-View G-Buffer Estimation Model
Haoyuan Wang, Zhenwei Wang, Xiaoxiao Long et al.
Controllable Human-centric Keyframe Interpolation with Generative Prior
Zujin Guo, Size Wu, Zhongang Cai et al.
OmniDiff: A Comprehensive Benchmark for Fine-grained Image Difference Captioning
Yuan Liu, Saihui Hou, Saijie Hou et al.
Smooth Sailing: Lipschitz-Driven Uncertainty Quantification for Spatial Associations
David Burt, Renato Berlinghieri, Stephen Bates et al.
Hierarchical Visual Prompt Learning for Continual Video Instance Segmentation
Jiahua Dong, Hui Yin, Wenqi Liang et al.
Blind Video Super-Resolution based on Implicit Kernels
Qiang Zhu, Yuxuan Jiang, Shuyuan Zhu et al.
Pan-LUT: Efficient Pan-sharpening via Learnable Look-Up Tables
Zhongnan Cai, Yingying Wang, Hui Zheng et al.
PersPose: 3D Human Pose Estimation with Perspective Encoding and Perspective Rotation
Xiaoyang Hao, Han Li
Audits Under Resource, Data, and Access Constraints: Scaling Laws For Less Discriminatory Alternatives
Sarah Cen, Salil Goyal, Zaynah Javed et al.
A Unified Framework for Variable Selection in Model-Based Clustering with Missing Not at Random
Binh Ho, Long Nguyen-Chi, TrungTin Nguyen et al.
Diffusing DeBias: Synthetic Bias Amplification for Model Debiasing
Massimiliano Ciranni, Vito Paolo Pastore, Roberto Di Via et al.
Learning Dynamics of RNNs in Closed-Loop Environments
Yoav Ger, Omri Barak
Puzzles: Unbounded Video-Depth Augmentation for Scalable End-to-End 3D Reconstruction
Jiahao Ma, Lei Wang, Miaomiao Liu et al.
Video Color Grading via Look-Up Table Generation
Seunghyun Shin, Dongmin Shin, Jisu Shin et al.
Recover and Match: Open-Vocabulary Multi-Label Recognition through Knowledge-Constrained Optimal Transport
Hao Tan, Zichang Tan, Jun Li et al.
RegGS: Unposed Sparse Views Gaussian Splatting with 3DGS Registration
Chong Cheng, Yu Hu, Sicheng Yu et al.
Membership Inference Attacks with False Discovery Rate Control
Chenxu Zhao, Wei Qian, Aobo Chen et al.
Dimension-free Score Matching and Time Bootstrapping for Diffusion Models
Syamantak Kumar, Dheeraj Nagaraj, Purnamrita Sarkar
Active Measurement: Efficient Estimation at Scale
Max Hamilton, Jinlin Lai, Wenlong Zhao et al.
Perturbation Bounds for Low-Rank Inverse Approximations under Noise
Phuc Tran, Nisheeth K. Vishnoi
LEDiT: Your Length-Extrapolatable Diffusion Transformer without Positional Encoding
Shen Zhang, Siyuan Liang, Yaning Tan et al.
Efficient Large Language Model Inference with Neural Block Linearization
Mete Erdogan, Francesco Tonin, Volkan Cevher
Generalizable Reasoning through Compositional Energy Minimization
Alexandru Oarga, Yilun Du
Hipandas: Hyperspectral Image Joint Denoising and Super-Resolution by Image Fusion with the Panchromatic Image
Shuang Xu, Zixiang Zhao, Haowen Bai et al.
Adaptive Divergence Regularized Policy Optimization for Fine-tuning Generative Models
Jiajun Fan, Tong Wei, Chaoran Cheng et al.
FairHuman: Boosting Hand and Face Quality in Human Image Generation with Minimum Potential Delay Fairness in Diffusion Models
Yuxuan Wang, Tianwei Cao, Huayu Zhang et al.
Flex-Judge: Text-Only Reasoning Unleashes Zero-Shot Multimodal Evaluators
Jongwoo Ko, Sungnyun Kim, Sungwoo Cho et al.
A Controllable Examination for Long-Context Language Models
Yijun Yang, Zeyu Huang, Wenhao Zhu et al.
Training-Free Class Purification for Open-Vocabulary Semantic Segmentation
Qi Chen, Lingxiao Yang, Yun Chen et al.
Identifying interactions across brain areas while accounting for individual-neuron dynamics with a Transformer-based variational autoencoder
Qi Xin, Robert E Kass
GPAS: Accelerating Convergence of LLM Pretraining via Gradient-Preserving Activation Scaling
Tianhao Chen, Xin Xu, Zijing Liu et al.