Most Cited 2025 &quot;key-value state reuse&quot; Papers

NEURIPS 2025arXiv:2502.02494

#15002

Analyzing Similarity Metrics for Data Selection for Language Model Pretraining

Dylan Sam, Ayan Chakrabarti, Afshin Rostamizadeh et al.

ICCV 2025arXiv:2508.15256

#15003

Normal and Abnormal Pathology Knowledge-Augmented Vision-Language Model for Anomaly Detection in Pathology Images

Jinsol Song, Jiamu Wang, Anh Nguyen et al.

NEURIPS 2025arXiv:2507.11690

#15004

The Impact of Coreset Selection on Spurious Correlations and Group Robustness

Amaya Dharmasiri, William Yang, Polina Kirichenko et al.

#15005

UMFN: Unified Multi-Domain Face Normalization for Joint Cross-domain Prototype Learning and Heterogeneous Face Recognition

Meng Pang, Wenjun Zhang, Nanrun Zhou et al.

NEURIPS 2025oralarXiv:2505.14905

#15006

Concept Incongruence: An Exploration of Time and Death in Role Playing

Xiaoyan Bai, Ike Peng, Aditya Singh et al.

#15007

OpticalNet: An Optical Imaging Dataset and Benchmark Beyond the Diffraction Limit

Benquan Wang, Ruyi An, Jin-Kyu So et al.

CVPR 2025highlight

NEURIPS 2025arXiv:2511.00457

#15008

GraphChain: Large Language Models for Large-scale Graph Analysis via Tool Chaining

Chunyu Wei, Wenji Hu, Xingjia Hao et al.

NEURIPS 2025arXiv:2505.06535

#15009

Online Feedback Efficient Active Target Discovery in Partially Observable Environments

Anindya Sarkar, Binglin Ji, Yevgeniy Vorobeychik

ICCV 2025arXiv:2508.20265

#15010

Plug-in Feedback Self-adaptive Attention in CLIP for Training-free Open-Vocabulary Segmentation

Zhixiang Chi, Yanan Wu, Li Gu et al.

NEURIPS 2025arXiv:2510.17526

#15011

How Does Label Noise Gradient Descent Improve Generalization in the Low SNR Regime?

Wei Huang, Andi Han, Yujin Song et al.

NEURIPS 2025arXiv:2510.18768

#15012

Improving the Generation and Evaluation of Synthetic Data for Downstream Medical Causal Inference

Harry Amad, Zhaozhi Qian, Dennis Frauen et al.

NEURIPS 2025arXiv:2412.04752

#15013

Graph Neural Network Based Action Ranking for Planning

Rajesh Mangannavar, Stefan Lee, Alan Fern et al.

#15014

Self-Supervised Learning for Color Spike Camera Reconstruction

Yanchen Dong, Ruiqin Xiong, Xiaopeng Fan et al.

NEURIPS 2025arXiv:2502.04670

#15015

CCS: Controllable and Constrained Sampling with Diffusion Models via Initial Noise Perturbation

Bowen Song, Zecheng Zhang, Zhaoxu Luo et al.

NEURIPS 2025arXiv:2505.10311

#15016

Whitened Score Diffusion: A Structured Prior for Imaging Inverse Problems

Jeffrey Alido, Tongyu Li, Yu Sun et al.

NEURIPS 2025arXiv:2506.12945

#15017

Metropolis-Hastings Sampling for 3D Gaussian Reconstruction

Hyunjin Kim, Haebeom Jung, Jaesik Park

ICCV 2025arXiv:2509.16970

#15018

LLM-Assisted Semantic Guidance for Sparsely Annotated Remote Sensing Object Detection

Wei Liao, Chunyan Xu, Chenxu Wang et al.

ICCV 2025arXiv:2504.07955

#15019

BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation

Yuanhong Yu, Xingyi He, Chen Zhao et al.

NEURIPS 2025oralarXiv:2510.21167

#15020

Blockwise Flow Matching: Improving Flow Matching Models For Efficient High-Quality Generation

Dogyun Park, Taehoon Lee, Minseok Joo et al.

ICCV 2025arXiv:2504.14371

#15021

Efficient Spiking Point Mamba for Point Cloud Analysis

Peixi Wu, Bosong Chai, Menghua Zheng et al.

NEURIPS 2025arXiv:2504.09184

#15022

Parameterized Synthetic Text Generation with SimpleStories

Lennart Finke, Chandan Sreedhara, Thomas Dooms et al.

NEURIPS 2025arXiv:2508.15071

#15023

Enhancing Optimizer Stability: Momentum Adaptation of The NGN Step-size

Rustem Islamov, Niccolò Ajroldi, Antonio Orvieto et al.

ICCV 2025arXiv:2506.23152

#15024

DexH2R: A Benchmark for Dynamic Dexterous Grasping in Human-to-Robot Handover

Youzhuo Wang, jiayi ye, Chuyang Xiao et al.

ICCV 2025arXiv:2508.05123

#15025

Latent Expression Generation for Referring Image Segmentation and Grounding

Seonghoon Yu, Junbeom Hong, Joonseok Lee et al.

NEURIPS 2025arXiv:2510.13193

#15026

ReMindRAG: Low-Cost LLM-Guided Knowledge Graph Traversal for Efficient RAG

Yikuan Hu, Jifeng Zhu, Lanrui Tang et al.

#15027

Intermediate Connectors and Geometric Priors for Language-Guided Affordance Segmentation on Unseen Object Categories

Yicong Li, Yiyang Chen, Zhenyuan Ma et al.

ICCV 2025arXiv:2508.06895

#15028

BASIC: Boosting Visual Alignment with Intrinsic Refined Embeddings in Multimodal Large Language Models

Jianting Tang, Yubo Wang, Haoyu Cao et al.

#15029

STEPS: Sequential Probability Tensor Estimation for Text-to-Image Hard Prompt Search

Yuning Qiu, Andong Wang, Chao Li et al.

NEURIPS 2025spotlightarXiv:2506.03075

#15030

Agnostic Learning under Targeted Poisoning: Optimal Rates and the Role of Randomness

Bogdan Chornomaz, Yonatan Koren, Shay Moran et al.

ICCV 2025arXiv:2508.04090

#15031

Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework

Yi-Ting Chen, Ting-Hsuan Liao, Pengsheng Guo et al.

NEURIPS 2025arXiv:2512.23853

#15032

Flow Matching Neural Processes

Hussen Abu Hamad, Dan Rosenbaum

#15033

Keep the Balance: A Parameter-Efficient Symmetrical Framework for RGB+X Semantic Segmentation

Jiaxin Cai, Jingze Su, Qi Li et al.

NEURIPS 2025arXiv:2506.05216

#15034

A Unified Framework for Provably Efficient Algorithms to Estimate Shapley Values

Tyler Chen, Akshay Seshadri, Mattia Jacopo Villani et al.

NEURIPS 2025oralarXiv:2505.22820

#15035

Preference Learning with Response Time: Robust Losses and Guarantees

Ayush Sawarni, Sahasrajit Sarmasarkar, Vasilis Syrgkanis

NEURIPS 2025arXiv:2506.02793

#15036

Doubly-Robust Estimation of Counterfactual Policy Mean Embeddings

Houssam Zenati, Bariscan Bozkurt, Arthur Gretton

NEURIPS 2025arXiv:2509.18207

#15037

Securing the Language of Life: Inheritable Watermarks from DNA Language Models to Proteins

ZAIXI ZHANG, Ruofan Jin, Le Cong et al.

NEURIPS 2025arXiv:2510.18407

#15038

Heterogeneous Adversarial Play in Interactive Environments

Manjie Xu, Xinyi Yang, Jiayu Zhan et al.

ICCV 2025highlightarXiv:2507.04263

#15039

SRefiner: Soft-Braid Attention for Multi-Agent Trajectory Refinement

Liwen Xiao, Zhiyu Pan, Zhicheng Wang et al.

#15040

SimWorld: An Open-ended Simulator for Agents in Physical and Social Worlds

Xiaokang Ye, Jiawei Ren, Yan Zhuang et al.

NEURIPS 2025spotlight

NEURIPS 2025arXiv:2503.18403

#15041

Knowledge Graph Enhanced Generative Multi-modal Models for Class-Incremental Learning

Xusheng Cao, Haori Lu, Linlan Huang et al.

ICCV 2025highlightarXiv:2502.20760

#15042

VRM: Knowledge Distillation via Virtual Relation Matching

Weijia Zhang, Fei Xie, Weidong Cai et al.

ICCV 2025highlightarXiv:2503.06453

#15043

Efficient Input-level Backdoor Defense on Text-to-Image Synthesis via Neuron Activation Variation

Shengfang ZHAI, Jiajun Li, Yue Liu et al.

NEURIPS 2025arXiv:2509.15096

#15044

OmniSegmentor: A Flexible Multi-Modal Learning Framework for Semantic Segmentation

Bo-Wen Yin, Jiao-Long Cao, Xuying Zhang et al.

ICCV 2025arXiv:2507.09207

#15045

Visual Surface Wave Elastography: Revealing Subsurface Physical Properties via Visible Surface Waves

Alexander Ogren, Berthy Feng, Jihoon Ahn et al.

NEURIPS 2025arXiv:2510.19819

#15046

Is This Tracker On? A Benchmark Protocol for Dynamic Tracking

Ilona Demler, Saumya Chauhan, Georgia Gkioxari

ICCV 2025arXiv:2510.10793

#15047

ImHead: A Large-scale Implicit Morphable Model for Localized Head Modeling

Rolandos Alexandros Potamias, Stathis Galanakis, Jiankang Deng et al.

NEURIPS 2025arXiv:2505.18986

#15048

VL-SAM-V2: Open-World Object Detection with General and Specific Query Fusion

Zhiwei Lin, Yongtao Wang

ICCV 2025arXiv:2508.20080

#15049

Seam360GS: Seamless 360° Gaussian Splatting from Real-World Omnidirectional Images

Changha Shin, Woong Oh Cho, Seon Joo Kim

NEURIPS 2025arXiv:2502.06067

#15050

Smooth Sailing: Lipschitz-Driven Uncertainty Quantification for Spatial Associations

David Burt, Renato Berlinghieri, Stephen Bates et al.

#15051

Stabilizing LTI Systems under Partial Observability: Sample Complexity and Fundamental Limits

Ziyi Zhang, Yorie Nakahira, Guannan Qu

NEURIPS 2025

NEURIPS 2025arXiv:2510.18053

#15052

Adaptive Divergence Regularized Policy Optimization for Fine-tuning Generative Models

Jiajun Fan, Tong Wei, Chaoran Cheng et al.

#15053

MaDCoW: Marginal Distortion Correction for Wide-Angle Photography with Arbitrary Objects

Kevin Zhang, Jia-Bin Huang, Jose Echevarria et al.

#15054

Incorporating Dense Knowledge Alignment into Unified Multimodal Representation Models

Yuhao Cui, Xinxing Zu, Wenhua Zhang et al.

NEURIPS 2025arXiv:2505.13519

#15055

Continuous Domain Generalization

Zekun CAI, Yiheng YAO, Guangji Bai et al.

NEURIPS 2025arXiv:2505.20583

#15056

Balancing Performance and Costs in Best Arm Identification

Michael Harding, Kirthevasan Kandasamy

#15057

Polarized Color Screen Matting

Kenji Enomoto, Scott Cohen, Brian Price et al.

CVPR 2025highlight

NEURIPS 2025arXiv:2501.19224

#15058

Fast exact recovery of noisy matrix from few entries: the infinity norm approach

BaoLinh Tran, Van Vu

ICCV 2025arXiv:2502.01906

#15059

D-Attn: Decomposed Attention for Large Vision-and-Language Model

Chia-Wen Kuo, Sijie Zhu, Fan Chen et al.

NEURIPS 2025arXiv:2510.04770

#15060

Beyond the Seen: Bounded Distribution Estimation for Open-Vocabulary Learning

Xiaomeng Fan, Yuchuan Mao, Zhi Gao et al.

NEURIPS 2025arXiv:2511.20446

#15061

Learning to Generate Human-Human-Object Interactions from Textual Descriptions

Jeonghyeon Na, Sangwon Baik, Inhee Lee et al.

ICCV 2025arXiv:2507.21924

#15062

MMAT-1M: A Large Reasoning Dataset for Multimodal Agent Tuning

Tianhong Gao, Yannian Fu, Weiqun Wu et al.

ICCV 2025highlightarXiv:2507.17268

#15063

PolarAnything: Diffusion-based Polarimetric Image Synthesis

Kailong Zhang, Youwei Lyu, Heng Guo et al.

NEURIPS 2025arXiv:2510.10292

#15064

From Programs to Poses: Factored Real-World Scene Generation via Learned Program Libraries

Joy Hsu, Emily Jin, Jiajun Wu et al.

CVPR 2025arXiv:2412.09723

#15065

MAC-Ego3D: Multi-Agent Gaussian Consensus for Real-Time Collaborative Ego-Motion and Photorealistic 3D Reconstruction

Xiaohao Xu, Feng Xue, Shibo Zhao et al.

NEURIPS 2025arXiv:2510.09008

#15066

On Epistemic Uncertainty of Visual Tokens for Object Hallucinations in Large Vision-Language Models

Hoigi Seo, Dong Un Kang, Hyunjin Cho et al.

ICCV 2025arXiv:2505.12911

#15067

HiERO: Understanding the Hierarchy of Human Behavior Enhances Reasoning on Egocentric Videos

Simone Alberto Peirone, Francesca Pistilli, Giuseppe Averta

NEURIPS 2025arXiv:2507.02064

#15068

REMI: Reconstructing Episodic Memory During Internally Driven Path Planning

Zhaoze Wang, Genela Morris, Dori Derdikman et al.

CVPR 2025highlightarXiv:2503.23094

#15069

FRAME: Floor-aligned Representation for Avatar Motion from Egocentric Video

Andrea Boscolo Camiletto, Jian Wang, Eduardo Alvarado et al.

NEURIPS 2025oralarXiv:2509.06782

#15070

Physics-informed Value Learner for Offline Goal-Conditioned Reinforcement Learning

Vittorio Giammarino, Ruiqi Ni, Ahmed Qureshi

ICCV 2025arXiv:2508.03102

#15071

Causal Disentanglement and Cross-Modal Alignment for Enhanced Few-Shot Learning

Tianjiao Jiang, Zhen Zhang, Yuhang Liu et al.

NEURIPS 2025spotlightarXiv:2510.24038

#15072

Enhancing CLIP Robustness via Cross-Modality Alignment

Xingyu Zhu, Beier Zhu, Shuo Wang et al.

ICCV 2025arXiv:2501.14484

#15073

SpikePack: Enhanced Information Flow in Spiking Neural Networks with High Hardware Compatibility

Guobin Shen, Jindong Li, Tenglong Li et al.

#15074

Breaking Rectangular Shackles: Cross-View Object Segmentation for Fine-Grained Object Geo-Localization

Qingwang Zhang, Yingying Zhu

CVPR 2025arXiv:2502.21048

#15075

Data-free Universal Adversarial Perturbation with Pseudo-semantic Prior

Chanhui Lee, Yeonghwan Song, Jeany Son

ICCV 2025arXiv:2403.08512

#15076

MergeOcc: Bridge the Domain Gap between Different LiDARs for Robust Occupancy Prediction

Zikun Xu, Shaobing Xu

NEURIPS 2025arXiv:2511.04063

#15077

DartQuant: Efficient Rotational Distribution Calibration for LLM Quantization

YUANTIAN SHAO, Yuanteng Chen, Peisong Wang et al.

NEURIPS 2025arXiv:2505.21258

#15078

Plenodium: Underwater 3D Scene Reconstruction with Plenoptic Medium Representation

Changguang WU, Jiangxin Dong, Chengjian Li et al.

NEURIPS 2025arXiv:2506.21757

#15079

TADA: Improved Diffusion Sampling with Training-free Augmented DynAmics

Tianrong Chen, Huangjie Zheng, David Berthelot et al.

CVPR 2025arXiv:2503.13241

#15080

Sampling Innovation-Based Adaptive Compressive Sensing

Zhifu Tian, Tao Hu, Chaoyang Niu et al.

NEURIPS 2025arXiv:2505.18427

#15081

Learning Latent Variable Models via Jarzynski-adjusted Langevin Algorithm

James Cuin, Davide Carbone, O. Deniz Akyildiz

NEURIPS 2025arXiv:2407.12699

#15082

Mechanism Design via the Interim Relaxation

Kshipra Bhawalkar, Marios Mertzanidis, Divyarthi Mohan et al.

CVPR 2025arXiv:2503.17752

#15083

HiLoTs: High-Low Temporal Sensitive Representation Learning for Semi-Supervised LiDAR Segmentation in Autonomous Driving

R.D. Lin, Pengcheng Weng, Yinqiao Wang et al.

NEURIPS 2025arXiv:2505.17749

#15084

Mind the GAP! The Challenges of Scale in Pixel-based Deep Reinforcement Learning

Ghada Sokar, Pablo Samuel Castro

NEURIPS 2025spotlightarXiv:2505.01917

#15085

Discrete Spatial Diffusion: Intensity-Preserving Diffusion Modeling

Javier E. Santos, Agnese Marcato, Roman Colman et al.

CVPR 2025highlightarXiv:2504.02199

#15086

ESC: Erasing Space Concept for Knowledge Deletion

Tae-Young Lee, Sundong Park, Minwoo Jeon et al.

NEURIPS 2025arXiv:2510.14623

#15087

LeapFactual: Reliable Visual Counterfactual Explanation Using Conditional Flow Matching

Zhuo Cao, Xuan Zhao, Lena Krieger et al.

ICCV 2025arXiv:2509.02101

#15088

SALAD -- Semantics-Aware Logical Anomaly Detection

Matic Fučka, Vitjan Zavrtanik, Danijel Skocaj

NEURIPS 2025arXiv:2510.17299

#15089

Exploring Structural Degradation in Dense Representations for Self-supervised Learning

Siran Dai, Qianqian Xu, Peisong Wen et al.

NEURIPS 2025spotlightarXiv:2509.20211

#15090

Practical do-Shapley Explanations with Estimand-Agnostic Causal Inference

Álvaro Parafita, Tomas Garriga, Axel Brando et al.

CVPR 2025arXiv:2503.18637

#15091

Unbiasing through Textual Descriptions: Mitigating Representation Bias in Video Benchmarks

Nina Shvetsova, Arsha Nagrani, Bernt Schiele et al.

CVPR 2025arXiv:2503.21854

#15092

Foveated Instance Segmentation

Hongyi Zeng, Wenxuan Liu, Tianhua Xia et al.

#15093

Active Event-based Stereo Vision

Jianing Li, Yunjian Zhang, Haiqian Han et al.

NEURIPS 2025arXiv:2510.22451

#15094

GraphTOP: Graph Topology-Oriented Prompting for Graph Neural Networks

Xingbo Fu, Zhenyu Lei, Zihan Chen et al.

ICCV 2025arXiv:2507.04511

#15095

FA: Forced Prompt Learning of Vision-Language Models for Out-of-Distribution Detection

Xinhua Lu, Runhe Lai, Yanqi Wu et al.

NEURIPS 2025arXiv:2412.12772

#15096

Optimize the Unseen - Fast NeRF Cleanup with Free Space Prior

Leo Segre, Shai Avidan

NEURIPS 2025oralarXiv:2510.14427

#15097

Deep Compositional Phase Diffusion for Long Motion Sequence Generation

Ho Yin Au, Jie Chen, Junkun Jiang et al.

ICCV 2025arXiv:2503.20663

#15098

ARMO: Autoregressive Rigging for Multi-Category Objects

mingze sun, Shiwei Mao, Keyi Chen et al.

NEURIPS 2025arXiv:2510.25739

#15099

Hawk: Leveraging Spatial Context for Faster Autoregressive Text-to-Image Generation

Zhi-Kai Chen, Jun-Peng Jiang, Han-Jia Ye et al.

NEURIPS 2025oralarXiv:2505.17459

#15100

Sparse Diffusion Autoencoder for Test-time Adapting Prediction of Complex Systems

Jingwen Cheng, Ruikun Li, Huandong Wang et al.

CVPR 2025arXiv:2503.05333

#15101

PhysicsGen: Can Generative Models Learn from Images to Predict Complex Physical Relations?

Martin Spitznagel, Jan Vaillant, Janis Keuper

NEURIPS 2025arXiv:2507.00322

#15102

Failure by Interference: Language Models Make Balanced Parentheses Errors When Faulty Mechanisms Overshadow Sound Ones

Daking Rai, Samuel Miller, Kevin Moran et al.

ICCV 2025arXiv:2506.21364

#15103

CA-I2P: Channel-Adaptive Registration Network with Global Optimal Selection

Zhixin Cheng, Jiacheng Deng, Xinjun Li et al.

CVPR 2025arXiv:2505.02071

#15104

Hierarchical Compact Clustering Attention (COCA) for Unsupervised Object-Centric Learning

Can Küçüksözen, Yucel Yemez

CVPR 2025highlightarXiv:2411.16788

#15105

TIDE: Training Locally Interpretable Domain Generalization Models Enables Test-time Correction

Aishwarya Agarwal, Srikrishna Karanam, Vineet Gandhi

NEURIPS 2025arXiv:2504.12397

#15106

Activated LoRA: Fine-tuned LLMs for Intrinsics

Kristjan Greenewald, Luis Lastras, Thomas Parnell et al.

NEURIPS 2025arXiv:2405.14741

#15107

Subsampled Ensemble Can Improve Generalization Tail Exponentially

Huajie Qian, Donghao Ying, Henry Lam et al.

NEURIPS 2025arXiv:2509.19360

#15108

Semantic Representation Attack against Aligned Large Language Models

Jiawei Lian, Jianhong Pan, Lefan Wang et al.

#15109

Visual Relation Diffusion for Human-Object Interaction Detection

Ping Cao, Yepeng Tang, Chunjie Zhang et al.

NEURIPS 2025oralarXiv:2505.22573

#15110

FNOPE: Simulation-based inference on function spaces with Fourier Neural Operators

Guy Moss, Leah Muhle, Reinhard Drews et al.

NEURIPS 2025arXiv:2507.16345

#15111

The Cost of Compression: Tight Quadratic Black-Box Attacks on Sketches for $\ell_2$ Norm Estimation

Sara Ahmadian, Edith Cohen, Uri Stemmer

NEURIPS 2025arXiv:2503.05323

#15112

Graph Alignment via Birkhoff Relaxation

Sushil Varma, Irène Waldspurger, Laurent Massoulié

NEURIPS 2025arXiv:2506.06160

#15113

Acceleration via silver step-size on Riemannian manifolds with applications to Wasserstein space

Jiyoung Park, Abhishek Roy, Jonathan W. Siegel et al.

CVPR 2025arXiv:2502.08646

#15114

Poly-Autoregressive Prediction for Modeling Interactions

Neerja Thakkar, Tara Sadjadpour, Jathushan Rajasegaran et al.

NEURIPS 2025arXiv:2408.08395

#15115

Uncoupled and Convergent Learning in Monotone Games under Bandit Feedback

Jing Dong, Baoxiang Wang, Yaoliang Yu

#15116

Pseudo Visible Feature Fine-Grained Fusion for Thermal Object Detection

Ting Li, Mao Ye, Tianwen Wu et al.

NEURIPS 2025oralarXiv:2505.19408

#15117

Future Link Prediction Without Memory or Aggregation

Lu Yi, Runlin Lei, Fengran Mo et al.

#15118

MAGE : Single Image to Material-Aware 3D via the Multi-View G-Buffer Estimation Model

Haoyuan Wang, Zhenwei Wang, Xiaoxiao Long et al.

NEURIPS 2025arXiv:2506.13030

#15119

WildCAT3D: Appearance-Aware Multi-View Diffusion in the Wild

Morris Alper, David Novotny, Filippos Kokkinos et al.

CVPR 2025arXiv:2505.09413

#15120

Sparse Point Cloud Patches Rendering via Splitting 2D Gaussians

Changfeng Ma, Ran Bi, Jie Guo et al.

ICCV 2025arXiv:2507.00659

#15121

LoD-Loc v2: Aerial Visual Localization over Low Level-of-Detail City Models using Explicit Silhouette Alignment

Juelin Zhu, Shuaibang Peng, Long Wang et al.

ICCV 2025arXiv:2504.11521

#15122

LANGTRAJ: Diffusion Model and Dataset for Language-Conditioned Trajectory Simulation

WEI-JER Chang, Masayoshi Tomizuka, Wei Zhan et al.

NEURIPS 2025oralarXiv:2503.14698

#15123

Learning Efficient Fuse-and-Refine for Feed-Forward 3D Gaussian Splatting

Yiming Wang, Lucy Chai, Xuan Luo et al.

CVPR 2025highlightarXiv:2505.00502

#15124

Towards Scalable Human-aligned Benchmark for Text-guided Image Editing

Suho Ryu, Kihyun Kim, Eugene Baek et al.

ICCV 2025arXiv:2505.04813

#15125

WIR3D: Visually-Informed and Geometry-Aware 3D Shape Abstraction

Richard Liu, Daniel Fu, Noah Tan et al.

CVPR 2025arXiv:2412.04456

#15126

HeatFormer: A Neural Optimizer for Multiview Human Mesh Recovery

Yuto Matsubara, Ko Nishino

NEURIPS 2025arXiv:2507.02974

#15127

InvisibleInk: High-Utility and Low-Cost Text Generation with Differential Privacy

Vishnu Vinod, Krishna Pillutla, Abhradeep Guha Thakurta

ICCV 2025arXiv:2503.06136

#15128

GSV3D: Gaussian Splatting-based Geometric Distillation with Stable Video Diffusion for Single-Image 3D Object Generation

Ye Tao, jiawei zhang, Yahao Shi et al.

NEURIPS 2025arXiv:2507.05362

#15129

On the Bias of Next-Token Predictors Toward Systematically Inefficient Reasoning: A Shortest-Path Case Study

Riccardo Alberghi, Elizaveta Demyanenko, Luca Biggio et al.

NEURIPS 2025oralarXiv:2510.12160

#15130

State Space Prompting via Gathering and Spreading Spatio-Temporal Information for Video Understanding

Jiahuan Zhou, Kai Zhu, Zhenyu Cui et al.

NEURIPS 2025arXiv:2510.20877

#15131

Multimodal Negative Learning

Baoquan Gong, Xiyuan Gao, Pengfei Zhu et al.

ICCV 2025arXiv:2510.13317

#15132

Removing Cost Volumes from Optical Flow Estimators

Simon Kiefhaber, Stefan Roth, Simone Schaub-Meyer

NEURIPS 2025arXiv:2403.00957

#15133

Resolution of Simpson's paradox via the common cause principle

Arshak Hovhannisyan, Armen Allahverdyan

CVPR 2025arXiv:2503.01130

#15134

AirRoom: Objects Matter in Room Reidentification

Runmao Yao, Yi Du, Zhuoqun Chen et al.

NEURIPS 2025arXiv:2506.02881

#15135

Simulation-Based Inference for Adaptive Experiments

Brian Cho, Aurelien Bibaut, Nathan Kallus

NEURIPS 2025arXiv:2509.06938

#15136

From Noise to Narrative: Tracing the Origins of Hallucinations in Transformers

Praneet Suresh, Jack Stanley, Sonia Joseph et al.

ICCV 2025arXiv:2510.11605

#15137

ACE-G: Improving Generalization of Scene Coordinate Regression Through Query Pre-Training

Leonard Bruns, Axel Barroso-Laguna, Tommaso Cavallari et al.

NEURIPS 2025arXiv:2510.25512

#15138

FaCT: Faithful Concept Traces for Explaining Neural Network Decisions

Amin Parchami-Araghi, Sukrut Rao, Jonas Fischer et al.

NEURIPS 2025arXiv:2506.03119

#15139

Controllable Human-centric Keyframe Interpolation with Generative Prior

Zujin Guo, Size Wu, Zhongang Cai et al.

NEURIPS 2025oralarXiv:2503.23793

#15140

Pan-LUT: Efficient Pan-sharpening via Learnable Look-Up Tables

Zhongnan Cai, Yingying Wang, Hui Zheng et al.

NEURIPS 2025arXiv:2510.22095

#15141

Embracing Trustworthy Brain-Agent Collaboration as Paradigm Extension for Intelligent Assistive Technologies

Yankai Chen, Xinni Zhang, Yifei Zhang et al.

ICCV 2025highlightarXiv:2504.00139

#15142

SuperEvent: Cross-Modal Learning of Event-based Keypoint Detection for SLAM

Yannick Burkhardt, Simon Schaefer, Stefan Leutenegger

NEURIPS 2025arXiv:2510.07575

#15143

Position: Benchmarking is Broken - Don't Let AI be Its Own Judge

Zerui Cheng, Stella Wohnig, Ruchika Gupta et al.

NEURIPS 2025arXiv:2412.06646

#15144

The Narrow Gate: Localized Image-Text Communication in Native Multimodal Models

Alessandro Serra, Francesco Ortu, Emanuele Panizon et al.

NEURIPS 2025arXiv:2505.13567

#15145

Learning Dynamics of RNNs in Closed-Loop Environments

Yoav Ger, Omri Barak

ICCV 2025arXiv:2508.17239

#15146

PersPose: 3D Human Pose Estimation with Perspective Encoding and Perspective Rotation

Xiaoyang Hao, Han Li

NEURIPS 2025arXiv:2509.16691

#15147

InstanceAssemble: Layout-Aware Image Generation via Instance Assembling Attention

Qiang Xiang, Shuang Sun, Binglei Li et al.

NEURIPS 2025arXiv:2510.23640

#15148

Structure-Aware Fusion with Progressive Injection for Multimodal Molecular Representation Learning

Zihao Jing, Yan Sun, Yan Yi Li et al.

CVPR 2025highlightarXiv:2503.06012

#15149

End-to-End HOI Reconstruction Transformer with Graph-based Encoding

Zhenrong Wang, Qi Zheng, Sihan Ma et al.

ICCV 2025arXiv:2507.20842

#15150

METEOR: Multi-Encoder Collaborative Token Pruning for Efficient Vision Language Models

Yuchen Liu, Yaoming Wang, Bowen Shi et al.

#15151

LoKi: Low-dimensional KAN for Efficient Fine-tuning Image Models

Xuan Cai, Renjie Pan, Hua Yang

NEURIPS 2025spotlightarXiv:2509.00614

#15152

RoFt-Mol: Benchmarking Robust Fine-tuning with Molecular Graph Foundation Models

Shikun Liu, Deyu Zou, Nima Shoghi et al.

CVPR 2025highlightarXiv:2503.10149

#15153

Unlocking Generalization Power in LiDAR Point Cloud Registration

Zhenxuan Zeng, Qiao Wu, Xiyu Zhang et al.

NEURIPS 2025arXiv:2505.12919

#15154

RGNMR: A Gauss-Newton method for robust matrix completion with theoretical guarantees

Eilon Vaknin Laufer, Boaz Nadler

NEURIPS 2025arXiv:2507.05193

#15155

RAM-W600: A Multi-Task Wrist Dataset and Benchmark for Rheumatoid Arthritis

YANG SONGXIAO, Haolin Wang, Yao Fu et al.

ICCV 2025arXiv:2412.00138

#15156

Adversarial Exploitation of Data Diversity Improves Visual Localization

Sihang Li, Siqi Tan, Bowen Chang et al.

#15157

DyGS-SLAM: Real-Time Accurate Localization and Gaussian Reconstruction for Dynamic Scenes

Xinggang Hu, Chenyangguang Zhang, Mingyuan Zhao et al.

#15158

F^3OCUS - Federated Finetuning of Vision-Language Foundation Models with Optimal Client Layer Updating Strategy via Multi-objective Meta-Heuristics

Pramit Saha, Felix Wagner, Divyanshu Mishra et al.

CVPR 2025highlight

NEURIPS 2025arXiv:2509.23666

#15159

Beyond Greedy Exits: Improved Early Exit Decisions for Risk Control and Reliability

Divya Jyoti Bajpai, Manjesh Kumar Hanawal

CVPR 2025arXiv:2505.06580

#15160

TAROT: Towards Essentially Domain-Invariant Robustness with Theoretical Justification

Dongyoon Yang, Jihu Lee, Yongdai Kim

ICCV 2025arXiv:2411.16167

#15161

Mind the Cost of Scaffold! Benign Clients May Even Become Accomplices of Backdoor Attack

Xingshuo Han, Xuanye Zhang, Xiang Lan et al.

NEURIPS 2025arXiv:2512.14677

#15162

VASA-3D: Lifelike Audio-Driven Gaussian Head Avatars from a Single Image

Sicheng Xu, Guojun Chen, Jiaolong Yang et al.

CVPR 2025arXiv:2504.03006

#15163

DiSRT-In-Bed: Diffusion-Based Sim-to-Real Transfer Framework for In-Bed Human Mesh Recovery

Jing Gao, Ce Zheng, Laszlo Jeni et al.

CVPR 2025arXiv:2412.05279

#15164

Perturb-and-Revise: Flexible 3D Editing with Generative Trajectories

Susung Hong, Johanna Suvi Karras, Ricardo Martin et al.

NEURIPS 2025arXiv:2508.15593

#15165

Inductive Domain Transfer In Misspecified Simulation-Based Inference

Ortal Senouf, Antoine Wehenkel, Cédric Vincent-Cuaz et al.

NEURIPS 2025arXiv:2506.14271

#15166

Leader360V: A Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment

WEIMING ZHANG, Dingwen Xiao, Aobotao DAI et al.

NEURIPS 2025arXiv:2507.12318

#15167

Compositional Discrete Latent Code for High Fidelity, Productive Diffusion Models

Samuel Lavoie, Michael Noukhovitch, Aaron Courville

ICCV 2025arXiv:2503.21581

#15168

AlignDiff: Learning Physically-Grounded Camera Alignment via Diffusion

Liuyue Xie, Jiancong Guo, Ozan Cakmakci et al.

NEURIPS 2025arXiv:2406.04772

#15169

REP: Resource-Efficient Prompting for Rehearsal-Free Continual Learning

Sungho Jeon, Xinyue Ma, Kwang In Kim et al.

ICCV 2025arXiv:2409.17981

#15170

BlinkTrack: Feature Tracking over 80 FPS via Events and Images

Yichen Shen, Yijin Li, Shuo Chen et al.

#15171

DICE: Staleness-Centric Optimizations for Parallel Diffusion MoE Inference

Jiajun Luo, Lizhuo Luo, Jianru Xu et al.

NEURIPS 2025arXiv:2510.21292

#15172

Additive Models Explained: A Computational Complexity Approach

Shahaf Bassan, Michal Moshkovitz, Guy Katz

NEURIPS 2025arXiv:2511.11593

#15173

Sound Logical Explanations for Mean Aggregation Graph Neural Networks

Matthew Morris, Ian Horrocks

NEURIPS 2025oralarXiv:2510.21110

#15174

Confounding Robust Deep Reinforcement Learning: A Causal Approach

Mingxuan Li, Junzhe Zhang, Elias Bareinboim

NEURIPS 2025spotlightarXiv:2505.11309

#15175

Decomposing stimulus-specific sensory neural information via diffusion models

Steeve Laquitaine, Simone Azeglio, Carlo Paris et al.

CVPR 2025highlightarXiv:2503.04119

#15176

SCSA: A Plug-and-Play Semantic Continuous-Sparse Attention for Arbitrary Semantic Style Transfer

Chunnan Shang, Zhizhong Wang, Hongwei Wang et al.

NEURIPS 2025oralarXiv:2505.11930

#15177

The Logical Expressiveness of Temporal GNNs via Two-Dimensional Product Logics

Marco Sälzer, Przemyslaw Walega, Martin Lange

NEURIPS 2025arXiv:2511.01169

#15178

Web-Scale Collection of Video Data for 4D Animal Reconstruction

Brian Nlong Zhao, Jiajun Wu, Shangzhe Wu

NEURIPS 2025arXiv:2502.18475

#15179

Least squares variational inference

Yvann Le Fay, Nicolas Chopin, Simon Barthelmé

#15180

Diff2I2P: Differentiable Image-to-Point Cloud Registration with Diffusion Prior

Juncheng Mu, Chengwei REN, Weixiang Zhang et al.

ICCV 2025arXiv:2411.15235

#15181

CODE-CL: Conceptor-Based Gradient Projection for Deep Continual Learning

Marco P. Apolinario, Sakshi Choudhary, Kaushik Roy

NEURIPS 2025spotlightarXiv:2511.00090

#15182

LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation

Huanlin Gao, Ping Chen, Fuyuan Shi et al.

NEURIPS 2025oralarXiv:2410.16136

#15183

Modeling Dynamic Neural Activity by combining Naturalistic Video Stimuli and Stimulus-independent Latent Factors

Finn Schmidt, Polina Turishcheva, Suhas Shrinivasan et al.

NEURIPS 2025arXiv:2505.10118

#15184

Why 1 + 1 < 1 in Visual Token Pruning: Beyond Naive Integration via Multi-Objective Balanced Covering

Yangfu Li, Hongjian Zhan, Tianyi Chen et al.

NEURIPS 2025oralarXiv:2507.21244

#15185

Bubbleformer: Forecasting Boiling with Transformers

Sheikh Md Shakeel Hassan, Xianwei Zou, Akash Dhruv et al.

CVPR 2025arXiv:2503.14129

#15186

SketchFusion: Learning Universal Sketch Features through Fusing Foundation Models

Subhadeep Koley, Tapas Kumar Dutta, Aneeshan Sain et al.

NEURIPS 2025arXiv:2507.02377

#15187

Sparse Gaussian Processes: Structured Approximations and Power-EP Revisited

Thang Bui, Michalis Titsias

NEURIPS 2025arXiv:2506.19839

#15188

Improving Progressive Generation with Decomposable Flow Matching

Moayed Haji-Ali, Willi Menapace, Ivan Skorokhodov et al.

#15189

PixelStitch: Structure-Preserving Pixel-Wise Bidirectional Warps for Unsupervised Image Stitching

Hengzhe Jin, Lang Nie, Chunyu Lin et al.

CVPR 2025highlightarXiv:2411.18159

#15190

Type-R: Automatically Retouching Typos for Text-to-Image Generation

Wataru Shimoda, Naoto Inoue, Daichi Haraguchi et al.

NEURIPS 2025arXiv:2510.25657

#15191

Subgraph Federated Learning via Spectral Methods

Javad Aliakbari, Johan Oestman, Ashkan Panahi et al.

NEURIPS 2025arXiv:2502.18710

#15192

Bridging Critical Gaps in Convergent Learning: How Representational Alignment Evolves Across Layers, Training, and Distribution Shifts

Chaitanya Kapoor, Sudhanshu Srivastava, Meenakshi Khosla

NEURIPS 2025oralarXiv:2512.07599

#15193

Online Segment Any 3D Thing as Instance Tracking

Hanshi Wang, Cai Zijian, Jin Gao et al.

NEURIPS 2025spotlightarXiv:2601.09825

#15194

Eluder dimension: localise it!

Alireza Bakhtiari, Alex Ayoub, Samuel Robertson et al.

NEURIPS 2025oralarXiv:2502.09767

#15195

Non-Markovian Discrete Diffusion with Causal Language Models

Yangtian Zhang, Sizhuang He, Daniel Levine et al.

ICCV 2025arXiv:2508.06160

#15196

Fewer Denoising Steps or Cheaper Per-Step Inference: Towards Compute-Optimal Diffusion Model Deployment

Zhenbang Du, Yonggan Fu, Lifu Wang et al.

ICCV 2025arXiv:2507.09896

#15197

Measuring the Impact of Rotation Equivariance on Aerial Object Detection

Xiuyu Wu, Xinhao Wang, Xiubin Zhu et al.

#15198

Black Hole-Driven Identity Absorbing in Diffusion Models

Muhammad Shaheryar, Jong Taek Lee, Soon Ki Jung

NEURIPS 2025arXiv:2505.23395

#15199

Point or Line? Using Line-based Representation for Panoptic Symbol Spotting in CAD Drawings

Xingguang Wei, Haomin Wang, Shenglong Ye et al.

CVPR 2025arXiv:2410.21629

#15200

OFER: Occluded Face Expression Reconstruction

Pratheba Selvaraju, Victoria Abrevaya, Timo Bolkart et al.