NeurIPS 2025 Papers

5,858 papers found • Page 115 of 118

VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

Haozhe Wang, Chao Qu, Zuming Huang et al.

NeurIPS 2025spotlight
169
citations

VL-SAE: Interpreting and Enhancing Vision-Language Alignment with a Unified Concept Set

Shufan Shen, Junshu Sun, Qingming Huang et al.

NeurIPS 2025posterarXiv:2510.21323
1
citations

VL-SAM-V2: Open-World Object Detection with General and Specific Query Fusion

Zhiwei Lin, Yongtao Wang

NeurIPS 2025poster
1
citations

VMDT: Decoding the Trustworthiness of Video Foundation Models

Yujin Potter, Zhun Wang, Nicholas Crispino et al.

NeurIPS 2025poster

Vocabulary-Guided Gait Recognition

Panjian Huang, Saihui Hou, Chunshui Cao et al.

NeurIPS 2025poster

Vocabulary In-Context Learning in Transformers: Benefits of Positional Encoding

Qian Ma, Ruoxiang Xu, Yongqiang Cai

NeurIPS 2025poster

VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play

Zelai Xu, Ruize Zhang, Chao Yu et al.

NeurIPS 2025poster

Volume Transmission Implements Context Factorization to Target Online Credit Assignment and Enable Compositional Generalization

Matthew Bull, Po-Chen Kuo, Andrew Smith et al.

NeurIPS 2025poster

VORTA: Efficient Video Diffusion via Routing Sparse Attention

Wenhao Sun, Rong-Cheng Tu, Yifu Ding et al.

NeurIPS 2025posterarXiv:2505.18809
7
citations

VoxDet: Rethinking 3D Semantic Scene Completion as Dense Object Detection

Wuyang Li, Zhu Yu, Alexandre Alahi

NeurIPS 2025spotlight

VPO: Reasoning Preferences Optimization Based on $\mathcal{V}$-Usable Information

Zecheng Wang, Chunshan Li, Yupeng Zhang et al.

NeurIPS 2025spotlight

VQ-Seg: Vector-Quantized Token Perturbation for Semi-Supervised Medical Image Segmentation

Sicheng Yang, Zhaohu Xing, Lei Zhu

NeurIPS 2025poster

VQToken: Neural Discrete Token Representation Learning for Extreme Token Reduction in Video Large Language Models

Haichao Zhang, Yun Fu

NeurIPS 2025oralarXiv:2503.16980
3
citations

VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforcement Learning

Qiuchen Wang, Ruixue Ding, Yu Zeng et al.

NeurIPS 2025poster

VR-Drive: Viewpoint-Robust End-to-End Driving with Feed-Forward 3D Gaussian Splatting

Hoonhee Cho, Jae-Young Kang, Giwon Lee et al.

NeurIPS 2025oral

VT-FSL: Bridging Vision and Text with LLMs for Few-Shot Learning

Wenhao Li, Qiangchang Wang, Xianjing Meng et al.

NeurIPS 2025poster
2
citations

VTON-VLLM: Aligning Virtual Try-On Models with Human Preferences

Siqi Wan, Jingwen Chen, Qi Cai et al.

NeurIPS 2025poster

Vulnerable Data-Aware Adversarial Training

Yuqi Feng, Jiahao Fan, Yanan Sun

NeurIPS 2025poster

Walking the Schrödinger Bridge: A Direct Trajectory for Text-to-3D Generation

Ziying Li, Xuequan Lu, Xinkui Zhao et al.

NeurIPS 2025poster
1
citations

Walking the Tightrope: Autonomous Disentangling Beneficial and Detrimental Drifts in Non-Stationary Custom-Tuning

Xiaoyu Yang, Jie Lu, En Yu

NeurIPS 2025oral

WALL-E: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents

Siyu Zhou, Tianyi Zhou, Yijun Yang et al.

NeurIPS 2025poster
4
citations

WaLRUS: Wavelets for Long range Representation Using State Space Methods

Hossein Babaei, Mel White, Sina Alemohammad et al.

NeurIPS 2025poster

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Ruihang Chu, Yefei He, Zhekai Chen et al.

NeurIPS 2025oral
3
citations

WarpGAN: Warping-Guided 3D GAN Inversion with Style-Based Novel View Inpainting

Kaitao Huang, Yan Yan, Jing-Hao Xue et al.

NeurIPS 2025poster

WASP: Benchmarking Web Agent Security Against Prompt Injection Attacks

Ivan Evtimov, Arman Zharmagambetov, Aaron Grattafiori et al.

NeurIPS 2025poster

Wasserstein Convergence of Critically Damped Langevin Diffusions

Stanislas Strasman, Sobihan Surendran, Claire Boyer et al.

NeurIPS 2025poster

Wasserstein Transfer Learning

Kaicheng Zhang, Sinian Zhang, Doudou Zhou et al.

NeurIPS 2025poster

Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM

Zinuo Li, Xian Zhang, Yongxin Guo et al.

NeurIPS 2025oralarXiv:2505.18110
3
citations

Watermarking Autoregressive Image Generation

Nikola Jovanović, Ismail Labiad, Tomas Soucek et al.

NeurIPS 2025poster

WaveAR: Wavelet-Aware Continuous Autoregressive Diffusion for Accurate Human Motion Prediction

shengchuan gao, Shuo Wang, Yabiao Wang et al.

NeurIPS 2025oral

Wavelet Canonical Coherence for Nonstationary Signals

Haibo Wu, Marina Knight, Keiland Cooper et al.

NeurIPS 2025oral
1
citations

Wavy Transformer

Satoshi Noguchi, Yoshinobu Kawahara

NeurIPS 2025poster

Weak-shot Keypoint Estimation via Keyness and Correspondence Transfer

Junjie Chen, Zeyu Luo, Zezheng Liu et al.

NeurIPS 2025poster

Weak-to-Strong Generalization under Distribution Shifts

Myeongho Jeon, Jan Sobotka, Suhwan Choi et al.

NeurIPS 2025poster

WearVQA: A Visual Question Answering Benchmark for Wearables in Egocentric Authentic Real-world scenarios

Eun Chang, Zhuangqun Huang, Yiwei Liao et al.

NeurIPS 2025posterarXiv:2511.22154

WeatherPrompt: Multi-modality Representation Learning for All-Weather Drone Visual Geo-Localization

Jiahao Wen, Hang Yu, Zhedong Zheng

NeurIPS 2025poster

Weaver: Shrinking the Generation-Verification Gap by Scaling Compute for Verification

Jon Saad-Falcon, Estefany Kelly Buchanan, Mayee Chen et al.

NeurIPS 2025poster

WebDancer: Towards Autonomous Information Seeking Agency

Jialong Wu, Baixuan Li, Runnan Fang et al.

NeurIPS 2025posterarXiv:2505.22648
81
citations

WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch

Zimu Lu, Yunqiao Yang, Houxing Ren et al.

NeurIPS 2025oralarXiv:2505.03733
16
citations

Web-Scale Collection of Video Data for 4D Animal Reconstruction

Brian Nlong Zhao, Jiajun Wu, Shangzhe Wu

NeurIPS 2025poster
1
citations

Web-Shepherd: Advancing PRMs for Reinforcing Web Agents

Hyungjoo Chae, Seonghwan Kim, Junhee Cho et al.

NeurIPS 2025spotlight

WebThinker: Empowering Large Reasoning Models with Deep Research Capability

Xiaoxi Li, Jiajie Jin, Guanting Dong et al.

NeurIPS 2025poster

We Should Chart an Atlas of All the World's Models

Eliahu Horwitz, Nitzan Kurer, Jonathan Kahana et al.

NeurIPS 2025poster
5
citations

What are you sinking? A geometric approach on attention sink

Valeria Ruscio, Umberto Nanni, Fabrizio Silvestri

NeurIPS 2025spotlightarXiv:2508.02546
2
citations

What Can RL Bring to VLA Generalization? An Empirical Study

Jijia Liu, Feng Gao, Bingwen Wei et al.

NeurIPS 2025poster

What Data Enables Optimal Decisions? An Exact Characterization for Linear Optimization

Omar Bennouna, Amine Bennouna, Saurabh Amin et al.

NeurIPS 2025poster
1
citations

What Does It Take to Build a Performant Selective Classifier?

Stephan Rabanser, Nicolas Papernot

NeurIPS 2025poster

What Do Latent Action Models Actually Learn?

Chuheng Zhang, Tim Pearce, Pushi Zhang et al.

NeurIPS 2025posterarXiv:2506.15691
7
citations

What do you know? Bayesian knowledge inference for navigating agents

Matthias Schultheis, Jana-Sophie Schönfeld, Constantin Rothkopf et al.

NeurIPS 2025oral

What Expressivity Theory Misses: Message Passing Complexity for GNNs

Niklas Kemper, Tom Wollschläger, Stephan Günnemann

NeurIPS 2025spotlight