2025 Papers

21,856 papers found • Page 418 of 438

UVE: Are MLLMs Unified Evaluators for AI-Generated Videos?

Yuanxin Liu, Rui Zhu, Shuhuai Ren et al.

NEURIPS 2025posterarXiv:2503.09949
2
citations

UVGS: Reimagining Unstructured 3D Gaussian Splatting using UV Mapping

Aashish Rai, Dilin Wang, Mihir Jain et al.

CVPR 2025posterarXiv:2502.01846

U-ViLAR: Uncertainty-Aware Visual Localization for Autonomous Driving via Differentiable Association and Registration

Xiaofan Li, Zhihao Xu, Chenming Wu et al.

ICCV 2025posterarXiv:2507.04503

UWAV: Uncertainty-weighted Weakly-supervised Audio-Visual Video Parsing

Yung-Hsuan Lai, Janek Ebbers, Yu-Chiang Frank Wang et al.

CVPR 2025posterarXiv:2505.09615
1
citations

V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer

Hangzhou He, Lei Zhu, Xinliang Zhang et al.

AAAI 2025paperarXiv:2501.04975
8
citations

V^2Dial: Unification of Video and Visual Dialog via Multimodal Experts

Adnen Abdessaied, Anna Rohrbach, Marcus Rohrbach et al.

CVPR 2025poster

V2M4: 4D Mesh Animation Reconstruction from a Single Monocular Video

Jianqi Chen, Biao Zhang, Xiangjun Tang et al.

ICCV 2025posterarXiv:2503.09631
15
citations

V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding

Junqi Ge, Ziyi Chen, Jintao Lin et al.

ICCV 2025posterarXiv:2412.09616
17
citations

V2V3D: View-to-View Denoised 3D Reconstruction for Light Field Microscopy

Jiayin Zhao, Zhenqi Fu, Tao Yu et al.

CVPR 2025posterarXiv:2504.07853

V2V: Scaling Event-Based Vision through Efficient Video-to-Voxel Simulation

Hanyue Lou, Jinxiu Liang, Minggui Teng et al.

NEURIPS 2025oralarXiv:2505.16797
2
citations

V2XPnP: Vehicle-to-Everything Spatio-Temporal Fusion for Multi-Agent Perception and Prediction

Zewei Zhou, Hao Xiang, Zhaoliang Zheng et al.

ICCV 2025posterarXiv:2412.01812
20
citations

V2X-Radar: A Multi-modal Dataset with 4D Radar for Cooperative Perception

Lei Yang, Xinyu Zhang, Jun Li et al.

NEURIPS 2025spotlightarXiv:2411.10962
20
citations

V2X-R: Cooperative LiDAR-4D Radar Fusion with Denoising Diffusion for 3D Object Detection

Xun Huang, Jinlong Wang, Qiming Xia et al.

CVPR 2025posterarXiv:2411.08402
10
citations

V2XScenes: A Multiple Challenging Traffic Conditions Dataset for Large-Range Vehicle-Infrastructure Collaborative Perception

Bowen Wang, Yafei Wang, Wei Gong et al.

ICCV 2025poster

V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning

Hang Hua, Yunlong Tang, Chenliang Xu et al.

AAAI 2025paperarXiv:2404.12353
48
citations

VA-AR: Learning Velocity-Aware Action Representations with Mixture of Window Attention

Jiangning Wei, Lixiong Qin, Bo Yu et al.

AAAI 2025paperarXiv:2503.11004
5
citations

VACE: All-in-One Video Creation and Editing

Zeyinzi Jiang, Zhen Han, Chaojie Mao et al.

ICCV 2025posterarXiv:2503.07598
169
citations

VADB: A Large-Scale Video Aesthetic Database with Professional and Multi-Dimensional Annotations

Qianqian Qiao, DanDan Zheng, Yihang Bo et al.

NEURIPS 2025oralarXiv:2510.25238
1
citations

Vad-R1: Towards Video Anomaly Reasoning via Perception-to-Cognition Chain-of-Thought

Chao Huang, Benfeng Wang, Wei Wang et al.

NEURIPS 2025posterarXiv:2505.19877

VADTree: Explainable Training-Free Video Anomaly Detection via Hierarchical Granularity-Aware Tree

Wenlong Li, Yifei Xu, Yuan Rao et al.

NEURIPS 2025oralarXiv:2510.22693
1
citations

VAE-Var: Variational Autoencoder-Enhanced Variational Methods for Data Assimilation in Meteorology

Yi Xiao, Qilong Jia, Kun Chen et al.

ICLR 2025poster
6
citations

VAFlow: Video-to-Audio Generation with Cross-Modality Flow Matching

Xihua Wang, Xin Cheng, Yuyue Wang et al.

ICCV 2025poster
3
citations

VAGEN: Reinforcing World Model Reasoning for Multi-Turn VLM Agents

Kangrui Wang, Pingyue Zhang, Zihan Wang et al.

NEURIPS 2025posterarXiv:2510.16907
8
citations

VA-GS: Enhancing the Geometric Representation of Gaussian Splatting via View Alignment

Qing Li, Huifang Feng, Xun Gong et al.

NEURIPS 2025posterarXiv:2510.11473

VAGUE: Visual Contexts Clarify Ambiguous Expressions

Heejeong Nam, Jinwoo Ahn, Keummin Ka et al.

ICCV 2025posterarXiv:2411.14137

Validating LLM-as-a-Judge Systems under Rating Indeterminacy

Luke Guerdan, Solon Barocas, Kenneth Holstein et al.

NEURIPS 2025posterarXiv:2503.05965
6
citations

Validating Mechanistic Interpretations: An Axiomatic Approach

Nils Palumbo, Ravi Mangal, Zifan Wang et al.

ICML 2025posterarXiv:2407.13594
1
citations

Valid Conformal Prediction for Dynamic GNNs

Ed Davis, Ian Gallagher, Daniel Lawson et al.

ICLR 2025posterarXiv:2405.19230
7
citations

Valid Inference with Imperfect Synthetic Data

Yewon Byun, Shantanu Gupta, Zachary Lipton et al.

NEURIPS 2025posterarXiv:2508.06635

Valid Selection among Conformal Sets

Mahmoud Hegazy, Liviu Aolaritei, Michael Jordan et al.

NEURIPS 2025posterarXiv:2506.20173

VALLR: Visual ASR Language Model for Lip Reading

Marshall Thomas, Edward Fish, Richard Bowden

ICCV 2025posterarXiv:2503.21408
6
citations

Value-aligned Behavior Cloning for Offline Reinforcement Learning via Bi-level Optimization

Xingyu Jiang, Ning Gao, Xiuhui Zhang et al.

ICLR 2025poster

Value-Based Deep RL Scales Predictably

Oleh Rybkin, Michal Nauman, Preston Fu et al.

ICML 2025posterarXiv:2502.04327

Value Diffusion Reinforcement Learning

Xiaoliang Hu, Fuyun Wang, Tong Zhang et al.

NEURIPS 2025poster

Value Gradient Guidance for Flow Matching Alignment

Zhen Liu, Tim Xiao, Carles Domingo i Enrich et al.

NEURIPS 2025posterarXiv:2512.05116

Value-Guided Decision Transformer: A Unified Reinforcement Learning Framework for Online and Offline Settings

Hongling Zheng, Li Shen, Yong Luo et al.

NEURIPS 2025poster

Value-Guided KV Compression for LLMs via Approximated CUR Decomposition

Ayan Sengupta, Siddhant Chaudhary, Tanmoy Chakraborty

NEURIPS 2025posterarXiv:2509.15038
1
citations

Value-Guided Search for Efficient Chain-of-Thought Reasoning

Kaiwen Wang, Jin Zhou, Jonathan Chang et al.

NEURIPS 2025posterarXiv:2505.17373
7
citations

Value Improved Actor Critic Algorithms

Yaniv Oren, Moritz Zanger, Pascal van der Vaart et al.

NEURIPS 2025posterarXiv:2406.01423

Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF

Shicong Cen, Jincheng Mei, Katayoon Goshvadi et al.

ICLR 2025posterarXiv:2405.19320

Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers

Weiming Ren, Wentao Ma, Huan Yang et al.

ICCV 2025posterarXiv:2503.11579

VA-MoE: Variables-Adaptive Mixture of Experts for Incremental Weather Forecasting

Hao Chen, Tao Han, Song Guo et al.

ICCV 2025posterarXiv:2412.02503
3
citations

VaMP: Variational Multi-Modal Prompt Learning for Vision-Language Models

Silin Cheng, Kai Han

NEURIPS 2025posterarXiv:2511.22664
1
citations

Vanish into Thin Air: Cross-prompt Universal Adversarial Attacks for SAM2

Ziqi Zhou, Yifan Hu, Yufei Song et al.

NEURIPS 2025spotlightarXiv:2510.24195
7
citations

VaporTok: RL-Driven Adaptive Video Tokenizer with Prior & Task Awareness

Minghao Yang, Zechen Bai, Jing Lin et al.

NEURIPS 2025poster

VarCMP: Adapting Cross-Modal Pre-Training Models for Video Anomaly Retrieval

Peng Wu, Wanshun Su, Xiangteng He et al.

AAAI 2025paper

VarDrop: Enhancing Training Efficiency by Reducing Variate Redundancy in Periodic Time Series Forecasting

Junhyeok Kang, Yooju Shin, Jae-Gil Lee

AAAI 2025paperarXiv:2501.14183
3
citations

\(\varepsilon\)-Optimally Solving Two-Player Zero-Sum POSGs

Erwan escudie, Matthia Sabatelli, Olivier Buffet et al.

NEURIPS 2025poster

VarFlow: Proper Scoring-Rule Diffusion Distillation via Energy Matching

Huiyang Shao, Xin Xia, Yuxi Ren et al.

NEURIPS 2025poster

Variance as a Catalyst: Efficient and Transferable Semantic Erasure Adversarial Attack for Customized Diffusion Models

Jiachen Yang, Yusong Wang, Yanmei Fang et al.

ICML 2025poster