NeurIPS Papers
5,858 papers found • Page 112 of 118
Unleashing Hour-Scale Video Training for Long Video-Language Understanding
Jingyang Lin, Jialian Wu, Ximeng Sun et al.
Unleashing the Potential of Multimodal LLMs for Zero-Shot Spatio-Temporal Video Grounding
Zaiquan Yang, Yuhao LIU, Gerhard Hancke et al.
Unleashing the Power of One-Step Diffusion based Image Super-Resolution via a Large-Scale Diffusion Discriminator
Jianze Li, Jiezhang Cao, Zichen Zou et al.
Unlocker: Disentangle the Deadlock of Learning between Label-noisy and Long-tailed Data
shu chen, HongJun Xu, Ruichi Zhang et al.
Unlocking Dataset Distillation with Diffusion Models
Brian Moser, Federico Raue, Sebastian Palacio et al.
Unlocking hidden biomolecular conformational landscapes in diffusion models at inference time
Daniel D. Richman, Jessica Karaguesian, Carl-Mikael Suomivuori et al.
Unlocking Multimodal Mathematical Reasoning via Process Reward Model
Ruilin Luo, Zhuofan Zheng, Lei Wang et al.
Unlocking SLM Potential for Data Analysis Code Generation via Non-Parametric Knowledge Distillation
Jinyang Li, Jack Williams, Nick McKenna et al.
Unmasking Puppeteers: Leveraging Biometric Leakage to Expose Impersonation in AI-Based Videoconferencing
Danial Samadi Vahdati, Tai Nguyen, Ekta Prashnani et al.
Unraveling Metameric Dilemma for Spectral Reconstruction: A High-Fidelity Approach via Semi-Supervised Learning
Xingxing Yang, Jie Chen, Zaifeng Yang
Unsupervised Federated Graph Learning
Lele Fu, Tianchi Liao, Sheng Huang et al.
Unsupervised Learning for Optimal Transport plan prediction between unbalanced graphs
Sonia Mazelet, Rémi Flamary, Bertrand Thirion
Unsupervised Trajectory Optimization for 3D Registration in Serial Section Electron Microscopy using Neural ODEs
Zhenbang Zhang, Jingtong Feng, Hongjia Li et al.
Unveiling Chain of Step Reasoning for Vision-Language Models with Fine-grained Rewards
Honghao Chen, Xingzhou Lou, Xiaokun Feng et al.
Unveiling Concept Attribution in Diffusion Models
Nguyen Hung-Quang, Hoang Phan, Khoa D Doan
Unveiling Environmental Sensitivity of Individual Gains in Influence Maximization
Xinyan Su, Zhiheng Zhang, Jiyan Qiu et al.
Unveiling Extraneous Sampling Bias with Data Missing-Not-At-Random
Chunyuan Zheng, Haocheng Yang, Haoxuan Li et al.
Unveiling m-Sharpness Through the Structure of Stochastic Gradient Noise
Haocheng Luo, Mehrtash Harandi, Dinh Phung et al.
Unveiling the Compositional Ability Gap in Vision-Language Reasoning Model
Tianle Li, Jihai Zhang, Yongming Rao et al.
Unveiling the Learning Mind of Language Models: A Cognitive Framework and Empirical Study
Zhengyu Hu, Jianxun Lian, Zheyuan Xiao et al.
Unveiling the Power of Multiple Gossip Steps: A Stability-Based Generalization Analysis in Decentralized Training
Unveiling the Spatial-temporal Effective Receptive Fields of Spiking Neural Networks
Jieyuan (Eric) Zhang, Xiaolong Zhou, Shuai Wang et al.
Unveiling the Uncertainty in Embodied and Operational Carbon of Large AI Models through a Probabilistic Carbon Accounting Model
Xiaoyang Zhang, He Fang, Yang Deng et al.
Unveiling Transformer Perception by Exploring Input Manifolds
Alessandro Benfenati, Alfio Ferrara, Alessio Marta et al.
UrbanIng-V2X: A Large-Scale Multi-Vehicle, Multi-Infrastructure Dataset Across Multiple Intersections for Cooperative Perception
Karthikeyan Chandra Sekaran, Markus Geisler, Dominik Rößle et al.
URB - Urban Routing Benchmark for RL-equipped Connected Autonomous Vehicles
Ahmet Onur Akman, Anastasia Psarou, Michał Hoffmann et al.
URDF-Anything: Constructing Articulated Objects with 3D Multimodal Language Model
Zhe Li, Xiang Bai, Jieyu Zhang et al.
U-REPA: Aligning Diffusion U-Nets to ViTs
Yuchuan Tian, Hanting Chen, Mengyu Zheng et al.
URLs Help, Topics Guide: Understanding Metadata Utility in LLM Training
Dongyang Fan, Vinko Sabolčec, Martin Jaggi
User-Instructed Disparity-aware Defocus Control
Yudong Han, Yan Yang, Hao Yang et al.
UtilGen: Utility-Centric Generative Data Augmentation with Dual-Level Task Adaptation
Jiyu Guo, Shuo Yang, Yiming Huang et al.
Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs
Mantas Mazeika, Xuwang Yin, Rishub Tamirisa et al.
UVE: Are MLLMs Unified Evaluators for AI-Generated Videos?
Yuanxin Liu, Rui Zhu, Shuhuai Ren et al.
V2V: Scaling Event-Based Vision through Efficient Video-to-Voxel Simulation
Hanyue Lou, Jinxiu Liang, Minggui Teng et al.
V2X-Radar: A Multi-modal Dataset with 4D Radar for Cooperative Perception
Lei Yang, Xinyu Zhang, Jun Li et al.
VADB: A Large-Scale Video Aesthetic Database with Professional and Multi-Dimensional Annotations
Qianqian Qiao, DanDan Zheng, Yihang Bo et al.
Vad-R1: Towards Video Anomaly Reasoning via Perception-to-Cognition Chain-of-Thought
Chao Huang, Benfeng Wang, Wei Wang et al.
VADTree: Explainable Training-Free Video Anomaly Detection via Hierarchical Granularity-Aware Tree
Wenlong Li, Yifei Xu, Yuan Rao et al.
VAGEN: Reinforcing World Model Reasoning for Multi-Turn VLM Agents
Kangrui Wang, Pingyue Zhang, Zihan Wang et al.
VA-GS: Enhancing the Geometric Representation of Gaussian Splatting via View Alignment
Qing Li, Huifang Feng, Xun Gong et al.
Validating LLM-as-a-Judge Systems under Rating Indeterminacy
Luke Guerdan, Solon Barocas, Kenneth Holstein et al.
Valid Inference with Imperfect Synthetic Data
Yewon Byun, Shantanu Gupta, Zachary Lipton et al.
Valid Selection among Conformal Sets
Mahmoud Hegazy, Liviu Aolaritei, Michael Jordan et al.
Value Diffusion Reinforcement Learning
Xiaoliang Hu, Fuyun Wang, Tong Zhang et al.
Value Gradient Guidance for Flow Matching Alignment
Zhen Liu, Tim Xiao, Carles Domingo i Enrich et al.
Value-Guided Decision Transformer: A Unified Reinforcement Learning Framework for Online and Offline Settings
Hongling Zheng, Li Shen, Yong Luo et al.
Value-Guided KV Compression for LLMs via Approximated CUR Decomposition
Ayan Sengupta, Siddhant Chaudhary, Tanmoy Chakraborty
Value-Guided Search for Efficient Chain-of-Thought Reasoning
Kaiwen Wang, Jin Zhou, Jonathan Chang et al.
Value Improved Actor Critic Algorithms
Yaniv Oren, Moritz Zanger, Pascal van der Vaart et al.
VaMP: Variational Multi-Modal Prompt Learning for Vision-Language Models
Silin Cheng, Kai Han