NeurIPS Papers
5,858 papers found • Page 113 of 118
Vanish into Thin Air: Cross-prompt Universal Adversarial Attacks for SAM2
Ziqi Zhou, Yifan Hu, Yufei Song et al.
VaporTok: RL-Driven Adaptive Video Tokenizer with Prior & Task Awareness
Minghao Yang, Zechen Bai, Jing Lin et al.
\(\varepsilon\)-Optimally Solving Two-Player Zero-Sum POSGs
Erwan escudie, Matthia Sabatelli, Olivier Buffet et al.
VarFlow: Proper Scoring-Rule Diffusion Distillation via Energy Matching
Huiyang Shao, Xin Xia, Yuxi Ren et al.
Variance-Aware Feel-Good Thompson Sampling for Contextual Bandits
Xuheng Li, Quanquan Gu
Variance-Reduced Long-Term Rehearsal Learning with Quadratic Programming Reformulation
Wen-Bo Du, Tian Qin, Tian-Zuo Wang et al.
Variational Inference with Mixtures of Isotropic Gaussians
Marguerite Petit-Talamon, Marc Lambert, Anna Korba
Variational Learning Finds Flatter Solutions at the Edge of Stability
Avrajit Ghosh, Bai Cong, Rio Yokota et al.
Variational Polya Tree
Lu Xu, Tsai Hor Chan, Lequan Yu et al.
Variational Regularized Unbalanced Optimal Transport: Single Network, Least Action
Yuhao Sun, Zhenyi Zhang, Zihan Wang et al.
Variational Supervised Contrastive Learning
Ziwen Wang, Jiajun Fan, Thao Nguyen et al.
Variational Task Vector Composition
Boyuan Zhang, Yingjun Du, Xiantong Zhen et al.
Variational Uncertainty Decomposition for In-Context Learning
I. Shavindra Jayasekera, Jacob Si, Filippo Valdettaro et al.
VASA-3D: Lifelike Audio-Driven Gaussian Head Avatars from a Single Image
Sicheng Xu, Guojun Chen, Jiaolong Yang et al.
V-CECE: Visual Counterfactual Explanations via Conceptual Edits
Nikolaos Spanos, Maria Lymperaiou, Giorgos Filandrianos et al.
VCM: Vision Concept Modeling with Adaptive Vision Token Compression via Instruction Fine-Tuning
Run Luo, Renke Shan, Longze Chen et al.
Vector Database Watermarking
Zhiwen Ren, Wei Fan, Qiyi Yao et al.
Vector Quantization in the Brain: Grid-like Codes in World Models
Xiangyuan Peng, Xingsi Dong, Si Wu
Venus-MAXWELL: Efficient Learning of Protein-Mutation Stability Landscapes using Protein Language Models
Yuanxi Yu, Fan Jiang, Xinzhu Ma et al.
VERA: Variational Inference Framework for Jailbreaking Large Language Models
Anamika Lochab, Lu Yan, Patrick Pynadath et al.
VeriLoC: Line-of-Code Level Prediction of Hardware Design Quality from Verilog Code
Raghu Vamshi Hemadri, Jitendra Bhandari, Andre Nakkab et al.
VeriThinker: Learning to Verify Makes Reasoning Model Efficient
Zigeng Chen, Xinyin Ma, Gongfan Fang et al.
VeriThoughts: Enabling Automated Verilog Code Generation using Reasoning and Formal Verification
Patrick Yubeaton, Andre Nakkab, Weihua Xiao et al.
Versatile differentially private learning for general loss functions
Qilong Lu, Songxi Chen, Yumou Qiu
Versatile Transferable Unlearnable Example Generator
Zhihao Li, Jiale Cai, Gezheng Xu et al.
Vertical Federated Feature Screening
Huajun Yin, Liyuan Wang, Yingqiu Zhu et al.
VESSA: Video-based objEct-centric Self-Supervised Adaptation for Visual Foundation Models
Jesimon Barreto, Carlos Caetano, Andre Araujo et al.
VETA-DiT: Variance-Equalized and Temporally Adaptive Quantization for Efficient 4-bit Diffusion Transformers
Qinkai XU, yijin liu, YangChen et al.
VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption
Tianxiong Zhong, Xingye Tian, Boyuan Jiang et al.
Vgent: Graph-based Retrieval-Reasoning-Augmented Generation For Long Video Understanding
Xiaoqian Shen, Wenxuan Zhang, Jun Chen et al.
VGGT-SLAM: Dense RGB SLAM Optimized on the SL(4) Manifold
Dominic Maggio, Hyungtae Lim, Luca Carlone
vHector and HeisenVec: Scalable Vector Graphics Generation Through Large Language Models
Leonardo Zini, Elia Frigieri, Sebastiano Aloscari et al.
VIBE: Annotation-Free Video-to-Text Information Bottleneck Evaluation for TL;DR
Shenghui Chen, Po-han Li, Sandeep Chinchali et al.
Vicinal Label Supervision for Reliable Aleatoric and Epistemic Uncertainty Estimation
Linye Li, Yufei Chen, Xiaodong Yue
Vicinity-Guided Discriminative Latent Diffusion for Privacy-Preserving Domain Adaptation
Jing Wang, Wonho Bae, Jiahong Chen et al.
ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs
Xiyao Wang, Zhengyuan Yang, Chao Feng et al.
ViDAR: Video Diffusion-Aware 4D Reconstruction From Monocular Inputs
Michal Nazarczuk, Sibi Catley-Chandar, Thomas Tanay et al.
VidEmo: Affective-Tree Reasoning for Emotion-Centric Video Foundation Models
Zhicheng Zhang, Weicheng Wang, Yongjie Zhu et al.
VideoCAD: A Dataset and Model for Learning Long‑Horizon 3D CAD UI Interactions from Video
King Yiu Brandon Man, Ghadi Nehme, Md Ferdous Alam et al.
VideoChat-R1.5: Visual Test-Time Scaling to Reinforce Multimodal Reasoning by Iterative Perception
Ziang Yan, Yinan He, Xinhao Li et al.
Video Diffusion Models Excel at Tracking Similar-Looking Objects Without Supervision
Chenshuang Zhang, Kang Zhang, Joon Son Chung et al.
VideoGameQA-Bench: Evaluating Vision-Language Models for Video Game Quality Assurance
Mohammad Reza Taesiri, Abhijay Ghildyal, Saman Zadtootaghaj et al.
VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations on Synthetic Video Understanding
Zongxia Li, Xiyang Wu, Guangyao Shi et al.
VideoLucy: Deep Memory Backtracking for Long Video Understanding
Jialong Zuo, Yongtai Deng, Lingdong Kong et al.
VideoMAR: Autoregressive Video Generation with Continuous Tokens
Hu Yu, Biao Gong, Hangjie Yuan et al.
Video Perception Models for 3D Scene Synthesis
Rui Huang, Guangyao Zhai, Zuria Bauer et al.
Video-R1: Reinforcing Video Reasoning in MLLMs
Kaituo Feng, Kaixiong Gong, Bohao Li et al.
Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension
Yongdong Luo, Xiawu Zheng, Guilin Li et al.
VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models
Xiangdong Zhang, Jiaqi Liao, Shaofeng Zhang et al.
VideoRFT: Incentivizing Video Reasoning Capability in MLLMs via Reinforced Fine-Tuning
Qi Wang, Yanrui Yu, Ye Yuan et al.