All Papers

34,598 papers found • Page 687 of 692

VMINer: Versatile Multi-view Inverse Rendering with Near- and Far-field Light Sources

Fan Fei, Jiajun Tang, Ping Tan et al.

CVPR 2024highlight

VMT-Adapter: Parameter-Efficient Transfer Learning for Multi-Task Dense Scene Understanding

Yi Xin, Junlong Du, Qiang Wang et al.

AAAI 2024paperarXiv:2312.08733
82
citations

VNN: Verification-Friendly Neural Networks with Hard Robustness Guarantees

Anahita Baninajjar, Ahmed Rezine, Amir Aminifar

ICML 2024posterarXiv:2312.09748

Vocabulary for Universal Approximation: A Linguistic Perspective of Mapping Compositions

Yongqiang Cai

ICML 2024spotlightarXiv:2305.12205

VoCo: A Simple-yet-Effective Volume Contrastive Learning Framework for 3D Medical Image Analysis

Linshan Wu, Jia-Xin Zhuang, Hao Chen

CVPR 2024posterarXiv:2402.17300
76
citations

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Hubert Siuzdak

ICLR 2024posterarXiv:2306.00814

Volumetric Environment Representation for Vision-Language Navigation

Liu, Wenguan Wang, Yi Yang

CVPR 2024highlightarXiv:2403.14158

Volumetric Rendering with Baked Quadrature Fields

Gopal Sharma, Daniel Rebain, Kwang Moo Yi et al.

ECCV 2024posterarXiv:2312.02202
10
citations

VONet: Unsupervised Video Object Learning With Parallel U-Net Attention and Object-wise Sequential VAE

Haonan Yu, Wei Xu

ICLR 2024oralarXiv:2401.11110

VOODOO 3D: Volumetric Portrait Disentanglement For One-Shot 3D Head Reenactment

Phong Tran, Egor Zakharov, Long Nhat Ho et al.

CVPR 2024posterarXiv:2312.04651
29
citations

VoroNav: Voronoi-based Zero-shot Object Navigation with Large Language Model

Pengying Wu, Yao Mu, Bingxian Wu et al.

ICML 2024posterarXiv:2401.02695

Voxel or Pillar: Exploring Efficient Point Cloud Representation for 3D Object Detection

Yuhao Huang, Sanping Zhou, Junjie Zhang et al.

AAAI 2024paperarXiv:2304.02867

VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation

Yang Chen, Yingwei Pan, haibo yang et al.

CVPR 2024posterarXiv:2403.17001
30
citations

VPDETR: End-to-End Vanishing Point DEtection TRansformers

Taiyan Chen, Xianghua Ying, Jinfa Yang et al.

AAAI 2024paper
4
citations

VP-SAM: Taming Segment Anything Model for Video Polyp Segmentation via Disentanglement and Spatio-temporal Side Network

Zhixue Fang, Yuzhi Liu, Huisi Wu et al.

ECCV 2024poster
2
citations

VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving

Yibo Liu, Zheyuan Yang, Guile Wu et al.

ECCV 2024posterarXiv:2407.06516
10
citations

VQAttack: Transferable Adversarial Attacks on Visual Question Answering via Pre-trained Models

Ziyi Yin, Muchao Ye, Tianrong Zhang et al.

AAAI 2024paperarXiv:2402.11083

VQCNIR: Clearer Night Image Restoration with Vector-Quantized Codebook

Wenbin Zou, Hongxia Gao, Tian Ye et al.

AAAI 2024paperarXiv:2312.08606

VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling

Siyuan Li, Zedong Wang, Zicheng Liu et al.

ICML 2024posterarXiv:2405.10812

VQGraph: Rethinking Graph Representation Space for Bridging GNNs and MLPs

Ling Yang, Ye Tian, Minkai Xu et al.

ICLR 2024posterarXiv:2308.02117

VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space

Guénolé Fiche, Simon Leglaive, Xavier Alameda-Pineda et al.

ECCV 2024posterarXiv:2312.08291
10
citations

VQ-TR: Vector Quantized Attention for Time Series Forecasting

Kashif Rasul, Andrew Bennett, Pablo Vicente et al.

ICLR 2024poster

VRetouchEr: Learning Cross-frame Feature Interdependence with Imperfection Flow for Face Retouching in Videos

Wen Xue, Le Jiang, Lianxin Xie et al.

CVPR 2024poster
1
citations

VRP-SAM: SAM with Visual Reference Prompt

Yanpeng Sun, Jiahui Chen, Shan Zhang et al.

CVPR 2024posterarXiv:2402.17726

VSCode: General Visual Salient and Camouflaged Object Detection with 2D Prompt Learning

Ziyang Luo, Nian Liu, Wangbo Zhao et al.

CVPR 2024posterarXiv:2311.15011

VSFormer: Visual-Spatial Fusion Transformer for Correspondence Pruning

Tangfei Liao, Xiaoqin Zhang, Li Zhao et al.

AAAI 2024paperarXiv:2312.08774
15
citations

VSRD: Instance-Aware Volumetric Silhouette Rendering for Weakly Supervised 3D Object Detection

Zihua Liu, Hiroki Sakuma, Masatoshi Okutomi

CVPR 2024posterarXiv:2404.00149

VS: Reconstructing Clothed 3D Human from Single Image via Vertex Shift

Leyuan Liu, Yuhan Li, Yunqi Gao et al.

CVPR 2024poster

VSViG: Real-time Video-based Seizure Detection via Skeleton-based Spatiotemporal ViG

Yankun Xu, Junzhe Wang, Yun-Hsuan Chen et al.

ECCV 2024posterarXiv:2311.14775
5
citations

VTimeLLM: Empower LLM to Grasp Video Moments

Bin Huang, Xin Wang, Hong Chen et al.

CVPR 2024highlightarXiv:2311.18445
244
citations

VTQA: Visual Text Question Answering via Entity Alignment and Cross-Media Reasoning

Kang Chen, Xiangqian Wu

CVPR 2024posterarXiv:2303.02635
19
citations

V-Trans4Style: Visual Transition Recommendation for Video Production Style Adaptation

Pooja Guhan, Tsung-Wei Huang, Guan-Ming Su et al.

ECCV 2024posterarXiv:2501.07983

VVS: Video-to-Video Retrieval with Irrelevant Frame Suppression

Won Jo, Geuntaek Lim, Gwangjin Lee et al.

AAAI 2024paperarXiv:2303.08906
10
citations

W2P: Switching from Weak Supervision to Partial Supervision for Semantic Segmentation

Fangyuan Zhang, Tianxiang Pan, Junhai Yong et al.

AAAI 2024paper

Walker: Self-supervised Multiple Object Tracking by Walking on Temporal Object Appearance Graphs

Mattia Segu, Luigi Piccinelli, Siyuan Li et al.

ECCV 2024poster
3
citations

WALT3D: Generating Realistic Training Data from Time-Lapse Imagery for Reconstructing Dynamic Objects Under Occlusion

Khiem Vuong, N. Dinesh Reddy, Robert Tamburo et al.

CVPR 2024posterarXiv:2403.19022
3
citations

WANDR: Intention-guided Human Motion Generation

Markos Diomataris, Nikos Athanasiou, Omid Taheri et al.

CVPR 2024posterarXiv:2404.15383

WARM: On the Benefits of Weight Averaged Reward Models

Alexandre Rame, Nino Vieillard, Léonard Hussenot et al.

ICML 2024poster

WAS: Dataset and Methods for Artistic Text Segmentation

Xudong Xie, Yuzhe Li, Yang Liu et al.

ECCV 2024posterarXiv:2408.00106
3
citations

Wasserstein Differential Privacy

Chengyi Yang, Jiayin Qi, Aimin Zhou

AAAI 2024paperarXiv:2401.12436

Wasserstein Wormhole: Scalable Optimal Transport Distance with Transformer

Doron Haviv, Russell Kunes, Thomas Dougherty et al.

ICML 2024posterarXiv:2404.09411

WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians

Dmytro Kotovenko, Olga Grebenkova, Nikolaos Sarafianos et al.

ECCV 2024posterarXiv:2409.17917

Watching it in Dark: A Target-aware Representation Learning Framework for High-Level Vision Tasks in Low Illumination

Yunan LI, Yihao Zhang, Shoude Li et al.

ECCV 2024poster
6
citations

Watch Your Head: Assembling Projection Heads to Save the Reliability of Federated Models

Authors: Jinqian Chen, Jihua Zhu, Qinghai Zheng et al.

AAAI 2024paperarXiv:2402.16255
3
citations

Watch Your Steps: Local Image and Scene Editing by Text Instructions

Ashkan Mirzaei, Tristan T Aumentado-Armstrong, Marcus A Brubaker et al.

ECCV 2024posterarXiv:2308.08947

WateRF: Robust Watermarks in Radiance Fields for Protection of Copyrights

Youngdong Jang, Dong In Lee, MinHyuk Jang et al.

CVPR 2024posterarXiv:2405.02066
25
citations

Watermark-embedded Adversarial Examples for Copyright Protection against Diffusion Models

Peifei Zhu, Tsubasa Takahashi, Hirokatsu Kataoka

CVPR 2024posterarXiv:2404.09401
34
citations

Watermarking Conditional Text Generation for AI Detection: Unveiling Challenges and a Semantic-Aware Watermark Remedy

Yu Fu, Deyi Xiong, Yue Dong

AAAI 2024paperarXiv:2307.13808
55
citations

Watermarks in the Sand: Impossibility of Strong Watermarking for Language Models

Hanlin Zhang, Benjamin Edelman, Danilo Francati et al.

ICML 2024poster

Watermark Stealing in Large Language Models

Nikola Jovanović, Robin Staab, Martin Vechev

ICML 2024posterarXiv:2402.19361