ECCV Papers

2,387 papers found • Page 47 of 48

ViG-Bias: Visually Grounded Bias Discovery and Mitigation

Badr-Eddine Marani, Mohamed HANINI, Nihitha Malayarukil et al.

ECCV 2024poster
2
citations

ViGoR: Improving Visual Grounding of Large Vision Language Models with Fine-Grained Reward Modeling

Siming Yan, Min Bai, Weifeng Chen et al.

ECCV 2024poster

ViLA: Efficient Video-Language Alignment for Video Question Answering

Xijun Wang, Junbang Liang, Chun-Kai Wang et al.

ECCV 2024poster
22
citations

ViPer: Visual Personalization of Generative Models via Individual Preference Learning

Sogand Salehi, Mahdi Shafiei, Roman Bachmann et al.

ECCV 2024posterarXiv:2407.17365
10
citations

V-IRL: Grounding Virtual Intelligence in Real Life

Jihan YANG, Runyu Ding, Ellis L Brown et al.

ECCV 2024posterarXiv:2402.03310
35
citations

VISAGE: Video Instance Segmentation with Appearance-Guided Enhancement

Hanjung Kim, Jaehyun Kang, Miran Heo et al.

ECCV 2024posterarXiv:2312.04885
7
citations

VISA: Reasoning Video Object Segmentation via Large Language Model

Cilin Yan, haochen wang, Shilin Yan et al.

ECCV 2024poster
95
citations

VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding

Ofir Abramovich, Niv Nayman, Sharon Fogel et al.

ECCV 2024poster
6
citations

Visible and Clear: Finding Tiny Objects in Difference Map

Bing Cao, Haiyu Yao, Pengfei Zhu et al.

ECCV 2024posterarXiv:2405.11276
19
citations

Vision-Language Action Knowledge Learning for Semantic-Aware Action Quality Assessment

Huangbiao Xu, Xiao Ke, Yuezhou Li et al.

ECCV 2024poster
14
citations

Vision-Language Dual-Pattern Matching for Out-of-Distribution Detection

Zihan Zhang, Zhuo Xu, Xiang Xiang

ECCV 2024poster

VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks

Xiangxiang Chu, Jianlin Su, Bo Zhang et al.

ECCV 2024poster

VisionTrap: Vision-Augmented Trajectory Prediction Guided by Textual Descriptions

Seokha Moon, Hyun Woo, Hongbeen Park et al.

ECCV 2024poster

Vista3D: unravel the 3d darkside of a single image

Qiuhong Shen, Xingyi Yang, Michael Bi Mi et al.

ECCV 2024poster
3
citations

Visual Alignment Pre-training for Sign Language Translation

Peiqi Jiao, Yuecong Min, Xilin CHEN

ECCV 2024poster

Visual Grounding for Object-Level Generalization in Reinforcement Learning

Haobin Jiang, Zongqing Lu

ECCV 2024posterarXiv:2408.01942
3
citations

Visual Prompting via Partial Optimal Transport

MENGYU ZHENG, Zhiwei Hao, Yehui Tang et al.

ECCV 2024poster

Visual Relationship Transformation

Xiaoyu Xu, Jiayan Qiu, Baosheng Yu et al.

ECCV 2024poster

Visual Text Generation in the Wild

Yuanzhi Zhu, Jiawei Liu, Feiyu Gao et al.

ECCV 2024poster

VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models

Shicheng Li, Lei Li, Yi Liu et al.

ECCV 2024poster
49
citations

VividDreamer: Invariant Score Distillation for Hyper-Realistic Text-to-3D Generation

Wenjie Zhuo, Fan Ma, Hehe Fan et al.

ECCV 2024poster
16
citations

VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition

Ahmad Khaliq, Ming Xu, Stephen Hausler et al.

ECCV 2024poster
6
citations

Volumetric Rendering with Baked Quadrature Fields

Gopal Sharma, Daniel Rebain, Kwang Moo Yi et al.

ECCV 2024posterarXiv:2312.02202
10
citations

VP-SAM: Taming Segment Anything Model for Video Polyp Segmentation via Disentanglement and Spatio-temporal Side Network

Zhixue Fang, Yuzhi Liu, Huisi Wu et al.

ECCV 2024poster
2
citations

VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving

Yibo Liu, Zheyuan Yang, Guile Wu et al.

ECCV 2024poster

VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space

Guénolé Fiche, Simon Leglaive, Xavier Alameda-Pineda et al.

ECCV 2024posterarXiv:2312.08291
10
citations

VSViG: Real-time Video-based Seizure Detection via Skeleton-based Spatiotemporal ViG

Yankun Xu, Junzhe Wang, Yun-Hsuan Chen et al.

ECCV 2024posterarXiv:2311.14775
5
citations

V-Trans4Style: Visual Transition Recommendation for Video Production Style Adaptation

Pooja Guhan, Tsung-Wei Huang, Guan-Ming Su et al.

ECCV 2024posterarXiv:2501.07983

Walker: Self-supervised Multiple Object Tracking by Walking on Temporal Object Appearance Graphs

Mattia Segu, Luigi Piccinelli, Siyuan Li et al.

ECCV 2024poster
3
citations

WAS: Dataset and Methods for Artistic Text Segmentation

Xudong Xie, Yuzhe Li, Yang Liu et al.

ECCV 2024poster
3
citations

WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians

Dmytro Kotovenko, Olga Grebenkova, Nikolaos Sarafianos et al.

ECCV 2024poster

Watching it in Dark: A Target-aware Representation Learning Framework for High-Level Vision Tasks in Low Illumination

Yunan LI, Yihao Zhang, Shoude Li et al.

ECCV 2024poster
6
citations

Watch Your Steps: Local Image and Scene Editing by Text Instructions

Ashkan Mirzaei, Tristan T Aumentado-Armstrong, Marcus A Brubaker et al.

ECCV 2024poster

Wavelength-Embedding-guided Filter-Array Transformer for Spectral Demosaicing

haijin zeng, Hiep Luong, Wilfried Philips

ECCV 2024poster
1
citations

Wavelet Convolutions for Large Receptive Fields

Shahaf Finder, Roy Amoyal, Eran Treister et al.

ECCV 2024poster

WAVE: Warping DDIM Inversion Features for Zero-shot Text-to-Video Editing

Yutang Feng, Sicheng Gao, Yuxiang Bao et al.

ECCV 2024poster

WBP: Training-time Backdoor Attacks through Hardware-based Weight Bit Poisoning

Kunbei Cai, Zhenkai Zhang, Qian Lou et al.

ECCV 2024poster

Weakly-Supervised 3D Hand Reconstruction with Knowledge Prior and Uncertainty Guidance

Yufei Zhang, Jeffrey Kephart, Qiang Ji

ECCV 2024posterarXiv:2407.12307
3
citations

Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance

Kuan-Chih Huang, Yi-Hsuan Tsai, Ming-Hsuan Yang

ECCV 2024poster

Weakly-supervised Camera Localization by Ground-to-satellite Image Registration

Yujiao Shi, HONGDONG LI, Akhil Perincherry et al.

ECCV 2024poster

Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation

Xinyu Yang, Hossein Rahmani, Sue Black et al.

ECCV 2024posterarXiv:2402.17891
5
citations

Weakly-Supervised Spatio-Temporal Video Grounding with Variational Cross-Modal Alignment

Yang Jin, Yadong Mu

ECCV 2024poster

Weak-to-Strong Compositional Learning from Generative Models for Language-based Object Detection

Kwanyong Park, Kuniaki Saito, Donghyun Kim

ECCV 2024poster

Wear-Any-Way: Manipulable Virtual Try-on via Sparse Correspondence Alignment

Mengting Chen, Xi Chen, Zhonghua Zhai et al.

ECCV 2024posterarXiv:2403.12965
38
citations

WebRPG: Automatic Web Rendering Parameters Generation for Visual Presentation

Zirui Shao, Feiyu Gao, Hangdi Xing et al.

ECCV 2024posterarXiv:2407.15502
5
citations

WeConvene: Learned Image Compression with Wavelet-Domain Convolution and Entropy Model

Haisheng Fu, Jie Liang, Zhenman Fang et al.

ECCV 2024poster
30
citations

WeCromCL: Weakly Supervised Cross-Modality Contrastive Learning for Transcription-only Supervised Text Spotting

Jingjing Wu, Zhengyao Fang, Pengyuan Lyu et al.

ECCV 2024poster
4
citations

Weight Conditioning for Smooth Optimization of Neural Networks

Hemanth Saratchandran, Thomas X Wang, Simon Lucey

ECCV 2024poster
7
citations

Weighted Ensemble Models Are Strong Continual Learners

Imad Eddine Marouf, Subhankar Roy, Enzo Tartaglione et al.

ECCV 2024poster

Weighting Pseudo-Labels via High-Activation Feature Index Similarity and Object Detection for Semi-Supervised Segmentation

Prantik Howlader, Hieu Le, Dimitris Samaras

ECCV 2024posterarXiv:2407.12630
2
citations