Most Cited ECCV "pseudo-supervision generation" Papers

2,387 papers found • Page 5 of 12

Filters:Most Cited ECCV pseudo-supervision generation Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#801

Event Camera Data Dense Pre-training

Yan Yang, Liyuan Pan, Liu liu

ECCV 2024arXiv:2311.11533

citations

#802

Just a Hint: Point-Supervised Camouflaged Object Detection

Huafeng Chen, Dian SHAO, Guangqian Guo et al.

ECCV 2024arXiv:2408.10777

citations

#803

PLOT: Text-based Person Search with Part Slot Attention for Corresponding Part Discovery

Jicheol Park, Dongwon Kim, Boseung Jeong et al.

ECCV 2024arXiv:2409.13475

citations

#804

EmoTalk3D: High-Fidelity Free-View Synthesis of Emotional 3D Talking Head

Qianyun He, Xinya Ji, Yicheng Gong et al.

ECCV 2024arXiv:2408.00297

citations

#805

Visual Text Generation in the Wild

Yuanzhi Zhu, Jiawei Liu, Feiyu Gao et al.

ECCV 2024arXiv:2407.14138

citations

#806

Improving Geo-diversity of Generated Images with Contextualized Vendi Score Guidance

Reyhane Askari Hemmat, Melissa Hall, Alicia Yi Sun et al.

ECCV 2024arXiv:2406.04551

citations

#807

HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts

Wonjae Kim, Sanghyuk Chun, Taekyung Kim et al.

ECCV 2024arXiv:2404.17507

citations

#808

Open Panoramic Segmentation

Junwei Zheng, Ruiping Liu, Yufan Chen et al.

ECCV 2024arXiv:2407.02685

citations

#809

FRDiff : Feature Reuse for Universal Training-free Acceleration of Diffusion Models

Junhyuk So, Jungwon Lee, Eunhyeok Park

ECCV 2024arXiv:2312.03517

citations

#810

Animal Avatars: Reconstructing Animatable 3D Animals from Casual Videos

Remy Sabathier, David Novotny, Niloy Mitra

ECCV 2024arXiv:2403.17103

citations

#811

Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Models

Hao Cheng, Erjia Xiao, Jindong Gu et al.

ECCV 2024arXiv:2402.19150

citations

#812

MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion Models

Nithin Gopalakrishnan Nair, Jeya Maria Jose Valanarasu, Vishal Patel

ECCV 2024arXiv:2404.09977

citations

#813

Learning Camouflaged Object Detection from Noisy Pseudo Label

Jin Zhang, Ruiheng Zhang, Yanjiao Shi et al.

ECCV 2024arXiv:2407.13157

citations

#814

SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation

Lingchen Meng, Shiyi Lan, Hengduo Li et al.

ECCV 2024arXiv:2311.14671

citations

#815

UniProcessor: A Text-induced Unified Low-level Image Processor

Huiyu Duan, Xiongkuo Min, Sijing Wu et al.

ECCV 2024arXiv:2407.20928

citations

#816

Reinforcement Learning Friendly Vision-Language Model for Minecraft

Haobin Jiang, Junpeng Yue, Hao Luo et al.

ECCV 2024arXiv:2303.10571

citations

#817

Temporal Event Stereo via Joint Learning with Stereoscopic Flow

Hoonhee Cho, Jae-young Kang, Kuk-Jin Yoon

ECCV 2024arXiv:2407.10831

citations

#818

Enhancing Source-Free Domain Adaptive Object Detection with Low-confidence Pseudo Label Distillation

Ilhoon Yoon, Hyeongjun Kwon, Jin Kim et al.

ECCV 2024arXiv:2407.13524

citations

#819

The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?

Qinyu Zhao, Ming Xu, Kartik Gupta et al.

ECCV 2024arXiv:2403.09037

citations

#820

Dynamic Retraining-Updating Mean Teacher for Source-Free Object Detection

BA KHANH TRINH LE, Huy-Hung Nguyen, Long Hoang Pham et al.

ECCV 2024arXiv:2407.16497

citations

#821

Instant 3D Human Avatar Generation using Image Diffusion Models

Nikos Kolotouros, Thiemo Alldieck, Enric Corona et al.

ECCV 2024arXiv:2406.07516

citations

#822

NeuSDFusion: A Spatial-Aware Generative Model for 3D Shape Completion, Reconstruction, and Generation

Ruikai Cui, Weizhe Liu, Weixuan Sun et al.

ECCV 2024arXiv:2403.18241

citations

#823

General and Task-Oriented Video Segmentation

Mu Chen, Liulei Li, Wenguan Wang et al.

ECCV 2024arXiv:2407.06540

citations

#824

Generalizable Facial Expression Recognition

Yuhang Zhang, Xiuqi Zheng, Chenyi Liang et al.

ECCV 2024arXiv:2408.10614

citations

#825

EntAugment: Entropy-Driven Adaptive Data Augmentation Framework for Image Classification

Suorong Yang, Furao Shen, Jian Zhao

ECCV 2024arXiv:2409.06290

citations

#826

CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection

Wuyang Li, Xinyu Liu, Jiayi Ma et al.

ECCV 2024

citations

#827

Controlling the World by Sleight of Hand

Sruthi Sudhakar, Ruoshi Liu, Basile Van Hoorick et al.

ECCV 2024arXiv:2408.07147

citations

#828

EvSign: Sign Language Recognition and Translation with Streaming Events

Pengyu Zhang, Hao Yin, Zeren Wang et al.

ECCV 2024arXiv:2407.12593

citations

#829

InstaStyle: Inversion Noise of a Stylized Image is Secretly a Style Adviser

Xing Cui, Zekun Li, Peipei Li et al.

ECCV 2024arXiv:2311.15040

citations

#830

ScanTalk: 3D Talking Heads from Unregistered Scans

Federico Nocentini, Thomas Besnier, Claudio Ferrari et al.

ECCV 2024arXiv:2403.10942

citations

#831

MoEAD: A Parameter-efficient Model for Multi-class Anomaly Detection

Shiyuan Meng, Wenchao Meng, Qihang Zhou et al.

ECCV 2024

citations

#832

Track Everything Everywhere Fast and Robustly

Yunzhou Song, Jiahui Lei, Ziyun Wang et al.

ECCV 2024arXiv:2403.17931

citations

#833

TetraDiffusion: Tetrahedral Diffusion Models for 3D Shape Generation

Nikolai Kalischek, Torben Peters, Jan Dirk Wegner et al.

ECCV 2024arXiv:2211.13220

citations

#834

GRIDS: Grouped Multiple-Degradation Restoration with Image Degradation Similarity

Shuo Cao, Yihao Liu, Wenlong Zhang et al.

ECCV 2024arXiv:2407.12273

citations

#835

Multi-Label Cluster Discrimination for Visual Representation Learning

Xiang An, Kaicheng Yang, Xiangzi Dai et al.

ECCV 2024arXiv:2407.17331

citations

#836

Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth Estimation

Jinfeng Liu, Lingtong Kong, Bo Li et al.

ECCV 2024arXiv:2407.14126

citations

#837

DGD: Dynamic 3D Gaussians Distillation

Isaac Labe, Noam Issachar, Itai Lang et al.

ECCV 2024arXiv:2405.19321

citations

#838

Kalman-Inspired Feature Propagation for Video Face Super-Resolution

Ruicheng Feng, Chongyi Li, Chen Change Loy

ECCV 2024arXiv:2408.05205

citations

#839

UniCode : Learning a Unified Codebook for Multimodal Large Language Models

Sipeng Zheng, Bohan Zhou, Yicheng Feng et al.

ECCV 2024arXiv:2403.09072

citations

#840

Foster Adaptivity and Balance in Learning with Noisy Labels

Mengmeng Sheng, Zeren Sun, Tao Chen et al.

ECCV 2024arXiv:2407.02778

citations

#841

Long-term Temporal Context Gathering for Neural Video Compression

Linfeng Qi, Zhaoyang Jia, Jiahao Li et al.

ECCV 2024

citations

#842

Non-Exemplar Domain Incremental Learning via Cross-Domain Concept Integration

Qiang Wang, Yuhang He, Songlin Dong et al.

ECCV 2024

citations

#843

Free-VSC: Free Semantics from Visual Foundation Models for Unsupervised Video Semantic Compression

Yuan Tian, Guo Lu, Guangtao Zhai

ECCV 2024arXiv:2409.11718

citations

#844

SIGMA: Sinkhorn-Guided Masked Video Modeling

Mohammadreza Salehi, Michael Dorkenwald, Fida Mohammad Thoker et al.

ECCV 2024arXiv:2407.15447

citations

#845

X-Pose: Detecting Any Keypoints

Jie Yang, AILING ZENG, Ruimao Zhang et al.

ECCV 2024arXiv:2310.08530

citations

#846

Adaptive Compressed Sensing with Diffusion-Based Posterior Sampling

Noam Elata, Tomer Michaeli, Michael Elad

ECCV 2024arXiv:2407.08256

citations

#847

FineMatch: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction

Hang Hua, Jing Shi, Kushal Kafle et al.

ECCV 2024arXiv:2404.14715

citations

#848

AttnZero: Efficient Attention Discovery for Vision Transformers

Lujun Li, Zimian Wei, Peijie Dong et al.

ECCV 2024

citations

#849

Referring Atomic Video Action Recognition

Kunyu Peng, Jia Fu, Kailun Yang et al.

ECCV 2024arXiv:2407.01872

citations

#850

Where am I? Scene Retrieval with Language

Jiaqi Chen, Daniel Barath, Iro Armeni et al.

ECCV 2024arXiv:2404.14565

citations

#851

MoVideo: Motion-Aware Video Generation with Diffusion Models

Jingyun Liang, Yuchen Fan, Kai Zhang et al.

ECCV 2024arXiv:2311.11325

citations

#852

Finding Visual Task Vectors

Alberto Hojel, Yutong Bai, Trevor Darrell et al.

ECCV 2024arXiv:2404.05729

citations

#853

Vision-Language Action Knowledge Learning for Semantic-Aware Action Quality Assessment

Huangbiao Xu, Xiao Ke, Yuezhou Li et al.

ECCV 2024

citations

#854

Parrot Captions Teach CLIP to Spot Text

Yiqi Lin, Conghui He, Alex Jinpeng Wang et al.

ECCV 2024arXiv:2312.14232

citations

#855

Free-Editor: Zero-shot Text-driven 3D Scene Editing

Md Nazmul Karim, Hasan Iqbal, Umar Khalid et al.

ECCV 2024arXiv:2312.13663

citations

#856

Robust Multimodal Learning via Representation Decoupling

Shicai Wei, Yang Luo, Yuji Wang et al.

ECCV 2024arXiv:2407.04458

citations

#857

Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors

Jae Joong Lee, Bosheng Li, Sara Beery et al.

ECCV 2024arXiv:2407.10330

citations

#858

Reinforcement Learning Meets Visual Odometry

Nico Messikommer, Giovanni Cioffi, Mathias Gehrig et al.

ECCV 2024arXiv:2407.15626

citations

#859

Iterative Ensemble Training with Anti-Gradient Control for Mitigating Memorization in Diffusion Models

Xiao Liu, Xiaoliu Guan, Yu Wu et al.

ECCV 2024arXiv:2407.15328

citations

#860

Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments

Djamahl Etchegaray, Zi Helen Huang, Tatsuya Harada et al.

ECCV 2024arXiv:2403.13556

citations

#861

3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation

Zihao Xiao, Longlong Jing, Shangxuan Wu et al.

ECCV 2024arXiv:2401.02402

citations

#862

ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic Environments

Taewoong Kim, Cheolhong Min, Byeonghwi Kim et al.

ECCV 2024arXiv:2407.18550

citations

#863

IMMA: Immunizing text-to-image Models against Malicious Adaptation

Amber Yijia Zheng, Raymond Yeh

ECCV 2024arXiv:2311.18815

citations

#864

LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation

Archana Swaminathan, Anubhav Anubhav, Kamal Gupta et al.

ECCV 2024arXiv:2409.06703

citations

#865

ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction

Shaozhe Hao, Kai Han, Zhengyao Lv et al.

ECCV 2024arXiv:2407.07077

citations

#866

LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer

Ning Yu, Chia-Chih Chen, Zeyuan Chen et al.

ECCV 2024arXiv:2212.09877

citations

#867

ControlCap: Controllable Region-level Captioning

Yuzhong Zhao, Liu Yue, Zonghao Guo et al.

ECCV 2024arXiv:2401.17910

citations

#868

HUMOS: Human Motion Model Conditioned on Body Shape

Shashank Tripathi, Omid Taheri, Christoph Lassner et al.

ECCV 2024arXiv:2409.03944

citations

#869

Rate-Distortion-Cognition Controllable Versatile Neural Image Compression

Jinming Liu, Ruoyu Feng, Yunpeng Qi et al.

ECCV 2024arXiv:2407.11700

citations

#870

Norface: Improving Facial Expression Analysis by Identity Normalization

Hanwei Liu, Rudong An, Zhimeng Zhang et al.

ECCV 2024arXiv:2407.15617

citations

#871

RoDUS: Robust Decomposition of Static and Dynamic Elements in Urban Scenes

Thang-Anh-Quan Nguyen, Luis G Roldao Jimenez, Nathan Piasco et al.

ECCV 2024arXiv:2403.09419

citations

#872

PointNeRF++: A multi-scale, point-based Neural Radiance Field

Weiwei Sun, Eduard Trulls, Yang-Che Tseng et al.

ECCV 2024arXiv:2312.02362

citations

#873

Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL

Fangwei Zhong, Kui Wu, Hai Ci et al.

ECCV 2024arXiv:2404.09857

citations

#874

Neural Volumetric World Models for Autonomous Driving

Zanming Huang, Jimuyang Zhang, Eshed Ohn-Bar

ECCV 2024

citations

#875

Event-Adapted Video Super-Resolution

Zeyu Xiao, Dachun Kai, Yueyi Zhang et al.

ECCV 2024

citations

#876

Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection

QIJIE MO, Yipeng Gao, Shenghao Fu et al.

ECCV 2024arXiv:2407.11499

citations

#877

SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoders

Sheng-Wei Li, Zi-Xiang Wei, Wei-Jie Jack Chen et al.

ECCV 2024arXiv:2407.13460

citations

#878

Grounding Language Models for Visual Entity Recognition

Zilin Xiao, Ming Gong, Paola Cascante-Bonilla et al.

ECCV 2024arXiv:2402.18695

citations

#879

PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects

Junyi Li, Junfeng Wu, Weizhi Zhao et al.

ECCV 2024arXiv:2407.16696

citations

#880

Representing Topological Self-Similarity Using Fractal Feature Maps for Accurate Segmentation of Tubular Structures

Jiaxing Huang, Yanfeng Zhou, Yaoru Luo et al.

ECCV 2024arXiv:2407.14754

citations

#881

ChEX: Interactive Localization and Region Description in Chest X-rays

Philip Müller, Georgios Kaissis, Daniel Rueckert

ECCV 2024arXiv:2404.15770

citations

#882

DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion

Liao Shen, Tianqi Liu, Huiqiang Sun et al.

ECCV 2024arXiv:2409.09605

citations

#883

ViC-MAE: Self-Supervised Representation Learning from Images and Video with Contrastive Masked Autoencoders

Jefferson Hernandez, Ruben Villegas, Vicente Ordonez

ECCV 2024arXiv:2303.12001

citations

#884

Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Large Models

Chen Ju, Haicheng Wang, Haozhe Cheng et al.

ECCV 2024arXiv:2407.11717

citations

#885

Learning Representations of Satellite Images From Metadata Supervision

Jules Bourcier, Gohar Dashyan, Karteek Alahari et al.

ECCV 2024

citations

#886

Rethinking Features-Fused-Pyramid-Neck for Object Detection

Hulin Li

ECCV 2024arXiv:2505.12820

citations

#887

How to Train the Teacher Model for Effective Knowledge Distillation

Shayan Mohajer Hamidi, Xizhen Deng, Renhao Tan et al.

ECCV 2024arXiv:2407.18041

citations

#888

MagicEraser: Erasing Any Objects via Semantics-Aware Control

FAN LI, Zixiao Zhang, Yi Huang et al.

ECCV 2024arXiv:2410.10207

citations

#889

Improving Feature Stability during Upsampling -- Spectral Artifacts and the Importance of Spatial Context

Shashank Agnihotri, Julia Grabinski, Margret Keuper

ECCV 2024arXiv:2311.17524

citations

#890

Projecting Points to Axes: Oriented Object Detection via Point-Axis Representation

Zeyang Zhao, Qilong Xue, Yifan Bai et al.

ECCV 2024arXiv:2407.08489

citations

#891

On the Utility of 3D Hand Poses for Action Recognition

Md Salman Shamil, Dibyadip Chatterjee, Fadime Sener et al.

ECCV 2024arXiv:2403.09805

citations

#892

Semi-supervised Segmentation of Histopathology Images with Noise-Aware Topological Consistency

Meilong Xu, Xiaoling Hu, Saumya Gupta et al.

ECCV 2024arXiv:2311.16447

citations

#893

Teddy: Efficient Large-Scale Dataset Distillation via Taylor-Approximated Matching

Ruonan Yu, Songhua Liu, Jingwen Ye et al.

ECCV 2024arXiv:2410.07579

citations

#894

3iGS: Factorised Tensorial Illumination for 3D Gaussian Splatting

Zhe Jun Tang, Tat-Jen Cham

ECCV 2024arXiv:2408.03753

citations

#895

Editable Image Elements for Controllable Synthesis

Jiteng Mu, Michael Gharbi, Richard Zhang et al.

ECCV 2024arXiv:2404.16029

citations

#896

Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation

Zhengyuan Xie, Haiquan Lu, Jia-wen Xiao et al.

ECCV 2024arXiv:2407.14142

citations

#897

Few-shot Defect Image Generation based on Consistency Modeling

Qingfeng Shi, Jing Wei, Fei Shen et al.

ECCV 2024arXiv:2408.00372

citations

#898

OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation

Kwanyoung Kim, Yujin Oh, Jong Chul Ye

ECCV 2024arXiv:2403.14183

citations

#899

Boosting 3D Single Object Tracking with 2D Matching Distillation and 3D Pre-training

qiangqiang wu, Yan Xia, Jia Wan et al.

ECCV 2024

citations

#900

BKDSNN: Enhancing the Performance of Learning-based Spiking Neural Networks Training with Blurred Knowledge Distillation

Zekai Xu, Kang You, Qinghai Guo et al.

ECCV 2024arXiv:2407.09083

citations

#901

InstructGIE: Towards Generalizable Image Editing

Zichong Meng, Changdi Yang, Jun Liu et al.

ECCV 2024arXiv:2403.05018

citations

#902

Brain-ID: Learning Contrast-agnostic Anatomical Representations for Brain Imaging

Peirong Liu, Oula Puonti, Xiaoling Hu et al.

ECCV 2024arXiv:2311.16914

citations

#903

Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation

Peng Jin, Hao Li, Zesen Cheng et al.

ECCV 2024arXiv:2407.10528

citations

#904

Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis

Qi Sun, Hang Zhou, Wengang Zhou et al.

ECCV 2024arXiv:2407.05388

citations

#905

Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image

Pengkun Jiao, Na Zhao, Jingjing Chen et al.

ECCV 2024arXiv:2407.05256

citations

#906

Fairness-aware Vision Transformer via Debiased Self-Attention

Yao Qiang, Chengyin Li, Prashant Khanduri et al.

ECCV 2024arXiv:2301.13803

citations

#907

Open-Vocabulary Camouflaged Object Segmentation

Youwei Pang, Xiaoqi Zhao, JiaMing Zuo et al.

ECCV 2024arXiv:2311.11241

citations

#908

Geospecific View Generation - Geometry-Context Aware High-resolution Ground View Inference from Satellite Views

Ningli Xu, Rongjun Qin

ECCV 2024arXiv:2407.08061

citations

#909

STAMP: Outlier-Aware Test-Time Adaptation with Stable Memory Replay

Yu Yongcan, Lijun Sheng, Ran He et al.

ECCV 2024arXiv:2407.15773

citations

#910

WildRefer: 3D Object Localization in Large-scale Dynamic Scenes with Multi-modal Visual Data and Natural Language

Zhenxiang Lin, Xidong Peng, peishan cong et al.

ECCV 2024arXiv:2304.05645

citations

#911

Towards Image Ambient Lighting Normalization

Florin-Alexandru Vasluianu, Tim Seizinger, Zongwei Wu et al.

ECCV 2024arXiv:2403.18730

citations

#912

Harnessing Text-to-Image Diffusion Models for Category-Agnostic Pose Estimation

Duo Peng, Zhengbo Zhang, Ping Hu et al.

ECCV 2024

citations

#913

ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer

Jiazhi Guan, Zhiliang Xu, Hang Zhou et al.

ECCV 2024arXiv:2408.03284

citations

#914

GENIXER: Empowering Multimodal Large Language Models as a Powerful Data Generator

Hengyuan Zhao, Pan Zhou, Mike Zheng Shou

ECCV 2024arXiv:2312.06731

citations

#915

BAFFLE: A Baseline of Backpropagation-Free Federated Learning

Haozhe Feng, Tianyu Pang, Chao Du et al.

ECCV 2024arXiv:2301.12195

citations

#916

UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding

Bowen Shi, Peisen Zhao, Zichen Wang et al.

ECCV 2024arXiv:2401.06397

citations

#917

TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks

Jinjie Mai, Wenxuan Zhu, Sara Rojas Martinez et al.

ECCV 2024arXiv:2408.10739

citations

#918

Can OOD Object Detectors Learn from Foundation Models?

Jiahui Liu, Xin Wen, Shizhen Zhao et al.

ECCV 2024arXiv:2409.05162

citations

#919

3DGazeNet: Generalizing Gaze Estimation with Weak Supervision from Synthetic Views

Evangelos Ververas, Polydefkis Gkagkos, Jiankang Deng et al.

ECCV 2024arXiv:2212.02997

citations

#920

Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather

Junsung Park, Kyungmin Kim, Hyunjung Shim

ECCV 2024arXiv:2407.02286

citations

#921

MultiDelete for Multimodal Machine Unlearning

Jiali Cheng, Hadi Amiri

ECCV 2024arXiv:2311.12047

citations

#922

DoughNet: A Visual Predictive Model for Topological Manipulation of Deformable Objects

Dominik Bauer, Zhenjia Xu, Shuran Song

ECCV 2024arXiv:2404.12524

citations

#923

Missing Modality Prediction for Unpaired Multimodal Learning via Joint Embedding of Unimodal Models

Taesup Kim, Donggeun Kim

ECCV 2024arXiv:2407.12616

citations

#924

Can Textual Semantics Mitigate Sounding Object Segmentation Preference?

Yaoting Wang, Peiwen Sun, Yuanchao Li et al.

ECCV 2024arXiv:2407.10947

citations

#925

Audio-driven Talking Face Generation with Stabilized Synchronization Loss

Dogucan Yaman, Fevziye Irem Eyiokur Yaman, Leonard Bärmann et al.

ECCV 2024arXiv:2307.09368

citations

#926

Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing

Vadim Titov, Madina Khalmatova, Alexandra Ivanova et al.

ECCV 2024arXiv:2409.01322

citations

#927

OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models

Zijian Zhou, Zheng Zhu, Holger Caesar et al.

ECCV 2024arXiv:2407.11213

citations

#928

DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation

Xiaobin Hu, Xu Peng, Donghao Luo et al.

ECCV 2024arXiv:2403.06168

citations

#929

Every Pixel Has its Moments: Ultra-High-Resolution Unpaired Image-to-Image Translation via Dense Normalization

Ming-Yang Ho, Che-Ming Wu, Min-Sheng Wu et al.

ECCV 2024arXiv:2407.04245

citations

#930

SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse Weather

Edoardo Palladin, Roland Dietze, Praveen Narayanan et al.

ECCV 2024arXiv:2508.16408

citations

#931

DualBEV: Unifying Dual View Transformation with Probabilistic Correspondences

Peidong Li, Wancheng Shen, Qihao Huang et al.

ECCV 2024arXiv:2403.05402

citations

#932

Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos

Akshay Paruchuri, Samuel Ehrenstein, Shuxian Wang et al.

ECCV 2024arXiv:2403.17915

citations

#933

ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention

Chenhang He, Ruihuang Li, Guowen Zhang et al.

ECCV 2024arXiv:2401.00912

citations

#934

CrossGLG: LLM Guides One-shot Skeleton-based 3D Action Recognition in a Cross-level Manner

Tingbing Yan, Wenzheng Zeng, Yang Xiao et al.

ECCV 2024arXiv:2403.10082

citations

#935

Self-Guided Generation of Minority Samples Using Diffusion Models

Soobin Um, Jong Chul Ye

ECCV 2024arXiv:2407.11555

citations

#936

BRIDGE: Bridging Gaps in Image Captioning Evaluation with Stronger Visual Cues

Sara Sarto, Marcella Cornia, Lorenzo Baraldi et al.

ECCV 2024arXiv:2407.20341

citations

#937

Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation

Hyunwoo Yu, Yubin Cho, Beoungwoo Kang et al.

ECCV 2024arXiv:2407.17261

citations

#938

Training-Free Model Merging for Multi-target Domain Adaptation

Wenyi Li, Huan-ang Gao, Mingju Gao et al.

ECCV 2024arXiv:2407.13771

citations

#939

Adapting Fine-Grained Cross-View Localization to Areas without Fine Ground Truth

Zimin Xia, Yujiao Shi, HONGDONG LI et al.

ECCV 2024arXiv:2406.00474

citations

#940

SNeRV: Spectra-preserving Neural Representation for Video

Jina Kim, Jihoo Lee, Jewon Kang

ECCV 2024arXiv:2501.01681

citations

#941

Monocular Occupancy Prediction for Scalable Indoor Scenes

Hongxiao Yu, Yuqi Wang, Yuntao Chen et al.

ECCV 2024arXiv:2407.11730

citations

#942

DiffFAS: Face Anti-Spoofing via Generative Diffusion Models

Xinxu Ge, Xin Liu, Zitong Yu et al.

ECCV 2024arXiv:2409.08572

citations

#943

Which Model Generated This Image? A Model-Agnostic Approach for Origin Attribution

Fengyuan Liu, Haochen Luo, Yiming Li et al.

ECCV 2024arXiv:2404.02697

citations

#944

FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior

Zhekai Chen, Wen Wang, Zhen Yang et al.

ECCV 2024arXiv:2407.04947

citations

#945

RGNet: A Unified Clip Retrieval and Grounding Network for Long Videos

Tanveer Hannan, Mohaiminul Islam, Thomas Seidl et al.

ECCV 2024arXiv:2312.06729

citations

#946

Global Counterfactual Directions

Bartlomiej Sobieski, Przemyslaw Biecek

ECCV 2024arXiv:2404.12488

citations

#947

Real Appearance Modeling for More General Deepfake Detection

Jiahe Tian, Yu Cai, Xi Wang et al.

ECCV 2024

citations

#948

PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance

Aoming Liu, Zhong Li, Zhang Chen et al.

ECCV 2024arXiv:2408.02157

citations

#949

Kernel Diffusion: An Alternate Approach to Blind Deconvolution

Yash Sanghvi, Yiheng Chi, Stanley Chan

ECCV 2024arXiv:2312.02319

citations

#950

C2C: Component-to-Composition Learning for Zero-Shot Compositional Action Recognition

Rongchang Li, Zhenhua Feng, Tianyang Xu et al.

ECCV 2024arXiv:2407.06113

citations

#951

Dense Hand-Object(HO) GraspNet with Full Grasping Taxonomy and Dynamics

Woojin Cho, Jihyun Lee, Minjae Yi et al.

ECCV 2024arXiv:2409.04033

citations

#952

UpFusion: Novel View Diffusion from Unposed Sparse View Observations

Bharath Raj Nagoor Kani, Hsin-Ying Lee, Sergey Tulyakov et al.

ECCV 2024arXiv:2312.06661

citations

#953

Deciphering the Role of Representation Disentanglement: Investigating Compositional Generalization in CLIP Models

Reza Abbasi, Mohammad Rohban, Mahdieh Soleymani Baghshah

ECCV 2024arXiv:2407.05897

citations

#954

SINDER: Repairing the Singular Defects of DINOv2

Haoqi Wang, Tong Zhang, Mathieu Salzmann

ECCV 2024arXiv:2407.16826

citations

#955

CarFormer: Self-Driving with Learned Object-Centric Representations

Shadi Hamdan, Fatma Guney

ECCV 2024arXiv:2407.15843

citations

#956

DeTra: A Unified Model for Object Detection and Trajectory Forecasting

Sergio Casas, Ben T Agro, Jiageng Mao et al.

ECCV 2024arXiv:2406.04426

citations

#957

OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection

Jinghua Hou, Tong Wang, Xiaoqing Ye et al.

ECCV 2024arXiv:2407.10753

citations

#958

Towards Real-World Adverse Weather Image Restoration: Enhancing Clearness and Semantics with Vision-Language Models

Jiaqi Xu, Mengyang Wu, Xiaowei Hu et al.

ECCV 2024arXiv:2409.02101

citations

#959

Mitigating Background Shift in Class-Incremental Semantic Segmentation

gilhan Park, WonJun Moon, SuBeen Lee et al.

ECCV 2024arXiv:2407.11859

citations

#960

Learning Video Context as Interleaved Multimodal Sequences

Qinghong Lin, Pengchuan Zhang, Difei Gao et al.

ECCV 2024arXiv:2407.21757

citations

#961

Skeleton-based Group Activity Recognition via Spatial-Temporal Panoramic Graph

Zhengcen Li, Xinle Chang, Yueran Li et al.

ECCV 2024arXiv:2407.19497

citations

#962

UCIP: A Universal Framework for Compressed Image Super-Resolution using Dynamic Prompt

Xin Li, Bingchen Li, Yeying Jin et al.

ECCV 2024arXiv:2407.13108

citations

#963

∞-Brush: Controllable Large Image Synthesis with Diffusion Models in Infinite Dimensions

Minh Quan Le, Alexandros Graikos, Srikar Yellapragada et al.

ECCV 2024arXiv:2407.14709

citations

#964

CamoTeacher: Dual-Rotation Consistency Learning for Semi-Supervised Camouflaged Object Detection

Xunfa Lai, Zhiyu Yang, Jie Hu et al.

ECCV 2024arXiv:2408.08050

citations

#965

Closed-Loop Unsupervised Representation Disentanglement with $\beta$-VAE Distillation and Diffusion Probabilistic Feedback

Xin Jin, Bohan Li, Baao Xie et al.

ECCV 2024

citations

#966

DPA-Net: Structured 3D Abstraction from Sparse Views via Differentiable Primitive Assembly

Fenggen Yu, Yiming Qian, Xu Zhang et al.

ECCV 2024arXiv:2404.00875

citations

#967

Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing

Zizheng Yang, Hu Yu, Bing Li et al.

ECCV 2024arXiv:2509.20091

citations

#968

RICA^2: Rubric-Informed, Calibrated Assessment of Actions

Abrar Majeedi, Viswanatha Reddy Gajjala, Satya Sai Srinath Namburi GNVV et al.

ECCV 2024

citations

#969

Modeling and Driving Human Body Soundfields through Acoustic Primitives

Chao Huang, Dejan Markovic, Chenliang Xu et al.

ECCV 2024arXiv:2407.13083

citations

#970

COHO: Context-Sensitive City-Scale Hierarchical Urban Layout Generation

Liu He, Daniel Aliaga

ECCV 2024arXiv:2407.11294

citations

#971

Multi-modal Crowd Counting via a Broker Modality

Haoliang Meng, Xiaopeng Hong, Chenhao Wang et al.

ECCV 2024arXiv:2407.07518

citations

#972

Lego: Learning to Disentangle and Invert Personalized Concepts Beyond Object Appearance in Text-to-Image Diffusion Models

Saman Motamed, Danda Pani Paudel, Luc Van Gool

ECCV 2024arXiv:2311.13833

citations

#973

Bridging the Gap Between Human Motion and Action Semantics via Kinematics Phrases

Xinpeng Liu, Yong-Lu Li, AILING ZENG et al.

ECCV 2024arXiv:2310.04189

citations

#974

Eliminating Warping Shakes for Unsupervised Online Video Stitching

Lang Nie, Chunyu Lin, Kang Liao et al.

ECCV 2024arXiv:2403.06378

citations

#975

MICDrop: Masking Image and Depth Features via Complementary Dropout for Domain-Adaptive Semantic Segmentation

Linyan Yang, Lukas Hoyer, Mark Weber et al.

ECCV 2024arXiv:2408.16478

citations

#976

Mixture of Efficient Diffusion Experts Through Automatic Interval and Sub-Network Selection

Alireza Ganjdanesh, Yan Kang, Yuchen Liu et al.

ECCV 2024arXiv:2409.15557

citations

#977

Realistic Human Motion Generation with Cross-Diffusion Models

Zeping Ren, Shaoli Huang, Xiu Li

ECCV 2024arXiv:2312.10993

citations

#978

3x2: 3D Object Part Segmentation by 2D Semantic Correspondences

Anh Thai, Weiyao Wang, Hao Tang et al.

ECCV 2024arXiv:2407.09648

citations

#979

Learn to Preserve and Diversify: Parameter-Efficient Group with Orthogonal Regularization for Domain Generalization

Jiajun Hu, Jian Zhang, Lei Qi et al.

ECCV 2024arXiv:2407.15085

citations

#980

CMTA: Cross-Modal Temporal Alignment for Event-guided Video Deblurring

Taewoo Kim, Hoonhee Cho, Kuk-Jin Yoon

ECCV 2024arXiv:2408.14930

citations

#981

PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation

Shilin Yan, Xiaohao Xu, Renrui Zhang et al.

ECCV 2024arXiv:2309.12303

citations

#982

Benchmarking Spurious Bias in Few-Shot Image Classifiers

Guangtao Zheng, Wenqian Ye, Aidong Zhang

ECCV 2024arXiv:2409.02882

citations

#983

Towards Reliable Evaluation and Fast Training of Robust Semantic Segmentation Models

Francesco Croce, Naman D. Singh, Matthias Hein

ECCV 2024arXiv:2306.12941

citations

#984

The Lottery Ticket Hypothesis in Denoising: Towards Semantic-Driven Initialization

Jiafeng Mao, Xueting Wang, Kiyoharu Aizawa

ECCV 2024arXiv:2312.08872

citations

#985

Multi-Sentence Grounding for Long-term Instructional Video

Zeqian Li, QIRUI CHEN, Tengda Han et al.

ECCV 2024arXiv:2312.14055

citations

#986

Temporally Consistent Stereo Matching

Jiaxi Zeng, Chengtang Yao, Yuwei Wu et al.

ECCV 2024arXiv:2407.11950

citations

#987

Contrastive ground-level image and remote sensing pre-training improves representation learning for natural world imagery

Andy V Huynh, Lauren Gillespie, Jael Lopez-Saucedo et al.

ECCV 2024arXiv:2409.19439

citations

#988

ReLoo: Reconstructing Humans Dressed in Loose Garments from Monocular Video in the Wild

Chen Guo, Tianjian Jiang, Manuel Kaufmann et al.

ECCV 2024arXiv:2409.15269

citations

#989

FutureDepth: Learning to Predict the Future Improves Video Depth Estimation

Rajeev Yasarla, Manish Kumar Singh, Hong Cai et al.

ECCV 2024arXiv:2403.12953

citations

#990

LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection

Sanmin Kim, Youngseok Kim, Sihwan Hwang et al.

ECCV 2024arXiv:2407.10164

citations

#991

DreamDiffusion: High-Quality EEG-to-Image Generation with Temporal Masked Signal Modeling and CLIP Alignment

Yunpeng Bai, Xintao Wang, Yanpei Cao et al.

ECCV 2024

citations

#992

Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation

Seonghoon Yu, Paul Hongsuck Seo, Jeany Son

ECCV 2024arXiv:2407.07412

citations

#993

Explorative Inbetweening of Time and Space

Haiwen Feng, Zheng Ding, Zhihao Xia et al.

ECCV 2024arXiv:2403.14611

citations

#994

Physical-Based Event Camera Simulator

Haiqian Han, Jiacheng Lyu, Jianing Li et al.

ECCV 2024

citations

#995

Weakly-supervised Camera Localization by Ground-to-satellite Image Registration

Yujiao Shi, HONGDONG LI, Akhil Perincherry et al.

ECCV 2024arXiv:2409.06471

citations

#996

Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced Pre-training

Hyesong Choi, Hyejin Park, Kwang Moo Yi et al.

ECCV 2024arXiv:2404.08327

citations

#997

Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models

Xiaoyu Zhu, Hao Zhou, Pengfei Xing et al.

ECCV 2024arXiv:2407.13642

citations

#998

FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models

Wei WU, Qingnan Fan, Shuai Qin et al.

ECCV 2024arXiv:2404.11895

citations

#999

MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion

Lehong Wu, Lilang Lin, Jiahang Zhang et al.

ECCV 2024arXiv:2409.10473

citations

#1000

TimeCraft: Navigate Weakly-Supervised Temporal Grounded Video Question Answering via Bi-directional Reasoning

Huabin Liu, Xiao Ma, Cheng Zhong et al.

ECCV 2024

citations

← Previous

1...3 4 5 6 7...12