Most Cited ECCV "energy efficient networks" Papers

2,387 papers found • Page 5 of 12

#801

Event Camera Data Dense Pre-training

Yan Yang, Liyuan Pan, Liu liu

ECCV 2024arXiv:2311.11533
15
citations
#802

Just a Hint: Point-Supervised Camouflaged Object Detection

Huafeng Chen, Dian SHAO, Guangqian Guo et al.

ECCV 2024arXiv:2408.10777
15
citations
#803

PLOT: Text-based Person Search with Part Slot Attention for Corresponding Part Discovery

Jicheol Park, Dongwon Kim, Boseung Jeong et al.

ECCV 2024arXiv:2409.13475
15
citations
#804

EmoTalk3D: High-Fidelity Free-View Synthesis of Emotional 3D Talking Head

Qianyun He, Xinya Ji, Yicheng Gong et al.

ECCV 2024arXiv:2408.00297
15
citations
#805

Visual Text Generation in the Wild

Yuanzhi Zhu, Jiawei Liu, Feiyu Gao et al.

ECCV 2024arXiv:2407.14138
15
citations
#806

Improving Geo-diversity of Generated Images with Contextualized Vendi Score Guidance

Reyhane Askari Hemmat, Melissa Hall, Alicia Yi Sun et al.

ECCV 2024arXiv:2406.04551
15
citations
#807

HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts

Wonjae Kim, Sanghyuk Chun, Taekyung Kim et al.

ECCV 2024arXiv:2404.17507
15
citations
#808

Open Panoramic Segmentation

Junwei Zheng, Ruiping Liu, Yufan Chen et al.

ECCV 2024arXiv:2407.02685
15
citations
#809

FRDiff : Feature Reuse for Universal Training-free Acceleration of Diffusion Models

Junhyuk So, Jungwon Lee, Eunhyeok Park

ECCV 2024arXiv:2312.03517
15
citations
#810

Animal Avatars: Reconstructing Animatable 3D Animals from Casual Videos

Remy Sabathier, David Novotny, Niloy Mitra

ECCV 2024arXiv:2403.17103
15
citations
#811

Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Models

Hao Cheng, Erjia Xiao, Jindong Gu et al.

ECCV 2024arXiv:2402.19150
15
citations
#812

MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion Models

Nithin Gopalakrishnan Nair, Jeya Maria Jose Valanarasu, Vishal Patel

ECCV 2024arXiv:2404.09977
15
citations
#813

Learning Camouflaged Object Detection from Noisy Pseudo Label

Jin Zhang, Ruiheng Zhang, Yanjiao Shi et al.

ECCV 2024arXiv:2407.13157
15
citations
#814

SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation

Lingchen Meng, Shiyi Lan, Hengduo Li et al.

ECCV 2024arXiv:2311.14671
15
citations
#815

UniProcessor: A Text-induced Unified Low-level Image Processor

Huiyu Duan, Xiongkuo Min, Sijing Wu et al.

ECCV 2024arXiv:2407.20928
15
citations
#816

Reinforcement Learning Friendly Vision-Language Model for Minecraft

Haobin Jiang, Junpeng Yue, Hao Luo et al.

ECCV 2024arXiv:2303.10571
15
citations
#817

Temporal Event Stereo via Joint Learning with Stereoscopic Flow

Hoonhee Cho, Jae-young Kang, Kuk-Jin Yoon

ECCV 2024arXiv:2407.10831
15
citations
#818

Enhancing Source-Free Domain Adaptive Object Detection with Low-confidence Pseudo Label Distillation

Ilhoon Yoon, Hyeongjun Kwon, Jin Kim et al.

ECCV 2024arXiv:2407.13524
15
citations
#819

The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?

Qinyu Zhao, Ming Xu, Kartik Gupta et al.

ECCV 2024arXiv:2403.09037
15
citations
#820

Dynamic Retraining-Updating Mean Teacher for Source-Free Object Detection

BA KHANH TRINH LE, Huy-Hung Nguyen, Long Hoang Pham et al.

ECCV 2024arXiv:2407.16497
15
citations
#821

Instant 3D Human Avatar Generation using Image Diffusion Models

Nikos Kolotouros, Thiemo Alldieck, Enric Corona et al.

ECCV 2024arXiv:2406.07516
15
citations
#822

NeuSDFusion: A Spatial-Aware Generative Model for 3D Shape Completion, Reconstruction, and Generation

Ruikai Cui, Weizhe Liu, Weixuan Sun et al.

ECCV 2024arXiv:2403.18241
14
citations
#823

General and Task-Oriented Video Segmentation

Mu Chen, Liulei Li, Wenguan Wang et al.

ECCV 2024arXiv:2407.06540
14
citations
#824

Generalizable Facial Expression Recognition

Yuhang Zhang, Xiuqi Zheng, Chenyi Liang et al.

ECCV 2024arXiv:2408.10614
14
citations
#825

EntAugment: Entropy-Driven Adaptive Data Augmentation Framework for Image Classification

Suorong Yang, Furao Shen, Jian Zhao

ECCV 2024arXiv:2409.06290
14
citations
#826

CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection

Wuyang Li, Xinyu Liu, Jiayi Ma et al.

ECCV 2024
14
citations
#827

Controlling the World by Sleight of Hand

Sruthi Sudhakar, Ruoshi Liu, Basile Van Hoorick et al.

ECCV 2024arXiv:2408.07147
14
citations
#828

EvSign: Sign Language Recognition and Translation with Streaming Events

Pengyu Zhang, Hao Yin, Zeren Wang et al.

ECCV 2024arXiv:2407.12593
14
citations
#829

InstaStyle: Inversion Noise of a Stylized Image is Secretly a Style Adviser

Xing Cui, Zekun Li, Peipei Li et al.

ECCV 2024arXiv:2311.15040
14
citations
#830

ScanTalk: 3D Talking Heads from Unregistered Scans

Federico Nocentini, Thomas Besnier, Claudio Ferrari et al.

ECCV 2024arXiv:2403.10942
14
citations
#831

MoEAD: A Parameter-efficient Model for Multi-class Anomaly Detection

Shiyuan Meng, Wenchao Meng, Qihang Zhou et al.

ECCV 2024
14
citations
#832

Track Everything Everywhere Fast and Robustly

Yunzhou Song, Jiahui Lei, Ziyun Wang et al.

ECCV 2024arXiv:2403.17931
14
citations
#833

TetraDiffusion: Tetrahedral Diffusion Models for 3D Shape Generation

Nikolai Kalischek, Torben Peters, Jan Dirk Wegner et al.

ECCV 2024arXiv:2211.13220
14
citations
#834

GRIDS: Grouped Multiple-Degradation Restoration with Image Degradation Similarity

Shuo Cao, Yihao Liu, Wenlong Zhang et al.

ECCV 2024arXiv:2407.12273
14
citations
#835

Multi-Label Cluster Discrimination for Visual Representation Learning

Xiang An, Kaicheng Yang, Xiangzi Dai et al.

ECCV 2024arXiv:2407.17331
14
citations
#836

Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth Estimation

Jinfeng Liu, Lingtong Kong, Bo Li et al.

ECCV 2024arXiv:2407.14126
14
citations
#837

DGD: Dynamic 3D Gaussians Distillation

Isaac Labe, Noam Issachar, Itai Lang et al.

ECCV 2024arXiv:2405.19321
14
citations
#838

Kalman-Inspired Feature Propagation for Video Face Super-Resolution

Ruicheng Feng, Chongyi Li, Chen Change Loy

ECCV 2024arXiv:2408.05205
14
citations
#839

UniCode : Learning a Unified Codebook for Multimodal Large Language Models

Sipeng Zheng, Bohan Zhou, Yicheng Feng et al.

ECCV 2024arXiv:2403.09072
14
citations
#840

Foster Adaptivity and Balance in Learning with Noisy Labels

Mengmeng Sheng, Zeren Sun, Tao Chen et al.

ECCV 2024arXiv:2407.02778
14
citations
#841

Long-term Temporal Context Gathering for Neural Video Compression

Linfeng Qi, Zhaoyang Jia, Jiahao Li et al.

ECCV 2024
14
citations
#842

Non-Exemplar Domain Incremental Learning via Cross-Domain Concept Integration

Qiang Wang, Yuhang He, Songlin Dong et al.

ECCV 2024
14
citations
#843

Free-VSC: Free Semantics from Visual Foundation Models for Unsupervised Video Semantic Compression

Yuan Tian, Guo Lu, Guangtao Zhai

ECCV 2024arXiv:2409.11718
14
citations
#844

SIGMA: Sinkhorn-Guided Masked Video Modeling

Mohammadreza Salehi, Michael Dorkenwald, Fida Mohammad Thoker et al.

ECCV 2024arXiv:2407.15447
14
citations
#845

X-Pose: Detecting Any Keypoints

Jie Yang, AILING ZENG, Ruimao Zhang et al.

ECCV 2024arXiv:2310.08530
14
citations
#846

Adaptive Compressed Sensing with Diffusion-Based Posterior Sampling

Noam Elata, Tomer Michaeli, Michael Elad

ECCV 2024arXiv:2407.08256
14
citations
#847

FineMatch: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction

Hang Hua, Jing Shi, Kushal Kafle et al.

ECCV 2024arXiv:2404.14715
14
citations
#848

AttnZero: Efficient Attention Discovery for Vision Transformers

Lujun Li, Zimian Wei, Peijie Dong et al.

ECCV 2024
14
citations
#849

Referring Atomic Video Action Recognition

Kunyu Peng, Jia Fu, Kailun Yang et al.

ECCV 2024arXiv:2407.01872
14
citations
#850

Where am I? Scene Retrieval with Language

Jiaqi Chen, Daniel Barath, Iro Armeni et al.

ECCV 2024arXiv:2404.14565
14
citations
#851

MoVideo: Motion-Aware Video Generation with Diffusion Models

Jingyun Liang, Yuchen Fan, Kai Zhang et al.

ECCV 2024arXiv:2311.11325
14
citations
#852

Finding Visual Task Vectors

Alberto Hojel, Yutong Bai, Trevor Darrell et al.

ECCV 2024arXiv:2404.05729
14
citations
#853

Vision-Language Action Knowledge Learning for Semantic-Aware Action Quality Assessment

Huangbiao Xu, Xiao Ke, Yuezhou Li et al.

ECCV 2024
14
citations
#854

Parrot Captions Teach CLIP to Spot Text

Yiqi Lin, Conghui He, Alex Jinpeng Wang et al.

ECCV 2024arXiv:2312.14232
14
citations
#855

Free-Editor: Zero-shot Text-driven 3D Scene Editing

Md Nazmul Karim, Hasan Iqbal, Umar Khalid et al.

ECCV 2024arXiv:2312.13663
14
citations
#856

Robust Multimodal Learning via Representation Decoupling

Shicai Wei, Yang Luo, Yuji Wang et al.

ECCV 2024arXiv:2407.04458
14
citations
#857

Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors

Jae Joong Lee, Bosheng Li, Sara Beery et al.

ECCV 2024arXiv:2407.10330
14
citations
#858

Reinforcement Learning Meets Visual Odometry

Nico Messikommer, Giovanni Cioffi, Mathias Gehrig et al.

ECCV 2024arXiv:2407.15626
14
citations
#859

Iterative Ensemble Training with Anti-Gradient Control for Mitigating Memorization in Diffusion Models

Xiao Liu, Xiaoliu Guan, Yu Wu et al.

ECCV 2024arXiv:2407.15328
14
citations
#860

Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments

Djamahl Etchegaray, Zi Helen Huang, Tatsuya Harada et al.

ECCV 2024arXiv:2403.13556
14
citations
#861

3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation

Zihao Xiao, Longlong Jing, Shangxuan Wu et al.

ECCV 2024arXiv:2401.02402
14
citations
#862

ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic Environments

Taewoong Kim, Cheolhong Min, Byeonghwi Kim et al.

ECCV 2024arXiv:2407.18550
14
citations
#863

IMMA: Immunizing text-to-image Models against Malicious Adaptation

Amber Yijia Zheng, Raymond Yeh

ECCV 2024arXiv:2311.18815
14
citations
#864

LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation

Archana Swaminathan, Anubhav Anubhav, Kamal Gupta et al.

ECCV 2024arXiv:2409.06703
14
citations
#865

ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction

Shaozhe Hao, Kai Han, Zhengyao Lv et al.

ECCV 2024arXiv:2407.07077
14
citations
#866

LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer

Ning Yu, Chia-Chih Chen, Zeyuan Chen et al.

ECCV 2024arXiv:2212.09877
14
citations
#867

ControlCap: Controllable Region-level Captioning

Yuzhong Zhao, Liu Yue, Zonghao Guo et al.

ECCV 2024arXiv:2401.17910
14
citations
#868

HUMOS: Human Motion Model Conditioned on Body Shape

Shashank Tripathi, Omid Taheri, Christoph Lassner et al.

ECCV 2024arXiv:2409.03944
14
citations
#869

Rate-Distortion-Cognition Controllable Versatile Neural Image Compression

Jinming Liu, Ruoyu Feng, Yunpeng Qi et al.

ECCV 2024arXiv:2407.11700
14
citations
#870

Norface: Improving Facial Expression Analysis by Identity Normalization

Hanwei Liu, Rudong An, Zhimeng Zhang et al.

ECCV 2024arXiv:2407.15617
14
citations
#871

RoDUS: Robust Decomposition of Static and Dynamic Elements in Urban Scenes

Thang-Anh-Quan Nguyen, Luis G Roldao Jimenez, Nathan Piasco et al.

ECCV 2024arXiv:2403.09419
14
citations
#872

PointNeRF++: A multi-scale, point-based Neural Radiance Field

Weiwei Sun, Eduard Trulls, Yang-Che Tseng et al.

ECCV 2024arXiv:2312.02362
14
citations
#873

Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL

Fangwei Zhong, Kui Wu, Hai Ci et al.

ECCV 2024arXiv:2404.09857
14
citations
#874

Neural Volumetric World Models for Autonomous Driving

Zanming Huang, Jimuyang Zhang, Eshed Ohn-Bar

ECCV 2024
14
citations
#875

Event-Adapted Video Super-Resolution

Zeyu Xiao, Dachun Kai, Yueyi Zhang et al.

ECCV 2024
14
citations
#876

Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection

QIJIE MO, Yipeng Gao, Shenghao Fu et al.

ECCV 2024arXiv:2407.11499
14
citations
#877

SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoders

Sheng-Wei Li, Zi-Xiang Wei, Wei-Jie Jack Chen et al.

ECCV 2024arXiv:2407.13460
14
citations
#878

Grounding Language Models for Visual Entity Recognition

Zilin Xiao, Ming Gong, Paola Cascante-Bonilla et al.

ECCV 2024arXiv:2402.18695
13
citations
#879

PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects

Junyi Li, Junfeng Wu, Weizhi Zhao et al.

ECCV 2024arXiv:2407.16696
13
citations
#880

Representing Topological Self-Similarity Using Fractal Feature Maps for Accurate Segmentation of Tubular Structures

Jiaxing Huang, Yanfeng Zhou, Yaoru Luo et al.

ECCV 2024arXiv:2407.14754
13
citations
#881

ChEX: Interactive Localization and Region Description in Chest X-rays

Philip Müller, Georgios Kaissis, Daniel Rueckert

ECCV 2024arXiv:2404.15770
13
citations
#882

DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion

Liao Shen, Tianqi Liu, Huiqiang Sun et al.

ECCV 2024arXiv:2409.09605
13
citations
#883

ViC-MAE: Self-Supervised Representation Learning from Images and Video with Contrastive Masked Autoencoders

Jefferson Hernandez, Ruben Villegas, Vicente Ordonez

ECCV 2024arXiv:2303.12001
13
citations
#884

Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Large Models

Chen Ju, Haicheng Wang, Haozhe Cheng et al.

ECCV 2024arXiv:2407.11717
13
citations
#885

Learning Representations of Satellite Images From Metadata Supervision

Jules Bourcier, Gohar Dashyan, Karteek Alahari et al.

ECCV 2024
13
citations
#886

Rethinking Features-Fused-Pyramid-Neck for Object Detection

Hulin Li

ECCV 2024arXiv:2505.12820
13
citations
#887

How to Train the Teacher Model for Effective Knowledge Distillation

Shayan Mohajer Hamidi, Xizhen Deng, Renhao Tan et al.

ECCV 2024arXiv:2407.18041
13
citations
#888

MagicEraser: Erasing Any Objects via Semantics-Aware Control

FAN LI, Zixiao Zhang, Yi Huang et al.

ECCV 2024arXiv:2410.10207
13
citations
#889

Improving Feature Stability during Upsampling -- Spectral Artifacts and the Importance of Spatial Context

Shashank Agnihotri, Julia Grabinski, Margret Keuper

ECCV 2024arXiv:2311.17524
13
citations
#890

Projecting Points to Axes: Oriented Object Detection via Point-Axis Representation

Zeyang Zhao, Qilong Xue, Yifan Bai et al.

ECCV 2024arXiv:2407.08489
13
citations
#891

On the Utility of 3D Hand Poses for Action Recognition

Md Salman Shamil, Dibyadip Chatterjee, Fadime Sener et al.

ECCV 2024arXiv:2403.09805
13
citations
#892

Semi-supervised Segmentation of Histopathology Images with Noise-Aware Topological Consistency

Meilong Xu, Xiaoling Hu, Saumya Gupta et al.

ECCV 2024arXiv:2311.16447
13
citations
#893

Teddy: Efficient Large-Scale Dataset Distillation via Taylor-Approximated Matching

Ruonan Yu, Songhua Liu, Jingwen Ye et al.

ECCV 2024arXiv:2410.07579
13
citations
#894

3iGS: Factorised Tensorial Illumination for 3D Gaussian Splatting

Zhe Jun Tang, Tat-Jen Cham

ECCV 2024arXiv:2408.03753
13
citations
#895

Editable Image Elements for Controllable Synthesis

Jiteng Mu, Michael Gharbi, Richard Zhang et al.

ECCV 2024arXiv:2404.16029
13
citations
#896

Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation

Zhengyuan Xie, Haiquan Lu, Jia-wen Xiao et al.

ECCV 2024arXiv:2407.14142
13
citations
#897

Few-shot Defect Image Generation based on Consistency Modeling

Qingfeng Shi, Jing Wei, Fei Shen et al.

ECCV 2024arXiv:2408.00372
13
citations
#898

OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation

Kwanyoung Kim, Yujin Oh, Jong Chul Ye

ECCV 2024arXiv:2403.14183
13
citations
#899

Boosting 3D Single Object Tracking with 2D Matching Distillation and 3D Pre-training

qiangqiang wu, Yan Xia, Jia Wan et al.

ECCV 2024
13
citations
#900

BKDSNN: Enhancing the Performance of Learning-based Spiking Neural Networks Training with Blurred Knowledge Distillation

Zekai Xu, Kang You, Qinghai Guo et al.

ECCV 2024arXiv:2407.09083
13
citations
#901

InstructGIE: Towards Generalizable Image Editing

Zichong Meng, Changdi Yang, Jun Liu et al.

ECCV 2024arXiv:2403.05018
13
citations
#902

Brain-ID: Learning Contrast-agnostic Anatomical Representations for Brain Imaging

Peirong Liu, Oula Puonti, Xiaoling Hu et al.

ECCV 2024arXiv:2311.16914
13
citations
#903

Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation

Peng Jin, Hao Li, Zesen Cheng et al.

ECCV 2024arXiv:2407.10528
13
citations
#904

Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis

Qi Sun, Hang Zhou, Wengang Zhou et al.

ECCV 2024arXiv:2407.05388
13
citations
#905

Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image

Pengkun Jiao, Na Zhao, Jingjing Chen et al.

ECCV 2024arXiv:2407.05256
13
citations
#906

Fairness-aware Vision Transformer via Debiased Self-Attention

Yao Qiang, Chengyin Li, Prashant Khanduri et al.

ECCV 2024arXiv:2301.13803
13
citations
#907

Open-Vocabulary Camouflaged Object Segmentation

Youwei Pang, Xiaoqi Zhao, JiaMing Zuo et al.

ECCV 2024arXiv:2311.11241
13
citations
#908

Geospecific View Generation - Geometry-Context Aware High-resolution Ground View Inference from Satellite Views

Ningli Xu, Rongjun Qin

ECCV 2024arXiv:2407.08061
13
citations
#909

STAMP: Outlier-Aware Test-Time Adaptation with Stable Memory Replay

Yu Yongcan, Lijun Sheng, Ran He et al.

ECCV 2024arXiv:2407.15773
13
citations
#910

WildRefer: 3D Object Localization in Large-scale Dynamic Scenes with Multi-modal Visual Data and Natural Language

Zhenxiang Lin, Xidong Peng, peishan cong et al.

ECCV 2024arXiv:2304.05645
13
citations
#911

Towards Image Ambient Lighting Normalization

Florin-Alexandru Vasluianu, Tim Seizinger, Zongwei Wu et al.

ECCV 2024arXiv:2403.18730
13
citations
#912

Harnessing Text-to-Image Diffusion Models for Category-Agnostic Pose Estimation

Duo Peng, Zhengbo Zhang, Ping Hu et al.

ECCV 2024
13
citations
#913

ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer

Jiazhi Guan, Zhiliang Xu, Hang Zhou et al.

ECCV 2024arXiv:2408.03284
13
citations
#914

GENIXER: Empowering Multimodal Large Language Models as a Powerful Data Generator

Hengyuan Zhao, Pan Zhou, Mike Zheng Shou

ECCV 2024arXiv:2312.06731
13
citations
#915

BAFFLE: A Baseline of Backpropagation-Free Federated Learning

Haozhe Feng, Tianyu Pang, Chao Du et al.

ECCV 2024arXiv:2301.12195
13
citations
#916

UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding

Bowen Shi, Peisen Zhao, Zichen Wang et al.

ECCV 2024arXiv:2401.06397
13
citations
#917

TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks

Jinjie Mai, Wenxuan Zhu, Sara Rojas Martinez et al.

ECCV 2024arXiv:2408.10739
13
citations
#918

Can OOD Object Detectors Learn from Foundation Models?

Jiahui Liu, Xin Wen, Shizhen Zhao et al.

ECCV 2024arXiv:2409.05162
13
citations
#919

3DGazeNet: Generalizing Gaze Estimation with Weak Supervision from Synthetic Views

Evangelos Ververas, Polydefkis Gkagkos, Jiankang Deng et al.

ECCV 2024arXiv:2212.02997
13
citations
#920

Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather

Junsung Park, Kyungmin Kim, Hyunjung Shim

ECCV 2024arXiv:2407.02286
13
citations
#921

MultiDelete for Multimodal Machine Unlearning

Jiali Cheng, Hadi Amiri

ECCV 2024arXiv:2311.12047
13
citations
#922

DoughNet: A Visual Predictive Model for Topological Manipulation of Deformable Objects

Dominik Bauer, Zhenjia Xu, Shuran Song

ECCV 2024arXiv:2404.12524
13
citations
#923

Missing Modality Prediction for Unpaired Multimodal Learning via Joint Embedding of Unimodal Models

Taesup Kim, Donggeun Kim

ECCV 2024arXiv:2407.12616
13
citations
#924

Can Textual Semantics Mitigate Sounding Object Segmentation Preference?

Yaoting Wang, Peiwen Sun, Yuanchao Li et al.

ECCV 2024arXiv:2407.10947
13
citations
#925

Audio-driven Talking Face Generation with Stabilized Synchronization Loss

Dogucan Yaman, Fevziye Irem Eyiokur Yaman, Leonard Bärmann et al.

ECCV 2024arXiv:2307.09368
13
citations
#926

Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing

Vadim Titov, Madina Khalmatova, Alexandra Ivanova et al.

ECCV 2024arXiv:2409.01322
13
citations
#927

OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models

Zijian Zhou, Zheng Zhu, Holger Caesar et al.

ECCV 2024arXiv:2407.11213
13
citations
#928

DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation

Xiaobin Hu, Xu Peng, Donghao Luo et al.

ECCV 2024arXiv:2403.06168
13
citations
#929

Every Pixel Has its Moments: Ultra-High-Resolution Unpaired Image-to-Image Translation via Dense Normalization

Ming-Yang Ho, Che-Ming Wu, Min-Sheng Wu et al.

ECCV 2024arXiv:2407.04245
13
citations
#930

SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse Weather

Edoardo Palladin, Roland Dietze, Praveen Narayanan et al.

ECCV 2024arXiv:2508.16408
13
citations
#931

DualBEV: Unifying Dual View Transformation with Probabilistic Correspondences

Peidong Li, Wancheng Shen, Qihao Huang et al.

ECCV 2024arXiv:2403.05402
13
citations
#932

Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos

Akshay Paruchuri, Samuel Ehrenstein, Shuxian Wang et al.

ECCV 2024arXiv:2403.17915
13
citations
#933

ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention

Chenhang He, Ruihuang Li, Guowen Zhang et al.

ECCV 2024arXiv:2401.00912
13
citations
#934

CrossGLG: LLM Guides One-shot Skeleton-based 3D Action Recognition in a Cross-level Manner

Tingbing Yan, Wenzheng Zeng, Yang Xiao et al.

ECCV 2024arXiv:2403.10082
13
citations
#935

Self-Guided Generation of Minority Samples Using Diffusion Models

Soobin Um, Jong Chul Ye

ECCV 2024arXiv:2407.11555
13
citations
#936

BRIDGE: Bridging Gaps in Image Captioning Evaluation with Stronger Visual Cues

Sara Sarto, Marcella Cornia, Lorenzo Baraldi et al.

ECCV 2024arXiv:2407.20341
12
citations
#937

Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation

Hyunwoo Yu, Yubin Cho, Beoungwoo Kang et al.

ECCV 2024arXiv:2407.17261
12
citations
#938

Training-Free Model Merging for Multi-target Domain Adaptation

Wenyi Li, Huan-ang Gao, Mingju Gao et al.

ECCV 2024arXiv:2407.13771
12
citations
#939

Adapting Fine-Grained Cross-View Localization to Areas without Fine Ground Truth

Zimin Xia, Yujiao Shi, HONGDONG LI et al.

ECCV 2024arXiv:2406.00474
12
citations
#940

SNeRV: Spectra-preserving Neural Representation for Video

Jina Kim, Jihoo Lee, Jewon Kang

ECCV 2024arXiv:2501.01681
12
citations
#941

Monocular Occupancy Prediction for Scalable Indoor Scenes

Hongxiao Yu, Yuqi Wang, Yuntao Chen et al.

ECCV 2024arXiv:2407.11730
12
citations
#942

DiffFAS: Face Anti-Spoofing via Generative Diffusion Models

Xinxu Ge, Xin Liu, Zitong Yu et al.

ECCV 2024arXiv:2409.08572
12
citations
#943

Which Model Generated This Image? A Model-Agnostic Approach for Origin Attribution

Fengyuan Liu, Haochen Luo, Yiming Li et al.

ECCV 2024arXiv:2404.02697
12
citations
#944

FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior

Zhekai Chen, Wen Wang, Zhen Yang et al.

ECCV 2024arXiv:2407.04947
12
citations
#945

RGNet: A Unified Clip Retrieval and Grounding Network for Long Videos

Tanveer Hannan, Mohaiminul Islam, Thomas Seidl et al.

ECCV 2024arXiv:2312.06729
12
citations
#946

Global Counterfactual Directions

Bartlomiej Sobieski, Przemyslaw Biecek

ECCV 2024arXiv:2404.12488
12
citations
#947

Real Appearance Modeling for More General Deepfake Detection

Jiahe Tian, Yu Cai, Xi Wang et al.

ECCV 2024
12
citations
#948

PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance

Aoming Liu, Zhong Li, Zhang Chen et al.

ECCV 2024arXiv:2408.02157
12
citations
#949

Kernel Diffusion: An Alternate Approach to Blind Deconvolution

Yash Sanghvi, Yiheng Chi, Stanley Chan

ECCV 2024arXiv:2312.02319
12
citations
#950

C2C: Component-to-Composition Learning for Zero-Shot Compositional Action Recognition

Rongchang Li, Zhenhua Feng, Tianyang Xu et al.

ECCV 2024arXiv:2407.06113
12
citations
#951

Dense Hand-Object(HO) GraspNet with Full Grasping Taxonomy and Dynamics

Woojin Cho, Jihyun Lee, Minjae Yi et al.

ECCV 2024arXiv:2409.04033
12
citations
#952

UpFusion: Novel View Diffusion from Unposed Sparse View Observations

Bharath Raj Nagoor Kani, Hsin-Ying Lee, Sergey Tulyakov et al.

ECCV 2024arXiv:2312.06661
12
citations
#953

Deciphering the Role of Representation Disentanglement: Investigating Compositional Generalization in CLIP Models

Reza Abbasi, Mohammad Rohban, Mahdieh Soleymani Baghshah

ECCV 2024arXiv:2407.05897
12
citations
#954

SINDER: Repairing the Singular Defects of DINOv2

Haoqi Wang, Tong Zhang, Mathieu Salzmann

ECCV 2024arXiv:2407.16826
12
citations
#955

CarFormer: Self-Driving with Learned Object-Centric Representations

Shadi Hamdan, Fatma Guney

ECCV 2024arXiv:2407.15843
12
citations
#956

DeTra: A Unified Model for Object Detection and Trajectory Forecasting

Sergio Casas, Ben T Agro, Jiageng Mao et al.

ECCV 2024arXiv:2406.04426
12
citations
#957

OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection

Jinghua Hou, Tong Wang, Xiaoqing Ye et al.

ECCV 2024arXiv:2407.10753
12
citations
#958

Towards Real-World Adverse Weather Image Restoration: Enhancing Clearness and Semantics with Vision-Language Models

Jiaqi Xu, Mengyang Wu, Xiaowei Hu et al.

ECCV 2024arXiv:2409.02101
12
citations
#959

Mitigating Background Shift in Class-Incremental Semantic Segmentation

gilhan Park, WonJun Moon, SuBeen Lee et al.

ECCV 2024arXiv:2407.11859
12
citations
#960

Learning Video Context as Interleaved Multimodal Sequences

Qinghong Lin, Pengchuan Zhang, Difei Gao et al.

ECCV 2024arXiv:2407.21757
12
citations
#961

Skeleton-based Group Activity Recognition via Spatial-Temporal Panoramic Graph

Zhengcen Li, Xinle Chang, Yueran Li et al.

ECCV 2024arXiv:2407.19497
12
citations
#962

UCIP: A Universal Framework for Compressed Image Super-Resolution using Dynamic Prompt

Xin Li, Bingchen Li, Yeying Jin et al.

ECCV 2024arXiv:2407.13108
12
citations
#963

∞-Brush: Controllable Large Image Synthesis with Diffusion Models in Infinite Dimensions

Minh Quan Le, Alexandros Graikos, Srikar Yellapragada et al.

ECCV 2024arXiv:2407.14709
12
citations
#964

CamoTeacher: Dual-Rotation Consistency Learning for Semi-Supervised Camouflaged Object Detection

Xunfa Lai, Zhiyu Yang, Jie Hu et al.

ECCV 2024arXiv:2408.08050
12
citations
#965

Closed-Loop Unsupervised Representation Disentanglement with $\beta$-VAE Distillation and Diffusion Probabilistic Feedback

Xin Jin, Bohan Li, Baao Xie et al.

ECCV 2024
12
citations
#966

DPA-Net: Structured 3D Abstraction from Sparse Views via Differentiable Primitive Assembly

Fenggen Yu, Yiming Qian, Xu Zhang et al.

ECCV 2024arXiv:2404.00875
12
citations
#967

Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing

Zizheng Yang, Hu Yu, Bing Li et al.

ECCV 2024arXiv:2509.20091
12
citations
#968

RICA^2: Rubric-Informed, Calibrated Assessment of Actions

Abrar Majeedi, Viswanatha Reddy Gajjala, Satya Sai Srinath Namburi GNVV et al.

ECCV 2024
12
citations
#969

Modeling and Driving Human Body Soundfields through Acoustic Primitives

Chao Huang, Dejan Markovic, Chenliang Xu et al.

ECCV 2024arXiv:2407.13083
12
citations
#970

COHO: Context-Sensitive City-Scale Hierarchical Urban Layout Generation

Liu He, Daniel Aliaga

ECCV 2024arXiv:2407.11294
12
citations
#971

Multi-modal Crowd Counting via a Broker Modality

Haoliang Meng, Xiaopeng Hong, Chenhao Wang et al.

ECCV 2024arXiv:2407.07518
12
citations
#972

Lego: Learning to Disentangle and Invert Personalized Concepts Beyond Object Appearance in Text-to-Image Diffusion Models

Saman Motamed, Danda Pani Paudel, Luc Van Gool

ECCV 2024arXiv:2311.13833
12
citations
#973

Bridging the Gap Between Human Motion and Action Semantics via Kinematics Phrases

Xinpeng Liu, Yong-Lu Li, AILING ZENG et al.

ECCV 2024arXiv:2310.04189
12
citations
#974

Eliminating Warping Shakes for Unsupervised Online Video Stitching

Lang Nie, Chunyu Lin, Kang Liao et al.

ECCV 2024arXiv:2403.06378
12
citations
#975

MICDrop: Masking Image and Depth Features via Complementary Dropout for Domain-Adaptive Semantic Segmentation

Linyan Yang, Lukas Hoyer, Mark Weber et al.

ECCV 2024arXiv:2408.16478
12
citations
#976

Mixture of Efficient Diffusion Experts Through Automatic Interval and Sub-Network Selection

Alireza Ganjdanesh, Yan Kang, Yuchen Liu et al.

ECCV 2024arXiv:2409.15557
12
citations
#977

Realistic Human Motion Generation with Cross-Diffusion Models

Zeping Ren, Shaoli Huang, Xiu Li

ECCV 2024arXiv:2312.10993
12
citations
#978

3x2: 3D Object Part Segmentation by 2D Semantic Correspondences

Anh Thai, Weiyao Wang, Hao Tang et al.

ECCV 2024arXiv:2407.09648
12
citations
#979

Learn to Preserve and Diversify: Parameter-Efficient Group with Orthogonal Regularization for Domain Generalization

Jiajun Hu, Jian Zhang, Lei Qi et al.

ECCV 2024arXiv:2407.15085
12
citations
#980

CMTA: Cross-Modal Temporal Alignment for Event-guided Video Deblurring

Taewoo Kim, Hoonhee Cho, Kuk-Jin Yoon

ECCV 2024arXiv:2408.14930
12
citations
#981

PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation

Shilin Yan, Xiaohao Xu, Renrui Zhang et al.

ECCV 2024arXiv:2309.12303
12
citations
#982

Benchmarking Spurious Bias in Few-Shot Image Classifiers

Guangtao Zheng, Wenqian Ye, Aidong Zhang

ECCV 2024arXiv:2409.02882
12
citations
#983

Towards Reliable Evaluation and Fast Training of Robust Semantic Segmentation Models

Francesco Croce, Naman D. Singh, Matthias Hein

ECCV 2024arXiv:2306.12941
12
citations
#984

The Lottery Ticket Hypothesis in Denoising: Towards Semantic-Driven Initialization

Jiafeng Mao, Xueting Wang, Kiyoharu Aizawa

ECCV 2024arXiv:2312.08872
12
citations
#985

Multi-Sentence Grounding for Long-term Instructional Video

Zeqian Li, QIRUI CHEN, Tengda Han et al.

ECCV 2024arXiv:2312.14055
12
citations
#986

Temporally Consistent Stereo Matching

Jiaxi Zeng, Chengtang Yao, Yuwei Wu et al.

ECCV 2024arXiv:2407.11950
12
citations
#987

Contrastive ground-level image and remote sensing pre-training improves representation learning for natural world imagery

Andy V Huynh, Lauren Gillespie, Jael Lopez-Saucedo et al.

ECCV 2024arXiv:2409.19439
12
citations
#988

ReLoo: Reconstructing Humans Dressed in Loose Garments from Monocular Video in the Wild

Chen Guo, Tianjian Jiang, Manuel Kaufmann et al.

ECCV 2024arXiv:2409.15269
12
citations
#989

FutureDepth: Learning to Predict the Future Improves Video Depth Estimation

Rajeev Yasarla, Manish Kumar Singh, Hong Cai et al.

ECCV 2024arXiv:2403.12953
12
citations
#990

LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection

Sanmin Kim, Youngseok Kim, Sihwan Hwang et al.

ECCV 2024arXiv:2407.10164
12
citations
#991

DreamDiffusion: High-Quality EEG-to-Image Generation with Temporal Masked Signal Modeling and CLIP Alignment

Yunpeng Bai, Xintao Wang, Yanpei Cao et al.

ECCV 2024
12
citations
#992

Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation

Seonghoon Yu, Paul Hongsuck Seo, Jeany Son

ECCV 2024arXiv:2407.07412
12
citations
#993

Explorative Inbetweening of Time and Space

Haiwen Feng, Zheng Ding, Zhihao Xia et al.

ECCV 2024arXiv:2403.14611
12
citations
#994

Physical-Based Event Camera Simulator

Haiqian Han, Jiacheng Lyu, Jianing Li et al.

ECCV 2024
12
citations
#995

Weakly-supervised Camera Localization by Ground-to-satellite Image Registration

Yujiao Shi, HONGDONG LI, Akhil Perincherry et al.

ECCV 2024arXiv:2409.06471
11
citations
#996

Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced Pre-training

Hyesong Choi, Hyejin Park, Kwang Moo Yi et al.

ECCV 2024arXiv:2404.08327
11
citations
#997

Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models

Xiaoyu Zhu, Hao Zhou, Pengfei Xing et al.

ECCV 2024arXiv:2407.13642
11
citations
#998

FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models

Wei WU, Qingnan Fan, Shuai Qin et al.

ECCV 2024arXiv:2404.11895
11
citations
#999

MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion

Lehong Wu, Lilang Lin, Jiahang Zhang et al.

ECCV 2024arXiv:2409.10473
11
citations
#1000

TimeCraft: Navigate Weakly-Supervised Temporal Grounded Video Question Answering via Bi-directional Reasoning

Huabin Liu, Xiao Ma, Cheng Zhong et al.

ECCV 2024
11
citations