Most Cited ECCV "source-free adaptation" Papers

2,387 papers found • Page 4 of 12

#601

Long-term Temporal Context Gathering for Neural Video Compression

Linfeng Qi, Zhaoyang Jia, Jiahao Li et al.

ECCV 2024poster
14
citations
#602

Norface: Improving Facial Expression Analysis by Identity Normalization

Hanwei Liu, Rudong An, Zhimeng Zhang et al.

ECCV 2024posterarXiv:2407.15617
14
citations
#603

HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance

Guian Fang, Wenbiao Yan, Yuanfan Guo et al.

ECCV 2024posterarXiv:2407.06937
14
citations
#604

NeuSDFusion: A Spatial-Aware Generative Model for 3D Shape Completion, Reconstruction, and Generation

Ruikai Cui, Weizhe Liu, Weixuan Sun et al.

ECCV 2024posterarXiv:2403.18241
14
citations
#605

DailyDVS-200: A Comprehensive Benchmark Dataset for Event-Based Action Recognition

Qi Wang, Zhou Xu, Yuming Lin et al.

ECCV 2024posterarXiv:2407.05106
14
citations
#606

Improving Text-guided Object Inpainting with Semantic Pre-inpainting

Yifu Chen, Jingwen Chen, Yingwei Pan et al.

ECCV 2024posterarXiv:2409.08260
14
citations
#607

Foster Adaptivity and Balance in Learning with Noisy Labels

Mengmeng Sheng, Zeren Sun, Tao Chen et al.

ECCV 2024posterarXiv:2407.02778
14
citations
#608

X-Pose: Detecting Any Keypoints

Jie Yang, AILING ZENG, Ruimao Zhang et al.

ECCV 2024posterarXiv:2310.08530
14
citations
#609

EmoTalk3D: High-Fidelity Free-View Synthesis of Emotional 3D Talking Head

Qianyun He, Xinya Ji, Yicheng Gong et al.

ECCV 2024posterarXiv:2408.00297
14
citations
#610

Rate-Distortion-Cognition Controllable Versatile Neural Image Compression

Jinming Liu, Ruoyu Feng, Yunpeng Qi et al.

ECCV 2024posterarXiv:2407.11700
14
citations
#611

AttnZero: Efficient Attention Discovery for Vision Transformers

Lujun Li, Zimian Wei, Peijie Dong et al.

ECCV 2024poster
14
citations
#612

Event-Adapted Video Super-Resolution

Zeyu Xiao, Dachun Kai, Yueyi Zhang et al.

ECCV 2024poster
14
citations
#613

Referring Atomic Video Action Recognition

Kunyu Peng, Jia Fu, Kailun Yang et al.

ECCV 2024posterarXiv:2407.01872
14
citations
#614

MoVideo: Motion-Aware Video Generation with Diffusion Models

Jingyun Liang, Yuchen Fan, Kai Zhang et al.

ECCV 2024posterarXiv:2311.11325
14
citations
#615

MoEAD: A Parameter-efficient Model for Multi-class Anomaly Detection

Shiyuan Meng, Wenchao Meng, Qihang Zhou et al.

ECCV 2024poster
14
citations
#616

UniCode : Learning a Unified Codebook for Multimodal Large Language Models

Sipeng Zheng, Bohan Zhou, Yicheng Feng et al.

ECCV 2024posterarXiv:2403.09072
14
citations
#617

Finding Visual Task Vectors

Alberto Hojel, Yutong Bai, Trevor Darrell et al.

ECCV 2024posterarXiv:2404.05729
14
citations
#618

Non-Exemplar Domain Incremental Learning via Cross-Domain Concept Integration

Qiang Wang, Yuhang He, Songlin Dong et al.

ECCV 2024poster
14
citations
#619

Vision-Language Action Knowledge Learning for Semantic-Aware Action Quality Assessment

Huangbiao Xu, Xiao Ke, Yuezhou Li et al.

ECCV 2024poster
14
citations
#620

CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection

Wuyang Li, Xinyu Liu, Jiayi Ma et al.

ECCV 2024poster
14
citations
#621

Free-Editor: Zero-shot Text-driven 3D Scene Editing

Md Nazmul Karim, Hasan Iqbal, Umar Khalid et al.

ECCV 2024posterarXiv:2312.13663
14
citations
#622

Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments

Djamahl Etchegaray, Zi Helen Huang, Tatsuya Harada et al.

ECCV 2024posterarXiv:2403.13556
14
citations
#623

TetraDiffusion: Tetrahedral Diffusion Models for 3D Shape Generation

Nikolai Kalischek, Torben Peters, Jan Dirk Wegner et al.

ECCV 2024posterarXiv:2211.13220
14
citations
#624

Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation

Shihao Zhao, Shaozhe Hao, Bojia Zi et al.

ECCV 2024posterarXiv:2403.07860
14
citations
#625

LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation

Archana Swaminathan, Anubhav Anubhav, Kamal Gupta et al.

ECCV 2024posterarXiv:2409.06703
14
citations
#626

FineMatch: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction

Hang Hua, Jing Shi, Kushal Kafle et al.

ECCV 2024posterarXiv:2404.14715
14
citations
#627

LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer

Ning Yu, Chia-Chih Chen, Zeyuan Chen et al.

ECCV 2024posterarXiv:2212.09877
14
citations
#628

Free-VSC: Free Semantics from Visual Foundation Models for Unsupervised Video Semantic Compression

Yuan Tian, Guo Lu, Guangtao Zhai

ECCV 2024posterarXiv:2409.11718
14
citations
#629

HUMOS: Human Motion Model Conditioned on Body Shape

Shashank Tripathi, Omid Taheri, Christoph Lassner et al.

ECCV 2024posterarXiv:2409.03944
14
citations
#630

Temporal Event Stereo via Joint Learning with Stereoscopic Flow

Hoonhee Cho, Jae-young Kang, Kuk-Jin Yoon

ECCV 2024posterarXiv:2407.10831
14
citations
#631

LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction

Penghui Du, Yu Wang, Yifan Sun et al.

ECCV 2024posterarXiv:2407.11335
14
citations
#632

PointNeRF++: A multi-scale, point-based Neural Radiance Field

Weiwei Sun, Eduard Trulls, Yang-Che Tseng et al.

ECCV 2024posterarXiv:2312.02362
14
citations
#633

Reinforcement Learning Meets Visual Odometry

Nico Messikommer, Giovanni Cioffi, Mathias Gehrig et al.

ECCV 2024posterarXiv:2407.15626
14
citations
#634

Neural Volumetric World Models for Autonomous Driving

Zanming Huang, Jimuyang Zhang, Eshed Ohn-Bar

ECCV 2024poster
14
citations
#635

Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection

QIJIE MO, Yipeng Gao, Shenghao Fu et al.

ECCV 2024posterarXiv:2407.11499
14
citations
#636

Grounding Language Models for Visual Entity Recognition

Zilin Xiao, Ming Gong, Paola Cascante-Bonilla et al.

ECCV 2024posterarXiv:2402.18695
13
citations
#637

Every Pixel Has its Moments: Ultra-High-Resolution Unpaired Image-to-Image Translation via Dense Normalization

Ming-Yang Ho, Che-Ming Wu, Min-Sheng Wu et al.

ECCV 2024posterarXiv:2407.04245
13
citations
#638

Editable Image Elements for Controllable Synthesis

Jiteng Mu, Michael Gharbi, Richard Zhang et al.

ECCV 2024posterarXiv:2404.16029
13
citations
#639

DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion

Liao Shen, Tianqi Liu, Huiqiang Sun et al.

ECCV 2024posterarXiv:2409.09605
13
citations
#640

InstaStyle: Inversion Noise of a Stylized Image is Secretly a Style Adviser

Xing Cui, Zekun Li, Peipei Li et al.

ECCV 2024posterarXiv:2311.15040
13
citations
#641

ScanTalk: 3D Talking Heads from Unregistered Scans

Federico Nocentini, Thomas Besnier, Claudio Ferrari et al.

ECCV 2024posterarXiv:2403.10942
13
citations
#642

Learning Representations of Satellite Images From Metadata Supervision

Jules Bourcier, Gohar Dashyan, Karteek Alahari et al.

ECCV 2024poster
13
citations
#643

MagicEraser: Erasing Any Objects via Semantics-Aware Control

FAN LI, Zixiao Zhang, Yi Huang et al.

ECCV 2024posterarXiv:2410.10207
13
citations
#644

On the Utility of 3D Hand Poses for Action Recognition

Md Salman Shamil, Dibyadip Chatterjee, Fadime Sener et al.

ECCV 2024posterarXiv:2403.09805
13
citations
#645

PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects

Junyi Li, Junfeng Wu, Weizhi Zhao et al.

ECCV 2024posterarXiv:2407.16696
13
citations
#646

3iGS: Factorised Tensorial Illumination for 3D Gaussian Splatting

Zhe Jun Tang, Tat-Jen Cham

ECCV 2024posterarXiv:2408.03753
13
citations
#647

Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Large Models

Chen Ju, Haicheng Wang, Haozhe Cheng et al.

ECCV 2024posterarXiv:2407.11717
13
citations
#648

Kalman-Inspired Feature Propagation for Video Face Super-Resolution

Ruicheng Feng, Chongyi Li, Chen Change Loy

ECCV 2024posterarXiv:2408.05205
13
citations
#649

Teddy: Efficient Large-Scale Dataset Distillation via Taylor-Approximated Matching

Ruonan Yu, Songhua Liu, Jingwen Ye et al.

ECCV 2024posterarXiv:2410.07579
13
citations
#650

Boosting 3D Single Object Tracking with 2D Matching Distillation and 3D Pre-training

qiangqiang wu, Yan Xia, Jia Wan et al.

ECCV 2024poster
13
citations
#651

Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation

Peng Jin, Hao Li, Zesen Cheng et al.

ECCV 2024posterarXiv:2407.10528
13
citations
#652

Representing Topological Self-Similarity Using Fractal Feature Maps for Accurate Segmentation of Tubular Structures

Jiaxing Huang, Yanfeng Zhou, Yaoru Luo et al.

ECCV 2024posterarXiv:2407.14754
13
citations
#653

WildRefer: 3D Object Localization in Large-scale Dynamic Scenes with Multi-modal Visual Data and Natural Language

Zhenxiang Lin, Xidong Peng, peishan cong et al.

ECCV 2024posterarXiv:2304.05645
13
citations
#654

Semi-supervised Segmentation of Histopathology Images with Noise-Aware Topological Consistency

Meilong Xu, Xiaoling Hu, Saumya Gupta et al.

ECCV 2024posterarXiv:2311.16447
13
citations
#655

Harnessing Text-to-Image Diffusion Models for Category-Agnostic Pose Estimation

Duo Peng, Zhengbo Zhang, Ping Hu et al.

ECCV 2024poster
13
citations
#656

OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation

Kwanyoung Kim, Yujin Oh, Jong Chul Ye

ECCV 2024posterarXiv:2403.14183
13
citations
#657

Brain-ID: Learning Contrast-agnostic Anatomical Representations for Brain Imaging

Peirong Liu, Oula Puonti, Xiaoling Hu et al.

ECCV 2024posterarXiv:2311.16914
13
citations
#658

Where am I? Scene Retrieval with Language

Jiaqi Chen, Daniel Barath, Iro Armeni et al.

ECCV 2024posterarXiv:2404.14565
13
citations
#659

3DGazeNet: Generalizing Gaze Estimation with Weak Supervision from Synthetic Views

Evangelos Ververas, Polydefkis Gkagkos, Jiankang Deng et al.

ECCV 2024posterarXiv:2212.02997
13
citations
#660

UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding

Bowen Shi, Peisen Zhao, Zichen Wang et al.

ECCV 2024posterarXiv:2401.06397
13
citations
#661

Robust Multimodal Learning via Representation Decoupling

Shicai Wei, Yang Luo, Yuji Wang et al.

ECCV 2024posterarXiv:2407.04458
13
citations
#662

DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation

Xiaobin Hu, Xu Peng, Donghao Luo et al.

ECCV 2024posterarXiv:2403.06168
13
citations
#663

Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather

Junsung Park, Kyungmin Kim, Hyunjung Shim

ECCV 2024posterarXiv:2407.02286
13
citations
#664

ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer

Jiazhi Guan, Zhiliang Xu, Hang Zhou et al.

ECCV 2024posterarXiv:2408.03284
13
citations
#665

STAMP: Outlier-Aware Test-Time Adaptation with Stable Memory Replay

Yu Yongcan, Lijun Sheng, Ran He et al.

ECCV 2024posterarXiv:2407.15773
13
citations
#666

MultiDelete for Multimodal Machine Unlearning

Jiali Cheng, Hadi Amiri

ECCV 2024posterarXiv:2311.12047
13
citations
#667

Self-Guided Generation of Minority Samples Using Diffusion Models

Soobin Um, Jong Chul Ye

ECCV 2024posterarXiv:2407.11555
13
citations
#668

3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation

Zihao Xiao, Longlong Jing, Shangxuan Wu et al.

ECCV 2024posterarXiv:2401.02402
13
citations
#669

ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic Environments

Taewoong Kim, Cheolhong Min, Byeonghwi Kim et al.

ECCV 2024posterarXiv:2407.18550
13
citations
#670

IMMA: Immunizing text-to-image Models against Malicious Adaptation

Amber Yijia Zheng, Raymond Yeh

ECCV 2024posterarXiv:2311.18815
13
citations
#671

InstructGIE: Towards Generalizable Image Editing

Zichong Meng, Changdi Yang, Jun Liu et al.

ECCV 2024posterarXiv:2403.05018
13
citations
#672

OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models

Zijian Zhou, Zheng Zhu, Holger Caesar et al.

ECCV 2024posterarXiv:2407.11213
13
citations
#673

MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion Models

Nithin Gopalakrishnan Nair, Jeya Maria Jose Valanarasu, Vishal Patel

ECCV 2024posterarXiv:2404.09977
13
citations
#674

Just a Hint: Point-Supervised Camouflaged Object Detection

Huafeng Chen, Dian SHAO, Guangqian Guo et al.

ECCV 2024posterarXiv:2408.10777
13
citations
#675

BKDSNN: Enhancing the Performance of Learning-based Spiking Neural Networks Training with Blurred Knowledge Distillation

Zekai Xu, Kang You, Qinghai Guo et al.

ECCV 2024posterarXiv:2407.09083
13
citations
#676

Improving Geo-diversity of Generated Images with Contextualized Vendi Score Guidance

Reyhane Askari Hemmat, Melissa Hall, Alicia Yi Sun et al.

ECCV 2024posterarXiv:2406.04551
13
citations
#677

Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL

Fangwei Zhong, Kui Wu, Hai Ci et al.

ECCV 2024posterarXiv:2404.09857
13
citations
#678

Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image

Pengkun Jiao, Na Zhao, Jingjing Chen et al.

ECCV 2024posterarXiv:2407.05256
13
citations
#679

ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention

Chenhang He, Ruihuang Li, Guowen Zhang et al.

ECCV 2024posterarXiv:2401.00912
13
citations
#680

Adaptive Compressed Sensing with Diffusion-Based Posterior Sampling

Noam Elata, Tomer Michaeli, Michael Elad

ECCV 2024posterarXiv:2407.08256
13
citations
#681

Physical-Based Event Camera Simulator

Haiqian Han, Jiacheng Lyu, Jianing Li et al.

ECCV 2024poster
12
citations
#682

BRIDGE: Bridging Gaps in Image Captioning Evaluation with Stronger Visual Cues

Sara Sarto, Marcella Cornia, Lorenzo Baraldi et al.

ECCV 2024posterarXiv:2407.20341
12
citations
#683

RGNet: A Unified Clip Retrieval and Grounding Network for Long Videos

Tanveer Hannan, Mohaiminul Islam, Thomas Seidl et al.

ECCV 2024posterarXiv:2312.06729
12
citations
#684

Multi-Label Cluster Discrimination for Visual Representation Learning

Xiang An, Kaicheng Yang, Xiangzi Dai et al.

ECCV 2024posterarXiv:2407.17331
12
citations
#685

PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance

Aoming Liu, Zhong Li, Zhang Chen et al.

ECCV 2024posterarXiv:2408.02157
12
citations
#686

SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse Weather

Edoardo Palladin, Roland Dietze, Praveen Narayanan et al.

ECCV 2024posterarXiv:2508.16408
12
citations
#687

Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation

Zhengyuan Xie, Haiquan Lu, Jia-wen Xiao et al.

ECCV 2024posterarXiv:2407.14142
12
citations
#688

OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection

Jinghua Hou, Tong Wang, Xiaoqing Ye et al.

ECCV 2024posterarXiv:2407.10753
12
citations
#689

Improving Feature Stability during Upsampling -- Spectral Artifacts and the Importance of Spatial Context

Shashank Agnihotri, Julia Grabinski, Margret Keuper

ECCV 2024posterarXiv:2311.17524
12
citations
#690

ChEX: Interactive Localization and Region Description in Chest X-rays

Philip Müller, Georgios Kaissis, Daniel Rueckert

ECCV 2024posterarXiv:2404.15770
12
citations
#691

∞-Brush: Controllable Large Image Synthesis with Diffusion Models in Infinite Dimensions

Minh Quan Le, Alexandros Graikos, Srikar Yellapragada et al.

ECCV 2024posterarXiv:2407.14709
12
citations
#692

CamoTeacher: Dual-Rotation Consistency Learning for Semi-Supervised Camouflaged Object Detection

Xunfa Lai, Zhiyu Yang, Jie Hu et al.

ECCV 2024posterarXiv:2408.08050
12
citations
#693

Closed-Loop Unsupervised Representation Disentanglement with $\beta$-VAE Distillation and Diffusion Probabilistic Feedback

Xin Jin, Bohan Li, Baao Xie et al.

ECCV 2024poster
12
citations
#694

UpFusion: Novel View Diffusion from Unposed Sparse View Observations

Bharath Raj Nagoor Kani, Hsin-Ying Lee, Sergey Tulyakov et al.

ECCV 2024posterarXiv:2312.06661
12
citations
#695

CrossGLG: LLM Guides One-shot Skeleton-based 3D Action Recognition in a Cross-level Manner

Tingbing Yan, Wenzheng Zeng, Yang Xiao et al.

ECCV 2024posterarXiv:2403.10082
12
citations
#696

PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation

Shilin Yan, Xiaohao Xu, Renrui Zhang et al.

ECCV 2024posterarXiv:2309.12303
12
citations
#697

DeTra: A Unified Model for Object Detection and Trajectory Forecasting

Sergio Casas, Ben T Agro, Jiageng Mao et al.

ECCV 2024posterarXiv:2406.04426
12
citations
#698

Multi-Sentence Grounding for Long-term Instructional Video

Zeqian Li, QIRUI CHEN, Tengda Han et al.

ECCV 2024posterarXiv:2312.14055
12
citations
#699

Adapting Fine-Grained Cross-View Localization to Areas without Fine Ground Truth

Zimin Xia, Yujiao Shi, HONGDONG LI et al.

ECCV 2024posterarXiv:2406.00474
12
citations
#700

Can OOD Object Detectors Learn from Foundation Models?

Jiahui Liu, Xin Wen, Shizhen Zhao et al.

ECCV 2024posterarXiv:2409.05162
12
citations
#701

Towards Real-World Adverse Weather Image Restoration: Enhancing Clearness and Semantics with Vision-Language Models

Jiaqi Xu, Mengyang Wu, Xiaowei Hu et al.

ECCV 2024posterarXiv:2409.02101
12
citations
#702

Kernel Diffusion: An Alternate Approach to Blind Deconvolution

Yash Sanghvi, Yiheng Chi, Stanley Chan

ECCV 2024posterarXiv:2312.02319
12
citations
#703

Modeling and Driving Human Body Soundfields through Acoustic Primitives

Chao Huang, Dejan Markovic, Chenliang Xu et al.

ECCV 2024posterarXiv:2407.13083
12
citations
#704

COHO: Context-Sensitive City-Scale Hierarchical Urban Layout Generation

Liu He, Daniel Aliaga

ECCV 2024posterarXiv:2407.11294
12
citations
#705

DoughNet: A Visual Predictive Model for Topological Manipulation of Deformable Objects

Dominik Bauer, Zhenjia Xu, Shuran Song

ECCV 2024posterarXiv:2404.12524
12
citations
#706

SINDER: Repairing the Singular Defects of DINOv2

Haoqi Wang, Tong Zhang, Mathieu Salzmann

ECCV 2024posterarXiv:2407.16826
12
citations
#707

Mixture of Efficient Diffusion Experts Through Automatic Interval and Sub-Network Selection

Alireza Ganjdanesh, Yan Kang, Yuchen Liu et al.

ECCV 2024posterarXiv:2409.15557
12
citations
#708

DiffFAS: Face Anti-Spoofing via Generative Diffusion Models

Xinxu Ge, Xin Liu, Zitong Yu et al.

ECCV 2024posterarXiv:2409.08572
12
citations
#709

MICDrop: Masking Image and Depth Features via Complementary Dropout for Domain-Adaptive Semantic Segmentation

Linyan Yang, Lukas Hoyer, Mark Weber et al.

ECCV 2024posterarXiv:2408.16478
12
citations
#710

BAFFLE: A Baseline of Backpropagation-Free Federated Learning

Haozhe Feng, Tianyu Pang, Chao Du et al.

ECCV 2024posterarXiv:2301.12195
12
citations
#711

Which Model Generated This Image? A Model-Agnostic Approach for Origin Attribution

Fengyuan Liu, Haochen Luo, Yiming Li et al.

ECCV 2024posterarXiv:2404.02697
12
citations
#712

Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation

Seonghoon Yu, Paul Hongsuck Seo, Jeany Son

ECCV 2024posterarXiv:2407.07412
12
citations
#713

Multi-modal Crowd Counting via a Broker Modality

Haoliang Meng, Xiaopeng Hong, Chenhao Wang et al.

ECCV 2024posterarXiv:2407.07518
12
citations
#714

CMTA: Cross-Modal Temporal Alignment for Event-guided Video Deblurring

Taewoo Kim, Hoonhee Cho, Kuk-Jin Yoon

ECCV 2024posterarXiv:2408.14930
12
citations
#715

RICA^2: Rubric-Informed, Calibrated Assessment of Actions

Abrar Majeedi, Viswanatha Reddy Gajjala, Satya Sai Srinath Namburi GNVV et al.

ECCV 2024poster
12
citations
#716

Real Appearance Modeling for More General Deepfake Detection

Jiahe Tian, Yu Cai, Xi Wang et al.

ECCV 2024poster
12
citations
#717

Mitigating Background Shift in Class-Incremental Semantic Segmentation

gilhan Park, WonJun Moon, SuBeen Lee et al.

ECCV 2024posterarXiv:2407.11859
12
citations
#718

Contrastive ground-level image and remote sensing pre-training improves representation learning for natural world imagery

Andy V Huynh, Lauren Gillespie, Jael Lopez-Saucedo et al.

ECCV 2024posterarXiv:2409.19439
12
citations
#719

Eliminating Warping Shakes for Unsupervised Online Video Stitching

Lang Nie, Chunyu Lin, Kang Liao et al.

ECCV 2024posterarXiv:2403.06378
12
citations
#720

Learning Video Context as Interleaved Multimodal Sequences

Qinghong Lin, Pengchuan Zhang, Difei Gao et al.

ECCV 2024posterarXiv:2407.21757
12
citations
#721

Explorative Inbetweening of Time and Space

Haiwen Feng, Zheng Ding, Zhihao Xia et al.

ECCV 2024posterarXiv:2403.14611
12
citations
#722

FutureDepth: Learning to Predict the Future Improves Video Depth Estimation

Rajeev Yasarla, Manish Kumar Singh, Hong Cai et al.

ECCV 2024posterarXiv:2403.12953
12
citations
#723

Temporally Consistent Stereo Matching

Jiaxi Zeng, Chengtang Yao, Yuwei Wu et al.

ECCV 2024posterarXiv:2407.11950
12
citations
#724

FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior

Zhekai Chen, Wen Wang, Zhen Yang et al.

ECCV 2024posterarXiv:2407.04947
12
citations
#725

DreamDiffusion: High-Quality EEG-to-Image Generation with Temporal Masked Signal Modeling and CLIP Alignment

Yunpeng Bai, Xintao Wang, Yanpei Cao et al.

ECCV 2024poster
12
citations
#726

Towards Reliable Evaluation and Fast Training of Robust Semantic Segmentation Models

Francesco Croce, Naman D. Singh, Matthias Hein

ECCV 2024posterarXiv:2306.12941
12
citations
#727

UCIP: A Universal Framework for Compressed Image Super-Resolution using Dynamic Prompt

Xin Li, Bingchen Li, Yeying Jin et al.

ECCV 2024posterarXiv:2407.13108
12
citations
#728

C2C: Component-to-Composition Learning for Zero-Shot Compositional Action Recognition

Rongchang Li, Zhenhua Feng, Tianyang Xu et al.

ECCV 2024posterarXiv:2407.06113
12
citations
#729

Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models

Xiaoyu Zhu, Hao Zhou, Pengfei Xing et al.

ECCV 2024posterarXiv:2407.13642
11
citations
#730

ViC-MAE: Self-Supervised Representation Learning from Images and Video with Contrastive Masked Autoencoders

Jefferson Hernandez, Ruben Villegas, Vicente Ordonez

ECCV 2024posterarXiv:2303.12001
11
citations
#731

CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts

Yichao Cai, Yuhang Liu, Zhen Zhang et al.

ECCV 2024posterarXiv:2311.16445
11
citations
#732

TimeCraft: Navigate Weakly-Supervised Temporal Grounded Video Question Answering via Bi-directional Reasoning

Huabin Liu, Xiao Ma, Cheng Zhong et al.

ECCV 2024poster
11
citations
#733

IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation

Yuanhao Zhai, Kevin Lin, Linjie Li et al.

ECCV 2024posterarXiv:2407.10937
11
citations
#734

NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model

Zhongqun Zhang, Hengfei Wang, Ziwei Yu et al.

ECCV 2024posterarXiv:2407.12727
11
citations
#735

Training-Free Model Merging for Multi-target Domain Adaptation

Wenyi Li, Huan-ang Gao, Mingju Gao et al.

ECCV 2024posterarXiv:2407.13771
11
citations
#736

Deblur e-NeRF: NeRF from Motion-Blurred Events under High-speed or Low-light Conditions

Weng Fei Low, Gim Hee Lee

ECCV 2024posterarXiv:2409.17988
11
citations
#737

TCAN: Animating Human Images with Temporally Consistent Pose Guidance using Diffusion Models

Jeongho Kim, Min-Jung Kim, Junsoo Lee et al.

ECCV 2024posterarXiv:2407.09012
11
citations
#738

DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation

Haibo Yang, Yang Chen, Yingwei Pan et al.

ECCV 2024posterarXiv:2409.07454
11
citations
#739

Self-Supervised Any-Point Tracking by Contrastive Random Walks

Ayush Shrivastava, Andrew Owens

ECCV 2024posterarXiv:2409.16288
11
citations
#740

Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion

Yu Cao, Shaogang Gong

ECCV 2024posterarXiv:2407.07249
11
citations
#741

DiffusionPen: Towards Controlling the Style of Handwritten Text Generation

KONSTANTINA NIKOLAIDOU, George Retsinas, Giorgos Sfikas et al.

ECCV 2024posterarXiv:2409.06065
11
citations
#742

CarFormer: Self-Driving with Learned Object-Centric Representations

Shadi Hamdan, Fatma Guney

ECCV 2024posterarXiv:2407.15843
11
citations
#743

Monocular Occupancy Prediction for Scalable Indoor Scenes

Hongxiao Yu, Yuqi Wang, Yuntao Chen et al.

ECCV 2024posterarXiv:2407.11730
11
citations
#744

Skeleton-based Group Activity Recognition via Spatial-Temporal Panoramic Graph

Zhengcen Li, Xinle Chang, Yueran Li et al.

ECCV 2024posterarXiv:2407.19497
11
citations
#745

Unlocking Attributes' Contribution to Successful Camouflage: A Combined Textual and Visual Analysis Strategy

Hong Zhang, Yixuan Lyu, Qian Yu et al.

ECCV 2024poster
11
citations
#746

Diff-Reg: Diffusion Model in Doubly Stochastic Matrix Space for Registration Problem

Qianliang Wu, Haobo Jiang, Lei Luo et al.

ECCV 2024poster
11
citations
#747

OneVOS: Unifying Video Object Segmentation with All-in-One Transformer Framework

Wanyun Li, Pinxue Guo, Xinyu Zhou et al.

ECCV 2024posterarXiv:2403.08682
11
citations
#748

TF-FAS: Twofold-Element Fine-Grained Semantic Guidance for Generalizable Face Anti-Spoofing

Xudong Wang, Ke-Yue Zhang, Taiping Yao et al.

ECCV 2024poster
11
citations
#749

EDformer: Transformer-Based Event Denoising Across Varied Noise Levels

Bin Jiang, Bo Xiong, Bohan Qu et al.

ECCV 2024poster
11
citations
#750

FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models

Wei WU, Qingnan Fan, Shuai Qin et al.

ECCV 2024posterarXiv:2404.11895
11
citations
#751

Rethinking Features-Fused-Pyramid-Neck for Object Detection

Hulin Li

ECCV 2024posterarXiv:2505.12820
11
citations
#752

Global Counterfactual Directions

Bartlomiej Sobieski, Przemyslaw Biecek

ECCV 2024posterarXiv:2404.12488
11
citations
#753

Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation

Shuangrui Ding, Rui Qian, Haohang Xu et al.

ECCV 2024posterarXiv:2311.17893
11
citations
#754

Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing

Zizheng Yang, Hu Yu, Bing Li et al.

ECCV 2024posterarXiv:2509.20091
11
citations
#755

LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection

Sanmin Kim, Youngseok Kim, Sihwan Hwang et al.

ECCV 2024posterarXiv:2407.10164
11
citations
#756

SHERL: Synthesizing High Accuracy and Efficient Memory for Resource-Limited Transfer Learning

Haiwen Diao, Bo Wan, XU JIA et al.

ECCV 2024posterarXiv:2407.07523
11
citations
#757

DGD: Dynamic 3D Gaussians Distillation

Isaac Labe, Noam Issachar, Itai Lang et al.

ECCV 2024posterarXiv:2405.19321
11
citations
#758

Class-Agnostic Object Counting with Text-to-Image Diffusion Model

Xiaofei Hui, Qian Wu, Hossein Rahmani et al.

ECCV 2024poster
11
citations
#759

Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors

Ruicheng Wang, Jianfeng Xiang, Jiaolong Yang et al.

ECCV 2024posterarXiv:2403.11503
11
citations
#760

Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation

Jiawei Han, Kaiqi Liu, Wei Li et al.

ECCV 2024posterarXiv:2408.10537
11
citations
#761

MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion

Lehong Wu, Lilang Lin, Jiahang Zhang et al.

ECCV 2024posterarXiv:2409.10473
11
citations
#762

Dataset Quantization with Active Learning based Adaptive Sampling

Zhenghao Zhao, Yuzhang Shang, Junyi Wu et al.

ECCV 2024posterarXiv:2407.07268
11
citations
#763

Fairness-aware Vision Transformer via Debiased Self-Attention

Yao Qiang, Chengyin Li, Prashant Khanduri et al.

ECCV 2024posterarXiv:2301.13803
11
citations
#764

INTRA: Interaction Relationship-aware Weakly Supervised Affordance Grounding

jiha jang, Hoigi Seo, Se Young Chun

ECCV 2024posterarXiv:2409.06210
11
citations
#765

KFD-NeRF: Rethinking Dynamic NeRF with Kalman Filter

Yifan Zhan, Zhuoxiao Li, Muyao Niu et al.

ECCV 2024posterarXiv:2407.13185
11
citations
#766

MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment

Anurag Das, Xinting Hu, Li Jiang et al.

ECCV 2024posterarXiv:2407.21654
11
citations
#767

BlenderAlchemy: Editing 3D Graphics with Vision-Language Models

Ian Huang, Guandao Yang, Leonidas Guibas

ECCV 2024posterarXiv:2404.17672
11
citations
#768

DreamStruct: Understanding Slides and User Interfaces via Synthetic Data Generation

Yi-Hao Peng, Faria Huq, Yue Jiang et al.

ECCV 2024posterarXiv:2410.00201
11
citations
#769

RoadPainter: Points Are Ideal Navigators for Topology transformER

Zhongxing Ma, Liang Shuang, Yongkun Wen et al.

ECCV 2024posterarXiv:2407.15349
11
citations
#770

How to Train the Teacher Model for Effective Knowledge Distillation

Shayan Mohajer Hamidi, Xizhen Deng, Renhao Tan et al.

ECCV 2024posterarXiv:2407.18041
11
citations
#771

Timestep-Aware Correction for Quantized Diffusion Models

Yuzhe YAO, Feng Tian, Jun Chen et al.

ECCV 2024posterarXiv:2407.03917
11
citations
#772

OmniSSR: Zero-shot Omnidirectional Image Super-Resolution using Stable Diffusion Model

Runyi Li, Xuhan SHENG, Weiqi Li et al.

ECCV 2024posterarXiv:2404.10312
11
citations
#773

3x2: 3D Object Part Segmentation by 2D Semantic Correspondences

Anh Thai, Weiyao Wang, Hao Tang et al.

ECCV 2024posterarXiv:2407.09648
11
citations
#774

Placing Objects in Context via Inpainting for Out-of-distribution Segmentation

Pau de Jorge Aranda, Riccardo Volpi, Puneet Dokania et al.

ECCV 2024posterarXiv:2402.16392
11
citations
#775

Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation

Kihong Kim, Haneol Lee, Jihye Park et al.

ECCV 2024posterarXiv:2402.13729
11
citations
#776

The Lottery Ticket Hypothesis in Denoising: Towards Semantic-Driven Initialization

Jiafeng Mao, Xueting Wang, Kiyoharu Aizawa

ECCV 2024posterarXiv:2312.08872
11
citations
#777

Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding

Ruihuang Li, Zhengqiang ZHANG, Chenhang He et al.

ECCV 2024posterarXiv:2407.09781
11
citations
#778

RT-Pose: A 4D Radar-Tensor based 3D Human Pose Estimation and Localization Benchmark

Yuan-Hao Ho, Jen-Hao Cheng, Sheng Yao Kuan et al.

ECCV 2024posterarXiv:2407.13930
11
citations
#779

Sparse Beats Dense: Rethinking Supervision in Radar-Camera Depth Completion

Huadong Li, Minhao Jing, Jin Wang et al.

ECCV 2024posterarXiv:2312.00844
11
citations
#780

Benchmarking Spurious Bias in Few-Shot Image Classifiers

Guangtao Zheng, Wenqian Ye, Aidong Zhang

ECCV 2024posterarXiv:2409.02882
11
citations
#781

SwapAnything: Enabling Arbitrary Object Swapping in Personalized Image Editing

Jing Gu, Nanxuan Zhao, Wei Xiong et al.

ECCV 2024poster
11
citations
#782

Inf-DiT: Upsampling any-resolution image with memory-efficient diffusion transformer.

Zhuoyi Yang, Heyang Jiang, Wenyi Hong et al.

ECCV 2024posterarXiv:2405.04312
11
citations
#783

Real-time Holistic Robot Pose Estimation with Unknown States

Shikun Ban, Juling Fan, Xiaoxuan Ma et al.

ECCV 2024posterarXiv:2402.05655
11
citations
#784

SAVE: Protagonist Diversification with Structure Agnostic Video Editing

Yeji Song, Wonsik Shin, Junsoo Lee et al.

ECCV 2024posterarXiv:2312.02503
11
citations
#785

Efficient Few-Shot Action Recognition via Multi-Level Post-Reasoning

Cong Wu, Xiao-Jun Wu, Linze Li et al.

ECCV 2024poster
11
citations
#786

Volumetric Rendering with Baked Quadrature Fields

Gopal Sharma, Daniel Rebain, Kwang Moo Yi et al.

ECCV 2024posterarXiv:2312.02202
10
citations
#787

BEAF: Observing BEfore-AFter Changes to Evaluate Hallucination in Vision-language Models

Ye-Bin Moon, Nam Hyeon-Woo, Wonseok Choi et al.

ECCV 2024posterarXiv:2407.13442
10
citations
#788

Domain Shifting: A Generalized Solution for Heterogeneous Cross-Modality Person Re-Identification

Yan Jiang, Xu Cheng, Hao Yu et al.

ECCV 2024poster
10
citations
#789

Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation

Pengfei Wang, Yuxi Wang, Shuai Li et al.

ECCV 2024posterarXiv:2407.13362
10
citations
#790

Part2Object: Hierarchical Unsupervised 3D Instance Segmentation

cheng Shi, Yulin zhang, Bin Yang et al.

ECCV 2024posterarXiv:2407.10084
10
citations
#791

DiscoMatch: Fast Discrete Optimisation for Geometrically Consistent 3D Shape Matching

Paul Roetzer, Ahmed Abbas, Dongliang Cao et al.

ECCV 2024posterarXiv:2310.08230
10
citations
#792

Self-Adapting Large Visual-Language Models to Edge Devices across Visual Modalities

Kaiwen Cai, ZheKai Duan, Gaowen Liu et al.

ECCV 2024posterarXiv:2403.04908
10
citations
#793

Idempotent Unsupervised Representation Learning for Skeleton-Based Action Recognition

Lilang Lin, Lehong Wu, Jiahang Zhang et al.

ECCV 2024posterarXiv:2410.20349
10
citations
#794

Motion and Structure from Event-based Normal Flow

Zhongyang Ren, Bangyan Liao, Delei Kong et al.

ECCV 2024posterarXiv:2407.12239
10
citations
#795

KDProR: A Knowledge-Decoupling Probabilistic Framework for Video-Text Retrieval

Xianwei Zhuang, Hongxiang Li, Xuxin Cheng et al.

ECCV 2024poster
10
citations
#796

VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving

Yibo Liu, Zheyuan Yang, Guile Wu et al.

ECCV 2024posterarXiv:2407.06516
10
citations
#797

Multiscale Sliced Wasserstein Distances as Perceptual Color Difference Measures

Jiaqi He, Zhihua Wang, Leon Wang et al.

ECCV 2024posterarXiv:2407.10181
10
citations
#798

Imaging Interiors: An Implicit Solution to Electromagnetic Inverse Scattering Problems

Ziyuan Luo, Boxin Shi, Haoliang Li et al.

ECCV 2024posterarXiv:2407.09352
10
citations
#799

Adversarially Robust Distillation by Reducing the Student-Teacher Variance Gap

Junhao Dong, Piotr Koniusz, Junxi Chen et al.

ECCV 2024poster
10
citations
#800

Exploiting Semantic Reconstruction to Mitigate Hallucinations in Vision-Language Models

Minchan Kim, Minyeong Kim, Junik Bae et al.

ECCV 2024posterarXiv:2403.16167
10
citations