Most Cited ECCV "contextual appropriateness" Papers

2,387 papers found • Page 4 of 12

#601

Is Retain Set All You Need in Machine Unlearning? Restoring Performance of Unlearned Models with Out-Of-Distribution Images

Jacopo Bonato, Marco Cotogni, Luigi Sabetta

ECCV 2024arXiv:2404.12922
21
citations
#602

WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models

Zijian He, Peixin Chen, Guangrun Wang et al.

ECCV 2024arXiv:2407.10625
21
citations
#603

Protecting NeRFs' Copyright via Plug-And-Play Watermarking Base Model

Qi Song, Ziyuan Luo, Ka Chun Cheung et al.

ECCV 2024arXiv:2407.07735
21
citations
#604

When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset

Yi Zhang, Wang Zeng, Sheng Jin et al.

ECCV 2024arXiv:2407.10125
21
citations
#605

CadVLM: Bridging Language and Vision in the Generation of Parametric CAD Sketches

Sifan Wu, Amir Hosein Khasahmadi, Mor Katz et al.

ECCV 2024arXiv:2409.17457
21
citations
#606

SmartControl: Enhancing ControlNet for Handling Rough Visual Conditions

XIAOYU LIU, Yuxiang WEI, Ming LIU et al.

ECCV 2024arXiv:2404.06451
20
citations
#607

ConGeo: Robust Cross-view Geo-localization across Ground View Variations

Li Mi, Chang Xu, Javiera Castillo Navarro et al.

ECCV 2024arXiv:2403.13965
20
citations
#608

Improving Video Segmentation via Dynamic Anchor Queries

Yikang Zhou, Tao Zhang, Xiangtai Li et al.

ECCV 2024arXiv:2404.00086
20
citations
#609

Diffusion for Natural Image Matting

Yihan Hu, Yiheng Lin, Wei Wang et al.

ECCV 2024arXiv:2312.05915
20
citations
#610

ColorPeel: Color Prompt Learning with Diffusion Models via Color and Shape Disentanglement

Muhammad Atif Butt, Kai Wang, Javier Vazquez-Corral et al.

ECCV 2024arXiv:2407.07197
20
citations
#611

iHuman: Instant Animatable Digital Humans From Monocular Videos

Pramish Paudel, Anubhav Khanal, Danda Pani Paudel et al.

ECCV 2024arXiv:2407.11174
20
citations
#612

CVT-Occ: Cost Volume Temporal Fusion for 3D Occupancy Prediction

Zhangchen Ye, Tao Jiang, Chenfeng Xu et al.

ECCV 2024arXiv:2409.13430
20
citations
#613

Simplifying Source-Free Domain Adaptation for Object Detection: Effective Self-Training Strategies and Performance Insights

Yan Hao, Florent Forest, Olga Fink

ECCV 2024arXiv:2407.07586
20
citations
#614

SPIRE: Semantic Prompt-Driven Image Restoration

Chenyang Qi, Zhengzhong Tu, Keren Ye et al.

ECCV 2024arXiv:2312.11595
20
citations
#615

Diff-Tracker: Text-to-Image Diffusion Models are Unsupervised Trackers

Zhengbo Zhang, Li Xu, Duo Peng et al.

ECCV 2024arXiv:2407.08394
20
citations
#616

TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling

Dong Huo, Zixin Guo, Xinxin Zuo et al.

ECCV 2024arXiv:2408.01291
20
citations
#617

MirrorGaussian: Reflecting 3D Gaussians for Reconstructing Mirror Reflections

Jiayue Liu, Tang Xiao, Freeman Cheng et al.

ECCV 2024arXiv:2405.11921
20
citations
#618

Towards Open Domain Text-Driven Synthesis of Multi-Person Motions

Shan Mengyi, Lu Dong, Yutao Han et al.

ECCV 2024arXiv:2405.18483
20
citations
#619

Gaussian Frosting: Editable Complex Radiance Fields with Real-Time Rendering

Antoine Guedon, Vincent Lepetit

ECCV 2024
20
citations
#620

You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation

Mehdi Noroozi, Isma Hadji, Brais Martinez et al.

ECCV 2024arXiv:2401.17258
20
citations
#621

Cross-Platform Video Person ReID: A New Benchmark Dataset and Adaptation Approach

Shizhou Zhang, Wenlong Luo, De Cheng et al.

ECCV 2024arXiv:2408.07500
20
citations
#622

WordRobe: Text-Guided Generation of Textured 3D Garments

Astitva Srivastava, Pranav Manu, Amit Raj et al.

ECCV 2024arXiv:2403.17541
20
citations
#623

Revisit Human-Scene Interaction via Space Occupancy

Xinpeng Liu, Haowen Hou, Yanchao Yang et al.

ECCV 2024arXiv:2312.02700
20
citations
#624

MMVR: Millimeter-wave Multi-View Radar Dataset and Benchmark for Indoor Perception

Mohammad Mahbubur Rahman, Ryoma Yataka, Sorachi Kato et al.

ECCV 2024arXiv:2406.10708
20
citations
#625

Fast View Synthesis of Casual Videos with Soup-of-Planes

Yao-Chih Lee, Zhoutong Zhang, Kevin Blackburn-Matzen et al.

ECCV 2024arXiv:2312.02135
20
citations
#626

Beat-It: Beat-Synchronized Multi-Condition 3D Dance Generation

Zikai Huang, Xuemiao Xu, Cheng Xu et al.

ECCV 2024arXiv:2407.07554
20
citations
#627

A Graph-Based Approach for Category-Agnostic Pose Estimation

Or Hirschorn, Shai Avidan

ECCV 2024arXiv:2311.17891
20
citations
#628

Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation

Marco Mistretta, Alberto Baldrati, Marco Bertini et al.

ECCV 2024arXiv:2407.03056
20
citations
#629

OmniNOCS: A unified NOCS dataset and model for 3D lifting of 2D objects

Akshay Krishnan, Abhijit Kundu, Kevis Maninis et al.

ECCV 2024arXiv:2407.08711
19
citations
#630

SSL-Cleanse: Trojan Detection and Mitigation in Self-Supervised Learning

Mengxin Zheng, Jiaqi Xue, Zihao Wang et al.

ECCV 2024arXiv:2303.09079
19
citations
#631

Exemplar-free Continual Representation Learning via Learnable Drift Compensation

Alex Gomez-Villa, Dipam Goswami, Kai Wang et al.

ECCV 2024arXiv:2407.08536
19
citations
#632

Diffusion-Driven Data Replay: A Novel Approach to Combat Forgetting in Federated Class Continual Learning

Jinglin Liang, Jin Zhong, Hanlin Gu et al.

ECCV 2024arXiv:2409.01128
19
citations
#633

StructLDM: Structured Latent Diffusion for 3D Human Generation

Tao Hu, Fangzhou Hong, Ziwei Liu

ECCV 2024arXiv:2404.01241
19
citations
#634

PromptCCD: Learning Gaussian Mixture Prompt Pool for Continual Category Discovery

Fernando Julio Cendra, Bingchen Zhao, Kai Han

ECCV 2024arXiv:2407.19001
19
citations
#635

StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion

Ming Tao, BINGKUN BAO, Hao Tang et al.

ECCV 2024arXiv:2404.05979
19
citations
#636

Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models

Ruibin Li, Ruihuang Li, Song Guo et al.

ECCV 2024arXiv:2403.11105
19
citations
#637

CLAMP-ViT: Contrastive Data-Free Learning for Adaptive Post-Training Quantization of ViTs

Akshat Ramachandran, Souvik Kundu, Tushar Krishna

ECCV 2024arXiv:2407.05266
19
citations
#638

Good Teachers Explain: Explanation-Enhanced Knowledge Distillation

Amin Parchami, Moritz Böhle, Sukrut Rao et al.

ECCV 2024arXiv:2402.03119
19
citations
#639

Learning to Unlearn for Robust Machine Unlearning

Mark HUANG, Lin Geng Foo, Jun Liu

ECCV 2024arXiv:2407.10494
19
citations
#640

Osmosis: RGBD Diffusion Prior for Underwater Image Restoration

Opher Bar Nathan, Deborah Steinberger-Levy, Tali Treibitz et al.

ECCV 2024arXiv:2403.14837
19
citations
#641

DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image Manipulation

Jeongsol Kim, Geon Yeong Park, Jong Chul Ye

ECCV 2024arXiv:2403.11415
19
citations
#642

Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction

Alexander Timans, Christoph-Nikolas Straehle, Kaspar Sakmann et al.

ECCV 2024arXiv:2403.07263
19
citations
#643

NICP: Neural ICP for 3D Human Registration at Scale

Riccardo Marin, Enric Corona, Gerard Pons-Moll

ECCV 2024arXiv:2312.14024
19
citations
#644

FreestyleRet: Retrieving Images from Style-Diversified Queries

Hao Li, Yanhao Jia, Peng Jin et al.

ECCV 2024arXiv:2312.02428
19
citations
#645

OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation

Zhenyu Wang, Ya-Li Li, TAICHI LIU et al.

ECCV 2024arXiv:2403.19580
19
citations
#646

EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval

Thomas Hummel, Shyamgopal Karthik, Mariana-Iuliana Georgescu et al.

ECCV 2024arXiv:2407.16658
19
citations
#647

LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models

Yabin Zhang, Wenjie Zhu, Chenhang He et al.

ECCV 2024arXiv:2407.08966
19
citations
#648

Explain via Any Concept: Concept Bottleneck Model with Open Vocabulary Concepts

Andong Tan, Fengtao Zhou, Hao Chen

ECCV 2024arXiv:2408.02265
19
citations
#649

SparseCraft: Few-Shot Neural Reconstruction through Stereopsis Guided Geometric Linearization

Mae Younes, Amine Ouasfi, Adnane Boukhayma

ECCV 2024arXiv:2407.14257
19
citations
#650

Towards Real-world Event-guided Low-light Video Enhancement and Deblurring

Taewoo Kim, Jaeseok Jeong, Hoonhee Cho et al.

ECCV 2024arXiv:2408.14916
19
citations
#651

Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models

Yang Zhang, Tze Tzun Teoh, Wei Hern Lim et al.

ECCV 2024arXiv:2403.06381
19
citations
#652

GarmentCodeData: A Dataset of 3D Made-to-Measure Garments With Sewing Patterns

Maria Korosteleva, Timur Levent Kesdogan, Fabian Kemper et al.

ECCV 2024arXiv:2405.17609
19
citations
#653

CoMusion: Towards Consistent Stochastic Human Motion Prediction via Motion Diffusion

Jiarui Sun, Girish Chowdhary

ECCV 2024arXiv:2305.12554
19
citations
#654

SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs

Yang Miao, Francis Engelmann, Olga Vysotska et al.

ECCV 2024arXiv:2404.00469
19
citations
#655

Any2Point: Empowering Any-modality Transformers for Efficient 3D Understanding

YIWEN TANG, Renrui Zhang, Jiaming Liu et al.

ECCV 2024
19
citations
#656

Towards Neuro-Symbolic Video Understanding

Minkyu Choi, Harsh Goel, Mohammad Omama et al.

ECCV 2024arXiv:2403.11021
19
citations
#657

SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding

Weitai Kang, Gaowen Liu, Shah Mubarak et al.

ECCV 2024arXiv:2407.03200
19
citations
#658

HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras

Zhongyu Xia, ZhiWei Lin, Xinhao Wang et al.

ECCV 2024arXiv:2404.02517
19
citations
#659

LatentEditor: Text Driven Local Editing of 3D Scenes

Umar Khalid, Hasan Iqbal, Muhammad Tayyab et al.

ECCV 2024arXiv:2312.09313
19
citations
#660

LPViT: Low-Power Semi-structured Pruning for Vision Transformers

KAIXIN Xu, Zhe Wang, Chunyun Chen et al.

ECCV 2024arXiv:2407.02068
19
citations
#661

UniTalker: Scaling up Audio-Driven 3D Facial Animation through A Unified Model

Xiangyu Fan, Jiaqi Li, Zhiqian Lin et al.

ECCV 2024arXiv:2408.00762
19
citations
#662

Unified Embedding Alignment for Open-Vocabulary Video Instance Segmentation

Hao Fang, Peng Wu, Yawei Li et al.

ECCV 2024arXiv:2407.07427
19
citations
#663

Contourlet Residual for Prompt Learning Enhanced Infrared Image Super-Resolution

Xingyuan Li, Jinyuan Liu, ZHIXIN CHEN et al.

ECCV 2024
19
citations
#664

Centering the Value of Every Modality: Towards Efficient and Resilient Modality-agnostic Semantic Segmentation

Xu Zheng, Yuanhuiyi Lyu, jiazhou zhou et al.

ECCV 2024arXiv:2407.11344
19
citations
#665

ZeroI2V: Zero-Cost Adaptation of Pre-Trained Transformers from Image to Video

Xinhao Li, Yuhan Zhu, Limin Wang

ECCV 2024arXiv:2310.01324
19
citations
#666

InfMAE: A Foundation Model in The Infrared Modality

Fangcen liu, Chenqiang Gao, Yaming Zhang et al.

ECCV 2024arXiv:2402.00407
19
citations
#667

InterFusion: Text-Driven Generation of 3D Human-Object Interaction

Sisi Dai, Wenhao Li, Haowen Sun et al.

ECCV 2024arXiv:2403.15612
19
citations
#668

Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation

Bolin Lai, Fiona Ryan, Wenqi Jia et al.

ECCV 2024arXiv:2305.03907
19
citations
#669

HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes

Zhuopeng Li, Yilin Zhang, Chenming Wu et al.

ECCV 2024arXiv:2403.20032
19
citations
#670

UNIC: Universal Classification Models via Multi-teacher Distillation

Yannis Kalantidis, Larlus Diane, Mert Bulent SARIYILDIZ et al.

ECCV 2024arXiv:2408.05088
19
citations
#671

Spiking Wavelet Transformer

Yuetong Fang, Ziqing Wang, Lingfeng Zhang et al.

ECCV 2024arXiv:2403.11138
19
citations
#672

Crowd-SAM:SAM as a smart annotator for object detection in crowded scenes

Zhi Cai, Yingjie Gao, Yaoyan Zheng et al.

ECCV 2024arXiv:2407.11464
19
citations
#673

MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing

Haoyu Zhao, Tianyi Lu, Jiaxi Gu et al.

ECCV 2024arXiv:2311.17338
19
citations
#674

Caltech Aerial RGB-Thermal Dataset in the Wild

Connor Lee, Matthew Anderson, Nikhil Ranganathan et al.

ECCV 2024arXiv:2403.08997
19
citations
#675

Latent Diffusion Prior Enhanced Deep Unfolding for Snapshot Spectral Compressive Imaging

Zongliang Wu, Ruiying Lu, Ying Fu et al.

ECCV 2024arXiv:2311.14280
18
citations
#676

VividDreamer: Invariant Score Distillation for Hyper-Realistic Text-to-3D Generation

Wenjie Zhuo, Fan Ma, Hehe Fan et al.

ECCV 2024arXiv:2407.09822
18
citations
#677

Thinking Outside the BBox: Unconstrained Generative Object Compositing

Gemma Canet Tarrés, Zhe Lin, Zhifei Zhang et al.

ECCV 2024arXiv:2409.04559
18
citations
#678

Diffusion Model for Robust Multi-Sensor Fusion in 3D Object Detection and BEV Segmentation

Duy Tho Le, Hengcan Shi, Jianfei Cai et al.

ECCV 2024arXiv:2404.04629
18
citations
#679

Connecting Consistency Distillation to Score Distillation for Text-to-3D Generation

Zongrui Li, Minghui Hu, Qian Zheng et al.

ECCV 2024arXiv:2407.13584
18
citations
#680

Learning to Detect Multi-class Anomalies with Just One Normal Image Prompt

Bin-Bin Gao

ECCV 2024arXiv:2505.09264
18
citations
#681

AnyControl: Create Your Artwork with Versatile Control on Text-to-Image Generation

Sun Yanan, Yanchen Liu, Yinhao Tang et al.

ECCV 2024arXiv:2406.18958
18
citations
#682

Dataset Enhancement with Instance-Level Augmentations

Orest Kupyn, Christian Rupprecht

ECCV 2024arXiv:2406.08249
18
citations
#683

RAW-Adapter: Adapting Pretrained Visual Model to Camera RAW Images

Ziteng Cui, Tatsuya Harada

ECCV 2024
18
citations
#684

Raindrop Clarity: A Dual-Focused Dataset for Day and Night Raindrop Removal

Yeying Jin, Xin Li, Jiadong Wang et al.

ECCV 2024arXiv:2407.16957
18
citations
#685

Multi-branch Collaborative Learning Network for 3D Visual Grounding

Zhipeng Qian, Yiwei Ma, Zhekai Lin et al.

ECCV 2024arXiv:2407.05363
18
citations
#686

CC-SAM: Enhancing SAM with Cross-feature Attention and Context for Ultrasound Image Segmentation

Shreyank Narayana Gowda, David A Clifton

ECCV 2024
18
citations
#687

Synchronization is All You Need: Exocentric-to-Egocentric Transfer for Temporal Action Segmentation with Unlabeled Synchronized Video Pairs

Camillo Quattrocchi, Antonino Furnari, Daniele Di Mauro et al.

ECCV 2024arXiv:2312.02638
18
citations
#688

Implicit Concept Removal of Diffusion Models

Zhili LIU, Kai Chen, Yifan Zhang et al.

ECCV 2024arXiv:2310.05873
18
citations
#689

GSD: View-Guided Gaussian Splatting Diffusion for 3D Reconstruction

Yuxuan Mu, Xinxin Zuo, Chuan Guo et al.

ECCV 2024arXiv:2407.04237
18
citations
#690

Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion Models

Yasi Zhang, Peiyu Yu, Ying Nian Wu

ECCV 2024arXiv:2404.07389
18
citations
#691

GRA: Detecting Oriented Objects through Group-wise Rotating and Attention

Jiangshan Wang, Yifan Pu, Yizeng Han et al.

ECCV 2024arXiv:2403.11127
18
citations
#692

Improving Virtual Try-On with Garment-focused Diffusion Models

Siqi Wan, Yehao Li, Jingwen Chen et al.

ECCV 2024arXiv:2409.08258
18
citations
#693

Unlocking the Potential of Federated Learning: The Symphony of Dataset Distillation via Deep Generative Latents

Yuqi Jia, Saeed Vahidian, Jingwei Sun et al.

ECCV 2024arXiv:2312.01537
18
citations
#694

Pathology-knowledge Enhanced Multi-instance Prompt Learning for Few-shot Whole Slide Image Classification

Linhao Qu, Dingkang Yang, Dan Huang et al.

ECCV 2024arXiv:2407.10814
18
citations
#695

PILoRA: Prototype Guided Incremental LoRA for Federated Class-Incremental Learning

Haiyang Guo, Fei Zhu, Wenzhuo Liu et al.

ECCV 2024arXiv:2401.02094
18
citations
#696

Beta-Tuned Timestep Diffusion Model

Tianyi Zheng, Peng-Tao Jiang, Ben Wan et al.

ECCV 2024
18
citations
#697

SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds

Yanbo Wang, Wentao Zhao, Cao Chuan et al.

ECCV 2024arXiv:2407.11569
18
citations
#698

Keypoint Promptable Re-Identification

Vladimir Somers, Alexandre ALahi, Christophe De Vleeschouwer

ECCV 2024arXiv:2407.18112
18
citations
#699

Robust Incremental Structure-from-Motion with Hybrid Features

Shaohui Liu, Yidan Gao, Tianyi Zhang et al.

ECCV 2024arXiv:2409.19811
18
citations
#700

CoDA: Instructive Chain-of-Domain Adaptation with Severity-Aware Visual Prompt Tuning

Ziyang Gong, FuHao Li, Yupeng Deng et al.

ECCV 2024arXiv:2403.17369
18
citations
#701

Masked Video and Body-worn IMU Autoencoder for Egocentric Action Recognition

Mingfang Zhang, Yifei Huang, Ruicong Liu et al.

ECCV 2024arXiv:2407.06628
18
citations
#702

CoReS: Orchestrating the Dance of Reasoning and Segmentation

Xiaoyi Bao, Siyang Sun, Shuailei Ma et al.

ECCV 2024arXiv:2404.05673
18
citations
#703

PetFace: A Large-Scale Dataset and Benchmark for Animal Identification

Risa Shinoda, Kaede Shiohara

ECCV 2024arXiv:2407.13555
18
citations
#704

InsMapper: Exploring Inner-instance Information for Vectorized HD Mapping

Zhenhua Xu, Kwan-Yee K. Wong, Hengshuang ZHAO

ECCV 2024arXiv:2308.08543
18
citations
#705

BeNeRF:Neural Radiance Fields from a Single Blurry Image and Event Stream

Wenpu Li, Pian Wan, Peng Wang et al.

ECCV 2024arXiv:2407.02174
18
citations
#706

PartSTAD: 2D-to-3D Part Segmentation Task Adaptation

Hyunjin Kim, Minhyuk Sung

ECCV 2024arXiv:2401.05906
18
citations
#707

Diffusion Model is a Good Pose Estimator from 3D RF-Vision

Junqiao Fan, Jianfei Yang, Yuecong Xu et al.

ECCV 2024arXiv:2403.16198
17
citations
#708

Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation

Yixiao Wang, Chen Tang, Lingfeng Sun et al.

ECCV 2024arXiv:2408.00766
17
citations
#709

SuperGaussian: Repurposing Video Models for 3D Super Resolution

Yuan Shen, Duygu Ceylan, Paul Guerrero et al.

ECCV 2024arXiv:2406.00609
17
citations
#710

PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts

Zewen Chen, Haina Qin, Juan Wang et al.

ECCV 2024arXiv:2403.04993
17
citations
#711

Prompt-Based Test-Time Real Image Dehazing: A Novel Pipeline

Zixuan Chen, Zewei He, Ziqian Lu et al.

ECCV 2024arXiv:2309.17389
17
citations
#712

Tackling Structural Hallucination in Image Translation with Local Diffusion

Seunghoi Kim, Chen Jin, Tom Diethe et al.

ECCV 2024arXiv:2404.05980
17
citations
#713

Text to Layer-wise 3D Clothed Human Generation

Junting Dong, Qi Fang, Zehuan Huang et al.

ECCV 2024arXiv:2404.16748
17
citations
#714

Rethinking Few-shot Class-incremental Learning: Learning from Yourself

Yu-Ming Tang, Yi-Xing Peng, Jing-Ke Meng et al.

ECCV 2024arXiv:2407.07468
17
citations
#715

Lazy Diffusion Transformer for Interactive Image Editing

Yotam Nitzan, Zongze Wu, Richard Zhang et al.

ECCV 2024arXiv:2404.12382
17
citations
#716

Visual Alignment Pre-training for Sign Language Translation

Peiqi Jiao, Yuecong Min, Xilin CHEN

ECCV 2024
17
citations
#717

CLOSER: Towards Better Representation Learning for Few-Shot Class-Incremental Learning

Junghun Oh, Sungyong Baik, Kyoung Mu Lee

ECCV 2024arXiv:2410.05627
17
citations
#718

Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders

Alexandre Eymaël, Renaud Vandeghen, Anthony Cioppa et al.

ECCV 2024arXiv:2403.17823
17
citations
#719

Multimodal Cross-Domain Few-Shot Learning for Egocentric Action Recognition

Masashi Hatano, Ryo Hachiuma, Ryo Fujii et al.

ECCV 2024arXiv:2405.19917
17
citations
#720

An Efficient and Effective Transformer Decoder-Based Framework for Multi-Task Visual Grounding

Wei Chen, Long Chen, Yu Wu

ECCV 2024arXiv:2408.01120
17
citations
#721

Deep Diffusion Image Prior for Efficient OOD Adaptation in 3D Inverse Problems

Hyungjin Chung, Jong Chul Ye

ECCV 2024arXiv:2407.10641
17
citations
#722

Self-Supervised Video Desmoking for Laparoscopic Surgery

Renlong Wu, Zhilu Zhang, Shuohao Zhang et al.

ECCV 2024arXiv:2403.11192
17
citations
#723

A Multimodal Benchmark Dataset and Model for Crop Disease Diagnosis

Xiang Liu, Zhaoxiang Liu, Huan Hu et al.

ECCV 2024arXiv:2503.06973
17
citations
#724

Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation

Xuelu Feng, Dongdong Chen, Junsong Yuan et al.

ECCV 2024arXiv:2403.12042
17
citations
#725

PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer

Tongkun Guan, Chengyu Lin, Wei Shen et al.

ECCV 2024arXiv:2407.07764
17
citations
#726

GaussReg: Fast 3D Registration with Gaussian Splatting

Jiahao Chang, Yinglin Xu, Yihao Li et al.

ECCV 2024arXiv:2407.05254
17
citations
#727

Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change Captioning

Yunbin Tu, Liang Li, Li Su et al.

ECCV 2024arXiv:2407.11683
17
citations
#728

Knowledge Transfer with Simulated Inter-Image Erasing for Weakly Supervised Semantic Segmentation

Tao Chen, Xiruo Jiang, Gensheng Pei et al.

ECCV 2024arXiv:2407.02768
17
citations
#729

PartCraft: Crafting Creative Objects by Parts

Kam Woh Ng, Xiatian Zhu, Yi-Zhe Song et al.

ECCV 2024arXiv:2407.04604
17
citations
#730

One-stage Prompt-based Continual Learning

Youngeun Kim, YUHANG LI, Priyadarshini Panda

ECCV 2024arXiv:2402.16189
17
citations
#731

LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT Descriptors

Saksham Suri, Matthew Walmer, Kamal Gupta et al.

ECCV 2024arXiv:2403.14625
17
citations
#732

Emergent Visual-Semantic Hierarchies in Image-Text Representations

Morris Alper, Hadar Averbuch-Elor

ECCV 2024arXiv:2407.08521
17
citations
#733

Motion-prior Contrast Maximization for Dense Continuous-Time Motion Estimation

Friedhelm Hamann, Ziyun Wang, Ioannis Asmanis et al.

ECCV 2024arXiv:2407.10802
17
citations
#734

AnatoMask: Enhancing Medical Image Segmentation with Reconstruction-guided Self-masking

Yuheng Li, Tianyu Luan, Yizhou Wu et al.

ECCV 2024arXiv:2407.06468
17
citations
#735

E.T. the Exceptional Trajectory: Text-to-camera-trajectory generation with character awareness

Robin Courant, Nicolas Dufour, Xi WANG et al.

ECCV 2024arXiv:2407.01516
17
citations
#736

UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving

Jian Zou, Tianyu Huang, Guanglei Yang et al.

ECCV 2024
17
citations
#737

Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance

Tien Toan Nguyen, Minh Nhat Nhat Vu, Baoru Huang et al.

ECCV 2024arXiv:2407.13842
17
citations
#738

M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models

Seunggeun Chi, Hyung-gun Chi, Hengbo Ma et al.

ECCV 2024arXiv:2407.14502
17
citations
#739

Context Diffusion: In-Context Aware Image Generation

Ivona Najdenkoska, Animesh Sinha, Abhimanyu Dubey et al.

ECCV 2024arXiv:2312.03584
17
citations
#740

Mismatch Quest: Visual and Textual Feedback for Image-Text Misalignment

Brian Gordon, Yonatan Bitton, Yonatan Shafir et al.

ECCV 2024arXiv:2312.03766
17
citations
#741

PARE-Net: Position-Aware Rotation-Equivariant Networks for Robust Point Cloud Registration

Runzhao Yao, Shaoyi Du, Wenting Cui et al.

ECCV 2024arXiv:2407.10142
17
citations
#742

Towards Multimodal Open-Set Domain Generalization and Adaptation through Self-supervision

Hao Dong, Eleni Chatzi, Olga Fink

ECCV 2024arXiv:2407.01518
17
citations
#743

MSD: A Benchmark Dataset for Floor Plan Generation of Building Complexes

Casper van Engelenburg, Fatemeh Mostafavi, Emanuel Kuhn et al.

ECCV 2024arXiv:2407.10121
17
citations
#744

City-on-Web: Real-time Neural Rendering of Large-scale Scenes on the Web

Kaiwen Song, Xiaoyi Zeng, Chenqu Ren et al.

ECCV 2024arXiv:2312.16457
17
citations
#745

Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning

Yibing Wei, Abhinav Gupta, Pedro Morgado

ECCV 2024arXiv:2407.15837
16
citations
#746

Taming Latent Diffusion Model for Neural Radiance Field Inpainting

Chieh Lin, Changil Kim, Jia-Bin Huang et al.

ECCV 2024arXiv:2404.09995
16
citations
#747

TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Autonomous Driving

Cheng Zhao, su sun, Ruoyu Wang et al.

ECCV 2024arXiv:2404.02410
16
citations
#748

Towards Multi-modal Transformers in Federated Learning

Guangyu Sun, Matias Mendieta, Aritra Dutta et al.

ECCV 2024arXiv:2404.12467
16
citations
#749

Controllable Navigation Instruction Generation with Chain of Thought Prompting

Xianghao Kong, Jinyu Chen, Wenguan Wang et al.

ECCV 2024arXiv:2407.07433
16
citations
#750

Explicitly Guided Information Interaction Network for Cross-modal Point Cloud Completion

Hang Xu, Chen Long, Wenxiao Zhang et al.

ECCV 2024arXiv:2407.02887
16
citations
#751

Learning with Counterfactual Explanations for Radiology Report Generation

Mingjie Li, Haokun Lin, Liang Qiu et al.

ECCV 2024arXiv:2407.14474
16
citations
#752

Thermal3D-GS: Physics-induced 3D Gaussians for Thermal Infrared Novel-view Synthesis

Qian Chen, Shihao Shu, Xiangzhi Bai

ECCV 2024arXiv:2409.08042
16
citations
#753

Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation

Shihao Zhao, Shaozhe Hao, Bojia Zi et al.

ECCV 2024arXiv:2403.07860
16
citations
#754

Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on Vision-Language Models

Yu-Chu Yu, Chi-Pin Huang, Jr-Jen Chen et al.

ECCV 2024arXiv:2403.09296
16
citations
#755

The Hard Positive Truth about Vision-Language Compositionality

Amita Kamath, Cheng-Yu Hsieh, Kai-Wei Chang et al.

ECCV 2024arXiv:2409.17958
16
citations
#756

Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition

Sergio Izquierdo, Javier Civera

ECCV 2024arXiv:2407.02422
16
citations
#757

RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation

Luis Li, Hubert P. H. Shum, Toby P Breckon

ECCV 2024arXiv:2407.10159
16
citations
#758

PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation

Zhenyu Li, Shariq Farooq Bhat, Peter Wonka

ECCV 2024arXiv:2406.06679
16
citations
#759

EMIE-MAP: Large-Scale Road Surface Reconstruction Based on Explicit Mesh and Implicit Encoding

Wenhua Wu, Qi Wang, Guangming Wang et al.

ECCV 2024arXiv:2403.11789
16
citations
#760

LookupViT: Compressing visual information to a limited number of tokens

Rajat Koner, Gagan Jain, Sujoy Paul et al.

ECCV 2024arXiv:2407.12753
16
citations
#761

Image Demoireing in RAW and sRGB Domains

Shuning Xu, Binbin Song, Xiangyu Chen et al.

ECCV 2024arXiv:2312.09063
16
citations
#762

LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction

Penghui Du, Yu Wang, Yifan Sun et al.

ECCV 2024arXiv:2407.11335
16
citations
#763

Auto-GAS: Automated Proxy Discovery for Training-free Generative Architecture Search

Lujun Li, Haosen SUN, Shiwen Li et al.

ECCV 2024
16
citations
#764

Beyond MOT: Semantic Multi-Object Tracking

Yunhao Li, Qin Li, Hao Wang et al.

ECCV 2024arXiv:2403.05021
16
citations
#765

Diffusion Bridges for 3D Point Cloud Denoising

Mathias Vogel, Keisuke Tateno, Marc Pollefeys et al.

ECCV 2024arXiv:2408.16325
16
citations
#766

Tri^{2}-plane: Thinking Head Avatar via Feature Pyramid

Luchuan Song, Pinxin Liu, Lele Chen et al.

ECCV 2024arXiv:2401.09386
16
citations
#767

A Simple Background Augmentation Method for Object Detection with Diffusion Model

YUHANG LI, Xin Dong, Chen Chen et al.

ECCV 2024arXiv:2408.00350
16
citations
#768

TP2O: Creative Text Pair-to-Object Generation using Balance Swap-Sampling

Jun Li, Zedong Zhang, Jian Yang

ECCV 2024arXiv:2310.01819
16
citations
#769

AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation

Zanlin Ni, Yulin Wang, Renping Zhou et al.

ECCV 2024arXiv:2409.00342
16
citations
#770

MoE-DiffIR: Task-customized Diffusion Priors for Universal Compressed Image Restoration

Yulin Ren, Xin Li, Bingchen Li et al.

ECCV 2024arXiv:2407.10833
16
citations
#771

TimeLens-XL: Real-time Event-based Video Frame Interpolation with Large Motion

Shi Guo, Yutian Chen, Tianfan Xue et al.

ECCV 2024
16
citations
#772

DailyDVS-200: A Comprehensive Benchmark Dataset for Event-Based Action Recognition

Qi Wang, Zhou Xu, Yuming Lin et al.

ECCV 2024arXiv:2407.05106
16
citations
#773

Teach CLIP to Develop a Number Sense for Ordinal Regression

Yao DU, Qiang Zhai, Weihang Dai et al.

ECCV 2024arXiv:2408.03574
16
citations
#774

I Can't Believe It's Not Scene Flow!

Ishan Khatri, Kyle Vedder, Neehar Peri et al.

ECCV 2024arXiv:2403.04739
16
citations
#775

TransCAD: A Hierarchical Transformer for CAD Sequence Inference from Point Clouds

Dupont Elona, Kseniya Cherenkova, Dimitrios Mallis et al.

ECCV 2024arXiv:2407.12702
16
citations
#776

ColorMNet: A Memory-based Deep Spatial-Temporal Feature Propagation Network for Video Colorization

Yixin Yang, Jiangxin Dong, Jinhui Tang et al.

ECCV 2024arXiv:2404.06251
16
citations
#777

SegGen: Supercharging Segmentation Models with Text2Mask and Mask2Img Synthesis

Hanrong Ye, Jason Wen Yong Kuen, Qing Liu et al.

ECCV 2024arXiv:2311.03355
16
citations
#778

Improving Text-guided Object Inpainting with Semantic Pre-inpainting

Yifu Chen, Jingwen Chen, Yingwei Pan et al.

ECCV 2024arXiv:2409.08260
15
citations
#779

Just a Hint: Point-Supervised Camouflaged Object Detection

Huafeng Chen, Dian SHAO, Guangqian Guo et al.

ECCV 2024arXiv:2408.10777
15
citations
#780

Improving Geo-diversity of Generated Images with Contextualized Vendi Score Guidance

Reyhane Askari Hemmat, Melissa Hall, Alicia Yi Sun et al.

ECCV 2024arXiv:2406.04551
15
citations
#781

A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting

Wouter Van Gansbeke, Bert De Brabandere

ECCV 2024arXiv:2401.10227
15
citations
#782

Visual Text Generation in the Wild

Yuanzhi Zhu, Jiawei Liu, Feiyu Gao et al.

ECCV 2024arXiv:2407.14138
15
citations
#783

CLIP-Guided Generative Networks for Transferable Targeted Adversarial Attacks

Hao Fang, Jiawei Kong, Bin Chen et al.

ECCV 2024arXiv:2407.10179
15
citations
#784

Large-Scale Multi-Hypotheses Cell Tracking Using Ultrametric Contours Maps

Jordao Bragantini, Merlin Lange, Loïc A Royer

ECCV 2024arXiv:2308.04526
15
citations
#785

Learning Camouflaged Object Detection from Noisy Pseudo Label

Jin Zhang, Ruiheng Zhang, Yanjiao Shi et al.

ECCV 2024arXiv:2407.13157
15
citations
#786

FRDiff : Feature Reuse for Universal Training-free Acceleration of Diffusion Models

Junhyuk So, Jungwon Lee, Eunhyeok Park

ECCV 2024arXiv:2312.03517
15
citations
#787

Reinforcement Learning Friendly Vision-Language Model for Minecraft

Haobin Jiang, Junpeng Yue, Hao Luo et al.

ECCV 2024arXiv:2303.10571
15
citations
#788

LaPose: Laplacian Mixture Shape Modeling for RGB-Based Category-Level Object Pose Estimation

Ruida Zhang, Ziqin Huang, Gu Wang et al.

ECCV 2024arXiv:2409.15727
15
citations
#789

HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance

Guian Fang, Wenbiao Yan, Yuanfan Guo et al.

ECCV 2024arXiv:2407.06937
15
citations
#790

SUMix: Mixup with Semantic and Uncertain Information

Huafeng Qin, Xin Jin, Hongyu Zhu et al.

ECCV 2024arXiv:2407.07805
15
citations
#791

Instant 3D Human Avatar Generation using Image Diffusion Models

Nikos Kolotouros, Thiemo Alldieck, Enric Corona et al.

ECCV 2024arXiv:2406.07516
15
citations
#792

VEON: Vocabulary-Enhanced Occupancy Prediction

Jilai Zheng, Pin Tang, Zhongdao Wang et al.

ECCV 2024arXiv:2407.12294
15
citations
#793

DG-PIC: Domain Generalized Point-In-Context Learning for Point Cloud Understanding

Jincen Jiang, Qianyu Zhou, Yuhang Li et al.

ECCV 2024arXiv:2407.08801
15
citations
#794

UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection

Yingsen Zeng, Yujie Zhong, Chengjian Feng et al.

ECCV 2024arXiv:2404.04933
15
citations
#795

CardiacNet: Learning to Reconstruct Abnormalities for Cardiac Disease Assessment from Echocardiogram Videos

JIEWEN YANG, Yiqun Lin, Bin Pu et al.

ECCV 2024arXiv:2410.20769
15
citations
#796

Textual Knowledge Matters: Cross-Modality Co-Teaching for Generalized Visual Class Discovery

Haiyang Zheng, Pu Nan, Wenjing Li et al.

ECCV 2024arXiv:2403.07369
15
citations
#797

Event Camera Data Dense Pre-training

Yan Yang, Liyuan Pan, Liu liu

ECCV 2024arXiv:2311.11533
15
citations
#798

PLOT: Text-based Person Search with Part Slot Attention for Corresponding Part Discovery

Jicheol Park, Dongwon Kim, Boseung Jeong et al.

ECCV 2024arXiv:2409.13475
15
citations
#799

Animal Avatars: Reconstructing Animatable 3D Animals from Casual Videos

Remy Sabathier, David Novotny, Niloy Mitra

ECCV 2024arXiv:2403.17103
15
citations
#800

SAFNet: Selective Alignment Fusion Network for Efficient HDR Imaging

Lingtong Kong, Bo Li, Yike Xiong et al.

ECCV 2024arXiv:2407.16308
15
citations