Most Cited 2024 "conceptual generalization" Papers

12,324 papers found • Page 43 of 62

Filters:Most Cited 2024 conceptual generalization Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#8401

A Multimodal Benchmark Dataset and Model for Crop Disease Diagnosis

Xiang Liu, Zhaoxiang Liu, Huan Hu et al.

ECCV 2024arXiv:2503.06973

#8402

Missing Modality Prediction for Unpaired Multimodal Learning via Joint Embedding of Unimodal Models

Taesup Kim, Donggeun Kim

ECCV 2024arXiv:2407.12616

#8403

Fast Diffusion-Based Counterfactuals for Shortcut Removal and Generation

Nina Weng, Paraskevas Pegios, Eike Petersen et al.

ECCV 2024arXiv:2312.14223

#8404

GroCo: Ground Constraint for Metric Self-Supervised Monocular Depth

Aurélien Cecille, Stefan Duffner, Franck DAVOINE et al.

ECCV 2024arXiv:2409.14850

#8405

EMIE-MAP: Large-Scale Road Surface Reconstruction Based on Explicit Mesh and Implicit Encoding

Wenhua Wu, Qi Wang, Guangming Wang et al.

ECCV 2024arXiv:2403.11789

#8406

HyperSpaceX: Radial and Angular Exploration of HyperSpherical Dimensions

Chiranjeev Chiranjeev, Muskan Dosi, Kartik Thakral et al.

ECCV 2024arXiv:2408.02494

#8407

Common Sense Reasoning for Deep Fake Detection

Yue Zhang, Ben Colman, Xiao Guo et al.

ECCV 2024

#8408

Tight and Efficient Upper Bound on Spectral Norm of Convolutional Layers

Ekaterina Grishina, Mikhail Gorbunov, Maxim Rakhuba

ECCV 2024arXiv:2409.11859

#8409

Deciphering the Role of Representation Disentanglement: Investigating Compositional Generalization in CLIP Models

Reza Abbasi, Mohammad Rohban, Mahdieh Soleymani Baghshah

ECCV 2024arXiv:2407.05897

#8410

ViGoR: Improving Visual Grounding of Large Vision Language Models with Fine-Grained Reward Modeling

Siming Yan, Min Bai, Weifeng Chen et al.

ECCV 2024arXiv:2402.06118

#8411

Learning Scalable Model Soup on a Single GPU: An Efficient Subspace Training Strategy

Tao Li, Weisen Jiang, Fanghui Liu et al.

ECCV 2024arXiv:2407.03641

#8412

Deep Companion Learning: Enhancing Generalization Through Historical Consistency

Ruizhao Zhu, Venkatesh Saligrama

ECCV 2024arXiv:2407.18821

#8413

Straightforward Layer-wise Pruning for More Efficient Visual Adaptation

Ruizi Han, Jinglei Tang

ECCV 2024arXiv:2407.14330

#8414

ABC Easy as 123: A Blind Counter for Exemplar-Free Multi-Class Class-agnostic Counting

Michael A Hobley, Victor Adrian Prisacariu

ECCV 2024arXiv:2309.04820

#8415

CrossScore: A Multi-View Approach to Image Evaluation and Scoring

Zirui Wang, Wenjing Bian, Victor Adrian Prisacariu

ECCV 2024

#8416

CPM: Class-conditional Prompting Machine for Audio-visual Segmentation

Yuanhong Chen, Chong Wang, Yuyuan Liu et al.

ECCV 2024arXiv:2407.05358

#8417

DiffClass: Diffusion-Based Class Incremental Learning

Zichong Meng, Jie Zhang, Changdi Yang et al.

ECCV 2024arXiv:2403.05016

#8418

Dual-Decoupling Learning and Metric-Adaptive Thresholding for Semi-Supervised Multi-Label Learning

Jiahao Xiao, Ming-Kun Xie, Heng-Bo Fan et al.

ECCV 2024arXiv:2407.18624

#8419

SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation

Lingchen Meng, Shiyi Lan, Hengduo Li et al.

ECCV 2024arXiv:2311.14671

#8420

DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing

Minghao Chen, Iro Laina, Andrea Vedaldi

ECCV 2024arXiv:2404.18929

#8421

Handling The Non-Smooth Challenge in Tensor SVD: A Multi-Objective Tensor Recovery Framework

Jingjing Zheng, Wanglong Lu, Wenzhe Wang et al.

ECCV 2024arXiv:2311.13958

#8422

3D Gaussian Parametric Head Model

Yuelang Xu, Lizhen Wang, Zerong Zheng et al.

ECCV 2024

#8423

Dynamic Neural Radiance Field From Defocused Monocular Video

Xianrui Luo, Huiqiang Sun, Juewen Peng et al.

ECCV 2024arXiv:2407.05586

#8424

4Diff: 3D-Aware Diffusion Model for Third-to-First Viewpoint Translation

Feng Cheng, Mi Luo, Huiyu Wang et al.

ECCV 2024

#8425

Realistic Human Motion Generation with Cross-Diffusion Models

Zeping Ren, Shaoli Huang, Xiu Li

ECCV 2024arXiv:2312.10993

#8426

UniTalker: Scaling up Audio-Driven 3D Facial Animation through A Unified Model

Xiangyu Fan, Jiaqi Li, Zhiqian Lin et al.

ECCV 2024arXiv:2408.00762

#8427

PartCraft: Crafting Creative Objects by Parts

Kam Woh Ng, Xiatian Zhu, Yi-Zhe Song et al.

ECCV 2024arXiv:2407.04604

#8428

AnyControl: Create Your Artwork with Versatile Control on Text-to-Image Generation

Sun Yanan, Yanchen Liu, Yinhao Tang et al.

ECCV 2024arXiv:2406.18958

#8429

MERLiN: Single-Shot Material Estimation and Relighting for Photometric Stereo

Ashish Tiwari, Satoshi Ikehata, Shanmuganathan Raman

ECCV 2024arXiv:2409.00674

#8430

Data Overfitting for On-Device Super-Resolution with Dynamic Algorithm and Compiler Co-Design

Li, zhihao shu, Jie Ji et al.

ECCV 2024arXiv:2407.02813

#8431

BI-MDRG: Bridging Image History in Multimodal Dialogue Response Generation

Hee Suk Yoon, Eunseop Yoon, Joshua Tian Jin Tee et al.

ECCV 2024arXiv:2408.05926

#8432

PRET: Planning with Directed Fidelity Trajectory for Vision and Language Navigation

Renjie Lu, Jing-Ke Meng, WEISHI ZHENG

ECCV 2024arXiv:2407.11487

#8433

Rethinking Few-shot Class-incremental Learning: Learning from Yourself

Yu-Ming Tang, Yi-Xing Peng, Jing-Ke Meng et al.

ECCV 2024arXiv:2407.07468

#8434

Long-CLIP: Unlocking the Long-Text Capability of CLIP

Beichen Zhang, Pan Zhang, Xiaoyi Dong et al.

ECCV 2024arXiv:2403.15378

#8435

RoGUENeRF: A Robust Geometry-Consistent Universal Enhancer for NeRF

Sibi Catley-Chandar, Richard Shaw, Greg Slabaugh et al.

ECCV 2024arXiv:2403.11909

#8436

FuseTeacher: Modality-fused Encoders are Strong Vision Supervisors

Chen-Wei Xie, Siyang Sun, Liming Zhao et al.

ECCV 2024

#8437

MVDD: Multi-View Depth Diffusion Models

Zhen Wang, Qiangeng Xu, Feitong Tan et al.

ECCV 2024arXiv:2312.04875

#8438

Learning with Counterfactual Explanations for Radiology Report Generation

Mingjie Li, Haokun Lin, Liang Qiu et al.

ECCV 2024arXiv:2407.14474

#8439

Pseudo-Embedding for Generalized Few-Shot Point Cloud Segmentation

Chih-Jung Tsai, Hwann-Tzong Chen, Tyng-Luh Liu

ECCV 2024

#8440

Wavelet Convolutions for Large Receptive Fields

Shahaf Finder, Roy Amoyal, Eran Treister et al.

ECCV 2024arXiv:2407.05848

#8441

AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer

Zhuguanyu Wu, Jiaxin Chen, Hanwen Zhong et al.

ECCV 2024arXiv:2407.12951

#8442

Gradient-based Out-of-Distribution Detection

Taha Entesari, Sina Sharifi, Bardia Safaei et al.

ECCV 2024

#8443

Veil Privacy on Visual Data: Concealing Privacy for Humans, Unveiling for DNNs

Shuchao Pang, Ruhao Ma, Bing Li et al.

ECCV 2024

#8444

Simple Unsupervised Knowledge Distillation With Space Similarity

Aditya Singh, Haohan Wang

ECCV 2024arXiv:2409.13939

#8445

Learning Natural Consistency Representation for Face Forgery Video Detection

Daichi Zhang, Zihao Xiao, Shikun Li et al.

ECCV 2024arXiv:2407.10550

#8446

View-Consistent 3D Editing with Gaussian Splatting

Yuxuan Wang, Xuanyu Yi, Zike Wu et al.

ECCV 2024arXiv:2403.11868

#8447

HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes

Zhuopeng Li, Yilin Zhang, Chenming Wu et al.

ECCV 2024arXiv:2403.20032

#8448

Generating Human Interaction Motions in Scenes with Text Control

Hongwei Yi, Justus Thies, Michael J. Black et al.

ECCV 2024arXiv:2404.10685

#8449

Optimizing Illuminant Estimation in Dual-Exposure HDR Imaging

Mahmoud Afifi, Zhenhua Hu, Liang Liang

ECCV 2024arXiv:2403.02449

#8450

Instruction Tuning-free Visual Token Complement for Multimodal LLMs

Dongsheng Wang, Jiequan Cui, Miaoge Li et al.

ECCV 2024arXiv:2408.05019

#8451

Improving Point-based Crowd Counting and Localization Based on Auxiliary Point Guidance

I-HSIANG CHEN, Wei-Ting Chen, Yu-Wei Liu et al.

ECCV 2024arXiv:2405.10589

#8452

Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts

Shuangkang Fang, Yufeng Wang, Yi-Hsuan Tsai et al.

ECCV 2024arXiv:2407.06842

#8453

HybridBooth: Hybrid Prompt Inversion for Efficient Subject-Driven Generation

Shanyan Guan, Yanhao Ge, Ying Tai et al.

ECCV 2024arXiv:2410.08192

#8454

SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer

Zijie Wu, Chaohui Yu, Yanqin Jiang et al.

ECCV 2024arXiv:2404.03736

#8455

Revisit Self-supervision with Local Structure-from-Motion

Shengjie Zhu, Xiaoming Liu

ECCV 2024

#8456

On the Viability of Monocular Depth Pre-training for Semantic Segmentation

DONG LAO, Fengyu Yang, Daniel Wang et al.

ECCV 2024arXiv:2203.13987

#8457

Weakly-supervised Camera Localization by Ground-to-satellite Image Registration

Yujiao Shi, HONGDONG LI, Akhil Perincherry et al.

ECCV 2024arXiv:2409.06471

#8458

GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection

Ziying Song, Lei Yang, Shaoqing Xu et al.

ECCV 2024arXiv:2403.11848

#8459

ProtoComp: Diverse Point Cloud Completion with Controllable Prototype

Xumin Yu, Yanbo Wang, Jie Zhou et al.

ECCV 2024

#8460

Topo4D: Topology-Preserving Gaussian Splatting for High-Fidelity 4D Head Capture

Xuanchen Li, Yuhao Cheng, Xingyu Ren et al.

ECCV 2024arXiv:2406.00440

#8461

EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis

Shuai Tan, Bin Ji, Mengxiao Bi et al.

ECCV 2024arXiv:2404.01647

#8462

Put Myself in Your Shoes: Lifting the Egocentric Perspective from Exocentric Videos

Mi Luo, Zihui Xue, Alex Dimakis et al.

ECCV 2024arXiv:2403.06351

#8463

LivePhoto: Real Image Animation with Text-guided Motion Control

Xi Chen, Zhiheng Liu, Mengting Chen et al.

ECCV 2024arXiv:2312.02928

#8464

CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion

Wendi Zheng, Jiayan Teng, Zhuoyi Yang et al.

ECCV 2024arXiv:2403.05121

#8465

OAPT: Offset-Aware Partition Transformer for Double JPEG Artifacts Removal

Qiao Mo, Yukang Ding, Jinhua Hao et al.

ECCV 2024arXiv:2408.11480

#8466

Seeing the Unseen: A Frequency Prompt Guided Transformer for Image Restoration

shihao zhou, Jinshan Pan, Jinglei Shi et al.

ECCV 2024arXiv:2404.00288

#8467

Animate Your Motion: Turning Still Images into Dynamic Videos

Mingxiao Li, Bo Wan, Marie-Francine Moens et al.

ECCV 2024arXiv:2403.10179

#8468

Spatial-Temporal Multi-level Association for Video Object Segmentation

Deshui Miao, Xin Li, Zhenyu He et al.

ECCV 2024arXiv:2404.06265

#8469

Context-Aware Action Recognition: Introducing a Comprehensive Dataset for Behavior Contrast

Tatsuya Sasaki, Yoshiki Ito, Satoshi Kondo

ECCV 2024

#8470

DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Jinbo Xing, Menghan Xia, Yong Zhang et al.

ECCV 2024arXiv:2310.12190

#8471

UniProcessor: A Text-induced Unified Low-level Image Processor

Huiyu Duan, Xiongkuo Min, Sijing Wu et al.

ECCV 2024arXiv:2407.20928

#8472

Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors

Tongkun Guan, Wei Shen, Xue Yang et al.

ECCV 2024arXiv:2312.05286

#8473

Learning Chain of Counterfactual Thought for Bias-Robust Vision-Language Reasoning

Yifeng Zhang, Ming Jiang, Qi Zhao

ECCV 2024

#8474

Let the Avatar Talk using Texts without Paired Training Data

Xiuzhe Wu, Yang-Tian Sun, Handi Chen et al.

ECCV 2024

#8475

Attention Beats Linear for Fast Implicit Neural Representation Generation

Shuyi Zhang, Ke Liu, Jingjun Gu et al.

ECCV 2024arXiv:2407.15355

#8476

Prompt-Based Test-Time Real Image Dehazing: A Novel Pipeline

Zixuan Chen, Zewei He, Ziqian Lu et al.

ECCV 2024arXiv:2309.17389

#8477

RCS-Prompt: Learning Prompt to Rearrange Class Space for Prompt-based Continual Learning

Longrong Yang, Hanbin Zhao, Yunlong Yu et al.

ECCV 2024

#8478

Dynamic Guidance Adversarial Distillation with Enhanced Teacher Knowledge

Hyejin Park, Dongbo Min

ECCV 2024arXiv:2409.01627

#8479

Gaussian Grouping: Segment and Edit Anything in 3D Scenes

Mingqiao Ye, Martin Danelljan, Fisher Yu et al.

ECCV 2024arXiv:2312.00732

#8480

3D Hand Sequence Recovery from Real Blurry Images and Event Stream

Joonkyu Park, Gyeongsik Moon, Weipeng Xu et al.

ECCV 2024

#8481

Segmentation-guided Layer-wise Image Vectorization with Gradient Fills

Hengyu Zhou, Hui Zhang, Bin Wang

ECCV 2024arXiv:2408.15741

#8482

HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting

Zhenglin Zhou, Fan Ma, Hehe Fan et al.

ECCV 2024arXiv:2402.06149

#8483

Explicitly Guided Information Interaction Network for Cross-modal Point Cloud Completion

Hang Xu, Chen Long, Wenxiao Zhang et al.

ECCV 2024arXiv:2407.02887

#8484

StructLDM: Structured Latent Diffusion for 3D Human Generation

Tao Hu, Fangzhou Hong, Ziwei Liu

ECCV 2024arXiv:2404.01241

#8485

High-Fidelity Modeling of Generalizable Wrinkle Deformation

Jingfan Guo, Jae Shin Yoon, Shunsuke Saito et al.

ECCV 2024

#8486

COMPOSE: Comprehensive Portrait Shadow Editing

Andrew Hou, Zhixin Shu, Xuaner Zhang et al.

ECCV 2024arXiv:2408.13922

#8487

EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion

Guangyao Zhai, Evin Pınar Örnek, Dave Zhenyu Chen et al.

ECCV 2024arXiv:2405.00915

#8488

Learning Representations from Foundation Models for Domain Generalized Stereo Matching

Yongjian Zhang, Longguang Wang, Kunhong Li et al.

ECCV 2024

#8489

NeRF-XL: NeRF at Any Scale with Multi-GPU

Ruilong Li, Sanja Fidler, Angjoo Kanazawa et al.

ECCV 2024

#8490

3D Hand Pose Estimation in Everyday Egocentric Images

Aditya Prakash, Ruisen Tu, Matthew Chang et al.

ECCV 2024arXiv:2312.06583

#8491

Controllable Human-Object Interaction Synthesis

Jiaman Li, Alexander Clegg, Roozbeh Mottaghi et al.

ECCV 2024arXiv:2312.03913

#8492

Nymeria: A Massive Collection of Egocentric Multi-modal Human Motion in the Wild

Lingni Ma, Yuting Ye, Rowan Postyeni et al.

ECCV 2024

#8493

Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene

Ruiyang Zhang, Hu Zhang, Hang Yu et al.

ECCV 2024arXiv:2407.08569

#8494

Six-Point Method for Multi-Camera Systems with Reduced Solution Space

Banglei Guan, Ji Zhao, Laurent Kneip

ECCV 2024arXiv:2402.18066

#8495

Tuning-Free Image Customization with Image and Text Guidance

Pengzhi Li, Qiang Nie, Ying Chen et al.

ECCV 2024arXiv:2403.12658

#8496

MegaScenes: Scene-Level View Synthesis at Scale

Joseph Tung, Gene Chou, Ruojin Cai et al.

ECCV 2024arXiv:2406.11819

#8497

Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth Estimation

Jinfeng Liu, Lingtong Kong, Bo Li et al.

ECCV 2024arXiv:2407.14126

#8498

Idea2Img: Iterative Self-Refinement with GPT-4V for Automatic Image Design and Generation

Zhengyuan Yang, Jianfeng Wang, Linjie Li et al.

ECCV 2024

#8499

Preventing Catastrophic Forgetting through Memory Networks in Continuous Detection

Gaurav Bhatt, Leonid Sigal, James Ross

ECCV 2024arXiv:2403.14797

#8500

COIN-Matting: Confounder Intervention for Image Matting

Zhaohe Liao, Jiangtong Li, Jun Lan et al.

ECCV 2024

#8501

Score Distillation Sampling with Learned Manifold Corrective

Thiemo Alldieck, Nikos Kolotouros, Cristian Sminchisescu

ECCV 2024arXiv:2401.05293

#8502

Rethinking Weakly-supervised Video Temporal Grounding From a Game Perspective

Xiang Fang, Zeyu Xiong, Wanlong Fang et al.

ECCV 2024

#8503

AdaDiffSR: Adaptive Region-aware Dynamic acceleration Diffusion Model for Real-World Image Super-Resolution

Yuanting Fan, Chengxu Liu, Nengzhong Yin et al.

ECCV 2024arXiv:2410.17752

#8504

Domesticating SAM for Breast Ultrasound Image Segmentation via Spatial-frequency Fusion and Uncertainty Correction

Wanting Zhang, Huisi Wu, Jing Qin

ECCV 2024

#8505

Interaction-centric Spatio-Temporal Context Reasoning for Multi-Person Video HOI Recognition

Yisong Wang, Nan Xi, Jingjing Meng et al.

ECCV 2024

#8506

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

Yanwei Li, Chengyao Wang, Jiaya Jia

ECCV 2024arXiv:2311.17043

#8507

Agent3D-Zero: An Agent for Zero-shot 3D Understanding

Sha Zhang, Di Huang, Jiajun Deng et al.

ECCV 2024arXiv:2403.11835

#8508

Structured-NeRF: Hierarchical Scene Graph with Neural Representation

Zhide Zhong, Jiakai Cao, songen gu et al.

ECCV 2024

#8509

APL: Anchor-based Prompt Learning for One-stage Weakly Supervised Referring Expression Comprehension

Yaxin Luo, Jiayi Ji, Xiaofu Chen et al.

ECCV 2024

#8510

DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion Consistency

Xiaojing Zhong, Xinyi Huang, Xiaofeng Yang et al.

ECCV 2024arXiv:2408.07481

#8511

MeshFeat: Multi-Resolution Features for Neural Fields on Meshes

Mihir Mahajan, Florian Hofherr, Daniel Cremers

ECCV 2024arXiv:2407.13592

#8512

TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP to Alleviate Single Tag Bias

Sanghyun Jo, Soohyun Ryu, Sungyub Kim et al.

ECCV 2024arXiv:2404.00384

#8513

DragAPart: Learning a Part-Level Motion Prior for Articulated Objects

Ruining Li, Chuanxia Zheng, Christian Rupprecht et al.

ECCV 2024arXiv:2403.15382

#8514

Learning to Unlearn for Robust Machine Unlearning

Mark HUANG, Lin Geng Foo, Jun Liu

ECCV 2024arXiv:2407.10494

#8515

Diff-Tracker: Text-to-Image Diffusion Models are Unsupervised Trackers

Zhengbo Zhang, Li Xu, Duo Peng et al.

ECCV 2024arXiv:2407.08394

#8516

Echoes of the Past: Boosting Long-tail Recognition via Reflective Learning

Qihao Zhao, YALUN DAI, Shen Lin et al.

ECCV 2024

#8517

Lego: Learning to Disentangle and Invert Personalized Concepts Beyond Object Appearance in Text-to-Image Diffusion Models

Saman Motamed, Danda Pani Paudel, Luc Van Gool

ECCV 2024arXiv:2311.13833

#8518

Taming CLIP for Fine-grained and Structured Visual Understanding of Museum Exhibits

Ada-Astrid Balauca, Danda Paudel, Kristina Toutanova et al.

ECCV 2024arXiv:2409.01690

#8519

Visual Text Generation in the Wild

Yuanzhi Zhu, Jiawei Liu, Feiyu Gao et al.

ECCV 2024arXiv:2407.14138

#8520

A Unified Image Compression Method for Human Perception and Multiple Vision Tasks

Sha Guo, Sui Lin, Chen-Lin Zhang et al.

ECCV 2024

#8521

Learning Quantized Adaptive Conditions for Diffusion Models

Yuchen Liang, Yuchuan Tian, Lei Yu et al.

ECCV 2024arXiv:2409.17487

#8522

Learn to Optimize Denoising Scores: A Unified and Improved Diffusion Prior for 3D Generation

Xiaofeng Yang, Yiwen Chen, Cheng Chen et al.

ECCV 2024

#8523

Discovering Unwritten Visual Classifiers with Large Language Models

Mia Chiquier, Utkarsh Mall, Carl Vondrick

ECCV 2024

#8524

Parameter-Efficient and Memory-Efficient Tuning for Vision Transformer: A Disentangled Approach

Taolin Zhang, Jiawang Bai, Zhihe Lu et al.

ECCV 2024arXiv:2407.06964

#8525

On the Approximation Risk of Few-Shot Class-Incremental Learning

Xuan Wang, Zhong Ji, Xiyao Liu et al.

ECCV 2024

#8526

In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation

Dahyun Kang, Minsu Cho

ECCV 2024arXiv:2408.04961

#8527

URS-NeRF: Unordered Rolling Shutter Bundle Adjustment for Neural Radiance Fields

Bo Xu, Liu Ziao, Mengqi GUO et al.

ECCV 2024arXiv:2403.10119

#8528

Hierarchically Structured Neural Bones for Reconstructing Animatable Objects from Casual Videos

Subin Jeon, In Cho, Minsu Kim et al.

ECCV 2024arXiv:2408.00351

#8529

Protecting NeRFs' Copyright via Plug-And-Play Watermarking Base Model

Qi Song, Ziyuan Luo, Ka Chun Cheung et al.

ECCV 2024arXiv:2407.07735

#8530

MOD-UV: Learning Mobile Object Detectors from Unlabeled Videos

Yihong Sun, Bharath Hariharan

ECCV 2024arXiv:2405.14841

#8531

V-Trans4Style: Visual Transition Recommendation for Video Production Style Adaptation

Pooja Guhan, Tsung-Wei Huang, Guan-Ming Su et al.

ECCV 2024arXiv:2501.07983

#8532

WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation

Jiachen Lu, Ze Huang, Zeyu Yang et al.

ECCV 2024arXiv:2312.02934

#8533

Uncertainty-Driven Spectral Compressive Imaging with Spatial-Frequency Transformer

Lintao Peng, Siyu Xie, Liheng Bian

ECCV 2024

#8534

Weakly-Supervised Spatio-Temporal Video Grounding with Variational Cross-Modal Alignment

Yang Jin, Yadong Mu

ECCV 2024

#8535

Chronologically Accurate Retrieval for Temporal Grounding of Motion-Language Models

Kent Fujiwara, Mikihiro Tanaka, Qing Yu

ECCV 2024arXiv:2407.15408

#8536

SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow

Yuanzhi Zhu, Xingchao Liu, Qiang Liu

ECCV 2024arXiv:2407.12718

#8537

Domain Reduction Strategy for Non-Line-of-Sight Imaging

Hyunbo Shim, In Cho, Daekyu Kwon et al.

ECCV 2024arXiv:2308.10269

#8538

Learning to Enhance Aperture Phasor Field for Non-Line-of-Sight Imaging

In Cho, Hyunbo Shim, Seon Joo Kim

ECCV 2024arXiv:2407.18574

#8539

FRI-Net: Floorplan Reconstruction via Room-wise Implicit Representation

Honghao Xu, Juzhan Xu, Zeyu Huang et al.

ECCV 2024arXiv:2407.10687

#8540

A Framework for Efficient Model Evaluation through Stratification, Sampling, and Estimation

Riccardo Fogliato, Pratik Patil, Mathew Monfort et al.

ECCV 2024arXiv:2406.07320

#8541

DECIDER: Leveraging Foundation Model Priors for Improved Model Failure Detection and Explanation

Rakshith Subramanyam, Kowshik Thopalli, Vivek Sivaraman Narayanaswamy et al.

ECCV 2024arXiv:2408.00331

#8542

ExMatch: Self-guided Exploitation for Semi-Supervised Learning with Scarce Labeled Samples

Noo-ri Kim, Jin-Seop Lee, Jee-Hyong LEE

ECCV 2024

#8543

CaesarNeRF: Calibrated Semantic Representation for Few-Shot Generalizable Neural Rendering

Haidong Zhu, Tianyu Ding, Tianyi Chen et al.

ECCV 2024arXiv:2311.15510

#8544

Open-Vocabulary RGB-Thermal Semantic Segmentation

Guoqiang Zhao, JunJie Huang, Xiaoyun Yan et al.

ECCV 2024

#8545

UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening

Siyuan Cheng, Guangyu Shen, Kaiyuan Zhang et al.

ECCV 2024arXiv:2407.11372

#8546

Unsupervised Moving Object Segmentation with Atmospheric Turbulence

Dehao Qin, Ripon Saha, Woojeh Chung et al.

ECCV 2024

#8547

Modeling Label Correlations with Latent Context for Multi-Label Recognition

Zhao-Min Chen, Quan Cui, Ruoxi Deng et al.

ECCV 2024

#8548

Towards Reliable Advertising Image Generation Using Human Feedback

Zhenbang Du, Wei Feng, Haohan Wang et al.

ECCV 2024arXiv:2408.00418

#8549

Weak-to-Strong Compositional Learning from Generative Models for Language-based Object Detection

Kwanyong Park, Kuniaki Saito, Donghyun Kim

ECCV 2024arXiv:2407.15296

#8550

TurboEdit: Real-time text-based disentangled real image editing

Zongze Wu, Nicholas I Kolkin, Jonathan Brandt et al.

ECCV 2024

#8551

The Role of Masking for Efficient Supervised Knowledge Distillation of Vision Transformers

Seungwoo Son, Jegwang Ryu, Namhoon Lee et al.

ECCV 2024arXiv:2302.10494

#8552

Improving Vision and Language Concepts Understanding with Multimodal Counterfactual Samples

Chengen Lai, Shengli Song, Sitong Yan et al.

ECCV 2024

#8553

Functional Transform-Based Low-Rank Tensor Factorization for Multi-Dimensional Data Recovery

Jian-Li Wang, Xi-Le Zhao

ECCV 2024

#8554

Clean & Compact: Efficient Data-Free Backdoor Defense with Model Compactness

Huy Phan, Jinqi Xiao, Yang Sui et al.

ECCV 2024

#8555

Language-Assisted Skeleton Action Understanding for Skeleton-Based Temporal Action Segmentation

Haoyu Ji, Bowen Chen, Xinglong Xu et al.

ECCV 2024

#8556

A Geometric Distortion Immunized Deep Watermarking Framework with Robustness Generalizability

Linfeng Ma, Han Fang, Tianyi Wei et al.

ECCV 2024

#8557

VideoAgent: Long-form Video Understanding with Large Language Model as Agent

Xiaohan Wang, Yuhui Zhang, Orr Zohar et al.

ECCV 2024arXiv:2403.10517

#8558

MoE-DiffIR: Task-customized Diffusion Priors for Universal Compressed Image Restoration

Yulin Ren, Xin Li, Bingchen Li et al.

ECCV 2024arXiv:2407.10833

#8559

Adaptive Human Trajectory Prediction via Latent Corridors

Neerja Thakkar, Karttikeya Mangalam, Andrea Bajcsy et al.

ECCV 2024arXiv:2312.06653

#8560

Generalizable Facial Expression Recognition

Yuhang Zhang, Xiuqi Zheng, Chenyi Liang et al.

ECCV 2024arXiv:2408.10614

#8561

RS-NeRF: Neural Radiance Fields from Rolling Shutter Images

Muyao Niu, Tong Chen, Yifan Zhan et al.

ECCV 2024arXiv:2407.10267

#8562

MARs: Multi-view Attention Regularizations for Patch-based Feature Recognition of Space Terrain

Timothy Chase, Karthik Dantu

ECCV 2024arXiv:2410.05182

#8563

Grid-Attention: Enhancing Computational Efficiency of Large Vision Models without Fine-Tuning

Pengyu Li, Biao Wang, Tianchu Guo et al.

ECCV 2024

#8564

Enhanced Motion Forecasting with Visual Relation Reasoning

Sungjune Kim, Hadam Baek, Seunggwan Lee et al.

ECCV 2024

#8565

DSA: Discriminative Scatter Analysis for Early Smoke Segmentation

Lujian Yao, Haitao Zhao, Jingchao Peng et al.

ECCV 2024

#8566

DualBEV: Unifying Dual View Transformation with Probabilistic Correspondences

Peidong Li, Wancheng Shen, Qihao Huang et al.

ECCV 2024arXiv:2403.05402

#8567

Continuous SO(3) Equivariant Convolution for 3D Point Cloud Analysis

Jaein Kim, HEE BIN YOO, Dong-Sig Han et al.

ECCV 2024

#8568

MedRAT: Unpaired Medical Report Generation via Auxiliary Tasks

Elad Hirsch, Gefen Dawidowicz, Ayellet Tal

ECCV 2024arXiv:2407.03919

#8569

Towards Unified Representation of Invariant-Specific Features in Missing Modality Face Anti-Spoofing

Guanghao Zheng, Yuchen Liu, Wenrui Dai et al.

ECCV 2024

#8570

OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web

Raghav Kapoor, Yash Parag Butala, Melisa A Russak et al.

ECCV 2024arXiv:2402.17553

#8571

Self-Supervised Underwater Caustics Removal and Descattering via Deep Monocular SLAM

Jonathan Sauder, Devis TUIA

ECCV 2024

#8572

SCAPE: A Simple and Strong Category-Agnostic Pose Estimator

Yujia Liang, Zixuan Ye, Wenze Liu et al.

ECCV 2024arXiv:2407.13483

#8573

Image-to-Lidar Relational Distillation for Autonomous Driving Data

Anas Mahmoud, Ali Harakeh, Steven Waslander

ECCV 2024arXiv:2409.00845

#8574

IGNORE: Information Gap-based False Negative Loss Rejection for Single Positive Multi-Label Learning

Gyeong Ryeol Song, Noo-ri Kim, Jin-Seop Lee et al.

ECCV 2024

#8575

CoLA: Conditional Dropout and Language-driven Robust Dual-modal Salient Object Detection

Shuang Hao, Chunlin Zhong, He Tang

ECCV 2024arXiv:2407.06780

#8576

Siamese Vision Transformers are Scalable Audio-visual Learners

Yan-Bo Lin, Gedas Bertasius

ECCV 2024arXiv:2403.19638

#8577

Visual Relationship Transformation

Xiaoyu Xu, Jiayan Qiu, Baosheng Yu et al.

ECCV 2024

#8578

Scene-aware Human Motion Forecasting via Mutual Distance Prediction

Chaoyue Xing, Wei Mao, Miaomiao LIU

ECCV 2024arXiv:2310.00615

#8579

Elysium: Exploring Object-level Perception in Videos through Semantic Integration Using MLLMs

Han Wang, Yanjie Wang, Ye Yongjie et al.

ECCV 2024

#8580

Rethinking Data Bias: Dataset Copyright Protection via Embedding Class-wise Hidden Bias

Jinhyeok Jang, ByungOk Han, Jaehong Kim et al.

ECCV 2024

#8581

Federated Learning with Local Openset Noisy Labels

Zonglin Di, Zhaowei Zhu, Xiaoxiao Li et al.

ECCV 2024

#8582

Match-Stereo-Videos: Bidirectional Alignment for Consistent Dynamic Stereo Matching

Junpeng Jing, Ye Mao, Krystian Mikolajczyk

ECCV 2024arXiv:2403.10755

#8583

PoseSOR: Human Pose Can Guide Our Attention

Huankang Guan, Rynson W.H. Lau

ECCV 2024

#8584

SpeedUpNet: A Plug-and-Play Adapter Network for Accelerating Text-to-Image Diffusion Models

Weilong Chai, Dandan Zheng, Jiajiong Cao et al.

ECCV 2024arXiv:2312.08887

#8585

Rethinking Tree-Ring Watermarking for Enhanced Multi-Key Identification

Hai Ci, Pei Yang, Yiren Song et al.

ECCV 2024arXiv:2404.14055

#8586

Dual-stage Hyperspectral Image Classification Model with Spectral Supertoken

Peifu Liu, Tingfa Xu, Jie Wang et al.

ECCV 2024arXiv:2407.07307

#8587

Optimal Transport of Diverse Unsupervised Tasks for Robust Learning from Noisy Few-Shot Data

Xiaofan Que, Qi Yu

ECCV 2024

#8588

LITA: Language Instructed Temporal-Localization Assistant

De-An Huang, Shijia Liao, Subhashree Radhakrishnan et al.

ECCV 2024arXiv:2403.19046

#8589

BurstM: Deep Burst Multi-scale SR using Fourier Space with Optical Flow

EungGu Kang, Byeonghun Lee, Sunghoon Im et al.

ECCV 2024arXiv:2409.15384

#8590

Unsupervised Dense Prediction using Differentiable Normalized Cuts

Yanbin Liu, Stephen Gould

ECCV 2024

#8591

uCAP: An Unsupervised Prompting Method for Vision-Language Models

A. Tuan Nguyen, Kai Sheng Tai, Bor-Chun Chen et al.

ECCV 2024

#8592

Flexible Distribution Alignment: Towards Long-tailed Semi-supervised Learning with Proper Calibration

Emanuel Sanchez Aimar, Nathaniel D Helgesen, Yonghao Xu et al.

ECCV 2024arXiv:2306.04621

#8593

Efficient Frequency-Domain Image Deraining with Contrastive Regularization

Ning Gao, xingyu jiang, Xiuhui Zhang et al.

ECCV 2024

#8594

Deep Cost Ray Fusion for Sparse Depth Video Completion

Jungeon Kim, Soongjin Kim, Jaesik Park et al.

ECCV 2024arXiv:2409.14935

#8595

SSL-Cleanse: Trojan Detection and Mitigation in Self-Supervised Learning

Mengxin Zheng, Jiaqi Xue, Zihao Wang et al.

ECCV 2024arXiv:2303.09079

#8596

Norma: A Noise Robust Memory-Augmented Framework for Whole Slide Image Classification

Yu Bai, Bo Zhang, Zheng Zhang et al.

ECCV 2024

#8597

Adaptive High-Frequency Transformer for Diverse Wildlife Re-Identification

Chenyue Li, Shuoyi Chen, Mang Ye

ECCV 2024arXiv:2410.06977

#8598

An accurate detection is not all you need to combat label noise in web-noisy datasets

Paul Albert, Kevin McGuinness, Eric Arazo et al.

ECCV 2024arXiv:2407.05528

#8599

Gated Temporal Diffusion for Stochastic Long-term Dense Anticipation

Olga Zatsarynna, Emad Bahrami, Yazan Abu Farha et al.

ECCV 2024arXiv:2407.11954

#8600

Causal Subgraphs and Information Bottlenecks: Redefining OOD Robustness in Graph Neural Networks

Weizhi An, Wenliang Zhong, Feng Jiang et al.

ECCV 2024

← Previous

1...41 42 43 44 45...62