Most Cited 2024 "conceptual generalization" Papers

12,324 papers found • Page 43 of 62

#8401

A Multimodal Benchmark Dataset and Model for Crop Disease Diagnosis

Xiang Liu, Zhaoxiang Liu, Huan Hu et al.

ECCV 2024arXiv:2503.06973
#8402

Missing Modality Prediction for Unpaired Multimodal Learning via Joint Embedding of Unimodal Models

Taesup Kim, Donggeun Kim

ECCV 2024arXiv:2407.12616
#8403

Fast Diffusion-Based Counterfactuals for Shortcut Removal and Generation

Nina Weng, Paraskevas Pegios, Eike Petersen et al.

ECCV 2024arXiv:2312.14223
#8404

GroCo: Ground Constraint for Metric Self-Supervised Monocular Depth

Aurélien Cecille, Stefan Duffner, Franck DAVOINE et al.

ECCV 2024arXiv:2409.14850
#8405

EMIE-MAP: Large-Scale Road Surface Reconstruction Based on Explicit Mesh and Implicit Encoding

Wenhua Wu, Qi Wang, Guangming Wang et al.

ECCV 2024arXiv:2403.11789
#8406

HyperSpaceX: Radial and Angular Exploration of HyperSpherical Dimensions

Chiranjeev Chiranjeev, Muskan Dosi, Kartik Thakral et al.

ECCV 2024arXiv:2408.02494
#8407

Common Sense Reasoning for Deep Fake Detection

Yue Zhang, Ben Colman, Xiao Guo et al.

ECCV 2024
#8408

Tight and Efficient Upper Bound on Spectral Norm of Convolutional Layers

Ekaterina Grishina, Mikhail Gorbunov, Maxim Rakhuba

ECCV 2024arXiv:2409.11859
#8409

Deciphering the Role of Representation Disentanglement: Investigating Compositional Generalization in CLIP Models

Reza Abbasi, Mohammad Rohban, Mahdieh Soleymani Baghshah

ECCV 2024arXiv:2407.05897
#8410

ViGoR: Improving Visual Grounding of Large Vision Language Models with Fine-Grained Reward Modeling

Siming Yan, Min Bai, Weifeng Chen et al.

ECCV 2024arXiv:2402.06118
#8411

Learning Scalable Model Soup on a Single GPU: An Efficient Subspace Training Strategy

Tao Li, Weisen Jiang, Fanghui Liu et al.

ECCV 2024arXiv:2407.03641
#8412

Deep Companion Learning: Enhancing Generalization Through Historical Consistency

Ruizhao Zhu, Venkatesh Saligrama

ECCV 2024arXiv:2407.18821
#8413

Straightforward Layer-wise Pruning for More Efficient Visual Adaptation

Ruizi Han, Jinglei Tang

ECCV 2024arXiv:2407.14330
#8414

ABC Easy as 123: A Blind Counter for Exemplar-Free Multi-Class Class-agnostic Counting

Michael A Hobley, Victor Adrian Prisacariu

ECCV 2024arXiv:2309.04820
#8415

CrossScore: A Multi-View Approach to Image Evaluation and Scoring

Zirui Wang, Wenjing Bian, Victor Adrian Prisacariu

ECCV 2024
#8416

CPM: Class-conditional Prompting Machine for Audio-visual Segmentation

Yuanhong Chen, Chong Wang, Yuyuan Liu et al.

ECCV 2024arXiv:2407.05358
#8417

DiffClass: Diffusion-Based Class Incremental Learning

Zichong Meng, Jie Zhang, Changdi Yang et al.

ECCV 2024arXiv:2403.05016
#8418

Dual-Decoupling Learning and Metric-Adaptive Thresholding for Semi-Supervised Multi-Label Learning

Jiahao Xiao, Ming-Kun Xie, Heng-Bo Fan et al.

ECCV 2024arXiv:2407.18624
#8419

SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation

Lingchen Meng, Shiyi Lan, Hengduo Li et al.

ECCV 2024arXiv:2311.14671
#8420

DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing

Minghao Chen, Iro Laina, Andrea Vedaldi

ECCV 2024arXiv:2404.18929
#8421

Handling The Non-Smooth Challenge in Tensor SVD: A Multi-Objective Tensor Recovery Framework

Jingjing Zheng, Wanglong Lu, Wenzhe Wang et al.

ECCV 2024arXiv:2311.13958
#8422

3D Gaussian Parametric Head Model

Yuelang Xu, Lizhen Wang, Zerong Zheng et al.

ECCV 2024
#8423

Dynamic Neural Radiance Field From Defocused Monocular Video

Xianrui Luo, Huiqiang Sun, Juewen Peng et al.

ECCV 2024arXiv:2407.05586
#8424

4Diff: 3D-Aware Diffusion Model for Third-to-First Viewpoint Translation

Feng Cheng, Mi Luo, Huiyu Wang et al.

ECCV 2024
#8425

Realistic Human Motion Generation with Cross-Diffusion Models

Zeping Ren, Shaoli Huang, Xiu Li

ECCV 2024arXiv:2312.10993
#8426

UniTalker: Scaling up Audio-Driven 3D Facial Animation through A Unified Model

Xiangyu Fan, Jiaqi Li, Zhiqian Lin et al.

ECCV 2024arXiv:2408.00762
#8427

PartCraft: Crafting Creative Objects by Parts

Kam Woh Ng, Xiatian Zhu, Yi-Zhe Song et al.

ECCV 2024arXiv:2407.04604
#8428

AnyControl: Create Your Artwork with Versatile Control on Text-to-Image Generation

Sun Yanan, Yanchen Liu, Yinhao Tang et al.

ECCV 2024arXiv:2406.18958
#8429

MERLiN: Single-Shot Material Estimation and Relighting for Photometric Stereo

Ashish Tiwari, Satoshi Ikehata, Shanmuganathan Raman

ECCV 2024arXiv:2409.00674
#8430

Data Overfitting for On-Device Super-Resolution with Dynamic Algorithm and Compiler Co-Design

Li, zhihao shu, Jie Ji et al.

ECCV 2024arXiv:2407.02813
#8431

BI-MDRG: Bridging Image History in Multimodal Dialogue Response Generation

Hee Suk Yoon, Eunseop Yoon, Joshua Tian Jin Tee et al.

ECCV 2024arXiv:2408.05926
#8432

PRET: Planning with Directed Fidelity Trajectory for Vision and Language Navigation

Renjie Lu, Jing-Ke Meng, WEISHI ZHENG

ECCV 2024arXiv:2407.11487
#8433

Rethinking Few-shot Class-incremental Learning: Learning from Yourself

Yu-Ming Tang, Yi-Xing Peng, Jing-Ke Meng et al.

ECCV 2024arXiv:2407.07468
#8434

Long-CLIP: Unlocking the Long-Text Capability of CLIP

Beichen Zhang, Pan Zhang, Xiaoyi Dong et al.

ECCV 2024arXiv:2403.15378
#8435

RoGUENeRF: A Robust Geometry-Consistent Universal Enhancer for NeRF

Sibi Catley-Chandar, Richard Shaw, Greg Slabaugh et al.

ECCV 2024arXiv:2403.11909
#8436

FuseTeacher: Modality-fused Encoders are Strong Vision Supervisors

Chen-Wei Xie, Siyang Sun, Liming Zhao et al.

ECCV 2024
#8437

MVDD: Multi-View Depth Diffusion Models

Zhen Wang, Qiangeng Xu, Feitong Tan et al.

ECCV 2024arXiv:2312.04875
#8438

Learning with Counterfactual Explanations for Radiology Report Generation

Mingjie Li, Haokun Lin, Liang Qiu et al.

ECCV 2024arXiv:2407.14474
#8439

Pseudo-Embedding for Generalized Few-Shot Point Cloud Segmentation

Chih-Jung Tsai, Hwann-Tzong Chen, Tyng-Luh Liu

ECCV 2024
#8440

Wavelet Convolutions for Large Receptive Fields

Shahaf Finder, Roy Amoyal, Eran Treister et al.

ECCV 2024arXiv:2407.05848
#8441

AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer

Zhuguanyu Wu, Jiaxin Chen, Hanwen Zhong et al.

ECCV 2024arXiv:2407.12951
#8442

Gradient-based Out-of-Distribution Detection

Taha Entesari, Sina Sharifi, Bardia Safaei et al.

ECCV 2024
#8443

Veil Privacy on Visual Data: Concealing Privacy for Humans, Unveiling for DNNs

Shuchao Pang, Ruhao Ma, Bing Li et al.

ECCV 2024
#8444

Simple Unsupervised Knowledge Distillation With Space Similarity

Aditya Singh, Haohan Wang

ECCV 2024arXiv:2409.13939
#8445

Learning Natural Consistency Representation for Face Forgery Video Detection

Daichi Zhang, Zihao Xiao, Shikun Li et al.

ECCV 2024arXiv:2407.10550
#8446

View-Consistent 3D Editing with Gaussian Splatting

Yuxuan Wang, Xuanyu Yi, Zike Wu et al.

ECCV 2024arXiv:2403.11868
#8447

HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes

Zhuopeng Li, Yilin Zhang, Chenming Wu et al.

ECCV 2024arXiv:2403.20032
#8448

Generating Human Interaction Motions in Scenes with Text Control

Hongwei Yi, Justus Thies, Michael J. Black et al.

ECCV 2024arXiv:2404.10685
#8449

Optimizing Illuminant Estimation in Dual-Exposure HDR Imaging

Mahmoud Afifi, Zhenhua Hu, Liang Liang

ECCV 2024arXiv:2403.02449
#8450

Instruction Tuning-free Visual Token Complement for Multimodal LLMs

Dongsheng Wang, Jiequan Cui, Miaoge Li et al.

ECCV 2024arXiv:2408.05019
#8451

Improving Point-based Crowd Counting and Localization Based on Auxiliary Point Guidance

I-HSIANG CHEN, Wei-Ting Chen, Yu-Wei Liu et al.

ECCV 2024arXiv:2405.10589
#8452

Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts

Shuangkang Fang, Yufeng Wang, Yi-Hsuan Tsai et al.

ECCV 2024arXiv:2407.06842
#8453

HybridBooth: Hybrid Prompt Inversion for Efficient Subject-Driven Generation

Shanyan Guan, Yanhao Ge, Ying Tai et al.

ECCV 2024arXiv:2410.08192
#8454

SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer

Zijie Wu, Chaohui Yu, Yanqin Jiang et al.

ECCV 2024arXiv:2404.03736
#8455

Revisit Self-supervision with Local Structure-from-Motion

Shengjie Zhu, Xiaoming Liu

ECCV 2024
#8456

On the Viability of Monocular Depth Pre-training for Semantic Segmentation

DONG LAO, Fengyu Yang, Daniel Wang et al.

ECCV 2024arXiv:2203.13987
#8457

Weakly-supervised Camera Localization by Ground-to-satellite Image Registration

Yujiao Shi, HONGDONG LI, Akhil Perincherry et al.

ECCV 2024arXiv:2409.06471
#8458

GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection

Ziying Song, Lei Yang, Shaoqing Xu et al.

ECCV 2024arXiv:2403.11848
#8459

ProtoComp: Diverse Point Cloud Completion with Controllable Prototype

Xumin Yu, Yanbo Wang, Jie Zhou et al.

ECCV 2024
#8460

Topo4D: Topology-Preserving Gaussian Splatting for High-Fidelity 4D Head Capture

Xuanchen Li, Yuhao Cheng, Xingyu Ren et al.

ECCV 2024arXiv:2406.00440
#8461

EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis

Shuai Tan, Bin Ji, Mengxiao Bi et al.

ECCV 2024arXiv:2404.01647
#8462

Put Myself in Your Shoes: Lifting the Egocentric Perspective from Exocentric Videos

Mi Luo, Zihui Xue, Alex Dimakis et al.

ECCV 2024arXiv:2403.06351
#8463

LivePhoto: Real Image Animation with Text-guided Motion Control

Xi Chen, Zhiheng Liu, Mengting Chen et al.

ECCV 2024arXiv:2312.02928
#8464

CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion

Wendi Zheng, Jiayan Teng, Zhuoyi Yang et al.

ECCV 2024arXiv:2403.05121
#8465

OAPT: Offset-Aware Partition Transformer for Double JPEG Artifacts Removal

Qiao Mo, Yukang Ding, Jinhua Hao et al.

ECCV 2024arXiv:2408.11480
#8466

Seeing the Unseen: A Frequency Prompt Guided Transformer for Image Restoration

shihao zhou, Jinshan Pan, Jinglei Shi et al.

ECCV 2024arXiv:2404.00288
#8467

Animate Your Motion: Turning Still Images into Dynamic Videos

Mingxiao Li, Bo Wan, Marie-Francine Moens et al.

ECCV 2024arXiv:2403.10179
#8468

Spatial-Temporal Multi-level Association for Video Object Segmentation

Deshui Miao, Xin Li, Zhenyu He et al.

ECCV 2024arXiv:2404.06265
#8469

Context-Aware Action Recognition: Introducing a Comprehensive Dataset for Behavior Contrast

Tatsuya Sasaki, Yoshiki Ito, Satoshi Kondo

ECCV 2024
#8470

DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Jinbo Xing, Menghan Xia, Yong Zhang et al.

ECCV 2024arXiv:2310.12190
#8471

UniProcessor: A Text-induced Unified Low-level Image Processor

Huiyu Duan, Xiongkuo Min, Sijing Wu et al.

ECCV 2024arXiv:2407.20928
#8472

Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors

Tongkun Guan, Wei Shen, Xue Yang et al.

ECCV 2024arXiv:2312.05286
#8473

Learning Chain of Counterfactual Thought for Bias-Robust Vision-Language Reasoning

Yifeng Zhang, Ming Jiang, Qi Zhao

ECCV 2024
#8474

Let the Avatar Talk using Texts without Paired Training Data

Xiuzhe Wu, Yang-Tian Sun, Handi Chen et al.

ECCV 2024
#8475

Attention Beats Linear for Fast Implicit Neural Representation Generation

Shuyi Zhang, Ke Liu, Jingjun Gu et al.

ECCV 2024arXiv:2407.15355
#8476

Prompt-Based Test-Time Real Image Dehazing: A Novel Pipeline

Zixuan Chen, Zewei He, Ziqian Lu et al.

ECCV 2024arXiv:2309.17389
#8477

RCS-Prompt: Learning Prompt to Rearrange Class Space for Prompt-based Continual Learning

Longrong Yang, Hanbin Zhao, Yunlong Yu et al.

ECCV 2024
#8478

Dynamic Guidance Adversarial Distillation with Enhanced Teacher Knowledge

Hyejin Park, Dongbo Min

ECCV 2024arXiv:2409.01627
#8479

Gaussian Grouping: Segment and Edit Anything in 3D Scenes

Mingqiao Ye, Martin Danelljan, Fisher Yu et al.

ECCV 2024arXiv:2312.00732
#8480

3D Hand Sequence Recovery from Real Blurry Images and Event Stream

Joonkyu Park, Gyeongsik Moon, Weipeng Xu et al.

ECCV 2024
#8481

Segmentation-guided Layer-wise Image Vectorization with Gradient Fills

Hengyu Zhou, Hui Zhang, Bin Wang

ECCV 2024arXiv:2408.15741
#8482

HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting

Zhenglin Zhou, Fan Ma, Hehe Fan et al.

ECCV 2024arXiv:2402.06149
#8483

Explicitly Guided Information Interaction Network for Cross-modal Point Cloud Completion

Hang Xu, Chen Long, Wenxiao Zhang et al.

ECCV 2024arXiv:2407.02887
#8484

StructLDM: Structured Latent Diffusion for 3D Human Generation

Tao Hu, Fangzhou Hong, Ziwei Liu

ECCV 2024arXiv:2404.01241
#8485

High-Fidelity Modeling of Generalizable Wrinkle Deformation

Jingfan Guo, Jae Shin Yoon, Shunsuke Saito et al.

ECCV 2024
#8486

COMPOSE: Comprehensive Portrait Shadow Editing

Andrew Hou, Zhixin Shu, Xuaner Zhang et al.

ECCV 2024arXiv:2408.13922
#8487

EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion

Guangyao Zhai, Evin Pınar Örnek, Dave Zhenyu Chen et al.

ECCV 2024arXiv:2405.00915
#8488

Learning Representations from Foundation Models for Domain Generalized Stereo Matching

Yongjian Zhang, Longguang Wang, Kunhong Li et al.

ECCV 2024
#8489

NeRF-XL: NeRF at Any Scale with Multi-GPU

Ruilong Li, Sanja Fidler, Angjoo Kanazawa et al.

ECCV 2024
#8490

3D Hand Pose Estimation in Everyday Egocentric Images

Aditya Prakash, Ruisen Tu, Matthew Chang et al.

ECCV 2024arXiv:2312.06583
#8491

Controllable Human-Object Interaction Synthesis

Jiaman Li, Alexander Clegg, Roozbeh Mottaghi et al.

ECCV 2024arXiv:2312.03913
#8492

Nymeria: A Massive Collection of Egocentric Multi-modal Human Motion in the Wild

Lingni Ma, Yuting Ye, Rowan Postyeni et al.

ECCV 2024
#8493

Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene

Ruiyang Zhang, Hu Zhang, Hang Yu et al.

ECCV 2024arXiv:2407.08569
#8494

Six-Point Method for Multi-Camera Systems with Reduced Solution Space

Banglei Guan, Ji Zhao, Laurent Kneip

ECCV 2024arXiv:2402.18066
#8495

Tuning-Free Image Customization with Image and Text Guidance

Pengzhi Li, Qiang Nie, Ying Chen et al.

ECCV 2024arXiv:2403.12658
#8496

MegaScenes: Scene-Level View Synthesis at Scale

Joseph Tung, Gene Chou, Ruojin Cai et al.

ECCV 2024arXiv:2406.11819
#8497

Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth Estimation

Jinfeng Liu, Lingtong Kong, Bo Li et al.

ECCV 2024arXiv:2407.14126
#8498

Idea2Img: Iterative Self-Refinement with GPT-4V for Automatic Image Design and Generation

Zhengyuan Yang, Jianfeng Wang, Linjie Li et al.

ECCV 2024
#8499

Preventing Catastrophic Forgetting through Memory Networks in Continuous Detection

Gaurav Bhatt, Leonid Sigal, James Ross

ECCV 2024arXiv:2403.14797
#8500

COIN-Matting: Confounder Intervention for Image Matting

Zhaohe Liao, Jiangtong Li, Jun Lan et al.

ECCV 2024
#8501

Score Distillation Sampling with Learned Manifold Corrective

Thiemo Alldieck, Nikos Kolotouros, Cristian Sminchisescu

ECCV 2024arXiv:2401.05293
#8502

Rethinking Weakly-supervised Video Temporal Grounding From a Game Perspective

Xiang Fang, Zeyu Xiong, Wanlong Fang et al.

ECCV 2024
#8503

AdaDiffSR: Adaptive Region-aware Dynamic acceleration Diffusion Model for Real-World Image Super-Resolution

Yuanting Fan, Chengxu Liu, Nengzhong Yin et al.

ECCV 2024arXiv:2410.17752
#8504

Domesticating SAM for Breast Ultrasound Image Segmentation via Spatial-frequency Fusion and Uncertainty Correction

Wanting Zhang, Huisi Wu, Jing Qin

ECCV 2024
#8505

Interaction-centric Spatio-Temporal Context Reasoning for Multi-Person Video HOI Recognition

Yisong Wang, Nan Xi, Jingjing Meng et al.

ECCV 2024
#8506

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

Yanwei Li, Chengyao Wang, Jiaya Jia

ECCV 2024arXiv:2311.17043
#8507

Agent3D-Zero: An Agent for Zero-shot 3D Understanding

Sha Zhang, Di Huang, Jiajun Deng et al.

ECCV 2024arXiv:2403.11835
#8508

Structured-NeRF: Hierarchical Scene Graph with Neural Representation

Zhide Zhong, Jiakai Cao, songen gu et al.

ECCV 2024
#8509

APL: Anchor-based Prompt Learning for One-stage Weakly Supervised Referring Expression Comprehension

Yaxin Luo, Jiayi Ji, Xiaofu Chen et al.

ECCV 2024
#8510

DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion Consistency

Xiaojing Zhong, Xinyi Huang, Xiaofeng Yang et al.

ECCV 2024arXiv:2408.07481
#8511

MeshFeat: Multi-Resolution Features for Neural Fields on Meshes

Mihir Mahajan, Florian Hofherr, Daniel Cremers

ECCV 2024arXiv:2407.13592
#8512

TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP to Alleviate Single Tag Bias

Sanghyun Jo, Soohyun Ryu, Sungyub Kim et al.

ECCV 2024arXiv:2404.00384
#8513

DragAPart: Learning a Part-Level Motion Prior for Articulated Objects

Ruining Li, Chuanxia Zheng, Christian Rupprecht et al.

ECCV 2024arXiv:2403.15382
#8514

Learning to Unlearn for Robust Machine Unlearning

Mark HUANG, Lin Geng Foo, Jun Liu

ECCV 2024arXiv:2407.10494
#8515

Diff-Tracker: Text-to-Image Diffusion Models are Unsupervised Trackers

Zhengbo Zhang, Li Xu, Duo Peng et al.

ECCV 2024arXiv:2407.08394
#8516

Echoes of the Past: Boosting Long-tail Recognition via Reflective Learning

Qihao Zhao, YALUN DAI, Shen Lin et al.

ECCV 2024
#8517

Lego: Learning to Disentangle and Invert Personalized Concepts Beyond Object Appearance in Text-to-Image Diffusion Models

Saman Motamed, Danda Pani Paudel, Luc Van Gool

ECCV 2024arXiv:2311.13833
#8518

Taming CLIP for Fine-grained and Structured Visual Understanding of Museum Exhibits

Ada-Astrid Balauca, Danda Paudel, Kristina Toutanova et al.

ECCV 2024arXiv:2409.01690
#8519

Visual Text Generation in the Wild

Yuanzhi Zhu, Jiawei Liu, Feiyu Gao et al.

ECCV 2024arXiv:2407.14138
#8520

A Unified Image Compression Method for Human Perception and Multiple Vision Tasks

Sha Guo, Sui Lin, Chen-Lin Zhang et al.

ECCV 2024
#8521

Learning Quantized Adaptive Conditions for Diffusion Models

Yuchen Liang, Yuchuan Tian, Lei Yu et al.

ECCV 2024arXiv:2409.17487
#8522

Learn to Optimize Denoising Scores: A Unified and Improved Diffusion Prior for 3D Generation

Xiaofeng Yang, Yiwen Chen, Cheng Chen et al.

ECCV 2024
#8523

Discovering Unwritten Visual Classifiers with Large Language Models

Mia Chiquier, Utkarsh Mall, Carl Vondrick

ECCV 2024
#8524

Parameter-Efficient and Memory-Efficient Tuning for Vision Transformer: A Disentangled Approach

Taolin Zhang, Jiawang Bai, Zhihe Lu et al.

ECCV 2024arXiv:2407.06964
#8525

On the Approximation Risk of Few-Shot Class-Incremental Learning

Xuan Wang, Zhong Ji, Xiyao Liu et al.

ECCV 2024
#8526

In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation

Dahyun Kang, Minsu Cho

ECCV 2024arXiv:2408.04961
#8527

URS-NeRF: Unordered Rolling Shutter Bundle Adjustment for Neural Radiance Fields

Bo Xu, Liu Ziao, Mengqi GUO et al.

ECCV 2024arXiv:2403.10119
#8528

Hierarchically Structured Neural Bones for Reconstructing Animatable Objects from Casual Videos

Subin Jeon, In Cho, Minsu Kim et al.

ECCV 2024arXiv:2408.00351
#8529

Protecting NeRFs' Copyright via Plug-And-Play Watermarking Base Model

Qi Song, Ziyuan Luo, Ka Chun Cheung et al.

ECCV 2024arXiv:2407.07735
#8530

MOD-UV: Learning Mobile Object Detectors from Unlabeled Videos

Yihong Sun, Bharath Hariharan

ECCV 2024arXiv:2405.14841
#8531

V-Trans4Style: Visual Transition Recommendation for Video Production Style Adaptation

Pooja Guhan, Tsung-Wei Huang, Guan-Ming Su et al.

ECCV 2024arXiv:2501.07983
#8532

WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation

Jiachen Lu, Ze Huang, Zeyu Yang et al.

ECCV 2024arXiv:2312.02934
#8533

Uncertainty-Driven Spectral Compressive Imaging with Spatial-Frequency Transformer

Lintao Peng, Siyu Xie, Liheng Bian

ECCV 2024
#8534

Weakly-Supervised Spatio-Temporal Video Grounding with Variational Cross-Modal Alignment

Yang Jin, Yadong Mu

ECCV 2024
#8535

Chronologically Accurate Retrieval for Temporal Grounding of Motion-Language Models

Kent Fujiwara, Mikihiro Tanaka, Qing Yu

ECCV 2024arXiv:2407.15408
#8536

SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow

Yuanzhi Zhu, Xingchao Liu, Qiang Liu

ECCV 2024arXiv:2407.12718
#8537

Domain Reduction Strategy for Non-Line-of-Sight Imaging

Hyunbo Shim, In Cho, Daekyu Kwon et al.

ECCV 2024arXiv:2308.10269
#8538

Learning to Enhance Aperture Phasor Field for Non-Line-of-Sight Imaging

In Cho, Hyunbo Shim, Seon Joo Kim

ECCV 2024arXiv:2407.18574
#8539

FRI-Net: Floorplan Reconstruction via Room-wise Implicit Representation

Honghao Xu, Juzhan Xu, Zeyu Huang et al.

ECCV 2024arXiv:2407.10687
#8540

A Framework for Efficient Model Evaluation through Stratification, Sampling, and Estimation

Riccardo Fogliato, Pratik Patil, Mathew Monfort et al.

ECCV 2024arXiv:2406.07320
#8541

DECIDER: Leveraging Foundation Model Priors for Improved Model Failure Detection and Explanation

Rakshith Subramanyam, Kowshik Thopalli, Vivek Sivaraman Narayanaswamy et al.

ECCV 2024arXiv:2408.00331
#8542

ExMatch: Self-guided Exploitation for Semi-Supervised Learning with Scarce Labeled Samples

Noo-ri Kim, Jin-Seop Lee, Jee-Hyong LEE

ECCV 2024
#8543

CaesarNeRF: Calibrated Semantic Representation for Few-Shot Generalizable Neural Rendering

Haidong Zhu, Tianyu Ding, Tianyi Chen et al.

ECCV 2024arXiv:2311.15510
#8544

Open-Vocabulary RGB-Thermal Semantic Segmentation

Guoqiang Zhao, JunJie Huang, Xiaoyun Yan et al.

ECCV 2024
#8545

UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening

Siyuan Cheng, Guangyu Shen, Kaiyuan Zhang et al.

ECCV 2024arXiv:2407.11372
#8546

Unsupervised Moving Object Segmentation with Atmospheric Turbulence

Dehao Qin, Ripon Saha, Woojeh Chung et al.

ECCV 2024
#8547

Modeling Label Correlations with Latent Context for Multi-Label Recognition

Zhao-Min Chen, Quan Cui, Ruoxi Deng et al.

ECCV 2024
#8548

Towards Reliable Advertising Image Generation Using Human Feedback

Zhenbang Du, Wei Feng, Haohan Wang et al.

ECCV 2024arXiv:2408.00418
#8549

Weak-to-Strong Compositional Learning from Generative Models for Language-based Object Detection

Kwanyong Park, Kuniaki Saito, Donghyun Kim

ECCV 2024arXiv:2407.15296
#8550

TurboEdit: Real-time text-based disentangled real image editing

Zongze Wu, Nicholas I Kolkin, Jonathan Brandt et al.

ECCV 2024
#8551

The Role of Masking for Efficient Supervised Knowledge Distillation of Vision Transformers

Seungwoo Son, Jegwang Ryu, Namhoon Lee et al.

ECCV 2024arXiv:2302.10494
#8552

Improving Vision and Language Concepts Understanding with Multimodal Counterfactual Samples

Chengen Lai, Shengli Song, Sitong Yan et al.

ECCV 2024
#8553

Functional Transform-Based Low-Rank Tensor Factorization for Multi-Dimensional Data Recovery

Jian-Li Wang, Xi-Le Zhao

ECCV 2024
#8554

Clean & Compact: Efficient Data-Free Backdoor Defense with Model Compactness

Huy Phan, Jinqi Xiao, Yang Sui et al.

ECCV 2024
#8555

Language-Assisted Skeleton Action Understanding for Skeleton-Based Temporal Action Segmentation

Haoyu Ji, Bowen Chen, Xinglong Xu et al.

ECCV 2024
#8556

A Geometric Distortion Immunized Deep Watermarking Framework with Robustness Generalizability

Linfeng Ma, Han Fang, Tianyi Wei et al.

ECCV 2024
#8557

VideoAgent: Long-form Video Understanding with Large Language Model as Agent

Xiaohan Wang, Yuhui Zhang, Orr Zohar et al.

ECCV 2024arXiv:2403.10517
#8558

MoE-DiffIR: Task-customized Diffusion Priors for Universal Compressed Image Restoration

Yulin Ren, Xin Li, Bingchen Li et al.

ECCV 2024arXiv:2407.10833
#8559

Adaptive Human Trajectory Prediction via Latent Corridors

Neerja Thakkar, Karttikeya Mangalam, Andrea Bajcsy et al.

ECCV 2024arXiv:2312.06653
#8560

Generalizable Facial Expression Recognition

Yuhang Zhang, Xiuqi Zheng, Chenyi Liang et al.

ECCV 2024arXiv:2408.10614
#8561

RS-NeRF: Neural Radiance Fields from Rolling Shutter Images

Muyao Niu, Tong Chen, Yifan Zhan et al.

ECCV 2024arXiv:2407.10267
#8562

MARs: Multi-view Attention Regularizations for Patch-based Feature Recognition of Space Terrain

Timothy Chase, Karthik Dantu

ECCV 2024arXiv:2410.05182
#8563

Grid-Attention: Enhancing Computational Efficiency of Large Vision Models without Fine-Tuning

Pengyu Li, Biao Wang, Tianchu Guo et al.

ECCV 2024
#8564

Enhanced Motion Forecasting with Visual Relation Reasoning

Sungjune Kim, Hadam Baek, Seunggwan Lee et al.

ECCV 2024
#8565

DSA: Discriminative Scatter Analysis for Early Smoke Segmentation

Lujian Yao, Haitao Zhao, Jingchao Peng et al.

ECCV 2024
#8566

DualBEV: Unifying Dual View Transformation with Probabilistic Correspondences

Peidong Li, Wancheng Shen, Qihao Huang et al.

ECCV 2024arXiv:2403.05402
#8567

Continuous SO(3) Equivariant Convolution for 3D Point Cloud Analysis

Jaein Kim, HEE BIN YOO, Dong-Sig Han et al.

ECCV 2024
#8568

MedRAT: Unpaired Medical Report Generation via Auxiliary Tasks

Elad Hirsch, Gefen Dawidowicz, Ayellet Tal

ECCV 2024arXiv:2407.03919
#8569

Towards Unified Representation of Invariant-Specific Features in Missing Modality Face Anti-Spoofing

Guanghao Zheng, Yuchen Liu, Wenrui Dai et al.

ECCV 2024
#8570

OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web

Raghav Kapoor, Yash Parag Butala, Melisa A Russak et al.

ECCV 2024arXiv:2402.17553
#8571

Self-Supervised Underwater Caustics Removal and Descattering via Deep Monocular SLAM

Jonathan Sauder, Devis TUIA

ECCV 2024
#8572

SCAPE: A Simple and Strong Category-Agnostic Pose Estimator

Yujia Liang, Zixuan Ye, Wenze Liu et al.

ECCV 2024arXiv:2407.13483
#8573

Image-to-Lidar Relational Distillation for Autonomous Driving Data

Anas Mahmoud, Ali Harakeh, Steven Waslander

ECCV 2024arXiv:2409.00845
#8574

IGNORE: Information Gap-based False Negative Loss Rejection for Single Positive Multi-Label Learning

Gyeong Ryeol Song, Noo-ri Kim, Jin-Seop Lee et al.

ECCV 2024
#8575

CoLA: Conditional Dropout and Language-driven Robust Dual-modal Salient Object Detection

Shuang Hao, Chunlin Zhong, He Tang

ECCV 2024arXiv:2407.06780
#8576

Siamese Vision Transformers are Scalable Audio-visual Learners

Yan-Bo Lin, Gedas Bertasius

ECCV 2024arXiv:2403.19638
#8577

Visual Relationship Transformation

Xiaoyu Xu, Jiayan Qiu, Baosheng Yu et al.

ECCV 2024
#8578

Scene-aware Human Motion Forecasting via Mutual Distance Prediction

Chaoyue Xing, Wei Mao, Miaomiao LIU

ECCV 2024arXiv:2310.00615
#8579

Elysium: Exploring Object-level Perception in Videos through Semantic Integration Using MLLMs

Han Wang, Yanjie Wang, Ye Yongjie et al.

ECCV 2024
#8580

Rethinking Data Bias: Dataset Copyright Protection via Embedding Class-wise Hidden Bias

Jinhyeok Jang, ByungOk Han, Jaehong Kim et al.

ECCV 2024
#8581

Federated Learning with Local Openset Noisy Labels

Zonglin Di, Zhaowei Zhu, Xiaoxiao Li et al.

ECCV 2024
#8582

Match-Stereo-Videos: Bidirectional Alignment for Consistent Dynamic Stereo Matching

Junpeng Jing, Ye Mao, Krystian Mikolajczyk

ECCV 2024arXiv:2403.10755
#8583

PoseSOR: Human Pose Can Guide Our Attention

Huankang Guan, Rynson W.H. Lau

ECCV 2024
#8584

SpeedUpNet: A Plug-and-Play Adapter Network for Accelerating Text-to-Image Diffusion Models

Weilong Chai, Dandan Zheng, Jiajiong Cao et al.

ECCV 2024arXiv:2312.08887
#8585

Rethinking Tree-Ring Watermarking for Enhanced Multi-Key Identification

Hai Ci, Pei Yang, Yiren Song et al.

ECCV 2024arXiv:2404.14055
#8586

Dual-stage Hyperspectral Image Classification Model with Spectral Supertoken

Peifu Liu, Tingfa Xu, Jie Wang et al.

ECCV 2024arXiv:2407.07307
#8587

Optimal Transport of Diverse Unsupervised Tasks for Robust Learning from Noisy Few-Shot Data

Xiaofan Que, Qi Yu

ECCV 2024
#8588

LITA: Language Instructed Temporal-Localization Assistant

De-An Huang, Shijia Liao, Subhashree Radhakrishnan et al.

ECCV 2024arXiv:2403.19046
#8589

BurstM: Deep Burst Multi-scale SR using Fourier Space with Optical Flow

EungGu Kang, Byeonghun Lee, Sunghoon Im et al.

ECCV 2024arXiv:2409.15384
#8590

Unsupervised Dense Prediction using Differentiable Normalized Cuts

Yanbin Liu, Stephen Gould

ECCV 2024
#8591

uCAP: An Unsupervised Prompting Method for Vision-Language Models

A. Tuan Nguyen, Kai Sheng Tai, Bor-Chun Chen et al.

ECCV 2024
#8592

Flexible Distribution Alignment: Towards Long-tailed Semi-supervised Learning with Proper Calibration

Emanuel Sanchez Aimar, Nathaniel D Helgesen, Yonghao Xu et al.

ECCV 2024arXiv:2306.04621
#8593

Efficient Frequency-Domain Image Deraining with Contrastive Regularization

Ning Gao, xingyu jiang, Xiuhui Zhang et al.

ECCV 2024
#8594

Deep Cost Ray Fusion for Sparse Depth Video Completion

Jungeon Kim, Soongjin Kim, Jaesik Park et al.

ECCV 2024arXiv:2409.14935
#8595

SSL-Cleanse: Trojan Detection and Mitigation in Self-Supervised Learning

Mengxin Zheng, Jiaqi Xue, Zihao Wang et al.

ECCV 2024arXiv:2303.09079
#8596

Norma: A Noise Robust Memory-Augmented Framework for Whole Slide Image Classification

Yu Bai, Bo Zhang, Zheng Zhang et al.

ECCV 2024
#8597

Adaptive High-Frequency Transformer for Diverse Wildlife Re-Identification

Chenyue Li, Shuoyi Chen, Mang Ye

ECCV 2024arXiv:2410.06977
#8598

An accurate detection is not all you need to combat label noise in web-noisy datasets

Paul Albert, Kevin McGuinness, Eric Arazo et al.

ECCV 2024arXiv:2407.05528
#8599

Gated Temporal Diffusion for Stochastic Long-term Dense Anticipation

Olga Zatsarynna, Emad Bahrami, Yazan Abu Farha et al.

ECCV 2024arXiv:2407.11954
#8600

Causal Subgraphs and Information Bottlenecks: Redefining OOD Robustness in Graph Neural Networks

Weizhi An, Wenliang Zhong, Feng Jiang et al.

ECCV 2024