Most Cited ECCV "autonomous agent training" Papers

2,387 papers found • Page 11 of 12

Filters:Most Cited ECCV autonomous agent training Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#2001

Delving into Adversarial Robustness on Document Tampering Localization

Huiru Shao, Zhuang Qian, Kaizhu Huang et al.

ECCV 2024poster

#2002

SceneTeller: Language-to-3D Scene Generation

Basak Melis Ocal, Maxim Tatarchenko, Sezer Karaoglu et al.

ECCV 2024poster

#2003

MagMax: Leveraging Model Merging for Seamless Continual Learning

Daniel Marczak, Bartlomiej Twardowski, Tomasz Trzcinski et al.

ECCV 2024posterarXiv:2407.06322

#2004

Spline-based Transformers

Prashanth Chandran, Agon Serifi, Markus Gross et al.

ECCV 2024posterarXiv:2504.02797

#2005

Efficient NeRF Optimization - Not All Samples Remain Equally Hard

Juuso Korhonen, Goutham Rangu, Hamed Rezazadegan Tavakoli et al.

ECCV 2024posterarXiv:2408.03193

#2006

VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks

Xiangxiang Chu, Jianlin Su, Bo Zhang et al.

ECCV 2024posterarXiv:2403.00522

#2007

Missing Modality Prediction for Unpaired Multimodal Learning via Joint Embedding of Unimodal Models

Taesup Kim, Donggeun Kim

ECCV 2024posterarXiv:2407.12616

#2008

Fast Diffusion-Based Counterfactuals for Shortcut Removal and Generation

Nina Weng, Paraskevas Pegios, Eike Petersen et al.

ECCV 2024posterarXiv:2312.14223

#2009

Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and Retention

Zuyao Chen, Jinlin Wu, Zhen Lei et al.

ECCV 2024posterarXiv:2311.10988

#2010

FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation

Xinzhi MU, Li Chen, Bohan CHEN et al.

ECCV 2024posterarXiv:2406.08392

#2011

GroCo: Ground Constraint for Metric Self-Supervised Monocular Depth

Aurélien Cecille, Stefan Duffner, Franck DAVOINE et al.

ECCV 2024posterarXiv:2409.14850

#2012

MART: MultiscAle Relational Transformer Networks for Multi-agent Trajectory Prediction

Seongju Lee, Junseok Lee, Yeonguk Yu et al.

ECCV 2024posterarXiv:2407.21635

#2013

Distributed Active Client Selection With Noisy Clients Using Model Association Scores

Kwang In Kim

ECCV 2024poster

#2014

Tight and Efficient Upper Bound on Spectral Norm of Convolutional Layers

Ekaterina Grishina, Mikhail Gorbunov, Maxim Rakhuba

ECCV 2024posterarXiv:2409.11859

#2015

Deciphering the Role of Representation Disentanglement: Investigating Compositional Generalization in CLIP Models

Reza Abbasi, Mohammad Rohban, Mahdieh Soleymani Baghshah

ECCV 2024posterarXiv:2407.05897

#2016

A Simple Baseline for Spoken Language to Sign Language Translation with 3D Avatars

Ronglai Zuo, Fangyun Wei, Zenggui Chen et al.

ECCV 2024posterarXiv:2401.04730

#2017

Unveiling Privacy Risks in Stochastic Neural Networks Training: Effective Image Reconstruction from Gradients

Yiming Chen, Xiangyu Yang, Nikos Deligiannis

ECCV 2024poster

#2018

Towards compact reversible image representations for neural style transfer

Xiyao Liu, Siyu Yang, Jian Zhang et al.

ECCV 2024poster

#2019

SPARO: Selective Attention for Robust and Compositional Transformer Encodings for Vision

Ankit Vani, Bac Nguyen, Samuel Lavoie et al.

ECCV 2024posterarXiv:2404.15721

#2020

Learning Scalable Model Soup on a Single GPU: An Efficient Subspace Training Strategy

Tao Li, Weisen Jiang, Fanghui Liu et al.

ECCV 2024posterarXiv:2407.03641

#2021

Straightforward Layer-wise Pruning for More Efficient Visual Adaptation

Ruizi Han, Jinglei Tang

ECCV 2024posterarXiv:2407.14330

#2022

Dual-Decoupling Learning and Metric-Adaptive Thresholding for Semi-Supervised Multi-Label Learning

Jiahao Xiao, Ming-Kun Xie, Heng-Bo Fan et al.

ECCV 2024posterarXiv:2407.18624

#2023

SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation

Lingchen Meng, Shiyi Lan, Hengduo Li et al.

ECCV 2024posterarXiv:2311.14671

#2024

Handling The Non-Smooth Challenge in Tensor SVD: A Multi-Objective Tensor Recovery Framework

Jingjing Zheng, Wanglong Lu, Wenzhe Wang et al.

ECCV 2024posterarXiv:2311.13958

#2025

DoubleTake: Geometry Guided Depth Estimation

Mohamed Sayed, Filippo Aleotti, Jamie Watson et al.

ECCV 2024posterarXiv:2406.18387

#2026

Sur^2f: A Hybrid Representation for High-Quality and Efficient Surface Reconstruction from Multi-view Images

Zhangjin Huang, Zhihao Liang, Kui Jia

ECCV 2024poster

#2027

UniTalker: Scaling up Audio-Driven 3D Facial Animation through A Unified Model

Xiangyu Fan, Jiaqi Li, Zhiqian Lin et al.

ECCV 2024posterarXiv:2408.00762

#2028

PartCraft: Crafting Creative Objects by Parts

Kam Woh Ng, Xiatian Zhu, Yi-Zhe Song et al.

ECCV 2024posterarXiv:2407.04604

#2029

Coarse-to-Fine Implicit Representation Learning for 3D Hand-Object Reconstruction from a Single RGB-D Image

Xingyu Liu, Pengfei Ren, Jingyu Wang et al.

ECCV 2024poster

#2030

AnyControl: Create Your Artwork with Versatile Control on Text-to-Image Generation

Sun Yanan, Yanchen Liu, Yinhao Tang et al.

ECCV 2024posterarXiv:2406.18958

#2031

Data Overfitting for On-Device Super-Resolution with Dynamic Algorithm and Compiler Co-Design

Li, zhihao shu, Jie Ji et al.

ECCV 2024posterarXiv:2407.02813

#2032

PRET: Planning with Directed Fidelity Trajectory for Vision and Language Navigation

Renjie Lu, Jing-Ke Meng, WEISHI ZHENG

ECCV 2024posterarXiv:2407.11487

#2033

Long-CLIP: Unlocking the Long-Text Capability of CLIP

Beichen Zhang, Pan Zhang, Xiaoyi Dong et al.

ECCV 2024posterarXiv:2403.15378

#2034

Learning with Counterfactual Explanations for Radiology Report Generation

Mingjie Li, Haokun Lin, Liang Qiu et al.

ECCV 2024posterarXiv:2407.14474

#2035

Pseudo-Embedding for Generalized Few-Shot Point Cloud Segmentation

Chih-Jung Tsai, Hwann-Tzong Chen, Tyng-Luh Liu

ECCV 2024poster

#2036

AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer

Zhuguanyu Wu, Jiaxin Chen, Hanwen Zhong et al.

ECCV 2024posterarXiv:2407.12951

#2037

Oulu Remote-photoplethysmography Physical Domain Attacks Database (ORPDAD)

Marko Savic, Guoying Zhao

ECCV 2024poster

#2038

Optimizing Illuminant Estimation in Dual-Exposure HDR Imaging

Mahmoud Afifi, Zhenhua Hu, Liang Liang

ECCV 2024posterarXiv:2403.02449

#2039

Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts

Shuangkang Fang, Yufeng Wang, Yi-Hsuan Tsai et al.

ECCV 2024posterarXiv:2407.06842

#2040

Embodied Understanding of Driving Scenarios

Yunsong Zhou, Linyan Huang, Qingwen Bu et al.

ECCV 2024posterarXiv:2403.04593

#2041

HybridBooth: Hybrid Prompt Inversion for Efficient Subject-Driven Generation

Shanyan Guan, Yanhao Ge, Ying Tai et al.

ECCV 2024posterarXiv:2410.08192

#2042

On the Viability of Monocular Depth Pre-training for Semantic Segmentation

DONG LAO, Fengyu Yang, Daniel Wang et al.

ECCV 2024posterarXiv:2203.13987

#2043

Weakly-supervised Camera Localization by Ground-to-satellite Image Registration

Yujiao Shi, HONGDONG LI, Akhil Perincherry et al.

ECCV 2024posterarXiv:2409.06471

#2044

ProtoComp: Diverse Point Cloud Completion with Controllable Prototype

Xumin Yu, Yanbo Wang, Jie Zhou et al.

ECCV 2024poster

#2045

Topo4D: Topology-Preserving Gaussian Splatting for High-Fidelity 4D Head Capture

Xuanchen Li, Yuhao Cheng, Xingyu Ren et al.

ECCV 2024posterarXiv:2406.00440

#2046

SPHINX: A Mixer of Weights, Visual Embeddings and Image Scales for Multi-modal Large Language Models

Ziyi Lin, Dongyang Liu, Renrui Zhang et al.

ECCV 2024poster

#2047

Diffusion Models as Data Mining Tools

Ioannis Siglidis, Aleksander Holynski, Alexei Efros et al.

ECCV 2024posterarXiv:2408.02752

#2048

SCPNet: Unsupervised Cross-modal Homography Estimation via Intra-modal Self-supervised Learning

Runmin Zhang, Jun Ma, Lun Luo et al.

ECCV 2024posterarXiv:2407.08148

#2049

Leveraging Representations from Intermediate Encoder-blocks for Synthetic Image Detection

Christos Koutlis, Symeon Papadopoulos

ECCV 2024posterarXiv:2402.19091

#2050

Seeing the Unseen: A Frequency Prompt Guided Transformer for Image Restoration

shihao zhou, Jinshan Pan, Jinglei Shi et al.

ECCV 2024posterarXiv:2404.00288

#2051

Animate Your Motion: Turning Still Images into Dynamic Videos

Mingxiao Li, Bo Wan, Marie-Francine Moens et al.

ECCV 2024posterarXiv:2403.10179

#2052

Spatial-Temporal Multi-level Association for Video Object Segmentation

Deshui Miao, Xin Li, Zhenyu He et al.

ECCV 2024posterarXiv:2404.06265

#2053

Adaptive Multi-head Contrastive Learning

Lei Wang, Piotr Koniusz, Tom Gedeon et al.

ECCV 2024posterarXiv:2310.05615

#2054

UniProcessor: A Text-induced Unified Low-level Image Processor

Huiyu Duan, Xiongkuo Min, Sijing Wu et al.

ECCV 2024posterarXiv:2407.20928

#2055

GRACE: Graph-Based Contextual Debiasing for Fair Visual Question Answering

Yifeng Zhang, Ming Jiang, Qi Zhao

ECCV 2024poster

#2056

Learning Chain of Counterfactual Thought for Bias-Robust Vision-Language Reasoning

Yifeng Zhang, Ming Jiang, Qi Zhao

ECCV 2024poster

#2057

Generalizing to Unseen Domains via Text-guided Augmentation

Daiqing Qi, Handong Zhao, Aidong Zhang et al.

ECCV 2024poster

#2058

On Calibration of Object Detectors: Pitfalls, Evaluation and Baselines

Selim Kuzucu, Kemal Oksuz, Jonathan Sadeghi et al.

ECCV 2024posterarXiv:2405.20459

#2059

IVTP: Instruction-guided Visual Token Pruning for Large Vision-Language Models

Kai Huang, Hao Zou, Ye Xi et al.

ECCV 2024poster

#2060

Unsqueeze [CLS] Bottleneck to Learn Rich Representations

Qing Su, Shihao Ji

ECCV 2024posterarXiv:2407.17671

#2061

RCS-Prompt: Learning Prompt to Rearrange Class Space for Prompt-based Continual Learning

Longrong Yang, Hanbin Zhao, Yunlong Yu et al.

ECCV 2024poster

#2062

Dynamic Guidance Adversarial Distillation with Enhanced Teacher Knowledge

Hyejin Park, Dongbo Min

ECCV 2024posterarXiv:2409.01627

#2063

LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models

Yabin Zhang, Wenjie Zhu, Chenhang He et al.

ECCV 2024posterarXiv:2407.08966

#2064

Gaussian Grouping: Segment and Edit Anything in 3D Scenes

Mingqiao Ye, Martin Danelljan, Fisher Yu et al.

ECCV 2024posterarXiv:2312.00732

#2065

StructLDM: Structured Latent Diffusion for 3D Human Generation

Tao Hu, Fangzhou Hong, Ziwei Liu

ECCV 2024posterarXiv:2404.01241

#2066

High-Fidelity Modeling of Generalizable Wrinkle Deformation

Jingfan Guo, Jae Shin Yoon, Shunsuke Saito et al.

ECCV 2024poster

#2067

EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion

Guangyao Zhai, Evin Pınar Örnek, Dave Zhenyu Chen et al.

ECCV 2024posterarXiv:2405.00915

#2068

NeRF-XL: NeRF at Any Scale with Multi-GPU

Ruilong Li, Sanja Fidler, Angjoo Kanazawa et al.

ECCV 2024poster

#2069

Learning Neural Deformation Representation for 4D Dynamic Shape Generation

Gyojin Han, Jiwan Hur, Jaehyun Choi et al.

ECCV 2024poster

#2070

3D Hand Pose Estimation in Everyday Egocentric Images

Aditya Prakash, Ruisen Tu, Matthew Chang et al.

ECCV 2024posterarXiv:2312.06583

#2071

Towards Robust Event-based Networks for Nighttime via Unpaired Day-to-Night Event Translation

Yuhwan Jeong, Hoonhee Cho, Kuk-Jin Yoon

ECCV 2024posterarXiv:2407.10703

#2072

Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene

Ruiyang Zhang, Hu Zhang, Hang Yu et al.

ECCV 2024posterarXiv:2407.08569

#2073

Robust Nearest Neighbors for Source-Free Domain Adaptation under Class Distribution Shift

Antonio Tejero-de-Pablos, Riku Togashi, Mayu Otani et al.

ECCV 2024poster

#2074

Six-Point Method for Multi-Camera Systems with Reduced Solution Space

Banglei Guan, Ji Zhao, Laurent Kneip

ECCV 2024posterarXiv:2402.18066

#2075

Physics-informed Knowledge Transfer for Underwater Monocular Depth Estimation

Jinghe Yang, Mingming Gong, Ye Pu

ECCV 2024poster

#2076

Learning Equilibrium Transformation for Gamut Expansion and Color Restoration

JUN XIAO, Changjian Shui, Zhi-Song Liu et al.

ECCV 2024poster

#2077

COIN-Matting: Confounder Intervention for Image Matting

Zhaohe Liao, Jiangtong Li, Jun Lan et al.

ECCV 2024poster

#2078

FoundPose: Unseen Object Pose Estimation with Foundation Features

Evin Pınar Örnek, Yann Labbé, Bugra Tekin et al.

ECCV 2024posterarXiv:2311.18809

#2079

SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras

Yingqi Tang, Zhaotie Meng, Guoliang Chen et al.

ECCV 2024posterarXiv:2403.10353

#2080

ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction

Shaozhe Hao, Kai Han, Zhengyao Lv et al.

ECCV 2024posterarXiv:2407.07077

#2081

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

Yanwei Li, Chengyao Wang, Jiaya Jia

ECCV 2024posterarXiv:2311.17043

#2082

DreamMotion: Space-Time Self-Similar Score Distillation for Zero-Shot Video Editing

Hyeonho Jeong, Jinho Chang, GEON YEONG PARK et al.

ECCV 2024posterarXiv:2403.12002

#2083

Agent3D-Zero: An Agent for Zero-shot 3D Understanding

Sha Zhang, Di Huang, Jiajun Deng et al.

ECCV 2024posterarXiv:2403.11835

#2084

Structured-NeRF: Hierarchical Scene Graph with Neural Representation

Zhide Zhong, Jiakai Cao, songen gu et al.

ECCV 2024poster

#2085

Robustness Preserving Fine-tuning using Neuron Importance

Guangrui Li, Rahul Duggal, Aaditya Singh et al.

ECCV 2024poster

#2086

DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion Consistency

Xiaojing Zhong, Xinyi Huang, Xiaofeng Yang et al.

ECCV 2024posterarXiv:2408.07481

#2087

MeshFeat: Multi-Resolution Features for Neural Fields on Meshes

Mihir Mahajan, Florian Hofherr, Daniel Cremers

ECCV 2024posterarXiv:2407.13592

#2088

DragAPart: Learning a Part-Level Motion Prior for Articulated Objects

Ruining Li, Chuanxia Zheng, Christian Rupprecht et al.

ECCV 2024posterarXiv:2403.15382

#2089

CLR-GAN: Improving GANs Stability and Quality via Consistent Latent Representation and Reconstruction

Shengke Sun, Ziqian Luan, Zhanshan Zhao et al.

ECCV 2024poster

#2090

Learning to Unlearn for Robust Machine Unlearning

Mark HUANG, Lin Geng Foo, Jun Liu

ECCV 2024posterarXiv:2407.10494

#2091

Taming CLIP for Fine-grained and Structured Visual Understanding of Museum Exhibits

Ada-Astrid Balauca, Danda Paudel, Kristina Toutanova et al.

ECCV 2024posterarXiv:2409.01690

#2092

Visual Text Generation in the Wild

Yuanzhi Zhu, Jiawei Liu, Feiyu Gao et al.

ECCV 2024posterarXiv:2407.14138

#2093

FreeInit: Bridging Initialization Gap in Video Diffusion Models

Tianxing Wu, Chenyang Si, Yuming Jiang et al.

ECCV 2024posterarXiv:2312.07537

#2094

Learning Quantized Adaptive Conditions for Diffusion Models

Yuchen Liang, Yuchuan Tian, Lei Yu et al.

ECCV 2024posterarXiv:2409.17487

#2095

Learn to Optimize Denoising Scores: A Unified and Improved Diffusion Prior for 3D Generation

Xiaofeng Yang, Yiwen Chen, Cheng Chen et al.

ECCV 2024poster

#2096

Rethinking Fast Adversarial Training: A Splitting Technique To Overcome Catastrophic Overfitting

Masoumeh Zareapoor, Pourya Shamsolmoali

ECCV 2024poster

#2097

Similarity of Neural Architectures using Adversarial Attack Transferability

Jaehui Hwang, Dongyoon Han, Byeongho Heo et al.

ECCV 2024posterarXiv:2210.11407

#2098

In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation

Dahyun Kang, Minsu Cho

ECCV 2024posterarXiv:2408.04961

#2099

CadVLM: Bridging Language and Vision in the Generation of Parametric CAD Sketches

Sifan Wu, Amir Hosein Khasahmadi, Mor Katz et al.

ECCV 2024posterarXiv:2409.17457

#2100

AttentionHand: Text-driven Controllable Hand Image Generation for 3D Hand Reconstruction in the Wild

Junho Park, Kyeongbo Kong, Suk-Ju Kang

ECCV 2024posterarXiv:2407.18034

#2101

Embracing Events and Frames with Hierarchical Feature Refinement Network for Object Detection

Hu Cao, Zehua Zhang, Yan Xia et al.

ECCV 2024posterarXiv:2407.12582

#2102

TimeLens-XL: Real-time Event-based Video Frame Interpolation with Large Motion

Shi Guo, Yutian Chen, Tianfan Xue et al.

ECCV 2024poster

#2103

Identity-Consistent Diffusion Network for Grading Knee Osteoarthritis Progression in Radiographic Imaging

Wenhua Wu, Kun Hu, Wenxi Yue et al.

ECCV 2024posterarXiv:2407.21381

#2104

Teach CLIP to Develop a Number Sense for Ordinal Regression

Yao DU, Qiang Zhai, Weihang Dai et al.

ECCV 2024posterarXiv:2408.03574

#2105

Compact 3D Scene Representation via Self-Organizing Gaussian Grids

Wieland Morgenstern, Florian Barthel, Anna Hilsmann et al.

ECCV 2024posterarXiv:2312.13299

#2106

Flash-Splat: 3D Reflection Removal with Flash Cues and Gaussian Splats

Mingyang Xie, Haoming Cai, Sachin Shah et al.

ECCV 2024posterarXiv:2410.02764

#2107

Instant Uncertainty Calibration of NeRFs Using a Meta-Calibrator

Niki Amini-Naieni, Tomas Jakab, Andrea Vedaldi et al.

ECCV 2024posterarXiv:2312.02350

#2108

SHIC: Shape-Image Correspondences with no Keypoint Supervision

Aleksandar Shtedritski, Christian Rupprecht, Andrea Vedaldi

ECCV 2024posterarXiv:2407.18907

#2109

Debiasing surgeon: fantastic weights and how to find them

Remi Nahon, Ivan Luiz De Moura Matos, Van-Tam Nguyen et al.

ECCV 2024posterarXiv:2403.14200

#2110

R.A.C.E.: Robust Adversarial Concept Erasure for Secure Text-to-Image Diffusion Model

Changhoon Kim, Kyle Min, Yezhou Yang

ECCV 2024posterarXiv:2405.16341

#2111

Labeled Data Selection for Category Discovery

Bingchen Zhao, Nico Lang, Serge Belongie et al.

ECCV 2024posterarXiv:2406.04898

#2112

Mew: Multiplexed Immunofluorescence Image Analysis through an Efficient Multiplex Network

Sukwon Yun, Jie Peng, Alexandro E Trevino et al.

ECCV 2024posterarXiv:2407.17857

#2113

A Multimodal Benchmark Dataset and Model for Crop Disease Diagnosis

Xiang Liu, Zhaoxiang Liu, Huan Hu et al.

ECCV 2024posterarXiv:2503.06973

#2114

Understanding and Mitigating Human-Labelling Errors in Supervised Contrastive Learning

Zijun Long, Lipeng Zhuang, George W Killick et al.

ECCV 2024posterarXiv:2403.06289

#2115

MOD-UV: Learning Mobile Object Detectors from Unlabeled Videos

Yihong Sun, Bharath Hariharan

ECCV 2024posterarXiv:2405.14841

#2116

Leveraging scale- and orientation-covariant features for planar motion estimation

Marcus Valtonen Örnhag, Alberto Jaenal

ECCV 2024poster

#2117

WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation

Jiachen Lu, Ze Huang, Zeyu Yang et al.

ECCV 2024posterarXiv:2312.02934

#2118

EMIE-MAP: Large-Scale Road Surface Reconstruction Based on Explicit Mesh and Implicit Encoding

Wenhua Wu, Qi Wang, Guangming Wang et al.

ECCV 2024posterarXiv:2403.11789

#2119

HyperSpaceX: Radial and Angular Exploration of HyperSpherical Dimensions

Chiranjeev Chiranjeev, Muskan Dosi, Kartik Thakral et al.

ECCV 2024posterarXiv:2408.02494

#2120

Nonverbal Interaction Detection

Jianan Wei, Tianfei Zhou, Yi Yang et al.

ECCV 2024posterarXiv:2407.08133

#2121

PoseSOR: Human Pose Can Guide Our Attention

Huankang Guan, Rynson W.H. Lau

ECCV 2024poster

#2122

Generalized Coverage for More Robust Low-Budget Active Learning

Wonho Bae, Junhyug Noh, Danica J. Sutherland

ECCV 2024posterarXiv:2407.12212

#2123

Track Everything Everywhere Fast and Robustly

Yunzhou Song, Jiahui Lei, Ziyun Wang et al.

ECCV 2024posterarXiv:2403.17931

#2124

Common Sense Reasoning for Deep Fake Detection

Yue Zhang, Ben Colman, Xiao Guo et al.

ECCV 2024poster

#2125

Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models

Samuele Poppi, Tobia Poppi, Federico Cocchi et al.

ECCV 2024posterarXiv:2311.16254

#2126

Idling Neurons, Appropriately Lenient Workload During Fine-tuning Leads to Better Generalization

Hongjing Niu, Hanting Li, Bin Li et al.

ECCV 2024poster

#2127

Improving image synthesis with diffusion-negative sampling

Alakh Desai, Nuno Vasconcelos

ECCV 2024posterarXiv:2411.05473

#2128

WRIM-Net: Wide-Ranging Information Mining Network for Visible-Infrared Person Re-Identification

Yonggan Wu, Ling-Chao Meng, Yuan Zichao et al.

ECCV 2024posterarXiv:2408.10624

#2129

CatchBackdoor: Backdoor Detection via Critical Trojan Neural Path Fuzzing

Haibo Jin, Ruoxi Chen, Jinyin Chen et al.

ECCV 2024posterarXiv:2112.13064

#2130

DriveLM: Driving with Graph Visual Question Answering

Chonghao Sima, Katrin Renz, Kashyap Chitta et al.

ECCV 2024posterarXiv:2312.14150

#2131

ViGoR: Improving Visual Grounding of Large Vision Language Models with Fine-Grained Reward Modeling

Siming Yan, Min Bai, Weifeng Chen et al.

ECCV 2024posterarXiv:2402.06118

#2132

Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning

Meixuan Li, Tianyu Li, Guoqing Wang et al.

ECCV 2024posterarXiv:2403.10252

#2133

Deep Companion Learning: Enhancing Generalization Through Historical Consistency

Ruizhao Zhu, Venkatesh Saligrama

ECCV 2024posterarXiv:2407.18821

#2134

ABC Easy as 123: A Blind Counter for Exemplar-Free Multi-Class Class-agnostic Counting

Michael A Hobley, Victor Adrian Prisacariu

ECCV 2024posterarXiv:2309.04820

#2135

CrossScore: A Multi-View Approach to Image Evaluation and Scoring

Zirui Wang, Wenjing Bian, Victor Adrian Prisacariu

ECCV 2024poster

#2136

CPM: Class-conditional Prompting Machine for Audio-visual Segmentation

Yuanhong Chen, Chong Wang, Yuyuan Liu et al.

ECCV 2024posterarXiv:2407.05358

#2137

DiffClass: Diffusion-Based Class Incremental Learning

Zichong Meng, Jie Zhang, Changdi Yang et al.

ECCV 2024posterarXiv:2403.05016

#2138

Dual-Rain: Video Rain Removal using Assertive and Gentle Teachers

Tingting Chen, Beibei Lin, Yeying Jin et al.

ECCV 2024poster

#2139

DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing

Minghao Chen, Iro Laina, Andrea Vedaldi

ECCV 2024posterarXiv:2404.18929

#2140

Dynamic Neural Radiance Field From Defocused Monocular Video

Xianrui Luo, Huiqiang Sun, Juewen Peng et al.

ECCV 2024posterarXiv:2407.05586

#2141

4Diff: 3D-Aware Diffusion Model for Third-to-First Viewpoint Translation

Feng Cheng, Mi Luo, Huiyu Wang et al.

ECCV 2024poster

#2142

Chronologically Accurate Retrieval for Temporal Grounding of Motion-Language Models

Kent Fujiwara, Mikihiro Tanaka, Qing Yu

ECCV 2024posterarXiv:2407.15408

#2143

Realistic Human Motion Generation with Cross-Diffusion Models

Zeping Ren, Shaoli Huang, Xiu Li

ECCV 2024posterarXiv:2312.10993

#2144

Diffusion Models as Optimizers for Efficient Planning in Offline RL

Renming Huang, Yunqiang Pei, Guoqing Wang et al.

ECCV 2024posterarXiv:2407.16142

#2145

MERLiN: Single-Shot Material Estimation and Relighting for Photometric Stereo

Ashish Tiwari, Satoshi Ikehata, Shanmuganathan Raman

ECCV 2024posterarXiv:2409.00674

#2146

BI-MDRG: Bridging Image History in Multimodal Dialogue Response Generation

Hee Suk Yoon, Eunseop Yoon, Joshua Tian Jin Tee et al.

ECCV 2024posterarXiv:2408.05926

#2147

Rethinking Few-shot Class-incremental Learning: Learning from Yourself

Yu-Ming Tang, Yi-Xing Peng, Jing-Ke Meng et al.

ECCV 2024posterarXiv:2407.07468

#2148

Attention Decomposition for Cross-Domain Semantic Segmentation

Liqiang He, Sinisa Todorovic

ECCV 2024poster

#2149

RoGUENeRF: A Robust Geometry-Consistent Universal Enhancer for NeRF

Sibi Catley-Chandar, Richard Shaw, Greg Slabaugh et al.

ECCV 2024posterarXiv:2403.11909

#2150

FuseTeacher: Modality-fused Encoders are Strong Vision Supervisors

Chen-Wei Xie, Siyang Sun, Liming Zhao et al.

ECCV 2024poster

#2151

MVDD: Multi-View Depth Diffusion Models

Zhen Wang, Qiangeng Xu, Feitong Tan et al.

ECCV 2024posterarXiv:2312.04875

#2152

Wavelet Convolutions for Large Receptive Fields

Shahaf Finder, Roy Amoyal, Eran Treister et al.

ECCV 2024posterarXiv:2407.05848

#2153

Gradient-based Out-of-Distribution Detection

Taha Entesari, Sina Sharifi, Bardia Safaei et al.

ECCV 2024poster

#2154

Veil Privacy on Visual Data: Concealing Privacy for Humans, Unveiling for DNNs

Shuchao Pang, Ruhao Ma, Bing Li et al.

ECCV 2024poster

#2155

FYI: Flip Your Images for Dataset Distillation

Byunggwan Son, Youngmin Oh, Donghyeon Baek et al.

ECCV 2024posterarXiv:2407.08113

#2156

SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow

Yuanzhi Zhu, Xingchao Liu, Qiang Liu

ECCV 2024posterarXiv:2407.12718

#2157

Simple Unsupervised Knowledge Distillation With Space Similarity

Aditya Singh, Haohan Wang

ECCV 2024posterarXiv:2409.13939

#2158

Efficient Vision Transformers with Partial Attention

Xuan-Thuy Vo, Duy-Linh Nguyen, Adri Priadana et al.

ECCV 2024poster

#2159

Learning Natural Consistency Representation for Face Forgery Video Detection

Daichi Zhang, Zihao Xiao, Shikun Li et al.

ECCV 2024posterarXiv:2407.10550

#2160

Towards Stable 3D Object Detection

Jiabao Wang, Qiang Meng, Guochao Liu et al.

ECCV 2024posterarXiv:2407.04305

#2161

Revisit Human-Scene Interaction via Space Occupancy

Xinpeng Liu, Haowen Hou, Yanchao Yang et al.

ECCV 2024posterarXiv:2312.02700

#2162

View-Consistent 3D Editing with Gaussian Splatting

Yuxuan Wang, Xuanyu Yi, Zike Wu et al.

ECCV 2024posterarXiv:2403.11868

#2163

HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes

Zhuopeng Li, Yilin Zhang, Chenming Wu et al.

ECCV 2024posterarXiv:2403.20032

#2164

Generating Human Interaction Motions in Scenes with Text Control

Hongwei Yi, Justus Thies, Michael J. Black et al.

ECCV 2024posterarXiv:2404.10685

#2165

Multi-branch Collaborative Learning Network for 3D Visual Grounding

Zhipeng Qian, Yiwei Ma, Zhekai Lin et al.

ECCV 2024posterarXiv:2407.05363

#2166

KeypointDETR: An End-to-End 3D Keypoint Detector

Hairong Jin, Yuefan Shen, Jianwen Lou et al.

ECCV 2024poster

#2167

Instruction Tuning-free Visual Token Complement for Multimodal LLMs

Dongsheng Wang, Jiequan Cui, Miaoge Li et al.

ECCV 2024posterarXiv:2408.05019

#2168

Improving Point-based Crowd Counting and Localization Based on Auxiliary Point Guidance

I-HSIANG CHEN, Wei-Ting Chen, Yu-Wei Liu et al.

ECCV 2024posterarXiv:2405.10589

#2169

Online Temporal Action Localization with Memory-Augmented Transformer

Youngkil Song, Dongkeun Kim, Minsu Cho et al.

ECCV 2024posterarXiv:2408.02957

#2170

Bayesian Self-Training for Semi-Supervised 3D Segmentation

Ozan Unal, Christos Sakaridis, Luc Van Gool

ECCV 2024posterarXiv:2409.08102

#2171

SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer

Zijie Wu, Chaohui Yu, Yanqin Jiang et al.

ECCV 2024posterarXiv:2404.03736

#2172

Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models

Xiaoshi Wu, Yiming Hao, Manyuan Zhang et al.

ECCV 2024posterarXiv:2405.00760

#2173

Fully Authentic Visual Question Answering Dataset from Online Communities

Chongyan Chen, Mengchen Liu, Noel C Codella et al.

ECCV 2024posterarXiv:2311.15562

#2174

Revisit Self-supervision with Local Structure-from-Motion

Shengjie Zhu, Xiaoming Liu

ECCV 2024poster

#2175

Open-Vocabulary Camouflaged Object Segmentation

Youwei Pang, Xiaoqi Zhao, JiaMing Zuo et al.

ECCV 2024posterarXiv:2311.11241

#2176

Revisiting Feature Disentanglement Strategy in Diffusion Training and Breaking Conditional Independence Assumption in Sampling

Wonwoong Cho, Hareesh Ravi, Midhun Harikumar et al.

ECCV 2024poster

#2177

TexDreamer: Towards Zero-Shot High-Fidelity 3D Human Texture Generation

Yufei Liu, Junwei Zhu, Junshu Tang et al.

ECCV 2024posterarXiv:2403.12906

#2178

GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection

Ziying Song, Lei Yang, Shaoqing Xu et al.

ECCV 2024posterarXiv:2403.11848

#2179

EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis

Shuai Tan, Bin Ji, Mengxiao Bi et al.

ECCV 2024posterarXiv:2404.01647

#2180

Put Myself in Your Shoes: Lifting the Egocentric Perspective from Exocentric Videos

Mi Luo, Zihui Xue, Alex Dimakis et al.

ECCV 2024posterarXiv:2403.06351

#2181

SRPose: Two-view Relative Pose Estimation with Sparse Keypoints

Rui Yin, Yulun Zhang, Zherong Pan et al.

ECCV 2024posterarXiv:2407.08199

#2182

LivePhoto: Real Image Animation with Text-guided Motion Control

Xi Chen, Zhiheng Liu, Mengting Chen et al.

ECCV 2024posterarXiv:2312.02928

#2183

CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion

Wendi Zheng, Jiayan Teng, Zhuoyi Yang et al.

ECCV 2024posterarXiv:2403.05121

#2184

OAPT: Offset-Aware Partition Transformer for Double JPEG Artifacts Removal

Qiao Mo, Yukang Ding, Jinhua Hao et al.

ECCV 2024posterarXiv:2408.11480

#2185

Context-Aware Action Recognition: Introducing a Comprehensive Dataset for Behavior Contrast

Tatsuya Sasaki, Yoshiki Ito, Satoshi Kondo

ECCV 2024poster

#2186

Privacy-Preserving Adaptive Re-Identification without Image Transfer

Hamza Rami, Jhony H. Giraldo, Nicolas Winckler et al.

ECCV 2024posterarXiv:2407.12589

#2187

FRI-Net: Floorplan Reconstruction via Room-wise Implicit Representation

Honghao Xu, Juzhan Xu, Zeyu Huang et al.

ECCV 2024posterarXiv:2407.10687

#2188

DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Jinbo Xing, Menghan Xia, Yong Zhang et al.

ECCV 2024posterarXiv:2310.12190

#2189

Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors

Tongkun Guan, Wei Shen, Xue Yang et al.

ECCV 2024posterarXiv:2312.05286

#2190

Motion Aware Event Representation-driven Image Deblurring

Zhijing Sun, Xueyang Fu, Longzhuo Huang et al.

ECCV 2024poster

#2191

OLAF: A Plug-and-Play Framework for Enhanced Multi-object Multi-part Scene Parsing

Pranav Gupta, Rishubh Singh, Pradeep Shenoy et al.

ECCV 2024posterarXiv:2411.02858

#2192

Let the Avatar Talk using Texts without Paired Training Data

Xiuzhe Wu, Yang-Tian Sun, Handi Chen et al.

ECCV 2024poster

#2193

Uncertainty Calibration with Energy Based Instance-wise Scaling in the Wild Dataset

Mijoo Kim, Junseok Kwon

ECCV 2024posterarXiv:2407.12330

#2194

Attention Beats Linear for Fast Implicit Neural Representation Generation

Shuyi Zhang, Ke Liu, Jingjun Gu et al.

ECCV 2024posterarXiv:2407.15355

#2195

Prompt-Based Test-Time Real Image Dehazing: A Novel Pipeline

Zixuan Chen, Zewei He, Ziqian Lu et al.

ECCV 2024posterarXiv:2309.17389

#2196

Category Adaptation Meets Projected Distillation in Generalized Continual Category Discovery

Grzegorz Rypesc, Daniel Marczak, Sebastian Cygert et al.

ECCV 2024poster

#2197

3D Hand Sequence Recovery from Real Blurry Images and Event Stream

Joonkyu Park, Gyeongsik Moon, Weipeng Xu et al.

ECCV 2024poster

#2198

SG-NeRF: Neural Surface Reconstruction with Scene Graph Optimization

Yiyang Chen, Siyan Dong, Xulong Wang et al.

ECCV 2024posterarXiv:2407.12667

#2199

Segmentation-guided Layer-wise Image Vectorization with Gradient Fills

Hengyu Zhou, Hui Zhang, Bin Wang

ECCV 2024posterarXiv:2408.15741

#2200

ST-LDM: A Universal Framework for Text-Grounded Object Generation in Real Images

Xiangtian Xue, Jiasong Wu, Youyong Kong et al.

ECCV 2024posterarXiv:2403.10004

← Previous

1...9 10 11 12