Most Cited ECCV "autonomous agent training" Papers

2,387 papers found • Page 11 of 12

#2001

Delving into Adversarial Robustness on Document Tampering Localization

Huiru Shao, Zhuang Qian, Kaizhu Huang et al.

ECCV 2024poster
#2002

SceneTeller: Language-to-3D Scene Generation

Basak Melis Ocal, Maxim Tatarchenko, Sezer Karaoglu et al.

ECCV 2024poster
#2003

MagMax: Leveraging Model Merging for Seamless Continual Learning

Daniel Marczak, Bartlomiej Twardowski, Tomasz Trzcinski et al.

ECCV 2024posterarXiv:2407.06322
#2004

Spline-based Transformers

Prashanth Chandran, Agon Serifi, Markus Gross et al.

ECCV 2024posterarXiv:2504.02797
#2005

Efficient NeRF Optimization - Not All Samples Remain Equally Hard

Juuso Korhonen, Goutham Rangu, Hamed Rezazadegan Tavakoli et al.

ECCV 2024posterarXiv:2408.03193
#2006

VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks

Xiangxiang Chu, Jianlin Su, Bo Zhang et al.

ECCV 2024posterarXiv:2403.00522
#2007

Missing Modality Prediction for Unpaired Multimodal Learning via Joint Embedding of Unimodal Models

Taesup Kim, Donggeun Kim

ECCV 2024posterarXiv:2407.12616
#2008

Fast Diffusion-Based Counterfactuals for Shortcut Removal and Generation

Nina Weng, Paraskevas Pegios, Eike Petersen et al.

ECCV 2024posterarXiv:2312.14223
#2009

Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and Retention

Zuyao Chen, Jinlin Wu, Zhen Lei et al.

ECCV 2024posterarXiv:2311.10988
#2010

FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation

Xinzhi MU, Li Chen, Bohan CHEN et al.

ECCV 2024posterarXiv:2406.08392
#2011

GroCo: Ground Constraint for Metric Self-Supervised Monocular Depth

Aurélien Cecille, Stefan Duffner, Franck DAVOINE et al.

ECCV 2024posterarXiv:2409.14850
#2012

MART: MultiscAle Relational Transformer Networks for Multi-agent Trajectory Prediction

Seongju Lee, Junseok Lee, Yeonguk Yu et al.

ECCV 2024posterarXiv:2407.21635
#2013

Distributed Active Client Selection With Noisy Clients Using Model Association Scores

Kwang In Kim

ECCV 2024poster
#2014

Tight and Efficient Upper Bound on Spectral Norm of Convolutional Layers

Ekaterina Grishina, Mikhail Gorbunov, Maxim Rakhuba

ECCV 2024posterarXiv:2409.11859
#2015

Deciphering the Role of Representation Disentanglement: Investigating Compositional Generalization in CLIP Models

Reza Abbasi, Mohammad Rohban, Mahdieh Soleymani Baghshah

ECCV 2024posterarXiv:2407.05897
#2016

A Simple Baseline for Spoken Language to Sign Language Translation with 3D Avatars

Ronglai Zuo, Fangyun Wei, Zenggui Chen et al.

ECCV 2024posterarXiv:2401.04730
#2017

Unveiling Privacy Risks in Stochastic Neural Networks Training: Effective Image Reconstruction from Gradients

Yiming Chen, Xiangyu Yang, Nikos Deligiannis

ECCV 2024poster
#2018

Towards compact reversible image representations for neural style transfer

Xiyao Liu, Siyu Yang, Jian Zhang et al.

ECCV 2024poster
#2019

SPARO: Selective Attention for Robust and Compositional Transformer Encodings for Vision

Ankit Vani, Bac Nguyen, Samuel Lavoie et al.

ECCV 2024posterarXiv:2404.15721
#2020

Learning Scalable Model Soup on a Single GPU: An Efficient Subspace Training Strategy

Tao Li, Weisen Jiang, Fanghui Liu et al.

ECCV 2024posterarXiv:2407.03641
#2021

Straightforward Layer-wise Pruning for More Efficient Visual Adaptation

Ruizi Han, Jinglei Tang

ECCV 2024posterarXiv:2407.14330
#2022

Dual-Decoupling Learning and Metric-Adaptive Thresholding for Semi-Supervised Multi-Label Learning

Jiahao Xiao, Ming-Kun Xie, Heng-Bo Fan et al.

ECCV 2024posterarXiv:2407.18624
#2023

SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation

Lingchen Meng, Shiyi Lan, Hengduo Li et al.

ECCV 2024posterarXiv:2311.14671
#2024

Handling The Non-Smooth Challenge in Tensor SVD: A Multi-Objective Tensor Recovery Framework

Jingjing Zheng, Wanglong Lu, Wenzhe Wang et al.

ECCV 2024posterarXiv:2311.13958
#2025

DoubleTake: Geometry Guided Depth Estimation

Mohamed Sayed, Filippo Aleotti, Jamie Watson et al.

ECCV 2024posterarXiv:2406.18387
#2026

Sur^2f: A Hybrid Representation for High-Quality and Efficient Surface Reconstruction from Multi-view Images

Zhangjin Huang, Zhihao Liang, Kui Jia

ECCV 2024poster
#2027

UniTalker: Scaling up Audio-Driven 3D Facial Animation through A Unified Model

Xiangyu Fan, Jiaqi Li, Zhiqian Lin et al.

ECCV 2024posterarXiv:2408.00762
#2028

PartCraft: Crafting Creative Objects by Parts

Kam Woh Ng, Xiatian Zhu, Yi-Zhe Song et al.

ECCV 2024posterarXiv:2407.04604
#2029

Coarse-to-Fine Implicit Representation Learning for 3D Hand-Object Reconstruction from a Single RGB-D Image

Xingyu Liu, Pengfei Ren, Jingyu Wang et al.

ECCV 2024poster
#2030

AnyControl: Create Your Artwork with Versatile Control on Text-to-Image Generation

Sun Yanan, Yanchen Liu, Yinhao Tang et al.

ECCV 2024posterarXiv:2406.18958
#2031

Data Overfitting for On-Device Super-Resolution with Dynamic Algorithm and Compiler Co-Design

Li, zhihao shu, Jie Ji et al.

ECCV 2024posterarXiv:2407.02813
#2032

PRET: Planning with Directed Fidelity Trajectory for Vision and Language Navigation

Renjie Lu, Jing-Ke Meng, WEISHI ZHENG

ECCV 2024posterarXiv:2407.11487
#2033

Long-CLIP: Unlocking the Long-Text Capability of CLIP

Beichen Zhang, Pan Zhang, Xiaoyi Dong et al.

ECCV 2024posterarXiv:2403.15378
#2034

Learning with Counterfactual Explanations for Radiology Report Generation

Mingjie Li, Haokun Lin, Liang Qiu et al.

ECCV 2024posterarXiv:2407.14474
#2035

Pseudo-Embedding for Generalized Few-Shot Point Cloud Segmentation

Chih-Jung Tsai, Hwann-Tzong Chen, Tyng-Luh Liu

ECCV 2024poster
#2036

AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer

Zhuguanyu Wu, Jiaxin Chen, Hanwen Zhong et al.

ECCV 2024posterarXiv:2407.12951
#2037

Oulu Remote-photoplethysmography Physical Domain Attacks Database (ORPDAD)

Marko Savic, Guoying Zhao

ECCV 2024poster
#2038

Optimizing Illuminant Estimation in Dual-Exposure HDR Imaging

Mahmoud Afifi, Zhenhua Hu, Liang Liang

ECCV 2024posterarXiv:2403.02449
#2039

Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts

Shuangkang Fang, Yufeng Wang, Yi-Hsuan Tsai et al.

ECCV 2024posterarXiv:2407.06842
#2040

Embodied Understanding of Driving Scenarios

Yunsong Zhou, Linyan Huang, Qingwen Bu et al.

ECCV 2024posterarXiv:2403.04593
#2041

HybridBooth: Hybrid Prompt Inversion for Efficient Subject-Driven Generation

Shanyan Guan, Yanhao Ge, Ying Tai et al.

ECCV 2024posterarXiv:2410.08192
#2042

On the Viability of Monocular Depth Pre-training for Semantic Segmentation

DONG LAO, Fengyu Yang, Daniel Wang et al.

ECCV 2024posterarXiv:2203.13987
#2043

Weakly-supervised Camera Localization by Ground-to-satellite Image Registration

Yujiao Shi, HONGDONG LI, Akhil Perincherry et al.

ECCV 2024posterarXiv:2409.06471
#2044

ProtoComp: Diverse Point Cloud Completion with Controllable Prototype

Xumin Yu, Yanbo Wang, Jie Zhou et al.

ECCV 2024poster
#2045

Topo4D: Topology-Preserving Gaussian Splatting for High-Fidelity 4D Head Capture

Xuanchen Li, Yuhao Cheng, Xingyu Ren et al.

ECCV 2024posterarXiv:2406.00440
#2046

SPHINX: A Mixer of Weights, Visual Embeddings and Image Scales for Multi-modal Large Language Models

Ziyi Lin, Dongyang Liu, Renrui Zhang et al.

ECCV 2024poster
#2047

Diffusion Models as Data Mining Tools

Ioannis Siglidis, Aleksander Holynski, Alexei Efros et al.

ECCV 2024posterarXiv:2408.02752
#2048

SCPNet: Unsupervised Cross-modal Homography Estimation via Intra-modal Self-supervised Learning

Runmin Zhang, Jun Ma, Lun Luo et al.

ECCV 2024posterarXiv:2407.08148
#2049

Leveraging Representations from Intermediate Encoder-blocks for Synthetic Image Detection

Christos Koutlis, Symeon Papadopoulos

ECCV 2024posterarXiv:2402.19091
#2050

Seeing the Unseen: A Frequency Prompt Guided Transformer for Image Restoration

shihao zhou, Jinshan Pan, Jinglei Shi et al.

ECCV 2024posterarXiv:2404.00288
#2051

Animate Your Motion: Turning Still Images into Dynamic Videos

Mingxiao Li, Bo Wan, Marie-Francine Moens et al.

ECCV 2024posterarXiv:2403.10179
#2052

Spatial-Temporal Multi-level Association for Video Object Segmentation

Deshui Miao, Xin Li, Zhenyu He et al.

ECCV 2024posterarXiv:2404.06265
#2053

Adaptive Multi-head Contrastive Learning

Lei Wang, Piotr Koniusz, Tom Gedeon et al.

ECCV 2024posterarXiv:2310.05615
#2054

UniProcessor: A Text-induced Unified Low-level Image Processor

Huiyu Duan, Xiongkuo Min, Sijing Wu et al.

ECCV 2024posterarXiv:2407.20928
#2055

GRACE: Graph-Based Contextual Debiasing for Fair Visual Question Answering

Yifeng Zhang, Ming Jiang, Qi Zhao

ECCV 2024poster
#2056

Learning Chain of Counterfactual Thought for Bias-Robust Vision-Language Reasoning

Yifeng Zhang, Ming Jiang, Qi Zhao

ECCV 2024poster
#2057

Generalizing to Unseen Domains via Text-guided Augmentation

Daiqing Qi, Handong Zhao, Aidong Zhang et al.

ECCV 2024poster
#2058

On Calibration of Object Detectors: Pitfalls, Evaluation and Baselines

Selim Kuzucu, Kemal Oksuz, Jonathan Sadeghi et al.

ECCV 2024posterarXiv:2405.20459
#2059

IVTP: Instruction-guided Visual Token Pruning for Large Vision-Language Models

Kai Huang, Hao Zou, Ye Xi et al.

ECCV 2024poster
#2060

Unsqueeze [CLS] Bottleneck to Learn Rich Representations

Qing Su, Shihao Ji

ECCV 2024posterarXiv:2407.17671
#2061

RCS-Prompt: Learning Prompt to Rearrange Class Space for Prompt-based Continual Learning

Longrong Yang, Hanbin Zhao, Yunlong Yu et al.

ECCV 2024poster
#2062

Dynamic Guidance Adversarial Distillation with Enhanced Teacher Knowledge

Hyejin Park, Dongbo Min

ECCV 2024posterarXiv:2409.01627
#2063

LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models

Yabin Zhang, Wenjie Zhu, Chenhang He et al.

ECCV 2024posterarXiv:2407.08966
#2064

Gaussian Grouping: Segment and Edit Anything in 3D Scenes

Mingqiao Ye, Martin Danelljan, Fisher Yu et al.

ECCV 2024posterarXiv:2312.00732
#2065

StructLDM: Structured Latent Diffusion for 3D Human Generation

Tao Hu, Fangzhou Hong, Ziwei Liu

ECCV 2024posterarXiv:2404.01241
#2066

High-Fidelity Modeling of Generalizable Wrinkle Deformation

Jingfan Guo, Jae Shin Yoon, Shunsuke Saito et al.

ECCV 2024poster
#2067

EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion

Guangyao Zhai, Evin Pınar Örnek, Dave Zhenyu Chen et al.

ECCV 2024posterarXiv:2405.00915
#2068

NeRF-XL: NeRF at Any Scale with Multi-GPU

Ruilong Li, Sanja Fidler, Angjoo Kanazawa et al.

ECCV 2024poster
#2069

Learning Neural Deformation Representation for 4D Dynamic Shape Generation

Gyojin Han, Jiwan Hur, Jaehyun Choi et al.

ECCV 2024poster
#2070

3D Hand Pose Estimation in Everyday Egocentric Images

Aditya Prakash, Ruisen Tu, Matthew Chang et al.

ECCV 2024posterarXiv:2312.06583
#2071

Towards Robust Event-based Networks for Nighttime via Unpaired Day-to-Night Event Translation

Yuhwan Jeong, Hoonhee Cho, Kuk-Jin Yoon

ECCV 2024posterarXiv:2407.10703
#2072

Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene

Ruiyang Zhang, Hu Zhang, Hang Yu et al.

ECCV 2024posterarXiv:2407.08569
#2073

Robust Nearest Neighbors for Source-Free Domain Adaptation under Class Distribution Shift

Antonio Tejero-de-Pablos, Riku Togashi, Mayu Otani et al.

ECCV 2024poster
#2074

Six-Point Method for Multi-Camera Systems with Reduced Solution Space

Banglei Guan, Ji Zhao, Laurent Kneip

ECCV 2024posterarXiv:2402.18066
#2075

Physics-informed Knowledge Transfer for Underwater Monocular Depth Estimation

Jinghe Yang, Mingming Gong, Ye Pu

ECCV 2024poster
#2076

Learning Equilibrium Transformation for Gamut Expansion and Color Restoration

JUN XIAO, Changjian Shui, Zhi-Song Liu et al.

ECCV 2024poster
#2077

COIN-Matting: Confounder Intervention for Image Matting

Zhaohe Liao, Jiangtong Li, Jun Lan et al.

ECCV 2024poster
#2078

FoundPose: Unseen Object Pose Estimation with Foundation Features

Evin Pınar Örnek, Yann Labbé, Bugra Tekin et al.

ECCV 2024posterarXiv:2311.18809
#2079

SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras

Yingqi Tang, Zhaotie Meng, Guoliang Chen et al.

ECCV 2024posterarXiv:2403.10353
#2080

ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction

Shaozhe Hao, Kai Han, Zhengyao Lv et al.

ECCV 2024posterarXiv:2407.07077
#2081

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

Yanwei Li, Chengyao Wang, Jiaya Jia

ECCV 2024posterarXiv:2311.17043
#2082

DreamMotion: Space-Time Self-Similar Score Distillation for Zero-Shot Video Editing

Hyeonho Jeong, Jinho Chang, GEON YEONG PARK et al.

ECCV 2024posterarXiv:2403.12002
#2083

Agent3D-Zero: An Agent for Zero-shot 3D Understanding

Sha Zhang, Di Huang, Jiajun Deng et al.

ECCV 2024posterarXiv:2403.11835
#2084

Structured-NeRF: Hierarchical Scene Graph with Neural Representation

Zhide Zhong, Jiakai Cao, songen gu et al.

ECCV 2024poster
#2085

Robustness Preserving Fine-tuning using Neuron Importance

Guangrui Li, Rahul Duggal, Aaditya Singh et al.

ECCV 2024poster
#2086

DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion Consistency

Xiaojing Zhong, Xinyi Huang, Xiaofeng Yang et al.

ECCV 2024posterarXiv:2408.07481
#2087

MeshFeat: Multi-Resolution Features for Neural Fields on Meshes

Mihir Mahajan, Florian Hofherr, Daniel Cremers

ECCV 2024posterarXiv:2407.13592
#2088

DragAPart: Learning a Part-Level Motion Prior for Articulated Objects

Ruining Li, Chuanxia Zheng, Christian Rupprecht et al.

ECCV 2024posterarXiv:2403.15382
#2089

CLR-GAN: Improving GANs Stability and Quality via Consistent Latent Representation and Reconstruction

Shengke Sun, Ziqian Luan, Zhanshan Zhao et al.

ECCV 2024poster
#2090

Learning to Unlearn for Robust Machine Unlearning

Mark HUANG, Lin Geng Foo, Jun Liu

ECCV 2024posterarXiv:2407.10494
#2091

Taming CLIP for Fine-grained and Structured Visual Understanding of Museum Exhibits

Ada-Astrid Balauca, Danda Paudel, Kristina Toutanova et al.

ECCV 2024posterarXiv:2409.01690
#2092

Visual Text Generation in the Wild

Yuanzhi Zhu, Jiawei Liu, Feiyu Gao et al.

ECCV 2024posterarXiv:2407.14138
#2093

FreeInit: Bridging Initialization Gap in Video Diffusion Models

Tianxing Wu, Chenyang Si, Yuming Jiang et al.

ECCV 2024posterarXiv:2312.07537
#2094

Learning Quantized Adaptive Conditions for Diffusion Models

Yuchen Liang, Yuchuan Tian, Lei Yu et al.

ECCV 2024posterarXiv:2409.17487
#2095

Learn to Optimize Denoising Scores: A Unified and Improved Diffusion Prior for 3D Generation

Xiaofeng Yang, Yiwen Chen, Cheng Chen et al.

ECCV 2024poster
#2096

Rethinking Fast Adversarial Training: A Splitting Technique To Overcome Catastrophic Overfitting

Masoumeh Zareapoor, Pourya Shamsolmoali

ECCV 2024poster
#2097

Similarity of Neural Architectures using Adversarial Attack Transferability

Jaehui Hwang, Dongyoon Han, Byeongho Heo et al.

ECCV 2024posterarXiv:2210.11407
#2098

In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation

Dahyun Kang, Minsu Cho

ECCV 2024posterarXiv:2408.04961
#2099

CadVLM: Bridging Language and Vision in the Generation of Parametric CAD Sketches

Sifan Wu, Amir Hosein Khasahmadi, Mor Katz et al.

ECCV 2024posterarXiv:2409.17457
#2100

AttentionHand: Text-driven Controllable Hand Image Generation for 3D Hand Reconstruction in the Wild

Junho Park, Kyeongbo Kong, Suk-Ju Kang

ECCV 2024posterarXiv:2407.18034
#2101

Embracing Events and Frames with Hierarchical Feature Refinement Network for Object Detection

Hu Cao, Zehua Zhang, Yan Xia et al.

ECCV 2024posterarXiv:2407.12582
#2102

TimeLens-XL: Real-time Event-based Video Frame Interpolation with Large Motion

Shi Guo, Yutian Chen, Tianfan Xue et al.

ECCV 2024poster
#2103

Identity-Consistent Diffusion Network for Grading Knee Osteoarthritis Progression in Radiographic Imaging

Wenhua Wu, Kun Hu, Wenxi Yue et al.

ECCV 2024posterarXiv:2407.21381
#2104

Teach CLIP to Develop a Number Sense for Ordinal Regression

Yao DU, Qiang Zhai, Weihang Dai et al.

ECCV 2024posterarXiv:2408.03574
#2105

Compact 3D Scene Representation via Self-Organizing Gaussian Grids

Wieland Morgenstern, Florian Barthel, Anna Hilsmann et al.

ECCV 2024posterarXiv:2312.13299
#2106

Flash-Splat: 3D Reflection Removal with Flash Cues and Gaussian Splats

Mingyang Xie, Haoming Cai, Sachin Shah et al.

ECCV 2024posterarXiv:2410.02764
#2107

Instant Uncertainty Calibration of NeRFs Using a Meta-Calibrator

Niki Amini-Naieni, Tomas Jakab, Andrea Vedaldi et al.

ECCV 2024posterarXiv:2312.02350
#2108

SHIC: Shape-Image Correspondences with no Keypoint Supervision

Aleksandar Shtedritski, Christian Rupprecht, Andrea Vedaldi

ECCV 2024posterarXiv:2407.18907
#2109

Debiasing surgeon: fantastic weights and how to find them

Remi Nahon, Ivan Luiz De Moura Matos, Van-Tam Nguyen et al.

ECCV 2024posterarXiv:2403.14200
#2110

R.A.C.E.: Robust Adversarial Concept Erasure for Secure Text-to-Image Diffusion Model

Changhoon Kim, Kyle Min, Yezhou Yang

ECCV 2024posterarXiv:2405.16341
#2111

Labeled Data Selection for Category Discovery

Bingchen Zhao, Nico Lang, Serge Belongie et al.

ECCV 2024posterarXiv:2406.04898
#2112

Mew: Multiplexed Immunofluorescence Image Analysis through an Efficient Multiplex Network

Sukwon Yun, Jie Peng, Alexandro E Trevino et al.

ECCV 2024posterarXiv:2407.17857
#2113

A Multimodal Benchmark Dataset and Model for Crop Disease Diagnosis

Xiang Liu, Zhaoxiang Liu, Huan Hu et al.

ECCV 2024posterarXiv:2503.06973
#2114

Understanding and Mitigating Human-Labelling Errors in Supervised Contrastive Learning

Zijun Long, Lipeng Zhuang, George W Killick et al.

ECCV 2024posterarXiv:2403.06289
#2115

MOD-UV: Learning Mobile Object Detectors from Unlabeled Videos

Yihong Sun, Bharath Hariharan

ECCV 2024posterarXiv:2405.14841
#2116

Leveraging scale- and orientation-covariant features for planar motion estimation

Marcus Valtonen Örnhag, Alberto Jaenal

ECCV 2024poster
#2117

WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation

Jiachen Lu, Ze Huang, Zeyu Yang et al.

ECCV 2024posterarXiv:2312.02934
#2118

EMIE-MAP: Large-Scale Road Surface Reconstruction Based on Explicit Mesh and Implicit Encoding

Wenhua Wu, Qi Wang, Guangming Wang et al.

ECCV 2024posterarXiv:2403.11789
#2119

HyperSpaceX: Radial and Angular Exploration of HyperSpherical Dimensions

Chiranjeev Chiranjeev, Muskan Dosi, Kartik Thakral et al.

ECCV 2024posterarXiv:2408.02494
#2120

Nonverbal Interaction Detection

Jianan Wei, Tianfei Zhou, Yi Yang et al.

ECCV 2024posterarXiv:2407.08133
#2121

PoseSOR: Human Pose Can Guide Our Attention

Huankang Guan, Rynson W.H. Lau

ECCV 2024poster
#2122

Generalized Coverage for More Robust Low-Budget Active Learning

Wonho Bae, Junhyug Noh, Danica J. Sutherland

ECCV 2024posterarXiv:2407.12212
#2123

Track Everything Everywhere Fast and Robustly

Yunzhou Song, Jiahui Lei, Ziyun Wang et al.

ECCV 2024posterarXiv:2403.17931
#2124

Common Sense Reasoning for Deep Fake Detection

Yue Zhang, Ben Colman, Xiao Guo et al.

ECCV 2024poster
#2125

Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models

Samuele Poppi, Tobia Poppi, Federico Cocchi et al.

ECCV 2024posterarXiv:2311.16254
#2126

Idling Neurons, Appropriately Lenient Workload During Fine-tuning Leads to Better Generalization

Hongjing Niu, Hanting Li, Bin Li et al.

ECCV 2024poster
#2127

Improving image synthesis with diffusion-negative sampling

Alakh Desai, Nuno Vasconcelos

ECCV 2024posterarXiv:2411.05473
#2128

WRIM-Net: Wide-Ranging Information Mining Network for Visible-Infrared Person Re-Identification

Yonggan Wu, Ling-Chao Meng, Yuan Zichao et al.

ECCV 2024posterarXiv:2408.10624
#2129

CatchBackdoor: Backdoor Detection via Critical Trojan Neural Path Fuzzing

Haibo Jin, Ruoxi Chen, Jinyin Chen et al.

ECCV 2024posterarXiv:2112.13064
#2130

DriveLM: Driving with Graph Visual Question Answering

Chonghao Sima, Katrin Renz, Kashyap Chitta et al.

ECCV 2024posterarXiv:2312.14150
#2131

ViGoR: Improving Visual Grounding of Large Vision Language Models with Fine-Grained Reward Modeling

Siming Yan, Min Bai, Weifeng Chen et al.

ECCV 2024posterarXiv:2402.06118
#2132

Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning

Meixuan Li, Tianyu Li, Guoqing Wang et al.

ECCV 2024posterarXiv:2403.10252
#2133

Deep Companion Learning: Enhancing Generalization Through Historical Consistency

Ruizhao Zhu, Venkatesh Saligrama

ECCV 2024posterarXiv:2407.18821
#2134

ABC Easy as 123: A Blind Counter for Exemplar-Free Multi-Class Class-agnostic Counting

Michael A Hobley, Victor Adrian Prisacariu

ECCV 2024posterarXiv:2309.04820
#2135

CrossScore: A Multi-View Approach to Image Evaluation and Scoring

Zirui Wang, Wenjing Bian, Victor Adrian Prisacariu

ECCV 2024poster
#2136

CPM: Class-conditional Prompting Machine for Audio-visual Segmentation

Yuanhong Chen, Chong Wang, Yuyuan Liu et al.

ECCV 2024posterarXiv:2407.05358
#2137

DiffClass: Diffusion-Based Class Incremental Learning

Zichong Meng, Jie Zhang, Changdi Yang et al.

ECCV 2024posterarXiv:2403.05016
#2138

Dual-Rain: Video Rain Removal using Assertive and Gentle Teachers

Tingting Chen, Beibei Lin, Yeying Jin et al.

ECCV 2024poster
#2139

DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing

Minghao Chen, Iro Laina, Andrea Vedaldi

ECCV 2024posterarXiv:2404.18929
#2140

Dynamic Neural Radiance Field From Defocused Monocular Video

Xianrui Luo, Huiqiang Sun, Juewen Peng et al.

ECCV 2024posterarXiv:2407.05586
#2141

4Diff: 3D-Aware Diffusion Model for Third-to-First Viewpoint Translation

Feng Cheng, Mi Luo, Huiyu Wang et al.

ECCV 2024poster
#2142

Chronologically Accurate Retrieval for Temporal Grounding of Motion-Language Models

Kent Fujiwara, Mikihiro Tanaka, Qing Yu

ECCV 2024posterarXiv:2407.15408
#2143

Realistic Human Motion Generation with Cross-Diffusion Models

Zeping Ren, Shaoli Huang, Xiu Li

ECCV 2024posterarXiv:2312.10993
#2144

Diffusion Models as Optimizers for Efficient Planning in Offline RL

Renming Huang, Yunqiang Pei, Guoqing Wang et al.

ECCV 2024posterarXiv:2407.16142
#2145

MERLiN: Single-Shot Material Estimation and Relighting for Photometric Stereo

Ashish Tiwari, Satoshi Ikehata, Shanmuganathan Raman

ECCV 2024posterarXiv:2409.00674
#2146

BI-MDRG: Bridging Image History in Multimodal Dialogue Response Generation

Hee Suk Yoon, Eunseop Yoon, Joshua Tian Jin Tee et al.

ECCV 2024posterarXiv:2408.05926
#2147

Rethinking Few-shot Class-incremental Learning: Learning from Yourself

Yu-Ming Tang, Yi-Xing Peng, Jing-Ke Meng et al.

ECCV 2024posterarXiv:2407.07468
#2148

Attention Decomposition for Cross-Domain Semantic Segmentation

Liqiang He, Sinisa Todorovic

ECCV 2024poster
#2149

RoGUENeRF: A Robust Geometry-Consistent Universal Enhancer for NeRF

Sibi Catley-Chandar, Richard Shaw, Greg Slabaugh et al.

ECCV 2024posterarXiv:2403.11909
#2150

FuseTeacher: Modality-fused Encoders are Strong Vision Supervisors

Chen-Wei Xie, Siyang Sun, Liming Zhao et al.

ECCV 2024poster
#2151

MVDD: Multi-View Depth Diffusion Models

Zhen Wang, Qiangeng Xu, Feitong Tan et al.

ECCV 2024posterarXiv:2312.04875
#2152

Wavelet Convolutions for Large Receptive Fields

Shahaf Finder, Roy Amoyal, Eran Treister et al.

ECCV 2024posterarXiv:2407.05848
#2153

Gradient-based Out-of-Distribution Detection

Taha Entesari, Sina Sharifi, Bardia Safaei et al.

ECCV 2024poster
#2154

Veil Privacy on Visual Data: Concealing Privacy for Humans, Unveiling for DNNs

Shuchao Pang, Ruhao Ma, Bing Li et al.

ECCV 2024poster
#2155

FYI: Flip Your Images for Dataset Distillation

Byunggwan Son, Youngmin Oh, Donghyeon Baek et al.

ECCV 2024posterarXiv:2407.08113
#2156

SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow

Yuanzhi Zhu, Xingchao Liu, Qiang Liu

ECCV 2024posterarXiv:2407.12718
#2157

Simple Unsupervised Knowledge Distillation With Space Similarity

Aditya Singh, Haohan Wang

ECCV 2024posterarXiv:2409.13939
#2158

Efficient Vision Transformers with Partial Attention

Xuan-Thuy Vo, Duy-Linh Nguyen, Adri Priadana et al.

ECCV 2024poster
#2159

Learning Natural Consistency Representation for Face Forgery Video Detection

Daichi Zhang, Zihao Xiao, Shikun Li et al.

ECCV 2024posterarXiv:2407.10550
#2160

Towards Stable 3D Object Detection

Jiabao Wang, Qiang Meng, Guochao Liu et al.

ECCV 2024posterarXiv:2407.04305
#2161

Revisit Human-Scene Interaction via Space Occupancy

Xinpeng Liu, Haowen Hou, Yanchao Yang et al.

ECCV 2024posterarXiv:2312.02700
#2162

View-Consistent 3D Editing with Gaussian Splatting

Yuxuan Wang, Xuanyu Yi, Zike Wu et al.

ECCV 2024posterarXiv:2403.11868
#2163

HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes

Zhuopeng Li, Yilin Zhang, Chenming Wu et al.

ECCV 2024posterarXiv:2403.20032
#2164

Generating Human Interaction Motions in Scenes with Text Control

Hongwei Yi, Justus Thies, Michael J. Black et al.

ECCV 2024posterarXiv:2404.10685
#2165

Multi-branch Collaborative Learning Network for 3D Visual Grounding

Zhipeng Qian, Yiwei Ma, Zhekai Lin et al.

ECCV 2024posterarXiv:2407.05363
#2166

KeypointDETR: An End-to-End 3D Keypoint Detector

Hairong Jin, Yuefan Shen, Jianwen Lou et al.

ECCV 2024poster
#2167

Instruction Tuning-free Visual Token Complement for Multimodal LLMs

Dongsheng Wang, Jiequan Cui, Miaoge Li et al.

ECCV 2024posterarXiv:2408.05019
#2168

Improving Point-based Crowd Counting and Localization Based on Auxiliary Point Guidance

I-HSIANG CHEN, Wei-Ting Chen, Yu-Wei Liu et al.

ECCV 2024posterarXiv:2405.10589
#2169

Online Temporal Action Localization with Memory-Augmented Transformer

Youngkil Song, Dongkeun Kim, Minsu Cho et al.

ECCV 2024posterarXiv:2408.02957
#2170

Bayesian Self-Training for Semi-Supervised 3D Segmentation

Ozan Unal, Christos Sakaridis, Luc Van Gool

ECCV 2024posterarXiv:2409.08102
#2171

SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer

Zijie Wu, Chaohui Yu, Yanqin Jiang et al.

ECCV 2024posterarXiv:2404.03736
#2172

Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models

Xiaoshi Wu, Yiming Hao, Manyuan Zhang et al.

ECCV 2024posterarXiv:2405.00760
#2173

Fully Authentic Visual Question Answering Dataset from Online Communities

Chongyan Chen, Mengchen Liu, Noel C Codella et al.

ECCV 2024posterarXiv:2311.15562
#2174

Revisit Self-supervision with Local Structure-from-Motion

Shengjie Zhu, Xiaoming Liu

ECCV 2024poster
#2175

Open-Vocabulary Camouflaged Object Segmentation

Youwei Pang, Xiaoqi Zhao, JiaMing Zuo et al.

ECCV 2024posterarXiv:2311.11241
#2176

Revisiting Feature Disentanglement Strategy in Diffusion Training and Breaking Conditional Independence Assumption in Sampling

Wonwoong Cho, Hareesh Ravi, Midhun Harikumar et al.

ECCV 2024poster
#2177

TexDreamer: Towards Zero-Shot High-Fidelity 3D Human Texture Generation

Yufei Liu, Junwei Zhu, Junshu Tang et al.

ECCV 2024posterarXiv:2403.12906
#2178

GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection

Ziying Song, Lei Yang, Shaoqing Xu et al.

ECCV 2024posterarXiv:2403.11848
#2179

EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis

Shuai Tan, Bin Ji, Mengxiao Bi et al.

ECCV 2024posterarXiv:2404.01647
#2180

Put Myself in Your Shoes: Lifting the Egocentric Perspective from Exocentric Videos

Mi Luo, Zihui Xue, Alex Dimakis et al.

ECCV 2024posterarXiv:2403.06351
#2181

SRPose: Two-view Relative Pose Estimation with Sparse Keypoints

Rui Yin, Yulun Zhang, Zherong Pan et al.

ECCV 2024posterarXiv:2407.08199
#2182

LivePhoto: Real Image Animation with Text-guided Motion Control

Xi Chen, Zhiheng Liu, Mengting Chen et al.

ECCV 2024posterarXiv:2312.02928
#2183

CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion

Wendi Zheng, Jiayan Teng, Zhuoyi Yang et al.

ECCV 2024posterarXiv:2403.05121
#2184

OAPT: Offset-Aware Partition Transformer for Double JPEG Artifacts Removal

Qiao Mo, Yukang Ding, Jinhua Hao et al.

ECCV 2024posterarXiv:2408.11480
#2185

Context-Aware Action Recognition: Introducing a Comprehensive Dataset for Behavior Contrast

Tatsuya Sasaki, Yoshiki Ito, Satoshi Kondo

ECCV 2024poster
#2186

Privacy-Preserving Adaptive Re-Identification without Image Transfer

Hamza Rami, Jhony H. Giraldo, Nicolas Winckler et al.

ECCV 2024posterarXiv:2407.12589
#2187

FRI-Net: Floorplan Reconstruction via Room-wise Implicit Representation

Honghao Xu, Juzhan Xu, Zeyu Huang et al.

ECCV 2024posterarXiv:2407.10687
#2188

DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Jinbo Xing, Menghan Xia, Yong Zhang et al.

ECCV 2024posterarXiv:2310.12190
#2189

Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors

Tongkun Guan, Wei Shen, Xue Yang et al.

ECCV 2024posterarXiv:2312.05286
#2190

Motion Aware Event Representation-driven Image Deblurring

Zhijing Sun, Xueyang Fu, Longzhuo Huang et al.

ECCV 2024poster
#2191

OLAF: A Plug-and-Play Framework for Enhanced Multi-object Multi-part Scene Parsing

Pranav Gupta, Rishubh Singh, Pradeep Shenoy et al.

ECCV 2024posterarXiv:2411.02858
#2192

Let the Avatar Talk using Texts without Paired Training Data

Xiuzhe Wu, Yang-Tian Sun, Handi Chen et al.

ECCV 2024poster
#2193

Uncertainty Calibration with Energy Based Instance-wise Scaling in the Wild Dataset

Mijoo Kim, Junseok Kwon

ECCV 2024posterarXiv:2407.12330
#2194

Attention Beats Linear for Fast Implicit Neural Representation Generation

Shuyi Zhang, Ke Liu, Jingjun Gu et al.

ECCV 2024posterarXiv:2407.15355
#2195

Prompt-Based Test-Time Real Image Dehazing: A Novel Pipeline

Zixuan Chen, Zewei He, Ziqian Lu et al.

ECCV 2024posterarXiv:2309.17389
#2196

Category Adaptation Meets Projected Distillation in Generalized Continual Category Discovery

Grzegorz Rypesc, Daniel Marczak, Sebastian Cygert et al.

ECCV 2024poster
#2197

3D Hand Sequence Recovery from Real Blurry Images and Event Stream

Joonkyu Park, Gyeongsik Moon, Weipeng Xu et al.

ECCV 2024poster
#2198

SG-NeRF: Neural Surface Reconstruction with Scene Graph Optimization

Yiyang Chen, Siyan Dong, Xulong Wang et al.

ECCV 2024posterarXiv:2407.12667
#2199

Segmentation-guided Layer-wise Image Vectorization with Gradient Fills

Hengyu Zhou, Hui Zhang, Bin Wang

ECCV 2024posterarXiv:2408.15741
#2200

ST-LDM: A Universal Framework for Text-Grounded Object Generation in Real Images

Xiangtian Xue, Jiasong Wu, Youyong Kong et al.

ECCV 2024posterarXiv:2403.10004