Most Cited ECCV "llm web agents" Papers

2,387 papers found • Page 5 of 12

Filters:Most Cited ECCV llm web agents Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

#801

Motion and Structure from Event-based Normal Flow

Zhongyang Ren, Bangyan Liao, Delei Kong et al.

ECCV 2024posterarXiv:2407.12239

citations

#802

AdaDistill: Adaptive Knowledge Distillation for Deep Face Recognition

Fadi Boutros, Vitomir Struc, Naser Damer

ECCV 2024posterarXiv:2407.01332

citations

#803

Generating Physically Realistic and Directable Human Motions from Multi-Modal Inputs

Aayam Shrestha, Pan Liu, German Ros et al.

ECCV 2024posterarXiv:2502.05641

citations

#804

KDProR: A Knowledge-Decoupling Probabilistic Framework for Video-Text Retrieval

Xianwei Zhuang, Hongxiang Li, Xuxin Cheng et al.

ECCV 2024poster

citations

#805

ByteEdit: Boost, Comply and Accelerate Generative Image Editing

YUXI REN, Jie Wu, Yanzuo Lu et al.

ECCV 2024posterarXiv:2404.04860

citations

#806

CloudFixer: Test-Time Adaptation for 3D Point Clouds via Diffusion-Guided Geometric Transformation

Hajin Shim, Changhun Kim, Eunho Yang

ECCV 2024posterarXiv:2407.16193

citations

#807

Graph Neural Network Causal Explanation via Neural Causal Models

Arman Behnam, Binghui Wang

ECCV 2024posterarXiv:2407.09378

citations

#808

Open-Set Recognition in the Age of Vision-Language Models

Dimity Miller, Niko Suenderhauf, Alex Kenna et al.

ECCV 2024posterarXiv:2403.16528

citations

#809

Temporal-Mapping Photography for Event Cameras

Yuhan Bao, Lei Sun, Yuqin Ma et al.

ECCV 2024posterarXiv:2403.06443

citations

#810

Placing Objects in Context via Inpainting for Out-of-distribution Segmentation

Pau de Jorge Aranda, Riccardo Volpi, Puneet Dokania et al.

ECCV 2024posterarXiv:2402.16392

citations

#811

RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception

Shen Jianbing, Chunliang Li, Wencheng Han et al.

ECCV 2024posterarXiv:2407.10876

citations

#812

Hiding Imperceptible Noise in Curvature-Aware Patches for 3D Point Cloud Attack

Mingyu Yang, Daizong Liu, Keke Tang et al.

ECCV 2024poster

citations

#813

Exploiting Semantic Reconstruction to Mitigate Hallucinations in Vision-Language Models

Minchan Kim, Minyeong Kim, Junik Bae et al.

ECCV 2024posterarXiv:2403.16167

citations

#814

Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos

Mohaiminul Islam, Tushar Nagarajan, Huiyu Wang et al.

ECCV 2024posterarXiv:2409.20557

citations

#815

Distributionally Robust Loss for Long-Tailed Multi-Label Image Classification

Dekun Lin, Zhe Cui, Rui Chen et al.

ECCV 2024poster

citations

#816

BEAF: Observing BEfore-AFter Changes to Evaluate Hallucination in Vision-language Models

Ye-Bin Moon, Nam Hyeon-Woo, Wonseok Choi et al.

ECCV 2024posterarXiv:2407.13442

citations

#817

Multiscale Sliced Wasserstein Distances as Perceptual Color Difference Measures

Jiaqi He, Zhihua Wang, Leon Wang et al.

ECCV 2024posterarXiv:2407.10181

citations

#818

Idempotent Unsupervised Representation Learning for Skeleton-Based Action Recognition

Lilang Lin, Lehong Wu, Jiahang Zhang et al.

ECCV 2024posterarXiv:2410.20349

citations

#819

Few-shot NeRF by Adaptive Rendering Loss Regularization

Qingshan Xu, Xuanyu Yi, Jianyao Xu et al.

ECCV 2024posterarXiv:2410.17839

citations

#820

Emerging Property of Masked Token for Effective Pre-training

Hyesong Choi, Hunsang Lee, Seyoung Joung et al.

ECCV 2024posterarXiv:2404.08330

citations

#821

VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space

Guénolé Fiche, Simon Leglaive, Xavier Alameda-Pineda et al.

ECCV 2024posterarXiv:2312.08291

citations

#822

Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data

Tuo FENG, Wenguan Wang, Ruijie Quan et al.

ECCV 2024posterarXiv:2407.10200

citations

#823

Learning Semantic Latent Directions for Accurate and Controllable Human Motion Prediction

Guowei Xu, Jiale Tao, Wen Li et al.

ECCV 2024posterarXiv:2407.11494

citations

#824

Event-Aided Time-To-Collision Estimation for Autonomous Driving

Jinghang Li, Bangyan Liao, Xiuyuan LU et al.

ECCV 2024posterarXiv:2407.07324

citations

#825

3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance

Xiaoxu Xu, Yitian Yuan, Jinlong Li et al.

ECCV 2024posterarXiv:2407.09826

citations

#826

Enhancing Perceptual Quality in Video Super-Resolution through Temporally-Consistent Detail Synthesis using Diffusion Models

Claudio Rota, Marco Buzzelli, Joost Van de Weijer

ECCV 2024posterarXiv:2311.15908

citations

#827

Dual-level Adaptive Self-Labeling for Novel Class Discovery in Point Cloud Segmentation

Ruijie Xu, Chuyu Zhang, Hui Ren et al.

ECCV 2024posterarXiv:2407.12489

citations

#828

BlenderAlchemy: Editing 3D Graphics with Vision-Language Models

Ian Huang, Guandao Yang, Leonidas Guibas

ECCV 2024posterarXiv:2404.17672

citations

#829

Exploring Vulnerabilities in Spiking Neural Networks: Direct Adversarial Attacks on Raw Event Data

Yanmeng Yao, Xiaohan Zhao, Bin Gu

ECCV 2024poster

citations

#830

ActionVOS: Actions as Prompts for Video Object Segmentation

LIANGYANG OUYANG, Ruicong Liu, Yifei Huang et al.

ECCV 2024posterarXiv:2407.07402

citations

#831

Take A Step Back: Rethinking the Two Stages in Visual Reasoning

Mingyu Zhang, Jiting Cai, Mingyu Liu et al.

ECCV 2024posterarXiv:2407.19666

citations

#832

Certifiably Robust Image Watermark

Zhengyuan Jiang, Moyang Guo, Yuepeng Hu et al.

ECCV 2024posterarXiv:2407.04086

citations

#833

CountFormer: Multi-View Crowd Counting Transformer

Hong Mo, Xiong Zhang, Jianchao Tan et al.

ECCV 2024posterarXiv:2407.02047

citations

#834

Rethinking Video Deblurring with Wavelet-Aware Dynamic Transformer and Diffusion Model

chen rao, Guangyuan Li, Zehua Lan et al.

ECCV 2024posterarXiv:2408.13459

citations

#835

T-CorresNet: Template Guided 3D Point Cloud Completion with Correspondence Pooling Query Generation Strategy

Fan Duan, Jiahao Yu, Li Chen

ECCV 2024posterarXiv:2407.05008

citations

#836

DNI: Dilutional Noise Initialization for Diffusion Video Editing

Sunjae Yoon, Gwanhyeong Koo, Ji Woo Hong et al.

ECCV 2024posterarXiv:2409.13037

citations

#837

EraseDraw : Learning to Insert Objects by Erasing Them from Images

Alper Canberk, Maksym Bondarenko, Ege Ozguroglu et al.

ECCV 2024poster

citations

#838

Learning Diffusion Models for Multi-View Anomaly Detection

Chieh Liu, Yu-Min Chu, Ting-I Hsieh et al.

ECCV 2024poster

citations

#839

Self-supervised visual learning from interactions with objects

Arthur Aubret, Céline Teulière, Jochen Triesch

ECCV 2024posterarXiv:2407.06704

citations

#840

CanonicalFusion: Generating Drivable 3D Human Avatars from Multiple Images

Jisu Shin, Junmyeong Lee, Seongmin Lee et al.

ECCV 2024posterarXiv:2407.04345

citations

#841

SMILe: Leveraging Submodular Mutual Information For Robust Few-Shot Object Detection

Anay Majee, Ryan X Sharp, Rishabh Iyer

ECCV 2024posterarXiv:2407.02665

citations

#842

Enhancing Cross-Subject fMRI-to-Video Decoding with Global-Local Functional Alignment

Chong Li, Xuelin Qian, Yun Wang et al.

ECCV 2024poster

citations

#843

NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image

Yoonwoo Jeong, Jinwoo Lee, Chiheon Kim et al.

ECCV 2024posterarXiv:2312.07315

citations

#844

LG-Gaze: Learning Geometry-aware Continuous Prompts for Language-Guided Gaze Estimation

Pengwei Yin, Jingjing Wang, Guanzhong Zeng et al.

ECCV 2024posterarXiv:2411.08606

citations

#845

Perceptual Evaluation of Audio-Visual Synchrony Grounded in Viewers’ Opinion Scores

Lucas Goncalves, Prashant Mathur, Chandrashekhar Lavania et al.

ECCV 2024posterarXiv:2404.07336

citations

#846

Brain Netflix: Scaling Data to Reconstruct Videos from Brain Signals

Camilo Fosco, Benjamin Lahner, Bowen Pan et al.

ECCV 2024poster

citations

#847

Learning to Make Keypoints Sub-Pixel Accurate

Shinjeong Kim, Marc Pollefeys, Daniel Barath

ECCV 2024posterarXiv:2407.11668

citations

#848

Self-Adapting Large Visual-Language Models to Edge Devices across Visual Modalities

Kaiwen Cai, ZheKai Duan, Gaowen Liu et al.

ECCV 2024posterarXiv:2403.04908

citations

#849

3D Small Object Detection with Dynamic Spatial Pruning

Xiuwei Xu, Zhihao Sun, Ziwei Wang et al.

ECCV 2024posterarXiv:2305.03716

citations

#850

GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning

Xiaojie Li, Yibo Yang, Xiangtai Li et al.

ECCV 2024posterarXiv:2403.12003

citations

#851

Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic Segmentation

Shoumeng Qiu, Jie Chen, Xinrun Li et al.

ECCV 2024posterarXiv:2407.13254

citations

#852

HSR: Holistic 3D Human-Scene Reconstruction from Monocular Videos

Lixin Xue, Chen Guo, Chengwei Zheng et al.

ECCV 2024poster

citations

#853

Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model

Danni Yang, Ruohan Dong, Jiayi Ji et al.

ECCV 2024posterarXiv:2407.05352

citations

#854

GS-Pose: Category-Level Object Pose Estimation via Geometric and Semantic Correspondence

Pengyuan Wang, Takuya Ikeda, Robert Lee et al.

ECCV 2024posterarXiv:2311.13777

citations

#855

Quantized Prompt for Efficient Generalization of Vision-Language Models

Tianxiang Hao, Xiaohan Ding, Juexiao Feng et al.

ECCV 2024posterarXiv:2407.10704

citations

#856

PQ-SAM: Post-training Quantization for Segment Anything Model

Xiaoyu Liu, Xin Ding, Lei Yu et al.

ECCV 2024poster

citations

#857

RoScenes: A Large-scale Multi-view 3D Dataset for Roadside Perception

Xiaosu Zhu, Hualian Sheng, Sijia Cai et al.

ECCV 2024posterarXiv:2405.09883

citations

#858

PiTe: Pixel-Temporal Alignment for Large Video-Language Model

Yang Liu, Pengxiang Ding, Siteng Huang et al.

ECCV 2024posterarXiv:2409.07239

citations

#859

Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing

Wonjun Kang, Kevin Galim, Hyung Il Koo

ECCV 2024posterarXiv:2403.09468

citations

#860

RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting

Qi Wang, Ruijie Lu, Xudong XU et al.

ECCV 2024posterarXiv:2406.02461

citations

#861

Statewide Visual Geolocalization in the Wild

Florian Fervers, Sebastian Bullinger, Christoph Bodensteiner et al.

ECCV 2024posterarXiv:2409.16763

citations

#862

PredBench: Benchmarking Spatio-Temporal Prediction across Diverse Disciplines

Zidong Wang, Zeyu Lu, Di Huang et al.

ECCV 2024posterarXiv:2407.08418

citations

#863

Safeguard Text-to-Image Diffusion Models with Human Feedback Inversion

Sanghyun Kim, Seohyeon Jung, Balhae Kim et al.

ECCV 2024posterarXiv:2407.21032

citations

#864

On Pretraining Data Diversity for Self-Supervised Learning

Hasan Abed El Kader Hammoud, Tuhin Das, Fabio Pizzati et al.

ECCV 2024posterarXiv:2403.13808

citations

#865

GenRC: Generative 3D Room Completion from Sparse Image Collections

Ming-Feng Li, Yueh-Feng Ku, Hong-Xuan Yen et al.

ECCV 2024posterarXiv:2407.12939

citations

#866

X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs

Swetha Sirnam, Jinyu Yang, Tal Neiman et al.

ECCV 2024posterarXiv:2407.13851

citations

#867

BK-SDM: A Lightweight, Fast, and Cheap Version of Stable Diffusion

Bo-Kyeong Kim, Hyoung-Kyu Song, Thibault Castells et al.

ECCV 2024posterarXiv:2305.15798

citations

#868

Layer-Wise Relevance Propagation with Conservation Property for ResNet

Seitaro Otsuki, Tsumugi Iida, Félix Doublet et al.

ECCV 2024posterarXiv:2407.09115

citations

#869

Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation

Jiawei Han, Kaiqi Liu, Wei Li et al.

ECCV 2024posterarXiv:2408.10537

citations

#870

Cs2K: Class-specific and Class-shared Knowledge Guidance for Incremental Semantic Segmentation

Wei Cong, Yang Cong, Yuyang Liu et al.

ECCV 2024posterarXiv:2407.09047

citations

#871

O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation

Muer Tie, Julong Wei, Zhengjun Wang et al.

ECCV 2024posterarXiv:2404.06836

citations

#872

Compress3D: a Compressed Latent Space for 3D Generation from a Single Image

Bowen Zhang, Tianyu Yang, Yu Li et al.

ECCV 2024posterarXiv:2403.13524

citations

#873

DynoSurf: Neural Deformation-based Temporally Consistent Dynamic Surface Reconstruction

Yuxin Yao, Siyu Ren, Junhui Hou et al.

ECCV 2024posterarXiv:2403.11586

citations

#874

AFF-ttention! Affordances and Attention models for Short-Term Object Interaction Anticipation

Lorenzo Mur Labadia, Ruben Martinez-Cantin, Jose J Guerrero et al.

ECCV 2024posterarXiv:2406.01194

citations

#875

AddBiomechanics Dataset: Capturing the Physics of Human Motion at Scale

Keenon Werling, Janelle M Kaneda, Tian Tan et al.

ECCV 2024posterarXiv:2406.18537

citations

#876

CityGuessr: City-Level Video Geo-Localization on a Global Scale

Parth Parag Kulkarni, Gaurav Kumar Nayak, Shah Mubarak

ECCV 2024posterarXiv:2411.06344

citations

#877

Taming Lookup Tables for Efficient Image Retouching

Sidi Yang, Binxiao Huang, Mingdeng Cao et al.

ECCV 2024posterarXiv:2403.19238

citations

#878

ZoLA: Zero-Shot Creative Long Animation Generation with Short Video Model

Fu-Yun Wang, Zhaoyang Huang, Qiang Ma et al.

ECCV 2024poster

citations

#879

PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion

Runsong Zhu, Shi Qiu, Qianyi Wu et al.

ECCV 2024posterarXiv:2410.10659

citations

#880

Flying with Photons: Rendering Novel Views of Propagating Light

Anagh Malik, Noah Juravsky, Ryan Po et al.

ECCV 2024posterarXiv:2404.06493

citations

#881

TriNeRFLet: A Wavelet Based Triplane NeRF Representation

Rajaei Khatib, RAJA GIRYES

ECCV 2024posterarXiv:2401.06191

citations

#882

Bi-TTA: Bidirectional Test-Time Adapter for Remote Physiological Measurement

Haodong LI, Hao LU, Yingcong Chen

ECCV 2024posterarXiv:2409.17316

citations

#883

MinD-3D: Reconstruct High-quality 3D objects in Human Brain

Jianxiong Gao, Yuqian Fu, Yun Wang et al.

ECCV 2024posterarXiv:2312.07485

citations

#884

MetaCap: Meta-learning Priors from Multi-View Imagery for Sparse-view Human Performance Capture and Rendering

Guoxing Sun, Rishabh Dabral, Pascal Fua et al.

ECCV 2024posterarXiv:2403.18820

citations

#885

Zero-Shot Adaptation for Approximate Posterior Sampling of Diffusion Models in Inverse Problems

Yasar Utku Alcalar, Mehmet Akcakaya

ECCV 2024posterarXiv:2407.11288

citations

#886

Power Variable Projection for Initialization-Free Large-Scale Bundle Adjustment

Simon Weber, Je Hyeong Hong, Daniel Cremers

ECCV 2024posterarXiv:2405.05079

citations

#887

Scene-Graph ViT: End-to-End Open-Vocabulary Visual Relationship Detection

Tim Salzmann, Markus Ryll, Alex Bewley et al.

ECCV 2024posterarXiv:2403.14270

citations

#888

Self-Supervised Representation Learning for Adversarial Attack Detection

Yi Li, Plamen Angelov, Neeraj Suri

ECCV 2024posterarXiv:2407.04382

citations

#889

Zero-Shot Image Feature Consensus with Deep Functional Maps

Xinle Cheng, Congyue Deng, Adam Harley et al.

ECCV 2024posterarXiv:2403.12038

citations

#890

Concept Arithmetics for Circumventing Concept Inhibition in Diffusion Models

Vitali Petsiuk, Kate Saenko

ECCV 2024posterarXiv:2404.13706

citations

#891

Accelerating Online Mapping and Behavior Prediction via Direct BEV Feature Attention

Xunjiang Gu, Guanyu Song, Igor Gilitschenski et al.

ECCV 2024posterarXiv:2407.06683

citations

#892

Auto-DAS: Automated Proxy Discovery for Training-free Distillation-aware Architecture Search

Haosen SUN, Lujun Li, Peijie Dong et al.

ECCV 2024poster

citations

#893

Parameterized Quasi-Physical Simulators for Dexterous Manipulations Transfer

Xueyi Liu, Kangbo Lyu, jieqiong zhang et al.

ECCV 2024posterarXiv:2404.07988

citations

#894

PoseCrafter: One-Shot Personalized Video Synthesis Following Flexible Pose Control

Yong Zhong, Min Zhao, Zebin You et al.

ECCV 2024posterarXiv:2405.14582

citations

#895

Shape from Heat Conduction

Sriram Narayanan, Mani Ramanagopal, Mark Sheinin et al.

ECCV 2024poster

citations

#896

FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection

Zheng Jiang, Jinqing Zhang, Yanan Zhang et al.

ECCV 2024posterarXiv:2407.10135

citations

#897

Enriching Information and Preserving Semantic Consistency in Expanding Curvilinear Object Segmentation Datasets

Qin Lei, Jiang Zhong, Qizhu Dai

ECCV 2024posterarXiv:2407.08209

citations

#898

Towards Scene Graph Anticipation

Rohith Peddi, Saksham Singh, Saurabh . et al.

ECCV 2024posterarXiv:2403.04899

citations

#899

UniCal: Unified Neural Sensor Calibration

Ze Yang, George G Chen, Haowei Zhang et al.

ECCV 2024posterarXiv:2409.18953

citations

#900

Made to Order: Discovering monotonic temporal changes via self-supervised video ordering

Charig Yang, Weidi Xie, Andrew ZISSERMAN

ECCV 2024posterarXiv:2404.16828

citations

#901

Preventing Catastrophic Overfitting in Fast Adversarial Training: A Bi-level Optimization Perspective

Zhaoxin Wang, Handing Wang, Cong Tian et al.

ECCV 2024posterarXiv:2407.12443

citations

#902

SLIM: Spuriousness Mitigation with Minimal Human Annotations

Xiwei Xuan, Ziquan Deng, Hsuan-Tien Lin et al.

ECCV 2024posterarXiv:2407.05594

citations

#903

D4-VTON: Dynamic Semantics Disentangling for Differential Diffusion based Virtual Try-On

Zhaotong Yang, Zicheng Jiang, Xinzhe Li et al.

ECCV 2024poster

citations

#904

RAVE: Residual Vector Embedding for CLIP-Guided Backlit Image Enhancement

Tatiana Gaintseva, Martin Benning, Greg Slabaugh

ECCV 2024posterarXiv:2404.01889

citations

#905

Toward Tiny and High-quality Facial Makeup with Data Amplify Learning

Qiaoqiao Jin, Xuanhong Chen, Meiguang Jin et al.

ECCV 2024posterarXiv:2403.15033

citations

#906

You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception

Sheng Jin, Shuhuai Li, Tong Li et al.

ECCV 2024posterarXiv:2312.05525

citations

#907

Plan, Posture and Go: Towards Open-vocabulary Text-to-Motion Generation

Jinpeng Liu, Wenxun Dai, Chunyu Wang et al.

ECCV 2024poster

citations

#908

Flash Cache: Reducing Bias in Radiance Cache Based Inverse Rendering

Benjamin Attal, Dor Verbin, Ben Mildenhall et al.

ECCV 2024posterarXiv:2409.05867

citations

#909

RePOSE: 3D Human Pose Estimation via Spatio-Temporal Depth Relational Consistency

Ziming Sun, Yuan Liang, Zejun Ma et al.

ECCV 2024poster

citations

#910

Bottom-Up Domain Prompt Tuning for Generalized Face Anti-Spoofing

SI-QI LIU, Qirui Wang, Pong Chi Yuen

ECCV 2024poster

citations

#911

PolyOculus: Simultaneous Multi-view Image-based Novel View Synthesis

Jason Yu, Tristan Aumentado-Armstrong, Fereshteh Forghani et al.

ECCV 2024posterarXiv:2402.17986

citations

#912

Bidirectional Progressive Transformer for Interaction Intention Anticipation

Zichen Zhang, Hongchen Luo, Wei Zhai et al.

ECCV 2024posterarXiv:2405.05552

citations

#913

Memory-Efficient Fine-Tuning for Quantized Diffusion Model

Hyogon Ryu, Seohyun Lim, Hyunjung Shim

ECCV 2024posterarXiv:2401.04339

citations

#914

TTT-MIM: Test-Time Training with Masked Image Modeling for Denoising Distribution Shifts

Youssef Mansour, Xuyang Zhong, Serdar Caglar et al.

ECCV 2024poster

citations

#915

Layered Rendering Diffusion Model for Controllable Zero-Shot Image Synthesis

Zipeng Qi, Guoxi Huang, Chenyang Liu et al.

ECCV 2024posterarXiv:2311.18435

citations

#916

Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model

Zhening Liu, XINJIE ZHANG, Jiawei Shao et al.

ECCV 2024posterarXiv:2407.10632

citations

#917

DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation

Soojin Jang, JungMin Yun, JuneHyoung Kwon et al.

ECCV 2024posterarXiv:2409.15801

citations

#918

Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable Repainting

Junwu Zhang, Zhenyu Tang, Yatian Pang et al.

ECCV 2024poster

citations

#919

Relightable Neural Actor with Intrinsic Decomposition and Pose Control

Diogo Carbonera Luvizon, Vladislav Golyanik, Adam Kortylewski et al.

ECCV 2024posterarXiv:2312.11587

citations

#920

PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation

Jian Ma, Chen Chen, Qingsong Xie et al.

ECCV 2024posterarXiv:2311.17086

citations

#921

DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic Compensation

Wenliang Zhao, Haolin Wang, Jie Zhou et al.

ECCV 2024posterarXiv:2409.03755

citations

#922

An Explainable Vision Question Answer Model via Diffusion Chain-of-Thought

Chunhao LU, Qiang Lu, Jake Luo

ECCV 2024poster

citations

#923

Avatar Fingerprinting for Authorized Use of Synthetic Talking-Head Videos

Ekta Prashnani, Koki Nagano, Shalini De Mello et al.

ECCV 2024posterarXiv:2305.03713

citations

#924

High-Precision Self-Supervised Monocular Depth Estimation with Rich-Resource Prior

Shen Jianbing, Wencheng Han

ECCV 2024posterarXiv:2408.00361

citations

#925

GazeXplain: Learning to Predict Natural Language Explanations of Visual Scanpaths

Xianyu Chen, Ming Jiang, Qi Zhao

ECCV 2024posterarXiv:2408.02788

citations

#926

Un-EVIMO: Unsupervised Event-based Independent Motion Segmentation

Ziyun Wang, Jinyuan Guo, Kostas Daniilidis

ECCV 2024posterarXiv:2312.00114

citations

#927

WorldPose: A World Cup Dataset for Global 3D Human Pose Estimation

Tianjian Jiang, Johsan Billingham, Sebastian Müksch et al.

ECCV 2024posterarXiv:2501.02771

citations

#928

Prompting Future Driven Diffusion Model for Hand Motion Prediction

Bowen Tang, Kaihao Zhang, Wenhan Luo et al.

ECCV 2024poster

citations

#929

ARoFace: Alignment Robustness to Improve Low-quality Face Recognition

Mohammad Saeed Ebrahimi Saadabadi, Sahar Rahimi Malakshan, Ali Dabouei et al.

ECCV 2024posterarXiv:2407.14972

citations

#930

GeometrySticker: Enabling Ownership Claim of Recolorized Neural Radiance Fields

Xiufeng HUANG, Ka Chun Cheung, Simon See et al.

ECCV 2024posterarXiv:2407.13390

citations

#931

COMO: Compact Mapping and Odometry

Eric Dexheimer, Andrew Davison

ECCV 2024posterarXiv:2404.03531

citations

#932

Within the Dynamic Context: Inertia-aware 3D Human Modeling with Pose Sequence

Yutong Chen, Yifan Zhan, Zhihang Zhong et al.

ECCV 2024posterarXiv:2403.19160

citations

#933

ProCreate, Don't Reproduce! Propulsive Energy Diffusion for Creative Generation

Jack Lu, Ryan Teehan, Mengye Ren

ECCV 2024posterarXiv:2408.02226

citations

#934

Learning Neural Volumetric Pose Features for Camera Localization

Jingyu Lin, Jiaqi Gu, Bojian Wu et al.

ECCV 2024posterarXiv:2403.12800

citations

#935

Anytime Continual Learning for Open Vocabulary Classification

Zhen Zhu, Yiming Gong, Derek Hoiem

ECCV 2024posterarXiv:2409.08518

citations

#936

Markov Knowledge Distillation: Make Nasty Teachers trained by Self-undermining Knowledge Distillation Fully Distillable

En-Hui Yang, Linfeng Ye

ECCV 2024poster

citations

#937

ConDense: Consistent 2D-3D Pre-training for Dense and Sparse Features from Multi-View Images

Xiaoshuai Zhang, Zhicheng Wang, Howard Zhou et al.

ECCV 2024posterarXiv:2408.17027

citations

#938

Distill Gold from Massive Ores: Bi-level Data Pruning towards Efficient Dataset Distillation

YUE XU, Yong-Lu Li, Kaitong Cui et al.

ECCV 2024posterarXiv:2305.18381

citations

#939

Improving Knowledge Distillation via Regularizing Feature Direction and Norm

Yuzhu Wang, Lechao Cheng, Manni Duan et al.

ECCV 2024poster

citations

#940

Self-supervised co-salient object detection via feature correspondences at multiple scales

Souradeep Chakraborty, Dimitris Samaras

ECCV 2024posterarXiv:2403.11107

citations

#941

DiffuX2CT: Diffusion Learning to Reconstruct CT Images from Biplanar X-Rays

Baochang Zhang, Zhi Qiao, Runkun Liu et al.

ECCV 2024posterarXiv:2407.13545

citations

#942

AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval

Pavel Suma, Giorgos Kordopatis-Zilos, Ahmet Iscen et al.

ECCV 2024posterarXiv:2408.03282

citations

#943

Camera Calibration using a Collimator System

Shunkun Liang, Banglei Guan, Zhenbao Yu et al.

ECCV 2024posterarXiv:2409.20034

citations

#944

Quality Assured: Rethinking Annotation Strategies in Imaging AI

Tim Rädsch, Annika Reinke, Vivienn Weru et al.

ECCV 2024posterarXiv:2407.17596

citations

#945

Snuffy: Efficient Whole Slide Image Classifier

Hossein Jafarinia, Alireza Alipanah, Saeed Razavi et al.

ECCV 2024posterarXiv:2408.08258

citations

#946

FARSE-CNN: Fully Asynchronous, Recurrent and Sparse Event-Based CNN

Riccardo Santambrogio, Marco Cannici, Matteo Matteucci

ECCV 2024poster

citations

#947

Hyperion – A fast, versatile symbolic Gaussian Belief Propagation framework for Continuous-Time SLAM

David Hug, Ignacio Alzugaray Lopez, Margarita Chli

ECCV 2024posterarXiv:2407.07074

citations

#948

AWOL: Analysis WithOut synthesis using Language

Silvia Zuffi, Michael J. Black

ECCV 2024posterarXiv:2404.03042

citations

#949

SLAck: Semantic, Location, and Appearance Aware Open-Vocabulary Tracking

Siyuan Li, Lei Ke, Yung-Hsu Yang et al.

ECCV 2024posterarXiv:2409.11235

citations

#950

Trainable Highly-expressive Activation Functions

Irit Chelly, Shahaf Finder, Shira Ifergane et al.

ECCV 2024posterarXiv:2407.07564

citations

#951

RecurrentBEV: A Long-term Temporal Fusion Framework for Multi-view 3D Detection

Ming Chang, Xishan Zhang, Rui Zhang et al.

ECCV 2024poster

citations

#952

UniINR: Event-guided Unified Rolling Shutter Correction, Deblurring, and Interpolation

Yunfan Lu, Guoqiang Liang, Yusheng Wang et al.

ECCV 2024posterarXiv:2305.15078

citations

#953

PartImageNet++ Dataset: Scaling up Part-based Models for Robust Recognition

Xiao Li, Yining Liu, Na Dong et al.

ECCV 2024posterarXiv:2407.10918

citations

#954

IFTR: An Instance-Level Fusion Transformer for Visual Collaborative Perception

Shaohong Wang, Lu Bin, Xinyu Xiao et al.

ECCV 2024posterarXiv:2407.09857

citations

#955

VCD-Texture: Variance Alignment based 3D-2D Co-Denoising for Text-Guided Texturing

Shang Liu, Chaohui Yu, Chenjie Cao et al.

ECCV 2024posterarXiv:2407.04461

citations

#956

Camera-LiDAR Cross-modality Gait Recognition

Wenxuan Guo, Yingping Liang, Zhiyu Pan et al.

ECCV 2024posterarXiv:2407.02038

citations

#957

PairingNet: A Learning-based Pair-searching and -matching Network for Image Fragments

rixin zhou, Ding Xia, YI ZHANG et al.

ECCV 2024posterarXiv:2312.08704

citations

#958

SemReg: Semantics Constrained Point Cloud Registration

Sheldon Fung, Xuequan Lu, Dasith de Silva Edirimuni et al.

ECCV 2024poster

citations

#959

Personalized Video Relighting With an At-Home Light Stage

Jun Myeong Choi, Max Christman, Roni Sengupta

ECCV 2024posterarXiv:2311.08843

citations

#960

Diffusion Prior-Based Amortized Variational Inference for Noisy Inverse Problems

Sojin Lee, Dogyun Park, Inho Kong et al.

ECCV 2024posterarXiv:2407.16125

citations

#961

Temporal Residual Jacobians for Rig-free Motion Transfer

Sanjeev Muralikrishnan, Niladri Shekhar Dutt, Siddhartha Chaudhuri et al.

ECCV 2024posterarXiv:2407.14958

citations

#962

From Fake to Real: Pretraining on Balanced Synthetic Images to Prevent Spurious Correlations in Image Recognition

Maan Qraitem, Kate Saenko, Bryan Plummer

ECCV 2024posterarXiv:2308.04553

citations

#963

Category-level Object Detection, Pose Estimation and Reconstruction from Stereo Images

Chuanrui Zhang, Yonggen Ling, Minglei Lu et al.

ECCV 2024posterarXiv:2407.06984

citations

#964

Data Augmentation via Latent Diffusion for Saliency Prediction

Bahar Aydemir, Deblina Bhattacharjee, Tong Zhang et al.

ECCV 2024posterarXiv:2409.07307

citations

#965

Understanding Physical Dynamics with Counterfactual World Modeling

Rahul Mysore Venkatesh, Honglin Chen, Kevin Feigelis et al.

ECCV 2024posterarXiv:2312.06721

citations

#966

Insect Identification in the Wild: The AMI Dataset

Aditya Jain, Fagner Cunha, Michael J Bunsen et al.

ECCV 2024posterarXiv:2406.12452

citations

#967

HGL: Hierarchical Geometry Learning for Test-time Adaptation in 3D Point Cloud Segmentation

Tianpei Zou, Sanqing Qu, Zhijun Li et al.

ECCV 2024posterarXiv:2407.12387

citations

#968

Rawformer: Unpaired Raw-to-Raw Translation for Learnable Camera ISPs

Georgy Perevozchikov, Nancy Mehta, Mahmoud Afifi et al.

ECCV 2024posterarXiv:2404.10700

citations

#969

Self-Supervised Audio-Visual Soundscape Stylization

Tingle Li, Renhao Wang, Po-Yao Huang et al.

ECCV 2024posterarXiv:2409.14340

citations

#970

Improving Robustness to Model Inversion Attacks via Sparse Coding Architectures

Sayanton Vhaduri Dibbo, Adam Breuer, Juston Moore et al.

ECCV 2024posterarXiv:2403.14772

citations

#971

Zero-Shot Multi-Object Scene Completion

Shun Iwase, Katherine Liu, Vitor Guizilini et al.

ECCV 2024posterarXiv:2403.14628

citations

#972

Agglomerative Token Clustering

Joakim Bruslund Haurum, Sergio Escalera, Graham W. Taylor et al.

ECCV 2024posterarXiv:2409.11923

citations

#973

Weight Conditioning for Smooth Optimization of Neural Networks

Hemanth Saratchandran, Thomas X Wang, Simon Lucey

ECCV 2024posterarXiv:2409.03424

citations

#974

Layout-Corrector: Alleviating Layout Sticking Phenomenon in Discrete Diffusion Model

Shoma Iwai, Atsuki Osanai, Shunsuke Kitada et al.

ECCV 2024posterarXiv:2409.16689

citations

#975

External Knowledge Enhanced 3D Scene Generation from Sketch

Zijie Wu, Mingtao Feng, Yaonan Wang et al.

ECCV 2024posterarXiv:2403.14121

citations

#976

Click Prompt Learning with Optimal Transport for Interactive Segmentation

Jie Liu, haochen wang, Wenzhe Yin et al.

ECCV 2024poster

citations

#977

PARIS3D: Reasoning-based 3D Part Segmentation Using Large Multimodal Model

Amrin Kareem, Jean Lahoud, Hisham Cholakkal

ECCV 2024posterarXiv:2404.03836

citations

#978

A high-quality robust diffusion framework for corrupted dataset

Quan Dao, Binh Ta, Tung Pham et al.

ECCV 2024posterarXiv:2311.17101

citations

#979

Unsupervised Multi-modal Medical Image Registration via Invertible Translation

Mengjie Guo

ECCV 2024poster

citations

#980

Shedding More Light on Robust Classifiers under the lens of Energy-based Models

Mujtaba Hussain Mirza, Maria Rosaria Briglia, Senad Beadini et al.

ECCV 2024posterarXiv:2407.06315

citations

#981

Reshaping the Online Data Buffering and Organizing Mechanism for Continual Test-Time Adaptation

Zhilin Zhu, Xiaopeng Hong, Zhiheng Ma et al.

ECCV 2024posterarXiv:2407.09367

citations

#982

DiffCD: A Symmetric Differentiable Chamfer Distance for Neural Implicit Surface Fitting

Linus Härenstam-Nielsen, Lu Sang, Abhishek Saroha et al.

ECCV 2024posterarXiv:2407.17058

citations

#983

Any Target Can be Offense: Adversarial Example Generation via Generalized Latent Infection

Youheng Sun, Shengming Yuan, Xuanhan Wang et al.

ECCV 2024posterarXiv:2407.12292

citations

#984

Improving Domain Generalization in Self-Supervised Monocular Depth Estimation via Stabilized Adversarial Training

Yuanqi Yao, Gang Wu, Kui Jiang et al.

ECCV 2024posterarXiv:2411.02149

citations

#985

Improving Zero-Shot Generalization for CLIP with Variational Adapter

Ziqian Lu, Fengli Shen, Mushui Liu et al.

ECCV 2024poster

citations

#986

Learn to Memorize and to Forget: A Continual Learning Perspective of Dynamic SLAM

Baicheng Li, Zike Yan, Dong Wu et al.

ECCV 2024posterarXiv:2407.13338

citations

#987

Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation

Junsung Lee, Minsoo Kang, Bohyung Han

ECCV 2024posterarXiv:2409.08077

citations

#988

EA-VTR: Event-Aware Video-Text Retrieval

Zongyang Ma, Ziqi Zhang, Yuxin Chen et al.

ECCV 2024posterarXiv:2407.07478

citations

#989

Audio-visual Generalized Zero-shot Learning the Easy Way

Shentong Mo, Pedro Morgado

ECCV 2024posterarXiv:2407.13095

citations

#990

Affective Visual Dialog: A Large-Scale Benchmark for Emotional Reasoning Based on Visually Grounded Conversations

KILICHBEK HAYDAROV, Xiaoqian Shen, Avinash Madasu et al.

ECCV 2024posterarXiv:2308.16349

citations

#991

Learning Cross-hand Policies of High-DOF Reaching and Grasping

Qijin She, Shishun Zhang, Yunfan Ye et al.

ECCV 2024posterarXiv:2404.09150

citations

#992

DySeT: a Dynamic Masked Self-distillation Approach for Robust Trajectory Prediction

MOZHGAN POURKESHAVARZ, Arielle Zhang, Amir Rasouli

ECCV 2024poster

citations

#993

MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection

Youngmin Oh, Hyung-Il Kim, Seong Tae Kim et al.

ECCV 2024posterarXiv:2407.16448

citations

#994

FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance

Jiedong Zhuang, Jiaqi Hu, Lianrui Mu et al.

ECCV 2024posterarXiv:2407.05578

citations

#995

VISAGE: Video Instance Segmentation with Appearance-Guided Enhancement

Hanjung Kim, Jaehyun Kang, Miran Heo et al.

ECCV 2024posterarXiv:2312.04885

citations

#996

Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics

Shishira R Maiya, Anubhav Anubhav, Matthew Gwilliam et al.

ECCV 2024posterarXiv:2408.02672

citations

#997

SkyScenes: A Synthetic Dataset for Aerial Scene Understanding

Sahil Santosh Khose, Anisha Pal, Aayushi Agarwal et al.

ECCV 2024posterarXiv:2312.06719

citations

#998

Concise Plane Arrangements for Low-Poly Surface and Volume Modelling

Raphael Sulzer, Florent Lafarge

ECCV 2024posterarXiv:2404.06154

citations

#999

FrePolad: Frequency-Rectified Point Latent Diffusion for Point Cloud Generation

Chenliang Zhou, Fangcheng Zhong, Param Hanji et al.

ECCV 2024posterarXiv:2311.12090

citations

#1000

DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors

Zizheng Yan, Jiapeng Zhou, Fanpeng Meng et al.

ECCV 2024posterarXiv:2407.16260

citations

← Previous

1...3 4 5 6 7...12