Most Cited ECCV "message-passing networks" Papers

2,387 papers found • Page 10 of 12

#1801

Towards Stable 3D Object Detection

Jiabao Wang, Qiang Meng, Guochao Liu et al.

ECCV 2024arXiv:2407.04305
#1802

HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes

Zhuopeng Li, Yilin Zhang, Chenming Wu et al.

ECCV 2024arXiv:2403.20032
#1803

KeypointDETR: An End-to-End 3D Keypoint Detector

Hairong Jin, Yuefan Shen, Jianwen Lou et al.

ECCV 2024
#1804

Generating Human Interaction Motions in Scenes with Text Control

Hongwei Yi, Justus Thies, Michael J. Black et al.

ECCV 2024arXiv:2404.10685
#1805

Optimizing Illuminant Estimation in Dual-Exposure HDR Imaging

Mahmoud Afifi, Zhenhua Hu, Liang Liang

ECCV 2024arXiv:2403.02449
#1806

Revisit Human-Scene Interaction via Space Occupancy

Xinpeng Liu, Haowen Hou, Yanchao Yang et al.

ECCV 2024arXiv:2312.02700
#1807

Progressive Proxy Anchor Propagation for Unsupervised Semantic Segmentation

Hyun Seok Seong, WonJun Moon, SuBeen Lee et al.

ECCV 2024arXiv:2407.12463
#1808

Multi-branch Collaborative Learning Network for 3D Visual Grounding

Zhipeng Qian, Yiwei Ma, Zhekai Lin et al.

ECCV 2024arXiv:2407.05363
#1809

FLAT: Flux-aware Imperceptible Adversarial Attacks on 3D Point Clouds

Keke Tang, Lujie Huang, Weilong Peng et al.

ECCV 2024
#1810

Instruction Tuning-free Visual Token Complement for Multimodal LLMs

Dongsheng Wang, Jiequan Cui, Miaoge Li et al.

ECCV 2024arXiv:2408.05019
#1811

Improving Point-based Crowd Counting and Localization Based on Auxiliary Point Guidance

I-HSIANG CHEN, Wei-Ting Chen, Yu-Wei Liu et al.

ECCV 2024arXiv:2405.10589
#1812

JointDreamer: Ensuring Geometry Consistency and Text Congruence in Text-to-3D Generation via Joint Score Distillation

ChenHan Jiang, Yihan Zeng, Tianyang Hu et al.

ECCV 2024arXiv:2407.12291
#1813

Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts

Shuangkang Fang, Yufeng Wang, Yi-Hsuan Tsai et al.

ECCV 2024arXiv:2407.06842
#1814

Online Temporal Action Localization with Memory-Augmented Transformer

Youngkil Song, Dongkeun Kim, Minsu Cho et al.

ECCV 2024arXiv:2408.02957
#1815

Disentangled Generation and Aggregation for Robust Radiance Fields

Shihe Shen, Huachen Gao, Wangze Xu et al.

ECCV 2024arXiv:2409.15715
#1816

MANIKIN: Biomechanically Accurate Neural Inverse Kinematics for Human Motion Estimation

Jiaxi Jiang, Paul Streli, Xuejing Luo et al.

ECCV 2024
#1817

Click-Gaussian: Interactive Segmentation to Any 3D Gaussians

Seokhun Choi, Hyeonseop Song, Jaechul Kim et al.

ECCV 2024arXiv:2407.11793
#1818

Online Vectorized HD Map Construction using Geometry

Zhixin Zhang, Yiyuan Zhang, Xiaohan Ding et al.

ECCV 2024arXiv:2312.03341
#1819

HybridBooth: Hybrid Prompt Inversion for Efficient Subject-Driven Generation

Shanyan Guan, Yanhao Ge, Ying Tai et al.

ECCV 2024arXiv:2410.08192
#1820

Diffusion-Guided Weakly Supervised Semantic Segmentation

Sung-Hoon Yoon, Hoyong Kwon, Jaeseok Jeong et al.

ECCV 2024
#1821

Panel-Specific Degradation Representation for Raw Under-Display Camera Image Restoration

Youngjin Oh, Keuntek Lee, Jooyoung Lee et al.

ECCV 2024
#1822

SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer

Zijie Wu, Chaohui Yu, Yanqin Jiang et al.

ECCV 2024arXiv:2404.03736
#1823

Fully Authentic Visual Question Answering Dataset from Online Communities

Chongyan Chen, Mengchen Liu, Noel C Codella et al.

ECCV 2024arXiv:2311.15562
#1824

CoMusion: Towards Consistent Stochastic Human Motion Prediction via Motion Diffusion

Jiarui Sun, Girish Chowdhary

ECCV 2024arXiv:2305.12554
#1825

Real-data-driven 2000 FPS Color Video from Mosaicked Chromatic Spikes

Siqi Yang, Zhaojun Huang, Yakun Chang et al.

ECCV 2024
#1826

Revisit Self-supervision with Local Structure-from-Motion

Shengjie Zhu, Xiaoming Liu

ECCV 2024
#1827

On the Viability of Monocular Depth Pre-training for Semantic Segmentation

DONG LAO, Fengyu Yang, Daniel Wang et al.

ECCV 2024arXiv:2203.13987
#1828

Weakly-supervised Camera Localization by Ground-to-satellite Image Registration

Yujiao Shi, HONGDONG LI, Akhil Perincherry et al.

ECCV 2024arXiv:2409.06471
#1829

GLARE: Low Light Image Enhancement via Generative Latent Feature based Codebook Retrieval

Han Zhou, Wei Dong, Xiaohong Liu et al.

ECCV 2024arXiv:2407.12431
#1830

Open-Vocabulary Camouflaged Object Segmentation

Youwei Pang, Xiaoqi Zhao, JiaMing Zuo et al.

ECCV 2024arXiv:2311.11241
#1831

ProMerge: Prompt and Merge for Unsupervised Instance Segmentation

Dylan Li, Gyungin Shin

ECCV 2024arXiv:2409.18961
#1832

GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection

Ziying Song, Lei Yang, Shaoqing Xu et al.

ECCV 2024arXiv:2403.11848
#1833

Revisiting Feature Disentanglement Strategy in Diffusion Training and Breaking Conditional Independence Assumption in Sampling

Wonwoong Cho, Hareesh Ravi, Midhun Harikumar et al.

ECCV 2024
#1834

ProtoComp: Diverse Point Cloud Completion with Controllable Prototype

Xumin Yu, Yanbo Wang, Jie Zhou et al.

ECCV 2024
#1835

IAM-VFI : Interpolate Any Motion for Video Frame Interpolation with motion complexity map

Kihwan Yoon, Yong Han Kim, Sungjei Kim et al.

ECCV 2024
#1836

Co-Student: Collaborating Strong and Weak Students for Sparsely Annotated Object Detection

Lianjun Wu, Jiangxiao Han, Zengqiang Zheng et al.

ECCV 2024
#1837

Topo4D: Topology-Preserving Gaussian Splatting for High-Fidelity 4D Head Capture

Xuanchen Li, Yuhao Cheng, Xingyu Ren et al.

ECCV 2024arXiv:2406.00440
#1838

EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis

Shuai Tan, Bin Ji, Mengxiao Bi et al.

ECCV 2024arXiv:2404.01647
#1839

Geospecific View Generation - Geometry-Context Aware High-resolution Ground View Inference from Satellite Views

Ningli Xu, Rongjun Qin

ECCV 2024arXiv:2407.08061
#1840

Put Myself in Your Shoes: Lifting the Egocentric Perspective from Exocentric Videos

Mi Luo, Zihui Xue, Alex Dimakis et al.

ECCV 2024arXiv:2403.06351
#1841

TexDreamer: Towards Zero-Shot High-Fidelity 3D Human Texture Generation

Yufei Liu, Junwei Zhu, Junshu Tang et al.

ECCV 2024arXiv:2403.12906
#1842

Privacy-Preserving Adaptive Re-Identification without Image Transfer

Hamza Rami, Jhony H. Giraldo, Nicolas Winckler et al.

ECCV 2024arXiv:2407.12589
#1843

LivePhoto: Real Image Animation with Text-guided Motion Control

Xi Chen, Zhiheng Liu, Mengting Chen et al.

ECCV 2024arXiv:2312.02928
#1844

GroupDiff: Diffusion-based Group Portrait Editing

Yuming Jiang, Nanxuan Zhao, Qing Liu et al.

ECCV 2024arXiv:2409.14379
#1845

Motion Aware Event Representation-driven Image Deblurring

Zhijing Sun, Xueyang Fu, Longzhuo Huang et al.

ECCV 2024
#1846

DiffPMAE: Diffusion Masked Autoencoders for Point Cloud Reconstruction

YANLONG LI, Chamara Madarasingha, Kanchana Thilakarathna

ECCV 2024arXiv:2312.03298
#1847

OLAF: A Plug-and-Play Framework for Enhanced Multi-object Multi-part Scene Parsing

Pranav Gupta, Rishubh Singh, Pradeep Shenoy et al.

ECCV 2024arXiv:2411.02858
#1848

CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion

Wendi Zheng, Jiayan Teng, Zhuoyi Yang et al.

ECCV 2024arXiv:2403.05121
#1849

Uncertainty Calibration with Energy Based Instance-wise Scaling in the Wild Dataset

Mijoo Kim, Junseok Kwon

ECCV 2024arXiv:2407.12330
#1850

OAPT: Offset-Aware Partition Transformer for Double JPEG Artifacts Removal

Qiao Mo, Yukang Ding, Jinhua Hao et al.

ECCV 2024arXiv:2408.11480
#1851

Seeing the Unseen: A Frequency Prompt Guided Transformer for Image Restoration

shihao zhou, Jinshan Pan, Jinglei Shi et al.

ECCV 2024arXiv:2404.00288
#1852

Category Adaptation Meets Projected Distillation in Generalized Continual Category Discovery

Grzegorz Rypesc, Daniel Marczak, Sebastian Cygert et al.

ECCV 2024
#1853

Animate Your Motion: Turning Still Images into Dynamic Videos

Mingxiao Li, Bo Wan, Marie-Francine Moens et al.

ECCV 2024arXiv:2403.10179
#1854

Spatial-Temporal Multi-level Association for Video Object Segmentation

Deshui Miao, Xin Li, Zhenyu He et al.

ECCV 2024arXiv:2404.06265
#1855

High-Resolution and Few-shot View Synthesis from Asymmetric Dual-lens Inputs

Ruikang Xu, Mingde Yao, Yue Li et al.

ECCV 2024
#1856

MapDistill: Boosting Efficient Camera-based HD Map Construction via Camera-LiDAR Fusion Model Distillation

Xiaoshuai Hao, Ruikai Li, Hui Zhang et al.

ECCV 2024arXiv:2407.11682
#1857

Context-Aware Action Recognition: Introducing a Comprehensive Dataset for Behavior Contrast

Tatsuya Sasaki, Yoshiki Ito, Satoshi Kondo

ECCV 2024
#1858

ST-LDM: A Universal Framework for Text-Grounded Object Generation in Real Images

Xiangtian Xue, Jiasong Wu, Youyong Kong et al.

ECCV 2024arXiv:2403.10004
#1859

AdvDiff: Generating Unrestricted Adversarial Examples using Diffusion Models

Xuelong Dai, Kaisheng Liang, Bin Xiao

ECCV 2024arXiv:2307.12499
#1860

Multimodal Cross-Domain Few-Shot Learning for Egocentric Action Recognition

Masashi Hatano, Ryo Hachiuma, Ryo Fujii et al.

ECCV 2024arXiv:2405.19917
#1861

Textual Knowledge Matters: Cross-Modality Co-Teaching for Generalized Visual Class Discovery

Haiyang Zheng, Pu Nan, Wenjing Li et al.

ECCV 2024arXiv:2403.07369
#1862

DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Jinbo Xing, Menghan Xia, Yong Zhang et al.

ECCV 2024arXiv:2310.12190
#1863

UniProcessor: A Text-induced Unified Low-level Image Processor

Huiyu Duan, Xiongkuo Min, Sijing Wu et al.

ECCV 2024arXiv:2407.20928
#1864

Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors

Tongkun Guan, Wei Shen, Xue Yang et al.

ECCV 2024arXiv:2312.05286
#1865

Textual Grounding for Open-vocabulary Visual Information Extraction in Layout-diversified Documents

MENGJUN CHENG, Chengquan Zhang, Chang Liu et al.

ECCV 2024
#1866

TAPTR: Tracking Any Point with Transformers as Detection

Hongyang Li, Hao Zhang, Shilong Liu et al.

ECCV 2024arXiv:2403.13042
#1867

Cross-Platform Video Person ReID: A New Benchmark Dataset and Adaptation Approach

Shizhou Zhang, Wenlong Luo, De Cheng et al.

ECCV 2024arXiv:2408.07500
#1868

Learning Chain of Counterfactual Thought for Bias-Robust Vision-Language Reasoning

Yifeng Zhang, Ming Jiang, Qi Zhao

ECCV 2024
#1869

Text2Place: Affordance-aware Text Guided Human Placement

Rishubh Parihar, Harsh Gupta, Sachidanand VS et al.

ECCV 2024arXiv:2407.15446
#1870

CPT-VR: Improving Surface Rendering via Closest Point Transform with View-Reflection Appearance

Zhipeng Hu, Yongqiang Zhang, Chen Liu et al.

ECCV 2024
#1871

Relightable 3D Gaussians: Realistic Point Cloud Relighting with BRDF Decomposition and Ray Tracing

Jian Gao, chun gu, Youtian Lin et al.

ECCV 2024arXiv:2311.16043
#1872

A Closer Look at GAN Priors: Exploiting Intermediate Features for Enhanced Model Inversion Attacks

Yixiang Qiu, Hao Fang, Hongyao Yu et al.

ECCV 2024arXiv:2407.13863
#1873

BlazeBVD: Make Scale-Time Equalization Great Again for Blind Video Deflickering

Xinmin Qiu, Congying Han, Zicheng Zhang et al.

ECCV 2024arXiv:2403.06243
#1874

Let the Avatar Talk using Texts without Paired Training Data

Xiuzhe Wu, Yang-Tian Sun, Handi Chen et al.

ECCV 2024
#1875

SAFARI: Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation

Sayan Nag, Koustava Goswami, Srikrishna Karanam

ECCV 2024
#1876

MAD-DR: Map Compression for Visual Localization with Matchness Aware Descriptor Dimension Reduction

Qiang Wang

ECCV 2024
#1877

Motion-Guided Latent Diffusion for Temporally Consistent Real-world Video Super-resolution

Xi Yang, Chenhang He, Jianqi Ma et al.

ECCV 2024arXiv:2312.00853
#1878

LLM as Copilot for Coarse-grained Vision-and-Language Navigation

Yanyuan Qiao, Qianyi Liu, Jiajun Liu et al.

ECCV 2024
#1879

Physically Plausible Color Correction for Neural Radiance Fields

Qi Zhang, Ying Feng, HONGDONG LI

ECCV 2024
#1880

Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation

Fu-Yun Wang, Xiaoshi Wu, Zhaoyang Huang et al.

ECCV 2024arXiv:2403.13745
#1881

Attention Beats Linear for Fast Implicit Neural Representation Generation

Shuyi Zhang, Ke Liu, Jingjun Gu et al.

ECCV 2024arXiv:2407.15355
#1882

Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change Captioning

Yunbin Tu, Liang Li, Li Su et al.

ECCV 2024arXiv:2407.11683
#1883

Prompt-Driven Contrastive Learning for Transferable Adversarial Attacks

Hunmin Yang, Jongoh Jeong, Kuk-Jin Yoon

ECCV 2024arXiv:2407.20657
#1884

PLOT: Text-based Person Search with Part Slot Attention for Corresponding Part Discovery

Jicheol Park, Dongwon Kim, Boseung Jeong et al.

ECCV 2024arXiv:2409.13475
#1885

Prompt-Based Test-Time Real Image Dehazing: A Novel Pipeline

Zixuan Chen, Zewei He, Ziqian Lu et al.

ECCV 2024arXiv:2309.17389
#1886

RCS-Prompt: Learning Prompt to Rearrange Class Space for Prompt-based Continual Learning

Longrong Yang, Hanbin Zhao, Yunlong Yu et al.

ECCV 2024
#1887

Dynamic Guidance Adversarial Distillation with Enhanced Teacher Knowledge

Hyejin Park, Dongbo Min

ECCV 2024arXiv:2409.01627
#1888

Solving Motion Planning Tasks with a Scalable Generative Model

Yihan Hu, Siqi Chai, Zhening Yang et al.

ECCV 2024arXiv:2407.02797
#1889

Versatile Incremental Learning: Towards Class and Domain-Agnostic Incremental Learning

Minyeong Park, Jae-Ho Lee, Gyeong-Moon Park

ECCV 2024arXiv:2409.10956
#1890

Parrot Captions Teach CLIP to Spot Text

Yiqi Lin, Conghui He, Alex Jinpeng Wang et al.

ECCV 2024arXiv:2312.14232
#1891

Gaussian Grouping: Segment and Edit Anything in 3D Scenes

Mingqiao Ye, Martin Danelljan, Fisher Yu et al.

ECCV 2024arXiv:2312.00732
#1892

3D Hand Sequence Recovery from Real Blurry Images and Event Stream

Joonkyu Park, Gyeongsik Moon, Weipeng Xu et al.

ECCV 2024
#1893

A Direct Approach to Viewing Graph Solvability

Federica Arrigoni, Andrea Fusiello, Tomas Pajdla

ECCV 2024
#1894

Three Things We Need to Know About Transferring Stable Diffusion to Visual Dense Prediciton Tasks

Manyuan Zhang, Guanglu Song, Xiaoyu Shi et al.

ECCV 2024
#1895

Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion Models

Yasi Zhang, Peiyu Yu, Ying Nian Wu

ECCV 2024arXiv:2404.07389
#1896

Unsqueeze [CLS] Bottleneck to Learn Rich Representations

Qing Su, Shihao Ji

ECCV 2024arXiv:2407.17671
#1897

DomainFusion: Generalizing To Unseen Domains with Latent Diffusion Models

Yuyang Huang, Yabo Chen, Yuchen Liu et al.

ECCV 2024
#1898

Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors

Jae Joong Lee, Bosheng Li, Sara Beery et al.

ECCV 2024arXiv:2407.10330
#1899

Segmentation-guided Layer-wise Image Vectorization with Gradient Fills

Hengyu Zhou, Hui Zhang, Bin Wang

ECCV 2024arXiv:2408.15741
#1900

Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects

Zicong Fan, Takehiko Ohkawa, Linlin Yang et al.

ECCV 2024arXiv:2403.16428
#1901

TrajPrompt: Aligning Color Trajectory with Vision-Language Representations

Li-Wu Tsao, Hao-Tang Tsui, Yu-Rou Tuan et al.

ECCV 2024
#1902

Strike a Balance in Continual Panoptic Segmentation

Jinpeng Chen, Runmin Cong, Yuxuan Luo et al.

ECCV 2024arXiv:2407.16354
#1903

Expressive Whole-Body 3D Gaussian Avatar

Gyeongsik Moon, Takaaki Shiratori, Shunsuke Saito

ECCV 2024arXiv:2407.21686
#1904

HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting

Zhenglin Zhou, Fan Ma, Hehe Fan et al.

ECCV 2024arXiv:2402.06149
#1905

Explicitly Guided Information Interaction Network for Cross-modal Point Cloud Completion

Hang Xu, Chen Long, Wenxiao Zhang et al.

ECCV 2024arXiv:2407.02887
#1906

Bridging the Gap: Studio-like Avatar Creation from a Monocular Phone Capture

ShahRukh Athar, Shunsuke Saito, Stanislav Pidhorskyi et al.

ECCV 2024arXiv:2407.19593
#1907

StructLDM: Structured Latent Diffusion for 3D Human Generation

Tao Hu, Fangzhou Hong, Ziwei Liu

ECCV 2024arXiv:2404.01241
#1908

Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers

Chi-Pin Huang, Kai-Po Chang, Chung-Ting Tsai et al.

ECCV 2024arXiv:2311.17717
#1909

HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts

Wonjae Kim, Sanghyuk Chun, Taekyung Kim et al.

ECCV 2024arXiv:2404.17507
#1910

High-Fidelity Modeling of Generalizable Wrinkle Deformation

Jingfan Guo, Jae Shin Yoon, Shunsuke Saito et al.

ECCV 2024
#1911

COMPOSE: Comprehensive Portrait Shadow Editing

Andrew Hou, Zhixin Shu, Xuaner Zhang et al.

ECCV 2024arXiv:2408.13922
#1912

EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion

Guangyao Zhai, Evin Pınar Örnek, Dave Zhenyu Chen et al.

ECCV 2024arXiv:2405.00915
#1913

MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation

KUNPENG SONG, Yizhe Zhu, Bingchen Liu et al.

ECCV 2024arXiv:2404.05674
#1914

Learning Representations from Foundation Models for Domain Generalized Stereo Matching

Yongjian Zhang, Longguang Wang, Kunhong Li et al.

ECCV 2024
#1915

Global Structure-from-Motion Revisited

Linfei Pan, Daniel Barath, Marc Pollefeys et al.

ECCV 2024arXiv:2407.20219
#1916

NeRF-XL: NeRF at Any Scale with Multi-GPU

Ruilong Li, Sanja Fidler, Angjoo Kanazawa et al.

ECCV 2024
#1917

ReMatching: Low-Resolution Representations for Scalable Shape Correspondence

Filippo Maggioli, Daniele Baieri, Emanuele Rodola et al.

ECCV 2024arXiv:2305.09274
#1918

3D Hand Pose Estimation in Everyday Egocentric Images

Aditya Prakash, Ruisen Tu, Matthew Chang et al.

ECCV 2024arXiv:2312.06583
#1919

DEAL: Disentangle and Localize Concept-level Explanations for VLMs

Tang Li, Mengmeng Ma, Xi Peng

ECCV 2024arXiv:2407.14412
#1920

Controllable Human-Object Interaction Synthesis

Jiaman Li, Alexander Clegg, Roozbeh Mottaghi et al.

ECCV 2024arXiv:2312.03913
#1921

Nymeria: A Massive Collection of Egocentric Multi-modal Human Motion in the Wild

Lingni Ma, Yuting Ye, Rowan Postyeni et al.

ECCV 2024
#1922

MTKD: Multi-Teacher Knowledge Distillation for Image Super-Resolution

Yuxuan Jiang, Chen Feng, Fan Zhang et al.

ECCV 2024arXiv:2404.09571
#1923

Appearance-based Refinement for Object-Centric Motion Segmentation

Junyu Xie, Weidi Xie, Andrew ZISSERMAN

ECCV 2024arXiv:2312.11463
#1924

iMatching: Imperative Correspondence Learning

Chen Wang, Dasong Gao, Yun-Jou Lin et al.

ECCV 2024
#1925

AnyHome: Open-Vocabulary Large-Scale Indoor Scene Generation with First-Person View Exploration

Rao Fu, Zehao Wen, Zichen Liu et al.

ECCV 2024
#1926

Towards High-Quality 3D Motion Transfer with Realistic Apparel Animation

Rong Wang, Wei Mao, Changsheng Lu et al.

ECCV 2024arXiv:2407.11266
#1927

Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene

Ruiyang Zhang, Hu Zhang, Hang Yu et al.

ECCV 2024arXiv:2407.08569
#1928

SlotLifter: Slot-guided Feature Lifting for Learning Object-Centric Radiance Fields

Yu Liu, Baoxiong Jia, Yixin Chen et al.

ECCV 2024arXiv:2408.06697
#1929

Confidence Self-Calibration for Multi-Label Class-Incremental Learning

Kaile Du, Yifan Zhou, Fan Lyu et al.

ECCV 2024arXiv:2403.12559
#1930

Fast View Synthesis of Casual Videos with Soup-of-Planes

Yao-Chih Lee, Zhoutong Zhang, Kevin Blackburn-Matzen et al.

ECCV 2024arXiv:2312.02135
#1931

Six-Point Method for Multi-Camera Systems with Reduced Solution Space

Banglei Guan, Ji Zhao, Laurent Kneip

ECCV 2024arXiv:2402.18066
#1932

A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties

Junfei Xiao, Ziqi Zhou, Wenxuan Li et al.

ECCV 2024arXiv:2312.13764
#1933

Tuning-Free Image Customization with Image and Text Guidance

Pengzhi Li, Qiang Nie, Ying Chen et al.

ECCV 2024arXiv:2403.12658
#1934

MegaScenes: Scene-Level View Synthesis at Scale

Joseph Tung, Gene Chou, Ruojin Cai et al.

ECCV 2024arXiv:2406.11819
#1935

Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth Estimation

Jinfeng Liu, Lingtong Kong, Bo Li et al.

ECCV 2024arXiv:2407.14126
#1936

Watch Your Steps: Local Image and Scene Editing by Text Instructions

Ashkan Mirzaei, Tristan T Aumentado-Armstrong, Marcus A Brubaker et al.

ECCV 2024arXiv:2308.08947
#1937

ControlCap: Controllable Region-level Captioning

Yuzhong Zhao, Liu Yue, Zonghao Guo et al.

ECCV 2024arXiv:2401.17910
#1938

Neural graphics texture compression supporting random access

Farzad Farhadzadeh, Qiqi Hou, Hoang Le et al.

ECCV 2024arXiv:2407.00021
#1939

U-COPE: Taking a Further Step to Universal 9D Category-level Object Pose Estimation

li zhang, Weiqing Meng, Yan Zhong et al.

ECCV 2024
#1940

Idea2Img: Iterative Self-Refinement with GPT-4V for Automatic Image Design and Generation

Zhengyuan Yang, Jianfeng Wang, Linjie Li et al.

ECCV 2024
#1941

Preventing Catastrophic Forgetting through Memory Networks in Continuous Detection

Gaurav Bhatt, Leonid Sigal, James Ross

ECCV 2024arXiv:2403.14797
#1942

Trajectory-aligned Space-time Tokens for Few-shot Action Recognition

Pulkit Kumar, Namitha Padmanabhan, Luke Luo et al.

ECCV 2024arXiv:2407.18249
#1943

Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models

Longxiang Tang, Zhuotao Tian, Kai Li et al.

ECCV 2024arXiv:2407.05342
#1944

Robust Incremental Structure-from-Motion with Hybrid Features

Shaohui Liu, Yidan Gao, Tianyi Zhang et al.

ECCV 2024arXiv:2409.19811
#1945

COIN-Matting: Confounder Intervention for Image Matting

Zhaohe Liao, Jiangtong Li, Jun Lan et al.

ECCV 2024
#1946

E3V-K5: An Authentic Benchmark for Redefining Video-Based Energy Expenditure Estimation

Shengxuming Zhang, Lei Jin, Yifan Wang et al.

ECCV 2024
#1947

Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection

Xingyu Peng, Yan Bai, Chen Gao et al.

ECCV 2024arXiv:2407.08931
#1948

Score Distillation Sampling with Learned Manifold Corrective

Thiemo Alldieck, Nikos Kolotouros, Cristian Sminchisescu

ECCV 2024arXiv:2401.05293
#1949

EGIC: Enhanced Low-Bit-Rate Generative Image Compression Guided by Semantic Segmentation

Nikolai Körber, Eduard Kromer, Andreas Siebert et al.

ECCV 2024arXiv:2309.03244
#1950

Rethinking Weakly-supervised Video Temporal Grounding From a Game Perspective

Xiang Fang, Zeyu Xiong, Wanlong Fang et al.

ECCV 2024
#1951

FlexAttention for Efficient High-Resolution Vision-Language Models

Junyan Li, Delin Chen, Tianle Cai et al.

ECCV 2024arXiv:2407.20228
#1952

AdaDiffSR: Adaptive Region-aware Dynamic acceleration Diffusion Model for Real-World Image Super-Resolution

Yuanting Fan, Chengxu Liu, Nengzhong Yin et al.

ECCV 2024arXiv:2410.17752
#1953

Asymmetric Mask Scheme for Self-Supervised Real Image Denoising

Xiangyu Liao, Tianheng Zheng, Jiayu Zhong et al.

ECCV 2024arXiv:2407.06514
#1954

Pathformer3D: A 3D Scanpath Transformer for 360° Images

Rong Quan, yantao Lai, Mengyu Qiu et al.

ECCV 2024arXiv:2407.10563
#1955

AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot Anomaly Detection

Yunkang Cao, Jiangning Zhang, Luca Frittoli et al.

ECCV 2024arXiv:2407.15795
#1956

Visual Prompting via Partial Optimal Transport

MENGYU ZHENG, Zhiwei Hao, Yehui Tang et al.

ECCV 2024
#1957

LiteSAM is Actually what you Need for segment Everything

Jianhai Fu, Yuanjie Yu, Ningchuan Li et al.

ECCV 2024
#1958

Deep Patch Visual SLAM

Lahav Lipson, Zachary Teed, Jia Deng

ECCV 2024arXiv:2408.01654
#1959

Efficient Training of Spiking Neural Networks with Multi-Parallel Implicit Stream Architecture

Zhigao Cao, Meng Li, Xiashuang Wang et al.

ECCV 2024
#1960

SplatFields: Neural Gaussian Splats for Sparse 3D and 4D Reconstruction

Marko Mihajlovic, Sergey Prokudin, Siyu Tang et al.

ECCV 2024arXiv:2409.11211
#1961

Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation without Manual Labels

Rui Huang, Songyou Peng, Ayca Takmaz et al.

ECCV 2024arXiv:2312.17232
#1962

Domesticating SAM for Breast Ultrasound Image Segmentation via Spatial-frequency Fusion and Uncertainty Correction

Wanting Zhang, Huisi Wu, Jing Qin

ECCV 2024
#1963

BRAVE: Broadening the visual encoding of vision-language models

Oguzhan Fatih Kar, Alessio Tonioni, Petra Poklukar et al.

ECCV 2024arXiv:2404.07204
#1964

Diff3DETR: Agent-based Diffusion Model for Semi-supervised 3D Object Detection

Jiacheng Deng, Jiahao Lu, Tianzhu Zhang

ECCV 2024
#1965

MarineInst: A Foundation Model for Marine Image Analysis with Instance Visual Description

Ziqiang Zheng, Yiwei Chen, Huimin Zeng et al.

ECCV 2024
#1966

Interaction-centric Spatio-Temporal Context Reasoning for Multi-Person Video HOI Recognition

Yisong Wang, Nan Xi, Jingjing Meng et al.

ECCV 2024
#1967

Multi-modal Relation Distillation for Unified 3D Representation Learning

Huiqun Wang, Yiping Bao, Panwang Pan et al.

ECCV 2024arXiv:2407.14007
#1968

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

Yanwei Li, Chengyao Wang, Jiaya Jia

ECCV 2024arXiv:2311.17043
#1969

Masked Angle-Aware Autoencoder for Remote Sensing Images

Zhihao Li, Biao Hou, Siteng Ma et al.

ECCV 2024arXiv:2408.01946
#1970

6DoF Head Pose Estimation through Explicit Bidirectional Interaction with Face Geometry

Sungho Chun, Ju Yong Chang

ECCV 2024
#1971

Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos

Akshay Paruchuri, Samuel Ehrenstein, Shuxian Wang et al.

ECCV 2024arXiv:2403.17915
#1972

ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation

Yi Zhang, Yun Tang, Wenjie Ruan et al.

ECCV 2024arXiv:2402.15429
#1973

Agent3D-Zero: An Agent for Zero-shot 3D Understanding

Sha Zhang, Di Huang, Jiajun Deng et al.

ECCV 2024arXiv:2403.11835
#1974

S-JEPA: A Joint Embedding Predictive Architecture for Skeletal Action Recognition

Mohamed Abdelfattah, Alexandre ALahi

ECCV 2024
#1975

Integer-Valued Training and Spike-driven Inference Spiking Neural Network for High-performance and Energy-efficient Object Detection

Xinhao Luo, Man Yao, Yuhong Chou et al.

ECCV 2024arXiv:2407.20708
#1976

Structured-NeRF: Hierarchical Scene Graph with Neural Representation

Zhide Zhong, Jiakai Cao, songen gu et al.

ECCV 2024
#1977

Improving Unsupervised Domain Adaptation: A Pseudo-Candidate Set Approach

Aveen Dayal, Rishabh Lalla, Linga Reddy Cenkeramaddi et al.

ECCV 2024
#1978

GiT: Towards Generalist Vision Transformer through Universal Language Interface

Haiyang Wang, Hao Tang, Li Jiang et al.

ECCV 2024arXiv:2403.09394
#1979

PoseAugment: Generative Human Pose Data Augmentation with Physical Plausibility for IMU-based Motion Capture

Zhuojun Li, Chun Yu, Chen Liang et al.

ECCV 2024arXiv:2409.14101
#1980

APL: Anchor-based Prompt Learning for One-stage Weakly Supervised Referring Expression Comprehension

Yaxin Luo, Jiayi Ji, Xiaofu Chen et al.

ECCV 2024
#1981

SCP-Diff: Spatial-Categorical Joint Prior for Diffusion Based Semantic Image Synthesis

Huan-ang Gao, Mingju Gao, Jiaju Li et al.

ECCV 2024arXiv:2403.09638
#1982

DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion Consistency

Xiaojing Zhong, Xinyi Huang, Xiaofeng Yang et al.

ECCV 2024arXiv:2408.07481
#1983

MeshFeat: Multi-Resolution Features for Neural Fields on Meshes

Mihir Mahajan, Florian Hofherr, Daniel Cremers

ECCV 2024arXiv:2407.13592
#1984

TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP to Alleviate Single Tag Bias

Sanghyun Jo, Soohyun Ryu, Sungyub Kim et al.

ECCV 2024arXiv:2404.00384
#1985

DragAPart: Learning a Part-Level Motion Prior for Articulated Objects

Ruining Li, Chuanxia Zheng, Christian Rupprecht et al.

ECCV 2024arXiv:2403.15382
#1986

BeNeRF:Neural Radiance Fields from a Single Blurry Image and Event Stream

Wenpu Li, Pian Wan, Peng Wang et al.

ECCV 2024arXiv:2407.02174
#1987

Enhancing Optimization Robustness in 1-bit Neural Networks through Stochastic Sign Descent

NianHui Guo, Hong Guo, Christoph Meinel et al.

ECCV 2024
#1988

Learning to Unlearn for Robust Machine Unlearning

Mark HUANG, Lin Geng Foo, Jun Liu

ECCV 2024arXiv:2407.10494
#1989

Diff-Tracker: Text-to-Image Diffusion Models are Unsupervised Trackers

Zhengbo Zhang, Li Xu, Duo Peng et al.

ECCV 2024arXiv:2407.08394
#1990

Echoes of the Past: Boosting Long-tail Recognition via Reflective Learning

Qihao Zhao, YALUN DAI, Shen Lin et al.

ECCV 2024
#1991

OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models

Kong Zhe, Yong Zhang, Tianyu Yang et al.

ECCV 2024arXiv:2403.10983
#1992

Lego: Learning to Disentangle and Invert Personalized Concepts Beyond Object Appearance in Text-to-Image Diffusion Models

Saman Motamed, Danda Pani Paudel, Luc Van Gool

ECCV 2024arXiv:2311.13833
#1993

Taming CLIP for Fine-grained and Structured Visual Understanding of Museum Exhibits

Ada-Astrid Balauca, Danda Paudel, Kristina Toutanova et al.

ECCV 2024arXiv:2409.01690
#1994

Visual Text Generation in the Wild

Yuanzhi Zhu, Jiawei Liu, Feiyu Gao et al.

ECCV 2024arXiv:2407.14138
#1995

Cross-Domain Learning for Video Anomaly Detection with Limited Supervision

Yashika Jain, Ali Dabouei, Min Xu

ECCV 2024arXiv:2408.05191
#1996

DreamMotion: Space-Time Self-Similar Score Distillation for Zero-Shot Video Editing

Hyeonho Jeong, Jinho Chang, GEON YEONG PARK et al.

ECCV 2024arXiv:2403.12002
#1997

A Unified Image Compression Method for Human Perception and Multiple Vision Tasks

Sha Guo, Sui Lin, Chen-Lin Zhang et al.

ECCV 2024
#1998

Domain-adaptive Video Deblurring via Test-time Blurring

Jin-Ting He, Fu-Jen Tsai, Jia-Hao Wu et al.

ECCV 2024arXiv:2407.09059
#1999

3DEgo: 3D Editing on the Go!

Umar Khalid, Hasan Iqbal, Azib Farooq et al.

ECCV 2024arXiv:2407.10102
#2000

Unleashing the Power of Prompt-driven Nucleus Instance Segmentation

Zhongyi Shui, Yunlong Zhang, Kai Yao et al.

ECCV 2024arXiv:2311.15939