Most Cited ECCV "saturation effect" Papers

2,387 papers found • Page 10 of 12

#1801

HiDiffusion: Unlocking Higher-Resolution Creativity and Efficiency in Pretrained Diffusion Models

Shen Zhang, Zhaowei CHEN, Zhenyu Zhao et al.

ECCV 2024posterarXiv:2311.17528
#1802

Imaging Interiors: An Implicit Solution to Electromagnetic Inverse Scattering Problems

Ziyuan Luo, Boxin Shi, Haoliang Li et al.

ECCV 2024posterarXiv:2407.09352
#1803

EgoBody3M: Egocentric Body Tracking on a VR Headset using a Diverse Dataset

Amy Zhao, Chengcheng Tang, Lezi Wang et al.

ECCV 2024poster
#1804

StableDrag: Stable Dragging for Point-based Image Editing

Yutao Cui, Xiaotong Zhao, Guozhen Zhang et al.

ECCV 2024posterarXiv:2403.04437
#1805

MasterWeaver: Taming Editability and Face Identity for Personalized Text-to-Image Generation

Yuxiang WEI, Zhilong Ji, Jinfeng Bai et al.

ECCV 2024posterarXiv:2405.05806
#1806

PointRegGPT: Boosting 3D Point Cloud Registration using Generative Point-Cloud Pairs for Training

SUYI CHEN, Hao Xu, Haipeng Li et al.

ECCV 2024posterarXiv:2407.14054
#1807

DIFFender: Diffusion-Based Adversarial Defense against Patch Attacks

Caixin Kang, Yinpeng Dong, Zhengyi Wang et al.

ECCV 2024posterarXiv:2306.09124
#1808

Rethinking Data Bias: Dataset Copyright Protection via Embedding Class-wise Hidden Bias

Jinhyeok Jang, ByungOk Han, Jaehong Kim et al.

ECCV 2024poster
#1809

Visual Relationship Transformation

Xiaoyu Xu, Jiayan Qiu, Baosheng Yu et al.

ECCV 2024poster
#1810

Q&A Prompts: Discovering Rich Visual Clues through Mining Question-Answer Prompts for VQA requiring Diverse World Knowledge

Haibo Wang, Weifeng Ge

ECCV 2024posterarXiv:2401.10712
#1811

HARIVO: Harnessing Text-to-Image Models for Video Generation

Mingi Kwon, Seoung Wug Oh, Yang Zhou et al.

ECCV 2024posterarXiv:2410.07763
#1812

Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models

Chao Gong, Kai Chen, Zhipeng Wei et al.

ECCV 2024posterarXiv:2407.12383
#1813

Length-Aware Motion Synthesis via Latent Diffusion

Alessio Sampieri, Alessio Palma, Indro Spinelli et al.

ECCV 2024posterarXiv:2407.11532
#1814

Clean & Compact: Efficient Data-Free Backdoor Defense with Model Compactness

Huy Phan, Jinqi Xiao, Yang Sui et al.

ECCV 2024poster
#1815

Evaluating the Adversarial Robustness of Semantic Segmentation: Trying Harder Pays Off

Levente Ferenc Halmosi, Bálint Mohos, Márk Jelasity

ECCV 2024posterarXiv:2407.09150
#1816

SignGen: End-to-End Sign Language Video Generation with Latent Diffusion

Fan Qi, Yu Duan, Changsheng Xu et al.

ECCV 2024poster
#1817

GRA: Detecting Oriented Objects through Group-wise Rotating and Attention

Jiangshan Wang, Yifan Pu, Yizeng Han et al.

ECCV 2024posterarXiv:2403.11127
#1818

Label-free Neural Semantic Image Synthesis

Jiayi Wang, Kevin Alexander Laube, Yumeng Li et al.

ECCV 2024posterarXiv:2407.01790
#1819

Causal Subgraphs and Information Bottlenecks: Redefining OOD Robustness in Graph Neural Networks

Weizhi An, Wenliang Zhong, Feng Jiang et al.

ECCV 2024poster
#1820

Image-to-Lidar Relational Distillation for Autonomous Driving Data

Anas Mahmoud, Ali Harakeh, Steven Waslander

ECCV 2024posterarXiv:2409.00845
#1821

Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning

Fanyue Wei, Wei Zeng, Zhenyang Li et al.

ECCV 2024posterarXiv:2407.06642
#1822

SemanticHuman-HD: High Resolution Semantic disentangled 3D Human Generation

Peng Zheng, Tao Liu, Zili Yi et al.

ECCV 2024posterarXiv:2403.10166
#1823

HiEI: A Universal Framework for Generating High-quality Emerging Images from Natural Images

Jingmeng Li, Lukang Fu, Surun Yang et al.

ECCV 2024poster
#1824

The Sky's the Limit: Relightable Outdoor Scenes via a Sky-pixel Constrained Illumination Prior and Outside-In Visibility

James Gardner, Evgenii Kashin, Bernhard Egger et al.

ECCV 2024poster
#1825

Neural Spectral Decomposition for Dataset Distillation

Yang Shaolei, Shen Cheng, Mingbo Hong et al.

ECCV 2024posterarXiv:2408.16236
#1826

COSMU: Complete 3D human shape from monocular unconstrained images

Marco Pesavento, Marco Volino, Adrian Hilton

ECCV 2024posterarXiv:2407.10586
#1827

Phase Concentration and Shortcut Suppression for Weakly Supervised Semantic Segmentation

Hoyong Kwon, Jaeseok Jeong, Sung-Hoon Yoon et al.

ECCV 2024poster
#1828

Language-Assisted Skeleton Action Understanding for Skeleton-Based Temporal Action Segmentation

Haoyu Ji, Bowen Chen, Xinglong Xu et al.

ECCV 2024poster
#1829

HERGen: Elevating Radiology Report Generation with Longitudinal Data

Fuying Wang, Shenghui Du, Lequan Yu

ECCV 2024posterarXiv:2407.15158
#1830

Hierarchical Unsupervised Relation Distillation for Source Free Domain Adaptation

Bowei Xing, Xianghua Ying, Ruibin Wang et al.

ECCV 2024poster
#1831

GMT: Enhancing Generalizable Neural Rendering via Geometry-Driven Multi-Reference Texture Transfer

Youngho Yoon, Hyun-Kurl Jang, Kuk-Jin Yoon

ECCV 2024posterarXiv:2410.00672
#1832

SNeRV: Spectra-preserving Neural Representation for Video

Jina Kim, Jihoo Lee, Jewon Kang

ECCV 2024posterarXiv:2501.01681
#1833

L-DiffER: Single Image Reflection Removal with Language-based Diffusion Model

Yuchen Hong, Haofeng Zhong, Shuchen Weng et al.

ECCV 2024poster
#1834

WPS-SAM: Towards Weakly-Supervised Part Segmentation with Foundation Models

xinjian wu, Ruisong Zhang, Jie Qin et al.

ECCV 2024posterarXiv:2407.10131
#1835

Analysis-by-Synthesis Transformer for Single-View 3D Reconstruction

Dian Jia, Xiaoqian Ruan, Kun Xia et al.

ECCV 2024poster
#1836

DMiT: Deformable Mipmapped Tri-Plane Representation for Dynamic Scenes

Jing-Wen Yang, Jia-Mu Sun, Yong-Liang Yang et al.

ECCV 2024poster
#1837

A Diffusion Model for Simulation Ready Coronary Anatomy with Morpho-skeletal Control

Karim Kadry, Shreya Gupta, Jonas Sogbadji et al.

ECCV 2024posterarXiv:2407.15631
#1838

Motion-Oriented Compositional Neural Radiance Fields for Monocular Dynamic Human Modeling

Jaehyeok Kim, Dongyoon Wee, Dan Xu

ECCV 2024posterarXiv:2407.11962
#1839

KMTalk: Speech-Driven 3D Facial Animation with Key Motion Embedding

Zhihao Xu, Shengjie Gong, Jiapeng Tang et al.

ECCV 2024posterarXiv:2409.01113
#1840

Revisiting Domain-Adaptive Object Detection in Adverse Weather by the Generation and Composition of High-Quality Pseudo-Labels

Rui Zhao, Huibin Yan, Shuoyao Wang

ECCV 2024poster
#1841

Post-training Quantization with Progressive Calibration and Activation Relaxing for Text-to-Image Diffusion Models

Siao Tang, Xin Wang, Hong Chen et al.

ECCV 2024poster
#1842

DISCO: Embodied Navigation and Interaction via Differentiable Scene Semantics and Dual-level Control

Xinyu Xu, Shengcheng Luo, Yanchao Yang et al.

ECCV 2024posterarXiv:2407.14758
#1843

Textual Query-Driven Mask Transformer for Domain Generalized Segmentation

Byeonghyun Pak, Byeongju Woo, Sunghwan Kim et al.

ECCV 2024posterarXiv:2407.09033
#1844

Arbitrary-Scale Video Super-Resolution with Structural and Textural Priors

Wei Shang, Dongwei Ren, Wanying Zhang et al.

ECCV 2024posterarXiv:2407.09919
#1845

A Unified Image Compression Method for Human Perception and Multiple Vision Tasks

Sha Guo, Sui Lin, Chen-Lin Zhang et al.

ECCV 2024poster
#1846

APL: Anchor-based Prompt Learning for One-stage Weakly Supervised Referring Expression Comprehension

Yaxin Luo, Jiayi Ji, Xiaofu Chen et al.

ECCV 2024poster
#1847

Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation

Hyunwoo Yu, Yubin Cho, Beoungwoo Kang et al.

ECCV 2024posterarXiv:2407.17261
#1848

Combining Generative and Geometry Priors for Wide-Angle Portrait Correction

Lan Yao, Chaofeng Chen, Xiaoming Li et al.

ECCV 2024posterarXiv:2410.09911
#1849

To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Unsafe Images ... For Now

Yimeng Zhang, jinghan jia, Xin Chen et al.

ECCV 2024posterarXiv:2310.11868
#1850

To Supervise or Not to Supervise: Understanding and Addressing the Key Challenges of Point Cloud Transfer Learning

Souhail Hadgi, Lei Li, Maks Ovsjanikov

ECCV 2024posterarXiv:2403.17869
#1851

3D Reconstruction of Objects in Hands without Real World 3D Supervision

Aditya Prakash, Matthew Chang, Matthew Jin et al.

ECCV 2024posterarXiv:2305.03036
#1852

Forbes: Face Obfuscation Rendering via Backpropagation Refinement Scheme

Jintae Kim, Seungwon Yang, Seong-Gyun Jeong et al.

ECCV 2024posterarXiv:2407.14170
#1853

SeA: Semantic Adversarial Augmentation for Last Layer Features from Unsupervised Representation Learning

Qi Qian, Yuanhong Xu, JUHUA HU

ECCV 2024posterarXiv:2408.13351
#1854

DualDn: Dual-domain Denoising via Differentiable ISP

Ruikang Li, Yujin Wang, Shiqi Chen et al.

ECCV 2024posterarXiv:2409.18783
#1855

VideoStudio: Generating Consistent-Content and Multi-Scene Videos

Fuchen Long, Zhaofan Qiu, Ting Yao et al.

ECCV 2024posterarXiv:2401.01256
#1856

Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation

Zhaoyang Li, Yuan Wang, Wangkai Li et al.

ECCV 2024posterarXiv:2408.13752
#1857

AdaIFL: Adaptive Image Forgery Localization via a Dynamic and Importance-aware Transformer Network

Yuxi Li, Fuyuan Cheng, Wangbo Yu et al.

ECCV 2024poster
#1858

Event-based Head Pose Estimation: Benchmark and Method

jiahui yuan, Hebei Li, Yansong Peng et al.

ECCV 2024poster
#1859

CoLA: Conditional Dropout and Language-driven Robust Dual-modal Salient Object Detection

Shuang Hao, Chunlin Zhong, He Tang

ECCV 2024posterarXiv:2407.06780
#1860

Visual Alignment Pre-training for Sign Language Translation

Peiqi Jiao, Yuecong Min, Xilin CHEN

ECCV 2024poster
#1861

Rethinking Image-to-Video Adaptation: An Object-centric Perspective

Rui Qian, Shuangrui Ding, Dahua Lin

ECCV 2024posterarXiv:2407.06871
#1862

Siamese Vision Transformers are Scalable Audio-visual Learners

Yan-Bo Lin, Gedas Bertasius

ECCV 2024posterarXiv:2403.19638
#1863

Learning Unsigned Distance Functions from Multi-view Images with Volume Rendering Priors

Wen Yuan Zhang, Kanle Shi, Yushen Liu et al.

ECCV 2024poster
#1864

Assessing Sample Quality via the Latent Space of Generative Models

Jingyi Xu, Hieu Le, Dimitris Samaras

ECCV 2024posterarXiv:2407.15171
#1865

Responsible Visual Editing

Minheng Ni, Yeli Shen, Yabin Zhang et al.

ECCV 2024posterarXiv:2404.05580
#1866

Consistent 3D Line Mapping

Xulong Bai, Hainan Cui, Shuhan Shen

ECCV 2024poster
#1867

Physical-Based Event Camera Simulator

Haiqian Han, Jiacheng Lyu, Jianing Li et al.

ECCV 2024poster
#1868

LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis

Kevin Xie, Tianshi Cao, Jonathan P Lorraine et al.

ECCV 2024posterarXiv:2403.15385
#1869

Clearer Frames, Anytime: Resolving Velocity Ambiguity in Video Frame Interpolation

Zhihang Zhong, Gurunandan Krishnan, Xiao Sun et al.

ECCV 2024poster
#1870

Scene-aware Human Motion Forecasting via Mutual Distance Prediction

Chaoyue Xing, Wei Mao, Miaomiao LIU

ECCV 2024posterarXiv:2310.00615
#1871

MotionDirector: Motion Customization of Text-to-Video Diffusion Models

Rui Zhao, Yuchao Gu, Jay Zhangjie Wu et al.

ECCV 2024posterarXiv:2310.08465
#1872

FunQA: Towards Surprising Video Comprehension

Binzhu Xie, Sicheng Zhang, Zitang Zhou et al.

ECCV 2024posterarXiv:2306.14899
#1873

Photon Inhibition for Energy-Efficient Single-Photon Imaging

Lucas Koerner, Shantanu Gupta, Atul N Ingle et al.

ECCV 2024posterarXiv:2409.18337
#1874

OccGen: Generative Multi-modal 3D Occupancy Prediction for Autonomous Driving

Guoqing Wang, Zhongdao Wang, Pin Tang et al.

ECCV 2024posterarXiv:2404.15014
#1875

Probabilistic Image-Driven Traffic Modeling via Remote Sensing

Scott Workman, Armin Hadzic

ECCV 2024posterarXiv:2403.05521
#1876

Smoothness, Synthesis, and Sampling: Re-thinking Unsupervised Multi-View Stereo with DIV Loss

Alex Rich, Noah Stier, Pradeep Sen et al.

ECCV 2024poster
#1877

Beyond MOT: Semantic Multi-Object Tracking

Yunhao Li, Qin Li, Hao Wang et al.

ECCV 2024posterarXiv:2403.05021
#1878

UAV First-Person Viewers Are Radiance Field Learners

Liqi Yan, Qifan Wang, Junhan Zhao et al.

ECCV 2024poster
#1879

Knowledge-enhanced Visual-Language Pretraining for Computational Pathology

Xiao Zhou, Xiaoman Zhang, Chaoyi Wu et al.

ECCV 2024posterarXiv:2404.09942
#1880

Pick-a-back: Selective Device-to-Device Knowledge Transfer in Federated Continual Learning

JinYi Yoon, HyungJune Lee

ECCV 2024poster
#1881

Situated Instruction Following

So Yeon Min, Xavier Puig, Devendra Singh Chaplot et al.

ECCV 2024posterarXiv:2407.12061
#1882

Curved Diffusion: A Generative Model With Optical Geometry Control

Andrey Voynov, Amir Hertz, Moab Arar et al.

ECCV 2024posterarXiv:2311.17609
#1883

Holodepth: Programmable Depth-Varying Projection via Computer-Generated Holography

Dorian Chan, Matthew O'Toole, Sizhuo Ma et al.

ECCV 2024poster
#1884

Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators

Yifan Pu, Xia Zhuofan, Jiayi Guo et al.

ECCV 2024posterarXiv:2408.05710
#1885

Two-Stage Video Shadow Detection via Temporal-Spatial Adaption

Xin Duan, Yu Cao, Lei Zhu et al.

ECCV 2024poster
#1886

CLIP-DINOiser: Teaching CLIP a few DINO tricks for open-vocabulary semantic segmentation

Monika Wysoczanska, Oriane Siméoni, Michaël Ramamonjisoa et al.

ECCV 2024poster
#1887

M^2Depth: Self-supervised Two-Frame Multi-camera Metric Depth Estimation

Yingshuang Zou, Yikang Ding, Xi Qiu et al.

ECCV 2024poster
#1888

3D Gaussian Parametric Head Model

Yuelang Xu, Lizhen Wang, Zerong Zheng et al.

ECCV 2024poster
#1889

Improving Adversarial Transferability via Model Alignment

Avery Ma, Amir-massoud Farahmand, Yangchen Pan et al.

ECCV 2024posterarXiv:2311.18495
#1890

RealGen: Retrieval Augmented Generation for Controllable Traffic Scenarios

Wenhao Ding, Yulong Cao, DING ZHAO et al.

ECCV 2024posterarXiv:2312.13303
#1891

Information Bottleneck Based Data Correction in Continual Learning

Shuai Chen, mingyi zhang, Junge Zhang et al.

ECCV 2024poster
#1892

Factorizing Text-to-Video Generation by Explicit Image Conditioning

Rohit Girdhar, Mannat Singh, Andrew Brown et al.

ECCV 2024posterarXiv:2311.10709
#1893

REDIR: Refocus-free Event-based De-occlusion Image Reconstruction

Qi Guo, Hailong Shi, Huan Li et al.

ECCV 2024poster
#1894

Cut out the Middleman: Revisiting Pose-based Gait Recognition

YANG FU, Saihui Hou, Shibei Meng et al.

ECCV 2024poster
#1895

Fast Registration of Photorealistic Avatars for VR Facial Animation

Chaitanya Patel, Shaojie Bai, Te-Li Wang et al.

ECCV 2024posterarXiv:2401.11002
#1896

Shapefusion: 3D localized human diffusion models

Rolandos Alexandros Potamias, Michael Tarasiou, Stylianos Ploumpis et al.

ECCV 2024poster
#1897

Frontier-enhanced Topological Memory with Improved Exploration Awareness for Embodied Visual Navigation

Xinru Cui, Qiming Liu, Zhe Liu et al.

ECCV 2024poster
#1898

Caltech Aerial RGB-Thermal Dataset in the Wild

Connor Lee, Matthew Anderson, Nikhil Ranganathan et al.

ECCV 2024posterarXiv:2403.08997
#1899

Diagnosing and Re-learning for Balanced Multimodal Learning

Yake Wei, Siwei Li, Ruoxuan Feng et al.

ECCV 2024posterarXiv:2407.09705
#1900

MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning

Vishal Nedungadi, Ankit Kariryaa, Stefan Oehmcke et al.

ECCV 2024posterarXiv:2405.02771
#1901

Loc3Diff: Local Diffusion for 3D Human Head Synthesis and Editing

Yushi Lan, Feitong Tan, Qiangeng Xu et al.

ECCV 2024poster
#1902

Learning to Distinguish Samples for Generalized Category Discovery

Fengxiang Yang, Pu Nan, Wenjing Li et al.

ECCV 2024poster
#1903

WBP: Training-time Backdoor Attacks through Hardware-based Weight Bit Poisoning

Kunbei Cai, Zhenkai Zhang, Qian Lou et al.

ECCV 2024poster
#1904

UL-VIO: Ultra-lightweight Visual-Inertial Odometry with Noise Robust Test-time Adaptation

Jinho Park, Se Young Chun, Mingoo Seok

ECCV 2024posterarXiv:2409.13106
#1905

Get Your Embedding Space in Order: Domain-Adaptive Regression for Forest Monitoring

Sizhuo Li, Dimitri Gominski, Martin Brandt et al.

ECCV 2024posterarXiv:2405.00514
#1906

CLIP-DPO: Vision-Language Models as a Source of Preference for Fixing Hallucinations in LVLMs

Yassine Ouali, Adrian Bulat, Brais Martinez et al.

ECCV 2024posterarXiv:2408.10433
#1907

HVCLIP: High-dimensional Vector in CLIP for Unsupervised Domain Adaptation

Noranart Vesdapunt, Kah Kuen Fu, Yue Wu et al.

ECCV 2024poster
#1908

Improving 3D Semi-supervised Learning by Effectively Utilizing All Unlabelled Data

Sneha Paul, Zachary Patterson, Nizar Bouguila

ECCV 2024posterarXiv:2409.13977
#1909

Leveraging Text Localization for Scene Text Removal via Text-aware Masked Image Modeling

Zixiao Wang, Hongtao Xie, YuXin Wang et al.

ECCV 2024posterarXiv:2409.13431
#1910

EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS

Sharath Girish, Kamal Gupta, Abhinav Shrivastava

ECCV 2024posterarXiv:2312.04564
#1911

Thinking Outside the BBox: Unconstrained Generative Object Compositing

Gemma Canet Tarrés, Zhe Lin, Zhifei Zhang et al.

ECCV 2024posterarXiv:2409.04559
#1912

CLAMP-ViT: Contrastive Data-Free Learning for Adaptive Post-Training Quantization of ViTs

Akshat Ramachandran, Souvik Kundu, Tushar Krishna

ECCV 2024posterarXiv:2407.05266
#1913

A Riemannian Approach for Spatiotemporal Analysis and Generation of 4D Tree-shaped Structures

Tahmina Khanam, Mohammed Bennamoun, Guan Wang et al.

ECCV 2024posterarXiv:2408.12443
#1914

Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively

Haobo Yuan, Xiangtai Li, Chong Zhou et al.

ECCV 2024posterarXiv:2401.02955
#1915

ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation

Mengcheng Lan, Chaofeng Chen, Yiping Ke et al.

ECCV 2024posterarXiv:2408.04883
#1916

LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model

Runhui Huang, Kaixin Cai, Jianhua Han et al.

ECCV 2024posterarXiv:2403.11929
#1917

Unsupervised Variational Translator for Bridging Image Restoration and High-Level Vision Tasks

Jiawei Wu, Zhi Jin

ECCV 2024posterarXiv:2408.08149
#1918

Fast Point Cloud Geometry Compression with Context-based Residual Coding and INR-based Refinement

Hao Xu, Xi Zhang, Xiaolin Wu

ECCV 2024posterarXiv:2408.02966
#1919

Scene-Conditional 3D Object Stylization and Composition

Jinghao Zhou, Tomas Jakab, Philip Torr et al.

ECCV 2024posterarXiv:2312.12419
#1920

RANRAC: Robust Neural Scene Representations via Random Ray Consensus

Benno Buschmann, Andreea Dogaru, Elmar Eisemann et al.

ECCV 2024posterarXiv:2312.09780
#1921

Vision-Language Dual-Pattern Matching for Out-of-Distribution Detection

Zihan Zhang, Zhuo Xu, Xiang Xiang

ECCV 2024poster
#1922

MarineInst: A Foundation Model for Marine Image Analysis with Instance Visual Description

Ziqiang Zheng, Yiwei Chen, Huimin Zeng et al.

ECCV 2024poster
#1923

Contextual Correspondence Matters: Bidirectional Graph Matching for Video Summarization

yunzuo zhang, Yameng Liu

ECCV 2024poster
#1924

Linking in Style: Understanding learned features in deep learning models

Maren Wehrheim, Pamela Osuna Vargas, Matthias Kaschube

ECCV 2024posterarXiv:2409.16865
#1925

COD: Learning Conditional Invariant Representation for Domain Adaptation Regression

Hao-Ran Yang, Chuan-Xian Ren, You-Wei Luo

ECCV 2024posterarXiv:2408.06638
#1926

Easing 3D Pattern Reasoning with Side-view Features for Semantic Scene Completion

Linxi Huan, Mingyue Dong, Linwei Yue et al.

ECCV 2024poster
#1927

Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression

Animesh Sinha, Bo Sun, Anmol Kalia et al.

ECCV 2024posterarXiv:2311.10794
#1928

High-Quality Mesh Blendshape Generation from Face Videos via Neural Inverse Rendering

Xin Ming, Jiawei Li, Jingwang Ling et al.

ECCV 2024posterarXiv:2401.08398
#1929

InfoNorm: Mutual Information Shaping of Normals for Sparse-View Reconstruction

Xulong Wang, Siyan Dong, Youyi Zheng et al.

ECCV 2024posterarXiv:2407.12661
#1930

DreamReward: Aligning Human Preference in Text-to-3D Generation

junliang ye, Fangfu Liu, Qixiu Li et al.

ECCV 2024poster
#1931

Towards Image Ambient Lighting Normalization

Florin-Alexandru Vasluianu, Tim Seizinger, Zongwei Wu et al.

ECCV 2024posterarXiv:2403.18730
#1932

FedHide: Federated Learning by Hiding in the Neighbors

Hyunsin Park, Sungrack Yun

ECCV 2024posterarXiv:2409.07808
#1933

Superpixel-informed Implicit Neural Representation for Multi-Dimensional Data

Jiayi Li, Xi-Le Zhao, Jian-Li Wang et al.

ECCV 2024posterarXiv:2411.11356
#1934

FedHARM: Harmonizing Model Architectural Diversity in Federated Learning

Anestis Kastellos, Athanasios Psaltis, Charalampos Z Patrikakis et al.

ECCV 2024poster
#1935

Towards Open-World Object-based Anomaly Detection via Self-Supervised Outlier Synthesis

Brian Isaac Medina, Yona Falinie Abdul Gaus, Neelanjan Bhowmik et al.

ECCV 2024posterarXiv:2407.15763
#1936

DiffSurf: A Transformer-based Diffusion Model for Generating and Reconstructing 3D Surfaces in Pose

Yoshiyasu Yusuke, Leyuan Sun

ECCV 2024posterarXiv:2408.14860
#1937

LPViT: Low-Power Semi-structured Pruning for Vision Transformers

KAIXIN Xu, Zhe Wang, Chunyun Chen et al.

ECCV 2024posterarXiv:2407.02068
#1938

Weighted Ensemble Models Are Strong Continual Learners

Imad Eddine Marouf, Subhankar Roy, Enzo Tartaglione et al.

ECCV 2024posterarXiv:2312.08977
#1939

GGRt: Towards Generalizable 3D Gaussians without Pose Priors in Real-Time

Hao Li, Yuanyuan Gao, Dingwen Zhang et al.

ECCV 2024poster
#1940

Chains of Diffusion Models

Yanheng Wei, Lianghua Huang, Zhi-Fan Wu et al.

ECCV 2024poster
#1941

Robustness Tokens: Towards Adversarial Robustness of Transformers

Brian Pulfer, Yury Belousov, Slava Voloshynovskiy

ECCV 2024posterarXiv:2503.10191
#1942

Omni6DPose: A Benchmark and Model for Universal 6D Object Pose Estimation and Tracking

Jiyao Zhang, Weiyao Huang, Bo Peng et al.

ECCV 2024posterarXiv:2406.04316
#1943

EINet: Point Cloud Completion via Extrapolation and Interpolation

Pingping Cai, Canyu Zhang, LINGJIA SHI et al.

ECCV 2024poster
#1944

Bridging the Gap Between Human Motion and Action Semantics via Kinematics Phrases

Xinpeng Liu, Yong-Lu Li, AILING ZENG et al.

ECCV 2024posterarXiv:2310.04189
#1945

DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks

Sarah Jabbour, Gregory Kondas, Ella Kazerooni et al.

ECCV 2024posterarXiv:2407.14509
#1946

Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance

Donghoon Ahn, Hyoungwon Cho, Jaewon Min et al.

ECCV 2024posterarXiv:2403.17377
#1947

MONTRAGE: Monitoring Training for Attribution of Generative Diffusion Models

Jonathan Brokman, Omer Hofman, Roman Vainshtein et al.

ECCV 2024poster
#1948

Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition

Sergio Izquierdo, Javier Civera

ECCV 2024posterarXiv:2407.02422
#1949

SWAG: Splatting in the Wild images with Appearance-conditioned Gaussians

Hiba Dahmani, Moussab Bennehar, Nathan Piasco et al.

ECCV 2024posterarXiv:2403.10427
#1950

TAG: Text Prompt Augmentation for Zero-Shot Out-of-Distribution Detection

Xixi Liu, Christopher Zach

ECCV 2024poster
#1951

Can Textual Semantics Mitigate Sounding Object Segmentation Preference?

Yaoting Wang, Peiwen Sun, Yuanchao Li et al.

ECCV 2024posterarXiv:2407.10947
#1952

Continual Learning and Unknown Object Discovery in 3D Scenes via Self-Distillation

Mohamed El Amine Boudjoghra, Jean Lahoud, Salman Khan et al.

ECCV 2024poster
#1953

Lost and Found: Overcoming Detector Failures in Online Multi-Object Tracking

Lorenzo Vaquero, Yihong XU, Xavier Alameda-Pineda et al.

ECCV 2024posterarXiv:2407.10151
#1954

Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time

Sanjoy Chowdhury, Sayan Nag, Subhrajyoti Dasgupta et al.

ECCV 2024posterarXiv:2407.01851
#1955

MLPHand: Real Time Multi-View 3D Hand Reconstruction via MLP Modeling

Jian Yang, Jiakun Li, Guoming Li et al.

ECCV 2024poster
#1956

How Far Can a 1-Pixel Camera Go? Solving Vision Tasks using Photoreceptors and Computationally Designed Visual Morphology

Andrei Atanov, Rishubh Singh, Jiawei Fu et al.

ECCV 2024poster
#1957

Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label Classifier

Prantik Howlader, Srijan Das, Hieu Le et al.

ECCV 2024posterarXiv:2407.04036
#1958

Spiking Wavelet Transformer

Yuetong Fang, Ziqing Wang, Lingfeng Zhang et al.

ECCV 2024posterarXiv:2403.11138
#1959

WAVE: Warping DDIM Inversion Features for Zero-shot Text-to-Video Editing

Yutang Feng, Sicheng Gao, Yuxiang Bao et al.

ECCV 2024poster
#1960

HoloADMM: High-Quality Holographic Complex Field Recovery

Mazen Mel, Paul Springer, Pietro Zanuttigh et al.

ECCV 2024poster
#1961

Few-shot Defect Image Generation based on Consistency Modeling

Qingfeng Shi, Jing Wei, Fei Shen et al.

ECCV 2024posterarXiv:2408.00372
#1962

BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion

Gwanghyun Kim, Hayeon Kim, Hoigi Seo et al.

ECCV 2024posterarXiv:2404.04544
#1963

All You Need is Your Voice: Emotional Face Representation with Audio Perspective for Emotional Talking Face Generation

Seongho Kim, Byung Cheol Song

ECCV 2024poster
#1964

AnimateMe: 4D Facial Expressions via Diffusion Models

Dimitrios Gerogiannis, Foivos Paraperas Papantoniou, Rolandos Alexandros Potamias et al.

ECCV 2024posterarXiv:2403.17213
#1965

iNeMo: Incremental Neural Mesh Models for Robust Class-Incremental Learning

Tom Fischer, Yaoyao Liu, Artur Jesslen et al.

ECCV 2024posterarXiv:2407.09271
#1966

Pose Guided Fine-Grained Sign Language Video Generation

Tongkai Shi, Lianyu Hu, Fanhua Shang et al.

ECCV 2024poster
#1967

POET: Prompt Offset Tuning for Continual Human Action Adaptation

Prachi Garg, Joseph K J, Vineeth N Balasubramanian et al.

ECCV 2024posterarXiv:2504.18059
#1968

SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning

Bac Nguyen, Stefan Uhlich, Fabien Cardinaux et al.

ECCV 2024posterarXiv:2407.03036
#1969

SAH-SCI: Self-Supervised Adapter for Efficient Hyperspectral Snapshot Compressive Imaging

Haijin Zeng, Yuxi Liu, Yongyong Chen et al.

ECCV 2024poster
#1970

Optimization-based Uncertainty Attribution Via Learning Informative Perturbations

Hanjing Wang, Bashirul Azam Biswas, Qiang Ji

ECCV 2024poster
#1971

ReCON: Training-Free Acceleration for Text-to-Image Synthesis with Retrieval of Concept Prompt Trajectories

Chen-yi Lu, Shubham Agarwal, Mehrab Tanjim et al.

ECCV 2024poster
#1972

Object-Aware Query Perturbation for Cross-Modal Image-Text Retrieval

Naoya Sogi, Takashi Shibata, Makoto Terao

ECCV 2024posterarXiv:2407.12346
#1973

GRiT: A Generative Region-to-text Transformer for Object Understanding

Jialian Wu, Jianfeng Wang, Zhengyuan Yang et al.

ECCV 2024posterarXiv:2212.00280
#1974

LRSLAM: Low-rank Representation of Signed Distance Fields in Dense Visual SLAM System

Hongbeen Park, Minjeong Park, Giljoo Nam et al.

ECCV 2024posterarXiv:2506.10567
#1975

BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel Modeling

Cheng Peng, Yutao Tang, Yifan Zhou et al.

ECCV 2024posterarXiv:2403.04926
#1976

DPA-Net: Structured 3D Abstraction from Sparse Views via Differentiable Primitive Assembly

Fenggen Yu, Yiming Qian, Xu Zhang et al.

ECCV 2024posterarXiv:2404.00875
#1977

Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation

Juncheng Ma, Peiwen Sun, Yaoting Wang et al.

ECCV 2024posterarXiv:2407.11820
#1978

Reinforcement Learning via Auxillary Task Distillation

Abhinav Narayan Harish, Larry Heck, Josiah P Hanna et al.

ECCV 2024poster
#1979

Plug and Play: A Representation Enhanced Domain Adapter for Collaborative Perception

TIANYOU LUO, Quan Yuan, Yuchen Xia et al.

ECCV 2024poster
#1980

Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Models

Yuchen Yang, Kwonjoon Lee, Behzad Dariush et al.

ECCV 2024posterarXiv:2407.10299
#1981

Computing the Lipschitz constant needed for fast scene recovery from CASSI measurements

Niels Chr. Overgaard, Anders Holst

ECCV 2024poster
#1982

DreamScene: 3D Gaussian-based Text-to-3D Scene Generation via Formation Pattern Sampling

Haoran Li, Haolin Shi, Wenli Zhang et al.

ECCV 2024posterarXiv:2404.03575
#1983

Improving Hyperbolic Representations via Gromov-Wasserstein Regularization

yifei Yang, Wonjun Lee, Dongmian Zou et al.

ECCV 2024posterarXiv:2407.10495
#1984

IAM-VFI : Interpolate Any Motion for Video Frame Interpolation with motion complexity map

Kihwan Yoon, Yong Han Kim, Sungjei Kim et al.

ECCV 2024poster
#1985

Depth-Aware Blind Image Decomposition for Real-World Adverse Weather Recovery

Chao Wang, Zhedong Zheng, Ruijie Quan et al.

ECCV 2024poster
#1986

DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image Manipulation

Jeongsol Kim, Geon Yeong Park, Jong Chul Ye

ECCV 2024posterarXiv:2403.11415
#1987

Training A Small Emotional Vision Language Model for Visual Art Comprehension

Jing Zhang, Liang Zheng, Meng Wang et al.

ECCV 2024posterarXiv:2403.11150
#1988

PathMMU: A Massive Multimodal Expert-Level Benchmark for Understanding and Reasoning in Pathology

YUXUAN SUN, Hao Wu, Chenglu Zhu et al.

ECCV 2024posterarXiv:2401.16355
#1989

Kinetic Typography Diffusion Model

Seonmi Park, Inhwan Bae, Seunghyun Shin et al.

ECCV 2024posterarXiv:2407.10476
#1990

Free-ATM: Harnessing Free Attention Masks for Representation Learning on Diffusion-Generated Images

Junhao Zhang, Mutian Xu, Jay Zhangjie Wu et al.

ECCV 2024poster
#1991

EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding

Yuanming Li, Wei-Jin Huang, An-Lan Wang et al.

ECCV 2024posterarXiv:2406.08877
#1992

TrafficNight : An Aerial Multimodal Benchmark For Nighttime Vehicle Surveillance

Guoxing Zhang, Yiming Liu, xiaoyu yang et al.

ECCV 2024poster
#1993

Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learning

Amandeep Kumar, Muhammad Awais, Sanath Narayan et al.

ECCV 2024posterarXiv:2406.04413
#1994

COM Kitchens: An Unedited Overhead-view Procedural Videos Dataset a Vision-Language Benchmark

Atsushi Hashimoto, Koki Maeda, Tosho Hirasawa et al.

ECCV 2024poster
#1995

DatasetNeRF: Efficient 3D-aware Data Factory with Generative Radiance Fields

Yu Chi, Fangneng Zhan, Sibo Wu et al.

ECCV 2024posterarXiv:2311.12063
#1996

Unsupervised Representation Learning by Balanced Self Attention Matching

Daniel Shalam, Simon Korman

ECCV 2024posterarXiv:2408.02014
#1997

A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment

Tianhe Wu, Kede Ma, Jie Liang et al.

ECCV 2024posterarXiv:2403.10854
#1998

Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation

Fangfu Liu, Hanyang Wang, Weiliang Chen et al.

ECCV 2024posterarXiv:2403.09625
#1999

Towards Dual Transparent Liquid Level Estimation in Biomedical Lab: Dataset, Methods and Practice

Xiayu Wang, Ke Ma, Ruiyun Zhong et al.

ECCV 2024poster
#2000

On the Topology Awareness and Generalization Performance of Graph Neural Networks

Junwei Su, Chuan Wu

ECCV 2024posterarXiv:2403.04482