Most Cited 2024 &quot;audio-driven facial animation&quot; Papers

CVPR 2024posterarXiv:2403.01781

#2802

Integrating Efficient Optimal Transport and Functional Maps For Unsupervised Shape Correspondence Learning

Tung Le, Khai Nguyen, Shanlin Sun et al.

#2803

Hiding Imperceptible Noise in Curvature-Aware Patches for 3D Point Cloud Attack

Mingyu Yang, Daizong Liu, Keke Tang et al.

ECCV 2024posterarXiv:2408.12316

#2804

Unrolled Decomposed Unpaired Learning for Controllable Low-Light Video Enhancement

Lingyu Zhu, Wenhan Yang, Baoliang Chen et al.

AAAI 2024paperarXiv:2307.05892

#2805

SC-NeuS: Consistent Neural Surface Reconstruction from Sparse and Noisy Views

Shi-Sheng Huang, Zixin Zou, Yichi Zhang et al.

ICLR 2024spotlightarXiv:2310.10434

#2806

Equivariant Matrix Function Neural Networks

Ilyes Batatia, Lars Leon Schaaf, Gábor Csányi et al.

ECCV 2024posterarXiv:2403.13808

#2807

On Pretraining Data Diversity for Self-Supervised Learning

Hasan Abed El Kader Hammoud, Tuhin Das, Fabio Pizzati et al.

ECCV 2024posterarXiv:2409.17316

#2808

Bi-TTA: Bidirectional Test-Time Adapter for Remote Physiological Measurement

Haodong LI, Hao LU, Yingcong Chen

CVPR 2024posterarXiv:2403.13647

#2809

Meta-Point Learning and Refining for Category-Agnostic Pose Estimation

Junjie Chen, Jiebin Yan, Yuming Fang et al.

ECCV 2024posterarXiv:2407.11494

#2810

Learning Semantic Latent Directions for Accurate and Controllable Human Motion Prediction

Guowei Xu, Jiale Tao, Wen Li et al.

#2811

TurboSL: Dense Accurate and Fast 3D by Neural Inverse Structured Light

Parsa Mirdehghan, Maxx Wu, Wenzheng Chen et al.

ECCV 2024posterarXiv:2408.13752

#2812

Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation

Zhaoyang Li, Yuan Wang, Wangkai Li et al.

CVPR 2024posterarXiv:2403.07244

#2813

Time-Efficient Light-Field Acquisition Using Coded Aperture and Events

Shuji Habuchi, Keita Takahashi, Chihiro Tsutake et al.

ECCV 2024posterarXiv:2408.13459

#2814

Rethinking Video Deblurring with Wavelet-Aware Dynamic Transformer and Diffusion Model

chen rao, Guangyuan Li, Zehua Lan et al.

ECCV 2024posterarXiv:2305.15798

#2815

BK-SDM: A Lightweight, Fast, and Cheap Version of Stable Diffusion

Bo-Kyeong Kim, Hyoung-Kyu Song, Thibault Castells et al.

ICLR 2024posterarXiv:2403.09274

#2816

EventRPG: Event Data Augmentation with Relevance Propagation Guidance

Mingyuan Sun, Donghao Zhang, Zongyuan Ge et al.

CVPR 2024posterarXiv:2308.06699

#2817

Neural Super-Resolution for Real-time Rendering with Radiance Demodulation

Jia Li, Ziling Chen, Xiaolong Wu et al.

ECCV 2024posterarXiv:2305.03716

#2818

3D Small Object Detection with Dynamic Spatial Pruning

Xiuwei Xu, Zhihao Sun, Ziwei Wang et al.

ECCV 2024posterarXiv:2312.07315

#2819

NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image

Yoonwoo Jeong, Jinwoo Lee, Chiheon Kim et al.

CVPR 2024posterarXiv:2406.04155

#2820

Improving Physics-Augmented Continuum Neural Radiance Field-Based Geometry-Agnostic System Identification with Lagrangian Particle Optimization

Takuhiro Kaneko

ECCV 2024posterarXiv:2407.07324

#2821

Event-Aided Time-To-Collision Estimation for Autonomous Driving

Jinghang Li, Bangyan Liao, Xiuyuan LU et al.

ECCV 2024posterarXiv:2404.06493

#2822

Flying with Photons: Rendering Novel Views of Propagating Light

Anagh Malik, Noah Juravsky, Ryan Po et al.

CVPR 2024posterarXiv:2405.12509

#2823

Active Object Detection with Knowledge Aggregation and Distillation from Large Models

Dejie Yang, Yang Liu

ECCV 2024posterarXiv:2407.05352

#2824

Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model

Danni Yang, Ruohan Dong, Jiayi Ji et al.

ECCV 2024posterarXiv:2407.09826

#2825

3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance

Xiaoxu Xu, Yitian Yuan, Jinlong Li et al.

CVPR 2024posterarXiv:2311.15744

#2826

One More Step: A Versatile Plug-and-Play Module for Rectifying Diffusion Schedule Flaws and Enhancing Low-Frequency Controls

Minghui Hu, Jianbin Zheng, Chuanxia Zheng et al.

#2827

Fusing Personal and Environmental Cues for Identification and Segmentation of First-Person Camera Wearers in Third-Person Views

Ziwei Zhao, Yuchen Wang, Chuhua Wang

CVPR 2024highlightarXiv:2401.15261

#2828

Vanishing-Point-Guided Video Semantic Segmentation of Driving Scenes

Diandian Guo, Deng-Ping Fan, Tongyu Lu et al.

ECCV 2024posterarXiv:2311.13777

#2829

GS-Pose: Category-Level Object Pose Estimation via Geometric and Semantic Correspondence

Pengyuan Wang, Takuya Ikeda, Robert Lee et al.

ECCV 2024posterarXiv:2409.07239

#2830

PiTe: Pixel-Temporal Alignment for Large Video-Language Model

Yang Liu, Pengxiang Ding, Siteng Huang et al.

#2831

Brain Netflix: Scaling Data to Reconstruct Videos from Brain Signals

Camilo Fosco, Benjamin Lahner, Bowen Pan et al.

#2832

Exploring Vulnerabilities in Spiking Neural Networks: Direct Adversarial Attacks on Raw Event Data

Yanmeng Yao, Xiaohan Zhao, Bin Gu

ECCV 2024posterarXiv:2403.09468

#2833

Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing

Wonjun Kang, Kevin Galim, Hyung Il Koo

CVPR 2024posterarXiv:2002.07756

#2834

Hierarchical Correlation Clustering and Tree Preserving Embedding

Morteza Haghir Chehreghani, Mostafa Haghir Chehreghani

CVPR 2024posterarXiv:2404.19696

#2835

Naturally Supervised 3D Visual Grounding with Language-Regularized Concept Learners

Chun Feng, Joy Hsu, Weiyu Liu et al.

ICLR 2024posterarXiv:2401.10556

#2836

Symbol as Points: Panoptic Symbol Spotting via Point-based Representation

Wenlong Liu, Tianyu Yang, Yuhan Wang et al.

ICLR 2024posterarXiv:2306.15876

#2837

Hybrid Distillation: Connecting Masked Autoencoders with Contrastive Learners

Bowen Shi, XIAOPENG ZHANG, Yaoming Wang et al.

ECCV 2024posterarXiv:2311.15908

#2838

Enhancing Perceptual Quality in Video Super-Resolution through Temporally-Consistent Detail Synthesis using Diffusion Models

Claudio Rota, Marco Buzzelli, Joost Van de Weijer

CVPR 2024posterarXiv:2311.17951

#2839

C3Net: Compound Conditioned ControlNet for Multimodal Content Generation

Juntao Zhang, Yuehuai LIU, Yu-Wing Tai et al.

CVPR 2024posterarXiv:2401.06146

#2840

AAMDM: Accelerated Auto-regressive Motion Diffusion Model

Tianyu Li, Calvin Zhuhan Qiao, Ren Guanqiao et al.

ECCV 2024posterarXiv:2407.05008

#2841

T-CorresNet: Template Guided 3D Point Cloud Completion with Correspondence Pooling Query Generation Strategy

Fan Duan, Jiahao Yu, Li Chen

CVPR 2024posterarXiv:2404.00931

#2842

GOV-NeSF: Generalizable Open-Vocabulary Neural Semantic Fields

Fangyin Wei, Hanlin Chen, Gim Hee Lee

#2843

Enhancing Cross-Subject fMRI-to-Video Decoding with Global-Local Functional Alignment

Chong Li, Xuelin Qian, Yun Wang et al.

ECCV 2024posterarXiv:2411.08606

#2844

LG-Gaze: Learning Geometry-aware Continuous Prompts for Language-Guided Gaze Estimation

Pengwei Yin, Jingjing Wang, Guanzhong Zeng et al.

#2845

Epitopological learning and Cannistraci-Hebb network shape intelligence brain-inspired theory for ultra-sparse advantage in deep learning

Yingtao Zhang, Jialin Zhao, Wenjing Wu et al.

ICLR 2024poster

ICLR 2024oralarXiv:2311.03309

#2846

Neural structure learning with stochastic differential equations

Benjie Wang, Joel Jennings, Wenbo Gong

ICLR 2024posterarXiv:2308.10632

#2847

Foundation Model-oriented Robustness: Robust Image Model Evaluation with Pretrained Models

Peiyan Zhang, Haoyang Liu, Chaozhuo Li et al.

ECCV 2024posterarXiv:2403.18820

#2848

MetaCap: Meta-learning Priors from Multi-View Imagery for Sparse-view Human Performance Capture and Rendering

Guoxing Sun, Rishabh Dabral, Pascal Fua et al.

CVPR 2024posterarXiv:2403.19501

#2849

RELI11D: A Comprehensive Multimodal Human Motion Dataset and Method

Ming Yan, Yan Zhang, Shuqiang Cai et al.

ICLR 2024posterarXiv:2401.10474

#2850

LDReg: Local Dimensionality Regularized Self-Supervised Learning

Hanxun Huang, Ricardo Campello, Sarah Erfani et al.

ECCV 2024posterarXiv:2404.07336

#2851

Perceptual Evaluation of Audio-Visual Synchrony Grounded in Viewers’ Opinion Scores

Lucas Goncalves, Prashant Mathur, Chandrashekhar Lavania et al.

#2852

Motion Diversification Networks

Hee Jae Kim, Eshed Ohn-Bar

#2853

ZoLA: Zero-Shot Creative Long Animation Generation with Short Video Model

Fu-Yun Wang, Zhaoyang Huang, Qiang Ma et al.

CVPR 2024posterarXiv:2406.17219

#2854

Facial Identity Anonymization via Intrinsic and Extrinsic Attention Distraction

Zhenzhong Kuang, Xiaochen Yang, Yingjie Shen et al.

ECCV 2024posterarXiv:2407.04086

#2855

Certifiably Robust Image Watermark

Zhengyuan Jiang, Moyang Guo, Yuepeng Hu et al.

AAAI 2024paperarXiv:2306.12681

#2856

One at a Time: Progressive Multi-Step Volumetric Probability Learning for Reliable 3D Scene Perception

Bohan Li, Yasheng Sun, Jingxin Dong et al.

AAAI 2024paperarXiv:2406.08799

#2857

Pareto Front-Diverse Batch Multi-Objective Bayesian Optimization

Alaleh Ahmadianshalchi, Syrine Belakaria, Janardhan Rao Doppa

CVPR 2024posterarXiv:2405.02608

#2858

UnSAMFlow: Unsupervised Optical Flow Guided by Segment Anything Model

Shuai Yuan, Lei Luo, Zhuo Hui et al.

AAAI 2024paperarXiv:2402.19119

#2859

VIXEN: Visual Text Comparison Network for Image Difference Captioning

Alexander Black, Jing Shi, Yifei Fan et al.

#2860

REGLO: Provable Neural Network Repair for Global Robustness Properties

Feisi Fu, Zhilu Wang, Weichao Zhou et al.

#2861

TULIP: Multi-camera 3D Precision Assessment of Parkinson’s Disease

Kyungdo Kim, Sihan Lyu, Sneha Mantri et al.

CVPR 2024posterarXiv:2311.17833

#2862

DiG-IN: Diffusion Guidance for Investigating Networks - Uncovering Classifier Differences Neuron Visualisations and Visual Counterfactual Explanations

Maximilian Augustin, Yannic Neuhaus, Matthias Hein

AAAI 2024paperarXiv:2401.12497

#2863

Building Minimal and Reusable Causal State Abstractions for Reinforcement Learning

Zizhao Wang, Caroline Wang, Xuesu Xiao et al.

ECCV 2024posterarXiv:2407.06704

#2864

Self-supervised visual learning from interactions with objects

Arthur Aubret, Céline Teulière, Jochen Triesch

AAAI 2024paperarXiv:2501.00009

#2865

Model-Driven Deep Neural Network for Enhanced AoA Estimation Using 5G gNB

Shengheng Liu, Xingkang Li, Zihuan Mao et al.

AAAI 2024paperarXiv:2312.17263

#2866

TACIT: A Target-Agnostic Feature Disentanglement Framework for Cross-Domain Text Classification

Rui Song, Fausto Giunchiglia, Yingji Li et al.

CVPR 2024posterarXiv:2403.10988

#2867

Boosting Flow-based Generative Super-Resolution Models via Learned Prior

Li-Yuan Tsao, Yi-Chen Lo, Chia-Che Chang et al.

ECCV 2024posterarXiv:2407.19666

#2868

Take A Step Back: Rethinking the Two Stages in Visual Reasoning

Mingyu Zhang, Jiting Cai, Mingyu Liu et al.

#2869

Context Enhanced Transformer for Single Image Object Detection in Video Data

Seungjun An, Seonghoon Park, Gyeongnyeon Kim et al.

CVPR 2024posterarXiv:2312.03102

#2870

Fully Convolutional Slice-to-Volume Reconstruction for Single-Stack MRI

Sean I. Young, Yaël Balbastre, Bruce Fischl et al.

AAAI 2024paperarXiv:2312.13066

#2871

PPEA-Depth: Progressive Parameter-Efficient Adaptation for Self-Supervised Monocular Depth Estimation

Yue-Jiang Dong, Yuan-Chen Guo, Ying-Tian Liu et al.

AAAI 2024paperarXiv:2312.10314

#2872

DeepCalliFont: Few-Shot Chinese Calligraphy Font Synthesis by Integrating Dual-Modality Generative Models

Yitian Liu, Zhouhui Lian

#2873

HSR: Holistic 3D Human-Scene Reconstruction from Monocular Videos

Lixin Xue, Chen Guo, Chengwei Zheng et al.

#2874

Link Prediction in Multilayer Networks via Cross-Network Embedding

Guojing Ren, Xiao Ding, Xiao-Ke Xu et al.

AAAI 2024paperarXiv:2401.15603

#2875

Improving Expressive Power of Spectral Graph Neural Networks with Eigenvalue Correction

Kangkang Lu, Yanhua Yu, Hao Fei et al.

AAAI 2024paperarXiv:2402.16312

#2876

Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users

Hantao Yang, Xutong Liu, Zhiyong Wang et al.

CVPR 2024posterarXiv:2406.04999

#2877

ProMotion: Prototypes As Motion Learners

Yawen Lu, Dongfang Liu, Qifan Wang et al.

AAAI 2024paperarXiv:2411.19451

#2878

Learning Visual Abstract Reasoning through Dual-Stream Networks

Kai Zhao, Chang Xu, Bailu Si

#2879

Efficient Axiomatization of OWL 2 EL Ontologies from Data by Means of Formal Concept Analysis

Francesco Kriegel

CVPR 2024posterarXiv:2310.09469

#2880

Towards More Accurate Diffusion Model Acceleration with A Timestep Tuner

Mengfei Xia, Yujun Shen, Changsong Lei et al.

ECCV 2024posterarXiv:2407.07402

#2881

ActionVOS: Actions as Prompts for Video Object Segmentation

LIANGYANG OUYANG, Ruicong Liu, Yifei Huang et al.

#2882

ZOOM: Learning Video Mirror Detection with Extremely-Weak Supervision

Ke Xu, Tsun Wai Siu, Rynson W.H. Lau

ECCV 2024posterarXiv:2405.09883

#2883

RoScenes: A Large-scale Multi-view 3D Dataset for Roadside Perception

Xiaosu Zhu, Hualian Sheng, Sijia Cai et al.

AAAI 2024paperarXiv:2310.03131

#2884

Axiomatic Aggregations of Abductive Explanations

Gagan Biradar, Yacine Izza, Elita Lobo et al.

AAAI 2024paperarXiv:2401.11649

#2885

A Multimodal, Multi-Task Adapting Framework for Video Action Recognition

Mengmeng Wang, Jiazheng Xing, Boyuan Jiang et al.

#2886

Comprehensive View Embedding Learning for Single-Cell Multimodal Integration

Zhenchao Tang, Jiehui Huang, Guanxing Chen et al.

#2887

Making Visual Sense of Oracle Bones for You and Me

Runqi Qiao, LAN YANG, Kaiyue Pang et al.

#2888

PQ-SAM: Post-training Quantization for Segment Anything Model

Xiaoyu Liu, Xin Ding, Lei Yu et al.

ICLR 2024posterarXiv:2305.17342

#2889

Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in RL

Xiangyu Liu, Souradip Chakraborty, Yanchao Sun et al.

ECCV 2024posterarXiv:2407.02047

#2890

CountFormer: Multi-View Crowd Counting Transformer

Hong Mo, Xiong Zhang, Jianchao Tan et al.

ICLR 2024spotlightarXiv:2305.01521

#2891

Unlocking the Power of Representations in Long-term Novelty-based Exploration

Alaa Saade, Steven Kapturowski, Daniele Calandriello et al.

CVPR 2024posterarXiv:2404.01543

#2892

Efficient 3D Implicit Head Avatar with Mesh-anchored Hash Table Blendshapes

Ziqian Bai, Feitong Tan, Sean Fanello et al.

ECCV 2024posterarXiv:2406.02461

#2893

RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting

Qi Wang, Ruijie Lu, Xudong XU et al.

ECCV 2024posterarXiv:2403.12003

#2894

GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning

Xiaojie Li, Yibo Yang, Xiangtai Li et al.

ICLR 2024posterarXiv:2309.17175

#2895

TextField3D: Towards Enhancing Open-Vocabulary 3D Generation with Noisy Text Fields

Tianyu Huang, Yihan Zeng, Bowen Dong et al.

CVPR 2024posterarXiv:2405.10037

#2896

Bilateral Event Mining and Complementary for Event Stream Super-Resolution

Zhilin Huang, Quanmin Liang, Yijie Yu et al.

CVPR 2024posterarXiv:2403.15835

#2897

Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression

Hancheng Ye, Chong Yu, Peng Ye et al.

#2898

GLDL: Graph Label Distribution Learning

Yufei Jin, Richard Gao, Yi He et al.

ICLR 2024posterarXiv:2311.13541

#2899

Linear Log-Normal Attention with Unbiased Concentration

Yury Nahshan, Joseph Kampeas, Emir Haleva

ECCV 2024posterarXiv:2407.21032

#2900

Safeguard Text-to-Image Diffusion Models with Human Feedback Inversion

Sanghyun Kim, Seohyeon Jung, Balhae Kim et al.

#2901

Factorized Diffusion Autoencoder for Unsupervised Disentangled Representation Learning

Ancong Wu, Wei-shi Zheng

ECCV 2024posterarXiv:2409.13803

#2902

Intrinsic Single-Image HDR Reconstruction

Sebastian Dille, Chris Careaga, Yagiz Aksoy

AAAI 2024paperarXiv:2401.06799

#2903

Make Prompts Adaptable: Bayesian Modeling for Vision-Language Prompt Learning with Data-Dependent Prior

Youngjae Cho, HeeSun Bae, Seungjae Shin et al.

ICLR 2024posterarXiv:2403.05490

#2904

Poly-View Contrastive Learning

Amitis Shidani, R Devon Hjelm, Jason Ramapuram et al.

#2905

Unsupervised Group Re-identification via Adaptive Clustering-Driven Progressive Learning

Hongxu Chen, Quan Zhang, Jian-Huang Lai et al.

CVPR 2024posterarXiv:2403.20249

#2906

Relation Rectification in Diffusion Model

Yinwei Wu, Xingyi Yang, Xinchao Wang

AAAI 2024paperarXiv:2308.07301

#2907

A Unified Masked Autoencoder with Patchified Skeletons for Motion Synthesis

Esteve Valls Mascaro, Hyemin Ahn, Dongheui Lee

#2908

Learning Robust Rationales for Model Explainability: A Guidance-Based Approach

Shuaibo Hu, Kui Yu

AAAI 2024paperarXiv:2401.13621

#2909

DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning

Xinghao Wang, Junliang He, Pengyu Wang et al.

ECCV 2024posterarXiv:2407.10704

#2910

Quantized Prompt for Efficient Generalization of Vision-Language Models

Tianxiang Hao, Xiaohan Ding, Juexiao Feng et al.

AAAI 2024paperarXiv:2210.08106

#2911

A Primal-Dual Algorithm for Hybrid Federated Learning

Tom Overman, Garrett Blum, Diego Klabjan

CVPR 2024posterarXiv:2312.03442

#2912

High-Quality Facial Geometry and Appearance Capture at Home

Yuxuan Han, Junfeng Lyu, Feng Xu

CVPR 2024posterarXiv:2404.01243

#2913

A Unified and Interpretable Emotion Representation and Expression Generation

Reni Paskaleva, Mykyta Holubakha, Andela Ilic et al.

#2914

DIUSum: Dynamic Image Utilization for Multimodal Summarization

Min Xiao, Junnan Zhu, Feifei Zhai et al.

AAAI 2024paperarXiv:2403.05660

#2915

Decoupling Degradations with Recurrent Network for Video Restoration in Under-Display Camera

Chengxu Liu, Xuan Wang, Yuanting Fan et al.

ECCV 2024posterarXiv:2411.06344

#2916

CityGuessr: City-Level Video Geo-Localization on a Global Scale

Parth Parag Kulkarni, Gaurav Kumar Nayak, Shah Mubarak

AAAI 2024paperarXiv:2403.02063

#2917

Depth-Guided Robust and Fast Point Cloud Fusion NeRF for Sparse Input Views

Shuai Guo, Qiuwen Wang, Yijie Gao et al.

ICLR 2024spotlightarXiv:2307.15396

#2918

Noisy Interpolation Learning with Shallow Univariate ReLU Networks

Nirmit Joshi, Gal Vardi, Nathan Srebro

ICLR 2024posterarXiv:2310.04966

#2919

Improved Active Learning via Dependent Leverage Score Sampling

Atsushi Shimizu, Xiaoou Cheng, Christopher Musco et al.

CVPR 2024posterarXiv:2312.01964

#2920

Semantics-aware Motion Retargeting with Vision-Language Models

Haodong Zhang, ZhiKe Chen, Haocheng Xu et al.

ICLR 2024posterarXiv:2405.12398

#2921

ASMR: Activation-Sharing Multi-Resolution Coordinate Networks for Efficient Inference

Jason Chun Lok Li, Steven Luo, Le Xu et al.

ECCV 2024posterarXiv:2407.12939

#2922

GenRC: Generative 3D Room Completion from Sparse Image Collections

Ming-Feng Li, Yueh-Feng Ku, Hong-Xuan Yen et al.

ICLR 2024posterarXiv:2306.14268

#2923

Adaptive Window Pruning for Efficient Local Motion Deblurring

Haoying Li, Jixin Zhao, Shangchen Zhou et al.

AAAI 2024paperarXiv:2401.17186

#2924

Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning

Bang Yang, Yong Dai, Xuxin Cheng et al.

#2925

CDFormer: When Degradation Prediction Embraces Diffusion Model for Blind Image Super-Resolution

Qingguo Liu, Chenyi Zhuang, Pan Gao et al.

AAAI 2024paperarXiv:2404.07962

#2926

Live and Learn: Continual Action Clustering with Incremental Views

Xiaoqiang Yan, Yingtao Gan, Yiqiao Mao et al.

AAAI 2024paperarXiv:2312.15707

#2927

High-Fidelity Diffusion-Based Image Editing

Chen Hou, Guoqiang Wei, Zhibo Chen

ECCV 2024posterarXiv:2407.13254

#2928

Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic Segmentation

Shoumeng Qiu, Jie Chen, Xinrun Li et al.

ECCV 2024posterarXiv:2407.09115

#2929

Layer-Wise Relevance Propagation with Conservation Property for ResNet

Seitaro Otsuki, Tsumugi Iida, Félix Doublet et al.

ECCV 2024posterarXiv:2409.16763

#2930

Statewide Visual Geolocalization in the Wild

Florian Fervers, Sebastian Bullinger, Christoph Bodensteiner et al.

CVPR 2024posterarXiv:2404.00330

#2931

Memory-Scalable and Simplified Functional Map Learning

Robin Magnet, Maks Ovsjanikov

ECCV 2024posterarXiv:2407.17596

#2932

Quality Assured: Rethinking Annotation Strategies in Imaging AI

Tim Rädsch, Annika Reinke, Vivienn Weru et al.

ICLR 2024posterarXiv:2303.12306

#2933

Understanding Expressivity of GNN in Rule Learning

Haiquan Qiu, Yongqi Zhang, Yong Li et al.

AAAI 2024paperarXiv:2312.15425

#2934

Knowledge Guided Semi-supervised Learning for Quality Assessment of User Generated Videos

Shankhanil Mitra, Rajiv Soundararajan

#2935

Evidential Uncertainty-Guided Mitochondria Segmentation for 3D EM Images

Ruohua Shi, Lingyu Duan, Tiejun Huang et al.

ECCV 2024posterarXiv:2403.13524

#2936

Compress3D: a Compressed Latent Space for 3D Generation from a Single Image

Bowen Zhang, Tianyu Yang, Yu Li et al.

ICLR 2024posterarXiv:2312.16414

#2937

Bellman Optimal Stepsize Straightening of Flow-Matching Models

Bao Nguyen, Binh Nguyen, Viet Anh Nguyen

ECCV 2024posterarXiv:2404.06836

#2938

O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation

Muer Tie, Julong Wei, Zhengjun Wang et al.

ECCV 2024posterarXiv:2407.09047

#2939

Cs2K: Class-specific and Class-shared Knowledge Guidance for Incremental Semantic Segmentation

Wei Cong, Yang Cong, Yuyang Liu et al.

ECCV 2024posterarXiv:2406.18537

#2940

AddBiomechanics Dataset: Capturing the Physics of Human Motion at Scale

Keenon Werling, Janelle M Kaneda, Tian Tan et al.

CVPR 2024posterarXiv:2404.13605

#2941

Turb-Seg-Res: A Segment-then-Restore Pipeline for Dynamic Videos with Atmospheric Turbulence

Ripon Saha, Dehao Qin, Nianyi Li et al.

AAAI 2024paperarXiv:2312.12080

#2942

Learning Subject-Aware Cropping by Outpainting Professional Photos

James Hong, Lu Yuan, Michaël Gharbi et al.

ECCV 2024posterarXiv:2407.13390

#2943

GeometrySticker: Enabling Ownership Claim of Recolorized Neural Radiance Fields

Xiufeng HUANG, Ka Chun Cheung, Simon See et al.

ECCV 2024posterarXiv:2407.12489

#2944

Dual-level Adaptive Self-Labeling for Novel Class Discovery in Point Cloud Segmentation

Ruijie Xu, Chuyu Zhang, Hui Ren et al.

ECCV 2024posterarXiv:2406.01194

#2945

AFF-ttention! Affordances and Attention models for Short-Term Object Interaction Anticipation

Lorenzo Mur Labadia, Ruben Martinez-Cantin, Jose J Guerrero et al.

#2946

Intrinsic Phase-Preserving Networks for Depth Super Resolution

Xuanhong Chen, Hang Wang, Jinfan Liu et al.

#2947

Learning Diffusion Models for Multi-View Anomaly Detection

Chieh Liu, Yu-Min Chu, Ting-I Hsieh et al.

ECCV 2024posterarXiv:2403.11586

#2948

DynoSurf: Neural Deformation-based Temporally Consistent Dynamic Surface Reconstruction

Yuxin Yao, Siyu Ren, Junhui Hou et al.

ECCV 2024posterarXiv:2406.08392

#2949

FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation

Xinzhi MU, Li Chen, Bohan CHEN et al.

ECCV 2024posterarXiv:2407.08418

#2950

PredBench: Benchmarking Spatio-Temporal Prediction across Diverse Disciplines

Zidong Wang, Zeyu Lu, Di Huang et al.

ECCV 2024posterarXiv:2407.02665

#2951

SMILe: Leveraging Submodular Mutual Information For Robust Few-Shot Object Detection

Anay Majee, Ryan X Sharp, Rishabh Iyer

ECCV 2024posterarXiv:2403.19238

#2952

Taming Lookup Tables for Efficient Image Retouching

Sidi Yang, Binxiao Huang, Mingdeng Cao et al.

#2953

EraseDraw : Learning to Insert Objects by Erasing Them from Images

Alper Canberk, Maksym Bondarenko, Ege Ozguroglu et al.

AAAI 2024paperarXiv:2303.06678

#2954

PointPatchMix: Point Cloud Mixing with Patch Scoring

Yi Wang, Jiaze Wang, Jinpeng Li et al.

#2955

Data Disparity and Temporal Unavailability Aware Asynchronous Federated Learning for Predictive Maintenance on Transportation Fleets

Leonie von Wahl, Niklas Heidenreich, Prasenjit Mitra et al.

CVPR 2024posterarXiv:2307.04760

#2956

Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos

Sagnik Majumder, Ziad Al-Halah, Kristen Grauman

ECCV 2024posterarXiv:2403.15033

#2957

Toward Tiny and High-quality Facial Makeup with Data Amplify Learning

Qiaoqiao Jin, Xuanhong Chen, Meiguang Jin et al.

AAAI 2024paperarXiv:2207.05631

#2958

DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization

Wenze Chen, Shiyu Huang, Yuan Chiang et al.

ECCV 2024posterarXiv:2401.06191

#2959

TriNeRFLet: A Wavelet Based Triplane NeRF Representation

Rajaei Khatib, RAJA GIRYES

CVPR 2024posterarXiv:2404.16123

#2960

FairDeDup: Detecting and Mitigating Vision-Language Fairness Disparities in Semantic Dataset Deduplication

Eric Slyman, Stefan Lee, Scott Cohen et al.

AAAI 2024paperarXiv:2407.09787

#2961

Semi-supervised 3D Object Detection with PatchTeacher and PillarMix

Xiaopei Wu, Liang Peng, Liang Xie et al.

ECCV 2024posterarXiv:2312.07485

#2962

MinD-3D: Reconstruct High-quality 3D objects in Human Brain

Jianxiong Gao, Yuqian Fu, Yun Wang et al.

ECCV 2024posterarXiv:2410.10659

#2963

PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion

Runsong Zhu, Shi Qiu, Qianyi Wu et al.

ECCV 2024posterarXiv:2407.07074

#2964

Hyperion – A fast, versatile symbolic Gaussian Belief Propagation framework for Continuous-Time SLAM

David Hug, Ignacio Alzugaray Lopez, Margarita Chli

CVPR 2024posterarXiv:2212.08251

#2965

Task-Adaptive Saliency Guidance for Exemplar-free Class Incremental Learning

Xialei Liu, Jiang-Tian Zhai, Andrew Bagdanov et al.

CVPR 2024posterarXiv:2310.10700

#2966

PELA: Learning Parameter-Efficient Models with Low-Rank Approximation

Yangyang Guo, Guangzhi Wang, Mohan Kankanhalli

#2967

Diffusion-FOF: Single-View Clothed Human Reconstruction via Diffusion-Based Fourier Occupancy Field

Yuanzhen Li, Fei LUO, Chunxia Xiao

ECCV 2024posterarXiv:2305.15078

#2968

UniINR: Event-guided Unified Rolling Shutter Correction, Deblurring, and Interpolation

Yunfan Lu, Guoqiang Liang, Yusheng Wang et al.

ECCV 2024posterarXiv:2404.03531

#2969

COMO: Compact Mapping and Odometry

Eric Dexheimer, Andrew Davison

ECCV 2024posterarXiv:2407.10135

#2970

FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection

Zheng Jiang, Jinqing Zhang, Yanan Zhang et al.

#2971

General Point Model Pretraining with Autoencoding and Autoregressive

Zhe Li, Zhangyang Gao, Cheng Tan et al.

CVPR 2024highlightarXiv:2302.09585

#2972

StreamingFlow: Streaming Occupancy Forecasting with Asynchronous Multi-modal Data Streams via Neural Ordinary Differential Equation

Yining Shi, Kun JIANG, Ke Wang et al.

CVPR 2024highlightarXiv:2405.06216

#2973

Event-based Structure-from-Orbit

Ethan Elms, Yasir Latif, Tae Ha Park et al.

ECCV 2024posterarXiv:2407.10918

#2974

PartImageNet++ Dataset: Scaling up Part-based Models for Robust Recognition

Xiao Li, Yining Liu, Na Dong et al.

#2975

Cross Initialization for Face Personalization of Text-to-Image Models

Lianyu Pang, Jian Yin, Haoran Xie et al.

#2976

Relational Matching for Weakly Semi-Supervised Oriented Object Detection

Wenhao Wu, Hau San Wong, Si Wu et al.

ECCV 2024posterarXiv:2407.06871

#2977

Rethinking Image-to-Video Adaptation: An Object-centric Perspective

Rui Qian, Shuangrui Ding, Dahua Lin

ECCV 2024posterarXiv:2408.17027

#2978

ConDense: Consistent 2D-3D Pre-training for Dense and Sparse Features from Multi-View Images

Xiaoshuai Zhang, Zhicheng Wang, Howard Zhou et al.

#2979

FARSE-CNN: Fully Asynchronous, Recurrent and Sparse Event-Based CNN

Riccardo Santambrogio, Marco Cannici, Matteo Matteucci

ECCV 2024posterarXiv:2405.05079

#2980

Power Variable Projection for Initialization-Free Large-Scale Bundle Adjustment

Simon Weber, Je Hyeong Hong, Daniel Cremers

CVPR 2024posterarXiv:2404.19294

#2981

Masked Spatial Propagation Network for Sparsity-Adaptive Depth Refinement

Jinyoung Jun, Jae-Han Lee, Chang-Su Kim

ECCV 2024posterarXiv:2407.12443

#2982

Preventing Catastrophic Overfitting in Fast Adversarial Training: A Bi-level Optimization Perspective

Zhaoxin Wang, Handing Wang, Cong Tian et al.

CVPR 2024posterarXiv:2406.08960

#2983

AirPlanes: Accurate Plane Estimation via 3D-Consistent Embeddings

Jamie Watson, Filippo Aleotti, Mohamed Sayed et al.

ECCV 2024posterarXiv:2409.20034

#2984

Camera Calibration using a Collimator System

Shunkun Liang, Banglei Guan, Zhenbao Yu et al.

CVPR 2024posterarXiv:2302.04871

#2985

In-N-Out: Faithful 3D GAN Inversion with Volumetric Decomposition for Face Editing

Yiran Xu, Zhixin Shu, Cameron Smith et al.

ECCV 2024posterarXiv:2409.05867

#2986

Flash Cache: Reducing Bias in Radiance Cache Based Inverse Rendering

Benjamin Attal, Dor Verbin, Ben Mildenhall et al.

ECCV 2024posterarXiv:2407.09857

#2987

IFTR: An Instance-Level Fusion Transformer for Visual Collaborative Perception

Shaohong Wang, Lu Bin, Xinyu Xiao et al.

#2988

Plan, Posture and Go: Towards Open-vocabulary Text-to-Motion Generation

Jinpeng Liu, Wenxun Dai, Chunyu Wang et al.

ECCV 2024posterarXiv:2407.04382

#2989

Self-Supervised Representation Learning for Adversarial Attack Detection

Yi Li, Plamen Angelov, Neeraj Suri

ECCV 2024posterarXiv:2407.06683

#2990

Accelerating Online Mapping and Behavior Prediction via Direct BEV Feature Attention

Xunjiang Gu, Guanyu Song, Igor Gilitschenski et al.

CVPR 2024posterarXiv:2405.19902

#2991

Learning Discriminative Dynamics with Label Corruption for Noisy Label Detection

Suyeon Kim, Dongha Lee, SeongKu Kang et al.

#2992

Sur^2f: A Hybrid Representation for High-Quality and Efficient Surface Reconstruction from Multi-view Images

Zhangjin Huang, Zhihao Liang, Kui Jia

ICLR 2024posterarXiv:2401.10632

#2993

Interventional Fairness on Partially Known Causal Graphs: A Constrained Optimization Approach

Aoqi Zuo, yiqing li, Susan Wei et al.

ICLR 2024posterarXiv:2306.16688

#2994

SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores

Zhiyu Mei, Wei Fu, Jiaxuan Gao et al.

CVPR 2024posterarXiv:2404.04848

#2995

Task-Aware Encoder Control for Deep Video Compression

Xingtong Ge, Jixiang Luo, XINJIE ZHANG et al.

ECCV 2024posterarXiv:2408.08258

#2996

Snuffy: Efficient Whole Slide Image Classifier

Hossein Jafarinia, Alireza Alipanah, Saeed Razavi et al.

#2997

Markov Knowledge Distillation: Make Nasty Teachers trained by Self-undermining Knowledge Distillation Fully Distillable

En-Hui Yang, Linfeng Ye

CVPR 2024posterarXiv:2406.06813

#2998

Stable Neighbor Denoising for Source-free Domain Adaptive Segmentation

Dong Zhao, Shuang Wang, Qi Zang et al.

ECCV 2024posterarXiv:2404.13706

#2999

Concept Arithmetics for Circumventing Concept Inhibition in Diffusion Models

Vitali Petsiuk, Kate Saenko

ECCV 2024posterarXiv:2403.04899

#3000

Towards Scene Graph Anticipation

Rohith Peddi, Saksham Singh, Saurabh . et al.