Most Cited 2024 &quot;sparse neural networks&quot; Papers

ECCV 2024posterarXiv:2407.13851

#2802

X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs

Swetha Sirnam, Jinyu Yang, Tal Neiman et al.

AAAI 2024paperarXiv:2401.06443

#2803

BOK-VQA: Bilingual outside Knowledge-Based Visual Question Answering via Graph Representation Pretraining

Minjun Kim, SeungWoo Song, Youhan Lee et al.

ECCV 2024posterarXiv:2408.12352

#2804

GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections

Shiyue Zhang, Zheng Chong, Xujie Zhang et al.

ECCV 2024posterarXiv:2405.10690

#2805

CoLeaF: A Contrastive-Collaborative Learning Framework for Weakly Supervised Audio-Visual Video Parsing

Faegheh Sardari, Armin Mustafa, Philip JB Jackson et al.

AAAI 2024paperarXiv:2312.05551

#2806

Multi-Dimensional Fair Federated Learning

Cong Su, Guoxian Yu, Jun Wang et al.

ICLR 2024posterarXiv:2401.17992

#2807

Multilinear Operator Networks

Yixin Cheng, Grigorios Chrysos, Markos Georgopoulos et al.

ECCV 2024posterarXiv:2403.06443

#2808

Temporal-Mapping Photography for Event Cameras

Yuhan Bao, Lei Sun, Yuqin Ma et al.

AAAI 2024paperarXiv:2312.08009

#2809

Semi-supervised Class-Agnostic Motion Prediction with Pseudo Label Regeneration and BEVMix

Kewei Wang, Yizheng Wu, Zhiyu Pan et al.

ICLR 2024posterarXiv:2310.00115

#2810

Learning Over Molecular Conformer Ensembles: Datasets and Benchmarks

Yanqiao Zhu, Jeehyun Hwang, Keir Adams et al.

#2811

Cross-Modal Match for Language Conditioned 3D Object Grounding

Yachao Zhang, Runze Hu, Ronghui Li et al.

ICLR 2024spotlightarXiv:2310.12975

#2812

Variational Inference for SDEs Driven by Fractional Noise

Rembert Daems, Manfred Opper, Guillaume Crevecoeur et al.

AAAI 2024paperarXiv:2402.03561

#2813

VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation

Jialu Li, Aishwarya Padmakumar, Gaurav Sukhatme et al.

ECCV 2024posterarXiv:2403.16167

#2814

Exploiting Semantic Reconstruction to Mitigate Hallucinations in Vision-Language Models

Minchan Kim, Minyeong Kim, Junik Bae et al.

CVPR 2024posterarXiv:2404.03477

#2815

Towards Automated Movie Trailer Generation

Dawit Argaw Argaw, Mattia Soldan, Alejandro Pardo et al.

#2816

Multi-View Dynamic Reflection Prior for Video Glass Surface Detection

Fang Liu, Yuhao Liu, Jiaying Lin et al.

#2817

Identification of Necessary Semantic Undertakers in the Causal View for Image-Text Matching

Huatian Zhang, Lei Zhang, Kun Zhang et al.

#2818

Distributionally Robust Loss for Long-Tailed Multi-Label Image Classification

Dekun Lin, Zhe Cui, Rui Chen et al.

ECCV 2024posterarXiv:2407.12676

#2819

CoSIGN: Few-Step Guidance of ConSIstency Model to Solve General INverse Problems

Jiankun Zhao, Bowen Song, Liyue Shen

AAAI 2024paperarXiv:2407.09787

#2820

Semi-supervised 3D Object Detection with PatchTeacher and PillarMix

Xiaopei Wu, Liang Peng, Liang Xie et al.

#2821

CatFormer: Category-Level 6D Object Pose Estimation with Transformer

Sheng Yu, Dihua Zhai, Yuanqing Xia

ICLR 2024posterarXiv:2310.10780

#2822

Demystifying Poisoning Backdoor Attacks from a Statistical Perspective

Ganghua Wang, Xun Xian, Ashish Kundu et al.

#2823

Mixture of Weak and Strong Experts on Graphs

Hanqing Zeng, Hanjia Lyu, Diyi Hu et al.

ICLR 2024poster

CVPR 2024posterarXiv:2312.00598

#2824

Learning from One Continuous Video Stream

Joao Carreira, Michael King, Viorica Patraucean et al.

#2825

Self-Training Based Few-Shot Node Classification by Knowledge Distillation

Zongqian Wu, Yujie Mo, Peng Zhou et al.

ECCV 2024posterarXiv:2409.20557

#2826

Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos

Mohaiminul Islam, Tushar Nagarajan, Huiyu Wang et al.

#2827

LTA-PCS: Learnable Task-Agnostic Point Cloud Sampling

Jiaheng Liu, Jianhao Li, Kaisiyuan Wang et al.

#2828

Hiding Imperceptible Noise in Curvature-Aware Patches for 3D Point Cloud Attack

Mingyu Yang, Daizong Liu, Keke Tang et al.

ECCV 2024posterarXiv:2408.12316

#2829

Unrolled Decomposed Unpaired Learning for Controllable Low-Light Video Enhancement

Lingyu Zhu, Wenhan Yang, Baoliang Chen et al.

CVPR 2024posterarXiv:2212.05315

#2830

Mind The Edge: Refining Depth Edges in Sparsely-Supervised Monocular Depth Estimation

Lior Talker, Aviad Cohen, Erez Yosef et al.

AAAI 2024paperarXiv:2307.05892

#2831

SC-NeuS: Consistent Neural Surface Reconstruction from Sparse and Noisy Views

Shi-Sheng Huang, Zixin Zou, Yichi Zhang et al.

ICLR 2024posterarXiv:2404.07863

#2832

Backdoor Contrastive Learning via Bi-level Trigger Optimization

Weiyu Sun, Xinyu Zhang, Hao LU et al.

CVPR 2024posterarXiv:2002.07756

#2833

Hierarchical Correlation Clustering and Tree Preserving Embedding

Morteza Haghir Chehreghani, Mostafa Haghir Chehreghani

ECCV 2024posterarXiv:2403.13808

#2834

On Pretraining Data Diversity for Self-Supervised Learning

Hasan Abed El Kader Hammoud, Tuhin Das, Fabio Pizzati et al.

ECCV 2024posterarXiv:2409.17316

#2835

Bi-TTA: Bidirectional Test-Time Adapter for Remote Physiological Measurement

Haodong LI, Hao LU, Yingcong Chen

#2836

Epitopological learning and Cannistraci-Hebb network shape intelligence brain-inspired theory for ultra-sparse advantage in deep learning

Yingtao Zhang, Jialin Zhao, Wenjing Wu et al.

ICLR 2024poster

ECCV 2024posterarXiv:2407.11494

#2837

Learning Semantic Latent Directions for Accurate and Controllable Human Motion Prediction

Guowei Xu, Jiale Tao, Wen Li et al.

CVPR 2024posterarXiv:2403.07244

#2838

Time-Efficient Light-Field Acquisition Using Coded Aperture and Events

Shuji Habuchi, Keita Takahashi, Chihiro Tsutake et al.

ECCV 2024posterarXiv:2408.13752

#2839

Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation

Zhaoyang Li, Yuan Wang, Wangkai Li et al.

CVPR 2024posterarXiv:2308.06699

#2840

Neural Super-Resolution for Real-time Rendering with Radiance Demodulation

Jia Li, Ziling Chen, Xiaolong Wu et al.

ICLR 2024posterarXiv:2308.10632

#2841

Foundation Model-oriented Robustness: Robust Image Model Evaluation with Pretrained Models

Peiyan Zhang, Haoyang Liu, Chaozhuo Li et al.

CVPR 2024posterarXiv:2406.04155

#2842

Improving Physics-Augmented Continuum Neural Radiance Field-Based Geometry-Agnostic System Identification with Lagrangian Particle Optimization

Takuhiro Kaneko

ECCV 2024posterarXiv:2408.13459

#2843

Rethinking Video Deblurring with Wavelet-Aware Dynamic Transformer and Diffusion Model

chen rao, Guangyuan Li, Zehua Lan et al.

ECCV 2024posterarXiv:2305.15798

#2844

BK-SDM: A Lightweight, Fast, and Cheap Version of Stable Diffusion

Bo-Kyeong Kim, Hyoung-Kyu Song, Thibault Castells et al.

ICLR 2024posterarXiv:2306.15876

#2845

Hybrid Distillation: Connecting Masked Autoencoders with Contrastive Learners

Bowen Shi, XIAOPENG ZHANG, Yaoming Wang et al.

ICLR 2024posterarXiv:2401.10556

#2846

Symbol as Points: Panoptic Symbol Spotting via Point-based Representation

Wenlong Liu, Tianyu Yang, Yuhan Wang et al.

CVPR 2024posterarXiv:2405.12509

#2847

Active Object Detection with Knowledge Aggregation and Distillation from Large Models

Dejie Yang, Yang Liu

ECCV 2024posterarXiv:2305.03716

#2848

3D Small Object Detection with Dynamic Spatial Pruning

Xiuwei Xu, Zhihao Sun, Ziwei Wang et al.

ECCV 2024posterarXiv:2312.07315

#2849

NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image

Yoonwoo Jeong, Jinwoo Lee, Chiheon Kim et al.

ECCV 2024posterarXiv:2407.07324

#2850

Event-Aided Time-To-Collision Estimation for Autonomous Driving

Jinghang Li, Bangyan Liao, Xiuyuan LU et al.

ECCV 2024posterarXiv:2404.06493

#2851

Flying with Photons: Rendering Novel Views of Propagating Light

Anagh Malik, Noah Juravsky, Ryan Po et al.

CVPR 2024posterarXiv:2404.00931

#2852

GOV-NeSF: Generalizable Open-Vocabulary Neural Semantic Fields

Fangyin Wei, Hanlin Chen, Gim Hee Lee

CVPR 2024posterarXiv:2311.15744

#2853

One More Step: A Versatile Plug-and-Play Module for Rectifying Diffusion Schedule Flaws and Enhancing Low-Frequency Controls

Minghui Hu, Jianbin Zheng, Chuanxia Zheng et al.

#2854

Fusing Personal and Environmental Cues for Identification and Segmentation of First-Person Camera Wearers in Third-Person Views

Ziwei Zhao, Yuchen Wang, Chuhua Wang

ICLR 2024oralarXiv:2311.03309

#2855

Neural structure learning with stochastic differential equations

Benjie Wang, Joel Jennings, Wenbo Gong

CVPR 2024posterarXiv:2405.02608

#2856

UnSAMFlow: Unsupervised Optical Flow Guided by Segment Anything Model

Shuai Yuan, Lei Luo, Zhuo Hui et al.

ECCV 2024posterarXiv:2407.05352

#2857

Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model

Danni Yang, Ruohan Dong, Jiayi Ji et al.

ECCV 2024posterarXiv:2407.09826

#2858

3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance

Xiaoxu Xu, Yitian Yuan, Jinlong Li et al.

CVPR 2024posterarXiv:2311.17833

#2859

DiG-IN: Diffusion Guidance for Investigating Networks - Uncovering Classifier Differences Neuron Visualisations and Visual Counterfactual Explanations

Maximilian Augustin, Yannic Neuhaus, Matthias Hein

ICLR 2024posterarXiv:2401.10474

#2860

LDReg: Local Dimensionality Regularized Self-Supervised Learning

Hanxun Huang, Ricardo Campello, Sarah Erfani et al.

ECCV 2024posterarXiv:2311.13777

#2861

GS-Pose: Category-Level Object Pose Estimation via Geometric and Semantic Correspondence

Pengyuan Wang, Takuya Ikeda, Robert Lee et al.

ECCV 2024posterarXiv:2409.07239

#2862

PiTe: Pixel-Temporal Alignment for Large Video-Language Model

Yang Liu, Pengxiang Ding, Siteng Huang et al.

#2863

TULIP: Multi-camera 3D Precision Assessment of Parkinson’s Disease

Kyungdo Kim, Sihan Lyu, Sneha Mantri et al.

#2864

Brain Netflix: Scaling Data to Reconstruct Videos from Brain Signals

Camilo Fosco, Benjamin Lahner, Bowen Pan et al.

CVPR 2024posterarXiv:2404.19696

#2865

Naturally Supervised 3D Visual Grounding with Language-Regularized Concept Learners

Chun Feng, Joy Hsu, Weiyu Liu et al.

#2866

Exploring Vulnerabilities in Spiking Neural Networks: Direct Adversarial Attacks on Raw Event Data

Yanmeng Yao, Xiaohan Zhao, Bin Gu

ECCV 2024posterarXiv:2403.09468

#2867

Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing

Wonjun Kang, Kevin Galim, Hyung Il Koo

CVPR 2024posterarXiv:2401.06146

#2868

AAMDM: Accelerated Auto-regressive Motion Diffusion Model

Tianyu Li, Calvin Zhuhan Qiao, Ren Guanqiao et al.

CVPR 2024posterarXiv:2311.17951

#2869

C3Net: Compound Conditioned ControlNet for Multimodal Content Generation

Juntao Zhang, Yuehuai LIU, Yu-Wing Tai et al.

CVPR 2024posterarXiv:2406.17219

#2870

Facial Identity Anonymization via Intrinsic and Extrinsic Attention Distraction

Zhenzhong Kuang, Xiaochen Yang, Yingjie Shen et al.

ECCV 2024posterarXiv:2311.15908

#2871

Enhancing Perceptual Quality in Video Super-Resolution through Temporally-Consistent Detail Synthesis using Diffusion Models

Claudio Rota, Marco Buzzelli, Joost Van de Weijer

CVPR 2024posterarXiv:2403.10988

#2872

Boosting Flow-based Generative Super-Resolution Models via Learned Prior

Li-Yuan Tsao, Yi-Chen Lo, Chia-Che Chang et al.

ECCV 2024posterarXiv:2407.05008

#2873

T-CorresNet: Template Guided 3D Point Cloud Completion with Correspondence Pooling Query Generation Strategy

Fan Duan, Jiahao Yu, Li Chen

#2874

Enhancing Cross-Subject fMRI-to-Video Decoding with Global-Local Functional Alignment

Chong Li, Xuelin Qian, Yun Wang et al.

CVPR 2024posterarXiv:2403.19501

#2875

RELI11D: A Comprehensive Multimodal Human Motion Dataset and Method

Ming Yan, Yan Zhang, Shuqiang Cai et al.

ECCV 2024posterarXiv:2411.08606

#2876

LG-Gaze: Learning Geometry-aware Continuous Prompts for Language-Guided Gaze Estimation

Pengwei Yin, Jingjing Wang, Guanzhong Zeng et al.

ECCV 2024posterarXiv:2403.18820

#2877

MetaCap: Meta-learning Priors from Multi-View Imagery for Sparse-view Human Performance Capture and Rendering

Guoxing Sun, Rishabh Dabral, Pascal Fua et al.

#2878

Motion Diversification Networks

Hee Jae Kim, Eshed Ohn-Bar

CVPR 2024posterarXiv:2312.03102

#2879

Fully Convolutional Slice-to-Volume Reconstruction for Single-Stack MRI

Sean I. Young, Yaël Balbastre, Bruce Fischl et al.

AAAI 2024paperarXiv:2402.19119

#2880

VIXEN: Visual Text Comparison Network for Image Difference Captioning

Alexander Black, Jing Shi, Yifei Fan et al.

AAAI 2024paperarXiv:2306.12681

#2881

One at a Time: Progressive Multi-Step Volumetric Probability Learning for Reliable 3D Scene Perception

Bohan Li, Yasheng Sun, Jingxin Dong et al.

ECCV 2024posterarXiv:2404.07336

#2882

Perceptual Evaluation of Audio-Visual Synchrony Grounded in Viewers’ Opinion Scores

Lucas Goncalves, Prashant Mathur, Chandrashekhar Lavania et al.

AAAI 2024paperarXiv:2406.08799

#2883

Pareto Front-Diverse Batch Multi-Objective Bayesian Optimization

Alaleh Ahmadianshalchi, Syrine Belakaria, Janardhan Rao Doppa

#2884

ZoLA: Zero-Shot Creative Long Animation Generation with Short Video Model

Fu-Yun Wang, Zhaoyang Huang, Qiang Ma et al.

ECCV 2024posterarXiv:2407.04086

#2885

Certifiably Robust Image Watermark

Zhengyuan Jiang, Moyang Guo, Yuepeng Hu et al.

CVPR 2024posterarXiv:2406.04999

#2886

ProMotion: Prototypes As Motion Learners

Yawen Lu, Dongfang Liu, Qifan Wang et al.

#2887

REGLO: Provable Neural Network Repair for Global Robustness Properties

Feisi Fu, Zhilu Wang, Weichao Zhou et al.

ICLR 2024spotlightarXiv:2305.01521

#2888

Unlocking the Power of Representations in Long-term Novelty-based Exploration

Alaa Saade, Steven Kapturowski, Daniele Calandriello et al.

AAAI 2024paperarXiv:2401.12497

#2889

Building Minimal and Reusable Causal State Abstractions for Reinforcement Learning

Zizhao Wang, Caroline Wang, Xuesu Xiao et al.

AAAI 2024paperarXiv:2312.10572

#2890

Improved Anonymous Multi Agent Path Finding Algorithm

Zain Alabedeen Ali, Konstantin Yakovlev

AAAI 2024paperarXiv:2402.00084

#2891

EPSD: Early Pruning with Self-Distillation for Efficient Model Compression

Dong Chen, Ning Liu, Yichen Zhu et al.

#2892

Making Visual Sense of Oracle Bones for You and Me

Runqi Qiao, LAN YANG, Kaiyue Pang et al.

CVPR 2024posterarXiv:2310.09469

#2893

Towards More Accurate Diffusion Model Acceleration with A Timestep Tuner

Mengfei Xia, Yujun Shen, Changsong Lei et al.

AAAI 2024paperarXiv:2501.00009

#2894

Model-Driven Deep Neural Network for Enhanced AoA Estimation Using 5G gNB

Shengheng Liu, Xingkang Li, Zihuan Mao et al.

ECCV 2024posterarXiv:2407.06704

#2895

Self-supervised visual learning from interactions with objects

Arthur Aubret, Céline Teulière, Jochen Triesch

AAAI 2024paperarXiv:2312.17263

#2896

TACIT: A Target-Agnostic Feature Disentanglement Framework for Cross-Domain Text Classification

Rui Song, Fausto Giunchiglia, Yingji Li et al.

ICLR 2024posterarXiv:2311.13541

#2897

Linear Log-Normal Attention with Unbiased Concentration

Yury Nahshan, Joseph Kampeas, Emir Haleva

CVPR 2024posterarXiv:2405.10037

#2898

Bilateral Event Mining and Complementary for Event Stream Super-Resolution

Zhilin Huang, Quanmin Liang, Yijie Yu et al.

ECCV 2024posterarXiv:2407.19666

#2899

Take A Step Back: Rethinking the Two Stages in Visual Reasoning

Mingyu Zhang, Jiting Cai, Mingyu Liu et al.

CVPR 2024posterarXiv:2403.15835

#2900

Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression

Hancheng Ye, Chong Yu, Peng Ye et al.

AAAI 2024paperarXiv:2312.13066

#2901

PPEA-Depth: Progressive Parameter-Efficient Adaptation for Self-Supervised Monocular Depth Estimation

Yue-Jiang Dong, Yuan-Chen Guo, Ying-Tian Liu et al.

CVPR 2024posterarXiv:2404.01543

#2902

Efficient 3D Implicit Head Avatar with Mesh-anchored Hash Table Blendshapes

Ziqian Bai, Feitong Tan, Sean Fanello et al.

AAAI 2024paperarXiv:2312.10314

#2903

DeepCalliFont: Few-Shot Chinese Calligraphy Font Synthesis by Integrating Dual-Modality Generative Models

Yitian Liu, Zhouhui Lian

#2904

Context Enhanced Transformer for Single Image Object Detection in Video Data

Seungjun An, Seonghoon Park, Gyeongnyeon Kim et al.

ICLR 2024posterarXiv:2403.05490

#2905

Poly-View Contrastive Learning

Amitis Shidani, R Devon Hjelm, Jason Ramapuram et al.

#2906

HSR: Holistic 3D Human-Scene Reconstruction from Monocular Videos

Lixin Xue, Chen Guo, Chengwei Zheng et al.

AAAI 2024paperarXiv:2402.16312

#2907

Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users

Hantao Yang, Xutong Liu, Zhiyong Wang et al.

#2908

Link Prediction in Multilayer Networks via Cross-Network Embedding

Guojing Ren, Xiao Ding, Xiao-Ke Xu et al.

ICLR 2024posterarXiv:2309.17175

#2909

TextField3D: Towards Enhancing Open-Vocabulary 3D Generation with Noisy Text Fields

Tianyu Huang, Yihan Zeng, Bowen Dong et al.

AAAI 2024paperarXiv:2411.19451

#2910

Learning Visual Abstract Reasoning through Dual-Stream Networks

Kai Zhao, Chang Xu, Bailu Si

AAAI 2024paperarXiv:2401.15603

#2911

Improving Expressive Power of Spectral Graph Neural Networks with Eigenvalue Correction

Kangkang Lu, Yanhua Yu, Hao Fei et al.

#2912

Efficient Axiomatization of OWL 2 EL Ontologies from Data by Means of Formal Concept Analysis

Francesco Kriegel

ICLR 2024spotlightarXiv:2307.15396

#2913

Noisy Interpolation Learning with Shallow Univariate ReLU Networks

Nirmit Joshi, Gal Vardi, Nathan Srebro

AAAI 2024paperarXiv:2310.03131

#2914

Axiomatic Aggregations of Abductive Explanations

Gagan Biradar, Yacine Izza, Elita Lobo et al.

ECCV 2024posterarXiv:2407.07402

#2915

ActionVOS: Actions as Prompts for Video Object Segmentation

LIANGYANG OUYANG, Ruicong Liu, Yifei Huang et al.

CVPR 2024posterarXiv:2312.03442

#2916

High-Quality Facial Geometry and Appearance Capture at Home

Yuxuan Han, Junfeng Lyu, Feng Xu

#2917

ZOOM: Learning Video Mirror Detection with Extremely-Weak Supervision

Ke Xu, Tsun Wai Siu, Rynson W.H. Lau

CVPR 2024posterarXiv:2404.01243

#2918

A Unified and Interpretable Emotion Representation and Expression Generation

Reni Paskaleva, Mykyta Holubakha, Andela Ilic et al.

ECCV 2024posterarXiv:2405.09883

#2919

RoScenes: A Large-scale Multi-view 3D Dataset for Roadside Perception

Xiaosu Zhu, Hualian Sheng, Sijia Cai et al.

AAAI 2024paperarXiv:2401.11649

#2920

A Multimodal, Multi-Task Adapting Framework for Video Action Recognition

Mengmeng Wang, Jiazheng Xing, Boyuan Jiang et al.

#2921

Comprehensive View Embedding Learning for Single-Cell Multimodal Integration

Zhenchao Tang, Jiehui Huang, Guanxing Chen et al.

#2922

PQ-SAM: Post-training Quantization for Segment Anything Model

Xiaoyu Liu, Xin Ding, Lei Yu et al.

ICLR 2024posterarXiv:2405.12398

#2923

ASMR: Activation-Sharing Multi-Resolution Coordinate Networks for Efficient Inference

Jason Chun Lok Li, Steven Luo, Le Xu et al.

ECCV 2024posterarXiv:2407.02047

#2924

CountFormer: Multi-View Crowd Counting Transformer

Hong Mo, Xiong Zhang, Jianchao Tan et al.

#2925

Improve Robustness of Reinforcement Learning against Observation Perturbations via l∞ Lipschitz Policy Networks

Buqing Nie, Jingtian Ji, Yangqing Fu et al.

ICLR 2024posterarXiv:2310.04966

#2926

Improved Active Learning via Dependent Leverage Score Sampling

Atsushi Shimizu, Xiaoou Cheng, Christopher Musco et al.

ECCV 2024posterarXiv:2406.02461

#2927

RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting

Qi Wang, Ruijie Lu, Xudong XU et al.

ECCV 2024posterarXiv:2403.12003

#2928

GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning

Xiaojie Li, Yibo Yang, Xiangtai Li et al.

ICLR 2024posterarXiv:2306.14268

#2929

Adaptive Window Pruning for Efficient Local Motion Deblurring

Haoying Li, Jixin Zhao, Shangchen Zhou et al.

ICLR 2024posterarXiv:2303.12306

#2930

Understanding Expressivity of GNN in Rule Learning

Haiquan Qiu, Yongqi Zhang, Yong Li et al.

#2931

GLDL: Graph Label Distribution Learning

Yufei Jin, Richard Gao, Yi He et al.

#2932

Factorized Diffusion Autoencoder for Unsupervised Disentangled Representation Learning

Ancong Wu, Wei-shi Zheng

ECCV 2024posterarXiv:2407.21032

#2933

Safeguard Text-to-Image Diffusion Models with Human Feedback Inversion

Sanghyun Kim, Seohyeon Jung, Balhae Kim et al.

#2934

Unsupervised Group Re-identification via Adaptive Clustering-Driven Progressive Learning

Hongxu Chen, Quan Zhang, Jian-Huang Lai et al.

ECCV 2024posterarXiv:2409.13803

#2935

Intrinsic Single-Image HDR Reconstruction

Sebastian Dille, Chris Careaga, Yagiz Aksoy

AAAI 2024paperarXiv:2401.06799

#2936

Make Prompts Adaptable: Bayesian Modeling for Vision-Language Prompt Learning with Data-Dependent Prior

Youngjae Cho, HeeSun Bae, Seungjae Shin et al.

AAAI 2024paperarXiv:2312.17526

#2937

Noise-Free Optimization in Early Training Steps for Image Super-resolution

MinKyu Lee, Jae-Pil Heo

ICLR 2024posterarXiv:2305.17342

#2938

Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in RL

Xiangyu Liu, Souradip Chakraborty, Yanchao Sun et al.

AAAI 2024paperarXiv:2401.13621

#2939

DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning

Xinghao Wang, Junliang He, Pengyu Wang et al.

ICLR 2024posterarXiv:2312.16414

#2940

Bellman Optimal Stepsize Straightening of Flow-Matching Models

Bao Nguyen, Binh Nguyen, Viet Anh Nguyen

AAAI 2024paperarXiv:2308.07301

#2941

A Unified Masked Autoencoder with Patchified Skeletons for Motion Synthesis

Esteve Valls Mascaro, Hyemin Ahn, Dongheui Lee

#2942

Learning Robust Rationales for Model Explainability: A Guidance-Based Approach

Shuaibo Hu, Kui Yu

CVPR 2024posterarXiv:2404.00330

#2943

Memory-Scalable and Simplified Functional Map Learning

Robin Magnet, Maks Ovsjanikov

CVPR 2024posterarXiv:2403.20249

#2944

Relation Rectification in Diffusion Model

Yinwei Wu, Xingyi Yang, Xinchao Wang

CVPR 2024posterarXiv:2312.01964

#2945

Semantics-aware Motion Retargeting with Vision-Language Models

Haodong Zhang, ZhiKe Chen, Haocheng Xu et al.

AAAI 2024paperarXiv:2210.08106

#2946

A Primal-Dual Algorithm for Hybrid Federated Learning

Tom Overman, Garrett Blum, Diego Klabjan

ECCV 2024posterarXiv:2407.10704

#2947

Quantized Prompt for Efficient Generalization of Vision-Language Models

Tianxiang Hao, Xiaohan Ding, Juexiao Feng et al.

AAAI 2024paperarXiv:2403.02063

#2948

Depth-Guided Robust and Fast Point Cloud Fusion NeRF for Sparse Input Views

Shuai Guo, Qiuwen Wang, Yijie Gao et al.

#2949

DIUSum: Dynamic Image Utilization for Multimodal Summarization

Min Xiao, Junnan Zhu, Feifei Zhai et al.

AAAI 2024paperarXiv:2403.05660

#2950

Decoupling Degradations with Recurrent Network for Video Restoration in Under-Display Camera

Chengxu Liu, Xuan Wang, Yuanting Fan et al.

ECCV 2024posterarXiv:2411.06344

#2951

CityGuessr: City-Level Video Geo-Localization on a Global Scale

Parth Parag Kulkarni, Gaurav Kumar Nayak, Shah Mubarak

ECCV 2024posterarXiv:2407.12939

#2952

GenRC: Generative 3D Room Completion from Sparse Image Collections

Ming-Feng Li, Yueh-Feng Ku, Hong-Xuan Yen et al.

CVPR 2024posterarXiv:2404.13605

#2953

Turb-Seg-Res: A Segment-then-Restore Pipeline for Dynamic Videos with Atmospheric Turbulence

Ripon Saha, Dehao Qin, Nianyi Li et al.

AAAI 2024paperarXiv:2401.17186

#2954

Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning

Bang Yang, Yong Dai, Xuxin Cheng et al.

AAAI 2024paperarXiv:2312.15707

#2955

High-Fidelity Diffusion-Based Image Editing

Chen Hou, Guoqiang Wei, Zhibo Chen

AAAI 2024paperarXiv:2404.07962

#2956

Live and Learn: Continual Action Clustering with Incremental Views

Xiaoqiang Yan, Yingtao Gan, Yiqiao Mao et al.

ECCV 2024posterarXiv:2407.13254

#2957

Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic Segmentation

Shoumeng Qiu, Jie Chen, Xinrun Li et al.

ECCV 2024posterarXiv:2407.09115

#2958

Layer-Wise Relevance Propagation with Conservation Property for ResNet

Seitaro Otsuki, Tsumugi Iida, Félix Doublet et al.

ECCV 2024posterarXiv:2409.16763

#2959

Statewide Visual Geolocalization in the Wild

Florian Fervers, Sebastian Bullinger, Christoph Bodensteiner et al.

AAAI 2024paperarXiv:2312.15425

#2960

Knowledge Guided Semi-supervised Learning for Quality Assessment of User Generated Videos

Shankhanil Mitra, Rajiv Soundararajan

ECCV 2024posterarXiv:2407.17596

#2961

Quality Assured: Rethinking Annotation Strategies in Imaging AI

Tim Rädsch, Annika Reinke, Vivienn Weru et al.

#2962

Evidential Uncertainty-Guided Mitochondria Segmentation for 3D EM Images

Ruohua Shi, Lingyu Duan, Tiejun Huang et al.

ECCV 2024posterarXiv:2403.13524

#2963

Compress3D: a Compressed Latent Space for 3D Generation from a Single Image

Bowen Zhang, Tianyu Yang, Yu Li et al.

CVPR 2024posterarXiv:2403.13647

#2964

Meta-Point Learning and Refining for Category-Agnostic Pose Estimation

Junjie Chen, Jiebin Yan, Yuming Fang et al.

ECCV 2024posterarXiv:2404.06836

#2965

O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation

Muer Tie, Julong Wei, Zhengjun Wang et al.

ICLR 2024spotlightarXiv:2310.10434

#2966

Equivariant Matrix Function Neural Networks

Ilyes Batatia, Lars Leon Schaaf, Gábor Csányi et al.

ECCV 2024posterarXiv:2407.09047

#2967

Cs2K: Class-specific and Class-shared Knowledge Guidance for Incremental Semantic Segmentation

Wei Cong, Yang Cong, Yuyang Liu et al.

CVPR 2024posterarXiv:2404.16123

#2968

FairDeDup: Detecting and Mitigating Vision-Language Fairness Disparities in Semantic Dataset Deduplication

Eric Slyman, Stefan Lee, Scott Cohen et al.

ECCV 2024posterarXiv:2406.18537

#2969

AddBiomechanics Dataset: Capturing the Physics of Human Motion at Scale

Keenon Werling, Janelle M Kaneda, Tian Tan et al.

#2970

TurboSL: Dense Accurate and Fast 3D by Neural Inverse Structured Light

Parsa Mirdehghan, Maxx Wu, Wenzheng Chen et al.

AAAI 2024paperarXiv:2312.12080

#2971

Learning Subject-Aware Cropping by Outpainting Professional Photos

James Hong, Lu Yuan, Michaël Gharbi et al.

ECCV 2024posterarXiv:2407.13390

#2972

GeometrySticker: Enabling Ownership Claim of Recolorized Neural Radiance Fields

Xiufeng HUANG, Ka Chun Cheung, Simon See et al.

ECCV 2024posterarXiv:2407.12489

#2973

Dual-level Adaptive Self-Labeling for Novel Class Discovery in Point Cloud Segmentation

Ruijie Xu, Chuyu Zhang, Hui Ren et al.

ECCV 2024posterarXiv:2406.01194

#2974

AFF-ttention! Affordances and Attention models for Short-Term Object Interaction Anticipation

Lorenzo Mur Labadia, Ruben Martinez-Cantin, Jose J Guerrero et al.

#2975

Intrinsic Phase-Preserving Networks for Depth Super Resolution

Xuanhong Chen, Hang Wang, Jinfan Liu et al.

#2976

Learning Diffusion Models for Multi-View Anomaly Detection

Chieh Liu, Yu-Min Chu, Ting-I Hsieh et al.

ECCV 2024posterarXiv:2403.11586

#2977

DynoSurf: Neural Deformation-based Temporally Consistent Dynamic Surface Reconstruction

Yuxin Yao, Siyu Ren, Junhui Hou et al.

ICLR 2024posterarXiv:2403.09274

#2978

EventRPG: Event Data Augmentation with Relevance Propagation Guidance

Mingyuan Sun, Donghao Zhang, Zongyuan Ge et al.

CVPR 2024highlightarXiv:2401.15261

#2979

Vanishing-Point-Guided Video Semantic Segmentation of Driving Scenes

Diandian Guo, Deng-Ping Fan, Tongyu Lu et al.

ECCV 2024posterarXiv:2406.08392

#2980

FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation

Xinzhi MU, Li Chen, Bohan CHEN et al.

ECCV 2024posterarXiv:2407.08418

#2981

PredBench: Benchmarking Spatio-Temporal Prediction across Diverse Disciplines

Zidong Wang, Zeyu Lu, Di Huang et al.

ECCV 2024posterarXiv:2407.02665

#2982

SMILe: Leveraging Submodular Mutual Information For Robust Few-Shot Object Detection

Anay Majee, Ryan X Sharp, Rishabh Iyer

ECCV 2024posterarXiv:2403.19238

#2983

Taming Lookup Tables for Efficient Image Retouching

Sidi Yang, Binxiao Huang, Mingdeng Cao et al.

#2984

EraseDraw : Learning to Insert Objects by Erasing Them from Images

Alper Canberk, Maksym Bondarenko, Ege Ozguroglu et al.

AAAI 2024paperarXiv:2303.06678

#2985

PointPatchMix: Point Cloud Mixing with Patch Scoring

Yi Wang, Jiaze Wang, Jinpeng Li et al.

#2986

Data Disparity and Temporal Unavailability Aware Asynchronous Federated Learning for Predictive Maintenance on Transportation Fleets

Leonie von Wahl, Niklas Heidenreich, Prasenjit Mitra et al.

ECCV 2024posterarXiv:2403.15033

#2987

Toward Tiny and High-quality Facial Makeup with Data Amplify Learning

Qiaoqiao Jin, Xuanhong Chen, Meiguang Jin et al.

ECCV 2024posterarXiv:2401.06191

#2988

TriNeRFLet: A Wavelet Based Triplane NeRF Representation

Rajaei Khatib, RAJA GIRYES

#2989

CDFormer: When Degradation Prediction Embraces Diffusion Model for Blind Image Super-Resolution

Qingguo Liu, Chenyi Zhuang, Pan Gao et al.

AAAI 2024paperarXiv:2207.05631

#2990

DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization

Wenze Chen, Shiyu Huang, Yuan Chiang et al.

CVPR 2024posterarXiv:2307.04760

#2991

Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos

Sagnik Majumder, Ziad Al-Halah, Kristen Grauman

ECCV 2024posterarXiv:2312.07485

#2992

MinD-3D: Reconstruct High-quality 3D objects in Human Brain

Jianxiong Gao, Yuqian Fu, Yun Wang et al.

ECCV 2024posterarXiv:2410.10659

#2993

PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion

Runsong Zhu, Shi Qiu, Qianyi Wu et al.

CVPR 2024posterarXiv:2403.12202

#2994

DeCoTR: Enhancing Depth Completion with 2D and 3D Attentions

Yunxiao Shi, Manish Singh, Hong Cai et al.

ECCV 2024posterarXiv:2407.07074

#2995

Hyperion – A fast, versatile symbolic Gaussian Belief Propagation framework for Continuous-Time SLAM

David Hug, Ignacio Alzugaray Lopez, Margarita Chli

CVPR 2024posterarXiv:2403.04245

#2996

A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition

Yusheng Dai, HangChen, Jun Du et al.

#2997

Learning Personalized Causally Invariant Representations for Heterogeneous Federated Clients

Xueyang Tang, Song Guo, Jie ZHANG et al.

ICLR 2024poster

#2998

General Point Model Pretraining with Autoencoding and Autoregressive

Zhe Li, Zhangyang Gao, Cheng Tan et al.

ECCV 2024posterarXiv:2305.15078

#2999

UniINR: Event-guided Unified Rolling Shutter Correction, Deblurring, and Interpolation

Yunfan Lu, Guoqiang Liang, Yusheng Wang et al.

CVPR 2024highlightarXiv:2405.06216

#3000

Event-based Structure-from-Orbit

Ethan Elms, Yasir Latif, Tae Ha Park et al.