Most Cited 2024 &quot;latent dimension alignment&quot; Papers

#2802

ZoLA: Zero-Shot Creative Long Animation Generation with Short Video Model

Fu-Yun Wang, Zhaoyang Huang, Qiang Ma et al.

CVPR 2024arXiv:2403.19501

#2803

RELI11D: A Comprehensive Multimodal Human Motion Dataset and Method

Ming Yan, Yan Zhang, Shuqiang Cai et al.

ECCV 2024arXiv:2407.04086

#2804

Certifiably Robust Image Watermark

Zhengyuan Jiang, Moyang Guo, Yuepeng Hu et al.

#2805

Motion Diversification Networks

Hee Jae Kim, Eshed Ohn-Bar

ICLR 2024spotlightarXiv:2310.10434

#2806

Equivariant Matrix Function Neural Networks

Ilyes Batatia, Lars Leon Schaaf, Gábor Csányi et al.

AAAI 2024paperarXiv:2306.12681

#2807

One at a Time: Progressive Multi-Step Volumetric Probability Learning for Reliable 3D Scene Perception

Bohan Li, Yasheng Sun, Jingxin Dong et al.

AAAI 2024paperarXiv:2402.19119

#2808

VIXEN: Visual Text Comparison Network for Image Difference Captioning

Alexander Black, Jing Shi, Yifei Fan et al.

AAAI 2024paperarXiv:2406.08799

#2809

Pareto Front-Diverse Batch Multi-Objective Bayesian Optimization

Alaleh Ahmadianshalchi, Syrine Belakaria, Janardhan Rao Doppa

ECCV 2024arXiv:2407.06704

#2810

Self-supervised visual learning from interactions with objects

Arthur Aubret, Céline Teulière, Jochen Triesch

AAAI 2024paperarXiv:2401.12497

#2811

Building Minimal and Reusable Causal State Abstractions for Reinforcement Learning

Zizhao Wang, Caroline Wang, Xuesu Xiao et al.

AAAI 2024paperarXiv:2312.17263

#2812

TACIT: A Target-Agnostic Feature Disentanglement Framework for Cross-Domain Text Classification

Rui Song, Fausto Giunchiglia, Yingji Li et al.

ECCV 2024arXiv:2407.19666

#2813

Take A Step Back: Rethinking the Two Stages in Visual Reasoning

Mingyu Zhang, Jiting Cai, Mingyu Liu et al.

#2814

REGLO: Provable Neural Network Repair for Global Robustness Properties

Feisi Fu, Zhilu Wang, Weichao Zhou et al.

CVPR 2024highlightarXiv:2401.15261

#2815

Vanishing-Point-Guided Video Semantic Segmentation of Driving Scenes

Diandian Guo, Deng-Ping Fan, Tongyu Lu et al.

#2816

Context Enhanced Transformer for Single Image Object Detection in Video Data

Seungjun An, Seonghoon Park, Gyeongnyeon Kim et al.

#2817

HSR: Holistic 3D Human-Scene Reconstruction from Monocular Videos

Lixin Xue, Chen Guo, Chengwei Zheng et al.

#2818

Link Prediction in Multilayer Networks via Cross-Network Embedding

Guojing Ren, Xiao Ding, Xiao-Ke Xu et al.

AAAI 2024paperarXiv:2401.15603

#2819

Improving Expressive Power of Spectral Graph Neural Networks with Eigenvalue Correction

Kangkang Lu, Yanhua Yu, Hao Fei et al.

AAAI 2024paperarXiv:2312.10314

#2820

DeepCalliFont: Few-Shot Chinese Calligraphy Font Synthesis by Integrating Dual-Modality Generative Models

Yitian Liu, Zhouhui Lian

AAAI 2024paperarXiv:2501.00009

#2821

Model-Driven Deep Neural Network for Enhanced AoA Estimation Using 5G gNB

Shengheng Liu, Xingkang Li, Zihuan Mao et al.

AAAI 2024paperarXiv:2402.16312

#2822

Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users

Hantao Yang, Xutong Liu, Zhiyong Wang et al.

AAAI 2024paperarXiv:2411.19451

#2823

Learning Visual Abstract Reasoning through Dual-Stream Networks

Kai Zhao, Chang Xu, Bailu Si

#2824

Epitopological learning and Cannistraci-Hebb network shape intelligence brain-inspired theory for ultra-sparse advantage in deep learning

Yingtao Zhang, Jialin Zhao, Wenjing Wu et al.

ICLR 2024

ECCV 2024arXiv:2407.07402

#2825

ActionVOS: Actions as Prompts for Video Object Segmentation

LIANGYANG OUYANG, Ruicong Liu, Yifei Huang et al.

#2826

ZOOM: Learning Video Mirror Detection with Extremely-Weak Supervision

Ke Xu, Tsun Wai Siu, Rynson W.H. Lau

CVPR 2024arXiv:2404.00931

#2827

GOV-NeSF: Generalizable Open-Vocabulary Neural Semantic Fields

Fangyin Wei, Hanlin Chen, Gim Hee Lee

ECCV 2024arXiv:2405.09883

#2828

RoScenes: A Large-scale Multi-view 3D Dataset for Roadside Perception

Xiaosu Zhu, Hualian Sheng, Sijia Cai et al.

ICLR 2024arXiv:2401.10556

#2829

Symbol as Points: Panoptic Symbol Spotting via Point-based Representation

Wenlong Liu, Tianyu Yang, Yuhan Wang et al.

AAAI 2024paperarXiv:2401.11649

#2830

A Multimodal, Multi-Task Adapting Framework for Video Action Recognition

Mengmeng Wang, Jiazheng Xing, Boyuan Jiang et al.

#2831

Comprehensive View Embedding Learning for Single-Cell Multimodal Integration

Zhenchao Tang, Jiehui Huang, Guanxing Chen et al.

#2832

PQ-SAM: Post-training Quantization for Segment Anything Model

Xiaoyu Liu, Xin Ding, Lei Yu et al.

#2833

Efficient Axiomatization of OWL 2 EL Ontologies from Data by Means of Formal Concept Analysis

Francesco Kriegel

ECCV 2024arXiv:2407.02047

#2834

CountFormer: Multi-View Crowd Counting Transformer

Hong Mo, Xiong Zhang, Jianchao Tan et al.

AAAI 2024paperarXiv:2310.03131

#2835

Axiomatic Aggregations of Abductive Explanations

Gagan Biradar, Yacine Izza, Elita Lobo et al.

ICLR 2024arXiv:2306.15876

#2836

Hybrid Distillation: Connecting Masked Autoencoders with Contrastive Learners

Bowen Shi, XIAOPENG ZHANG, Yaoming Wang et al.

ECCV 2024arXiv:2406.02461

#2837

RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting

Qi Wang, Ruijie Lu, Xudong XU et al.

ECCV 2024arXiv:2403.12003

#2838

GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning

Xiaojie Li, Yibo Yang, Xiangtai Li et al.

CVPR 2024arXiv:2002.07756

#2839

Hierarchical Correlation Clustering and Tree Preserving Embedding

Morteza Haghir Chehreghani, Mostafa Haghir Chehreghani

AAAI 2024paperarXiv:2312.13594

#2840

Towards More Faithful Natural Language Explanation Using Multi-Level Contrastive Learning in VQA

Chengen Lai, Shengli Song, Shiqi Meng et al.

CVPR 2024arXiv:2405.02608

#2841

UnSAMFlow: Unsupervised Optical Flow Guided by Segment Anything Model

Shuai Yuan, Lei Luo, Zhuo Hui et al.

ICLR 2024arXiv:2401.10474

#2842

LDReg: Local Dimensionality Regularized Self-Supervised Learning

Hanxun Huang, Ricardo Campello, Sarah Erfani et al.

ICLR 2024oralarXiv:2311.03309

#2843

Neural structure learning with stochastic differential equations

Benjie Wang, Joel Jennings, Wenbo Gong

CVPR 2024arXiv:2311.17833

#2844

DiG-IN: Diffusion Guidance for Investigating Networks - Uncovering Classifier Differences Neuron Visualisations and Visual Counterfactual Explanations

Maximilian Augustin, Yannic Neuhaus, Matthias Hein

#2845

GLDL: Graph Label Distribution Learning

Yufei Jin, Richard Gao, Yi He et al.

ECCV 2024arXiv:2407.21032

#2846

Safeguard Text-to-Image Diffusion Models with Human Feedback Inversion

Sanghyun Kim, Seohyeon Jung, Balhae Kim et al.

ECCV 2024arXiv:2409.13803

#2847

Intrinsic Single-Image HDR Reconstruction

Sebastian Dille, Chris Careaga, Yagiz Aksoy

ICLR 2024arXiv:2308.10632

#2848

Foundation Model-oriented Robustness: Robust Image Model Evaluation with Pretrained Models

Peiyan Zhang, Haoyang Liu, Chaozhuo Li et al.

#2849

Factorized Diffusion Autoencoder for Unsupervised Disentangled Representation Learning

Ancong Wu, Wei-shi Zheng

AAAI 2024paperarXiv:2401.04331

#2850

Coupling Graph Neural Networks with Fractional Order Continuous Dynamics: A Robustness Study

Qiyu Kang, Kai Zhao, Yang Song et al.

AAAI 2024paperarXiv:2401.06799

#2851

Make Prompts Adaptable: Bayesian Modeling for Vision-Language Prompt Learning with Data-Dependent Prior

Youngjae Cho, HeeSun Bae, Seungjae Shin et al.

#2852

Unsupervised Group Re-identification via Adaptive Clustering-Driven Progressive Learning

Hongxu Chen, Quan Zhang, Jian-Huang Lai et al.

AAAI 2024paperarXiv:2401.13621

#2853

DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning

Xinghao Wang, Junliang He, Pengyu Wang et al.

AAAI 2024paperarXiv:2308.07301

#2854

A Unified Masked Autoencoder with Patchified Skeletons for Motion Synthesis

Esteve Valls Mascaro, Hyemin Ahn, Dongheui Lee

ECCV 2024arXiv:2407.10704

#2855

Quantized Prompt for Efficient Generalization of Vision-Language Models

Tianxiang Hao, Xiaohan Ding, Juexiao Feng et al.

AAAI 2024paperarXiv:2210.08106

#2856

A Primal-Dual Algorithm for Hybrid Federated Learning

Tom Overman, Garrett Blum, Diego Klabjan

CVPR 2024arXiv:2403.10988

#2857

Boosting Flow-based Generative Super-Resolution Models via Learned Prior

Li-Yuan Tsao, Yi-Chen Lo, Chia-Che Chang et al.

AAAI 2024paperarXiv:2403.02063

#2858

Depth-Guided Robust and Fast Point Cloud Fusion NeRF for Sparse Input Views

Shuai Guo, Qiuwen Wang, Yijie Gao et al.

#2859

DIUSum: Dynamic Image Utilization for Multimodal Summarization

Min Xiao, Junnan Zhu, Feifei Zhai et al.

ECCV 2024arXiv:2411.06344

#2860

CityGuessr: City-Level Video Geo-Localization on a Global Scale

Parth Parag Kulkarni, Gaurav Kumar Nayak, Shah Mubarak

AAAI 2024paperarXiv:2403.05660

#2861

Decoupling Degradations with Recurrent Network for Video Restoration in Under-Display Camera

Chengxu Liu, Xuan Wang, Yuanting Fan et al.

CVPR 2024arXiv:2406.04999

#2862

ProMotion: Prototypes As Motion Learners

Yawen Lu, Dongfang Liu, Qifan Wang et al.

#2863

Learning Robust Rationales for Model Explainability: A Guidance-Based Approach

Shuaibo Hu, Kui Yu

ECCV 2024arXiv:2407.12939

#2864

GenRC: Generative 3D Room Completion from Sparse Image Collections

Ming-Feng Li, Yueh-Feng Ku, Hong-Xuan Yen et al.

CVPR 2024arXiv:2310.09469

#2865

Towards More Accurate Diffusion Model Acceleration with A Timestep Tuner

Mengfei Xia, Yujun Shen, Changsong Lei et al.

CVPR 2024arXiv:2312.03102

#2866

Fully Convolutional Slice-to-Volume Reconstruction for Single-Stack MRI

Sean I. Young, Yaël Balbastre, Bruce Fischl et al.

AAAI 2024paperarXiv:2401.17186

#2867

Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning

Bang Yang, Yong Dai, Xuxin Cheng et al.

#2868

Making Visual Sense of Oracle Bones for You and Me

Runqi Qiao, LAN YANG, Kaiyue Pang et al.

CVPR 2024arXiv:2405.10037

#2869

Bilateral Event Mining and Complementary for Event Stream Super-Resolution

Zhilin Huang, Quanmin Liang, Yijie Yu et al.

AAAI 2024paperarXiv:2312.15707

#2870

High-Fidelity Diffusion-Based Image Editing

Chen Hou, Guoqiang Wei, Zhibo Chen

AAAI 2024paperarXiv:2404.07962

#2871

Live and Learn: Continual Action Clustering with Incremental Views

Xiaoqiang Yan, Yingtao Gan, Yiqiao Mao et al.

ECCV 2024arXiv:2407.13254

#2872

Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic Segmentation

Shoumeng Qiu, Jie Chen, Xinrun Li et al.

ECCV 2024arXiv:2407.09115

#2873

Layer-Wise Relevance Propagation with Conservation Property for ResNet

Seitaro Otsuki, Tsumugi Iida, Félix Doublet et al.

ECCV 2024arXiv:2409.16763

#2874

Statewide Visual Geolocalization in the Wild

Florian Fervers, Sebastian Bullinger, Christoph Bodensteiner et al.

ECCV 2024arXiv:2407.17596

#2875

Quality Assured: Rethinking Annotation Strategies in Imaging AI

Tim Rädsch, Annika Reinke, Vivienn Weru et al.

AAAI 2024paperarXiv:2312.15425

#2876

Knowledge Guided Semi-supervised Learning for Quality Assessment of User Generated Videos

Shankhanil Mitra, Rajiv Soundararajan

#2877

Evidential Uncertainty-Guided Mitochondria Segmentation for 3D EM Images

Ruohua Shi, Lingyu Duan, Tiejun Huang et al.

ECCV 2024arXiv:2403.13524

#2878

Compress3D: a Compressed Latent Space for 3D Generation from a Single Image

Bowen Zhang, Tianyu Yang, Yu Li et al.

ICLR 2024spotlightarXiv:2305.01521

#2879

Unlocking the Power of Representations in Long-term Novelty-based Exploration

Alaa Saade, Steven Kapturowski, Daniele Calandriello et al.

ECCV 2024arXiv:2404.06836

#2880

O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation

Muer Tie, Julong Wei, Zhengjun Wang et al.

ECCV 2024arXiv:2407.09047

#2881

Cs2K: Class-specific and Class-shared Knowledge Guidance for Incremental Semantic Segmentation

Wei Cong, Yang Cong, Yuyang Liu et al.

CVPR 2024arXiv:2404.01543

#2882

Efficient 3D Implicit Head Avatar with Mesh-anchored Hash Table Blendshapes

Ziqian Bai, Feitong Tan, Sean Fanello et al.

ICLR 2024arXiv:2309.17175

#2883

TextField3D: Towards Enhancing Open-Vocabulary 3D Generation with Noisy Text Fields

Tianyu Huang, Yihan Zeng, Bowen Dong et al.

CVPR 2024arXiv:2404.01243

#2884

A Unified and Interpretable Emotion Representation and Expression Generation

Reni Paskaleva, Mykyta Holubakha, Andela Ilic et al.

ECCV 2024arXiv:2406.18537

#2885

AddBiomechanics Dataset: Capturing the Physics of Human Motion at Scale

Keenon Werling, Janelle M Kaneda, Tian Tan et al.

ICLR 2024arXiv:2311.13541

#2886

Linear Log-Normal Attention with Unbiased Concentration

Yury Nahshan, Joseph Kampeas, Emir Haleva

AAAI 2024paperarXiv:2312.12080

#2887

Learning Subject-Aware Cropping by Outpainting Professional Photos

James Hong, Lu Yuan, Michaël Gharbi et al.

ECCV 2024arXiv:2407.13390

#2888

GeometrySticker: Enabling Ownership Claim of Recolorized Neural Radiance Fields

Xiufeng HUANG, Ka Chun Cheung, Simon See et al.

ECCV 2024arXiv:2407.12489

#2889

Dual-level Adaptive Self-Labeling for Novel Class Discovery in Point Cloud Segmentation

Ruijie Xu, Chuyu Zhang, Hui Ren et al.

ECCV 2024arXiv:2406.01194

#2890

AFF-ttention! Affordances and Attention models for Short-Term Object Interaction Anticipation

Lorenzo Mur Labadia, Ruben Martinez-Cantin, Jose J Guerrero et al.

CVPR 2024arXiv:2403.20249

#2891

Relation Rectification in Diffusion Model

Yinwei Wu, Xingyi Yang, Xinchao Wang

#2892

Learning Diffusion Models for Multi-View Anomaly Detection

Chieh Liu, Yu-Min Chu, Ting-I Hsieh et al.

ECCV 2024arXiv:2403.11586

#2893

DynoSurf: Neural Deformation-based Temporally Consistent Dynamic Surface Reconstruction

Yuxin Yao, Siyu Ren, Junhui Hou et al.

ICLR 2024arXiv:2305.17342

#2894

Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in RL

Xiangyu Liu, Souradip Chakraborty, Yanchao Sun et al.

CVPR 2024arXiv:2312.01964

#2895

Semantics-aware Motion Retargeting with Vision-Language Models

Haodong Zhang, ZhiKe Chen, Haocheng Xu et al.

#2896

Intrinsic Phase-Preserving Networks for Depth Super Resolution

Xuanhong Chen, Hang Wang, Jinfan Liu et al.

CVPR 2024arXiv:2403.15835

#2897

Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression

Hancheng Ye, Chong Yu, Peng Ye et al.

ICLR 2024arXiv:2403.05490

#2898

Poly-View Contrastive Learning

Amitis Shidani, R Devon Hjelm, Jason Ramapuram et al.

ECCV 2024arXiv:2406.08392

#2899

FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation

Xinzhi MU, Li Chen, Bohan CHEN et al.

ECCV 2024arXiv:2407.08418

#2900

PredBench: Benchmarking Spatio-Temporal Prediction across Diverse Disciplines

Zidong Wang, Zeyu Lu, Di Huang et al.

ECCV 2024arXiv:2407.02665

#2901

SMILe: Leveraging Submodular Mutual Information For Robust Few-Shot Object Detection

Anay Majee, Ryan X Sharp, Rishabh Iyer

ICLR 2024arXiv:2312.16414

#2902

Bellman Optimal Stepsize Straightening of Flow-Matching Models

Bao Nguyen, Binh Nguyen, Viet Anh Nguyen

ECCV 2024arXiv:2403.19238

#2903

Taming Lookup Tables for Efficient Image Retouching

Sidi Yang, Binxiao Huang, Mingdeng Cao et al.

#2904

EraseDraw : Learning to Insert Objects by Erasing Them from Images

Alper Canberk, Maksym Bondarenko, Ege Ozguroglu et al.

AAAI 2024paperarXiv:2303.06678

#2905

PointPatchMix: Point Cloud Mixing with Patch Scoring

Yi Wang, Jiaze Wang, Jinpeng Li et al.

#2906

CDFormer: When Degradation Prediction Embraces Diffusion Model for Blind Image Super-Resolution

Qingguo Liu, Chenyi Zhuang, Pan Gao et al.

#2907

Data Disparity and Temporal Unavailability Aware Asynchronous Federated Learning for Predictive Maintenance on Transportation Fleets

Leonie von Wahl, Niklas Heidenreich, Prasenjit Mitra et al.

AAAI 2024paperarXiv:2207.05631

#2908

DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization

Wenze Chen, Shiyu Huang, Yuan Chiang et al.

ECCV 2024arXiv:2403.15033

#2909

Toward Tiny and High-quality Facial Makeup with Data Amplify Learning

Qiaoqiao Jin, Xuanhong Chen, Meiguang Jin et al.

ECCV 2024arXiv:2401.06191

#2910

TriNeRFLet: A Wavelet Based Triplane NeRF Representation

Rajaei Khatib, RAJA GIRYES

CVPR 2024arXiv:2403.03037

#2911

A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives

Simone Alberto Peirone, Francesca Pistilli, Antonio Alliegro et al.

ICLR 2024arXiv:2405.12398

#2912

ASMR: Activation-Sharing Multi-Resolution Coordinate Networks for Efficient Inference

Jason Chun Lok Li, Steven Luo, Le Xu et al.

AAAI 2024paperarXiv:2407.09787

#2913

Semi-supervised 3D Object Detection with PatchTeacher and PillarMix

Xiaopei Wu, Liang Peng, Liang Xie et al.

CVPR 2024arXiv:2312.03442

#2914

High-Quality Facial Geometry and Appearance Capture at Home

Yuxuan Han, Junfeng Lyu, Feng Xu

CVPR 2024arXiv:2307.04760

#2915

Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos

Sagnik Majumder, Ziad Al-Halah, Kristen Grauman

ECCV 2024arXiv:2312.07485

#2916

MinD-3D: Reconstruct High-quality 3D objects in Human Brain

Jianxiong Gao, Yuqian Fu, Yun Wang et al.

ECCV 2024arXiv:2410.10659

#2917

PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion

Runsong Zhu, Shi Qiu, Qianyi Wu et al.

ECCV 2024arXiv:2407.07074

#2918

Hyperion – A fast, versatile symbolic Gaussian Belief Propagation framework for Continuous-Time SLAM

David Hug, Ignacio Alzugaray Lopez, Margarita Chli

CVPR 2024arXiv:2403.14870

#2919

VidLA: Video-Language Alignment at Scale

Mamshad Nayeem Rizve, Fan Fei, Jayakrishnan Unnikrishnan et al.

ICLR 2024spotlightarXiv:2402.09872

#2920

Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Community

Arman Isajanyan, Artur Shatveryan, David Kocharian et al.

#2921

Adversarially Robust Few-shot Learning via Parameter Co-distillation of Similarity and Class Concept Learners

Junhao Dong, Piotr Koniusz, Junxi Chen et al.

ECCV 2024arXiv:2305.15078

#2922

UniINR: Event-guided Unified Rolling Shutter Correction, Deblurring, and Interpolation

Yunfan Lu, Guoqiang Liang, Yusheng Wang et al.

CVPR 2024arXiv:2310.10700

#2923

PELA: Learning Parameter-Efficient Models with Low-Rank Approximation

Yangyang Guo, Guangzhi Wang, Mohan Kankanhalli

ECCV 2024arXiv:2404.03531

#2924

COMO: Compact Mapping and Odometry

Eric Dexheimer, Andrew Davison

ECCV 2024arXiv:2407.10135

#2925

FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection

Zheng Jiang, Jinqing Zhang, Yanan Zhang et al.

ECCV 2024arXiv:2407.10918

#2926

PartImageNet++ Dataset: Scaling up Part-based Models for Robust Recognition

Xiao Li, Yining Liu, Na Dong et al.

ECCV 2024arXiv:2407.06871

#2927

Rethinking Image-to-Video Adaptation: An Object-centric Perspective

Rui Qian, Shuangrui Ding, Dahua Lin

#2928

General Point Model Pretraining with Autoencoding and Autoregressive

Zhe Li, Zhangyang Gao, Cheng Tan et al.

ECCV 2024arXiv:2408.17027

#2929

ConDense: Consistent 2D-3D Pre-training for Dense and Sparse Features from Multi-View Images

Xiaoshuai Zhang, Zhicheng Wang, Howard Zhou et al.

#2930

FARSE-CNN: Fully Asynchronous, Recurrent and Sparse Event-Based CNN

Riccardo Santambrogio, Marco Cannici, Matteo Matteucci

ECCV 2024arXiv:2405.05079

#2931

Power Variable Projection for Initialization-Free Large-Scale Bundle Adjustment

Simon Weber, Je Hyeong Hong, Daniel Cremers

CVPR 2024arXiv:2403.07244

#2932

Time-Efficient Light-Field Acquisition Using Coded Aperture and Events

Shuji Habuchi, Keita Takahashi, Chihiro Tsutake et al.

CVPR 2024highlightarXiv:2405.06216

#2933

Event-based Structure-from-Orbit

Ethan Elms, Yasir Latif, Tae Ha Park et al.

CVPR 2024highlightarXiv:2302.09585

#2934

StreamingFlow: Streaming Occupancy Forecasting with Asynchronous Multi-modal Data Streams via Neural Ordinary Differential Equation

Yining Shi, Kun JIANG, Ke Wang et al.

#2935

Cross Initialization for Face Personalization of Text-to-Image Models

Lianyu Pang, Jian Yin, Haoran Xie et al.

ECCV 2024arXiv:2407.12443

#2936

Preventing Catastrophic Overfitting in Fast Adversarial Training: A Bi-level Optimization Perspective

Zhaoxin Wang, Handing Wang, Cong Tian et al.

ECCV 2024arXiv:2409.20034

#2937

Camera Calibration using a Collimator System

Shunkun Liang, Banglei Guan, Zhenbao Yu et al.

ICLR 2024arXiv:2402.06706

#2938

CoRe-GD: A Hierarchical Framework for Scalable Graph Visualization with GNNs

Florian Grötschla, Joël Mathys, Róbert Veres et al.

ECCV 2024arXiv:2409.05867

#2939

Flash Cache: Reducing Bias in Radiance Cache Based Inverse Rendering

Benjamin Attal, Dor Verbin, Ben Mildenhall et al.

ECCV 2024arXiv:2407.09857

#2940

IFTR: An Instance-Level Fusion Transformer for Visual Collaborative Perception

Shaohong Wang, Lu Bin, Xinyu Xiao et al.

#2941

Plan, Posture and Go: Towards Open-vocabulary Text-to-Motion Generation

Jinpeng Liu, Wenxun Dai, Chunyu Wang et al.

ECCV 2024arXiv:2407.04382

#2942

Self-Supervised Representation Learning for Adversarial Attack Detection

Yi Li, Plamen Angelov, Neeraj Suri

ICLR 2024arXiv:2306.00740

#2943

On the Limitations of Temperature Scaling for Distributions with Overlaps

Muthu Chidambaram, Rong Ge

ECCV 2024arXiv:2407.06683

#2944

Accelerating Online Mapping and Behavior Prediction via Direct BEV Feature Attention

Xunjiang Gu, Guanyu Song, Igor Gilitschenski et al.

#2945

Sur^2f: A Hybrid Representation for High-Quality and Efficient Surface Reconstruction from Multi-view Images

Zhangjin Huang, Zhihao Liang, Kui Jia

#2946

Uncertainty-aware Graph-based Hyperspectral Image Classification

Linlin Yu, Yifei Lou, Feng Chen

ICLR 2024

CVPR 2024arXiv:2405.19902

#2947

Learning Discriminative Dynamics with Label Corruption for Noisy Label Detection

Suyeon Kim, Dongha Lee, SeongKu Kang et al.

ECCV 2024arXiv:2408.08258

#2948

Snuffy: Efficient Whole Slide Image Classifier

Hossein Jafarinia, Alireza Alipanah, Saeed Razavi et al.

#2949

Markov Knowledge Distillation: Make Nasty Teachers trained by Self-undermining Knowledge Distillation Fully Distillable

En-Hui Yang, Linfeng Ye

ECCV 2024arXiv:2404.13706

#2950

Concept Arithmetics for Circumventing Concept Inhibition in Diffusion Models

Vitali Petsiuk, Kate Saenko

ECCV 2024arXiv:2403.04899

#2951

Towards Scene Graph Anticipation

Rohith Peddi, Saksham Singh, Saurabh . et al.

ECCV 2024arXiv:2403.19160

#2952

Within the Dynamic Context: Inertia-aware 3D Human Modeling with Pose Sequence

Yutong Chen, Yifan Zhan, Zhihang Zhong et al.

CVPR 2024arXiv:2404.04848

#2953

Task-Aware Encoder Control for Deep Video Compression

Xingtong Ge, Jixiang Luo, XINJIE ZHANG et al.

ECCV 2024arXiv:2401.04339

#2954

Memory-Efficient Fine-Tuning for Quantized Diffusion Model

Hyogon Ryu, Seohyun Lim, Hyunjung Shim

ICLR 2024oralarXiv:2401.08328

#2955

Un-Mixing Test-Time Normalization Statistics: Combatting Label Temporal Correlation

Devavrat Tomar, Guillaume Vray, Jean-Philippe Thiran et al.

ECCV 2024arXiv:2408.08671

#2956

Towards Physical World Backdoor Attacks against Skeleton Action Recognition

Qichen Zheng, Yi Yu, SIYUAN YANG et al.

ICLR 2024arXiv:2401.11098

#2957

Neural Auto-designer for Enhanced Quantum Kernels

Cong Lei, Yuxuan Du, Peng Mi et al.

ECCV 2024arXiv:2407.08209

#2958

Enriching Information and Preserving Semantic Consistency in Expanding Curvilinear Object Segmentation Datasets

Qin Lei, Jiang Zhong, Qizhu Dai

ECCV 2024arXiv:2403.14270

#2959

Scene-Graph ViT: End-to-End Open-Vocabulary Visual Relationship Detection

Tim Salzmann, Markus Ryll, Alex Bewley et al.

ECCV 2024arXiv:2407.11288

#2960

Zero-Shot Adaptation for Approximate Posterior Sampling of Diffusion Models in Inverse Problems

Yasar Utku Alcalar, Mehmet Akcakaya

ECCV 2024arXiv:2403.12038

#2961

Zero-Shot Image Feature Consensus with Deep Functional Maps

Xinle Cheng, Congyue Deng, Adam Harley et al.

ICLR 2024arXiv:2309.13957

#2962

Beam Enumeration: Probabilistic Explainability For Sample Efficient Self-conditioned Molecular Design

Jeff Guo, Philippe Schwaller

#2963

Prompting Future Driven Diffusion Model for Hand Motion Prediction

Bowen Tang, Kaihao Zhang, Wenhan Luo et al.

#2964

Auto-DAS: Automated Proxy Discovery for Training-free Distillation-aware Architecture Search

Haosen SUN, Lujun Li, Peijie Dong et al.

ECCV 2024arXiv:2407.05594

#2965

SLIM: Spuriousness Mitigation with Minimal Human Annotations

Xiwei Xuan, Ziquan Deng, Hsuan-Tien Lin et al.

#2966

Cross-Dimension Affinity Distillation for 3D EM Neuron Segmentation

Xiaoyu Liu, Miaomiao Cai, Yinda Chen et al.

CVPR 2024arXiv:2404.00254

#2967

Clustering for Protein Representation Learning

Ruijie Quan, Wenguan Wang, Fan Ma et al.

ECCV 2024arXiv:2404.01889

#2968

RAVE: Residual Vector Embedding for CLIP-Guided Backlit Image Enhancement

Tatiana Gaintseva, Martin Benning, Greg Slabaugh

#2969

D4-VTON: Dynamic Semantics Disentangling for Differential Diffusion based Virtual Try-On

Zhaotong Yang, Zicheng Jiang, Xinzhe Li et al.

ICLR 2024arXiv:2308.06703

#2970

Understanding the robustness difference between stochastic gradient descent and adaptive gradient methods

Avery Ma, Yangchen Pan, Amir-massoud Farahmand

ECCV 2024arXiv:2404.07988

#2971

Parameterized Quasi-Physical Simulators for Dexterous Manipulations Transfer

Xueyi Liu, Kangbo Lyu, jieqiong zhang et al.

CVPR 2024arXiv:2212.08251

#2972

Task-Adaptive Saliency Guidance for Exemplar-free Class Incremental Learning

Xialei Liu, Jiang-Tian Zhai, Andrew Bagdanov et al.

CVPR 2024arXiv:2302.04871

#2973

In-N-Out: Faithful 3D GAN Inversion with Volumetric Decomposition for Face Editing

Yiran Xu, Zhixin Shu, Cameron Smith et al.

#2974

Relational Matching for Weakly Semi-Supervised Oriented Object Detection

Wenhao Wu, Hau San Wong, Si Wu et al.

ECCV 2024arXiv:2405.14582

#2975

PoseCrafter: One-Shot Personalized Video Synthesis Following Flexible Pose Control

Yong Zhong, Min Zhao, Zebin You et al.

AAAI 2024paperarXiv:2406.07967

#2976

Better than Random: Reliable NLG Human Evaluation with Constrained Active Sampling

Jie Ruan, Xiao Pu, Mingqi Gao et al.

AAAI 2024paperarXiv:2312.07122

#2977

Neural Reasoning about Agents’ Goals, Preferences, and Actions

Matteo Bortoletto, Lei Shi, Andreas Bulling

AAAI 2024paperarXiv:2303.13077

#2978

An Efficient Knowledge Transfer Strategy for Spiking Neural Networks from Static to Event Domain

Xiang He, Dongcheng Zhao, Yang Li et al.

AAAI 2024paperarXiv:2310.04884

#2979

Regret Analysis of Repeated Delegated Choice

Suho Shin, Keivan Rezaei, Mohammad Hajiaghayi et al.

#2980

Diffusion-FOF: Single-View Clothed Human Reconstruction via Diffusion-Based Fourier Occupancy Field

Yuanzhen Li, Fei LUO, Chunxia Xiao

AAAI 2024paperarXiv:2312.11545

#2981

Robust Communicative Multi-Agent Reinforcement Learning with Active Defense

Lebin Yu, Yunbo Qiu, Quanming Yao et al.

ICLR 2024arXiv:2306.16688

#2982

SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores

Zhiyu Mei, Wei Fu, Jiaxuan Gao et al.

CVPR 2024arXiv:2406.08960

#2983

AirPlanes: Accurate Plane Estimation via 3D-Consistent Embeddings

Jamie Watson, Filippo Aleotti, Mohamed Sayed et al.

ECCV 2024arXiv:2404.16828

#2984

Made to Order: Discovering monotonic temporal changes via self-supervised video ordering

Charig Yang, Weidi Xie, Andrew ZISSERMAN

AAAI 2024paperarXiv:2312.10572

#2985

Improved Anonymous Multi Agent Path Finding Algorithm

Zain Alabedeen Ali, Konstantin Yakovlev

AAAI 2024paperarXiv:2402.00084

#2986

EPSD: Early Pruning with Self-Distillation for Efficient Model Compression

Dong Chen, Ning Liu, Yichen Zhu et al.

#2987

Decoding Global Preferences: Temporal and Cooperative Dependency Modeling in Multi-Agent Preference-Based Reinforcement Learning

Tianchen Zhu, Yue Qiu, Haoyi Zhou et al.

#2988

Gaze from Origin: Learning for Generalized Gaze Estimation by Embedding the Gaze Frontalization Process

Mingjie Xu, Feng Lu

AAAI 2024paperarXiv:2403.05093

#2989

Spectrum Translation for Refinement of Image Generation (STIG) Based on Contrastive Learning and Spectral Filter Profile

Seokjun Lee, Seung-Won Jung, Hyunseok Seo

AAAI 2024paperarXiv:2312.11091

#2990

Colored Noise in PPO: Improved Exploration and Performance through Correlated Action Sampling

Jakob Hollenstein, Georg Martius, Justus Piater

AAAI 2024paperarXiv:2312.09783

#2991

Keep the Faith: Faithful Explanations in Convolutional Neural Networks for Case-Based Reasoning

Tom Nuno Wolf, Fabian Bongratz, Anne-Marie Rickmann et al.

AAAI 2024paperarXiv:2401.09067

#2992

Towards Continual Learning Desiderata via HSIC-Bottleneck Orthogonalization and Equiangular Embedding

Depeng Li, Tianqi Wang, Junwei Chen et al.

AAAI 2024paperarXiv:2305.08372

#2993

Hierarchical Aligned Multimodal Learning for NER on Tweet Posts

Peipei Liu, Hong Li, Yimo Ren et al.

AAAI 2024paperarXiv:2312.13066

#2994

PPEA-Depth: Progressive Parameter-Efficient Adaptation for Self-Supervised Monocular Depth Estimation

Yue-Jiang Dong, Yuan-Chen Guo, Ying-Tian Liu et al.

#2995

RecurrentBEV: A Long-term Temporal Fusion Framework for Multi-view 3D Detection

Ming Chang, Xishan Zhang, Rui Zhang et al.

AAAI 2024paperarXiv:2412.03825

#2996

Residual Hyperbolic Graph Convolution Networks

Yangkai Xue, Jindou Dai, Zhipeng Lu et al.

AAAI 2024paperarXiv:2305.14024

#2997

Improved Metric Distortion via Threshold Approvals

Elliot Anshelevich, Aris Filos-Ratsikas, Christopher Jerrett et al.

AAAI 2024paperarXiv:2305.06741

#2998

IVP-VAE: Modeling EHR Time Series with Initial Value Problem Solvers

Jingge Xiao, Leonie Basso, Wolfgang Nejdl et al.

AAAI 2024paperarXiv:2312.15971

#2999

Graph Context Transformation Learning for Progressive Correspondence Pruning

Junwen Guo, Guobao Xiao, Shiping Wang et al.

CVPR 2024arXiv:2403.12202

#3000

DeCoTR: Enhancing Depth Completion with 2D and 3D Attentions

Yunxiao Shi, Manish Singh, Hong Cai et al.