Most Cited 2024 "feature redundancy reduction" Papers

12,324 papers found • Page 24 of 62

#4601

Thermal3D-GS: Physics-induced 3D Gaussians for Thermal Infrared Novel-view Synthesis

Qian Chen, Shihao Shu, Xiangzhi Bai

ECCV 2024arXiv:2409.08042
16
citations
#4602

Gaussian Shadow Casting for Neural Characters

Luis Bolanos, Shih-Yang Su, Helge Rhodin

CVPR 2024arXiv:2401.06116
16
citations
#4603

Beyond MOT: Semantic Multi-Object Tracking

Yunhao Li, Qin Li, Hao Wang et al.

ECCV 2024arXiv:2403.05021
16
citations
#4604

Breaking Physical and Linguistic Borders: Multilingual Federated Prompt Tuning for Low-Resource Languages

Wanru Zhao, Yihong Chen, Royson Lee et al.

ICLR 2024arXiv:2507.03003
16
citations
#4605

History Matters: Temporal Knowledge Editing in Large Language Model

Xunjian Yin, Jin Jiang, Liming Yang et al.

AAAI 2024paperarXiv:2312.05497
16
citations
#4606

Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on Vision-Language Models

Yu-Chu Yu, Chi-Pin Huang, Jr-Jen Chen et al.

ECCV 2024arXiv:2403.09296
16
citations
#4607

SurfPro: Functional Protein Design Based on Continuous Surface

Zhenqiao Song, Tinglin Huang, Lei Li et al.

ICML 2024arXiv:2405.06693
16
citations
#4608

Harnessing the Power of Neural Operators with Automatically Encoded Conservation Laws

Ning Liu, Yiming Fan, Xianyi Zeng et al.

ICML 2024spotlightarXiv:2312.11176
16
citations
#4609

LookupViT: Compressing visual information to a limited number of tokens

Rajat Koner, Gagan Jain, Sujoy Paul et al.

ECCV 2024arXiv:2407.12753
16
citations
#4610

ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting

Chen Duan, Pei Fu, Shan Guo et al.

CVPR 2024arXiv:2403.00303
16
citations
#4611

The Hard Positive Truth about Vision-Language Compositionality

Amita Kamath, Cheng-Yu Hsieh, Kai-Wei Chang et al.

ECCV 2024arXiv:2409.17958
16
citations
#4612

DiffEnc: Variational Diffusion with a Learned Encoder

Beatrix M. G. Nielsen, Anders Christensen, Andrea Dittadi et al.

ICLR 2024arXiv:2310.19789
16
citations
#4613

LoCoCo: Dropping In Convolutions for Long Context Compression

Ruisi Cai, Yuandong Tian, Zhangyang “Atlas” Wang et al.

ICML 2024arXiv:2406.05317
16
citations
#4614

RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation

Luis Li, Hubert P. H. Shum, Toby P Breckon

ECCV 2024arXiv:2407.10159
16
citations
#4615

EMIE-MAP: Large-Scale Road Surface Reconstruction Based on Explicit Mesh and Implicit Encoding

Wenhua Wu, Qi Wang, Guangming Wang et al.

ECCV 2024arXiv:2403.11789
16
citations
#4616

CSL: Class-Agnostic Structure-Constrained Learning for Segmentation including the Unseen

Hao Zhang, Fang Li, Lu Qi et al.

AAAI 2024paperarXiv:2312.05538
16
citations
#4617

Masked Completion via Structured Diffusion with White-Box Transformers

Druv Pai, Sam Buchanan, Ziyang Wu et al.

ICLR 2024arXiv:2404.02446
16
citations
#4618

Teach CLIP to Develop a Number Sense for Ordinal Regression

Yao DU, Qiang Zhai, Weihang Dai et al.

ECCV 2024arXiv:2408.03574
16
citations
#4619

Adversarial Score Distillation: When score distillation meets GAN

Min Wei, Jingkai Zhou, Junyao Sun et al.

CVPR 2024arXiv:2312.00739
16
citations
#4620

TimeLens-XL: Real-time Event-based Video Frame Interpolation with Large Motion

Shi Guo, Yutian Chen, Tianfan Xue et al.

ECCV 2024
16
citations
#4621

DailyDVS-200: A Comprehensive Benchmark Dataset for Event-Based Action Recognition

Qi Wang, Zhou Xu, Yuming Lin et al.

ECCV 2024arXiv:2407.05106
16
citations
#4622

Robustifying State-space Models for Long Sequences via Approximate Diagonalization

Annan Yu, Arnur Nigmetov, Dmitriy Morozov et al.

ICLR 2024spotlightarXiv:2310.01698
16
citations
#4623

DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery

Yixuan Zhu, Ao Li, Yansong Tang et al.

CVPR 2024arXiv:2404.01424
16
citations
#4624

FRIH: Fine-Grained Region-Aware Image Harmonization

Jinlong Peng, Zekun Luo, Liang Liu et al.

AAAI 2024paperarXiv:2205.06448
16
citations
#4625

Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation

Shihao Zhao, Shaozhe Hao, Bojia Zi et al.

ECCV 2024arXiv:2403.07860
16
citations
#4626

Image Demoireing in RAW and sRGB Domains

Shuning Xu, Binbin Song, Xiangyu Chen et al.

ECCV 2024arXiv:2312.09063
16
citations
#4627

Multimarginal Generative Modeling with Stochastic Interpolants

Michael Albergo, Nicholas Boffi, Michael Lindsey et al.

ICLR 2024arXiv:2310.03695
16
citations
#4628

Towards Improved Proxy-Based Deep Metric Learning via Data-Augmented Domain Adaptation

Li Ren, Chen Chen, Liqiang Wang et al.

AAAI 2024paperarXiv:2401.00617
16
citations
#4629

SURER: Structure-Adaptive Unified Graph Neural Network for Multi-View Clustering

Jing Wang, Songhe Feng, Gengyu Lyu et al.

AAAI 2024paper
16
citations
#4630

Combating Data Imbalances in Federated Semi-supervised Learning with Dual Regulators

Sikai Bai, Shuaicheng Li, Weiming Zhuang et al.

AAAI 2024paperarXiv:2307.05358
16
citations
#4631

PKU-DyMVHumans: A Multi-View Video Benchmark for High-Fidelity Dynamic Human Modeling

Xiaoyun Zheng, Liwei Liao, Xufeng Li et al.

CVPR 2024arXiv:2403.16080
16
citations
#4632

AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation

Jeongsoo Choi, Se Jin Park, Minsu Kim et al.

CVPR 2024highlightarXiv:2312.02512
16
citations
#4633

Non-stationary Projection-Free Online Learning with Dynamic and Adaptive Regret Guarantees

Yibo Wang, Wenhao Yang, Wei Jiang et al.

AAAI 2024paperarXiv:2305.11726
16
citations
#4634

Progressive Divide-and-Conquer via Subsampling Decomposition for Accelerated MRI

Chong Wang, Lanqing Guo, Yufei Wang et al.

CVPR 2024highlightarXiv:2403.10064
16
citations
#4635

Auto-GAS: Automated Proxy Discovery for Training-free Generative Architecture Search

Lujun Li, Haosen SUN, Shiwen Li et al.

ECCV 2024
16
citations
#4636

BayesDiff: Estimating Pixel-wise Uncertainty in Diffusion via Bayesian Inference

Siqi Kou, Lei Gan, Dequan Wang et al.

ICLR 2024arXiv:2310.11142
16
citations
#4637

Self-Supervised Multi-Modal Knowledge Graph Contrastive Hashing for Cross-Modal Search

Meiyu Liang, Junping Du, Zhengyang Liang et al.

AAAI 2024paper
16
citations
#4638

Keypoint-based Progressive Chain-of-Thought Distillation for LLMs

Kaituo Feng, Changsheng Li, Xiaolu Zhang et al.

ICML 2024arXiv:2405.16064
16
citations
#4639

A Comprehensive Augmentation Framework for Anomaly Detection

Lin Jiang, Yaping Yan

AAAI 2024paperarXiv:2308.15068
16
citations
#4640

What Matters to You? Towards Visual Representation Alignment for Robot Learning

Thomas Tian, Chenfeng Xu, Masayoshi Tomizuka et al.

ICLR 2024oralarXiv:2310.07932
16
citations
#4641

Diffusion Bridges for 3D Point Cloud Denoising

Mathias Vogel, Keisuke Tateno, Marc Pollefeys et al.

ECCV 2024arXiv:2408.16325
16
citations
#4642

Make Me a BNN: A Simple Strategy for Estimating Bayesian Uncertainty from Pre-trained Models

Gianni Franchi, Olivier Laurent, Maxence Leguéry et al.

CVPR 2024arXiv:2312.15297
16
citations
#4643

IS-DARTS: Stabilizing DARTS through Precise Measurement on Candidate Importance

Hongyi He, Longjun Liu, Haonan Zhang et al.

AAAI 2024paperarXiv:2312.12648
16
citations
#4644

Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition

Sergio Izquierdo, Javier Civera

ECCV 2024arXiv:2407.02422
16
citations
#4645

On Differentially Private Federated Linear Contextual Bandits

Xingyu Zhou, Sayak Ray Chowdhury

ICLR 2024arXiv:2302.13945
16
citations
#4646

NeRF Director: Revisiting View Selection in Neural Volume Rendering

Wenhui Xiao, Rodrigo Santa Cruz, David Ahmedt-Aristizabal et al.

CVPR 2024arXiv:2406.08839
16
citations
#4647

Non-negative Contrastive Learning

Yifei Wang, Qi Zhang, Yaoyu Guo et al.

ICLR 2024arXiv:2403.12459
16
citations
#4648

Transfer and Alignment Network for Generalized Category Discovery

Wenbin An, Feng Tian, Wenkai Shi et al.

AAAI 2024paperarXiv:2312.16467
16
citations
#4649

InNeRF360: Text-Guided 3D-Consistent Object Inpainting on 360-degree Neural Radiance Fields

Dongqing Wang, Tong Zhang, Alaa Abboud et al.

CVPR 2024arXiv:2305.15094
16
citations
#4650

Multiple View Geometry Transformers for 3D Human Pose Estimation

Ziwei Liao, jialiang zhu, Chunyu Wang et al.

CVPR 2024arXiv:2311.10983
16
citations
#4651

TP2O: Creative Text Pair-to-Object Generation using Balance Swap-Sampling

Jun Li, Zedong Zhang, Jian Yang

ECCV 2024arXiv:2310.01819
16
citations
#4652

DualAD: Disentangling the Dynamic and Static World for End-to-End Driving

Simon Doll, Niklas Hanselmann, Lukas Schneider et al.

CVPR 2024arXiv:2406.06264
16
citations
#4653

Acquiring Diverse Skills using Curriculum Reinforcement Learning with Mixture of Experts

Onur Celik, Aleksandar Taranovic, Gerhard Neumann

ICML 2024arXiv:2403.06966
16
citations
#4654

Confronting Ambiguity in 6D Object Pose Estimation via Score-Based Diffusion on SE(3)

Tsu-Ching Hsiao, Hao-Wei Chen, Hsuan-Kung Yang et al.

CVPR 2024arXiv:2305.15873
16
citations
#4655

ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object

Chenshuang Zhang, Fei Pan, Junmo Kim et al.

CVPR 2024highlightarXiv:2403.18775
16
citations
#4656

Gaussian Mixture Proposals with Pull-Push Learning Scheme to Capture Diverse Events for Weakly Supervised Temporal Video Grounding

AAAI 2024paperarXiv:2312.16388
16
citations
#4657

Exploring Vision Transformers for 3D Human Motion-Language Models with Motion Patches

Qing Yu, Mikihiro Tanaka, Kent Fujiwara

CVPR 2024arXiv:2405.04771
16
citations
#4658

Wired Perspectives: Multi-View Wire Art Embraces Generative AI

Zhiyu Qu, LAN YANG, Honggang Zhang et al.

CVPR 2024arXiv:2311.15421
16
citations
#4659

Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training

Yipeng Gao, Zeyu Wang, Wei-Shi Zheng et al.

CVPR 2024arXiv:2311.01734
16
citations
#4660

CoT3DRef: Chain-of-Thoughts Data-Efficient 3D Visual Grounding

eslam Abdelrahman, Mohamed Ayman Mohamed, Mahmoud Ahmed et al.

ICLR 2024arXiv:2310.06214
16
citations
#4661

Truly No-Regret Learning in Constrained MDPs

Adrian Müller, Pragnya Alatur, Volkan Cevher et al.

ICML 2024spotlightarXiv:2402.15776
16
citations
#4662

Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective

Ming Zhong, Chenxin An, Weizhu Chen et al.

ICLR 2024arXiv:2310.11451
16
citations
#4663

Visual Concept Connectome (VCC): Open World Concept Discovery and their Interlayer Connections in Deep Models

Matthew Kowal, Richard P. Wildes, Kosta Derpanis

CVPR 2024highlightarXiv:2404.02233
16
citations
#4664

DART: Implicit Doppler Tomography for Radar Novel View Synthesis

Tianshu Huang, John Miller, Akarsh Prabhakara et al.

CVPR 2024arXiv:2403.03896
16
citations
#4665

E2HQV: High-Quality Video Generation from Event Camera via Theory-Inspired Model-Aided Deep Learning

Qiang Qu, Yiran Shen, Xiaoming Chen et al.

AAAI 2024paperarXiv:2401.08117
16
citations
#4666

PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation

Zhenyu Li, Shariq Farooq Bhat, Peter Wonka

ECCV 2024arXiv:2406.06679
16
citations
#4667

A Simple Background Augmentation Method for Object Detection with Diffusion Model

YUHANG LI, Xin Dong, Chen Chen et al.

ECCV 2024arXiv:2408.00350
16
citations
#4668

Joint Demosaicing and Denoising for Spike Camera

Yanchen Dong, Ruiqin Xiong, Jing Zhao et al.

AAAI 2024paper
16
citations
#4669

What How and When Should Object Detectors Update in Continually Changing Test Domains?

Jayeon Yoo, Dongkwan Lee, Inseop Chung et al.

CVPR 2024arXiv:2312.08875
16
citations
#4670

Sliced Wasserstein Estimation with Control Variates

Khai Nguyen, Nhat Ho

ICLR 2024arXiv:2305.00402
16
citations
#4671

Programmable Motion Generation for Open-Set Motion Control Tasks

Hanchao Liu, Xiaohang Zhan, Shaoli Huang et al.

CVPR 2024highlightarXiv:2405.19283
16
citations
#4672

Revealing the Proximate Long-Tail Distribution in Compositional Zero-Shot Learning

Chenyi Jiang, Haofeng Zhang

AAAI 2024paperarXiv:2312.15923
16
citations
#4673

In-Context Learning Agents Are Asymmetric Belief Updaters

Johannes A. Schubert, Akshay Kumar Jagadish, Marcel Binz et al.

ICML 2024arXiv:2402.03969
16
citations
#4674

$H$-Consistency Guarantees for Regression

Anqi Mao, Mehryar Mohri, Yutao Zhong

ICML 2024arXiv:2403.19480
16
citations
#4675

KD-DETR: Knowledge Distillation for Detection Transformer with Consistent Distillation Points Sampling

Yu Wang, Xin Li, Shengzhao Wen et al.

CVPR 2024arXiv:2211.08071
16
citations
#4676

TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Autonomous Driving

Cheng Zhao, su sun, Ruoyu Wang et al.

ECCV 2024arXiv:2404.02410
16
citations
#4677

Grounded Object-Centric Learning

Avinash Kori, Francesco Locatello, Fabio De Sousa Ribeiro et al.

ICLR 2024
16
citations
#4678

Interactive3D: Create What You Want by Interactive 3D Generation

Shaocong Dong, Lihe Ding, Zhanpeng Huang et al.

CVPR 2024arXiv:2404.16510
16
citations
#4679

LDP: Language-driven Dual-Pixel Image Defocus Deblurring Network

Hao Yang, Liyuan Pan, Yan Yang et al.

CVPR 2024arXiv:2307.09815
16
citations
#4680

Commonsense Prototype for Outdoor Unsupervised 3D Object Detection

Hai Wu, Shijia Zhao, Xun Huang et al.

CVPR 2024arXiv:2404.16493
16
citations
#4681

SURE: SUrvey REcipes for building reliable and robust deep networks

Yuting Li, Yingyi Chen, Xuanlong Yu et al.

CVPR 2024arXiv:2403.00543
16
citations
#4682

Language Models Represent Beliefs of Self and Others

Wentao Zhu, Zhining Zhang, Yizhou Wang

ICML 2024arXiv:2402.18496
16
citations
#4683

RobustTSF: Towards Theory and Design of Robust Time Series Forecasting with Anomalies

Hao Cheng, Qingsong Wen, Yang Liu et al.

ICLR 2024arXiv:2402.02032
16
citations
#4684

Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning

Kaibin Tian, Yanhua Cheng, Yi Liu et al.

AAAI 2024paperarXiv:2401.00701
16
citations
#4685

Neural Spectral Methods: Self-supervised learning in the spectral domain

Yiheng Du, Nithin Chalapathi, Aditi Krishnapriyan

ICLR 2024oralarXiv:2312.05225
16
citations
#4686

The Audio-Visual Conversational Graph: From an Egocentric-Exocentric Perspective

Wenqi Jia, Miao Liu, Hao Jiang et al.

CVPR 2024arXiv:2312.12870
16
citations
#4687

DiffSCI: Zero-Shot Snapshot Compressive Imaging via Iterative Spectral Diffusion Model

Zhenghao Pan, Haijin Zeng, Jiezhang Cao et al.

CVPR 2024arXiv:2311.11417
16
citations
#4688

Generating Content for HDR Deghosting from Frequency View

Tao Hu, Qingsen Yan, Yuankai Qi et al.

CVPR 2024arXiv:2404.00849
16
citations
#4689

TapMo: Shape-aware Motion Generation of Skeleton-free Characters

Jiaxu Zhang, Shaoli Huang, Zhigang Tu et al.

ICLR 2024arXiv:2310.12678
16
citations
#4690

TransCAD: A Hierarchical Transformer for CAD Sequence Inference from Point Clouds

Dupont Elona, Kseniya Cherenkova, Dimitrios Mallis et al.

ECCV 2024arXiv:2407.12702
16
citations
#4691

Unlocking Pre-trained Image Backbones for Semantic Image Synthesis

Tariq Berrada, Jakob Verbeek, camille couprie et al.

CVPR 2024arXiv:2312.13314
16
citations
#4692

InstructDET: Diversifying Referring Object Detection with Generalized Instructions

Ronghao Dang, Jiangyan Feng, Haodong Zhang et al.

ICLR 2024arXiv:2310.05136
16
citations
#4693

CoLoRA: Continuous low-rank adaptation for reduced implicit neural modeling of parameterized partial differential equations

Jules Berman, Benjamin Peherstorfer

ICML 2024arXiv:2402.14646
16
citations
#4694

SLICE: Stabilized LIME for Consistent Explanations for Image Classification

Revoti Prasad Bora, Kiran Raja, Philipp Terhörst et al.

CVPR 2024highlight
16
citations
#4695

Towards Detailed Text-to-Motion Synthesis via Basic-to-Advanced Hierarchical Diffusion Model

Zhenyu Xie, Yang Wu, Xuehao Gao et al.

AAAI 2024paperarXiv:2312.10960
16
citations
#4696

I Can't Believe It's Not Scene Flow!

Ishan Khatri, Kyle Vedder, Neehar Peri et al.

ECCV 2024arXiv:2403.04739
16
citations
#4697

Characteristic Guidance: Non-linear Correction for Diffusion Model at Large Guidance Scale

Candi Zheng, Yuan LAN

ICML 2024arXiv:2312.07586
16
citations
#4698

Towards Robust Fidelity for Evaluating Explainability of Graph Neural Networks

Xu Zheng, Farhad Shirani, Tianchun Wang et al.

ICLR 2024spotlightarXiv:2310.01820
16
citations
#4699

Neural Diffusion Models

Grigory Bartosh, Dmitry Vetrov, Christian Andersson Naesseth

ICML 2024arXiv:2310.08337
16
citations
#4700

Token Transformation Matters: Towards Faithful Post-hoc Explanation for Vision Transformer

Junyi Wu, Bin Duan, Weitai Kang et al.

CVPR 2024arXiv:2403.14552
16
citations
#4701

Robust and Conjugate Gaussian Process Regression

Matias Altamirano, Francois-Xavier Briol, Jeremias Knoblauch

ICML 2024spotlightarXiv:2311.00463
16
citations
#4702

ConceptBed: Evaluating Concept Learning Abilities of Text-to-Image Diffusion Models

Maitreya Patel, Tejas Gokhale, Chitta Baral et al.

AAAI 2024paperarXiv:2306.04695
16
citations
#4703

Learning Instance-Aware Correspondences for Robust Multi-Instance Point Cloud Registration in Cluttered Scenes

Zhiyuan Yu, Zheng Qin, lintao zheng et al.

CVPR 2024arXiv:2404.04557
16
citations
#4704

Compositional Generative Inverse Design

Tailin Wu, Takashi Maruyama, Long Wei et al.

ICLR 2024spotlightarXiv:2401.13171
16
citations
#4705

Content-Style Decoupling for Unsupervised Makeup Transfer without Generating Pseudo Ground Truth

Zhaoyang Sun, Shengwu Xiong, Yaxiong Chen et al.

CVPR 2024arXiv:2405.17240
16
citations
#4706

Learning Encodings for Constructive Neural Combinatorial Optimization Needs to Regret

Rui Sun, Zhi Zheng, Zhenkun Wang

AAAI 2024paper
16
citations
#4707

Learning 3D Particle-based Simulators from RGB-D Videos

William Whitney, Tatiana Lopez-Guevara, Tobias Pfaff et al.

ICLR 2024arXiv:2312.05359
16
citations
#4708

Understanding Unimodal Bias in Multimodal Deep Linear Networks

Yedi Zhang, Peter Latham, Andrew Saxe

ICML 2024arXiv:2312.00935
16
citations
#4709

ScanFormer: Referring Expression Comprehension by Iteratively Scanning

Wei Su, Peihan Miao, Huanzhang Dou et al.

CVPR 2024arXiv:2406.18048
16
citations
#4710

FrameQuant: Flexible Low-Bit Quantization for Transformers

Harshavardhan Adepu, Zhanpeng Zeng, Li Zhang et al.

ICML 2024arXiv:2403.06082
16
citations
#4711

ProxyDet: Synthesizing Proxy Novel Classes via Classwise Mixup for Open Vocabulary Object Detection

Joonhyun Jeong, Geondo Park, Jayeon Yoo et al.

AAAI 2024paperarXiv:2312.07266
16
citations
#4712

Inverse Rendering of Glossy Objects via the Neural Plenoptic Function and Radiance Fields

Haoyuan Wang, Wenbo Hu, Lei Zhu et al.

CVPR 2024arXiv:2403.16224
16
citations
#4713

Position: A Call to Action for a Human-Centered AutoML Paradigm

Marius Lindauer, Florian Karl, Anne Klier et al.

ICML 2024arXiv:2406.03348
16
citations
#4714

Online Cascade Learning for Efficient Inference over Streams

Lunyiu Nie, Zhimin Ding, Erdong Hu et al.

ICML 2024arXiv:2402.04513
16
citations
#4715

PRISE: LLM-Style Sequence Compression for Learning Temporal Action Abstractions in Control

Ruijie Zheng, Ching-An Cheng, Hal Daumé et al.

ICML 2024oralarXiv:2402.10450
16
citations
#4716

ViewFusion: Towards Multi-View Consistency via Interpolated Denoising

Xianghui Yang, Gil Avraham, Yan Zuo et al.

CVPR 2024arXiv:2402.18842
16
citations
#4717

End-to-End Neuro-Symbolic Reinforcement Learning with Textual Explanations

Lirui Luo, Guoxi Zhang, Hongming Xu et al.

ICML 2024spotlightarXiv:2403.12451
16
citations
#4718

DDMI: Domain-agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural Representations

Dogyun Park, Sihyeon Kim, Sojin Lee et al.

ICLR 2024arXiv:2401.12517
16
citations
#4719

Video Prediction by Modeling Videos as Continuous Multi-Dimensional Processes

Gaurav Shrivastava, Abhinav Shrivastava

CVPR 2024
16
citations
#4720

Guess The Unseen: Dynamic 3D Scene Reconstruction from Partial 2D Glimpses

Inhee Lee, Byungjun Kim, Hanbyul Joo

CVPR 2024arXiv:2404.14410
16
citations
#4721

Data-Efficient Learning via Clustering-Based Sensitivity Sampling: Foundation Models and Beyond

Kyriakos Axiotis, Vincent Cohen-Addad, Monika Henzinger et al.

ICML 2024arXiv:2402.17327
16
citations
#4722

MoE-DiffIR: Task-customized Diffusion Priors for Universal Compressed Image Restoration

Yulin Ren, Xin Li, Bingchen Li et al.

ECCV 2024arXiv:2407.10833
16
citations
#4723

MolTailor: Tailoring Chemical Molecular Representation to Specific Tasks via Text Prompts

Haoqiang Guo, Sendong Zhao, Haochun Wang et al.

AAAI 2024paperarXiv:2401.11403
16
citations
#4724

A Geometric Explanation of the Likelihood OOD Detection Paradox

Hamidreza Kamkari, Brendan Ross, Jesse Cresswell et al.

ICML 2024arXiv:2403.18910
16
citations
#4725

AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation

Zanlin Ni, Yulin Wang, Renping Zhou et al.

ECCV 2024arXiv:2409.00342
16
citations
#4726

Label-Noise Robust Diffusion Models

Byeonghu Na, Yeongmin Kim, HeeSun Bae et al.

ICLR 2024arXiv:2402.17517
16
citations
#4727

Hybrid Functional Maps for Crease-Aware Non-Isometric Shape Matching

Lennart Bastian, Yizheng Xie, Nassir Navab et al.

CVPR 2024arXiv:2312.03678
16
citations
#4728

Factorized Explainer for Graph Neural Networks

AAAI 2024paperarXiv:2312.05596
16
citations
#4729

Jointly-Learned Exit and Inference for a Dynamic Neural Network

Florence Regol, Joud Chataoui, Mark Coates

ICLR 2024arXiv:2310.09163
16
citations
#4730

MCPNet: An Interpretable Classifier via Multi-Level Concept Prototypes

Bor Shiun Wang, Chien-Yi Wang, Wei-Chen Chiu

CVPR 2024arXiv:2404.08968
16
citations
#4731

SPGroup3D: Superpoint Grouping Network for Indoor 3D Object Detection

Yun Zhu, Le Hui, Yaqi Shen et al.

AAAI 2024paperarXiv:2312.13641
16
citations
#4732

S$2$AC: Energy-Based Reinforcement Learning with Stein Soft Actor Critic

Safa Messaoud, Billel Mokeddem, Zhenghai Xue et al.

ICLR 2024arXiv:2405.00987
16
citations
#4733

How to Handle Sketch-Abstraction in Sketch-Based Image Retrieval?

Subhadeep Koley, Ayan Kumar Bhunia, Aneeshan Sain et al.

CVPR 2024arXiv:2403.07203
16
citations
#4734

Iterated Learning Improves Compositionality in Large Vision-Language Models

Chenhao Zheng, Jieyu Zhang, Aniruddha Kembhavi et al.

CVPR 2024arXiv:2404.02145
16
citations
#4735

Frozen Feature Augmentation for Few-Shot Image Classification

Andreas Bär, Neil Houlsby, Mostafa Dehghani et al.

CVPR 2024arXiv:2403.10519
16
citations
#4736

SegGen: Supercharging Segmentation Models with Text2Mask and Mask2Img Synthesis

Hanrong Ye, Jason Wen Yong Kuen, Qing Liu et al.

ECCV 2024arXiv:2311.03355
16
citations
#4737

Revisiting Deep Audio-Text Retrieval Through the Lens of Transportation

Tien Manh Luong, Khai Nguyen, Nhat Ho et al.

ICLR 2024arXiv:2405.10084
16
citations
#4738

Efficient Meshflow and Optical Flow Estimation from Event Cameras

Xinglong Luo, Ao Luo, Zhengning Wang et al.

CVPR 2024
16
citations
#4739

Self-Interpretable Graph Learning with Sufficient and Necessary Explanations

Jiale Deng, Yanyan Shen

AAAI 2024paper
16
citations
#4740

Causal Mode Multiplexer: A Novel Framework for Unbiased Multispectral Pedestrian Detection

Taeheon Kim, Sebin Shin, Youngjoon Yu et al.

CVPR 2024arXiv:2403.01300
16
citations
#4741

Quantum Implicit Neural Representations

Jiaming Zhao, Wenbo Qiao, Peng Zhang et al.

ICML 2024arXiv:2406.03873
16
citations
#4742

Binarized Low-light Raw Video Enhancement

Gengchen Zhang, Yulun Zhang, Xin Yuan et al.

CVPR 2024arXiv:2403.19944
16
citations
#4743

Learning Optimal Advantage from Preferences and Mistaking It for Reward

W Bradley Knox, Stephane Hatgis-Kessell, Sigurdur Orn Adalgeirsson et al.

AAAI 2024paperarXiv:2310.02456
16
citations
#4744

VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling

Siyuan Li, Zedong Wang, Zicheng Liu et al.

ICML 2024arXiv:2405.10812
16
citations
#4745

En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data

Yifang Men, Biwen Lei, Yuan Yao et al.

CVPR 2024arXiv:2401.01173
15
citations
#4746

Reinforcement Learning Friendly Vision-Language Model for Minecraft

Haobin Jiang, Junpeng Yue, Hao Luo et al.

ECCV 2024arXiv:2303.10571
15
citations
#4747

A Video is Worth 256 Bases: Spatial-Temporal Expectation-Maximization Inversion for Zero-Shot Video Editing

Li Maomao, Yu Li, Tianyu Yang et al.

CVPR 2024arXiv:2312.05856
15
citations
#4748

BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation

Yunhao Ge, Yihe Tang, Jiashu Xu et al.

CVPR 2024highlightarXiv:2405.09546
15
citations
#4749

Neural Clustering based Visual Representation Learning

Guikun Chen, Xia Li, Yi Yang et al.

CVPR 2024arXiv:2403.17409
15
citations
#4750

A Noisy Elephant in the Room: Is Your Out-of-Distribution Detector Robust to Label Noise?

Galadrielle Humblot-Renaux, Sergio Escalera, Thomas B. Moeslund

CVPR 2024arXiv:2404.01775
15
citations
#4751

Context-Guided Diffusion for Out-of-Distribution Molecular and Protein Design

Leo Klarner, Tim G. J. Rudner, Garrett Morris et al.

ICML 2024arXiv:2407.11942
15
citations
#4752

Diffusion in Diffusion: Cyclic One-Way Diffusion for Text-Vision-Conditioned Generation

Ruoyu Wang, Yongqi Yang, Zhihao Qian et al.

ICLR 2024arXiv:2306.08247
15
citations
#4753

SAFNet: Selective Alignment Fusion Network for Efficient HDR Imaging

Lingtong Kong, Bo Li, Yike Xiong et al.

ECCV 2024arXiv:2407.16308
15
citations
#4754

Borda Regret Minimization for Generalized Linear Dueling Bandits

Yue Wu, Tao Jin, Qiwei Di et al.

ICML 2024arXiv:2303.08816
15
citations
#4755

Hebbian Learning based Orthogonal Projection for Continual Learning of Spiking Neural Networks

Mingqing Xiao, Qingyan Meng, Zongpeng Zhang et al.

ICLR 2024arXiv:2402.11984
15
citations
#4756

On the Markov Property of Neural Algorithmic Reasoning: Analyses and Methods

Montgomery Bohde, Meng Liu, Alexandra Saxton et al.

ICLR 2024spotlightarXiv:2403.04929
15
citations
#4757

Unlocking the Power of Open Set: A New Perspective for Open-Set Noisy Label Learning

Wenhai Wan, Shao-Yuan Li, Xinrui Wang et al.

AAAI 2024paperarXiv:2305.04203
15
citations
#4758

Learning Camouflaged Object Detection from Noisy Pseudo Label

Jin Zhang, Ruiheng Zhang, Yanjiao Shi et al.

ECCV 2024arXiv:2407.13157
15
citations
#4759

HarmoDT: Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning

Shengchao Hu, Ziqing Fan, Li Shen et al.

ICML 2024arXiv:2405.18080
15
citations
#4760

UniProcessor: A Text-induced Unified Low-level Image Processor

Huiyu Duan, Xiongkuo Min, Sijing Wu et al.

ECCV 2024arXiv:2407.20928
15
citations
#4761

Generative Unlearning for Any Identity

Juwon Seo, Sung-Hoon Lee, Tae-Young Lee et al.

CVPR 2024arXiv:2405.09879
15
citations
#4762

Complementing Event Streams and RGB Frames for Hand Mesh Reconstruction

Jianping Jiang, xinyu zhou, Bingxuan Wang et al.

CVPR 2024arXiv:2403.07346
15
citations
#4763

Table of Contents

Pengfei Hu, Zhenrong Zhang, Jianshu Zhang et al.

AAAI 2024paperarXiv:2212.02896
15
citations
#4764

VEON: Vocabulary-Enhanced Occupancy Prediction

Jilai Zheng, Pin Tang, Zhongdao Wang et al.

ECCV 2024arXiv:2407.12294
15
citations
#4765

Dynamic Cues-Assisted Transformer for Robust Point Cloud Registration

Hong Chen, Pei Yan, sihe xiang et al.

CVPR 2024highlight
15
citations
#4766

A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting

Wouter Van Gansbeke, Bert De Brabandere

ECCV 2024arXiv:2401.10227
15
citations
#4767

PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts

Bang An, Sicheng Zhu, Michael-Andrei Panaitescu-Liess et al.

ICLR 2024arXiv:2308.01313
15
citations
#4768

ESM All-Atom: Multi-Scale Protein Language Model for Unified Molecular Modeling

Kangjie Zheng, Siyu Long, Tianyu Lu et al.

ICML 2024arXiv:2403.12995
15
citations
#4769

MH-pFLID: Model Heterogeneous personalized Federated Learning via Injection and Distillation for Medical Data Analysis

Luyuan Xie, Manqing Lin, Tianyu Luan et al.

ICML 2024arXiv:2405.06822
15
citations
#4770

Accelerating Image Generation with Sub-path Linear Approximation Model

Chen Xu, Tianhui Song, Weixin Feng et al.

ECCV 2024arXiv:2404.13903
15
citations
#4771

The Need for Speed: Pruning Transformers with One Recipe

Samir Khaki, Konstantinos Plataniotis

ICLR 2024arXiv:2403.17921
15
citations
#4772

Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts

Shuangkang Fang, Yufeng Wang, Yi-Hsuan Tsai et al.

ECCV 2024arXiv:2407.06842
15
citations
#4773

MeshSegmenter: Zero-Shot Mesh Segmentation via Texture Synthesis

ziming zhong, Yanyu Xu, Jing Li et al.

ECCV 2024
15
citations
#4774

CREAD: A Classification-Restoration Framework with Error Adaptive Discretization for Watch Time Prediction in Video Recommender Systems

Jie Sun, Zhao Ying Ding, Xiaoshuang Chen et al.

AAAI 2024paperarXiv:2401.07521
15
citations
#4775

CLIP-Guided Generative Networks for Transferable Targeted Adversarial Attacks

Hao Fang, Jiawei Kong, Bin Chen et al.

ECCV 2024arXiv:2407.10179
15
citations
#4776

PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs

Michael Dorkenwald, Nimrod Barazani, Cees G. M. Snoek et al.

CVPR 2024arXiv:2402.08657
15
citations
#4777

Partial-to-Partial Shape Matching with Geometric Consistency

Viktoria Ehm, Maolin Gao, Paul Roetzer et al.

CVPR 2024arXiv:2404.12209
15
citations
#4778

Memory-Consistent Neural Networks for Imitation Learning

Kaustubh Sridhar, Souradeep Dutta, Dinesh Jayaraman et al.

ICLR 2024arXiv:2310.06171
15
citations
#4779

Arithmetic Feature Interaction Is Necessary for Deep Tabular Learning

Yi Cheng, Renjun Hu, Haochao Ying et al.

AAAI 2024paperarXiv:2402.02334
15
citations
#4780

Learning to Make Adherence-aware Advice

Guanting Chen, Xiaocheng Li, Chunlin Sun et al.

ICLR 2024arXiv:2310.00817
15
citations
#4781

Simplicial Representation Learning with Neural $k$-Forms

Kelly Maggs, Celia Hacker, Bastian Rieck

ICLR 2024arXiv:2312.08515
15
citations
#4782

Cross-Gate MLP with Protein Complex Invariant Embedding Is a One-Shot Antibody Designer

Cheng Tan, Zhangyang Gao, Lirong Wu et al.

AAAI 2024paperarXiv:2305.09480
15
citations
#4783

Maximum Likelihood Estimation is All You Need for Well-Specified Covariate Shift

Jiawei Ge, Shange Tang, Jianqing Fan et al.

ICLR 2024arXiv:2311.15961
15
citations
#4784

HiLo: Detailed and Robust 3D Clothed Human Reconstruction with High-and Low-Frequency Information of Parametric Models

Yifan Yang, Dong Liu, Shuhai Zhang et al.

CVPR 2024arXiv:2404.04876
15
citations
#4785

Interpretable Deep Clustering for Tabular Data

Jonathan Svirsky, Ofir Lindenbaum

ICML 2024arXiv:2306.04785
15
citations
#4786

Customization Assistant for Text-to-Image Generation

Yufan Zhou, Ruiyi Zhang, Jiuxiang Gu et al.

CVPR 2024arXiv:2312.03045
15
citations
#4787

UFORecon: Generalizable Sparse-View Surface Reconstruction from Arbitrary and Unfavorable Sets

Youngju Na, Woo Jae Kim, Kyu Han et al.

CVPR 2024arXiv:2403.05086
15
citations
#4788

Differentiable Euler Characteristic Transforms for Shape Classification

Ernst Roell, Bastian Rieck

ICLR 2024arXiv:2310.07630
15
citations
#4789

Towards Robust and Efficient Cloud-Edge Elastic Model Adaptation via Selective Entropy Distillation

Yaofo Chen, Shuaicheng Niu, Yaowei Wang et al.

ICLR 2024arXiv:2402.17316
15
citations
#4790

ConsistDreamer: 3D-Consistent 2D Diffusion for High-Fidelity Scene Editing

Jun-Kun Chen, Samuel Rota Bulò, Norman Müller et al.

CVPR 2024arXiv:2406.09404
15
citations
#4791

SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation

Lingchen Meng, Shiyi Lan, Hengduo Li et al.

ECCV 2024arXiv:2311.14671
15
citations
#4792

Learning Linear Block Error Correction Codes

Yoni Choukroun, Lior Wolf

ICML 2024arXiv:2405.04050
15
citations
#4793

Dissecting Sample Hardness: A Fine-Grained Analysis of Hardness Characterization Methods for Data-Centric AI

Nabeel Seedat, Fergus Imrie, Mihaela van der Schaar

ICLR 2024arXiv:2403.04551
15
citations
#4794

Graph Neural PDE Solvers with Conservation and Similarity-Equivariance

Masanobu Horie, NAOTO MITSUME

ICML 2024arXiv:2405.16183
15
citations
#4795

Instance-Aware Group Quantization for Vision Transformers

Jaehyeon Moon, Dohyung Kim, Jun Yong Cheon et al.

CVPR 2024arXiv:2404.00928
15
citations
#4796

Revisit and Outstrip Entity Alignment: A Perspective of Generative Models

Lingbing Guo, Zhuo Chen, Jiaoyan Chen et al.

ICLR 2024arXiv:2305.14651
15
citations
#4797

A Unified Framework for Microscopy Defocus Deblur with Multi-Pyramid Transformer and Contrastive Learning

Yuelin Zhang, Pengyu Zheng, Wanquan Yan et al.

CVPR 2024arXiv:2403.02611
15
citations
#4798

The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?

Qinyu Zhao, Ming Xu, Kartik Gupta et al.

ECCV 2024arXiv:2403.09037
15
citations
#4799

Improving Text-guided Object Inpainting with Semantic Pre-inpainting

Yifu Chen, Jingwen Chen, Yingwei Pan et al.

ECCV 2024arXiv:2409.08260
15
citations
#4800

Data-Free Generalized Zero-Shot Learning

Bowen Tang, Jing Zhang, Yan Long et al.

AAAI 2024paperarXiv:2401.15657
15
citations