Most Cited 2024 "facial action unit detection" Papers

12,324 papers found • Page 28 of 62

#5401

Improving Neural Additive Models with Bayesian Principles

Kouroche Bouchiat, Alexander Immer, Hugo Yèche et al.

ICML 2024arXiv:2305.16905
13
citations
#5402

How to Train the Teacher Model for Effective Knowledge Distillation

Shayan Mohajer Hamidi, Xizhen Deng, Renhao Tan et al.

ECCV 2024arXiv:2407.18041
13
citations
#5403

Approval-Based Committee Voting in Practice: A Case Study of (over-)Representation in the Polkadot Blockchain

Niclas Boehmer, Markus Brill, Alfonso Cevallos et al.

AAAI 2024paperarXiv:2312.11408
13
citations
#5404

Enhancing Small Medical Learners with Privacy-preserving Contextual Prompting

xinlu zhang, Shiyang Li, Xianjun Yang et al.

ICLR 2024arXiv:2305.12723
13
citations
#5405

Real-World Mobile Image Denoising Dataset with Efficient Baselines

Roman Flepp, Andrey Ignatov, Radu Timofte et al.

CVPR 2024
13
citations
#5406

FedLoGe: Joint Local and Generic Federated Learning under Long-tailed Data

Zikai Xiao, Zihan Chen, Liyinglan Liu et al.

ICLR 2024arXiv:2401.08977
13
citations
#5407

Reward-Free Curricula for Training Robust World Models

Marc Rigter, Minqi Jiang, Ingmar Posner

ICLR 2024arXiv:2306.09205
13
citations
#5408

EvTexture: Event-driven Texture Enhancement for Video Super-Resolution

Dachun Kai, Jiayao Lu, Yueyi Zhang et al.

ICML 2024oralarXiv:2406.13457
13
citations
#5409

SocialCVAE: Predicting Pedestrian Trajectory via Interaction Conditioned Latents

Wei Xiang, Haoteng YIN, He Wang et al.

AAAI 2024paperarXiv:2402.17339
13
citations
#5410

Towards More Unified In-context Visual Understanding

Dianmo Sheng, Dongdong Chen, Zhentao Tan et al.

CVPR 2024arXiv:2312.02520
13
citations
#5411

Efficient Conditional Diffusion Model with Probability Flow Sampling for Image Super-resolution

Yutao Yuan, Chun Yuan

AAAI 2024paperarXiv:2404.10688
13
citations
#5412

Align and Aggregate: Compositional Reasoning with Video Alignment and Answer Aggregation for Video Question-Answering

Zhaohe Liao, Jiangtong Li, Li Niu et al.

CVPR 2024arXiv:2407.03008
13
citations
#5413

MovingParts: Motion-based 3D Part Discovery in Dynamic Radiance Field

Kaizhi Yang, Xiaoshuai Zhang, Zhiao Huang et al.

ICLR 2024spotlightarXiv:2303.05703
13
citations
#5414

PRES: Toward Scalable Memory-Based Dynamic Graph Neural Networks

Junwei Su, Difan Zou, Chuan Wu

ICLR 2024oralarXiv:2402.04284
13
citations
#5415

Realistic Unsupervised CLIP Fine-tuning with Universal Entropy Optimization

Jian Liang, Sheng, Zhengbo Wang et al.

ICML 2024spotlightarXiv:2308.12919
13
citations
#5416

Language-Guided Transformer for Federated Multi-Label Classification

I-Jieh Liu, Ci-Siang Lin, Fu-En Yang et al.

AAAI 2024paperarXiv:2312.07165
13
citations
#5417

Federated Online Adaptation for Deep Stereo

Matteo Poggi, Fabio Tosi

CVPR 2024arXiv:2405.14873
13
citations
#5418

CrossGLG: LLM Guides One-shot Skeleton-based 3D Action Recognition in a Cross-level Manner

Tingbing Yan, Wenzheng Zeng, Yang Xiao et al.

ECCV 2024arXiv:2403.10082
13
citations
#5419

PAC Prediction Sets Under Label Shift

Wenwen Si, Sangdon Park, Insup Lee et al.

ICLR 2024arXiv:2310.12964
13
citations
#5420

Retro-fallback: retrosynthetic planning in an uncertain world

Austin Tripp, Krzysztof Maziarz, Sarah Lewis et al.

ICLR 2024arXiv:2310.09270
13
citations
#5421

CHAI: Clustered Head Attention for Efficient LLM Inference

Saurabh Agarwal, Bilge Acun, Basil Hosmer et al.

ICML 2024arXiv:2403.08058
13
citations
#5422

Geometrically-driven Aggregation for Zero-shot 3D Point Cloud Understanding

Guofeng Mei, Luigi Riz, Yiming Wang et al.

CVPR 2024highlightarXiv:2312.02244
12
citations
#5423

OODRobustBench: a Benchmark and Large-Scale Analysis of Adversarial Robustness under Distribution Shift

Lin Li, Yifei Wang, Chawin Sitawarin et al.

ICML 2024arXiv:2310.12793
12
citations
#5424

FedBAT: Communication-Efficient Federated Learning via Learnable Binarization

Shiwei Li, Wenchao Xu, Haozhao Wang et al.

ICML 2024arXiv:2408.03215
12
citations
#5425

Multi-Sentence Grounding for Long-term Instructional Video

Zeqian Li, QIRUI CHEN, Tengda Han et al.

ECCV 2024arXiv:2312.14055
12
citations
#5426

Neural Atoms: Propagating Long-range Interaction in Molecular Graphs through Efficient Communication Channel

Xuan Li, Zhanke Zhou, Jiangchao Yao et al.

ICLR 2024arXiv:2311.01276
12
citations
#5427

D3: A Methodological Exploration of Domain Division, Modeling, and Balance in Multi-Domain Recommendations

Pengyue Jia, Yichao Wang, Shanru LIN et al.

AAAI 2024paper
12
citations
#5428

RNb-NeuS: Reflectance and Normal-based Multi-View 3D Reconstruction

Baptiste Brument, Robin Bruneau, Yvain Queau et al.

CVPR 2024arXiv:2312.01215
12
citations
#5429

Task-Disruptive Background Suppression for Few-Shot Segmentation

Suho Park, SuBeen Lee, Sangeek Hyun et al.

AAAI 2024paperarXiv:2312.15894
12
citations
#5430

Learning Latent Dynamic Robust Representations for World Models

Ruixiang Sun, Hongyu Zang, Xin Li et al.

ICML 2024oralarXiv:2405.06263
12
citations
#5431

Mitigating Noisy Correspondence by Geometrical Structure Consistency Learning

Zihua Zhao, Mengxi Chen, Tianjie Dai et al.

CVPR 2024arXiv:2405.16996
12
citations
#5432

LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection

Sanmin Kim, Youngseok Kim, Sihwan Hwang et al.

ECCV 2024arXiv:2407.10164
12
citations
#5433

Neural Collapse in Multi-label Learning with Pick-all-label Loss

Pengyu Li, Xiao Li, Yutong Wang et al.

ICML 2024arXiv:2310.15903
12
citations
#5434

Optimal Sample Complexity of Contrastive Learning

Noga Alon, Dmitrii Avdiukhin, Dor Elboim et al.

ICLR 2024spotlightarXiv:2312.00379
12
citations
#5435

DeCoOp: Robust Prompt Tuning with Out-of-Distribution Detection

Zhi Zhou, Ming Yang, Jiang-Xin Shi et al.

ICML 2024arXiv:2406.00345
12
citations
#5436

SoundingActions: Learning How Actions Sound from Narrated Egocentric Videos

Changan Chen, Kumar Ashutosh, Rohit Girdhar et al.

CVPR 2024arXiv:2404.05206
12
citations
#5437

Adapting Fine-Grained Cross-View Localization to Areas without Fine Ground Truth

Zimin Xia, Yujiao Shi, HONGDONG LI et al.

ECCV 2024arXiv:2406.00474
12
citations
#5438

Decouple Content and Motion for Conditional Image-to-Video Generation

Cuifeng Shen, Yulu Gan, Chen Chen et al.

AAAI 2024paperarXiv:2311.14294
12
citations
#5439

SNeRV: Spectra-preserving Neural Representation for Video

Jina Kim, Jihoo Lee, Jewon Kang

ECCV 2024arXiv:2501.01681
12
citations
#5440

Stable Anisotropic Regularization

William Rudman, Carsten Eickhoff

ICLR 2024arXiv:2305.19358
12
citations
#5441

EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens

Sunil Hwang, Jaehong Yoon, Youngwan Lee et al.

ICML 2024oralarXiv:2211.10636
12
citations
#5442

Generalized Planning for the Abstraction and Reasoning Corpus

Chao Lei, Nir Lipovetzky, Krista A. Ehinger

AAAI 2024paperarXiv:2401.07426
12
citations
#5443

Bring Event into RGB and LiDAR: Hierarchical Visual-Motion Fusion for Scene Flow

Hanyu Zhou, Yi Chang, Zhiwei Shi

CVPR 2024arXiv:2403.07432
12
citations
#5444

Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics

Luca Grillotti, Maxence Faldor, Borja G. León et al.

ICML 2024arXiv:2403.09930
12
citations
#5445

Refining Latent Homophilic Structures over Heterophilic Graphs for Robust Graph Convolution Networks

Chenyang Qiu, Guoshun Nan, Tianyu Xiong et al.

AAAI 2024paperarXiv:2312.16418
12
citations
#5446

DAFA: Distance-Aware Fair Adversarial Training

Hyungyu Lee, Saehyung Lee, Hyemi Jang et al.

ICLR 2024arXiv:2401.12532
12
citations
#5447

Conformalized Adaptive Forecasting of Heterogeneous Trajectories

Yanfei Zhou, Lars Lindemann, Matteo Sesia

ICML 2024arXiv:2402.09623
12
citations
#5448

Backdoor Contrastive Learning via Bi-level Trigger Optimization

Weiyu Sun, Xinyu Zhang, Hao LU et al.

ICLR 2024arXiv:2404.07863
12
citations
#5449

DiffFAS: Face Anti-Spoofing via Generative Diffusion Models

Xinxu Ge, Xin Liu, Zitong Yu et al.

ECCV 2024arXiv:2409.08572
12
citations
#5450

S3O: A Dual-Phase Approach for Reconstructing Dynamic Shape and Skeleton of Articulated Objects from Single Monocular Video

Hao Zhang, Fang Li, Samyak Rawlekar et al.

ICML 2024arXiv:2405.12607
12
citations
#5451

EquiAV: Leveraging Equivariance for Audio-Visual Contrastive Learning

Jongsuk Kim, Hyeongkeun Lee, Kyeongha Rho et al.

ICML 2024arXiv:2403.09502
12
citations
#5452

Physical-Based Event Camera Simulator

Haiqian Han, Jiacheng Lyu, Jianing Li et al.

ECCV 2024
12
citations
#5453

Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis

Zicheng Zhang, RUOBING ZHENG, Bonan Li et al.

CVPR 2024arXiv:2402.17364
12
citations
#5454

Replicable Learning of Large-Margin Halfspaces

Alkis Kalavasis, Amin Karbasi, Kasper Green Larsen et al.

ICML 2024spotlightarXiv:2402.13857
12
citations
#5455

Discounted Adaptive Online Learning: Towards Better Regularization

Zhiyu Zhang, David Bombara, Heng Yang

ICML 2024arXiv:2402.02720
12
citations
#5456

DUPLEX: Dual GAT for Complex Embedding of Directed Graphs

Zhaoru Ke, Hang Yu, Jianguo Li et al.

ICML 2024arXiv:2406.05391
12
citations
#5457

Robustly Learning Single-Index Models via Alignment Sharpness

Nikos Zarifis, Puqian Wang, Ilias Diakonikolas et al.

ICML 2024arXiv:2402.17756
12
citations
#5458

FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior

Zhekai Chen, Wen Wang, Zhen Yang et al.

ECCV 2024arXiv:2407.04947
12
citations
#5459

11293 Cross-Class Feature Augmentation for Class Incremental Learning

Taehoon Kim, JaeYoo Park, Bohyung Han

AAAI 2024paper
12
citations
#5460

Illusory Attacks: Information-theoretic detectability matters in adversarial attacks

Tim Franzmeyer, Stephen McAleer, Joao F. Henriques et al.

ICLR 2024spotlightarXiv:2207.10170
12
citations
#5461

RGNet: A Unified Clip Retrieval and Grounding Network for Long Videos

Tanveer Hannan, Mohaiminul Islam, Thomas Seidl et al.

ECCV 2024arXiv:2312.06729
12
citations
#5462

Deep Copula-Based Survival Analysis for Dependent Censoring with Identifiability Guarantees

Weijia Zhang, Chun Kai Ling, Xuanhui Zhang

AAAI 2024paperarXiv:2312.15566
12
citations
#5463

SEAL: A Framework for Systematic Evaluation of Real-World Super-Resolution

Wenlong Zhang, Xiaohui Li, Xiangyu Chen et al.

ICLR 2024spotlightarXiv:2309.03020
12
citations
#5464

UpFusion: Novel View Diffusion from Unposed Sparse View Observations

Bharath Raj Nagoor Kani, Hsin-Ying Lee, Sergey Tulyakov et al.

ECCV 2024arXiv:2312.06661
12
citations
#5465

VQAttack: Transferable Adversarial Attacks on Visual Question Answering via Pre-trained Models

Ziyi Yin, Muchao Ye, Tianrong Zhang et al.

AAAI 2024paperarXiv:2402.11083
12
citations
#5466

Retrieval is Accurate Generation

Bowen Cao, Deng Cai, Leyang Cui et al.

ICLR 2024arXiv:2402.17532
12
citations
#5467

Discriminative Sample-Guided and Parameter-Efficient Feature Space Adaptation for Cross-Domain Few-Shot Learning

Rashindrie Perera, Saman Halgamuge

CVPR 2024arXiv:2403.04492
12
citations
#5468

USB-NeRF: Unrolling Shutter Bundle Adjusted Neural Radiance Fields

Moyang Li, Peng Wang, Lingzhe Zhao et al.

ICLR 2024arXiv:2310.02687
12
citations
#5469

Exploiting Code Symmetries for Learning Program Semantics

Kexin Pei, Weichen Li, Qirui Jin et al.

ICML 2024spotlightarXiv:2308.03312
12
citations
#5470

Balancing Similarity and Complementarity for Federated Learning

Kunda Yan, Sen Cui, Abudukelimu Wuerkaixi et al.

ICML 2024arXiv:2405.09892
12
citations
#5471

A Space Group Symmetry Informed Network for O(3) Equivariant Crystal Tensor Prediction

Keqiang Yan, Alexandra Saxton, Xiaofeng Qian et al.

ICML 2024arXiv:2406.12888
12
citations
#5472

QLABGrad: A Hyperparameter-Free and Convergence-Guaranteed Scheme for Deep Learning

Fang-Xiang Wu, Minghan Fu

AAAI 2024paperarXiv:2302.00252
12
citations
#5473

Linear Log-Normal Attention with Unbiased Concentration

Yury Nahshan, Joseph Kampeas, Emir Haleva

ICLR 2024arXiv:2311.13541
12
citations
#5474

Real Appearance Modeling for More General Deepfake Detection

Jiahe Tian, Yu Cai, Xi Wang et al.

ECCV 2024
12
citations
#5475

PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance

Aoming Liu, Zhong Li, Zhang Chen et al.

ECCV 2024arXiv:2408.02157
12
citations
#5476

CAVEN: An Embodied Conversational Agent for Efficient Audio-Visual Navigation in Noisy Environments

Xiulong Liu, Sudipta Paul, Moitreya Chatterjee et al.

AAAI 2024paperarXiv:2306.04047
12
citations
#5477

Noisy Correspondence Learning with Self-Reinforcing Errors Mitigation

Zhuohang Dang, Minnan Luo, Chengyou Jia et al.

AAAI 2024paperarXiv:2312.16478
12
citations
#5478

Realistic Human Motion Generation with Cross-Diffusion Models

Zeping Ren, Shaoli Huang, Xiu Li

ECCV 2024arXiv:2312.10993
12
citations
#5479

Out-of-Distribution Detection via Deep Multi-Comprehension Ensemble

Chenhui Xu, Fuxun Yu, Zirui Xu et al.

ICML 2024arXiv:2403.16260
12
citations
#5480

Unifying Feature and Cost Aggregation with Transformers for Semantic and Visual Correspondence

Sunghwan Hong, Seokju Cho, Seungryong Kim et al.

ICLR 2024arXiv:2403.11120
12
citations
#5481

Benchmarking Spurious Bias in Few-Shot Image Classifiers

Guangtao Zheng, Wenqian Ye, Aidong Zhang

ECCV 2024arXiv:2409.02882
12
citations
#5482

Temporally Consistent Stereo Matching

Jiaxi Zeng, Chengtang Yao, Yuwei Wu et al.

ECCV 2024arXiv:2407.11950
12
citations
#5483

Seeing the Unseen: Visual Common Sense for Semantic Placement

Ram Ramrakhya, Aniruddha Kembhavi, Dhruv Batra et al.

CVPR 2024arXiv:2401.07770
12
citations
#5484

CarFormer: Self-Driving with Learned Object-Centric Representations

Shadi Hamdan, Fatma Guney

ECCV 2024arXiv:2407.15843
12
citations
#5485

OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection

Jinghua Hou, Tong Wang, Xiaoqing Ye et al.

ECCV 2024arXiv:2407.10753
12
citations
#5486

P$^2$OT: Progressive Partial Optimal Transport for Deep Imbalanced Clustering

Chuyu Zhang, Hui Ren, Xuming He

ICLR 2024arXiv:2401.09266
12
citations
#5487

Rethinking Mesh Watermark: Towards Highly Robust and Adaptable Deep 3D Mesh Watermarking

Xingyu Zhu, Guanhui Ye, Xiapu Luo et al.

AAAI 2024paperarXiv:2307.11628
12
citations
#5488

Deciphering the Role of Representation Disentanglement: Investigating Compositional Generalization in CLIP Models

Reza Abbasi, Mohammad Rohban, Mahdieh Soleymani Baghshah

ECCV 2024arXiv:2407.05897
12
citations
#5489

Mitigating Background Shift in Class-Incremental Semantic Segmentation

gilhan Park, WonJun Moon, SuBeen Lee et al.

ECCV 2024arXiv:2407.11859
12
citations
#5490

DeTra: A Unified Model for Object Detection and Trajectory Forecasting

Sergio Casas, Ben T Agro, Jiageng Mao et al.

ECCV 2024arXiv:2406.04426
12
citations
#5491

Puff-Net: Efficient Style Transfer with Pure Content and Style Feature Fusion Network

Sizhe Zheng, Pan Gao, Peng Zhou et al.

CVPR 2024arXiv:2405.19775
12
citations
#5492

Learning Task-Aware Language-Image Representation for Class-Incremental Object Detection

Hongquan Zhang, Bin-Bin Gao, Yi Zeng et al.

AAAI 2024paper
12
citations
#5493

Learning Video Context as Interleaved Multimodal Sequences

Qinghong Lin, Pengchuan Zhang, Difei Gao et al.

ECCV 2024arXiv:2407.21757
12
citations
#5494

Total-Decom: Decomposed 3D Scene Reconstruction with Minimal Interaction

Xiaoyang Lyu, Chirui Chang, Peng Dai et al.

CVPR 2024highlightarXiv:2403.19314
12
citations
#5495

Generalizable Fourier Augmentation for Unsupervised Video Object Segmentation

Huihui Song, Tiankang Su, Yuhui Zheng et al.

AAAI 2024paper
12
citations
#5496

Classes Are Not Equal: An Empirical Study on Image Recognition Fairness

Jiequan Cui, Beier Zhu, Xin Wen et al.

CVPR 2024arXiv:2402.18133
12
citations
#5497

AdjointDPM: Adjoint Sensitivity Method for Gradient Backpropagation of Diffusion Probabilistic Models

Jiachun Pan, Jiachun Pan, Jun Hao Liew et al.

ICLR 2024arXiv:2307.10711
12
citations
#5498

Emergent Equivariance in Deep Ensembles

Jan Gerken, Pan Kessel

ICML 2024arXiv:2403.03103
12
citations
#5499

Instance Tracking in 3D Scenes from Egocentric Videos

Yunhan Zhao, Haoyu Ma, Shu Kong et al.

CVPR 2024arXiv:2312.04117
12
citations
#5500

Tripod: Three Complementary Inductive Biases for Disentangled Representation Learning

Kyle Hsu, Jubayer Ibn Hamid, Kaylee Burns et al.

ICML 2024arXiv:2404.10282
12
citations
#5501

Learning Useful Representations of Recurrent Neural Network Weight Matrices

Vincent Herrmann, Francesco Faccio, Jürgen Schmidhuber

ICML 2024arXiv:2403.11998
12
citations
#5502

Considering Nonstationary within Multivariate Time Series with Variational Hierarchical Transformer for Forecasting

Muyao Wang, Wenchao Chen, Bo Chen

AAAI 2024paperarXiv:2403.05406
12
citations
#5503

Bidirectional Temporal Plan Graph: Enabling Switchable Passing Orders for More Efficient Multi-Agent Path Finding Plan Execution

Yifan Su, Rishi Veerapaneni, Jiaoyang Li

AAAI 2024paperarXiv:2401.00315
12
citations
#5504

Learning Efficient and Robust Multi-Agent Communication via Graph Information Bottleneck

Shifei Ding, Wei Du, Ling Ding et al.

AAAI 2024paper
12
citations
#5505

PointInfinity: Resolution-Invariant Point Diffusion Models

Zixuan Huang, Justin Johnson, Shoubhik Debnath et al.

CVPR 2024arXiv:2404.03566
12
citations
#5506

Improving Bird's Eye View Semantic Segmentation by Task Decomposition

Tianhao Zhao, Yongcan Chen, Yu Wu et al.

CVPR 2024arXiv:2404.01925
12
citations
#5507

Advancing the Lower Bounds: an Accelerated, Stochastic, Second-order Method with Optimal Adaptation to Inexactness

Artem Agafonov, Dmitry Kamzolov, Alexander Gasnikov et al.

ICLR 2024arXiv:2309.01570
12
citations
#5508

No More Shortcuts: Realizing the Potential of Temporal Self-Supervision

Ishan Rajendrakumar Dave, Simon Jenni, Mubarak Shah

AAAI 2024paperarXiv:2312.13008
12
citations
#5509

Hyperbolic Graph Diffusion Model

Lingfeng Wen, Xuan Tang, Mingjie Ouyang et al.

AAAI 2024paperarXiv:2306.07618
12
citations
#5510

Improving Interpretation Faithfulness for Vision Transformers

Lijie Hu, Yixin Liu, Ninghao Liu et al.

ICML 2024spotlightarXiv:2311.17983
12
citations
#5511

UCIP: A Universal Framework for Compressed Image Super-Resolution using Dynamic Prompt

Xin Li, Bingchen Li, Yeying Jin et al.

ECCV 2024arXiv:2407.13108
12
citations
#5512

Dense Hand-Object(HO) GraspNet with Full Grasping Taxonomy and Dynamics

Woojin Cho, Jihyun Lee, Minjae Yi et al.

ECCV 2024arXiv:2409.04033
12
citations
#5513

Another Way to the Top: Exploit Contextual Clustering in Learned Image Coding

Yichi Zhang, Zhihao Duan, Ming Lu et al.

AAAI 2024paperarXiv:2401.11615
12
citations
#5514

Defense Against Adversarial Attacks on No-Reference Image Quality Models with Gradient Norm Regularization

Yujia Liu, Chenxi Yang, Dingquan Li et al.

CVPR 2024arXiv:2403.11397
12
citations
#5515

Variance Reduced Halpern Iteration for Finite-Sum Monotone Inclusions

Xufeng Cai, Ahmet Alacaoglu, Jelena Diakonikolas

ICLR 2024arXiv:2310.02987
12
citations
#5516

∞-Brush: Controllable Large Image Synthesis with Diffusion Models in Infinite Dimensions

Minh Quan Le, Alexandros Graikos, Srikar Yellapragada et al.

ECCV 2024arXiv:2407.14709
12
citations
#5517

DPA-Net: Structured 3D Abstraction from Sparse Views via Differentiable Primitive Assembly

Fenggen Yu, Yiming Qian, Xu Zhang et al.

ECCV 2024arXiv:2404.00875
12
citations
#5518

Cauchy-Schwarz Divergence Information Bottleneck for Regression

Shujian Yu, Xi Yu, Sigurd Løkse et al.

ICLR 2024arXiv:2404.17951
12
citations
#5519

3D-Aware Hypothesis & Verification for Generalizable Relative Object Pose Estimation

Chen Zhao, Tong Zhang, Mathieu Salzmann

ICLR 2024arXiv:2310.03534
12
citations
#5520

GRANDE: Gradient-Based Decision Tree Ensembles for Tabular Data

Sascha Marton, Stefan Lüdtke, Christian Bartelt et al.

ICLR 2024arXiv:2309.17130
12
citations
#5521

Learning to Pivot as a Smart Expert

Tianhao Liu, Shanwen Pu, Dongdong Ge et al.

AAAI 2024paperarXiv:2308.08171
12
citations
#5522

D3T: Distinctive Dual-Domain Teacher Zigzagging Across RGB-Thermal Gap for Domain-Adaptive Object Detection

Dinh Phat Do, Taehoon Kim, JAEMIN NA et al.

CVPR 2024arXiv:2403.09359
12
citations
#5523

Towards Reliable Evaluation and Fast Training of Robust Semantic Segmentation Models

Francesco Croce, Naman D. Singh, Matthias Hein

ECCV 2024arXiv:2306.12941
12
citations
#5524

Accurate Retraining-free Pruning for Pretrained Encoder-based Language Models

Seungcheol Park, Hojun Choi, U Kang

ICLR 2024arXiv:2308.03449
12
citations
#5525

CamoTeacher: Dual-Rotation Consistency Learning for Semi-Supervised Camouflaged Object Detection

Xunfa Lai, Zhiyu Yang, Jie Hu et al.

ECCV 2024arXiv:2408.08050
12
citations
#5526

Data-efficient Large Vision Models through Sequential Autoregression

Zhiwei Hao, Jianyuan Guo, Chengcheng Wang et al.

ICML 2024arXiv:2402.04841
12
citations
#5527

Closed-Loop Unsupervised Representation Disentanglement with $\beta$-VAE Distillation and Diffusion Probabilistic Feedback

Xin Jin, Bohan Li, Baao Xie et al.

ECCV 2024
12
citations
#5528

Explorative Inbetweening of Time and Space

Haiwen Feng, Zheng Ding, Zhihao Xia et al.

ECCV 2024arXiv:2403.14611
12
citations
#5529

REST: Efficient and Accelerated EEG Seizure Analysis through Residual State Updates

Arshia Afzal, Grigorios Chrysos, Volkan Cevher et al.

ICML 2024oralarXiv:2406.16906
12
citations
#5530

BRIDGE: Bridging Gaps in Image Captioning Evaluation with Stronger Visual Cues

Sara Sarto, Marcella Cornia, Lorenzo Baraldi et al.

ECCV 2024arXiv:2407.20341
12
citations
#5531

Bridging Vision and Language Spaces with Assignment Prediction

Jungin Park, Jiyoung Lee, Kwanghoon Sohn

ICLR 2024arXiv:2404.09632
12
citations
#5532

Pareto Deep Long-Tailed Recognition: A Conflict-Averse Solution

Zhipeng Zhou, Liu Liu, Peilin Zhao et al.

ICLR 2024oral
12
citations
#5533

Topological Neural Networks go Persistent, Equivariant, and Continuous

Yogesh Verma, Amauri Souza, Vikas Garg

ICML 2024arXiv:2406.03164
12
citations
#5534

Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing

Zizheng Yang, Hu Yu, Bing Li et al.

ECCV 2024arXiv:2509.20091
12
citations
#5535

Constrained Decoding for Cross-lingual Label Projection

Duong Le, Yang Chen, Alan Ritter et al.

ICLR 2024arXiv:2402.03131
12
citations
#5536

Neurosymbolic Grounding for Compositional World Models

Atharva Sehgal, Arya Grayeli, Jennifer Sun et al.

ICLR 2024arXiv:2310.12690
12
citations
#5537

Weakly Supervised Monocular 3D Detection with a Single-View Image

Xueying Jiang, Sheng Jin, Lewei Lu et al.

CVPR 2024arXiv:2402.19144
12
citations
#5538

R-EDL: Relaxing Nonessential Settings of Evidential Deep Learning

Mengyuan Chen, Junyu Gao, Changsheng Xu

ICLR 2024spotlight
12
citations
#5539

Mastering Symbolic Operations: Augmenting Language Models with Compiled Neural Networks

Yixuan Weng, Minjun Zhu, Fei Xia et al.

ICLR 2024arXiv:2304.01665
12
citations
#5540

Jointly Improving the Sample and Communication Complexities in Decentralized Stochastic Minimax Optimization

Xuan Zhang, Gabriel Mancino-Ball, Necdet Serhat Aybat et al.

AAAI 2024paperarXiv:2307.09421
12
citations
#5541

Gear-NeRF: Free-Viewpoint Rendering and Tracking with Motion-aware Spatio-Temporal Sampling

Xinhang Liu, Yu-Wing Tai, Chi-Keung Tang et al.

CVPR 2024highlightarXiv:2406.03723
12
citations
#5542

Improving Group Robustness on Spurious Correlation Requires Preciser Group Inference

Yujin Han, Difan Zou

ICML 2024arXiv:2404.13815
12
citations
#5543

Explaining Graph Neural Networks via Structure-aware Interaction Index

Ngoc Bui, Trung Hieu Nguyen, Viet Anh Nguyen et al.

ICML 2024arXiv:2405.14352
12
citations
#5544

Phoneme Hallucinator: One-Shot Voice Conversion via Set Expansion

Siyuan Shan, Yang Li, Amartya Banerjee et al.

AAAI 2024paperarXiv:2308.06382
12
citations
#5545

A Universal Class of Sharpness-Aware Minimization Algorithms

Behrooz Tahmasebi, Ashkan Soleymani, Dara Bahri et al.

ICML 2024arXiv:2406.03682
12
citations
#5546

Multi-modal Crowd Counting via a Broker Modality

Haoliang Meng, Xiaopeng Hong, Chenhao Wang et al.

ECCV 2024arXiv:2407.07518
12
citations
#5547

The Lottery Ticket Hypothesis in Denoising: Towards Semantic-Driven Initialization

Jiafeng Mao, Xueting Wang, Kiyoharu Aizawa

ECCV 2024arXiv:2312.08872
12
citations
#5548

Modeling and Driving Human Body Soundfields through Acoustic Primitives

Chao Huang, Dejan Markovic, Chenliang Xu et al.

ECCV 2024arXiv:2407.13083
12
citations
#5549

A Plug-and-Play Image Registration Network

JUNHAO HU, Weijie Gan, Zhixin Sun et al.

ICLR 2024arXiv:2310.04297
12
citations
#5550

Sparse is Enough in Fine-tuning Pre-trained Large Language Models

Weixi Song, Zuchao Li, Lefei Zhang et al.

ICML 2024spotlightarXiv:2312.11875
12
citations
#5551

Eclipse: Disambiguating Illumination and Materials using Unintended Shadows

Dor Verbin, Ben Mildenhall, Peter Hedman et al.

CVPR 2024arXiv:2305.16321
12
citations
#5552

Action Detection via an Image Diffusion Process

Lin Geng Foo, Tianjiao Li, Hossein Rahmani et al.

CVPR 2024arXiv:2404.01051
12
citations
#5553

COHO: Context-Sensitive City-Scale Hierarchical Urban Layout Generation

Liu He, Daniel Aliaga

ECCV 2024arXiv:2407.11294
12
citations
#5554

Skeleton-based Group Activity Recognition via Spatial-Temporal Panoramic Graph

Zhengcen Li, Xinle Chang, Yueran Li et al.

ECCV 2024arXiv:2407.19497
12
citations
#5555

Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations

Yongyuan Liang, Yanchao Sun, Ruijie Zheng et al.

ICLR 2024oralarXiv:2307.12062
12
citations
#5556

Image Content Generation with Causal Reasoning

Xiaochuan Li, Baoyu Fan, Run Zhang et al.

AAAI 2024paperarXiv:2312.07132
12
citations
#5557

InterpreTabNet: Distilling Predictive Signals from Tabular Data by Salient Feature Interpretation

Jacob Si, Wendy Yusi Cheng, Michael Cooper et al.

ICML 2024spotlightarXiv:2406.00426
12
citations
#5558

Adaptive Discovering and Merging for Incremental Novel Class Discovery

Guangyao Chen, Peixi Peng, Yangru Huang et al.

AAAI 2024paperarXiv:2403.03382
12
citations
#5559

MultiPhys: Multi-Person Physics-aware 3D Motion Estimation

Nicolás Ugrinovic, Boxiao Pan, Georgios Pavlakos et al.

CVPR 2024arXiv:2404.11987
12
citations
#5560

Discriminative Probing and Tuning for Text-to-Image Generation

Leigang Qu, Wenjie Wang, Yongqi Li et al.

CVPR 2024arXiv:2403.04321
12
citations
#5561

A Simple and Scalable Representation for Graph Generation

Yunhui Jang, Seul Lee, Sungsoo Ahn

ICLR 2024arXiv:2312.02230
12
citations
#5562

Symbolic Regression Enhanced Decision Trees for Classification Tasks

Kei Sen Fong, Mehul Motani

AAAI 2024paper
12
citations
#5563

S²MVTC: a Simple yet Efficient Scalable Multi-View Tensor Clustering

Zhen Long, Qiyuan Wang, Yazhou Ren et al.

CVPR 2024
12
citations
#5564

Improving Robustness for Joint Optimization of Camera Pose and Decomposed Low-Rank Tensorial Radiance Fields

BOYU Chen, Wei-Chen Chiu, Yu-Lun Liu

AAAI 2024paperarXiv:2402.13252
12
citations
#5565

Why Do Animals Need Shaping? A Theory of Task Composition and Curriculum Learning

Jin Hwa Lee, Stefano Mannelli, Andrew Saxe

ICML 2024arXiv:2402.18361
12
citations
#5566

Any-Stereo: Arbitrary Scale Disparity Estimation for Iterative Stereo Matching

Zhaohuai Liang, Changhe Li

AAAI 2024paper
12
citations
#5567

Lewis's Signaling Game as beta-VAE For Natural Word Lengths and Segments

Ryo Ueda, TADAHIRO TANIGUCHI

ICLR 2024arXiv:2311.04453
12
citations
#5568

Asymmetric Masked Distillation for Pre-Training Small Foundation Models

Zhiyu Zhao, Bingkun Huang, Sen Xing et al.

CVPR 2024arXiv:2311.03149
12
citations
#5569

Self-Consistency Training for Density-Functional-Theory Hamiltonian Prediction

He Zhang, Chang Liu, wang et al.

ICML 2024arXiv:2403.09560
12
citations
#5570

Language-Informed Visual Concept Learning

Sharon Lee, Yunzhi Zhang, Shangzhe Wu et al.

ICLR 2024arXiv:2312.03587
12
citations
#5571

Data-Efficient Multimodal Fusion on a Single GPU

Noël Vouitsis, Zhaoyan Liu, Satya Krishna Gorti et al.

CVPR 2024highlightarXiv:2312.10144
12
citations
#5572

Sparse and Structured Hopfield Networks

Saúl Santos, Vlad Niculae, Daniel McNamee et al.

ICML 2024spotlightarXiv:2402.13725
12
citations
#5573

RECOMBINER: Robust and Enhanced Compression with Bayesian Implicit Neural Representations

Jiajun He, Gergely Flamich, Zongyu Guo et al.

ICLR 2024arXiv:2309.17182
12
citations
#5574

Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation

Seonghoon Yu, Paul Hongsuck Seo, Jeany Son

ECCV 2024arXiv:2407.07412
12
citations
#5575

Finding Lottery Tickets in Vision Models via Data-driven Spectral Foresight Pruning

Leonardo Iurada, Marco Ciccone, Tatiana Tommasi

CVPR 2024arXiv:2406.01820
12
citations
#5576

Random Exploration in Bayesian Optimization: Order-Optimal Regret and Computational Efficiency

Sudeep Salgia, Sattar Vakili, Qing Zhao

ICML 2024arXiv:2310.15351
12
citations
#5577

Double-Layer Hybrid-Label Identification Feature Selection for Multi-View Multi-Label Learning

Pingting Hao, Kunpeng Liu, Wanfu Gao

AAAI 2024paper
12
citations
#5578

Minimum-Norm Interpolation Under Covariate Shift

Neil Mallinar, Austin Zane, Spencer Frei et al.

ICML 2024arXiv:2404.00522
12
citations
#5579

Transformer as Linear Expansion of Learngene

Shiyu Xia, Miaosen Zhang, Xu Yang et al.

AAAI 2024paperarXiv:2312.05614
12
citations
#5580

ZeroFlow: Scalable Scene Flow via Distillation

Kyle Vedder, Neehar Peri, Nathaniel Chodosh et al.

ICLR 2024oralarXiv:2305.10424
12
citations
#5581

A Generalized Shuffle Framework for Privacy Amplification: Strengthening Privacy Guarantees and Enhancing Utility

Chen E, Yang Cao, Ge Yifei

AAAI 2024paperarXiv:2312.14388
12
citations
#5582

Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models

Pengze Zhang, Hubery Yin, Chen Li et al.

CVPR 2024highlightarXiv:2403.08381
12
citations
#5583

OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning Denoising

Haichao Zhang, Yi Xu, Hongsheng Lu et al.

CVPR 2024arXiv:2404.02227
12
citations
#5584

FocusMAE: Gallbladder Cancer Detection from Ultrasound Videos with Focused Masked Autoencoders

Soumen Basu, Mayuna Gupta, Chetan Madan et al.

CVPR 2024arXiv:2403.08848
12
citations
#5585

Neural Implicit Morphing of Face Images

Guilherme Schardong, Tiago Novello, Hallison Paz et al.

CVPR 2024arXiv:2308.13888
12
citations
#5586

Point2SSM: Learning Morphological Variations of Anatomies from Point Clouds

Jadie Adams, Shireen Elhabian

ICLR 2024spotlightarXiv:2305.14486
12
citations
#5587

One Step Closer to Unbiased Aleatoric Uncertainty Estimation

Wang Zhang, Ziwen Martin Ma, Subhro Das et al.

AAAI 2024paperarXiv:2312.10469
12
citations
#5588

PANDA: Expanded Width-Aware Message Passing Beyond Rewiring

Jeongwhan Choi, Sumin Parksumin, Hyowon Wi et al.

ICML 2024arXiv:2406.03671
12
citations
#5589

MICDrop: Masking Image and Depth Features via Complementary Dropout for Domain-Adaptive Semantic Segmentation

Linyan Yang, Lukas Hoyer, Mark Weber et al.

ECCV 2024arXiv:2408.16478
12
citations
#5590

MGNet: Learning Correspondences via Multiple Graphs

Dai Luanyuan, Xiaoyu Du, Hanwang Zhang et al.

AAAI 2024paperarXiv:2401.04984
12
citations
#5591

Mechanistic Neural Networks for Scientific Machine Learning

Adeel Pervez, Francesco Locatello, Efstratios Gavves

ICML 2024arXiv:2402.13077
12
citations
#5592

DemoCaricature: Democratising Caricature Generation with a Rough Sketch

Dar-Yen Chen, Ayan Kumar Bhunia, Subhadeep Koley et al.

CVPR 2024arXiv:2312.04364
12
citations
#5593

Abstractors and relational cross-attention: An inductive bias for explicit relational reasoning in Transformers

Awni Altabaa, Taylor Webb, Jonathan Cohen et al.

ICLR 2024arXiv:2304.00195
12
citations
#5594

STARC: A General Framework For Quantifying Differences Between Reward Functions

Joar Skalse, Lucy Farnik, Sumeet Motwani et al.

ICLR 2024arXiv:2309.15257
12
citations
#5595

Prompt Risk Control: A Rigorous Framework for Responsible Deployment of Large Language Models

Thomas Zollo, Todd Morrill, Zhun Deng et al.

ICLR 2024arXiv:2311.13628
12
citations
#5596

Improving Token-Based World Models with Parallel Observation Prediction

Lior Cohen, Kaixin Wang, Bingyi Kang et al.

ICML 2024arXiv:2402.05643
12
citations
#5597

Image Neural Field Diffusion Models

Yinbo Chen, Oliver Wang, Richard Zhang et al.

CVPR 2024highlightarXiv:2406.07480
12
citations
#5598

Efficient Integrators for Diffusion Generative Models

Kushagra Pandey, Maja Rudolph, Stephan Mandt

ICLR 2024arXiv:2310.07894
12
citations
#5599

Data-Efficient Unsupervised Interpolation Without Any Intermediate Frame for 4D Medical Images

JungEun Kim, Hangyul Yoon, Geondo Park et al.

CVPR 2024arXiv:2404.01464
12
citations
#5600

Federated Modality-Specific Encoders and Multimodal Anchors for Personalized Brain Tumor Segmentation

AAAI 2024paperarXiv:2403.11803
12
citations