Most Cited ICLR "facial behavior generation" Papers

6,124 papers found • Page 20 of 31

#3801

Denoising with a Joint-Embedding Predictive Architecture

Chen Dengsheng, Jie Hu, Xiaoming Wei et al.

ICLR 2025arXiv:2410.03755
5
citations
#3802

Unlocking Point Processes through Point Set Diffusion

David Lüdke, Enric Rabasseda Raventós, Marcel Kollovieh et al.

ICLR 2025oralarXiv:2410.22493
5
citations
#3803

Learning Spatial-Semantic Features for Robust Video Object Segmentation

Xin Li, Deshui Miao, Zhenyu He et al.

ICLR 2025arXiv:2407.07760
5
citations
#3804

Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation

Yi-Chen Li, Fuxiang Zhang, Wenjie Qiu et al.

ICLR 2025arXiv:2407.03856
5
citations
#3805

NoVo: Norm Voting off Hallucinations with Attention Heads in Large Language Models

Zhengyi Ho, Siyuan Liang, Sen Zhang et al.

ICLR 2025arXiv:2410.08970
5
citations
#3806

CryoFM: A Flow-based Foundation Model for Cryo-EM Densities

Yi Zhou, Yilai Li, Jing Yuan et al.

ICLR 2025arXiv:2410.08631
5
citations
#3807

LICORICE: Label-Efficient Concept-Based Interpretable Reinforcement Learning

Zhuorui Ye, Stephanie Milani, Geoff Gordon et al.

ICLR 2025arXiv:2407.15786
5
citations
#3808

Credal Wrapper of Model Averaging for Uncertainty Estimation in Classification

Kaizheng Wang, Fabio Cuzzolin, Keivan Shariatmadar et al.

ICLR 2025arXiv:2405.15047
5
citations
#3809

Active Learning for Continual Learning: Keeping the Past Alive in the Present

Jaehyun Park, Dongmin Park, Jae-Gil Lee

ICLR 2025arXiv:2501.14278
5
citations
#3810

Let SSMs be ConvNets: State-space Modeling with Optimal Tensor Contractions

Yan Ru Pei

ICLR 2025arXiv:2501.13230
5
citations
#3811

Visually Guided Decoding: Gradient-Free Hard Prompt Inversion with Language Models

Donghoon Kim, Minji Bae, Kyuhong Shim et al.

ICLR 2025arXiv:2505.08622
5
citations
#3812

How to Verify Any (Reasonable) Distribution Property: Computationally Sound Argument Systems for Distributions

Tal Herman, Guy Rothblum

ICLR 2025arXiv:2409.06594
5
citations
#3813

UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting

Haoyuan Li, Yanpeng Zhou, Tao Tang et al.

ICLR 2025arXiv:2502.17860
5
citations
#3814

Generating Physical Dynamics under Priors

Zihan Zhou, Xiaoxue Wang, Tianshu Yu

ICLR 2025arXiv:2409.00730
5
citations
#3815

Refine-by-Align: Reference-Guided Artifacts Refinement through Semantic Alignment

Yizhi Song, Liu He, Zhifei Zhang et al.

ICLR 2025arXiv:2412.00306
5
citations
#3816

Group-robust Sample Reweighting for Subpopulation Shifts via Influence Functions

Rui Qiao, Zhaoxuan Wu, Jingtan Wang et al.

ICLR 2025arXiv:2503.07315
5
citations
#3817

A Solvable Attention for Neural Scaling Laws

Bochen Lyu, Di Wang, Zhanxing Zhu

ICLR 2025
5
citations
#3818

3D Reconstruction with Generalizable Neural Fields using Scene Priors

Yang Fu, Shalini De Mello, Xueting Li et al.

ICLR 2024arXiv:2309.15164
5
citations
#3819

ZIP: An Efficient Zeroth-order Prompt Tuning for Black-box Vision-Language Models

Seonghwan Park, Jaehyeon Jeong, Yongjun Kim et al.

ICLR 2025arXiv:2504.06838
5
citations
#3820

Temporal Flexibility in Spiking Neural Networks: Towards Generalization Across Time Steps and Deployment Friendliness

Kangrui Du, Yuhang Wu, Shikuang Deng et al.

ICLR 2025oralarXiv:2503.17394
5
citations
#3821

Finding Shared Decodable Concepts and their Negations in the Brain

Cory Efird, Alex Murphy, Joel Zylberberg et al.

ICLR 2025arXiv:2405.17663
5
citations
#3822

Compute-Optimal LLMs Provably Generalize Better with Scale

Marc Finzi, Sanyam Kapoor, Diego Granziol et al.

ICLR 2025arXiv:2504.15208
5
citations
#3823

Epistemic Monte Carlo Tree Search

Yaniv Oren, Viliam Vadocz, Matthijs T. J. Spaan et al.

ICLR 2025arXiv:2210.13455
5
citations
#3824

BigDocs: An Open Dataset for Training Multimodal Models on Document and Code Tasks

Juan A. Rodriguez, Xiangru Jian, Siba Smarak Panigrahi et al.

ICLR 2025arXiv:2412.04626
5
citations
#3825

Chemistry-Inspired Diffusion with Non-Differentiable Guidance

Yuchen Shen, Chenhao Zhang, Sijie Fu et al.

ICLR 2025arXiv:2410.06502
5
citations
#3826

NetFormer: An interpretable model for recovering dynamical connectivity in neuronal population dynamics

Ziyu Lu, Wuwei Zhang, Trung Le et al.

ICLR 2025oral
5
citations
#3827

Towards Improving Exploration through Sibling Augmented GFlowNets

Kanika Madan, Alex Lamb, Emmanuel Bengio et al.

ICLR 2025
5
citations
#3828

A Riemannian Framework for Learning Reduced-order Lagrangian Dynamics

Katharina Friedl, Noémie Jaquier, Jens Lundell et al.

ICLR 2025arXiv:2410.18868
5
citations
#3829

DRSM: De-Randomized Smoothing on Malware Classifier Providing Certified Robustness

Shoumik Saha, Wenxiao Wang, Yigitcan Kaya et al.

ICLR 2024arXiv:2303.13372
5
citations
#3830

Unify ML4TSP: Drawing Methodological Principles for TSP and Beyond from Streamlined Design Space of Learning and Search

Yang Li, Jiale Ma, Wenzheng Pan et al.

ICLR 2025
5
citations
#3831

U-shaped and Inverted-U Scaling behind Emergent Abilities of Large Language Models

Tung-Yu Wu, Melody Lo

ICLR 2025arXiv:2410.01692
5
citations
#3832

Don't Take Things Out of Context: Attention Intervention for Enhancing Chain-of-Thought Reasoning in Large Language Models

Shaotian Yan, Chen Shen, Wenxiao Wang et al.

ICLR 2025arXiv:2503.11154
5
citations
#3833

RA-TTA: Retrieval-Augmented Test-Time Adaptation for Vision-Language Models

Youngjun Lee, Doyoung Kim, Junhyeok Kang et al.

ICLR 2025
5
citations
#3834

Novel Quadratic Constraints for Extending LipSDP beyond Slope-Restricted Activations

Patricia Pauli, Aaron Havens, Alexandre Araujo et al.

ICLR 2024arXiv:2401.14033
5
citations
#3835

Revisiting Source-Free Domain Adaptation: a New Perspective via Uncertainty Control

Gezheng Xu, Hui GUO, Li Yi et al.

ICLR 2025
5
citations
#3836

SPDIM: Source-Free Unsupervised Conditional and Label Shift Adaptation in EEG

Shanglin Li, Motoaki Kawanabe, Reinmar Kobler

ICLR 2025arXiv:2411.07249
5
citations
#3837

Model-based Offline Reinforcement Learning with Lower Expectile Q-Learning

Kwanyoung Park, Youngwoon Lee

ICLR 2025arXiv:2407.00699
5
citations
#3838

Zero-shot Model-based Reinforcement Learning using Large Language Models

Abdelhakim Benechehab, Youssef Attia El Hili, Ambroise Odonnat et al.

ICLR 2025arXiv:2410.11711
5
citations
#3839

An Intuitive Multi-Frequency Feature Representation for SO(3)-Equivariant Networks

Dongwon Son, Jaehyung Kim, Sanghyeon Son et al.

ICLR 2024arXiv:2405.04537
5
citations
#3840

Robust Feature Learning for Multi-Index Models in High Dimensions

Alireza Mousavi-Hosseini, Adel Javanmard, Murat A Erdogdu

ICLR 2025arXiv:2410.16449
5
citations
#3841

Learning in reverse causal strategic environments with ramifications on two sided markets

Seamus Somerstep, Yuekai Sun, Yaacov Ritov

ICLR 2024arXiv:2404.13240
5
citations
#3842

Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning

Xiaochuan Li, Zichun Yu, Chenyan Xiong

ICLR 2025arXiv:2410.14208
5
citations
#3843

Sensitivity-Constrained Fourier Neural Operators for Forward and Inverse Problems in Parametric Differential Equations

Abdolmehdi Behroozi, Chaopeng Shen, Daniel Kifer

ICLR 2025arXiv:2505.08740
5
citations
#3844

SeRA: Self-Reviewing and Alignment of LLMs using Implicit Reward Margins

Jongwoo Ko, Saket Dingliwal, Bhavana Ganesh et al.

ICLR 2025
5
citations
#3845

Understanding Matrix Function Normalizations in Covariance Pooling through the Lens of Riemannian Geometry

Ziheng Chen, Yue Song, Xiaojun Wu et al.

ICLR 2025arXiv:2407.10484
5
citations
#3846

Streamlining Prediction in Bayesian Deep Learning

Rui Li, Marcus Klasson, Arno Solin et al.

ICLR 2025arXiv:2411.18425
5
citations
#3847

Latent Radiance Fields with 3D-aware 2D Representations

Chaoyi Zhou, Xi Liu, Feng Luo et al.

ICLR 2025arXiv:2502.09613
5
citations
#3848

Toward Generalizing Visual Brain Decoding to Unseen Subjects

Xiangtao Kong, Kexin Huang, Ping Li et al.

ICLR 2025arXiv:2410.14445
5
citations
#3849

Correlating instruction-tuning (in multimodal models) with vision-language processing (in the brain)

SUBBA REDDY OOTA, Akshett Rai Jindal, Ishani Mondal et al.

ICLR 2025arXiv:2505.20029
5
citations
#3850

Convergence of Distributed Adaptive Optimization with Local Updates

Ziheng Cheng, Margalit Glasgow

ICLR 2025arXiv:2409.13155
5
citations
#3851

Feedback Favors the Generalization of Neural ODEs

Jindou Jia, Zihan Yang, Meng Wang et al.

ICLR 2025arXiv:2410.10253
5
citations
#3852

ADMM for Nonconvex Optimization under Minimal Continuity Assumption

Ganzhao Yuan

ICLR 2025arXiv:2405.03233
5
citations
#3853

Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning

Calarina Muslimani, Matthew E Taylor

ICLR 2025arXiv:2405.00746
5
citations
#3854

Beyond Random Masking: When Dropout meets Graph Convolutional Networks

Yuankai Luo, Xiao-Ming Wu, Hao Zhu

ICLR 2025
5
citations
#3855

A Simple Approach to Unifying Diffusion-based Conditional Generation

Xirui Li, Charles Herrmann, Kelvin Chan et al.

ICLR 2025arXiv:2410.11439
5
citations
#3856

CHAMP: Conformalized 3D Human Multi-Hypothesis Pose Estimators

Harry Zhang, Luca Carlone

ICLR 2025arXiv:2407.06141
5
citations
#3857

Learning-Guided Rolling Horizon Optimization for Long-Horizon Flexible Job-Shop Scheduling

Sirui Li, Wenbin Ouyang, Yining Ma et al.

ICLR 2025arXiv:2502.15791
5
citations
#3858

Topological Schrödinger Bridge Matching

Maosheng Yang

ICLR 2025arXiv:2504.04799
5
citations
#3859

Differentially Private Federated Learning with Time-Adaptive Privacy Spending

Shahrzad Kianidehkordi, Nupur Kulkarni, Adam Dziedzic et al.

ICLR 2025
5
citations
#3860

Rodimus*: Breaking the Accuracy-Efficiency Trade-Off with Efficient Attentions

Zhihao He, Hang Yu, Zi Gong et al.

ICLR 2025arXiv:2410.06577
5
citations
#3861

Optimization by Parallel Quasi-Quantum Annealing with Gradient-Based Sampling

Yuma Ichikawa, Yamato Arai

ICLR 2025arXiv:2409.02135
5
citations
#3862

ContextRef: Evaluating Referenceless Metrics for Image Description Generation

Elisa Kreiss, Elisa Kreiss, Eric Zelikman et al.

ICLR 2024arXiv:2309.11710
5
citations
#3863

Advancing Graph Generation through Beta Diffusion

Xinyang Liu, Yilin He, Bo Chen et al.

ICLR 2025arXiv:2406.09357
5
citations
#3864

Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning

Samuel Garcin, Trevor McInroe, Pablo Samuel Castro et al.

ICLR 2025arXiv:2503.06343
5
citations
#3865

Improving Probabilistic Diffusion Models With Optimal Diagonal Covariance Matching

Zijing Ou, Mingtian Zhang, Andi Zhang et al.

ICLR 2025arXiv:2406.10808
5
citations
#3866

ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with Stationary Distribution Shift Regularization

The Viet Bui, Thanh Nguyen, Tien Mai

ICLR 2025arXiv:2410.01954
5
citations
#3867

When LLMs Play the Telephone Game: Cultural Attractors as Conceptual Tools to Evaluate LLMs in Multi-turn Settings

Jérémy Perez, Grgur Kovac, Corentin Léger et al.

ICLR 2025arXiv:2407.04503
5
citations
#3868

VLMaterial: Procedural Material Generation with Large Vision-Language Models

Beichen Li, Rundi Wu, Armando Solar-Lezama et al.

ICLR 2025arXiv:2501.18623
5
citations
#3869

Uncertainty and Influence aware Reward Model Refinement for Reinforcement Learning from Human Feedback

Zexu Sun, Yiju Guo, Yankai Lin et al.

ICLR 2025
5
citations
#3870

On the Hölder Stability of Multiset and Graph Neural Networks

Yair Davidson, Nadav Dym

ICLR 2025arXiv:2406.06984
5
citations
#3871

Decongestion by Representation: Learning to Improve Economic Welfare in Marketplaces

Omer Nahum, Gali Noti, David Parkes et al.

ICLR 2024arXiv:2306.10606
5
citations
#3872

DreamCatalyst: Fast and High-Quality 3D Editing via Controlling Editability and Identity Preservation

Jiwook Kim, Seonho Lee, Jaeyo Shin et al.

ICLR 2025arXiv:2407.11394
5
citations
#3873

KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models

Fan Wang, Juyong Jiang, Chansung Park et al.

ICLR 2025arXiv:2412.06071
5
citations
#3874

SMI-Editor: Edit-based SMILES Language Model with Fragment-level Supervision

Kangjie Zheng, Siyue Liang, Junwei Yang et al.

ICLR 2025arXiv:2412.05569
5
citations
#3875

Interpreting Language Reward Models via Contrastive Explanations

Junqi Jiang, Tom Bewley, Saumitra Mishra et al.

ICLR 2025arXiv:2411.16502
5
citations
#3876

Composable Interventions for Language Models

Arinbjörn Kolbeinsson, Kyle O'Brien, Tianjin Huang et al.

ICLR 2025arXiv:2407.06483
5
citations
#3877

Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning

Mingde Zhao, Safa Alver, Harm Seijen et al.

ICLR 2024oralarXiv:2310.00229
5
citations
#3878

Shifting the Paradigm: A Diffeomorphism Between Time Series Data Manifolds for Achieving Shift-Invariancy in Deep Learning

Berken Utku Demirel, Christian Holz

ICLR 2025arXiv:2502.19921
5
citations
#3879

Risk-Controlling Model Selection via Guided Bayesian Optimization

Adam Fisch, Regina Barzilay, Bracha Laufer-Goldshtein et al.

ICLR 2025arXiv:2312.01692
5
citations
#3880

Solving Inverse Problems with Model Mismatch using Untrained Neural Networks within Model-based Architectures

Peimeng Guan, Naveed Iqbal, Mark Davenport et al.

ICLR 2025arXiv:2403.04847
5
citations
#3881

Handling Delay in Real-Time Reinforcement Learning

Ivan Anokhin, Rishav Rishav, Matt Riemer et al.

ICLR 2025oralarXiv:2503.23478
5
citations
#3882

Lift Your Molecules: Molecular Graph Generation in Latent Euclidean Space

Mohamed Amine Ketata, Nicholas Gao, Johanna Sommer et al.

ICLR 2025arXiv:2406.10513
5
citations
#3883

Pushing the Limits of All-Atom Geometric Graph Neural Networks: Pre-Training, Scaling, and Zero-Shot Transfer

Zihan Pengmei, Zhengyuan Shen, Zichen Wang et al.

ICLR 2025arXiv:2410.21683
5
citations
#3884

Hierarchical Uncertainty Estimation for Learning-based Registration in Neuroimaging

Xiaoling Hu, Karthik Gopinath, Peirong Liu et al.

ICLR 2025arXiv:2410.09299
5
citations
#3885

Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning

Yujian Liu, Shiyu Chang, Tommi Jaakkola et al.

ICLR 2025arXiv:2410.19290
5
citations
#3886

IFORMER: INTEGRATING CONVNET AND TRANSFORMER FOR MOBILE APPLICATION

Chuanyang Zheng

ICLR 2025arXiv:2501.15369
5
citations
#3887

Synergy Between Sufficient Changes and Sparse Mixing Procedure for Disentangled Representation Learning

Zijian Li, Shunxing Fan, Yujia Zheng et al.

ICLR 2025arXiv:2503.00639
5
citations
#3888

Reconsidering Faithfulness in Regular, Self-Explainable and Domain Invariant GNNs

Steve Azzolin, Antonio Longa, Stefano Teso et al.

ICLR 2025arXiv:2406.15156
5
citations
#3889

Hessian-Free Online Certified Unlearning

Xinbao Qiao, Meng Zhang, Ming Tang et al.

ICLR 2025arXiv:2404.01712
5
citations
#3890

Unraveling the Enigma of Double Descent: An In-depth Analysis through the Lens of Learned Feature Space

Yufei Gu, Xiaoqing Zheng, Tomaso Aste

ICLR 2024arXiv:2310.13572
5
citations
#3891

MAPS: Advancing Multi-Modal Reasoning in Expert-Level Physical Science

Erle Zhu, Yadi Liu, Zhe Zhang et al.

ICLR 2025arXiv:2501.10768
5
citations
#3892

Neuron based Personality Trait Induction in Large Language Models

Jia Deng, Tianyi Tang, Yanbin Yin et al.

ICLR 2025arXiv:2410.12327
5
citations
#3893

Unknown Domain Inconsistency Minimization for Domain Generalization

Seungjae Shin, HeeSun Bae, Byeonghu Na et al.

ICLR 2024arXiv:2403.07329
5
citations
#3894

Robust Barycenter Estimation using Semi-Unbalanced Neural Optimal Transport

Milena Gazdieva, Jaemoo Choi, Alexander Kolesov et al.

ICLR 2025arXiv:2410.03974
5
citations
#3895

Selective Task Group Updates for Multi-Task Optimization

Wooseong Jeong, Kuk-Jin Yoon

ICLR 2025arXiv:2502.11986
5
citations
#3896

EBMDock: Neural Probabilistic Protein-Protein Docking via a Differentiable Energy Model

Huaijin Wu, Wei Liu, Yatao Bian et al.

ICLR 2024
5
citations
#3897

SpaCE: The Spatial Confounding Environment

Mauricio Tec, Ana Trisovic, Michelle Audirac et al.

ICLR 2024arXiv:2312.00710
5
citations
#3898

AutoCLIP: Auto-tuning Zero-Shot Classifiers for Vision-Language Models

Jan Metzen, Piyapat Saranrittichai, Chaithanya Kumar Mummadi

ICLR 2025arXiv:2309.16414
5
citations
#3899

Plugin estimators for selective classification with out-of-distribution detection

Harikrishna Narasimhan, Aditya Krishna Menon, Wittawat Jitkrittum et al.

ICLR 2024arXiv:2301.12386
5
citations
#3900

Differentiable and Learnable Wireless Simulation with Geometric Transformers

Thomas Hehn, Markus Peschl, Tribhuvanesh Orekondy et al.

ICLR 2025arXiv:2406.14995
5
citations
#3901

Motion Control of High-Dimensional Musculoskeletal Systems with Hierarchical Model-Based Planning

Yunyue Wei, Shanning Zhuang, Vincent Zhuang et al.

ICLR 2025arXiv:2505.08238
5
citations
#3902

TS-LIF: A Temporal Segment Spiking Neuron Network for Time Series Forecasting

Shibo Feng, Wanjin Feng, Xingyu Gao et al.

ICLR 2025oralarXiv:2503.05108
5
citations
#3903

Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation

Sungnyun Kim, Sungwoo Cho, Sangmin Bae et al.

ICLR 2025arXiv:2504.18539
5
citations
#3904

Learning Equivariant Non-Local Electron Density Functionals

Nicholas Gao, Eike Eberhard, Stephan Günnemann

ICLR 2025arXiv:2410.07972
5
citations
#3905

Exponential Topology-enabled Scalable Communication in Multi-agent Reinforcement Learning

Xinran Li, Xiaolu Wang, Chenjia Bai et al.

ICLR 2025arXiv:2502.19717
5
citations
#3906

Learning Robust Generalizable Radiance Field with Visibility and Feature Augmented Point Representation

Jiaxu Wang, Ziyi Zhang, Renjing Xu

ICLR 2024arXiv:2401.14354
5
citations
#3907

Towards the Fundamental Limits of Knowledge Transfer over Finite Domains

Qingyue Zhao, Banghua Zhu

ICLR 2024arXiv:2310.07838
5
citations
#3908

Improving Large Language Model Planning with Action Sequence Similarity

Xinran Zhao, Hanie Sedghi, Bernd Bohnet et al.

ICLR 2025arXiv:2505.01009
5
citations
#3909

Treatment Effects Estimation By Uniform Transformer

Ruoqi Yu, Shulei Wang

ICLR 2024arXiv:2008.03738
5
citations
#3910

Efficient Perplexity Bound and Ratio Matching in Discrete Diffusion Language Models

Etrit Haxholli, Yeti Z. Gurbuz, Oğul Can et al.

ICLR 2025arXiv:2507.04341
5
citations
#3911

DenoiseVAE: Learning Molecule-Adaptive Noise Distributions for Denoising-based 3D Molecular Pre-training

Yurou Liu, Jiahao Chen, Rui Jiao et al.

ICLR 2025
5
citations
#3912

A Coefficient Makes SVRG Effective

Yida Yin, Zhiqiu Xu, Zhiyuan Li et al.

ICLR 2025arXiv:2311.05589
5
citations
#3913

MixEval-X: Any-to-any Evaluations from Real-world Data Mixture

Jinjie Ni, Yifan Song, Deepanway Ghosal et al.

ICLR 2025arXiv:2410.13754
5
citations
#3914

Wasserstein-Regularized Conformal Prediction under General Distribution Shift

Rui Xu, Chao Chen, Yue Sun et al.

ICLR 2025arXiv:2501.13430
5
citations
#3915

Zeroth-Order Fine-Tuning of LLMs with Transferable Static Sparsity

Wentao Guo, Jikai Long, Yimeng Zeng et al.

ICLR 2025
5
citations
#3916

HASARD: A Benchmark for Vision-Based Safe Reinforcement Learning in Embodied Agents

Tristan Tomilin, Meng Fang, Mykola Pechenizkiy

ICLR 2025arXiv:2503.08241
5
citations
#3917

ANaGRAM: A Natural Gradient Relative to Adapted Model for efficient PINNs learning

Nilo Schwencke, Cyril Furtlehner

ICLR 2025arXiv:2412.10782
5
citations
#3918

HALL-E: Hierarchical Neural Codec Language Model for Minute-Long Zero-Shot Text-to-Speech Synthesis

Yuto Nishimura, Takumi Hirose, Masanari Ohi et al.

ICLR 2025arXiv:2410.04380
5
citations
#3919

Beyond Next Token Prediction: Patch-Level Training for Large Language Models

Chenze Shao, Fandong Meng, Jie Zhou

ICLR 2025arXiv:2407.12665
5
citations
#3920

Calibrating Expressions of Certainty

Peiqi Wang, Barbara Lam, Yingcheng Liu et al.

ICLR 2025arXiv:2410.04315
5
citations
#3921

From an LLM Swarm to a PDDL-empowered Hive: Planning Self-executed Instructions in a Multi-modal Jungle

Kaustubh Vyas, Damien Graux, Yijun Yang et al.

ICLR 2025arXiv:2412.12839
5
citations
#3922

Context Clues: Evaluating Long Context Models for Clinical Prediction Tasks on EHR Data

Michael Wornow, Suhana Bedi, Miguel Angel Fuentes Hernandez et al.

ICLR 2025
5
citations
#3923

Self-Updatable Large Language Models by Integrating Context into Model Parameters

Yu Wang, Xinshuang Liu, Xiusi Chen et al.

ICLR 2025arXiv:2410.00487
5
citations
#3924

HR-Extreme: A High-Resolution Dataset for Extreme Weather Forecasting

Nian Ran, Peng Xiao, Yue Wang et al.

ICLR 2025arXiv:2409.18885
5
citations
#3925

Shallow diffusion networks provably learn hidden low-dimensional structure

Nicholas Boffi, Arthur Jacot, Stephen Tu et al.

ICLR 2025arXiv:2410.11275
5
citations
#3926

Taming Transformer Without Using Learning Rate Warmup

Xianbiao Qi, Yelin He, Jiaquan Ye et al.

ICLR 2025arXiv:2505.21910
5
citations
#3927

Learning Interleaved Image-Text Comprehension in Vision-Language Large Models

Chenyu Zhou, Mengdan Zhang, Peixian Chen et al.

ICLR 2025arXiv:2406.10228
5
citations
#3928

Energy-conserving equivariant GNN for elasticity of lattice architected metamaterials

Ivan Grega, Ilyes Batatia, Gábor Csányi et al.

ICLR 2024arXiv:2401.16914
5
citations
#3929

Constructing Confidence Intervals for Average Treatment Effects from Multiple Datasets

Yuxin Wang, Maresa Schröder, Dennis Frauen et al.

ICLR 2025arXiv:2412.11511
5
citations
#3930

MrSteve: Instruction-Following Agents in Minecraft with What-Where-When Memory

Junyeong Park, Junmo Cho, Sungjin Ahn

ICLR 2025arXiv:2411.06736
5
citations
#3931

Learning Causal Alignment for Reliable Disease Diagnosis

Mingzhou Liu, Ching-Wen Lee, Xinwei Sun et al.

ICLR 2025arXiv:2310.01766
5
citations
#3932

QA-Calibration of Language Model Confidence Scores

Putra Manggala, Atalanti A Mastakouri, Elke Kirschbaum et al.

ICLR 2025arXiv:2410.06615
5
citations
#3933

ComPC: Completing a 3D Point Cloud with 2D Diffusion Priors

Tianxin Huang, Zhiwen Yan, Yuyang Zhao et al.

ICLR 2025arXiv:2404.06814
5
citations
#3934

Sharpness-Aware Black-Box Optimization

Feiyang YE, YUEMING LYU, Xuehao Wang et al.

ICLR 2025arXiv:2410.12457
5
citations
#3935

Hierarchically Encapsulated Representation for Protocol Design in Self-Driving Labs

Yu-Zhe Shi, Mingchen Liu, Fanxu Meng et al.

ICLR 2025arXiv:2504.03810
5
citations
#3936

ECHOPulse: ECG Controlled Echocardio-gram Video Generation

Yiwei Li, Sekeun Kim, Zihao Wu et al.

ICLR 2025arXiv:2410.03143
5
citations
#3937

Boltzmann-Aligned Inverse Folding Model as a Predictor of Mutational Effects on Protein-Protein Interactions

Xiaoran Jiao, Weian Mao, Wengong Jin et al.

ICLR 2025arXiv:2410.09543
5
citations
#3938

Unsupervised Pretraining for Fact Verification by Language Model Distillation

Adrian Bazaga, Pietro Lio, Gos Micklem

ICLR 2024arXiv:2309.16540
5
citations
#3939

SetCSE: Set Operations using Contrastive Learning of Sentence Embeddings

Kang Liu

ICLR 2024arXiv:2404.17606
5
citations
#3940

SEPARATE: A Simple Low-rank Projection for Gradient Compression in Modern Large-scale Model Training Process

Hanzhen Zhao, Xingyu Xie, Cong Fang et al.

ICLR 2025
5
citations
#3941

MOFFlow: Flow Matching for Structure Prediction of Metal-Organic Frameworks

Nayoung Kim, Seongsu Kim, Minsu Kim et al.

ICLR 2025arXiv:2410.17270
5
citations
#3942

SymmetricDiffusers: Learning Discrete Diffusion on Finite Symmetric Groups

Yongxing Zhang, Donglin Yang, Renjie Liao

ICLR 2025arXiv:2410.02942
5
citations
#3943

Learning High-Degree Parities: The Crucial Role of the Initialization

Emmanuel Abbe, Elisabetta Cornacchia, Jan Hązła et al.

ICLR 2025arXiv:2412.04910
5
citations
#3944

Microcanonical Langevin Ensembles: Advancing the Sampling of Bayesian Neural Networks

Emanuel Sommer, Jakob Robnik, Giorgi Nozadze et al.

ICLR 2025arXiv:2502.06335
5
citations
#3945

Amortized Control of Continuous State Space Feynman-Kac Model for Irregular Time Series

Byoungwoo Park, Hyungi Lee, Juho Lee

ICLR 2025arXiv:2410.05602
5
citations
#3946

Automatic Functional Differentiation in JAX

Min Lin

ICLR 2024arXiv:2311.18727
5
citations
#3947

CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features

Po-han Li, Sandeep Chinchali, ufuk topcu

ICLR 2025arXiv:2410.07610
5
citations
#3948

A Generic Framework for Conformal Fairness

Aditya Vadlamani, Anutam Srinivasan, Pranav Maneriker et al.

ICLR 2025arXiv:2505.16115
5
citations
#3949

Conditional Diffusion Models are Minimax-Optimal and Manifold-Adaptive for Conditional Distribution Estimation

Rong Tang, Lizhen Lin, Yun Yang

ICLR 2025arXiv:2409.20124
5
citations
#3950

Progressive Compression with Universally Quantized Diffusion Models

Yibo Yang, Justus Will, Stephan Mandt

ICLR 2025arXiv:2412.10935
5
citations
#3951

Multi-Label Test-Time Adaptation with Bound Entropy Minimization

Xiangyu Wu, Feng Yu, Yang Yang et al.

ICLR 2025arXiv:2502.03777
5
citations
#3952

Flavors of Margin: Implicit Bias of Steepest Descent in Homogeneous Neural Networks

Nikolaos Tsilivis, Gal Vardi, Julia Kempe

ICLR 2025arXiv:2410.22069
5
citations
#3953

HiGen: Hierarchical Graph Generative Networks

Mahdi Karami

ICLR 2024arXiv:2305.19337
5
citations
#3954

Think Thrice Before You Act: Progressive Thought Refinement in Large Language Models

Chengyu Du, Jinyi Han, Yizhou Ying et al.

ICLR 2025arXiv:2410.13413
5
citations
#3955

Structural Estimation of Partially Observed Linear Non-Gaussian Acyclic Model: A Practical Approach with Identifiability

Songyao Jin, Feng Xie, Guangyi Chen et al.

ICLR 2024
5
citations
#3956

Optimal Non-Asymptotic Rates of Value Iteration for Average-Reward Markov Decision Processes

Jongmin Lee, Ernest Ryu

ICLR 2025arXiv:2504.09913
5
citations
#3957

Demonstration-Regularized RL

Daniil Tiapkin, Denis Belomestny, Daniele Calandriello et al.

ICLR 2024arXiv:2310.17303
5
citations
#3958

Convolutional Deep Kernel Machines

Edward Milsom, Ben Anson, Laurence Aitchison

ICLR 2024arXiv:2309.09814
5
citations
#3959

Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models

Linh Tran, Wei Sun, Stacy Patterson et al.

ICLR 2025arXiv:2501.13904
5
citations
#3960

Leveraging Hyperbolic Embeddings for Coarse-to-Fine Robot Design

Heng Dong, Junyu Zhang, Chongjie Zhang

ICLR 2024arXiv:2311.00462
5
citations
#3961

Beyond FVD: An Enhanced Evaluation Metrics for Video Generation Distribution Quality

Ge Ya Luo, Gian M Favero, Zhi Hao Luo et al.

ICLR 2025oral
5
citations
#3962

The Update-Equivalence Framework for Decision-Time Planning

Samuel Sokota, Gabriele Farina, David Wu et al.

ICLR 2024arXiv:2304.13138
5
citations
#3963

Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias

Rui Lu, Runzhe Wang, Kaifeng Lyu et al.

ICLR 2025arXiv:2503.03595
5
citations
#3964

Model-Free Offline Reinforcement Learning with Enhanced Robustness

Chi Zhang, Zain Ulabedeen Farhat, George Atia et al.

ICLR 2025
5
citations
#3965

Conformal Generative Modeling with Improved Sample Efficiency through Sequential Greedy Filtering

Klaus-Rudolf Kladny, Bernhard Schölkopf, Michael Muehlebach

ICLR 2025arXiv:2410.01660
5
citations
#3966

Two Sparse Matrices are Better than One: Sparsifying Neural Networks with Double Sparse Factorization

Vladimir Boza, Vladimir Macko

ICLR 2025
5
citations
#3967

RaSA: Rank-Sharing Low-Rank Adaptation

Zhiwei He, Zhaopeng Tu, Xing Wang et al.

ICLR 2025arXiv:2503.12576
5
citations
#3968

GLoRa: A Benchmark to Evaluate the Ability to Learn Long-Range Dependencies in Graphs

Dongzhuoran Zhou, Evgeny Kharlamov, Egor Kostylev

ICLR 2025
5
citations
#3969

Feature-Based Online Bilateral Trade

Solenne Gaucher, Martino Bernasconi, Matteo Castiglioni et al.

ICLR 2025arXiv:2405.18183
5
citations
#3970

Towards Continuous Reuse of Graph Models via Holistic Memory Diversification

Ziyue Qiao, Junren Xiao, Qingqiang Sun et al.

ICLR 2025arXiv:2406.07413
5
citations
#3971

Understanding the Generalization of In-Context Learning in Transformers: An Empirical Study

Xingxuan Zhang, Haoran Wang, Jiansheng Li et al.

ICLR 2025arXiv:2503.15579
5
citations
#3972

BOFormer: Learning to Solve Multi-Objective Bayesian Optimization via Non-Markovian RL

Yu Heng Hung, Kai-Jie Lin, Yu-Heng Lin et al.

ICLR 2025arXiv:2505.21974
5
citations
#3973

From Probability to Counterfactuals: the Increasing Complexity of Satisfiability in Pearl's Causal Hierarchy

Julian Dörfler, Benito van der Zander, Markus Bläser et al.

ICLR 2025arXiv:2405.07373
5
citations
#3974

Natural Language Inference Improves Compositionality in Vision-Language Models

Paola Cascante-Bonilla, Yu (Hope) Hou, Yang Cao et al.

ICLR 2025arXiv:2410.22315
5
citations
#3975

Learning to Communicate Through Implicit Communication Channels

Han Wang, Binbin Chen, zhang et al.

ICLR 2025arXiv:2411.01553
5
citations
#3976

MGDA Converges under Generalized Smoothness, Provably

Qi Zhang, Peiyao Xiao, Shaofeng Zou et al.

ICLR 2025arXiv:2405.19440
5
citations
#3977

On Generalization Across Environments In Multi-Objective Reinforcement Learning

Jayden Teoh, Pradeep Varakantham, Peter Vamplew

ICLR 2025arXiv:2503.00799
5
citations
#3978

On-the-fly Preference Alignment via Principle-Guided Decoding

Mingye Zhu, Yi Liu, Lei Zhang et al.

ICLR 2025arXiv:2502.14204
5
citations
#3979

SCOPE: A Self-supervised Framework for Improving Faithfulness in Conditional Text Generation

Song Duong, Florian Le Bronnec, Alexandre Allauzen et al.

ICLR 2025arXiv:2502.13674
5
citations
#3980

NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for Retrieval

Sepanta Zeighami, Zac Wellmer, Aditya Parameswaran

ICLR 2025arXiv:2409.02343
5
citations
#3981

ORSO: Accelerating Reward Design via Online Reward Selection and Policy Optimization

Chen Bo Calvin Zhang, Zhang-Wei Hong, Aldo Pacchiano et al.

ICLR 2025arXiv:2410.13837
5
citations
#3982

Self-Supervised Diffusion MRI Denoising via Iterative and Stable Refinement

Chenxu Wu, Qingpeng Kong, Zihang Jiang et al.

ICLR 2025oralarXiv:2501.13514
5
citations
#3983

Towards Understanding the Universality of Transformers for Next-Token Prediction

Michael Sander, Gabriel Peyré

ICLR 2025arXiv:2410.03011
5
citations
#3984

Neural Interactive Proofs

Lewis Hammond, Sam Adam-Day

ICLR 2025arXiv:2412.08897
5
citations
#3985

Det-CGD: Compressed Gradient Descent with Matrix Stepsizes for Non-Convex Optimization

Hanmin Li, Avetik Karagulyan, Peter Richtarik

ICLR 2024arXiv:2305.12568
5
citations
#3986

Data-adaptive Differentially Private Prompt Synthesis for In-Context Learning

Fengyu Gao, Ruida Zhou, Tianhao Wang et al.

ICLR 2025arXiv:2410.12085
5
citations
#3987

ZETA: Leveraging $Z$-order Curves for Efficient Top-$k$ Attention

Qiuhao Zeng, Jierui Huang, Peng Lu et al.

ICLR 2025arXiv:2501.14577
5
citations
#3988

Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization

Yuxin Jiang, Bo Huang, Yufei Wang et al.

ICLR 2025arXiv:2408.07471
5
citations
#3989

Simple, Good, Fast: Self-Supervised World Models Free of Baggage

Jan Robine, Marc Höftmann, Stefan Harmeling

ICLR 2025arXiv:2506.02612
5
citations
#3990

A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement

Hui Yuan, Yifan Zeng, Yue Wu et al.

ICLR 2025arXiv:2410.13828
5
citations
#3991

What should a neuron aim for? Designing local objective functions based on information theory

Andreas C. Schneider, Valentin Neuhaus, David Ehrlich et al.

ICLR 2025arXiv:2412.02482
5
citations
#3992

ReFusion: Improving Natural Language Understanding with Computation-Efficient Retrieval Representation Fusion

Shangyu Wu, Ying Xiong, Yufei CUI et al.

ICLR 2024arXiv:2401.02993
5
citations
#3993

Lightweight Predictive 3D Gaussian Splats

Junli Cao, Vidit Goel, Chaoyang Wang et al.

ICLR 2025arXiv:2406.19434
5
citations
#3994

Understanding the Robustness of Multi-modal Contrastive Learning to Distribution Shift

Yihao Xue, Siddharth Joshi, Dang Nguyen et al.

ICLR 2024arXiv:2310.04971
5
citations
#3995

LeanVec: Searching vectors faster by making them fit

Ishwar Bhati, Cecilia Aguerrebere, Mark Hildebrand et al.

ICLR 2025arXiv:2312.16335
5
citations
#3996

No Free Lunch: Fundamental Limits of Learning Non-Hallucinating Generative Models

Changlong Wu, Ananth Grama, Wojciech Szpankowski

ICLR 2025arXiv:2410.19217
5
citations
#3997

Attribute-based Visual Reprogramming for Vision-Language Models

Chengyi Cai, Zesheng Ye, Lei Feng et al.

ICLR 2025arXiv:2501.13982
5
citations
#3998

Estimating Shape Distances on Neural Representations with Limited Samples

Dean Pospisil, Brett Larsen, Sarah Harvey et al.

ICLR 2024arXiv:2310.05742
5
citations
#3999

On Representation Complexity of Model-based and Model-free Reinforcement Learning

Hanlin Zhu, Baihe Huang, Stuart Russell

ICLR 2024arXiv:2310.01706
5
citations
#4000

Optimizing Backward Policies in GFlowNets via Trajectory Likelihood Maximization

Timofei Gritsaev, Nikita Morozov, Sergey Samsonov et al.

ICLR 2025arXiv:2410.15474
5
citations