Most Cited ICLR &quot;lightweight model design&quot; Papers

#5206

Understanding Warmup-Stable-Decay Learning Rates: A River Valley Loss Landscape View

Kaiyue Wen, Zhiyuan Li, Jason Wang et al.

ICLR 2025arXiv:2412.12361

#5207

The Ramanujan Library - Automated Discovery on the Hypergraph of Integer Relations

Itay Beit Halachmi, Ido Kaminer

#5208

LLCP: Learning Latent Causal Processes for Reasoning-based Video Question Answer

Guangyi Chen, Yuke Li, Xiao Liu et al.

#5209

Fiber Monte Carlo

Nick Richardson, Deniz Oktay, Yaniv Ovadia et al.

ICLR 2024arXiv:2310.07923

#5210

The Expressive Power of Transformers with Chain of Thought

William Merrill, Ashish Sabharwal

#5211

Inner Classifier-Free Guidance and Its Taylor Expansion for Diffusion Models

Shikun Sun, Longhui Wei, Zhicai Wang et al.

#5212

Linear Recurrences Accessible to Everyone

Felix Sarnthein

#5213

Efficient and Robust Neural Combinatorial Optimization via Wasserstein-Based Coresets

Xu Wang, Fuyou Miao, Wenjie Liu et al.

#5214

Compressing Latent Space via Least Volume

Qiuyi Chen, Mark Fuge

#5215

Bayesian Image Regression with Soft-thresholded Conditional Autoregressive Prior

Yuliang Xu, Jian Kang

ICLR 2025oralarXiv:2507.03393

#5216

Masked Temporal Interpolation Diffusion for Procedure Planning in Instructional Videos

Yufan Zhou, Zhaobo Qi, Lingshuai Lin et al.

#5217

An interpretable error correction method for enhancing code-to-code translation

Min Xue, Artur Andrzejak, Marla Leuther

#5218

Self-Supervised Diffusion Models for Electron-Aware Molecular Representation Learning

Gyoung S. Na, Chanyoung Park

#5219

A Unified Framework for Bayesian Optimization under Contextual Uncertainty

Sebastian Shenghong Tay, Chuan-Sheng Foo, Daisuke Urano et al.

#5220

Learning Large DAGs is Harder than you Think: Many Losses are Minimal for the Wrong DAG

Jonas Seng, Matej Zečević, Devendra Singh Dhami et al.

ICLR 2025oralarXiv:2405.14650

#5221

PhiNets: Brain-inspired Non-contrastive Learning Based on Temporal Prediction Hypothesis

Satoki Ishikawa, Makoto Yamada, Han Bao et al.

#5222

Active Retrosynthetic Planning Aware of Route Quality

Luotian Yuan, Yemin Yu, Ying Wei et al.

#5223

Zero-Mean Regularized Spectral Contrastive Learning: Implicitly Mitigating Wrong Connections in Positive-Pair Graphs

Xiong Zhou, Xianming Liu, feilong zhang et al.

ICLR 2024arXiv:2403.01058

#5224

Neural Field Classifiers via Target Encoding and Classification Loss

Xindi Yang, Zeke Xie, Xiong Zhou et al.

#5225

Safety-Prioritizing Curricula for Constrained Reinforcement Learning

Cevahir Koprulu, Thiago Simão, Nils Jansen et al.

ICLR 2025arXiv:2406.03337

#5226

Identifying latent state transitions in non-linear dynamical systems

Çağlar Hızlı, Çağatay Yıldız, Matthias Bethge et al.

#5227

GaussianAnything: Interactive Point Cloud Flow Matching for 3D Generation

Yushi LAN, Shangchen Zhou, Zhaoyang Lyu et al.

ICLR 2025arXiv:2510.08858

#5228

Sparse components distinguish visual pathways & their alignment to neural networks

Ammar I Marvi, Nancy Kanwisher, Meenakshi Khosla

#5229

Are Bert Family Good Instruction Followers? A Study on Their Potential And Limitations

yisheng xiao, Juntao Li, Zechen Sun et al.

#5230

Synergistic Patch Pruning for Vision Transformer: Unifying Intra- & Inter-Layer Patch Importance

Yuyao Zhang, Lan Wei, Nikolaos Freris

#5231

Scaling In-the-Wild Training for Diffusion-based Illumination Harmonization and Editing by Imposing Consistent Light Transport

Lvmin Zhang, Anyi Rao, Maneesh Agrawala

#5232

On Bias-Variance Alignment in Deep Models

Lin Chen, Michal Lukasik, Wittawat Jitkrittum et al.

ICLR 2025arXiv:2504.11831

#5233

Support is All You Need for Certified VAE Training

Changming Xu, Debangshu Banerjee, Deepak Vasisht et al.

#5234

Contrastive Preference Learning: Learning from Human Feedback without Reinforcement Learning

Joey Hejna, Rafael Rafailov, Harshit Sikchi et al.

#5235

Transformers Provably Learn Two-Mixture of Linear Classification via Gradient Flow

Hongru Yang, Zhangyang Wang, Jason Lee et al.

#5236

The Reversal Curse: LLMs trained on “A is B” fail to learn “B is A”

Lukas Berglund, Meg Tong, Maximilian Kaufmann et al.

#5237

Variance-enlarged Poisson Learning for Graph-based Semi-Supervised Learning with Extremely Sparse Labeled Data

Xiong Zhou, Xianming Liu, Hao Yu et al.

#5238

Forewarned is Forearmed: Harnessing LLMs for Data Synthesis via Failure-induced Exploration

Qintong Li, Jiahui Gao, Sheng Wang et al.

#5239

CheapNet: Cross-attention on Hierarchical representations for Efficient protein-ligand binding Affinity Prediction

Hyukjun Lim, Sun Kim, Sangseon Lee

#5240

Factual Context Validation and Simplification: A Scalable Method to Enhance GPT Trustworthiness and Efficiency

Tianyi Huang

#5241

Solving Homogeneous and Heterogeneous Cooperative Tasks with Greedy Sequential Execution

Shanqi Liu, Dong Xing, Pengjie Gu et al.

#5242

Century: A Framework and Dataset for Evaluating Historical Contextualisation of Sensitive Images

Canfer Akbulut, Kevin Robinson, Maribeth Rauh et al.

#5243

FreeDyG: Frequency Enhanced Continuous-Time Dynamic Graph Model for Link Prediction

Yuxing Tian, Yiyan Qi, Fan Guo

#5244

Concept Bottleneck Generative Models

Aya Abdelsalam Ismail, Julius Adebayo, Hector Corrada Bravo et al.

#5245

TC-MoE: Augmenting Mixture of Experts with Ternary Expert Choice

Shen Yan, Xingyan Bin, Sijun Zhang et al.

#5246

Toward Optimal Policy Population Growth in Two-Player Zero-Sum Games

Stephen McAleer, John Banister Lanier, Kevin A. Wang et al.

ICLR 2025arXiv:2408.16916

#5247

A Computational Framework for Modeling Emergence of Color Vision in the Human Brain

Atsunobu Kotani, Yi-Ren Ng

#5248

Synthesizing Realistic fMRI: A Physiological Dynamics-Driven Hierarchical Diffusion Model for Efficient fMRI Acquisition

Yufan Hu, Jiang, Wuyang Li et al.

#5249

Improving Neural Network Accuracy by Concurrently Training with a Twin Network

Benjamin Vandersmissen, Lucas Deckers, Jose Oramas

#5250

DeepSPF: Spherical SO(3)-Equivariant Patches for Scan-to-CAD Estimation

Driton Salihu, Adam Misik, Yuankai Wu et al.

#5251

The Trickle-down Impact of Reward Inconsistency on RLHF

Lingfeng Shen, Lingfeng Shen, Sihao Chen et al.

ICLR 2025arXiv:2410.01208

#5252

StringLLM: Understanding the String Processing Capability of Large Language Models

Xilong Wang, Hao Fu, Jindong Wang et al.

#5253

Probabilistic Adaptation of Black-Box Text-to-Video Models

Sherry Yang, Yilun Du, Bo Dai et al.

ICLR 2024arXiv:2402.09237

#5254

Weatherproofing Retrieval for Localization with Generative AI and Geometric Consistency

Yannis Kalantidis, Mert Bulent SARIYILDIZ, Rafael Rezende et al.

#5255

DreamClean: Restoring Clean Image Using Deep Diffusion Prior

Jie Xiao, Ruili Feng, Han Zhang et al.

#5256

How to visualize training dynamics in neural networks

Michael Hu, Shreyans Jain, Sangam Chaulagain et al.

ICLR 2025arXiv:2405.12001

#5257

Scrutinize What We Ignore: Reining In Task Representation Shift Of Context-Based Offline Meta Reinforcement Learning

Hai Zhang, Boyuan Zheng, Tianying Ji et al.

#5258

Learning and aligning single-neuron invariance manifolds in visual cortex

Mohammad Bashiri, Luca Baroni, Ján Antolík et al.

ICLR 2025oralarXiv:2506.08306

#5259

AstroCompress: A benchmark dataset for multi-purpose compression of astronomical data

Tuan Truong, Rithwik Sudharsan, Yibo Yang et al.

#5260

True Knowledge Comes from Practice: Aligning Large Language Models with Embodied Environments via Reinforcement Learning

Weihao Tan, Wentao Zhang, Shanqi Liu et al.

ICLR 2025arXiv:2502.09122

#5261

Improving Deep Regression with Tightness

Shihao Zhang, Yuguang Yan, Angela Yao

#5262

How much of my dataset did you use? Quantitative Data Usage Inference in Machine Learning

Yao Tong, Jiayuan Ye, Sajjad Zarifzadeh et al.

#5263

Private Mechanism Design via Quantile Estimation

Yuanyuan Yang, Tao Xiao, Bhuvesh Kumar et al.

#5264

Value-aligned Behavior Cloning for Offline Reinforcement Learning via Bi-level Optimization

Xingyu Jiang, Ning Gao, Xiuhui Zhang et al.

#5265

Watch Less, Do More: Implicit Skill Discovery for Video-Conditioned Policy

Wang, Zongqing Lu

#5266

RB-Modulation: Training-Free Stylization using Reference-Based Modulation

Litu Rout, Yujia Chen, Nataniel Ruiz et al.

#5267

Weaker MVI Condition: Extragradient Methods with Multi-Step Exploration

Yifeng Fan, Yongqiang Li, Bo Chen

#5268

ZeRO++: Extremely Efficient Collective Communication for Large Model Training

Guanhua Wang, Heyang Qin, Sam Jacobs et al.

#5269

Hybrid Directional Graph Neural Network for Molecules

Junyi An, Chao Qu, Zhipeng Zhou et al.

#5270

On the Hardness of Online Nonconvex Optimization with Single Oracle Feedback

Ziwei Guan, Yi Zhou, Yingbin Liang

ICLR 2025arXiv:2410.15346

#5271

YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-Dictionary

Hao-Tang Tsui, Chien-Yao Wang, Hong-Yuan Liao

#5272

SFS: Smarter Code Space Search improves LLM Inference Scaling

Jonathan Light, Yue Wu, Yiyou Sun et al.

#5273

Mastering Task Arithmetic: $\tau$Jp as a Key Indicator for Weight Disentanglement

Kotaro Yoshida, Yuji Naraki, Takafumi Horie et al.

#5274

Recovery of Causal Graph Involving Latent Variables via Homologous Surrogates

Xiuchuan Li, Jun Wang, Tongliang Liu

#5275

MMD Graph Kernel: Effective Metric Learning for Graphs via Maximum Mean Discrepancy

Yan Sun, Jicong Fan

#5276

An improved analysis of per-sample and per-update clipping in federated learning

Bo Li, Xiaowen Jiang, Mikkel N. Schmidt et al.

#5277

Class Probability Matching with Calibrated Networks for Label Shift Adaption

Hongwei Wen, Annika Betken, Hanyuan Hang

#5278

Towards Offline Opponent Modeling with In-context Learning

Yuheng Jing, Kai Li, Bingyun Liu et al.

#5279

Mechanistic Interpretability Meets Vision Language Models: Insights and Limitations

Yiming Liu, Yuhui Zhang, Serena Yeung

#5280

Learning Polynomial Problems with $SL(2, \mathbb{R})$-Equivariance

Hannah Lawrence, Mitchell Harris

#5281

$\texttt{NAISR}$: A 3D Neural Additive Model for Interpretable Shape Representation

Yining Jiao, Carlton ZDANSKI, Julia Kimbell et al.

#5282

InterpGNN: Understand and Improve Generalization Ability of Transdutive GNNs through the Lens of Interplay between Train and Test Nodes

Jiawei Sun, Kailai Li, Ruoxin Chen et al.

#5283

A Progressive Training Framework for Spiking Neural Networks with Learnable Multi-hierarchical Model

Zecheng Hao, Xinyu Shi, Zihan Huang et al.

#5284

Generative Learning for Financial Time Series with Irregular and Scale-Invariant Patterns

Hongbin Huang, Minghua Chen, Xiao Qiao

#5285

Chain-of-Thought Provably Enables Learning the (Otherwise) Unlearnable

Chenxiao Yang, Zhiyuan Li, David Wipf

#5286

Scaling Long Context Training Data by Long-Distance Referrals

Yonghao Zhuang, Lanxiang Hu, Longfei Yun et al.

#5287

Grammar Reinforcement Learning: path and cycle counting in graphs with a Context-Free Grammar and Transformer approach

Jason Piquenot, Maxime Berar, Romain Raveaux et al.

#5288

Bridging the Gap between Variational Inference and Stochastic Gradient MCMC in Function Space

Mengjing Wu, Junyu Xuan, Jie Lu

ICLR 2024spotlightarXiv:2310.02246

#5289

Learning to Relax: Setting Solver Parameters Across a Sequence of Linear System Instances

Mikhail Khodak, Edmond Chow, Nina Balcan et al.

#5290

Würstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models

Pablo Pernías, Dominic Rampas, Mats L. Richter et al.

#5291

Medium-Difficulty Samples Constitute Smoothed Decision Boundary for Knowledge Distillation on Pruned Datasets

Yudong Chen, Xuwei Xu, Frank de Hoog et al.

#5292

Understanding Convergence and Generalization in Federated Learning through Feature Learning Theory

Wei Huang, Ye Shi, Zhongyi Cai et al.

#5293

Robust System Identification: Finite-sample Guarantees and Connection to Regularization

Hank Park, Grani A. Hanasusanto, Yingying Li

#5294

GDrag:Towards General-Purpose Interactive Editing with Anti-ambiguity Point Diffusion

Xiaojian Lin, Hanhui Li, Yuhao Cheng et al.

#5295

Gaussian-Based Instance-Adaptive Intensity Modeling for Point-Supervised Facial Expression Spotting

Yicheng Deng, Hideaki Hayashi, Hajime Nagahara

#5296

BP-Modified Local Loss for Efficient Training of Deep Neural Networks

REN Lianhai, Qianxiao Li

#5297

ST-GCond: Self-supervised and Transferable Graph Dataset Condensation

Beining Yang, Qingyun Sun, Cheng Ji et al.

#5298

Simulating Training Dynamics to Reconstruct Training Data from Deep Neural Networks

Hanling Tian, Yuhang Liu, Mingzhen He et al.

#5299

Sparse MoE with Language Guided Routing for Multilingual Machine Translation

Xinyu Zhao, Xuxi Chen, Yu Cheng et al.

ICLR 2024arXiv:2307.07919

#5300

Neural Architecture Retrieval

Xiaohuan Pei, Yanxi Li, Minjing Dong et al.

#5301

Neural SDF Flow for 3D Reconstruction of Dynamic Scenes

wei mao, Richard Hartley, Mathieu Salzmann et al.

#5302

A new framework for evaluating model out-of-distribution generalisation for the biochemical domain

Raul Fernandez-Diaz, Hoang Thanh Lam, Vanessa Lopez et al.

ICLR 2025oral

#5303

Numerical Accounting in the Shuffle Model of Differential Privacy

Antti Koskela, Antti Honkela, Mikko Heikkilä

#5304

Unsupervised Multiple Kernel Learning for Graphs via Ordinality Preservation

Yan Sun, Stanley Kok

#5305

ADOPD: A Large-Scale Document Page Decomposition Dataset

Jiuxiang Gu, Xiangxi Shi, Jason Kuen et al.

#5306

Score-based generative models break the curse of dimensionality in learning a family of sub-Gaussian distributions

Frank Cole, Yulong Lu

#5307

Avoid Overclaims: Summary of Complexity Bounds for Algorithms in Minimization and Minimax Optimization

Siqi Zhang, Yifan Hu

#5308

To the Cutoff... and Beyond? A Longitudinal Perspective on LLM Data Contamination

Manley Roberts, Himanshu Thakur, Christine Herlihy et al.

#5309

XAIguiFormer: explainable artificial intelligence guided transformer for brain disorder identification

Hanning Guo, Farah Abdellatif, Yu Fu et al.

#5310

Minimalistic Predictions for Online Class Constraint Scheduling

Dorian Guyot, Alexandra Lassota

#5311

Mutual Effort for Efficiency: A Similarity-based Token Pruning for Vision Transformers in Self-Supervised Learning

Sheng Li, Qitao Tan, Yue Dai et al.

#5312

Resolution Attack: Exploiting Image Compression to Deceive Deep Neural Networks

Wangjia Yu, Xiaomeng Fu, Qiao Li et al.

#5313

Multi-Resolution Diffusion Models for Time Series Forecasting

Lifeng Shen, Weiyu Chen, James Kwok

ICLR 2024arXiv:2310.13550

#5314

Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes

Ruiquan Huang, Yuan Cheng, Jing Yang et al.

#5315

Straightness of Rectified Flow: A Theoretical Insight into Wasserstein Convergence

Saptarshi Roy, Vansh Bansal, Purnamrita Sarkar et al.

ICLR 2025arXiv:2502.21110

#5316

Rare event modeling with self-regularized normalizing flows: what can we learn from a single failure?

Charles Dawson, Van Tran, Max Li et al.

#5317

Multi-Label Node Classification with Label Influence Propagation

Yifei Sun, Zemin Liu, Bryan Hooi et al.

#5318

Language Control Diffusion: Efficiently Scaling through Space, Time, and Tasks

David Bell, Yujie Lu, Shinda Huang et al.

ICLR 2024arXiv:2504.02142

#5319

Like Oil and Water: Group Robustness Methods and Poisoning Defenses May Be at Odds

Michael-Andrei Panaitescu-Liess, Yigitcan Kaya, Sicheng Zhu et al.

#5320

Collaborative Discrete-Continuous Black-Box Prompt Learning for Language Models

Hualin Zhang, Haozhen Zhang, Zhekai Liu et al.

#5321

RESuM: A Rare Event Surrogate Model for Physics Detector Design

Ann-Kathrin Schuetz, Alan Poon, Aobo Li

#5322

SPD Attack - Prevention of AI Powered Image Editing by Image Immunization

Parth Badgujar, Shorya Singhal, Devansh Bhardwaj

#5323

Vision and Language Synergy for Rehearsal Free Continual Learning

Muhammad Anwar Masum, Mahardhika Pratama, Savitha Ramasamy et al.

ICLR 2025arXiv:2407.14618

#5324

SOREL: A Stochastic Algorithm for Spectral Risks Minimization

Yuze Ge, Rujun Jiang

#5325

Federated Few-Shot Class-Incremental Learning

Muhammad Anwar Masum, Mahardhika Pratama, Lin Liu et al.

#5326

FairDen: Fair Density-Based Clustering

Lena Krieger, Anna Beer, Pernille Matthews et al.

#5327

Building Blocks of Differentially Private Training

Mahmoud Hegazy, Aymeric Dieuleveut

#5328

Boundary Denoising for Video Activity Localization

Mengmeng Xu, Mattia Soldan, Jialin Gao et al.

#5329

SIMPL: Scalable and hassle-free optimisation of neural representations from behaviour

Tom George, Pierre Glaser, Kimberly Stachenfeld et al.

#5330

Generative Learning for Solving Non-Convex Problem with Multi-Valued Input-Solution Mapping

Enming Liang, Minghua Chen

#5331

Deep Networks Learn Features From Local Discontinuities in the Label Function

Prithaj Banerjee, Harish G Ramaswamy, Mahesh Yadav et al.

#5332

On Trajectory Augmentations for Off-Policy Evaluation

Ge Gao, Qitong Gao, Xi Yang et al.

#5333

Learn hybrid prototypes for multivariate time series anomaly detection

Ke-Yuan Shen

#5334

GRAPH-CONSTRAINED DIFFUSION FOR END-TO-END PATH PLANNING

DINGYUAN SHI, Yongxin Tong, Zimu Zhou et al.

#5335

Combinatorial Bandits for Maximum Value Reward Function under Value-Index Feedback

Yiliu Wang, Wei Chen, Milan Vojnovic

#5336

From Complexity to Clarity: Analytical Expressions of Deep Neural Network Weights via Clifford Algebra and Convexity

Mert Pilanci

#5337

On the Fourier analysis in the SO(3) space : the EquiLoPO Network

Dmitrii Zhemchuzhnikov, Sergei Grudinin

ICLR 2024arXiv:2406.00198

#5338

ImplicitSLIM and How it Improves Embedding-based Collaborative Filtering

Ilya Shenbin, Sergey Nikolenko

#5339

PixArt-$\alpha$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Junsong Chen, Jincheng YU, Chongjian GE et al.

#5340

Balancing Bias in Two-sided Markets for Fair Stable Matchings

Siyuan Wu, Leong Hou U, Panagiotis Karras

#5341

Convergence of Bayesian Bilevel Optimization

Shi Fu, Fengxiang He, Xinmei Tian et al.

#5342

Towards more rigorous evaluations of language models

Desi R Ivanova, Ilija Ilievski, Momchil Konstantinov

#5343

Consistency Training with Learnable Data Augmentation for Graph Anomaly Detection with Limited Supervision

Nan Chen, Zemin Liu, Bryan Hooi et al.

#5344

Rotation Has Two Sides: Evaluating Data Augmentation for Deep One-class Classification

Guodong Wang, Yunhong Wang, Xiuguo Bao et al.

#5345

Digi-Q: Learning VLM Q-Value Functions for Training Device-Control Agents

Hao Bai, Yifei Zhou, Li Li et al.

ICLR 2025oral

#5346

Lion Secretly Solves a Constrained Optimization: As Lyapunov Predicts

Lizhang Chen, Bo Liu, Kaizhao Liang et al.

#5347

Aligned LLMs Are Not Aligned Browser Agents

Priyanshu Kumar, Elaine Lau, Saranya Vijayakumar et al.

#5348

On the Effect of Batch Size in Byzantine-Robust Distributed Learning

Yi-Rui Yang, Chang-Wei Shi, Wu-Jun Li

#5349

Three-in-One: Fast and Accurate Transducer for Hybrid-Autoregressive ASR

Hainan Xu, Travis Bartley, Vladimir Bataev et al.

#5350

Node2ket: Efficient High-Dimensional Network Embedding in Quantum Hilbert Space

Hao Xiong, Yehui Tang, Yunlin He et al.

#5351

Towards LLM4QPE: Unsupervised Pretraining of Quantum Property Estimation and A Benchmark

Yehui Tang, Hao Xiong, Nianzu Yang et al.

#5352

Sensitivity Verification for Additive Decision Tree Ensembles

Arhaan Ahmad, Tanay Tayal, Ashutosh Gupta et al.

#5353

Diffusion Models and Gaussian Flow Matching: Two Sides of the Same Coin

Ruiqi Gao, Emiel Hoogeboom, Jonathan Heek et al.

#5354

PAE: Reinforcement Learning from External Knowledge for Efficient Exploration

Zhe Wu, Haofei Lu, Junliang Xing et al.

#5355

LOIRE: LifelOng learning on Incremental data via pre-trained language model gRowth Efficiently

Xue Han, Yitong Wang, Junlan Feng et al.

#5356

Continual Slow-and-Fast Adaptation of Latent Neural Dynamics (CoSFan): Meta-Learning What-How & When to Adapt

Ryan Missel, Linwei Wang

ICLR 2024arXiv:2512.00351

#5357

Provable Memory Efficient Self-Play Algorithm for Model-free Reinforcement Learning

Na Li, Yuchen Jiao, Hangguan Shan et al.

#5358

Sketching for Convex and Nonconvex Regularized Least Squares with Sharp Guarantees

Yingzhen Yang, Ping Li

ICLR 2025arXiv:2311.01806

#5359

Problem-Parameter-Free Federated Learning

Wenjing Yan, Kai Zhang, Xiaolu Wang et al.

#5360

From Decoupling to Adaptive Transformation: a Wider Optimization Space for PTQ

Zhaojing Wen, Qiulin Zhang, Yuan Zhang et al.

#5361

Exploiting Hidden Symmetry to Improve Objective Perturbation for DP Linear Learners with a Nonsmooth L1-Norm

Du Chen, Geoffrey A. Chua

ICLR 2025arXiv:2601.01465

#5362

Leveraging Flatness to Improve Information-Theoretic Generalization Bounds for SGD

Ze Peng, Jian Zhang, Yisen Wang et al.

#5363

Reveal Object in Lensless Photography via Region Gaze and Amplification

Xiangjun Yin, Huihui Yue

#5364

Fast Imitation via Behavior Foundation Models

Matteo Pirotta, Andrea Tirinzoni, Ahmed Touati et al.

ICLR 2025arXiv:2501.10202

#5365

Provably Safeguarding a Classifier from OOD and Adversarial Samples

Nicolas Atienza, Johanne Cohen, Christophe Labreuche et al.

#5366

Federated Text-driven Prompt Generation for Vision-Language Models

Chen Qiu, Xingyu Li, Chaithanya Kumar Mummadi et al.

#5367

Ins-DetCLIP: Aligning Detection Model to Follow Human-Language Instruction

Renjie Pi, Lewei Yao, Jianhua Han et al.

#5368

GANDALF: Generative AttentioN based Data Augmentation and predictive modeLing Framework for personalized cancer treatment

Aishwarya Jayagopal, Yanrong Zhang, Robert Walsh et al.

ICLR 2025arXiv:2502.20957

#5369

Reward Dimension Reduction for Scalable Multi-Objective Reinforcement Learning

Giseung Park, Youngchul Sung

#5370

Finding and Only Finding Differential Nash Equilibria by Both Pretending to be a Follower

Guodong Zhang, Xuchan Bao

ICLR 2024arXiv:2402.09164

#5371

Less is More: Fewer Interpretable Region via Submodular Subset Selection

Ruoyu Chen, Hua Zhang, Siyuan Liang et al.

#5372

On the Inherent Privacy Properties of Discrete Denoising Diffusion Models

Eli Chien, Pan Li, Vamsi Potluru et al.

#5373

Generalized Policy Iteration using Tensor Approximation for Hybrid Control

Suhan Shetty, Teng Xue, Sylvain Calinon

#5374

ACTIVE: Offline Reinforcement Learning via Adaptive Imitation and In-sample $V$-Ensemble

Tianyuan Chen, Ronglong Cai, Faguo Wu et al.

#5375

On LLM Knowledge Distillation - A Comparison between Forward KL and Reverse KL

Yihan Cao, Yanbin Kang

#5376

Diffusion Models for Multi-Task Generative Modeling

Changyou Chen, Han Ding, Bunyamin Sisman et al.

#5377

Score-based free-form architectures for high-dimensional Fokker-Planck equations

Feng Liu, Faguo Wu, Xiao Zhang

#5378

Causal Modelling Agents: Causal Graph Discovery through Synergising Metadata- and Data-driven Reasoning

Ahmed Abdulaal, Adamos Hadjivasiliou, Nina Montaña-Brown et al.

ICLR 2025arXiv:2502.06919

#5379

Select before Act: Spatially Decoupled Action Repetition for Continuous Control

Buqing Nie, Yangqing Fu, Yue Gao

#5380

Fugatto 1: Foundational Generative Audio Transformer Opus 1

Rafael Valle, Rohan Badlani, Zhifeng Kong et al.

#5381

Efficient Interpolation between Extragradient and Proximal Methods for Weak MVIs

Thomas Pethick, Ioannis Mavrothalassitis, Volkan Cevher

#5382

Latent 3D Graph Diffusion

Yuning You, Ruida Zhou, Jiwoong Park et al.

#5383

Does Progress On Object Recognition Benchmarks Improve Generalization on Crowdsourced, Global Data?

Megan Richards, Polina Kirichenko, Diane Bouchacourt et al.

#5384

Scalable Monotonic Neural Networks

Hyunho Kim, Jong-Seok Lee

#5385

Understanding Methods for Scalable MCTS

Will Knipe

#5386

Combining Axes Preconditioners through Kronecker Approximation for Deep Learning

Venkata Sai Surya Subramanyam Duvvuri, Fnu Devvrit, Rohan Anil et al.

#5387

RetroInText: A Multimodal Large Language Model Enhanced Framework for Retrosynthetic Planning via In-Context Representation Learning

Chenglong Kang, Xiaoyi Liu, Fei Guo

#5388

FedTrans: Client-Transparent Utility Estimation for Robust Federated Learning

Mingkun Yang, Ran Zhu, Qing Wang et al.

#5389

The mechanistic basis of data dependence and abrupt learning in an in-context classification task

Gautam Reddy Nallamala

#5390

NuwaDynamics: Discovering and Updating in Causal Spatio-Temporal Modeling

Kun Wang, Hao Wu, Yifan Duan et al.

#5391

Rethinking Information-theoretic Generalization: Loss Entropy Induced PAC Bounds

Yuxin Dong, Tieliang Gong, Hong Chen et al.

ICLR 2024arXiv:2202.09914

#5392

SOInter: A Novel Deep Energy-Based Interpretation Method for Explaining Structured Output Models

S. Fatemeh Seyyedsalehi, Mahdieh Baghshah, Hamid Rabiee

#5393

Mind the Gap: Examining the Self-Improvement Capabilities of Large Language Models

Yuda Song, Hanlin Zhang, Carson Eisenach et al.

ICLR 2025arXiv:2412.02674

#5394

Connectome Mapping: Shape-Memory Network via Interpretation of Contextual Semantic Information

Kyungsu Lee, Haeyun Lee, Jae Youn Hwang

#5395

GAFormer: Enhancing Timeseries Transformers Through Group-Aware Embeddings

Jingyun Xiao, Ran Liu, Eva Dyer

#5396

Fat-to-Thin Policy Optimization: Offline Reinforcement Learning with Sparse Policies

Lingwei Zhu, Han Wang, Yukie Nagai

#5397

SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models

Xin Zhang, Dong Zhang, Shimin Li et al.

#5398

Prompt Learning with Quaternion Networks

Boya Shi, Zhengqin Xu, Shuai Jia et al.

#5399

Lost in Prediction: Why Social Media Narratives Don't Help Macroeconomic Forecasting?

Almog Gueta, Roi Reichart, Amir Feder et al.

#5400

Global Identifiability of Overcomplete Dictionary Learning via L1 and Volume Minimization

Yuchen Sun, Kejun Huang