Most Cited 2025 &quot;molecular large language models&quot; Papers

ICML 2025arXiv:2505.04741

#7202

When Bad Data Leads to Good Models

Kenneth Li, Yida Chen, Fernanda Viégas et al.

ICML 2025arXiv:2502.01342

#7203

Activation by Interval-wise Dropout: A Simple Way to Prevent Neural Networks from Plasticity Loss

Sangyeon Park, Isaac Han, Seungwon Oh et al.

ICLR 2025arXiv:2410.11826

#7204

Bayesian Experimental Design Via Contrastive Diffusions

Jacopo Iollo, Christophe Heinkelé, Pierre Alliez et al.

ICML 2025spotlightarXiv:2504.02854

#7205

Efficient First-Order Optimization on the Pareto Set for Multi-Objective Learning under Preference Guidance

Lisha Chen, Quan Xiao, Ellen Fukuda et al.

ICLR 2025arXiv:2505.17126

#7206

Conformal Language Model Reasoning with Coherent Factuality

Maxon Rubin-Toles, Maya Gambhir, Keshav Ramji et al.

ICLR 2025arXiv:2407.16615

#7207

Lawma: The Power of Specialization for Legal Annotation

Ricardo Dominguez-Olmedo, Vedant Nanda, Rediet Abebe et al.

AAAI 2025paperarXiv:2412.11253

#7208

Are Expressive Models Truly Necessary for Offline RL?

Guan Wang, Haoyi Niu, Jianxiong Li et al.

ICLR 2025arXiv:2411.01992

#7209

Ask, and it shall be given: On the Turing completeness of prompting

Ruizhong Qiu, Zhe Xu, Wenxuan Bao et al.

ICLR 2025arXiv:2504.13292

#7210

Let Me Grok for You: Accelerating Grokking via Embedding Transfer from a Weaker Model

Zhiwei Xu, Zhiyu Ni, Yixin Wang et al.

NEURIPS 2025arXiv:2505.20219

#7211

New Perspectives on the Polyak Stepsize: Surrogate Functions and Negative Results

Francesco Orabona, Ryan D'Orazio

ICML 2025arXiv:2411.18612

#7212

Robust Offline Reinforcement Learning with Linearly Structured $f$-Divergence Regularization

Cheng Tang, Zhishuai Liu, Pan Xu

ICLR 2025arXiv:2502.15315

#7213

Tight Clusters Make Specialized Experts

Stefan Nielsen, Rachel Teo, Laziz Abdullaev et al.

ICML 2025spotlightarXiv:2502.18147

#7214

Jacobian Sparse Autoencoders: Sparsify Computations, Not Just Activations

Lucy Farnik, Tim Lawson, Conor Houghton et al.

ICLR 2025arXiv:2406.10354

#7215

SigDiffusions: Score-Based Diffusion Models for Time Series via Log-Signature Embeddings

Barbora Barancikova, Zhuoyue Huang, Cristopher Salvi

ICML 2025arXiv:2410.22316

#7216

Understanding Synthetic Context Extension via Retrieval Heads

Xinyu Zhao, Fangcong Yin, Greg Durrett

ICML 2025arXiv:2501.01144

#7217

BlockDialect: Block-wise Fine-grained Mixed Format Quantization for Energy-Efficient LLM Inference

Wonsuk Jang, Thierry Tambe

#7218

Dynamic Graph Learning with Static Relations for Credit Risk Assessment

Qi Yuan, Yang Liu, Yateng Tang et al.

ICLR 2025arXiv:2404.02241

#7219

Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better

Enshu Liu, Junyi Zhu, Zinan Lin et al.

AAAI 2025paperarXiv:2408.07397

#7220

Bridging Training and Execution via Dynamic Directed Graph-Based Communication in Cooperative Multi-Agent Systems

Zhuohui Zhang, Bin He, Bin Cheng et al.

NEURIPS 2025arXiv:2507.04103

#7221

How to Train Your LLM Web Agent: A Statistical Diagnosis

Dheeraj Vattikonda, Santhoshi Ravichandran, Emiliano Penaloza et al.

#7222

FedTMOS: Efficient One-Shot Federated Learning with Tsetlin Machine

Shannon How, Jagmohan Chauhan, Geoff Merrett et al.

ICML 2025arXiv:2502.14760

#7223

EquivaMap: Leveraging LLMs for Automatic Equivalence Checking of Optimization Formulations

Haotian Zhai, Connor Lawless, Ellen Vitercik et al.

ICLR 2025arXiv:2502.18821

#7224

CAMEx: Curvature-aware Merging of Experts

Dung Viet Nguyen, Minh Nguyen, Luc Nguyen et al.

AAAI 2025paperarXiv:2503.06974

#7225

Asymmetric Visual Semantic Embedding Framework for Efficient Vision-Language Alignment

Yang Liu, Mengyuan Liu, Shudong Huang et al.

ICML 2025oralarXiv:2505.00612

#7226

Position: AI Competitions Provide the Gold Standard for Empirical Rigor in GenAI Evaluation

D. Sculley, William Cukierski, Phil Culliton et al.

AAAI 2025paperarXiv:2412.16969

#7227

Multifaceted User Modeling in Recommendation: A Federated Foundation Models Approach

Chunxu Zhang, Guodong Long, Hongkuan Guo et al.

#7228

WaterDiffusion: Learning a Prior-involved Unrolling Diffusion for Joint Underwater Saliency Detection and Visual Restoration

Laibin Chang, Yunke Wang, Longxiang Deng et al.

ICML 2025arXiv:2506.04870

#7229

Aligning Multimodal Representations through an Information Bottleneck

Antonio Almudévar, Jose Miguel Hernandez-Lobato, Sameer Khurana et al.

ICLR 2025arXiv:2410.01322

#7230

Forte : Finding Outliers with Representation Typicality Estimation

Debargha Ganguly, Warren Morningstar, Andrew Yu et al.

ICLR 2025arXiv:2503.18871

#7231

Bootstrapped Model Predictive Control

Yuhang Wang, Hanwei Guo, Sizhe Wang et al.

AAAI 2025paperarXiv:2412.13734

#7232

Text2Relight: Creative Portrait Relighting with Text Guidance

Junuk Cha, Mengwei Ren, Krishna Kumar Singh et al.

AAAI 2025paperarXiv:2412.15526

#7233

SGTC: Semantic-Guided Triplet Co-training for Sparsely Annotated Semi-Supervised Medical Image Segmentation

Ke Yan, Qing Cai, Fan Zhang et al.

AAAI 2025paperarXiv:2408.11297

#7234

Making Large Vision Language Models to Be Good Few-Shot Learners

Fan Liu, Wenwen Cai, Jian Huo et al.

AAAI 2025paperarXiv:2412.11807

#7235

PhysAug: A Physical-guided and Frequency-based Data Augmentation for Single-Domain Generalized Object Detection

Xiaoran Xu, Jiangang Yang, Wenhui Shi et al.

ICLR 2025arXiv:2412.14421

#7236

Comparing noisy neural population dynamics using optimal transport distances

Amin Nejatbakhsh, Victor Geadah, Alex Williams et al.

NEURIPS 2025arXiv:2508.05954

#7237

Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents

Han Lin, Jaemin Cho, Amir Zadeh et al.

AAAI 2025paperarXiv:2409.11283

#7238

Zero-resource Hallucination Detection for Text Generation via Graph-based Contextual Knowledge Triples Modeling

Xinyue Fang, Zhen Huang, Zhiliang Tian et al.

ICLR 2025arXiv:2408.08558

#7239

Linear combinations of latents in generative models: subspaces and beyond

Erik Bodin, Alexandru Stere, Dragos Margineantu et al.

ICML 2025arXiv:2110.06257

#7240

Causal Discovery from Conditionally Stationary Time Series

Carles Balsells-Rodas, Xavier Sumba, Tanmayee Narendra et al.

ICLR 2025arXiv:2408.07249

#7241

Bridging Information Asymmetry in Text-video Retrieval: A Data-centric Approach

Zechen Bai, Tianjun Xiao, Tong He et al.

AAAI 2025paperarXiv:2503.03135

#7242

Bridging Molecular Graphs and Large Language Models

Runze Wang, Mingqi Yang, Yanming Shen

ICML 2025arXiv:2411.00171

#7243

EARL-BO: Reinforcement Learning for Multi-Step Lookahead, High-Dimensional Bayesian Optimization

Mujin Cheon, Jay Lee, Dong-Yeun Koh et al.

ICLR 2025arXiv:2412.02856

#7244

Is Large-scale Pretraining the Secret to Good Domain Generalization?

Piotr Teterwak, Kuniaki Saito, Theodoros Tsiligkaridis et al.

AAAI 2025paperarXiv:2405.16579

#7245

Automatically Generating Numerous Context-Driven SFT Data for LLMs Across Diverse Granularity

Shanghaoran Quan

ICML 2025arXiv:2507.17135

#7246

SADA: Stability-guided Adaptive Diffusion Acceleration

Ting Jiang, Yixiao Wang, Hancheng Ye et al.

ICLR 2025arXiv:2409.10362

#7247

Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning

Amin Karimi Monsefi, Mengxi Zhou, Nastaran Monsefi et al.

ICML 2025oralarXiv:2411.07061

#7248

General framework for online-to-nonconvex conversion: Schedule-free SGD is also effective for nonconvex optimization

Kwangjun Ahn, Gagik Magakyan, Ashok Cutkosky

AAAI 2025paperarXiv:2501.00910

#7249

Population Aware Diffusion for Time Series Generation

Yang Li, Han Meng, Zhenyu Bi et al.

ICLR 2025arXiv:2502.08958

#7250

Biologically Plausible Brain Graph Transformer

Ciyuan Peng, Yuelong Huang, Qichao Dong et al.

ICLR 2025arXiv:2406.14022

#7251

Investigating the Pre-Training Dynamics of In-Context Learning: Task Recognition vs. Task Learning

Xiaolei Wang, Xinyu Tang, Junyi Li et al.

ICML 2025arXiv:2502.01330

#7252

Accelerating Linear Recurrent Neural Networks for the Edge with Unstructured Sparsity

Alessandro Pierro, Steven Abreu, Jonathan Timcheck et al.

AAAI 2025paperarXiv:2404.17288

#7253

ExcluIR: Exclusionary Neural Information Retrieval

Wenhao Zhang, Mengqi Zhang, Shiguang Wu et al.

NEURIPS 2025arXiv:2506.17368

#7254

SAFEx: Analyzing Vulnerabilities of MoE-Based LLMs via Stable Safety-critical Expert Identification

Zhenglin Lai, Mengyao Liao, Bingzhe Wu et al.

ICLR 2025arXiv:2402.01943

#7255

Precedence-Constrained Winter Value for Effective Graph Data Valuation

Hongliang Chi, Wei Jin, Charu Aggarwal et al.

ICML 2025arXiv:2502.04807

#7256

Robust Conformal Outlier Detection under Contaminated Reference Data

Meshi Bashari, Matteo Sesia, Yaniv Romano

ICLR 2025arXiv:2410.02309

#7257

Decoupling Layout from Glyph in Online Chinese Handwriting Generation

Minsi Ren, Yan-Ming Zhang, yi chen

NEURIPS 2025oralarXiv:2410.15392

#7258

EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting

Bohao Liao, Wei Zhai, Zengyu Wan et al.

NEURIPS 2025arXiv:2506.05551

#7259

When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding

Yan Shu, Hangui Lin, Yexin Liu et al.

AAAI 2025paperarXiv:2409.18073

#7260

Infer Human’s Intentions Before Following Natural Language Instructions

Yanming Wan, Yue Wu, Yiping Wang et al.

ICML 2025arXiv:2412.11044

#7261

Understanding and Mitigating Memorization in Diffusion Models for Tabular Data

Zhengyu Fang, Zhimeng Jiang, Huiyuan Chen et al.

AAAI 2025paperarXiv:2412.15499

#7262

A Robust Prototype-Based Network with Interpretable RBF Classifier Foundations

Sascha Saralajew, Ashish Rana, Thomas Villmann et al.

ICML 2025arXiv:2411.09858

#7263

One Leaf Reveals the Season: Occlusion-Based Contrastive Learning with Semantic-Aware Views for Efficient Visual Representation

Xiaoyu Yang, Lijian Xu, Hongsheng Li et al.

ICML 2025arXiv:2410.08067

#7264

Reward-Augmented Data Enhances Direct Preference Alignment of LLMs

Shenao Zhang, Zhihan Liu, Boyi Liu et al.

AAAI 2025paperarXiv:2501.09428

#7265

AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring

Xinyi Wang, Na Zhao, Zhiyuan Han et al.

#7266

MLC-NC: Long-Tailed Multi-Label Image Classification Through the Lens of Neural Collapse

Zijian Tao, Shao-Yuan Li, Wenhai Wan et al.

ICLR 2025oralarXiv:2411.19455

#7267

Autocorrelation Matters: Understanding the Role of Initialization Schemes for State Space Models

Fusheng Liu, Qianxiao Li

ICLR 2025arXiv:2410.12592

#7268

Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion

Minkyoung Cho, Yulong Cao, Jiachen Sun et al.

ICML 2025arXiv:2502.16075

#7269

Implicit Bias of Gradient Descent for Non-Homogeneous Deep Networks

Yuhang Cai, Kangjie Zhou, Jingfeng Wu et al.

ICLR 2025arXiv:2410.02275

#7270

Optimal Strong Regret and Violation in Constrained MDPs via Policy Optimization

Francesco Emanuele Stradi, Matteo Castiglioni, Alberto Marchesi et al.

AAAI 2025paperarXiv:2502.08974

#7271

Topo2Seq: Enhanced Topology Reasoning via Topology Sequence Learning

Yiming Yang, Yueru Luo, Bingkun He et al.

#7272

DUO: Diverse, Uncertain, On-Policy Query Generation and Selection for Reinforcement Learning from Human Feedback

Xuening Feng, Zhaohui Jiang, Timo Kaufmann et al.

AAAI 2025paperarXiv:2412.15655

#7273

MathSpeech: Leveraging Small LMs for Accurate Conversion in Mathematical Speech-to-Formula

Sieun Hyeon, Kyudan Jung, Jaehee Won et al.

ICML 2025arXiv:2503.04429

#7274

Activation Space Interventions Can Be Transferred Between Large Language Models

Narmeen Oozeer, Dhruv Nathawani, Nirmalendu Prakash et al.

ICLR 2025arXiv:2412.12540

#7275

Stiefel Flow Matching for Moment-Constrained Structure Elucidation

Austin H Cheng, Alston Lo, Kin Long Kelvin Lee et al.

AAAI 2025paperarXiv:2407.03757

#7276

DiffRetouch: Using Diffusion to Retouch on the Shoulder of Experts

Zheng-Peng Duan, Jiawei Zhang, Zheng Lin et al.

ICLR 2025arXiv:2402.09099

#7277

Neuron-based Multifractal Analysis of Neuron Interaction Dynamics in Large Models

Xiongye Xiao, Heng Ping, Chenyu Zhou et al.

ICLR 2025oralarXiv:2308.01170

#7278

Revisiting a Design Choice in Gradient Temporal Difference Learning

Xiaochi Qian, Shangtong Zhang

ICLR 2025arXiv:2412.14355

#7279

Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference

Matt Riemer, Gopeshh Raaj Subbaraj, Glen Berseth et al.

AAAI 2025paperarXiv:2412.01857

#7280

Planning from Imagination: Episodic Simulation and Episodic Memory for Vision-and-Language Navigation

Yiyuan Pan, Yunzhe Xu, Zhe Liu et al.

AAAI 2025paperarXiv:2503.18042

#7281

DualCP: Rehearsal-Free Domain-Incremental Learning via Dual-Level Concept Prototype

Qiang Wang, Yuhang He, Songlin Dong et al.

ICLR 2025arXiv:2503.11005

#7282

Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection

Chuhan ZHANG, Chaoyang Zhu, Pingcheng Dong et al.

ICML 2025oralarXiv:2501.19328

#7283

Capturing Temporal Dynamics in Large-Scale Canopy Tree Height Estimation

Jan Pauls, Max Zimmer, Berkant Turan et al.

ICML 2025arXiv:2410.04959

#7284

Collapse-Proof Non-Contrastive Self-Supervised Learning

EMANUELE SANSONE, Tim Lebailly, Tinne Tuytelaars

AAAI 2025paperarXiv:2501.05906

#7285

Q-MAML: Quantum Model-Agnostic Meta-Learning for Variational Quantum Algorithms

Junyong Lee, Jeihee Cho, Shiho Kim

ICML 2025arXiv:2502.07203

#7286

Playmate: Flexible Control of Portrait Animation via 3D-Implicit Space Guided Diffusion

Xingpei Ma, Jiaran Cai, Yuansheng Guan et al.

NEURIPS 2025arXiv:2505.07233

#7287

DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation

Jiashuo Sun, Xianrui Zhong, Sizhe Zhou et al.

ICML 2025arXiv:2505.18545

#7288

B-score: Detecting biases in large language models using response history

An Vo, Mohammad Reza Taesiri, Daeyoung Kim et al.

ICML 2025arXiv:2507.05502

#7289

Predicting mutational effects on protein binding from folding energy

Arthur Deng, Karsten Householder, Fang Wu et al.

ICML 2025arXiv:2411.02083

#7290

Regress, Don't Guess: A Regression-like Loss on Number Tokens for Language Models

Jonas Zausinger, Lars Pennig, Anamarija Kozina et al.

AAAI 2025paperarXiv:2408.10605

#7291

Muses: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration

Yanbo Ding, Shaobin Zhuang, Kunchang Li et al.

#7292

PoseLLaVA: Pose Centric Multimodal LLM for Fine-Grained 3D Pose Manipulation

Dong Feng, Ping Guo, Encheng Peng et al.

ICML 2025oralarXiv:2406.19593

#7293

SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs

Xin Su, Man Luo, Kris Pan et al.

ICML 2025arXiv:2502.20727

#7294

SPD: Sync-Point Drop for Efficient Tensor Parallelism of Large Language Models

Han-Byul Kim, Duc Hoang, Arnav Kundu et al.

AAAI 2025paperarXiv:2408.10613

#7295

Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval

Guangyuan Ma, Yongliang Ma, Xing Wu et al.

ICML 2025arXiv:2502.18699

#7296

MPO: An Efficient Post-Processing Framework for Mixing Diverse Preference Alignment

Tianze Wang, Dongnan Gui, Yifan Hu et al.

ICLR 2025arXiv:2410.13413

#7297

Think Thrice Before You Act: Progressive Thought Refinement in Large Language Models

Chengyu Du, Jinyi Han, Yizhou Ying et al.

ICLR 2025arXiv:2409.20124

#7298

Conditional Diffusion Models are Minimax-Optimal and Manifold-Adaptive for Conditional Distribution Estimation

Rong Tang, Lizhen Lin, Yun Yang

AAAI 2025paperarXiv:2501.01196

#7299

Sparis: Neural Implicit Surface Reconstruction of Indoor Scenes from Sparse Views

Yulun Wu, Han Huang, Wenyuan Zhang et al.

AAAI 2025paperarXiv:2406.04612

#7300

Faithful and Accurate Self-Attention Attribution for Message Passing Neural Networks via the Computation Tree Viewpoint

Yong-Min Shin, Siqing Li, Xin Cao et al.

ICLR 2025oralarXiv:2503.23478

#7301

Handling Delay in Real-Time Reinforcement Learning

Ivan Anokhin, Rishav Rishav, Matt Riemer et al.

#7302

Scaling Sparse Feature Circuits For Studying In-Context Learning

Dmitrii Kharlapenko, Stepan Shabalin, Arthur Conmy et al.

ICML 2025

ICML 2025arXiv:2502.06994

#7303

SyncMind: Measuring Agent Out-of-Sync Recovery in Collaborative Software Engineering

Xuehang Guo, Xingyao Wang, Yangyi Chen et al.

ICLR 2025arXiv:2504.09913

#7304

Optimal Non-Asymptotic Rates of Value Iteration for Average-Reward Markov Decision Processes

Jongmin Lee, Ernest Ryu

NEURIPS 2025arXiv:2506.05454

#7305

Zeroth-Order Optimization Finds Flat Minima

Liang Zhang, Bingcong Li, Kiran Thekumparampil et al.

ICLR 2025arXiv:2505.16115

#7306

A Generic Framework for Conformal Fairness

Aditya Vadlamani, Anutam Srinivasan, Pranav Maneriker et al.

ICML 2025arXiv:2411.14003

#7307

Generative Intervention Models for Causal Perturbation Modeling

Nora Schneider, Lars Lorch, Niki Kilbertus et al.

ICML 2025arXiv:2505.11131

#7308

One Image is Worth a Thousand Words: A Usability Preservable Text-Image Collaborative Erasing Framework

Feiran Li, Qianqian Xu, Shilong Bao et al.

NEURIPS 2025spotlightarXiv:2409.03817

#7309

Neural Entropy

Akhil Premkumar

ICML 2025arXiv:2503.15704

#7310

Tuning Sequential Monte Carlo Samplers via Greedy Incremental Divergence Minimization

Kyurae Kim, Zuheng Xu, Jacob Gardner et al.

NEURIPS 2025arXiv:2506.15707

#7311

Every Rollout Counts: Optimal Resource Allocation for Efficient Test-Time Scaling

Xinglin Wang, Yiwei Li, Shaoxiong Feng et al.

AAAI 2025paperarXiv:2412.08879

#7312

Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark

Yongliang Wu, Wenbo Zhu, Jiawang Cao et al.

ICML 2025arXiv:2506.02557

#7313

Kernel-based Unsupervised Embedding Alignment for Enhanced Visual Representation in Vision-language Models

Shizhan Gong, Yankai Jiang, DOU QI et al.

ICML 2025arXiv:2505.05922

#7314

Cape: Context-Aware Prompt Perturbation Mechanism with Differential Privacy

Haoqi Wu, Wei Dai, Wang Li et al.

ICLR 2025arXiv:2501.13904

#7315

Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models

Linh Tran, Wei Sun, Stacy Patterson et al.

ICLR 2025arXiv:2410.07610

#7316

CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features

Po-han Li, Sandeep Chinchali, ufuk topcu

AAAI 2025paperarXiv:2504.09608

#7317

ERL-MPP: Evolutionary Reinforcement Learning with Multi-head Puzzle Perception for Solving Large-scale Jigsaw Puzzles of Eroded Gaps

Xingke Song, Xiaoying Yang, Chenglin Yao et al.

ICML 2025arXiv:2409.17275

#7318

On the Vulnerability of Applying Retrieval-Augmented Generation within Knowledge-Intensive Application Domains

Xun Xian, Ganghua Wang, Xuan Bi et al.

ICML 2025arXiv:2410.22944

#7319

Focus On This, Not That! Steering LLMs with Adaptive Feature Specification

Tom A. Lamb, Adam Davies, Alasdair J Paren et al.

ICML 2025arXiv:2502.21075

#7320

Spatial Reasoning with Denoising Models

Christopher Wewer, Bartlomiej Pogodzinski, Bernt Schiele et al.

ICML 2025spotlightarXiv:2507.08285

#7321

FlowDrag: 3D-aware Drag-based Image Editing with Mesh-guided Deformation Vector Flow Fields

Gwanhyeong Koo, Sunjae Yoon, Younghwan Lee et al.

ICLR 2025arXiv:2412.06071

#7322

KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models

Fan Wang, Juyong Jiang, Chansung Park et al.

AAAI 2025paperarXiv:2410.19796

#7323

Feature Clipping for Uncertainty Calibration

Linwei Tao, Minjing Dong, Chang Xu

ICML 2025arXiv:2308.01358

#7324

Compressed and distributed least-squares regression: convergence rates with applications to federated learning

Constantin Philippenko, Aymeric Dieuleveut

ICML 2025arXiv:2505.09768

#7325

Self-Consuming Generative Models with Adversarially Curated Data

Xiukun Wei, Xueru Zhang

ICML 2025arXiv:2506.19031

#7326

When Diffusion Models Memorize: Inductive Biases in Probability Flow of Minimum-Norm Shallow Neural Nets

Chen Zeno, Hila Manor, Gregory Ongie et al.

ICLR 2025arXiv:2405.14318

#7327

Adaptive Retention & Correction: Test-Time Training for Continual Learning

Haoran Chen, Micah Goldblum, Zuxuan Wu et al.

AAAI 2025paperarXiv:2412.08222

#7328

Structured IB: Improving Information Bottleneck with Structured Feature Learning

Hanzhe Yang, Youlong Wu, Dingzhu Wen et al.

ICML 2025arXiv:2506.00772

#7329

LIFT the Veil for the Truth: Principal Weights Emerge after Rank Reduction for Reasoning-Focused Supervised Fine-Tuning

Zihang Liu, Tianyu Pang, Oleg Balabanov et al.

ICML 2025arXiv:2505.02130

#7330

Attention Mechanisms Perspective: Exploring LLM Processing of Graph-Structured Data

Guan Zhong, Likang Wu, Hongke Zhao et al.

#7331

HeterGP: Bridging Heterogeneity in Graph Neural Networks with Multi-View Prompting

Fengyu Yan, Xiaobao Wang, Dongxiao He et al.

ICLR 2025arXiv:2410.05602

#7332

Amortized Control of Continuous State Space Feynman-Kac Model for Irregular Time Series

Byoungwoo Park, Hyungi Lee, Juho Lee

AAAI 2025paperarXiv:2412.11744

#7333

Conditional Diffusion Models Based Conditional Independence Testing

Yanfeng Yang, Shuai Li, Yingjie Zhang et al.

ICML 2025arXiv:2502.03678

#7334

Reflection-Window Decoding: Text Generation with Selective Refinement

Zeyu Tang, Zhenhao Chen, Xiangchen Song et al.

AAAI 2025paperarXiv:2412.20487

#7335

Multimodal Variational Autoencoder: A Barycentric View

Peijie Qiu, Wenhui Zhu, Sayantan Kumar et al.

ICLR 2025arXiv:2411.18425

#7336

Streamlining Prediction in Bayesian Deep Learning

Rui Li, Marcus Klasson, Arno Solin et al.

AAAI 2025paperarXiv:2409.11212

#7337

Self-Evolutionary Large Language Models Through Uncertainty-Enhanced Preference Optimization

Jianing Wang, Yang Zhou, Xiaocheng Zhang et al.

ICML 2025arXiv:2410.09795

#7338

WGFormer: An SE(3)-Transformer Driven by Wasserstein Gradient Flows for Molecular Ground-State Conformation Prediction

Fanmeng Wang, Minjie Cheng, Hongteng Xu

#7339

SeRA: Self-Reviewing and Alignment of LLMs using Implicit Reward Margins

Jongwoo Ko, Saket Dingliwal, Bhavana Ganesh et al.

ICLR 2025arXiv:2502.06335

#7340

Microcanonical Langevin Ensembles: Advancing the Sampling of Bayesian Neural Networks

Emanuel Sommer, Jakob Robnik, Giorgi Nozadze et al.

ICLR 2025arXiv:2412.04910

#7341

Learning High-Degree Parities: The Crucial Role of the Initialization

Emmanuel Abbe, Elisabetta Cornacchia, Jan Hązła et al.

#7342

Beyond FVD: An Enhanced Evaluation Metrics for Video Generation Distribution Quality

Ge Ya Luo, Gian M Favero, Zhi Hao Luo et al.

ICLR 2025oral

ICML 2025arXiv:2503.11842

#7343

Test-Time Training Provably Improves Transformers as In-context Learners

Halil Alperen Gozeten, Muhammed Emrullah Ildiz, Xuechen Zhang et al.

ICML 2025oralarXiv:2502.12082

#7344

AdaSplash: Adaptive Sparse Flash Attention

Nuno Gonçalves, Marcos V. Treviso, Andre Martins

ICLR 2025arXiv:2410.17270

#7345

MOFFlow: Flow Matching for Structure Prediction of Metal-Organic Frameworks

Nayoung Kim, Seongsu Kim, Minsu Kim et al.

ICML 2025arXiv:2502.07244

#7346

Linear Transformers as VAR Models: Aligning Autoregressive Attention Mechanisms with Autoregressive Forecasting

Jiecheng Lu, Shihao Yang

ICML 2025arXiv:2502.04757

#7347

ELITE: Enhanced Language-Image Toxicity Evaluation for Safety

Wonjun Lee, Doehyeon Lee, Eugene Choi et al.

ICML 2025arXiv:2502.07460

#7348

Logarithmic Regret for Online KL-Regularized Reinforcement Learning

Heyang Zhao, Chenlu Ye, Wei Xiong et al.

NEURIPS 2025arXiv:2505.18044

#7349

Linear Mixture Distributionally Robust Markov Decision Processes

Zhishuai Liu, Pan Xu

ICML 2025arXiv:2505.18532

#7350

Preserving AUC Fairness in Learning with Noisy Protected Groups

Mingyang Wu, Li Lin, Wenbin Zhang et al.

ICML 2025arXiv:2504.18574

#7351

Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism

Aviv Bick, Eric Xing, Albert Gu

ICML 2025arXiv:2506.21602

#7352

BiMark: Unbiased Multilayer Watermarking for Large Language Models

Xiaoyan Feng, He Zhang, Yanjun Zhang et al.

AAAI 2025paperarXiv:2402.01371

#7353

Two-Timescale Critic-Actor for Average Reward MDPs with Function Approximation

Prashansa Panda, Shalabh Bhatnagar

ICML 2025arXiv:2407.20444

#7354

Importance Corrected Neural JKO Sampling

Johannes Hertrich, Robert Gruhlke

NEURIPS 2025arXiv:2505.11194

#7355

Prot2Text-V2: Protein Function Prediction with Multimodal Contrastive Alignment

Xiao Fei, Michail Chatzianastasis, Sarah Carneiro et al.

ICML 2025arXiv:2505.03792

#7356

Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning

Lang Feng, Weihao Tan, Zhiyi Lyu et al.

ICML 2025arXiv:2505.20089

#7357

Homophily Enhanced Graph Domain Adaptation

Ruiyi Fang, Bingheng Li, Jingyu Zhao et al.

ICLR 2025arXiv:2410.14445

#7358

Toward Generalizing Visual Brain Decoding to Unseen Subjects

Xiangtao Kong, Kexin Huang, Ping Li et al.

ICLR 2025arXiv:2503.03595

#7359

Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias

Rui Lu, Runzhe Wang, Kaifeng Lyu et al.

ICML 2025spotlightarXiv:2505.23017

#7360

$K^2$VAE: A Koopman-Kalman Enhanced Variational AutoEncoder for Probabilistic Time Series Forecasting

Xingjian Wu, Xiangfei Qiu, Hongfan Gao et al.

#7361

Model-Free Offline Reinforcement Learning with Enhanced Robustness

Chi Zhang, Zain Ulabedeen Farhat, George Atia et al.

AAAI 2025paperarXiv:2403.11464

#7362

FedSPU: Personalized Federated Learning for Resource-Constrained Devices with Stochastic Parameter Update

Ziru Niu, Hai Dong, A. K. Qin

NEURIPS 2025oralarXiv:2507.00583

#7363

AI-Generated Video Detection via Perceptual Straightening

Christian Internò, Robert Geirhos, Markus Olhofer et al.

ICLR 2025arXiv:2410.09543

#7364

Boltzmann-Aligned Inverse Folding Model as a Predictor of Mutational Effects on Protein-Protein Interactions

Xiaoran Jiao, Weian Mao, Wengong Jin et al.

ICML 2025arXiv:2502.02367

#7365

Field Matching: an Electrostatic Paradigm to Generate and Transfer Data

Alexander Kolesov, S. Manukhov, Vladimir Palyulin et al.

#7366

Scaling Laws for Floating–Point Quantization Training

Xingwu Sun, Shuaipeng Li, Ruobing Xie et al.

ICML 2025

ICML 2025arXiv:2503.15748

#7367

PARQ: Piecewise-Affine Regularized Quantization

Lisa Jin, Jianhao Ma, Zechun Liu et al.

ICML 2025oralarXiv:2505.06892

#7368

Learning Soft Sparse Shapes for Efficient Time-Series Classification

Zhen Liu, Yicheng Luo, Boyuan Li et al.

#7369

GLoRa: A Benchmark to Evaluate the Ability to Learn Long-Range Dependencies in Graphs

Dongzhuoran Zhou, Evgeny Kharlamov, Egor Kostylev

NEURIPS 2025oralarXiv:2506.16055

#7370

Knee-Deep in C-RASP: A Transformer Depth Hierarchy

Andy J Yang, Michaël Cadilhac, David Chiang

ICML 2025arXiv:2502.01362

#7371

Inverse Bridge Matching Distillation

Nikita Gushchin, David Li, Daniil Selikhanovych et al.

ICLR 2025arXiv:2405.18183

#7372

Feature-Based Online Bilateral Trade

Solenne Gaucher, Martino Bernasconi, Matteo Castiglioni et al.

ICLR 2025arXiv:2505.20029

#7373

Correlating instruction-tuning (in multimodal models) with vision-language processing (in the brain)

SUBBA REDDY OOTA, Akshett Rai Jindal, Ishani Mondal et al.

ICLR 2025arXiv:2503.15579

#7374

Understanding the Generalization of In-Context Learning in Transformers: An Empirical Study

Xingxuan Zhang, Haoran Wang, Jiansheng Li et al.

ICML 2025arXiv:2503.16398

#7375

The Global Convergence Time of Stochastic Gradient Descent in Non-Convex Landscapes: Sharp Estimates via Large Deviations

Waïss Azizian, Franck Iutzeler, Jérôme Malick et al.

ICLR 2025arXiv:2411.16502

#7376

Interpreting Language Reward Models via Contrastive Explanations

Junqi Jiang, Tom Bewley, Saumitra Mishra et al.

ICLR 2025arXiv:2504.03810

#7377

Hierarchically Encapsulated Representation for Protocol Design in Self-Driving Labs

Yu-Zhe Shi, Mingchen Liu, Fanxu Meng et al.

ICML 2025arXiv:2502.05807

#7378

Devil is in the Details: Density Guidance for Detail-Aware Generation with Flow Models

Rafał Karczewski, Markus Heinonen, Vikas Garg

AAAI 2025paperarXiv:2412.17856

#7379

Graph Structure Refinement with Energy-based Contrastive Learning

Xianlin Zeng, Yufeng Wang, Yuqi Sun et al.

AAAI 2025paperarXiv:2503.17017

#7380

Specifying What You Know or Not for Multi-Label Class-Incremental Learning

Aoting Zhang, Dongbao Yang, Chang Liu et al.

NEURIPS 2025spotlightarXiv:2505.17534

#7381

Co-Reinforcement Learning for Unified Multimodal Understanding and Generation

Jingjing Jiang, Chongjie Si, Jun Luo et al.

ICLR 2025arXiv:2505.21974

#7382

BOFormer: Learning to Solve Multi-Objective Bayesian Optimization via Non-Markovian RL

Yu Heng Hung, Kai-Jie Lin, Yu-Heng Lin et al.

ICLR 2025arXiv:2405.07373

#7383

From Probability to Counterfactuals: the Increasing Complexity of Satisfiability in Pearl's Causal Hierarchy

Julian Dörfler, Benito van der Zander, Markus Bläser et al.

ICLR 2025arXiv:2505.08740

#7384

Sensitivity-Constrained Fourier Neural Operators for Forward and Inverse Problems in Parametric Differential Equations

Abdolmehdi Behroozi, Chaopeng Shen, Daniel Kifer

ICLR 2025arXiv:2410.12457

#7385

Sharpness-Aware Black-Box Optimization

Feiyang YE, YUEMING LYU, Xuehao Wang et al.

ICLR 2025arXiv:2411.01553

#7386

Learning to Communicate Through Implicit Communication Channels

Han Wang, Binbin Chen, zhang et al.

ICLR 2025arXiv:2405.19440

#7387

MGDA Converges under Generalized Smoothness, Provably

Qi Zhang, Peiyao Xiao, Shaofeng Zou et al.

ICML 2025arXiv:2507.04610

#7388

any4: Learned 4-bit Numeric Representation for LLMs

Mostafa Elhoushi, Jeff Johnson

ICML 2025arXiv:2506.11039

#7389

Angle Domain Guidance: Latent Diffusion Requires Rotation Rather Than Extrapolation

Cheng Jin, Zhenyu Xiao, Chutao Liu et al.

ICLR 2025arXiv:2503.00799

#7390

On Generalization Across Environments In Multi-Objective Reinforcement Learning

Jayden Teoh, Pradeep Varakantham, Peter Vamplew

AAAI 2025paperarXiv:2401.09953

#7391

Through the Dual-Prism: A Spectral Perspective on Graph Data Augmentation for Graph Classifications

Yutong Xia, Runpeng Yu, Yuxuan Liang et al.

ICML 2025arXiv:2505.02288

#7392

Universal Approximation Theorem of Deep Q-Networks

Qian Qi

ICLR 2025arXiv:2502.14204

#7393

On-the-fly Preference Alignment via Principle-Guided Decoding

Mingye Zhu, Yi Liu, Lei Zhang et al.

ICML 2025arXiv:2505.18956

#7394

How Do Images Align and Complement LiDAR? Towards a Harmonized Multi-modal 3D Panoptic Segmentation

Yining Pan, Qiongjie Cui, Xulei Yang et al.

AAAI 2025paperarXiv:2502.10675

#7395

Hierarchically-Structured Open-Vocabulary Indoor Scene Synthesis with Pre-trained Large Language Model

Weilin Sun, Xinran Li, Manyi Li et al.

ICML 2025arXiv:2502.08141

#7396

LowRA: Accurate and Efficient LoRA Fine-Tuning of LLMs under 2 Bits

Zikai Zhou, Qizheng Zhang, Hermann Kumbong et al.

AAAI 2025paperarXiv:2503.18317

#7397

Improved Rates of Differentially Private Nonconvex-Strongly-Concave Minimax Optimization

Ruijia Zhang, Mingxi Lei, Meng Ding et al.

ICLR 2025arXiv:2410.10253

#7398

Feedback Favors the Generalization of Neural ODEs

Jindou Jia, Zihan Yang, Meng Wang et al.

ICLR 2025arXiv:2502.13674

#7399

SCOPE: A Self-supervised Framework for Improving Faithfulness in Conditional Text Generation

Song Duong, Florian Le Bronnec, Alexandre Allauzen et al.

ICML 2025arXiv:2505.13652

#7400

Guided Search Strategies in Non-Serializable Environments with Applications to Software Engineering Agents

Karina Zainullina, Aleksandr Golubev, Maria Trofimova et al.