Most Cited 2025 "molecular large language models" Papers

22,274 papers found • Page 37 of 112

#7201

Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues

Yan Zhang, Gangyan Zeng, Huawen Shen et al.

AAAI 2025paperarXiv:2412.12502
6
citations
#7202

When Bad Data Leads to Good Models

Kenneth Li, Yida Chen, Fernanda Viégas et al.

ICML 2025arXiv:2505.04741
6
citations
#7203

Activation by Interval-wise Dropout: A Simple Way to Prevent Neural Networks from Plasticity Loss

Sangyeon Park, Isaac Han, Seungwon Oh et al.

ICML 2025arXiv:2502.01342
6
citations
#7204

Bayesian Experimental Design Via Contrastive Diffusions

Jacopo Iollo, Christophe Heinkelé, Pierre Alliez et al.

ICLR 2025arXiv:2410.11826
6
citations
#7205

Efficient First-Order Optimization on the Pareto Set for Multi-Objective Learning under Preference Guidance

Lisha Chen, Quan Xiao, Ellen Fukuda et al.

ICML 2025spotlightarXiv:2504.02854
6
citations
#7206

Conformal Language Model Reasoning with Coherent Factuality

Maxon Rubin-Toles, Maya Gambhir, Keshav Ramji et al.

ICLR 2025arXiv:2505.17126
6
citations
#7207

Lawma: The Power of Specialization for Legal Annotation

Ricardo Dominguez-Olmedo, Vedant Nanda, Rediet Abebe et al.

ICLR 2025arXiv:2407.16615
6
citations
#7208

Are Expressive Models Truly Necessary for Offline RL?

Guan Wang, Haoyi Niu, Jianxiong Li et al.

AAAI 2025paperarXiv:2412.11253
6
citations
#7209

Ask, and it shall be given: On the Turing completeness of prompting

Ruizhong Qiu, Zhe Xu, Wenxuan Bao et al.

ICLR 2025arXiv:2411.01992
6
citations
#7210

Let Me Grok for You: Accelerating Grokking via Embedding Transfer from a Weaker Model

Zhiwei Xu, Zhiyu Ni, Yixin Wang et al.

ICLR 2025arXiv:2504.13292
6
citations
#7211

New Perspectives on the Polyak Stepsize: Surrogate Functions and Negative Results

Francesco Orabona, Ryan D'Orazio

NEURIPS 2025arXiv:2505.20219
6
citations
#7212

Robust Offline Reinforcement Learning with Linearly Structured $f$-Divergence Regularization

Cheng Tang, Zhishuai Liu, Pan Xu

ICML 2025arXiv:2411.18612
6
citations
#7213

Tight Clusters Make Specialized Experts

Stefan Nielsen, Rachel Teo, Laziz Abdullaev et al.

ICLR 2025arXiv:2502.15315
6
citations
#7214

Jacobian Sparse Autoencoders: Sparsify Computations, Not Just Activations

Lucy Farnik, Tim Lawson, Conor Houghton et al.

ICML 2025spotlightarXiv:2502.18147
6
citations
#7215

SigDiffusions: Score-Based Diffusion Models for Time Series via Log-Signature Embeddings

Barbora Barancikova, Zhuoyue Huang, Cristopher Salvi

ICLR 2025arXiv:2406.10354
6
citations
#7216

Understanding Synthetic Context Extension via Retrieval Heads

Xinyu Zhao, Fangcong Yin, Greg Durrett

ICML 2025arXiv:2410.22316
6
citations
#7217

BlockDialect: Block-wise Fine-grained Mixed Format Quantization for Energy-Efficient LLM Inference

Wonsuk Jang, Thierry Tambe

ICML 2025arXiv:2501.01144
6
citations
#7218

Dynamic Graph Learning with Static Relations for Credit Risk Assessment

Qi Yuan, Yang Liu, Yateng Tang et al.

AAAI 2025paper
6
citations
#7219

Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better

Enshu Liu, Junyi Zhu, Zinan Lin et al.

ICLR 2025arXiv:2404.02241
6
citations
#7220

Bridging Training and Execution via Dynamic Directed Graph-Based Communication in Cooperative Multi-Agent Systems

Zhuohui Zhang, Bin He, Bin Cheng et al.

AAAI 2025paperarXiv:2408.07397
6
citations
#7221

How to Train Your LLM Web Agent: A Statistical Diagnosis

Dheeraj Vattikonda, Santhoshi Ravichandran, Emiliano Penaloza et al.

NEURIPS 2025arXiv:2507.04103
6
citations
#7222

FedTMOS: Efficient One-Shot Federated Learning with Tsetlin Machine

Shannon How, Jagmohan Chauhan, Geoff Merrett et al.

ICLR 2025
6
citations
#7223

EquivaMap: Leveraging LLMs for Automatic Equivalence Checking of Optimization Formulations

Haotian Zhai, Connor Lawless, Ellen Vitercik et al.

ICML 2025arXiv:2502.14760
6
citations
#7224

CAMEx: Curvature-aware Merging of Experts

Dung Viet Nguyen, Minh Nguyen, Luc Nguyen et al.

ICLR 2025arXiv:2502.18821
6
citations
#7225

Asymmetric Visual Semantic Embedding Framework for Efficient Vision-Language Alignment

Yang Liu, Mengyuan Liu, Shudong Huang et al.

AAAI 2025paperarXiv:2503.06974
6
citations
#7226

Position: AI Competitions Provide the Gold Standard for Empirical Rigor in GenAI Evaluation

D. Sculley, William Cukierski, Phil Culliton et al.

ICML 2025oralarXiv:2505.00612
6
citations
#7227

Multifaceted User Modeling in Recommendation: A Federated Foundation Models Approach

Chunxu Zhang, Guodong Long, Hongkuan Guo et al.

AAAI 2025paperarXiv:2412.16969
6
citations
#7228

WaterDiffusion: Learning a Prior-involved Unrolling Diffusion for Joint Underwater Saliency Detection and Visual Restoration

Laibin Chang, Yunke Wang, Longxiang Deng et al.

AAAI 2025paper
6
citations
#7229

Aligning Multimodal Representations through an Information Bottleneck

Antonio Almudévar, Jose Miguel Hernandez-Lobato, Sameer Khurana et al.

ICML 2025arXiv:2506.04870
6
citations
#7230

Forte : Finding Outliers with Representation Typicality Estimation

Debargha Ganguly, Warren Morningstar, Andrew Yu et al.

ICLR 2025arXiv:2410.01322
6
citations
#7231

Bootstrapped Model Predictive Control

Yuhang Wang, Hanwei Guo, Sizhe Wang et al.

ICLR 2025arXiv:2503.18871
6
citations
#7232

Text2Relight: Creative Portrait Relighting with Text Guidance

Junuk Cha, Mengwei Ren, Krishna Kumar Singh et al.

AAAI 2025paperarXiv:2412.13734
6
citations
#7233

SGTC: Semantic-Guided Triplet Co-training for Sparsely Annotated Semi-Supervised Medical Image Segmentation

Ke Yan, Qing Cai, Fan Zhang et al.

AAAI 2025paperarXiv:2412.15526
6
citations
#7234

Making Large Vision Language Models to Be Good Few-Shot Learners

Fan Liu, Wenwen Cai, Jian Huo et al.

AAAI 2025paperarXiv:2408.11297
6
citations
#7235

PhysAug: A Physical-guided and Frequency-based Data Augmentation for Single-Domain Generalized Object Detection

Xiaoran Xu, Jiangang Yang, Wenhui Shi et al.

AAAI 2025paperarXiv:2412.11807
6
citations
#7236

Comparing noisy neural population dynamics using optimal transport distances

Amin Nejatbakhsh, Victor Geadah, Alex Williams et al.

ICLR 2025arXiv:2412.14421
6
citations
#7237

Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents

Han Lin, Jaemin Cho, Amir Zadeh et al.

NEURIPS 2025arXiv:2508.05954
6
citations
#7238

Zero-resource Hallucination Detection for Text Generation via Graph-based Contextual Knowledge Triples Modeling

Xinyue Fang, Zhen Huang, Zhiliang Tian et al.

AAAI 2025paperarXiv:2409.11283
6
citations
#7239

Linear combinations of latents in generative models: subspaces and beyond

Erik Bodin, Alexandru Stere, Dragos Margineantu et al.

ICLR 2025arXiv:2408.08558
6
citations
#7240

Causal Discovery from Conditionally Stationary Time Series

Carles Balsells-Rodas, Xavier Sumba, Tanmayee Narendra et al.

ICML 2025arXiv:2110.06257
6
citations
#7241

Bridging Information Asymmetry in Text-video Retrieval: A Data-centric Approach

Zechen Bai, Tianjun Xiao, Tong He et al.

ICLR 2025arXiv:2408.07249
6
citations
#7242

Bridging Molecular Graphs and Large Language Models

Runze Wang, Mingqi Yang, Yanming Shen

AAAI 2025paperarXiv:2503.03135
6
citations
#7243

EARL-BO: Reinforcement Learning for Multi-Step Lookahead, High-Dimensional Bayesian Optimization

Mujin Cheon, Jay Lee, Dong-Yeun Koh et al.

ICML 2025arXiv:2411.00171
6
citations
#7244

Is Large-scale Pretraining the Secret to Good Domain Generalization?

Piotr Teterwak, Kuniaki Saito, Theodoros Tsiligkaridis et al.

ICLR 2025arXiv:2412.02856
6
citations
#7245

Automatically Generating Numerous Context-Driven SFT Data for LLMs Across Diverse Granularity

Shanghaoran Quan

AAAI 2025paperarXiv:2405.16579
6
citations
#7246

SADA: Stability-guided Adaptive Diffusion Acceleration

Ting Jiang, Yixiao Wang, Hancheng Ye et al.

ICML 2025arXiv:2507.17135
6
citations
#7247

Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning

Amin Karimi Monsefi, Mengxi Zhou, Nastaran Monsefi et al.

ICLR 2025arXiv:2409.10362
6
citations
#7248

General framework for online-to-nonconvex conversion: Schedule-free SGD is also effective for nonconvex optimization

Kwangjun Ahn, Gagik Magakyan, Ashok Cutkosky

ICML 2025oralarXiv:2411.07061
6
citations
#7249

Population Aware Diffusion for Time Series Generation

Yang Li, Han Meng, Zhenyu Bi et al.

AAAI 2025paperarXiv:2501.00910
6
citations
#7250

Biologically Plausible Brain Graph Transformer

Ciyuan Peng, Yuelong Huang, Qichao Dong et al.

ICLR 2025arXiv:2502.08958
6
citations
#7251

Investigating the Pre-Training Dynamics of In-Context Learning: Task Recognition vs. Task Learning

Xiaolei Wang, Xinyu Tang, Junyi Li et al.

ICLR 2025arXiv:2406.14022
6
citations
#7252

Accelerating Linear Recurrent Neural Networks for the Edge with Unstructured Sparsity

Alessandro Pierro, Steven Abreu, Jonathan Timcheck et al.

ICML 2025arXiv:2502.01330
6
citations
#7253

ExcluIR: Exclusionary Neural Information Retrieval

Wenhao Zhang, Mengqi Zhang, Shiguang Wu et al.

AAAI 2025paperarXiv:2404.17288
6
citations
#7254

SAFEx: Analyzing Vulnerabilities of MoE-Based LLMs via Stable Safety-critical Expert Identification

Zhenglin Lai, Mengyao Liao, Bingzhe Wu et al.

NEURIPS 2025arXiv:2506.17368
6
citations
#7255

Precedence-Constrained Winter Value for Effective Graph Data Valuation

Hongliang Chi, Wei Jin, Charu Aggarwal et al.

ICLR 2025arXiv:2402.01943
6
citations
#7256

Robust Conformal Outlier Detection under Contaminated Reference Data

Meshi Bashari, Matteo Sesia, Yaniv Romano

ICML 2025arXiv:2502.04807
6
citations
#7257

Decoupling Layout from Glyph in Online Chinese Handwriting Generation

Minsi Ren, Yan-Ming Zhang, yi chen

ICLR 2025arXiv:2410.02309
6
citations
#7258

EF-3DGS: Event-Aided Free-Trajectory 3D Gaussian Splatting

Bohao Liao, Wei Zhai, Zengyu Wan et al.

NEURIPS 2025oralarXiv:2410.15392
6
citations
#7259

When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding

Yan Shu, Hangui Lin, Yexin Liu et al.

NEURIPS 2025arXiv:2506.05551
6
citations
#7260

Infer Human’s Intentions Before Following Natural Language Instructions

Yanming Wan, Yue Wu, Yiping Wang et al.

AAAI 2025paperarXiv:2409.18073
6
citations
#7261

Understanding and Mitigating Memorization in Diffusion Models for Tabular Data

Zhengyu Fang, Zhimeng Jiang, Huiyuan Chen et al.

ICML 2025arXiv:2412.11044
6
citations
#7262

A Robust Prototype-Based Network with Interpretable RBF Classifier Foundations

Sascha Saralajew, Ashish Rana, Thomas Villmann et al.

AAAI 2025paperarXiv:2412.15499
6
citations
#7263

One Leaf Reveals the Season: Occlusion-Based Contrastive Learning with Semantic-Aware Views for Efficient Visual Representation

Xiaoyu Yang, Lijian Xu, Hongsheng Li et al.

ICML 2025arXiv:2411.09858
6
citations
#7264

Reward-Augmented Data Enhances Direct Preference Alignment of LLMs

Shenao Zhang, Zhihan Liu, Boyi Liu et al.

ICML 2025arXiv:2410.08067
6
citations
#7265

AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring

Xinyi Wang, Na Zhao, Zhiyuan Han et al.

AAAI 2025paperarXiv:2501.09428
6
citations
#7266

MLC-NC: Long-Tailed Multi-Label Image Classification Through the Lens of Neural Collapse

Zijian Tao, Shao-Yuan Li, Wenhai Wan et al.

AAAI 2025paper
6
citations
#7267

Autocorrelation Matters: Understanding the Role of Initialization Schemes for State Space Models

Fusheng Liu, Qianxiao Li

ICLR 2025oralarXiv:2411.19455
6
citations
#7268

Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion

Minkyoung Cho, Yulong Cao, Jiachen Sun et al.

ICLR 2025arXiv:2410.12592
6
citations
#7269

Implicit Bias of Gradient Descent for Non-Homogeneous Deep Networks

Yuhang Cai, Kangjie Zhou, Jingfeng Wu et al.

ICML 2025arXiv:2502.16075
6
citations
#7270

Optimal Strong Regret and Violation in Constrained MDPs via Policy Optimization

Francesco Emanuele Stradi, Matteo Castiglioni, Alberto Marchesi et al.

ICLR 2025arXiv:2410.02275
6
citations
#7271

Topo2Seq: Enhanced Topology Reasoning via Topology Sequence Learning

Yiming Yang, Yueru Luo, Bingkun He et al.

AAAI 2025paperarXiv:2502.08974
6
citations
#7272

DUO: Diverse, Uncertain, On-Policy Query Generation and Selection for Reinforcement Learning from Human Feedback

Xuening Feng, Zhaohui Jiang, Timo Kaufmann et al.

AAAI 2025paper
6
citations
#7273

MathSpeech: Leveraging Small LMs for Accurate Conversion in Mathematical Speech-to-Formula

Sieun Hyeon, Kyudan Jung, Jaehee Won et al.

AAAI 2025paperarXiv:2412.15655
6
citations
#7274

Activation Space Interventions Can Be Transferred Between Large Language Models

Narmeen Oozeer, Dhruv Nathawani, Nirmalendu Prakash et al.

ICML 2025arXiv:2503.04429
6
citations
#7275

Stiefel Flow Matching for Moment-Constrained Structure Elucidation

Austin H Cheng, Alston Lo, Kin Long Kelvin Lee et al.

ICLR 2025arXiv:2412.12540
6
citations
#7276

DiffRetouch: Using Diffusion to Retouch on the Shoulder of Experts

Zheng-Peng Duan, Jiawei Zhang, Zheng Lin et al.

AAAI 2025paperarXiv:2407.03757
6
citations
#7277

Neuron-based Multifractal Analysis of Neuron Interaction Dynamics in Large Models

Xiongye Xiao, Heng Ping, Chenyu Zhou et al.

ICLR 2025arXiv:2402.09099
6
citations
#7278

Revisiting a Design Choice in Gradient Temporal Difference Learning

Xiaochi Qian, Shangtong Zhang

ICLR 2025oralarXiv:2308.01170
6
citations
#7279

Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference

Matt Riemer, Gopeshh Raaj Subbaraj, Glen Berseth et al.

ICLR 2025arXiv:2412.14355
6
citations
#7280

Planning from Imagination: Episodic Simulation and Episodic Memory for Vision-and-Language Navigation

Yiyuan Pan, Yunzhe Xu, Zhe Liu et al.

AAAI 2025paperarXiv:2412.01857
6
citations
#7281

DualCP: Rehearsal-Free Domain-Incremental Learning via Dual-Level Concept Prototype

Qiang Wang, Yuhang He, Songlin Dong et al.

AAAI 2025paperarXiv:2503.18042
6
citations
#7282

Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection

Chuhan ZHANG, Chaoyang Zhu, Pingcheng Dong et al.

ICLR 2025arXiv:2503.11005
6
citations
#7283

Capturing Temporal Dynamics in Large-Scale Canopy Tree Height Estimation

Jan Pauls, Max Zimmer, Berkant Turan et al.

ICML 2025oralarXiv:2501.19328
6
citations
#7284

Collapse-Proof Non-Contrastive Self-Supervised Learning

EMANUELE SANSONE, Tim Lebailly, Tinne Tuytelaars

ICML 2025arXiv:2410.04959
6
citations
#7285

Q-MAML: Quantum Model-Agnostic Meta-Learning for Variational Quantum Algorithms

Junyong Lee, Jeihee Cho, Shiho Kim

AAAI 2025paperarXiv:2501.05906
6
citations
#7286

Playmate: Flexible Control of Portrait Animation via 3D-Implicit Space Guided Diffusion

Xingpei Ma, Jiaran Cai, Yuansheng Guan et al.

ICML 2025arXiv:2502.07203
6
citations
#7287

DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation

Jiashuo Sun, Xianrui Zhong, Sizhe Zhou et al.

NEURIPS 2025arXiv:2505.07233
6
citations
#7288

B-score: Detecting biases in large language models using response history

An Vo, Mohammad Reza Taesiri, Daeyoung Kim et al.

ICML 2025arXiv:2505.18545
6
citations
#7289

Predicting mutational effects on protein binding from folding energy

Arthur Deng, Karsten Householder, Fang Wu et al.

ICML 2025arXiv:2507.05502
6
citations
#7290

Regress, Don't Guess: A Regression-like Loss on Number Tokens for Language Models

Jonas Zausinger, Lars Pennig, Anamarija Kozina et al.

ICML 2025arXiv:2411.02083
6
citations
#7291

Muses: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration

Yanbo Ding, Shaobin Zhuang, Kunchang Li et al.

AAAI 2025paperarXiv:2408.10605
6
citations
#7292

PoseLLaVA: Pose Centric Multimodal LLM for Fine-Grained 3D Pose Manipulation

Dong Feng, Ping Guo, Encheng Peng et al.

AAAI 2025paper
6
citations
#7293

SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs

Xin Su, Man Luo, Kris Pan et al.

ICML 2025oralarXiv:2406.19593
6
citations
#7294

SPD: Sync-Point Drop for Efficient Tensor Parallelism of Large Language Models

Han-Byul Kim, Duc Hoang, Arnav Kundu et al.

ICML 2025arXiv:2502.20727
6
citations
#7295

Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval

Guangyuan Ma, Yongliang Ma, Xing Wu et al.

AAAI 2025paperarXiv:2408.10613
6
citations
#7296

MPO: An Efficient Post-Processing Framework for Mixing Diverse Preference Alignment

Tianze Wang, Dongnan Gui, Yifan Hu et al.

ICML 2025arXiv:2502.18699
5
citations
#7297

Think Thrice Before You Act: Progressive Thought Refinement in Large Language Models

Chengyu Du, Jinyi Han, Yizhou Ying et al.

ICLR 2025arXiv:2410.13413
5
citations
#7298

Conditional Diffusion Models are Minimax-Optimal and Manifold-Adaptive for Conditional Distribution Estimation

Rong Tang, Lizhen Lin, Yun Yang

ICLR 2025arXiv:2409.20124
5
citations
#7299

Sparis: Neural Implicit Surface Reconstruction of Indoor Scenes from Sparse Views

Yulun Wu, Han Huang, Wenyuan Zhang et al.

AAAI 2025paperarXiv:2501.01196
5
citations
#7300

Faithful and Accurate Self-Attention Attribution for Message Passing Neural Networks via the Computation Tree Viewpoint

Yong-Min Shin, Siqing Li, Xin Cao et al.

AAAI 2025paperarXiv:2406.04612
5
citations
#7301

Handling Delay in Real-Time Reinforcement Learning

Ivan Anokhin, Rishav Rishav, Matt Riemer et al.

ICLR 2025oralarXiv:2503.23478
5
citations
#7302

Scaling Sparse Feature Circuits For Studying In-Context Learning

Dmitrii Kharlapenko, Stepan Shabalin, Arthur Conmy et al.

ICML 2025
5
citations
#7303

SyncMind: Measuring Agent Out-of-Sync Recovery in Collaborative Software Engineering

Xuehang Guo, Xingyao Wang, Yangyi Chen et al.

ICML 2025arXiv:2502.06994
5
citations
#7304

Optimal Non-Asymptotic Rates of Value Iteration for Average-Reward Markov Decision Processes

Jongmin Lee, Ernest Ryu

ICLR 2025arXiv:2504.09913
5
citations
#7305

Zeroth-Order Optimization Finds Flat Minima

Liang Zhang, Bingcong Li, Kiran Thekumparampil et al.

NEURIPS 2025arXiv:2506.05454
5
citations
#7306

A Generic Framework for Conformal Fairness

Aditya Vadlamani, Anutam Srinivasan, Pranav Maneriker et al.

ICLR 2025arXiv:2505.16115
5
citations
#7307

Generative Intervention Models for Causal Perturbation Modeling

Nora Schneider, Lars Lorch, Niki Kilbertus et al.

ICML 2025arXiv:2411.14003
5
citations
#7308

One Image is Worth a Thousand Words: A Usability Preservable Text-Image Collaborative Erasing Framework

Feiran Li, Qianqian Xu, Shilong Bao et al.

ICML 2025arXiv:2505.11131
5
citations
#7309

Neural Entropy

Akhil Premkumar

NEURIPS 2025spotlightarXiv:2409.03817
5
citations
#7310

Tuning Sequential Monte Carlo Samplers via Greedy Incremental Divergence Minimization

Kyurae Kim, Zuheng Xu, Jacob Gardner et al.

ICML 2025arXiv:2503.15704
5
citations
#7311

Every Rollout Counts: Optimal Resource Allocation for Efficient Test-Time Scaling

Xinglin Wang, Yiwei Li, Shaoxiong Feng et al.

NEURIPS 2025arXiv:2506.15707
5
citations
#7312

Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark

Yongliang Wu, Wenbo Zhu, Jiawang Cao et al.

AAAI 2025paperarXiv:2412.08879
5
citations
#7313

Kernel-based Unsupervised Embedding Alignment for Enhanced Visual Representation in Vision-language Models

Shizhan Gong, Yankai Jiang, DOU QI et al.

ICML 2025arXiv:2506.02557
5
citations
#7314

Cape: Context-Aware Prompt Perturbation Mechanism with Differential Privacy

Haoqi Wu, Wei Dai, Wang Li et al.

ICML 2025arXiv:2505.05922
5
citations
#7315

Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models

Linh Tran, Wei Sun, Stacy Patterson et al.

ICLR 2025arXiv:2501.13904
5
citations
#7316

CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features

Po-han Li, Sandeep Chinchali, ufuk topcu

ICLR 2025arXiv:2410.07610
5
citations
#7317

ERL-MPP: Evolutionary Reinforcement Learning with Multi-head Puzzle Perception for Solving Large-scale Jigsaw Puzzles of Eroded Gaps

Xingke Song, Xiaoying Yang, Chenglin Yao et al.

AAAI 2025paperarXiv:2504.09608
5
citations
#7318

On the Vulnerability of Applying Retrieval-Augmented Generation within Knowledge-Intensive Application Domains

Xun Xian, Ganghua Wang, Xuan Bi et al.

ICML 2025arXiv:2409.17275
5
citations
#7319

Focus On This, Not That! Steering LLMs with Adaptive Feature Specification

Tom A. Lamb, Adam Davies, Alasdair J Paren et al.

ICML 2025arXiv:2410.22944
5
citations
#7320

Spatial Reasoning with Denoising Models

Christopher Wewer, Bartlomiej Pogodzinski, Bernt Schiele et al.

ICML 2025arXiv:2502.21075
5
citations
#7321

FlowDrag: 3D-aware Drag-based Image Editing with Mesh-guided Deformation Vector Flow Fields

Gwanhyeong Koo, Sunjae Yoon, Younghwan Lee et al.

ICML 2025spotlightarXiv:2507.08285
5
citations
#7322

KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models

Fan Wang, Juyong Jiang, Chansung Park et al.

ICLR 2025arXiv:2412.06071
5
citations
#7323

Feature Clipping for Uncertainty Calibration

Linwei Tao, Minjing Dong, Chang Xu

AAAI 2025paperarXiv:2410.19796
5
citations
#7324

Compressed and distributed least-squares regression: convergence rates with applications to federated learning

Constantin Philippenko, Aymeric Dieuleveut

ICML 2025arXiv:2308.01358
5
citations
#7325

Self-Consuming Generative Models with Adversarially Curated Data

Xiukun Wei, Xueru Zhang

ICML 2025arXiv:2505.09768
5
citations
#7326

When Diffusion Models Memorize: Inductive Biases in Probability Flow of Minimum-Norm Shallow Neural Nets

Chen Zeno, Hila Manor, Gregory Ongie et al.

ICML 2025arXiv:2506.19031
5
citations
#7327

Adaptive Retention & Correction: Test-Time Training for Continual Learning

Haoran Chen, Micah Goldblum, Zuxuan Wu et al.

ICLR 2025arXiv:2405.14318
5
citations
#7328

Structured IB: Improving Information Bottleneck with Structured Feature Learning

Hanzhe Yang, Youlong Wu, Dingzhu Wen et al.

AAAI 2025paperarXiv:2412.08222
5
citations
#7329

LIFT the Veil for the Truth: Principal Weights Emerge after Rank Reduction for Reasoning-Focused Supervised Fine-Tuning

Zihang Liu, Tianyu Pang, Oleg Balabanov et al.

ICML 2025arXiv:2506.00772
5
citations
#7330

Attention Mechanisms Perspective: Exploring LLM Processing of Graph-Structured Data

Guan Zhong, Likang Wu, Hongke Zhao et al.

ICML 2025arXiv:2505.02130
5
citations
#7331

HeterGP: Bridging Heterogeneity in Graph Neural Networks with Multi-View Prompting

Fengyu Yan, Xiaobao Wang, Dongxiao He et al.

AAAI 2025paper
5
citations
#7332

Amortized Control of Continuous State Space Feynman-Kac Model for Irregular Time Series

Byoungwoo Park, Hyungi Lee, Juho Lee

ICLR 2025arXiv:2410.05602
5
citations
#7333

Conditional Diffusion Models Based Conditional Independence Testing

Yanfeng Yang, Shuai Li, Yingjie Zhang et al.

AAAI 2025paperarXiv:2412.11744
5
citations
#7334

Reflection-Window Decoding: Text Generation with Selective Refinement

Zeyu Tang, Zhenhao Chen, Xiangchen Song et al.

ICML 2025arXiv:2502.03678
5
citations
#7335

Multimodal Variational Autoencoder: A Barycentric View

Peijie Qiu, Wenhui Zhu, Sayantan Kumar et al.

AAAI 2025paperarXiv:2412.20487
5
citations
#7336

Streamlining Prediction in Bayesian Deep Learning

Rui Li, Marcus Klasson, Arno Solin et al.

ICLR 2025arXiv:2411.18425
5
citations
#7337

Self-Evolutionary Large Language Models Through Uncertainty-Enhanced Preference Optimization

Jianing Wang, Yang Zhou, Xiaocheng Zhang et al.

AAAI 2025paperarXiv:2409.11212
5
citations
#7338

WGFormer: An SE(3)-Transformer Driven by Wasserstein Gradient Flows for Molecular Ground-State Conformation Prediction

Fanmeng Wang, Minjie Cheng, Hongteng Xu

ICML 2025arXiv:2410.09795
5
citations
#7339

SeRA: Self-Reviewing and Alignment of LLMs using Implicit Reward Margins

Jongwoo Ko, Saket Dingliwal, Bhavana Ganesh et al.

ICLR 2025
5
citations
#7340

Microcanonical Langevin Ensembles: Advancing the Sampling of Bayesian Neural Networks

Emanuel Sommer, Jakob Robnik, Giorgi Nozadze et al.

ICLR 2025arXiv:2502.06335
5
citations
#7341

Learning High-Degree Parities: The Crucial Role of the Initialization

Emmanuel Abbe, Elisabetta Cornacchia, Jan Hązła et al.

ICLR 2025arXiv:2412.04910
5
citations
#7342

Beyond FVD: An Enhanced Evaluation Metrics for Video Generation Distribution Quality

Ge Ya Luo, Gian M Favero, Zhi Hao Luo et al.

ICLR 2025oral
5
citations
#7343

Test-Time Training Provably Improves Transformers as In-context Learners

Halil Alperen Gozeten, Muhammed Emrullah Ildiz, Xuechen Zhang et al.

ICML 2025arXiv:2503.11842
5
citations
#7344

AdaSplash: Adaptive Sparse Flash Attention

Nuno Gonçalves, Marcos V. Treviso, Andre Martins

ICML 2025oralarXiv:2502.12082
5
citations
#7345

MOFFlow: Flow Matching for Structure Prediction of Metal-Organic Frameworks

Nayoung Kim, Seongsu Kim, Minsu Kim et al.

ICLR 2025arXiv:2410.17270
5
citations
#7346

Linear Transformers as VAR Models: Aligning Autoregressive Attention Mechanisms with Autoregressive Forecasting

Jiecheng Lu, Shihao Yang

ICML 2025arXiv:2502.07244
5
citations
#7347

ELITE: Enhanced Language-Image Toxicity Evaluation for Safety

Wonjun Lee, Doehyeon Lee, Eugene Choi et al.

ICML 2025arXiv:2502.04757
5
citations
#7348

Logarithmic Regret for Online KL-Regularized Reinforcement Learning

Heyang Zhao, Chenlu Ye, Wei Xiong et al.

ICML 2025arXiv:2502.07460
5
citations
#7349

Linear Mixture Distributionally Robust Markov Decision Processes

Zhishuai Liu, Pan Xu

NEURIPS 2025arXiv:2505.18044
5
citations
#7350

Preserving AUC Fairness in Learning with Noisy Protected Groups

Mingyang Wu, Li Lin, Wenbin Zhang et al.

ICML 2025arXiv:2505.18532
5
citations
#7351

Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism

Aviv Bick, Eric Xing, Albert Gu

ICML 2025arXiv:2504.18574
5
citations
#7352

BiMark: Unbiased Multilayer Watermarking for Large Language Models

Xiaoyan Feng, He Zhang, Yanjun Zhang et al.

ICML 2025arXiv:2506.21602
5
citations
#7353

Two-Timescale Critic-Actor for Average Reward MDPs with Function Approximation

Prashansa Panda, Shalabh Bhatnagar

AAAI 2025paperarXiv:2402.01371
5
citations
#7354

Importance Corrected Neural JKO Sampling

Johannes Hertrich, Robert Gruhlke

ICML 2025arXiv:2407.20444
5
citations
#7355

Prot2Text-V2: Protein Function Prediction with Multimodal Contrastive Alignment

Xiao Fei, Michail Chatzianastasis, Sarah Carneiro et al.

NEURIPS 2025arXiv:2505.11194
5
citations
#7356

Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning

Lang Feng, Weihao Tan, Zhiyi Lyu et al.

ICML 2025arXiv:2505.03792
5
citations
#7357

Homophily Enhanced Graph Domain Adaptation

Ruiyi Fang, Bingheng Li, Jingyu Zhao et al.

ICML 2025arXiv:2505.20089
5
citations
#7358

Toward Generalizing Visual Brain Decoding to Unseen Subjects

Xiangtao Kong, Kexin Huang, Ping Li et al.

ICLR 2025arXiv:2410.14445
5
citations
#7359

Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias

Rui Lu, Runzhe Wang, Kaifeng Lyu et al.

ICLR 2025arXiv:2503.03595
5
citations
#7360

$K^2$VAE: A Koopman-Kalman Enhanced Variational AutoEncoder for Probabilistic Time Series Forecasting

Xingjian Wu, Xiangfei Qiu, Hongfan Gao et al.

ICML 2025spotlightarXiv:2505.23017
5
citations
#7361

Model-Free Offline Reinforcement Learning with Enhanced Robustness

Chi Zhang, Zain Ulabedeen Farhat, George Atia et al.

ICLR 2025
5
citations
#7362

FedSPU: Personalized Federated Learning for Resource-Constrained Devices with Stochastic Parameter Update

Ziru Niu, Hai Dong, A. K. Qin

AAAI 2025paperarXiv:2403.11464
5
citations
#7363

AI-Generated Video Detection via Perceptual Straightening

Christian Internò, Robert Geirhos, Markus Olhofer et al.

NEURIPS 2025oralarXiv:2507.00583
5
citations
#7364

Boltzmann-Aligned Inverse Folding Model as a Predictor of Mutational Effects on Protein-Protein Interactions

Xiaoran Jiao, Weian Mao, Wengong Jin et al.

ICLR 2025arXiv:2410.09543
5
citations
#7365

Field Matching: an Electrostatic Paradigm to Generate and Transfer Data

Alexander Kolesov, S. Manukhov, Vladimir Palyulin et al.

ICML 2025arXiv:2502.02367
5
citations
#7366

Scaling Laws for Floating–Point Quantization Training

Xingwu Sun, Shuaipeng Li, Ruobing Xie et al.

ICML 2025
5
citations
#7367

PARQ: Piecewise-Affine Regularized Quantization

Lisa Jin, Jianhao Ma, Zechun Liu et al.

ICML 2025arXiv:2503.15748
5
citations
#7368

Learning Soft Sparse Shapes for Efficient Time-Series Classification

Zhen Liu, Yicheng Luo, Boyuan Li et al.

ICML 2025oralarXiv:2505.06892
5
citations
#7369

GLoRa: A Benchmark to Evaluate the Ability to Learn Long-Range Dependencies in Graphs

Dongzhuoran Zhou, Evgeny Kharlamov, Egor Kostylev

ICLR 2025
5
citations
#7370

Knee-Deep in C-RASP: A Transformer Depth Hierarchy

Andy J Yang, Michaël Cadilhac, David Chiang

NEURIPS 2025oralarXiv:2506.16055
5
citations
#7371

Inverse Bridge Matching Distillation

Nikita Gushchin, David Li, Daniil Selikhanovych et al.

ICML 2025arXiv:2502.01362
5
citations
#7372

Feature-Based Online Bilateral Trade

Solenne Gaucher, Martino Bernasconi, Matteo Castiglioni et al.

ICLR 2025arXiv:2405.18183
5
citations
#7373

Correlating instruction-tuning (in multimodal models) with vision-language processing (in the brain)

SUBBA REDDY OOTA, Akshett Rai Jindal, Ishani Mondal et al.

ICLR 2025arXiv:2505.20029
5
citations
#7374

Understanding the Generalization of In-Context Learning in Transformers: An Empirical Study

Xingxuan Zhang, Haoran Wang, Jiansheng Li et al.

ICLR 2025arXiv:2503.15579
5
citations
#7375

The Global Convergence Time of Stochastic Gradient Descent in Non-Convex Landscapes: Sharp Estimates via Large Deviations

Waïss Azizian, Franck Iutzeler, Jérôme Malick et al.

ICML 2025arXiv:2503.16398
5
citations
#7376

Interpreting Language Reward Models via Contrastive Explanations

Junqi Jiang, Tom Bewley, Saumitra Mishra et al.

ICLR 2025arXiv:2411.16502
5
citations
#7377

Hierarchically Encapsulated Representation for Protocol Design in Self-Driving Labs

Yu-Zhe Shi, Mingchen Liu, Fanxu Meng et al.

ICLR 2025arXiv:2504.03810
5
citations
#7378

Devil is in the Details: Density Guidance for Detail-Aware Generation with Flow Models

Rafał Karczewski, Markus Heinonen, Vikas Garg

ICML 2025arXiv:2502.05807
5
citations
#7379

Graph Structure Refinement with Energy-based Contrastive Learning

Xianlin Zeng, Yufeng Wang, Yuqi Sun et al.

AAAI 2025paperarXiv:2412.17856
5
citations
#7380

Specifying What You Know or Not for Multi-Label Class-Incremental Learning

Aoting Zhang, Dongbao Yang, Chang Liu et al.

AAAI 2025paperarXiv:2503.17017
5
citations
#7381

Co-Reinforcement Learning for Unified Multimodal Understanding and Generation

Jingjing Jiang, Chongjie Si, Jun Luo et al.

NEURIPS 2025spotlightarXiv:2505.17534
5
citations
#7382

BOFormer: Learning to Solve Multi-Objective Bayesian Optimization via Non-Markovian RL

Yu Heng Hung, Kai-Jie Lin, Yu-Heng Lin et al.

ICLR 2025arXiv:2505.21974
5
citations
#7383

From Probability to Counterfactuals: the Increasing Complexity of Satisfiability in Pearl's Causal Hierarchy

Julian Dörfler, Benito van der Zander, Markus Bläser et al.

ICLR 2025arXiv:2405.07373
5
citations
#7384

Sensitivity-Constrained Fourier Neural Operators for Forward and Inverse Problems in Parametric Differential Equations

Abdolmehdi Behroozi, Chaopeng Shen, Daniel Kifer

ICLR 2025arXiv:2505.08740
5
citations
#7385

Sharpness-Aware Black-Box Optimization

Feiyang YE, YUEMING LYU, Xuehao Wang et al.

ICLR 2025arXiv:2410.12457
5
citations
#7386

Learning to Communicate Through Implicit Communication Channels

Han Wang, Binbin Chen, zhang et al.

ICLR 2025arXiv:2411.01553
5
citations
#7387

MGDA Converges under Generalized Smoothness, Provably

Qi Zhang, Peiyao Xiao, Shaofeng Zou et al.

ICLR 2025arXiv:2405.19440
5
citations
#7388

any4: Learned 4-bit Numeric Representation for LLMs

Mostafa Elhoushi, Jeff Johnson

ICML 2025arXiv:2507.04610
5
citations
#7389

Angle Domain Guidance: Latent Diffusion Requires Rotation Rather Than Extrapolation

Cheng Jin, Zhenyu Xiao, Chutao Liu et al.

ICML 2025arXiv:2506.11039
5
citations
#7390

On Generalization Across Environments In Multi-Objective Reinforcement Learning

Jayden Teoh, Pradeep Varakantham, Peter Vamplew

ICLR 2025arXiv:2503.00799
5
citations
#7391

Through the Dual-Prism: A Spectral Perspective on Graph Data Augmentation for Graph Classifications

Yutong Xia, Runpeng Yu, Yuxuan Liang et al.

AAAI 2025paperarXiv:2401.09953
5
citations
#7392

Universal Approximation Theorem of Deep Q-Networks

Qian Qi

ICML 2025arXiv:2505.02288
5
citations
#7393

On-the-fly Preference Alignment via Principle-Guided Decoding

Mingye Zhu, Yi Liu, Lei Zhang et al.

ICLR 2025arXiv:2502.14204
5
citations
#7394

How Do Images Align and Complement LiDAR? Towards a Harmonized Multi-modal 3D Panoptic Segmentation

Yining Pan, Qiongjie Cui, Xulei Yang et al.

ICML 2025arXiv:2505.18956
5
citations
#7395

Hierarchically-Structured Open-Vocabulary Indoor Scene Synthesis with Pre-trained Large Language Model

Weilin Sun, Xinran Li, Manyi Li et al.

AAAI 2025paperarXiv:2502.10675
5
citations
#7396

LowRA: Accurate and Efficient LoRA Fine-Tuning of LLMs under 2 Bits

Zikai Zhou, Qizheng Zhang, Hermann Kumbong et al.

ICML 2025arXiv:2502.08141
5
citations
#7397

Improved Rates of Differentially Private Nonconvex-Strongly-Concave Minimax Optimization

Ruijia Zhang, Mingxi Lei, Meng Ding et al.

AAAI 2025paperarXiv:2503.18317
5
citations
#7398

Feedback Favors the Generalization of Neural ODEs

Jindou Jia, Zihan Yang, Meng Wang et al.

ICLR 2025arXiv:2410.10253
5
citations
#7399

SCOPE: A Self-supervised Framework for Improving Faithfulness in Conditional Text Generation

Song Duong, Florian Le Bronnec, Alexandre Allauzen et al.

ICLR 2025arXiv:2502.13674
5
citations
#7400

Guided Search Strategies in Non-Serializable Environments with Applications to Software Engineering Agents

Karina Zainullina, Aleksandr Golubev, Maria Trofimova et al.

ICML 2025arXiv:2505.13652
5
citations