Most Cited ICML "primitive number scheduling" Papers

5,975 papers found • Page 7 of 30

#1201

Prediction-powered Generalization of Causal Inferences

Ilker Demirel, Ahmed Alaa, Anthony Philippakis et al.

ICML 2024arXiv:2406.02873
15
citations
#1202

PILAF: Optimal Human Preference Sampling for Reward Modeling

Yunzhen Feng, Ariel Kwiatkowski, Kunhao Zheng et al.

ICML 2025arXiv:2502.04270
15
citations
#1203

LOCATE 3D: Real-World Object Localization via Self-Supervised Learning in 3D

Paul McVay, Sergio Arnaud, Ada Martin et al.

ICML 2025spotlightarXiv:2504.14151
15
citations
#1204

MapEval: A Map-Based Evaluation of Geo-Spatial Reasoning in Foundation Models

Mahir Labib Dihan, Tanvir Hassan, Md Tanvir Parvez et al.

ICML 2025spotlightarXiv:2501.00316
15
citations
#1205

ContPhy: Continuum Physical Concept Learning and Reasoning from Videos

Zhicheng Zheng, Xin Yan, Zhenfang Chen et al.

ICML 2024arXiv:2402.06119
15
citations
#1206

Position: Editing Large Language Models Poses Serious Safety Risks

Paul Youssef, Zhixue Zhao, Daniel Braun et al.

ICML 2025arXiv:2502.02958
15
citations
#1207

ExPLoRA: Parameter-Efficient Extended Pre-Training to Adapt Vision Transformers under Domain Shifts

Samar Khanna, Medhanie Irgau, David Lobell et al.

ICML 2025arXiv:2406.10973
14
citations
#1208

Autoformalizing Euclidean Geometry

Logan Murphy, Kaiyu Yang, Jialiang Sun et al.

ICML 2024arXiv:2405.17216
14
citations
#1209

How to Synthesize Text Data without Model Collapse?

Xuekai Zhu, Daixuan Cheng, Hengli Li et al.

ICML 2025arXiv:2412.14689
14
citations
#1210

RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning

Yuanhuiyi Lyu, Xu Zheng, Lutao Jiang et al.

ICML 2025arXiv:2502.00848
14
citations
#1211

How Compositional Generalization and Creativity Improve as Diffusion Models are Trained

Alessandro Favero, Antonio Sclocchi, Francesco Cagnetta et al.

ICML 2025arXiv:2502.12089
14
citations
#1212

A Hitchhiker's Guide to Scaling Law Estimation

Leshem Choshen, Yang Zhang, Jacob Andreas

ICML 2025arXiv:2410.11840
14
citations
#1213

Positional Knowledge is All You Need: Position-induced Transformer (PiT) for Operator Learning

Junfeng CHEN, Kailiang Wu

ICML 2024arXiv:2405.09285
14
citations
#1214

Position: Benchmarking is Limited in Reinforcement Learning Research

Scott Jordan, Adam White, Bruno da Silva et al.

ICML 2024arXiv:2406.16241
14
citations
#1215

Test-Time Degradation Adaptation for Open-Set Image Restoration

Yuanbiao Gou, Haiyu Zhao, Boyun Li et al.

ICML 2024spotlightarXiv:2312.02197
14
citations
#1216

Federated Optimization with Doubly Regularized Drift Correction

Xiaowen Jiang, Anton Rodomanov, Sebastian Stich

ICML 2024arXiv:2404.08447
14
citations
#1217

RLVF: Learning from Verbal Feedback without Overgeneralization

Moritz Stephan, Alexander Khazatsky, Eric Mitchell et al.

ICML 2024arXiv:2402.10893
14
citations
#1218

One Diffusion Step to Real-World Super-Resolution via Flow Trajectory Distillation

Jianze Li, Jiezhang Cao, Yong Guo et al.

ICML 2025arXiv:2502.01993
14
citations
#1219

The Double-Ellipsoid Geometry of CLIP

Meir Yossef Levi, Guy Gilboa

ICML 2025arXiv:2411.14517
14
citations
#1220

DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space

Mang Ning, Mingxiao Li, Jianlin Su et al.

ICML 2025arXiv:2412.15032
14
citations
#1221

Rethinking Decision Transformer via Hierarchical Reinforcement Learning

Yi Ma, Jianye Hao, Hebin Liang et al.

ICML 2024arXiv:2311.00267
14
citations
#1222

Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient

Jan Ludziejewski, Maciej Pióro, Jakub Krajewski et al.

ICML 2025arXiv:2502.05172
14
citations
#1223

SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models

Xudong LU, Aojun Zhou, Yuhui Xu et al.

ICML 2024arXiv:2405.16057
14
citations
#1224

A Manifold Perspective on the Statistical Generalization of Graph Neural Networks

Zhiyang Wang, Juan Cervino, Alejandro Ribeiro

ICML 2025arXiv:2406.05225
14
citations
#1225

BRIDGE: Bootstrapping Text to Control Time-Series Generation via Multi-Agent Iterative Optimization and Diffusion Modeling

Hao Li, Yu-Hao Huang, Chang Xu et al.

ICML 2025oralarXiv:2503.02445
14
citations
#1226

Domain Generalisation via Imprecise Learning

Anurag Singh, Siu Lun Chau, Shahine Bouabid et al.

ICML 2024spotlightarXiv:2404.04669
14
citations
#1227

Two Heads Are Better Than One: Boosting Graph Sparse Training via Semantic and Topological Awareness

Guibin Zhang, Yanwei Yue, kun wang et al.

ICML 2024arXiv:2402.01242
14
citations
#1228

DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning

Jianxiong Li, Jinliang Zheng, Yinan Zheng et al.

ICML 2024oralarXiv:2402.18137
14
citations
#1229

Understanding the Emergence of Multimodal Representation Alignment

Megan Tjandrasuwita, Chanakya Ekbote, Liu Ziyin et al.

ICML 2025arXiv:2502.16282
14
citations
#1230

Multi-Sender Persuasion: A Computational Perspective

Safwan Hossain, Tonghan Wang, Tao Lin et al.

ICML 2024arXiv:2402.04971
14
citations
#1231

Attention Meets Post-hoc Interpretability: A Mathematical Perspective

Gianluigi Lopardo, Frederic Precioso, Damien Garreau

ICML 2024arXiv:2402.03485
14
citations
#1232

CROW: Eliminating Backdoors from Large Language Models via Internal Consistency Regularization

Nay Myat Min, Long H. Pham, Yige Li et al.

ICML 2025arXiv:2411.12768
14
citations
#1233

Reward-Guided Iterative Refinement in Diffusion Models at Test-Time with Applications to Protein and DNA Design

Masatoshi Uehara, su, Yulai Zhao et al.

ICML 2025arXiv:2502.14944
14
citations
#1234

Verifying message-passing neural networks via topology-based bounds tightening

Christopher Hojny, Shiqiang Zhang, Juan Campos et al.

ICML 2024arXiv:2402.13937
14
citations
#1235

Beyond Sensor Data: Foundation Models of Behavioral Data from Wearables Improve Health Predictions

Eray Erturk, Fahad Kamran, Salar Abbaspourazad et al.

ICML 2025oralarXiv:2507.00191
14
citations
#1236

GRAM: A Generative Foundation Reward Model for Reward Generalization

Chenglong Wang, Yang Gan, Yifu Huo et al.

ICML 2025arXiv:2506.14175
14
citations
#1237

Rejuvenating image-GPT as Strong Visual Representation Learners

Sucheng Ren, Zeyu Wang, Hongru Zhu et al.

ICML 2024arXiv:2312.02147
14
citations
#1238

How to Escape Sharp Minima with Random Perturbations

Kwangjun Ahn, Ali Jadbabaie, Suvrit Sra

ICML 2024arXiv:2305.15659
14
citations
#1239

On The Complexity of First-Order Methods in Stochastic Bilevel Optimization

Jeongyeol Kwon, Dohyun Kwon, Hanbaek Lyu

ICML 2024arXiv:2402.07101
14
citations
#1240

SimPro: A Simple Probabilistic Framework Towards Realistic Long-Tailed Semi-Supervised Learning

Chaoqun Du, Yizeng Han, Gao Huang

ICML 2024arXiv:2402.13505
14
citations
#1241

Size-invariance Matters: Rethinking Metrics and Losses for Imbalanced Multi-object Salient Object Detection

Feiran Li, Qianqian Xu, Shilong Bao et al.

ICML 2024spotlightarXiv:2405.09782
14
citations
#1242

The Surprising Effectiveness of Skip-Tuning in Diffusion Sampling

Jiajun Ma, Shuchen Xue, Tianyang Hu et al.

ICML 2024arXiv:2402.15170
14
citations
#1243

HyperFields: Towards Zero-Shot Generation of NeRFs from Text

Sudarshan Babu, Richard Liu, Zi Yu Zhou et al.

ICML 2024arXiv:2310.17075
14
citations
#1244

BayOTIDE: Bayesian Online Multivariate Time Series Imputation with Functional Decomposition

Shikai Fang, Qingsong Wen, Yingtao Luo et al.

ICML 2024oralarXiv:2308.14906
14
citations
#1245

DPCore: Dynamic Prompt Coreset for Continual Test-Time Adaptation

Yunbei Zhang, Akshay Mehra, Shuaicheng Niu et al.

ICML 2025arXiv:2406.10737
14
citations
#1246

Hyperspherical Normalization for Scalable Deep Reinforcement Learning

Hojoon Lee, Youngdo Lee, Takuma Seno et al.

ICML 2025spotlightarXiv:2502.15280
14
citations
#1247

Interpreting and Improving Diffusion Models from an Optimization Perspective

Frank Permenter, Chenyang Yuan

ICML 2024arXiv:2306.04848
14
citations
#1248

SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation

Danni Yang, Jiayi Ji, Yiwei Ma et al.

ICML 2024arXiv:2406.01451
14
citations
#1249

BOtied: Multi-objective Bayesian optimization with tied multivariate ranks

Ji Won Park, Natasa Tagasovska, Michael Maser et al.

ICML 2024arXiv:2306.00344
14
citations
#1250

Raising the Bar: Investigating the Values of Large Language Models via Generative Evolving Testing

Han Jiang, Xiaoyuan Yi, Zhihua Wei et al.

ICML 2025oralarXiv:2406.14230
14
citations
#1251

Trainable Transformer in Transformer

Abhishek Panigrahi, Sadhika Malladi, Mengzhou Xia et al.

ICML 2024arXiv:2307.01189
14
citations
#1252

Prompt-guided Precise Audio Editing with Diffusion Models

Manjie Xu, Chenxing Li, Duzhen Zhang et al.

ICML 2024arXiv:2406.04350
14
citations
#1253

Let Go of Your Labels with Unsupervised Transfer

Artyom Gadetsky, Yulun Jiang, Maria Brbic

ICML 2024arXiv:2406.07236
14
citations
#1254

Discovering Bias in Latent Space: An Unsupervised Debiasing Approach

Dyah Adila, Shuai Zhang, Boran Han et al.

ICML 2024arXiv:2406.03631
14
citations
#1255

AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization

Junkang Wu, xue wang, Zhengyi Yang et al.

ICML 2025arXiv:2410.10148
14
citations
#1256

Enhancing Adversarial Robustness in SNNs with Sparse Gradients

Yujia Liu, Tong Bu, Ding Jianhao et al.

ICML 2024arXiv:2405.20355
14
citations
#1257

LlavaGuard: An Open VLM-based Framework for Safeguarding Vision Datasets and Models

Lukas Helff, Felix Friedrich, Manuel Brack et al.

ICML 2025arXiv:2406.05113
14
citations
#1258

DeepPolar: Inventing Nonlinear Large-Kernel Polar Codes via Deep Learning

Ashwin Hebbar, Sravan Kumar Ankireddy, Hyeji Kim et al.

ICML 2024arXiv:2402.08864
14
citations
#1259

MIB: A Mechanistic Interpretability Benchmark

Aaron Mueller, Atticus Geiger, Sarah Wiegreffe et al.

ICML 2025arXiv:2504.13151
14
citations
#1260

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Yung-Sung Chuang, Benjamin Cohen-Wang, Shannon Shen et al.

ICML 2025arXiv:2502.09604
14
citations
#1261

Principled Preferential Bayesian Optimization

Wenjie Xu, Wenbin Wang, Yuning Jiang et al.

ICML 2024arXiv:2402.05367
14
citations
#1262

SECOND: Mitigating Perceptual Hallucination in Vision-Language Models via Selective and Contrastive Decoding

Woohyeon Park, Woojin Kim, Jaeik Kim et al.

ICML 2025arXiv:2506.08391
14
citations
#1263

On the Impact of Hard Adversarial Instances on Overfitting in Adversarial Training

Chen Liu, Zhichao Huang, Mathieu Salzmann et al.

ICML 2025arXiv:2112.07324
14
citations
#1264

Arrows of Time for Large Language Models

Vassilis Papadopoulos, Jérémie Wenger, Clement Hongler

ICML 2024arXiv:2401.17505
14
citations
#1265

AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence

Yuliang Liu, Junjie Lu, Chaofeng Qu et al.

ICML 2025arXiv:2502.13943
14
citations
#1266

Integrating Multimodal Data for Joint Generative Modeling of Complex Dynamics

Manuel Brenner, Florian Hess, Georgia Koppe et al.

ICML 2024oralarXiv:2212.07892
14
citations
#1267

Improving Gradient-Guided Nested Sampling for Posterior Inference

Pablo Lemos, Nikolay Malkin, Will Handley et al.

ICML 2024arXiv:2312.03911
14
citations
#1268

What is the Long-Run Distribution of Stochastic Gradient Descent? A Large Deviations Analysis

Waïss Azizian, Franck Iutzeler, Jérôme Malick et al.

ICML 2024arXiv:2406.09241
14
citations
#1269

Policy Learning for Balancing Short-Term and Long-Term Rewards

Peng Wu, Ziyu Shen, Feng Xie et al.

ICML 2024arXiv:2405.03329
14
citations
#1270

SITCOM: Step-wise Triple-Consistent Diffusion Sampling For Inverse Problems

Ismail Alkhouri, Shijun Liang, Cheng-Han Huang et al.

ICML 2025arXiv:2410.04479
14
citations
#1271

Variational Rectified Flow Matching

Pengsheng Guo, Alex Schwing

ICML 2025arXiv:2502.09616
14
citations
#1272

MiraGe: Editable 2D Images using Gaussian Splatting

Joanna Waczyńska, Tomasz Szczepanik, Piotr Borycki et al.

ICML 2025arXiv:2410.01521
14
citations
#1273

Accelerating PDE Data Generation via Differential Operator Action in Solution Space

huanshuo dong, Hong Wang, Haoyang Liu et al.

ICML 2024arXiv:2402.05957
14
citations
#1274

Open-Vocabulary Calibration for Fine-tuned CLIP

Shuoyuan Wang, Jindong Wang, Guoqing Wang et al.

ICML 2024arXiv:2402.04655
14
citations
#1275

Contextual Bandits for Unbounded Context Distributions

Puning Zhao, Rongfei Fan, Shaowei Wang et al.

ICML 2025arXiv:2408.09655
14
citations
#1276

Robust Stable Spiking Neural Networks

Ding Jianhao, Zhiyu Pan, Yujia Liu et al.

ICML 2024arXiv:2405.20694
14
citations
#1277

Expand-and-Cluster: Parameter Recovery of Neural Networks

Flavio Martinelli, Berfin Simsek, Wulfram Gerstner et al.

ICML 2024arXiv:2304.12794
14
citations
#1278

Editable Concept Bottleneck Models

Lijie Hu, Chenyang Ren, Zhengyu Hu et al.

ICML 2025arXiv:2405.15476
14
citations
#1279

Deliberation in Latent Space via Differentiable Cache Augmentation

Luyang Liu, Jonas Pfeiffer, Jiaxing Wu et al.

ICML 2025arXiv:2412.17747
14
citations
#1280

QuEST: Stable Training of LLMs with 1-Bit Weights and Activations

Andrei Panferov, Jiale Chen, Rush Tabesh et al.

ICML 2025arXiv:2502.05003
14
citations
#1281

ProSec: Fortifying Code LLMs with Proactive Security Alignment

Xiangzhe Xu, Zian Su, Jinyao Guo et al.

ICML 2025arXiv:2411.12882
13
citations
#1282

Lexico: Extreme KV Cache Compression via Sparse Coding over Universal Dictionaries

Junhyuck Kim, Jongho Park, Jaewoong Cho et al.

ICML 2025arXiv:2412.08890
13
citations
#1283

ETTA: Elucidating the Design Space of Text-to-Audio Models

Sang-gil Lee, Zhifeng Kong, ARUSHI GOEL et al.

ICML 2025arXiv:2412.19351
13
citations
#1284

A Circuit Domain Generalization Framework for Efficient Logic Synthesis in Chip Design

Zhihai Wang, Lei Chen, Jie Wang et al.

ICML 2024spotlightarXiv:2309.03208
13
citations
#1285

Geometry Informed Tokenization of Molecules for Language Model Generation

Xiner Li, Limei Wang, Youzhi Luo et al.

ICML 2025arXiv:2408.10120
13
citations
#1286

Instruction-Following Pruning for Large Language Models

Bairu Hou, Qibin Chen, Jianyu Wang et al.

ICML 2025arXiv:2501.02086
13
citations
#1287

Provable Multi-Task Representation Learning by Two-Layer ReLU Neural Networks

Liam Collins, Hamed Hassani, Mahdi Soltanolkotabi et al.

ICML 2024arXiv:2307.06887
13
citations
#1288

Sparse Inducing Points in Deep Gaussian Processes: Enhancing Modeling with Denoising Diffusion Variational Inference

JIAN XU, Delu Zeng, John Paisley

ICML 2024arXiv:2407.17033
13
citations
#1289

Knowledge Transfer from Vision Foundation Models for Efficient Training of Small Task-specific Models

Raviteja Vemulapalli, Hadi Pouransari, Fartash Faghri et al.

ICML 2024arXiv:2311.18237
13
citations
#1290

Locality-Sensitive Hashing-Based Efficient Point Transformer with Applications in High-Energy Physics

Siqi Miao, Zhiyuan Lu, Mia Liu et al.

ICML 2024arXiv:2402.12535
13
citations
#1291

Superposition Prompting: Improving and Accelerating Retrieval-Augmented Generation

Thomas Merth, Qichen Fu, Mohammad Rastegari et al.

ICML 2024arXiv:2404.06910
13
citations
#1292

Tuning-Free Stochastic Optimization

Ahmed Khaled, Chi Jin

ICML 2024spotlightarXiv:2402.07793
13
citations
#1293

Liger: Linearizing Large Language Models to Gated Recurrent Structures

Disen Lan, Weigao Sun, Jiaxi Hu et al.

ICML 2025arXiv:2503.01496
13
citations
#1294

Community-Invariant Graph Contrastive Learning

Shiyin Tan, Dongyuan Li, Renhe Jiang et al.

ICML 2024arXiv:2405.01350
13
citations
#1295

Adaptive Advantage-Guided Policy Regularization for Offline Reinforcement Learning

Tenglong Liu, Yang Li, Yixing Lan et al.

ICML 2024arXiv:2405.19909
13
citations
#1296

OSSCAR: One-Shot Structured Pruning in Vision and Language Models with Combinatorial Optimization

Xiang Meng, Shibal Ibrahim, Kayhan Behdin et al.

ICML 2024arXiv:2403.12983
13
citations
#1297

EvTexture: Event-driven Texture Enhancement for Video Super-Resolution

Dachun Kai, Jiayao Lu, Yueyi Zhang et al.

ICML 2024oralarXiv:2406.13457
13
citations
#1298

Listenable Maps for Audio Classifiers

Francesco Paissan, Mirco Ravanelli, Cem Subakan

ICML 2024arXiv:2403.13086
13
citations
#1299

Latent Action Learning Requires Supervision in the Presence of Distractors

Alexander Nikulin, Ilya Zisman, Denis Tarasov et al.

ICML 2025arXiv:2502.00379
13
citations
#1300

Improving LLM Video Understanding with 16 Frames Per Second

Yixuan Li, Changli Tang, Jimin Zhuang et al.

ICML 2025oralarXiv:2503.13956
13
citations
#1301

PDE-Controller: LLMs for Autoformalization and Reasoning of PDEs

Mauricio Soroco, Jialin Song, Mengzhou Xia et al.

ICML 2025arXiv:2502.00963
13
citations
#1302

An Image is Worth Multiple Words: Discovering Object Level Concepts using Multi-Concept Prompt Learning

Chen Jin, Ryutaro Tanno, Amrutha Saseendran et al.

ICML 2024arXiv:2310.12274
13
citations
#1303

Polygonal Unadjusted Langevin Algorithms: Creating stable and efficient adaptive algorithms for neural networks

Dongyoung Lim, Sotirios Sabanis

ICML 2024arXiv:2105.13937
13
citations
#1304

Cost-efficient Collaboration between On-device and Cloud Language Models

Avanika Narayan, Dan Biderman, Sabri Eyuboglu et al.

ICML 2025arXiv:2502.15964
13
citations
#1305

Taylor Videos for Action Recognition

Lei Wang, Xiuyuan Yuan, Tom Gedeon et al.

ICML 2024oralarXiv:2402.03019
13
citations
#1306

Zero-Shot Reinforcement Learning via Function Encoders

Tyler Ingebrand, Amy Zhang, Ufuk Topcu

ICML 2024arXiv:2401.17173
13
citations
#1307

CHAI: Clustered Head Attention for Efficient LLM Inference

Saurabh Agarwal, Bilge Acun, Basil Hosmer et al.

ICML 2024arXiv:2403.08058
13
citations
#1308

Simplifying DINO via Coding Rate Regularization

Ziyang Wu, Jingyuan Zhang, Druv Pai et al.

ICML 2025arXiv:2502.10385
13
citations
#1309

Sample-specific Masks for Visual Reprogramming-based Prompting

Chengyi Cai, Zesheng Ye, Lei Feng et al.

ICML 2024spotlightarXiv:2406.03150
13
citations
#1310

Coarse-to-Fine Highlighting: Reducing Knowledge Hallucination in Large Language Models

Qitan Lv, Jie Wang, Hanzhu Chen et al.

ICML 2024arXiv:2410.15116
13
citations
#1311

Can Classic GNNs Be Strong Baselines for Graph-level Tasks? Simple Architectures Meet Excellence

Yuankai Luo, Lei Shi, Xiao-Ming Wu

ICML 2025arXiv:2502.09263
13
citations
#1312

Beyond Matryoshka: Revisiting Sparse Coding for Adaptive Representation

Tiansheng Wen, Yifei Wang, Zequn Zeng et al.

ICML 2025oralarXiv:2503.01776
13
citations
#1313

Sign is Not a Remedy: Multiset-to-Multiset Message Passing for Learning on Heterophilic Graphs

Langzhang Liang, Sunwoo Kim, Kijung Shin et al.

ICML 2024arXiv:2405.20652
13
citations
#1314

SAPG: Split and Aggregate Policy Gradients

Jayesh Singla, Ananye Agarwal, Deepak Pathak

ICML 2024arXiv:2407.20230
13
citations
#1315

Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator

Kaiwen Zheng, Yongxin Chen, Huayu Chen et al.

ICML 2025spotlightarXiv:2503.01103
13
citations
#1316

Byzantine-Robust Federated Learning: Impact of Client Subsampling and Local Updates

Youssef Allouah, Sadegh Farhadkhani, Rachid Guerraoui et al.

ICML 2024arXiv:2402.12780
13
citations
#1317

OmniAudio: Generating Spatial Audio from 360-Degree Video

Huadai Liu, Tianyi Luo, Kaicheng Luo et al.

ICML 2025arXiv:2504.14906
13
citations
#1318

What is Dataset Distillation Learning?

William Yang, Ye Zhu, Zhiwei Deng et al.

ICML 2024arXiv:2406.04284
13
citations
#1319

Bayesian Uncertainty for Gradient Aggregation in Multi-Task Learning

Idan Achituve, Idit Diamant, Arnon Netzer et al.

ICML 2024arXiv:2402.04005
13
citations
#1320

Optimal Hessian/Jacobian-Free Nonconvex-PL Bilevel Optimization

Feihu Huang

ICML 2024arXiv:2407.17823
13
citations
#1321

Active Evaluation Acquisition for Efficient LLM Benchmarking

Yang Li, Jie Ma, Miguel Ballesteros et al.

ICML 2025arXiv:2410.05952
13
citations
#1322

Training a Generally Curious Agent

Fahim Tajwar, Yiding Jiang, Abitha Thankaraj et al.

ICML 2025oralarXiv:2502.17543
13
citations
#1323

Epsilon-VAE: Denoising as Visual Decoding

Long Zhao, Sanghyun Woo, Ziyu Wan et al.

ICML 2025arXiv:2410.04081
13
citations
#1324

Latent Thought Models with Variational Bayes Inference-Time Computation

Deqian Kong, Minglu Zhao, Dehong Xu et al.

ICML 2025arXiv:2502.01567
13
citations
#1325

Learning High-Order Relationships of Brain Regions

Weikang Qiu, Huangrui Chu, Selena Wang et al.

ICML 2024arXiv:2312.02203
13
citations
#1326

High-Fidelity Simultaneous Speech-To-Speech Translation

Tom Labiausse, Laurent Mazaré, Edouard Grave et al.

ICML 2025arXiv:2502.03382
13
citations
#1327

Inverse problems with experiment-guided AlphaFold

Sai Advaith Maddipatla, Nadav Bojan, Meital Bojan et al.

ICML 2025arXiv:2502.09372
13
citations
#1328

Unveiling Privacy, Memorization, and Input Curvature Links

Deepak Ravikumar, Efstathia Soufleri, Abolfazl Hashemi et al.

ICML 2024arXiv:2402.18726
13
citations
#1329

Retrieval-Augmented Score Distillation for Text-to-3D Generation

Junyoung Seo, Susung Hong, Wooseok Jang et al.

ICML 2024arXiv:2402.02972
13
citations
#1330

Restoring balance: principled under/oversampling of data for optimal classification

Emanuele Loffredo, Mauro Pastore, Simona Cocco et al.

ICML 2024arXiv:2405.09535
13
citations
#1331

The Relative Value of Prediction in Algorithmic Decision Making

Juan Perdomo

ICML 2024arXiv:2312.08511
13
citations
#1332

LLMs on the Line: Data Determines Loss-to-Loss Scaling Laws

Prasanna Mayilvahanan, Thaddäus Wiedemer, Sayak Mallick et al.

ICML 2025arXiv:2502.12120
13
citations
#1333

Recurrent Early Exits for Federated Learning with Heterogeneous Clients

Royson Lee, Javier Fernandez-Marques, Xu Hu et al.

ICML 2024arXiv:2405.14791
13
citations
#1334

UPOCR: Towards Unified Pixel-Level OCR Interface

Dezhi Peng, Zhenhua Yang, Jiaxin Zhang et al.

ICML 2024arXiv:2312.02694
13
citations
#1335

Realistic Unsupervised CLIP Fine-tuning with Universal Entropy Optimization

Jian Liang, Sheng, Zhengbo Wang et al.

ICML 2024spotlightarXiv:2308.12919
13
citations
#1336

X-Transfer Attacks: Towards Super Transferable Adversarial Attacks on CLIP

Hanxun Huang, Sarah Erfani, Yige Li et al.

ICML 2025arXiv:2505.05528
13
citations
#1337

Deeper or Wider: A Perspective from Optimal Generalization Error with Sobolev Loss

Yahong Yang, Juncai He

ICML 2024arXiv:2402.00152
13
citations
#1338

Offline Multi-Objective Optimization

Ke Xue, Rong-Xi Tan, Xiaobin Huang et al.

ICML 2024arXiv:2406.03722
13
citations
#1339

Proactive Agents for Multi-Turn Text-to-Image Generation Under Uncertainty

Meera Hahn, Wenjun Zeng, Nithish Kannen et al.

ICML 2025arXiv:2412.06771
13
citations
#1340

Random Masking Finds Winning Tickets for Parameter Efficient Fine-tuning

Jing Xu, Jingzhao Zhang

ICML 2024arXiv:2405.02596
13
citations
#1341

Recovering the Pre-Fine-Tuning Weights of Generative Models

Eliahu Horwitz, Jonathan Kahana, Yedid Hoshen

ICML 2024arXiv:2402.10208
13
citations
#1342

Policy Filtration for RLHF to Mitigate Noise in Reward Models

Chuheng Zhang, Wei Shen, Li Zhao et al.

ICML 2025arXiv:2409.06957
13
citations
#1343

Non-convex Stochastic Composite Optimization with Polyak Momentum

Yuan Gao, Anton Rodomanov, Sebastian Stich

ICML 2024arXiv:2403.02967
13
citations
#1344

A Neural-Guided Dynamic Symbolic Network for Exploring Mathematical Expressions from Data

Wenqiang Li, Weijun Li, Lina Yu et al.

ICML 2024arXiv:2309.13705
13
citations
#1345

How Contaminated Is Your Benchmark? Measuring Dataset Leakage in Large Language Models with Kernel Divergence

Hyeong Kyu Choi, Maxim Khanov, Hongxin Wei et al.

ICML 2025
13
citations
#1346

Distributional Diffusion Models with Scoring Rules

Valentin De Bortoli, Alexandre Galashov, J Swaroop Guntupalli et al.

ICML 2025arXiv:2502.02483
13
citations
#1347

One Example Shown, Many Concepts Known! Counterexample-Driven Conceptual Reasoning in Mathematical LLMs

Yinghui Li, Jiayi Kuang, Haojing Huang et al.

ICML 2025arXiv:2502.10454
13
citations
#1348

MusicFlow: Cascaded Flow Matching for Text Guided Music Generation

Prajwal K R, Bowen Shi, Matthew Le et al.

ICML 2024arXiv:2410.20478
13
citations
#1349

Recurrent Distance Filtering for Graph Representation Learning

Yuhui Ding, Antonio Orvieto, Bobby He et al.

ICML 2024arXiv:2312.01538
13
citations
#1350

Rethinking Generative Large Language Model Evaluation for Semantic Comprehension

Fangyun Wei, Xi Chen, Lin Luo

ICML 2024arXiv:2403.07872
13
citations
#1351

Large Language-Geometry Model: When LLM meets Equivariance

Zongzhao Li, Jiacheng Cen, Bing Su et al.

ICML 2025arXiv:2502.11149
13
citations
#1352

MindLLM: A Subject-Agnostic and Versatile Model for fMRI-to-text Decoding

Weikang Qiu, Zheng Huang, Haoyu Hu et al.

ICML 2025arXiv:2502.15786
13
citations
#1353

A Memory Efficient Randomized Subspace Optimization Method for Training Large Language Models

Yiming Chen, yuan zhang, Yin Liu et al.

ICML 2025arXiv:2502.07222
13
citations
#1354

Adaptive Proximal Gradient Methods Are Universal Without Approximation

Konstantinos Oikonomidis, Emanuel Laude, Puya Latafat et al.

ICML 2024spotlightarXiv:2402.06271
13
citations
#1355

Learning the RoPEs: Better 2D and 3D Position Encodings with STRING

Connor Schenck, Isaac Reid, Mithun Jacob et al.

ICML 2025spotlightarXiv:2502.02562
13
citations
#1356

Multi-Fidelity Residual Neural Processes for Scalable Surrogate Modeling

Brooks(Ruijia) Niu, Dongxia Wu, Kai Kim et al.

ICML 2024arXiv:2402.18846
13
citations
#1357

Safe Delta: Consistently Preserving Safety when Fine-Tuning LLMs on Diverse Datasets

Ning LU, Shengcai Liu, Jiahao Wu et al.

ICML 2025arXiv:2505.12038
13
citations
#1358

Improving Neural Additive Models with Bayesian Principles

Kouroche Bouchiat, Alexander Immer, Hugo Yèche et al.

ICML 2024arXiv:2305.16905
13
citations
#1359

FADAS: Towards Federated Adaptive Asynchronous Optimization

Yujia Wang, Shiqiang Wang, Songtao Lu et al.

ICML 2024arXiv:2407.18365
13
citations
#1360

PIDformer: Transformer Meets Control Theory

Tam Nguyen, Cesar Uribe, Tan Nguyen et al.

ICML 2024arXiv:2402.15989
12
citations
#1361

Improving Token-Based World Models with Parallel Observation Prediction

Lior Cohen, Kaixin Wang, Bingyi Kang et al.

ICML 2024arXiv:2402.05643
12
citations
#1362

Harmony in Diversity: Merging Neural Networks with Canonical Correlation Analysis

Stefan Horoi, Albert Manuel Orozco Camacho, Eugene Belilovsky et al.

ICML 2024arXiv:2407.05385
12
citations
#1363

Black-Box Adversarial Attacks on LLM-Based Code Completion

Slobodan Jenko, Niels Mündler, Jingxuan He et al.

ICML 2025arXiv:2408.02509
12
citations
#1364

Topological Neural Networks go Persistent, Equivariant, and Continuous

Yogesh Verma, Amauri Souza, Vikas Garg

ICML 2024arXiv:2406.03164
12
citations
#1365

Taming Knowledge Conflicts in Language Models

Gaotang Li, Yuzhong Chen, Hanghang Tong

ICML 2025spotlightarXiv:2503.10996
12
citations
#1366

Unlocking the Capabilities of Large Vision-Language Models for Generalizable and Explainable Deepfake Detection

Peipeng Yu, Jianwei Fei, Hui Gao et al.

ICML 2025arXiv:2503.14853
12
citations
#1367

Otter: Generating Tests from Issues to Validate SWE Patches

Toufique Ahmed, Jatin Ganhotra, Rangeet Pan et al.

ICML 2025arXiv:2502.05368
12
citations
#1368

EquiAV: Leveraging Equivariance for Audio-Visual Contrastive Learning

Jongsuk Kim, Hyeongkeun Lee, Kyeongha Rho et al.

ICML 2024arXiv:2403.09502
12
citations
#1369

DUPLEX: Dual GAT for Complex Embedding of Directed Graphs

Zhaoru Ke, Hang Yu, Jianguo Li et al.

ICML 2024arXiv:2406.05391
12
citations
#1370

Conditioning Diffusions Using Malliavin Calculus

Jakiw Pidstrigach, Elizabeth Baker, Carles Domingo i Enrich et al.

ICML 2025arXiv:2504.03461
12
citations
#1371

Replicable Learning of Large-Margin Halfspaces

Alkis Kalavasis, Amin Karbasi, Kasper Green Larsen et al.

ICML 2024spotlightarXiv:2402.13857
12
citations
#1372

MoEQuant: Enhancing Quantization for Mixture-of-Experts Large Language Models via Expert-Balanced Sampling and Affinity Guidance

Zhixuan Chen, Xing Hu, Dawei Yang et al.

ICML 2025arXiv:2505.03804
12
citations
#1373

Strategy Coopetition Explains the Emergence and Transience of In-Context Learning

Aaditya Singh, Ted Moskovitz, Sara Dragutinović et al.

ICML 2025oralarXiv:2503.05631
12
citations
#1374

Metastable Dynamics of Chain-of-Thought Reasoning: Provable Benefits of Search, RL and Distillation

Juno Kim, Denny Wu, Jason Lee et al.

ICML 2025arXiv:2502.01694
12
citations
#1375

A Universal Class of Sharpness-Aware Minimization Algorithms

Behrooz Tahmasebi, Ashkan Soleymani, Dara Bahri et al.

ICML 2024arXiv:2406.03682
12
citations
#1376

REST: Efficient and Accelerated EEG Seizure Analysis through Residual State Updates

Arshia Afzal, Grigorios Chrysos, Volkan Cevher et al.

ICML 2024oralarXiv:2406.16906
12
citations
#1377

Learning Adversarial MDPs with Stochastic Hard Constraints

Francesco Emanuele Stradi, Matteo Castiglioni, Alberto Marchesi et al.

ICML 2025arXiv:2403.03672
12
citations
#1378

RODEO: Robust Outlier Detection via Exposing Adaptive Out-of-Distribution Samples

Hossein Mirzaei, Mohammad Jafari Varnousfaderani, Hamid Reza Dehbashi et al.

ICML 2024arXiv:2501.16971
12
citations
#1379

DeCoOp: Robust Prompt Tuning with Out-of-Distribution Detection

Zhi Zhou, Ming Yang, Jiang-Xin Shi et al.

ICML 2024arXiv:2406.00345
12
citations
#1380

Performative Prediction with Bandit Feedback: Learning through Reparameterization

Yatong Chen, Wei Tang, Chien-Ju Ho et al.

ICML 2024arXiv:2305.01094
12
citations
#1381

Emergent Equivariance in Deep Ensembles

Jan Gerken, Pan Kessel

ICML 2024arXiv:2403.03103
12
citations
#1382

Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards

Yangsibo Huang, Milad Nasr, Anastasios Angelopoulos et al.

ICML 2025oralarXiv:2501.07493
12
citations
#1383

Online Algorithms with Uncertainty-Quantified Predictions

Bo Sun, Jerry Huang, Nicolas Christianson et al.

ICML 2024arXiv:2310.11558
12
citations
#1384

Conformalized Adaptive Forecasting of Heterogeneous Trajectories

Yanfei Zhou, Lars Lindemann, Matteo Sesia

ICML 2024arXiv:2402.09623
12
citations
#1385

CAD-Editor: A Locate-then-Infill Framework with Automated Training Data Synthesis for Text-Based CAD Editing

Yu Yuan, Shizhao Sun, Qi Liu et al.

ICML 2025arXiv:2502.03997
12
citations
#1386

Mixture of Experts Made Intrinsically Interpretable

Xingyi Yang, Constantin Venhoff, Ashkan Khakzar et al.

ICML 2025arXiv:2503.07639
12
citations
#1387

Sparse is Enough in Fine-tuning Pre-trained Large Language Models

Weixi Song, Zuchao Li, Lefei Zhang et al.

ICML 2024spotlightarXiv:2312.11875
12
citations
#1388

Compositional Image Decomposition with Diffusion Models

Jocelin Su, Nan Liu, Yanbo Wang et al.

ICML 2024arXiv:2406.19298
12
citations
#1389

Using AI Uncertainty Quantification to Improve Human Decision-Making

Laura Marusich, Jonathan Bakdash, Yan Zhou et al.

ICML 2024oralarXiv:2309.10852
12
citations
#1390

The Energy Loss Phenomenon in RLHF: A New Perspective on Mitigating Reward Hacking

Yuchun Miao, Sen Zhang, Liang Ding et al.

ICML 2025arXiv:2501.19358
12
citations
#1391

TimeDART: A Diffusion Autoregressive Transformer for Self-Supervised Time Series Representation

Daoyu Wang, Mingyue Cheng, Zhiding Liu et al.

ICML 2025arXiv:2410.05711
12
citations
#1392

Scaling Inference-Efficient Language Models

Song Bian, Minghao Yan, Shivaram Venkataraman

ICML 2025arXiv:2501.18107
12
citations
#1393

Scaling Laws for Differentially Private Language Models

Ryan McKenna, Yangsibo Huang, Amer Sinha et al.

ICML 2025arXiv:2501.18914
12
citations
#1394

Characterizing Overfitting in Kernel Ridgeless Regression Through the Eigenspectrum

Tin Sum Cheng, Aurelien Lucchi, Anastasis Kratsios et al.

ICML 2024arXiv:2402.01297
12
citations
#1395

Why Do Animals Need Shaping? A Theory of Task Composition and Curriculum Learning

Jin Hwa Lee, Stefano Mannelli, Andrew Saxe

ICML 2024arXiv:2402.18361
12
citations
#1396

Improving Interpretation Faithfulness for Vision Transformers

Lijie Hu, Yixin Liu, Ninghao Liu et al.

ICML 2024spotlightarXiv:2311.17983
12
citations
#1397

NTPP: Generative Speech Language Modeling for Dual-Channel Spoken Dialogue via Next-Token-Pair Prediction

Qichao Wang, Ziqiao Meng, Wenqian Cui et al.

ICML 2025arXiv:2506.00975
12
citations
#1398

Towards Graph Foundation Models: Learning Generalities Across Graphs via Task-Trees

Zehong Wang, Zheyuan Zhang, Tianyi MA et al.

ICML 2025arXiv:2412.16441
12
citations
#1399

Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces

Kevin Rojas, Yuchen Zhu, Sichen Zhu et al.

ICML 2025arXiv:2506.07903
12
citations
#1400

Tripod: Three Complementary Inductive Biases for Disentangled Representation Learning

Kyle Hsu, Jubayer Ibn Hamid, Kaylee Burns et al.

ICML 2024arXiv:2404.10282
12
citations