ICML Papers
5,975 papers found • Page 24 of 120
From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories, and Applications
Ajay Jaiswal, Yifan Wang, Lu Yin et al.
From Mechanistic Interpretability to Mechanistic Biology: Training, Evaluating, and Interpreting Sparse Autoencoders on Protein Language Models
Etowah Adams, Liam Bai, Minji Lee et al.
From Passive to Active Reasoning: Can Large Language Models Ask the Right Questions under Incomplete Information?
Zhanke Zhou, Xiao Feng, Zhaocheng Zhu et al.
From Pixels to Perception: Interpretable Predictions via Instance-wise Grouped Feature Selection
Moritz Vandenhirtz, Julia Vogt
From RAG to Memory: Non-Parametric Continual Learning for Large Language Models
Bernal Jimenez Gutierrez, Yiheng Shu, Weijian Qi et al.
From Spectrum-free towards Baseline-view-free: Double-track Proximity Driven Multi-view Clustering
Shengju Yu, Dong Zhibin, Siwei Wang et al.
From Theory to Practice: Rethinking Green and Martin Kernels for Unleashing Graph Transformers
Yoon Hyeok Lee, Jaemin Park, Taejin Paik et al.
From Thousands to Billions: 3D Visual Language Grounding via Render-Supervised Distillation from 2D VLMs
Ang Cao, Sergio Arnaud, Oleksandr Maksymets et al.
From Token to Rhythm: A Multi-Scale Approach for ECG-Language Pretraining
Fuying Wang, Jiacheng Xu, Lequan Yu
From Uncertain to Safe: Conformal Adaptation of Diffusion Models for Safe PDE Control
Peiyan Hu, Xiaowei Qian, Wenhao Deng et al.
From Weight-Based to State-Based Fine-Tuning: Further Memory Reduction on LoRA with Parallel Control
Chi Zhang, REN Lianhai, Jingpu Cheng et al.
FRUGAL: Memory-Efficient Optimization by Reducing State Overhead for Scalable Training
Filipp Zmushko, Aleksandr Beznosikov, Martin Takac et al.
FSL-SAGE: Accelerating Federated Split Learning via Smashed Activation Gradient Estimation
Srijith Nair, Michael Lin, Peizhong Ju et al.
FSTLLM: Spatio-Temporal LLM for Few Shot Time Series Forecasting
Yue Jiang, Yile Chen, Xiucheng Li et al.
Fully Dynamic Embedding into $\ell_p$ Spaces
Kiarash Banihashem, Xiang Chen, MohammadTaghi Hajiaghayi et al.
Fully Dynamic Euclidean Bi-Chromatic Matching in Sublinear Update Time
Gramoz Goranci, Peter Kiss, Neel Patel et al.
Fully Heteroscedastic Count Regression with Deep Double Poisson Networks
Spencer Young, Porter Jenkins, Longchao Da et al.
FunBO: Discovering Acquisition Functions for Bayesian Optimization with FunSearch
Virginia Aglietti, Ira Ktena, Jessica Schrouff et al.
Functional Alignment Can Mislead: Examining Model Stitching
Damian Smith, Harvey Mannering, Antonia Marcu
Function Encoders: A Principled Approach to Transfer Learning in Hilbert Spaces
Tyler Ingebrand, Adam Thorpe, Ufuk Topcu
Function-Space Learning Rates
Edward Milsom, Ben Anson, Laurence Aitchison
Function-to-Style Guidance of LLMs for Code Translation
Longhui Zhang, Bin Wang, Jiahao Wang et al.
Fundamental Bias in Inverting Random Sampling Matrices with Application to Sub-sampled Newton
Chengmei Niu, Zhenyu Liao, Zenan Ling et al.
Fundamental limits of learning in sequence multi-index models and deep attention networks: high-dimensional asymptotics and sharp thresholds
Emanuele Troiani, Hugo Cui, Yatin Dandi et al.
Fundamental Limits of Visual Autoregressive Transformers: Universal Approximation Abilities
Yifang Chen, Xiaoyu Li, Yingyu Liang et al.
FuseUNet: A Multi-Scale Feature Fusion Method for U-like Networks
Quansong He, Xiangde Min, Kaishen Wang et al.
Fusing Reward and Dueling Feedback in Stochastic Bandits
Xuchuang Wang, Qirun Zeng, Jinhang Zuo et al.
G-Adaptivity: optimised graph-based mesh relocation for finite element methods
James Rowbottom, Georg Maierhofer, Teo Deveney et al.
Galileo: Learning Global & Local Features of Many Remote Sensing Modalities
Gabriel Tseng, Anthony Fuller, Marlena Reil et al.
Gamma Distribution PCA-Enhanced Feature Learning for Angle-Robust SAR Target Recognition
Chong Zhang, Peng Zhang, Mengke Li
Gandalf the Red: Adaptive Security for LLMs
Niklas Pfister, Václav Volhejn, Manuel Knott et al.
GANQ: GPU-Adaptive Non-Uniform Quantization for Large Language Models
Pengxiang Zhao, Xiaoming Yuan
Gap-Dependent Bounds for Federated $Q$-Learning
Haochen Zhang, Zhong Zheng, Lingzhou Xue
GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model
Zixiang Ai, Zichen Liu, Yuanhang Lei et al.
Gaussian Mixture Flow Matching Models
Hansheng Chen, Kai Zhang, Hao Tan et al.
GaussMark: A Practical Approach for Structural Watermarking of Language Models
Adam Block, Alexander Rakhlin, Ayush Sekhari
GaussMarker: Robust Dual-Domain Watermark for Diffusion Models
Kecen Li, Zhicong Huang, Xinwen Hou et al.
GCAL: Adapting Graph Models to Evolving Domain Shifts
Ziyue Qiao, Qianyi Cai, Hao Dong et al.
G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks
Guibin Zhang, Yanwei Yue, Xiangguo Sun et al.
GEFA: A General Feature Attribution Framework Using Proxy Gradient Estimation
Yi Cai, Thibaud Ardoin, Gerhard Wunder
General agents need world models
Jonathan Richens, Tom Everitt, David Abel
General framework for online-to-nonconvex conversion: Schedule-free SGD is also effective for nonconvex optimization
Kwangjun Ahn, Gagik Magakyan, Ashok Cutkosky
Generalists vs. Specialists: Evaluating LLMs on Highly-Constrained Biophysical Sequence Optimization Tasks
Angelica Chen, Samuel Stanton, Frances Ding et al.
Generalizable Multi-Camera 3D Object Detection from a Single Source via Fourier Cross-View Learning
Xue Zhao, Qinying Gu, Xinbing Wang et al.
Generalization Analysis for Controllable Learning
Yi-Fan Zhang, Xiao Zhang, Min-Ling Zhang
Generalization Analysis for Supervised Contrastive Representation Learning under Non-IID Settings
Minh Hieu Nong, Antoine Ledent
Generalization and Robustness of the Tilted Empirical Risk
Gholamali Aminian, Amir R. Asadi, Tian Li et al.
Generalization Bounds via Meta-Learned Model Representations: PAC-Bayes and Sample Compression Hypernetworks
Benjamin Leblanc, Mathieu Bazinet, Nathaniel D'Amours et al.
Generalization in Federated Learning: A Conditional Mutual Information Framework
Ziqiao Wang, Cheng Long, Yongyi Mao
Generalization of noisy SGD in unbounded non-convex settings
Leello Dadi, Volkan Cevher