Most Cited ICML "learning rate schedule" Papers
5,975 papers found • Page 28 of 30
Conference
Latent Variable Estimation in Bayesian Black-Litterman Models
Thomas Y.L. Lin, Jerry Yao-Chieh Hu, Wan-Jiun Paul Chiou et al.
Convergence of Mean-Field Langevin Stochastic Descent-Ascent for Distributional Minimax Optimization
Zhangyi Liu, Feng Liu, Rui Gao et al.
A-PSRO: A Unified Strategy Learning Method with Advantage Metric for Normal-form Games
Yudong Hu, Haoran Li, Congying Han et al.
In-Context Fine-Tuning for Time-Series Foundation Models
Matthew Faw, Rajat Sen, Yichen Zhou et al.
High-Dimensional Tensor Regression With Oracle Properties
Wenbin Wang, Yu Shi, Ziping Zhao
FourierMamba: Fourier Learning Integration with State Space Models for Image Deraining
Dong Li, Yidi Liu, Xueyang Fu et al.
Analytical Lyapunov Function Discovery: An RL-based Generative Approach
Haohan Zou, Jie Feng, Hao Zhao et al.
T1: Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling
Zhenyu Hou, Xin Lv, Rui Lu et al.
Implicit Regularization for Tubal Tensor Factorizations via Gradient Descent
Santhosh Karnik, Anna Veselovska, Mark Iwen et al.
MENTOR: Mixture-of-Experts Network with Task-Oriented Perturbation for Visual Reinforcement Learning
Suning Huang, Zheyu Zhang, Tianhai Liang et al.
Efficient Federated Incomplete Multi-View Clustering
Suyuan Liu, Hao Yu, Hao Tan et al.
TraceGrad: a Framework Learning Expressive SO(3)-equivariant Non-linear Representations for Electronic-Structure Hamiltonian Prediction
Shi Yin, Xinyang Pan, fengyan wang et al.
CommVQ: Commutative Vector Quantization for KV Cache Compression
Junyan Li, Yang Zhang, Muhammad Yusuf Hassan et al.
Addressing Concept Mislabeling in Concept Bottleneck Models Through Preference Optimization
Emiliano Penaloza, Tianyue Zhang, Laurent Charlin et al.
On the Resilience of LLM-Based Multi-Agent Collaboration with Faulty Agents
Jen-Tse Huang, Jiaxu Zhou, Tailin Jin et al.
Bayesian Weight Enhancement with Steady-State Adaptation for Test-time Adaptation in Dynamic Environments
Jae-Hong Lee
Are Sparse Autoencoders Useful? A Case Study in Sparse Probing
Subhash Kantamneni, Josh Engels, Senthooran Rajamanoharan et al.
Securing Equal Share: A Principled Approach for Learning Multiplayer Symmetric Games
Jiawei Ge, Yuanhao Wang, Wenzhe Li et al.
Exactly Tight Information-theoretic Generalization Bounds via Binary Jensen-Shannon Divergence
Yuxin Dong, Haoran Guo, Tieliang Gong et al.
All-Purpose Mean Estimation over R: Optimal Sub-Gaussianity with Outlier Robustness and Low Moments Performance
Jasper Lee, Walter McKelvie, Maoyuan Song et al.
Self-Play $Q$-Learners Can Provably Collude in the Iterated Prisoner's Dilemma
Quentin Bertrand, Juan Duque, Emilio Calvano et al.
DiffAdvMAP: Flexible Diffusion-Based Framework for Generating Natural Unrestricted Adversarial Examples
Zhengzhao Pan, Hua Chen, Xiaogang Zhang
Enhancing Visual Localization with Cross-Domain Image Generation
Yuanze Wang, Yichao Yan, Shiming Song et al.
Deep Reinforcement Learning from Hierarchical Preference Design
Alexander Bukharin, Yixiao Li, Pengcheng He et al.
One Stone, Two Birds: Enhancing Adversarial Defense Through the Lens of Distributional Discrepancy
Jiacheng Zhang, Benjamin Rubinstein, Jingfeng Zhang et al.
Reflect-then-Plan: Offline Model-Based Planning through a Doubly Bayesian Lens
Jihwan Jeong, Xiaoyu Wang, Jingmin Wang et al.
Unlocking the Power of Rehearsal in Continual Learning: A Theoretical Perspective
Junze Deng, Qinhang Wu, Peizhong Ju et al.
A Tale of Two Structures: Do LLMs Capture the Fractal Complexity of Language?
Ibrahim Alabdulmohsin, Andreas Steiner
On the Private Estimation of Smooth Transport Maps
Clément Lalanne, Franck Iutzeler, Loubes Jean-Michel et al.
Data-driven Design of Randomized Control Trials with Guaranteed Treatment Effects
Santiago Cortes-Gomez, Naveen Raman, Aarti Singh et al.
LoRA Training Provably Converges to a Low-Rank Global Minimum Or It Fails Loudly (But it Probably Won't Fail)
Junsu Kim, Jaeyeon Kim, Ernest Ryu
Larger or Smaller Reward Margins to Select Preferences for LLM Alignment?
Kexin Huang, Junkang Wu, Ziqian Chen et al.
Causal Logistic Bandits with Counterfactual Fairness Constraints
Jiajun Chen, Jin Tian, Chris Quinn
Rethink GraphODE Generalization within Coupled Dynamical System
Guancheng Wan, Zijie Huang, Wanjia Zhao et al.
Identifying Neural Dynamics Using Interventional State Space Models
Amin Nejatbakhsh, Yixin Wang
Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning
Yiran Wang, Chenshu Liu, Yunfan Li et al.
Invariant Deep Uplift Modeling for Incentive Assignment in Online Marketing via Probability of Necessity and Sufficiency
Zexu Sun, Qiyu Han, Hao Yang et al.
An Improved Clique-Picking Algorithm for Counting Markov Equivalent DAGs via Super Cliques Transfer
Lifu Liu, Shiyuan He, Jianhua Guo
An Online Learning Approach to Prompt-based Selection of Generative Models and LLMs
Xiaoyan Hu, Ho-fung Leung, Farzan Farnia
HYGMA: Hypergraph Coordination Networks with Dynamic Grouping for Multi-Agent Reinforcement Learning
Chiqiang Liu, Dazi Li
Continual Reinforcement Learning by Planning with Online World Models
Zichen Liu, Guoji Fu, Chao Du et al.
AEQA-NAT : Adaptive End-to-end Quantization Alignment Training Framework for Non-autoregressive Machine Translation
Xiangyu Qu, Guojing Liu, Liang Li
Chameleon: A Flexible Data-mixing Framework for Language Model Pretraining and Finetuning
Wanyun Xie, Francesco Tonin, Volkan Cevher
Functional Alignment Can Mislead: Examining Model Stitching
Damian Smith, Harvey Mannering, Antonia Marcu
FicGCN: Unveiling the Homomorphic Encryption Efficiency from Irregular Graph Convolutional Networks
Zhaoxuan Kan, Husheng Han, shangyi shi et al.
Fishers for Free? Approximating the Fisher Information Matrix by Recycling the Squared Gradient Accumulator
YuXin Li, Felix Dangel, Derek Tam et al.
LBI-FL: Low-Bit Integerized Federated Learning with Temporally Dynamic Bit-Width Allocation
Li Ding, Hao Zhang, Wenrui Dai et al.
Making Hard Problems Easier with Custom Data Distributions and Loss Regularization: A Case Study in Modular Arithmetic
Eshika Saxena, Alberto Alfarano, Emily Wenger et al.
Directed Graph Grammars for Sequence-based Learning
Michael Sun, Orion Foo, Gang Liu et al.
SDMG: Smoothing Your Diffusion Models for Powerful Graph Representation Learning
Junyou Zhu, Langzhou He, Chao Gao et al.
Best of Both Worlds: Regret Minimization versus Minimax Play
Adrian Müller, Jon Schneider, EFSTRATIOS PANTELEIMON SKOULAKIS et al.
Training High Performance Spiking Neural Network by Temporal Model Calibration
Jiaqi Yan, Changping Wang, De Ma et al.
Limitations of measure-first protocols in quantum machine learning
Casper Gyurik, Riccardo Molteni, Vedran Dunjko
LDMol: A Text-to-Molecule Diffusion Model with Structurally Informative Latent Space Surpasses AR Models
Jinho Chang, Jong Chul YE
Scalable Non-Equivariant 3D Molecule Generation via Rotational Alignment
Yuhui Ding, Thomas Hofmann
Explicit Discovery of Nonlinear Symmetries from Dynamic Data
Lexiang Hu, Yikang Li, Zhouchen Lin
Falsification of Unconfoundedness by Testing Independence of Causal Mechanisms
Rickard K.A. Karlsson, Jesse H. Krijthe
A Closer Look at Transformers for Time Series Forecasting: Understanding Why They Work and Where They Struggle
Yu Chen, Nathalia Céspedes, Payam Barnaghi
Revisiting Instance-Optimal Cluster Recovery in the Labeled Stochastic Block Model
Kaito Ariu, Alexandre Proutiere, Se-Young Yun
Better to Teach than to Give: Domain Generalized Semantic Segmentation via Agent Queries with Diffusion Model Guidance
Fan Li, Xuan Wang, Min Qi et al.
Global Context-aware Representation Learning for Spatially Resolved Transcriptomics
Yunhak Oh, Junseok Lee, Yeongmin Kim et al.
Provably Efficient Algorithm for Best Scoring Rule Identification in Online Principal-Agent Information Acquisition
Zichen Wang, Chuanhao Li, Huazheng Wang
Testing the Limits of Fine-Tuning for Improving Visual Cognition in Vision Language Models
Luca M. Schulze Buschoff, Konstantinos Voudouris, Elif Akata et al.
Black-Box Adversarial Attacks on LLM-Based Code Completion
Slobodan Jenko, Niels Mündler, Jingxuan He et al.
Behavior-agnostic Task Inference for Robust Offline In-context Reinforcement Learning
Long Ma, Fangwei Zhong, Yizhou Wang
Counterfactual Effect Decomposition in Multi-Agent Sequential Decision Making
Stelios Triantafyllou, Aleksa Sukovic, Yasaman Zolfimoselo et al.
DeepCrossAttention: Supercharging Transformer Residual Connections
Mike Heddes, Adel Javanmard, Kyriakos Axiotis et al.
BaxBench: Can LLMs Generate Correct and Secure Backends?
Mark Vero, Niels Mündler, Viktor Chibotaru et al.
Automated Hypothesis Validation with Agentic Sequential Falsifications
Kexin Huang, Ying Jin, Ryan Li et al.
ZeroFlow: Overcoming Catastrophic Forgetting is Easier than You Think
Tao Feng, Wei Li, Didi Zhu et al.
TransPL: VQ-Code Transition Matrices for Pseudo-Labeling of Time Series Unsupervised Domain Adaptation
Jaeho Kim, Seulki Lee
Strengthen Out-of-Distribution Detection Capability with Progressive Self-Knowledge Distillation
Yang Yang, Haonan Xu
Learning Joint Interventional Effects from Single-Variable Interventions in Additive Models
Armin Kekić, Sergio Hernan Garrido Mejia, Bernhard Schölkopf
Weakly Supervised Anomaly Detection via Dual-Tailed Kernel
Walid Durani, Tobias Nitzl, Claudia Plant et al.
Q-Supervised Contrastive Representation: A State Decoupling Framework for Safe Offline Reinforcement Learning
Zhihe Yang, Yunjian Xu, Yang Zhang
Gumiho: A Hybrid Architecture to Prioritize Early Tokens in Speculative Decoding
Jinze Li, Yixing Xu, Haiduo Huang et al.
iDPA: Instance Decoupled Prompt Attention for Incremental Medical Object Detection
Huahui Yi, Wei Xu, Ziyuan Qin et al.
Stay Hungry, Keep Learning: Sustainable Plasticity for Deep Reinforcement Learning
Huaicheng Zhou, Zifeng Zhuang, Donglin Wang
Fully Heteroscedastic Count Regression with Deep Double Poisson Networks
Spencer Young, Porter Jenkins, Longchao Da et al.
Fully Dynamic Embedding into $\ell_p$ Spaces
Kiarash Banihashem, Xiang Chen, MohammadTaghi Hajiaghayi et al.
PiD: Generalized AI-Generated Images Detection with Pixelwise Decomposition Residuals
Xinghe Fu, Zhiyuan Yan, Zheng Yang et al.
Learning from Sample Stability for Deep Clustering
Zhixin Li, Yuheng Jia, Hui LIU et al.
Flow Matching for Denoised Social Recommendation
Yinxuan Huang, KE LIANG, Zhuofan Dong et al.
Auditing Prompt Caching in Language Model APIs
Chenchen Gu, Xiang Li, Rohith Kuditipudi et al.
Reinforcement Learning with Segment Feedback
Yihan Du, Anna Winnicki, Gal Dalal et al.
Implicit degree bias in the link prediction task
Rachith Aiyappa, Xin Wang, Munjung Kim et al.
Rethinking Score Distilling Sampling for 3D Editing and Generation
Xingyu Miao, Haoran Duan, Yang Long et al.
WildChat-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training
Benjamin Feuer, Chinmay Hegde
HEAP: Hyper Extended A-PDHG Operator for Constrained High-dim PDEs
Mingquan Feng, Weixin Liao, Yixin Huang et al.
Contour Integration Underlies Human-Like Vision
Ben Lonnqvist, Elsa Scialom, Abdulkadir Gokce et al.
BSemiFL: Semi-supervised Federated Learning via a Bayesian Approach
Haozhao Wang, Shengyu Wang, Jiaming Li et al.
Multi-band Frequency Reconstruction for Neural Psychoacoustic Coding
Dianwen Ng, Kun Zhou, Yi-Wen Chao et al.
Instance-Optimal Pure Exploration for Linear Bandits on Continuous Arms
Sho Takemori, Yuhei Umeda, Aditya Gopalan
Flat-LoRA: Low-Rank Adaptation over a Flat Loss Landscape
Tao Li, Zhengbao He, Yujun Li et al.
Square$\chi$PO: Differentially Private and Robust $\chi^2$-Preference Optimization in Offline Direct Alignment
Xingyu Zhou, Yulian Wu, Wenqian Weng et al.
Equivariant Polynomial Functional Networks
Thieu Vo, Viet Hoang Tran, Tho Tran Huu et al.
An Effective and Secure Federated Multi-View Clustering Method with Information-Theoretic Perspective
Xinyue Chen, Jinfeng Peng, Yuhao Li et al.
Deep Neural Cellular Potts Models
Koen Minartz, Tim d'Hondt, Leon Hillmann et al.
The Sample Complexity of Online Strategic Decision Making with Information Asymmetry and Knowledge Transportability
Jiachen Hu, Rui Ai, Han Zhong et al.
Understanding the Logic of Direct Preference Alignment through Logic
Kyle Richardson, Vivek Srikumar, Ashish Sabharwal
Contextual Optimization Under Model Misspecification: A Tractable and Generalizable Approach
Omar Bennouna, Jiawei Zhang, Saurabh Amin et al.
Modularized Self-Reflected Video Reasoner for Multimodal LLM with Application to Video Question Answering
Zihan Song, Xin Wang, Zi Qian et al.
General agents need world models
Jonathan Richens, Tom Everitt, David Abel
Discovering Symbolic Cognitive Models from Human and Animal Behavior
Pablo Samuel Castro, Nenad Tomasev, Ankit Anand et al.
Scalable Approximation Algorithms for $p$-Wasserstein Distance and Its Variants
Nathaniel Lahn, Sharath Raghvendra, Emma Saarinen et al.
SkipGPT: Each Token is One of a Kind
Anhao Zhao, Fanghua Ye, Yingqi Fan et al.
Conservative Offline Goal-Conditioned Implicit V-Learning
Ke Kaiqiang, qian lin, Zongkai Liu et al.
CERTAIN: Context Uncertainty-aware One-Shot Adaptation for Context-based Offline Meta Reinforcement Learning
Hongtu Zhou, Ruiling Yang, Yakun Zhu et al.
InfoCons: Identifying Interpretable Critical Concepts in Point Clouds via Information Theory
Feifei Li, Mi Zhang, Zhaoxiang Wang et al.
Unsupervised Learning for Class Distribution Mismatch
Pan Du, Zhao, Xinai Lu et al.
Be a Goldfish: Forgetting Bad Conditioning in Sparse Linear Regression via Variational Autoencoders
Kuheli Pratihar, Debdeep Mukhopadhyay
Portable Reward Tuning: Towards Reusable Fine-Tuning across Different Pretrained Models
Daiki Chijiwa, Taku Hasegawa, Kyosuke Nishida et al.
Large Language Models to Diffusion Finetuning
Edoardo Cetin, Tianyu Zhao, Yujin Tang
Learning Fused State Representations for Control from Multi-View Observations
Zeyu Wang, Yao-Hui Li, Xin Li et al.
EvFocus: Learning to Reconstruct Sharp Images from Out-of-Focus Event Streams
Lin Zhu, Xiantao Ma, Xiao Wang et al.
Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations
Yucheng Hu, Yanjiang Guo, Pengchao Wang et al.
Be Confident: Uncovering Overfitting in MLLM Multi-Task Tuning
Wenke Huang, Jian Liang, Guancheng Wan et al.
Runtime Analysis of Evolutionary NAS for Multiclass Classification
Zeqiong Lv, Chao Qian, Yun Liu et al.
Discrete Markov Probabilistic Models: An Improved Discrete Score-Based Framework with sharp convergence bounds under minimal assumptions
Le Tuyet Nhi PHAM, Dario Shariatian, Antonio Ocello et al.
SeedLoRA: A Fusion Approach to Efficient LLM Fine-Tuning
Yong Liu, Di Fu, Shenggan Cheng et al.
IRBridge: Solving Image Restoration Bridge with Pre-trained Generative Diffusion Models
Hanting Wang, Tao Jin, Wang Lin et al.
Splitting & Integrating: Out-of-Distribution Detection via Adversarial Gradient Attribution
Jiayu Zhang, Xinyi Wang, Zhibo Jin et al.
SAM2Act: Integrating Visual Foundation Model with A Memory Architecture for Robotic Manipulation
Haoquan Fang, Markus Grotz, Wilbert Pumacay et al.
Offline Learning for Combinatorial Multi-armed Bandits
Xutong Liu, Xiangxiang Dai, Jinhang Zuo et al.
Compositional Condition Question Answering in Tabular Understanding
Jun-Peng Jiang, Tao Zhou, De-Chuan Zhan et al.
Federated Incomplete Multi-view Clustering with Globally Fused Graph Guidance
Guoqing Chao, Zhenghao Zhang, Lei Meng et al.
Risk-Sensitive Theory of Mind: Coordinating with Agents of Unknown Bias using Cumulative Prospect Theory
Mason O. Smith, Wenlong Zhang
The Limits of Predicting Agents from Behaviour
Alexis Bellot, Jonathan Richens, Tom Everitt
Reinforcement Learning Control of a Physical Robot Device for Assisted Human Walking without a Simulator
junmin zhong, Emiliano Quinones Yumbla, Seyed Yousef Soltanian et al.
MoHAVE: Mixture of Hierarchical Audio-Visual Experts for Robust Speech Recognition
Sungnyun Kim, Kangwook Jang, Sangmin Bae et al.
Physics Aware Neural Networks for Unsupervised Binding Energy Prediction
Ke Liu, Hao Chen, Chunhua Shen
MARGE: Improving Math Reasoning with Guided Exploration
Jingyue Gao, Runji Lin, Keming Lu et al.
EcoMapper: Generative Modeling for Climate-Aware Satellite Imagery
Muhammed Göktepe, Amir Hossein Shamseddin, Erencan Uysal et al.
Adaptive Flow Matching for Resolving Small-Scale Physics
Stathi Fotiadis, Noah Brenowitz, Tomas Geffner et al.
Learning Mean Field Control on Sparse Graphs
Christian Fabian, Kai Cui, Heinz Koeppl
Safe-EF: Error Feedback for Non-smooth Constrained Optimization
Rustem Islamov, Yarden As, Ilyas Fatkhullin
Fast Tensor Completion via Approximate Richardson Iteration
Mehrdad Ghadiri, Matthew Fahrbach, Yunbum Kook et al.
Habitizing Diffusion Planning for Efficient and Effective Decision Making
Haofei Lu, Yifei Shen, Dongsheng Li et al.
BAME: Block-Aware Mask Evolution for Efficient N:M Sparse Training
Chenyi yang, Wenjie Nie, Yuxin Zhang et al.
Non-Asymptotic and Non-Lipschitzian Bounds on Optimal Values in Stochastic Optimization Under Heavy Tails
Jindong Tong, Hongcheng Liu, Johannes Royset
When to retrain a machine learning model
Florence Regol, Leo Schwinn, Kyle Sprague et al.
Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts
Yike Yuan, Ziyu Wang, Zihao Huang et al.
Feature Shift Localization Network
Míriam Barrabés, Daniel Mas Montserrat, Kapal Dev et al.
Generalization and Robustness of the Tilted Empirical Risk
Gholamali Aminian, Amir R. Asadi, Tian Li et al.
ProDiff: Prototype-Guided Diffusion for Minimal Information Trajectory Imputation
Tianci Bu, Le Zhou, Wenchuan Yang et al.
Phase and Amplitude-aware Prompting for Enhancing Adversarial Robustness
Yibo Xu, Dawei Zhou, Decheng Liu et al.
MVA: Linear Attention with High-order Query-Keys Integration and Multi-level Vocabulary Decomposition
ning wang, Zekun Li, Tongxin Bai et al.
Harmonizing Geometry and Uncertainty: Diffusion with Hyperspheres
Muskan Dosi, Chiranjeev Chiranjeev, Kartik Thakral et al.
Private Model Personalization Revisited
Conor Snedeker, Xinyu Zhou, Raef Bassily
Online Learning in the Random-Order Model
Martino Bernasconi, Andrea Celli, Riccardo Colini Baldeschi et al.
LEVIS: Large Exact Verifiable Input Spaces for Neural Networks
Mohamad Chehade, Wenting Li, Brian Bell et al.
Batch List-Decodable Linear Regression via Higher Moments
Ilias Diakonikolas, Daniel Kane, Sushrut Karmalkar et al.
DiffusionVLA: Scaling Robot Foundation Models via Unified Diffusion and Autoregression
Junjie Wen, Yichen Zhu, Minjie Zhu et al.
TabSDS: a Lightweight, Fully Non-Parametric, and Model Free Approach for Generating Synthetic Tabular Data
Elias Chaibub Neto
Token Signature: Predicting Chain-of-Thought Gains with Token Decoding Feature in Large Language Models
peijie liu, Fengli Xu, Yong Li
Efficient Core-set Selection for Deep Learning Through Squared Loss Minimization
Jianting Chen
A Sub-Problem Quantum Alternating Operator Ansatz for Correlation Clustering
Lucas Fabian Naumann, Jannik Irmai, Bjoern Andres
Rethinking Benign Overfitting in Two-Layer Neural Networks
Ruichen Xu, Kexin Chen
Efficient Noise Calculation in Deep Learning-based MRI Reconstructions
Onat Dalmaz, Arjun Desai, Reinhard Heckel et al.
Accelerating Linear Recurrent Neural Networks for the Edge with Unstructured Sparsity
Alessandro Pierro, Steven Abreu, Jonathan Timcheck et al.
Can DBNNs Robust to Environmental Noise for Resource-constrained Scenarios?
Wendong Zheng, Junyang Chen, Husheng Guo et al.
Probably Approximately Global Robustness Certification
Peter Blohm, Patrick Indri, Thomas Gärtner et al.
Graph Adaptive Autoregressive Moving Average Models
Moshe Eliasof, Alessio Gravina, Andrea Ceni et al.
Understanding the difficulties of posterior predictive estimation
Abhinav Agrawal, Justin Domke
Proxy-FDA: Proxy-based Feature Distribution Alignment for Fine-tuning Vision Foundation Models without Forgetting
Chen Huang, Skyler Seto, Hadi Pouransari et al.
Stochastic Forward–Backward Deconvolution: Training Diffusion Models with Finite Noisy Datasets
Haoye Lu, Qifan Wu, Yaoliang Yu
Generative Audio Language Modeling with Continuous-valued Tokens and Masked Next-Token Prediction
Shu-wen Yang, Byeonggeun Kim, Kuan Po Huang et al.
What Has a Foundation Model Found? Inductive Bias Reveals World Models
Keyon Vafa, Peter Chang, Ashesh Rambachan et al.
Dialogue Without Limits: Constant-Sized KV Caches for Extended Response in LLMs
Ravi Ghadia, Avinash Kumar, Gaurav Jain et al.
Tree-Sliced Wasserstein Distance: A Geometric Perspective
Viet Hoang Tran, Trang Pham, Tho Tran Huu et al.
Off-Policy Evaluation under Nonignorable Missing Data
Han Wang, Yang Xu, Wenbin Lu et al.
Self-supervised Adversarial Purification for Graph Neural Networks
Woohyun Lee, Hogun Park
Deep Linear Network Training Dynamics from Random Initialization: Data, Width, Depth, and Hyperparameter Transfer
Blake Bordelon, Cengiz Pehlevan
LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models
Dachuan Shi, Yonggan Fu, Xiangchi Yuan et al.
Chaos Meets Attention: Transformers for Large-Scale Dynamical Prediction
Yi He, Yiming Yang, Xiaoyuan Cheng et al.
Improving Diversity in Language Models: When Temperature Fails, Change the Loss
Alexandre Verine, Florian Le Bronnec, Kunhao Zheng et al.
Efficient Network Automatic Relevance Determination
Hongwei Zhang, Ziqi Ye, Xinyuan Wang et al.
Learning Compact Semantic Information for Incomplete Multi-View Missing Multi-Label Classification
Jie Wen, Yadong Liu, Zhanyan Tang et al.
Sparse Causal Discovery with Generative Intervention for Unsupervised Graph Domain Adaptation
Junyu Luo, Yuhao Tang, Yiwei Fu et al.
Theoretically Unmasking Inference Attacks Against LDP-Protected Clients in Federated Vision Models
Quan Nguyen, Minh Vu, Truc Nguyen et al.
Federated Causal Structure Learning with Non-identical Variable Sets
Yunxia Wang, Fuyuan CAO, Kui Yu et al.
NExtLong: Toward Effective Long-Context Training without Long Documents
Chaochen Gao, Xing W, Zijia Lin et al.
Optimizing Social Network Interventions via Hypergradient-Based Recommender System Design
Marino Kühne, Panagiotis D. Grontas, Giulia De Pasquale et al.
Guarantees of a Preconditioned Subgradient Algorithm for Overparameterized Asymmetric Low-rank Matrix Recovery
Paris Giampouras, HanQin Cai, Rene Vidal
New Bounds for Sparse Variational Gaussian Processes
Michalis Titsias
Adapter Naturally Serves as Decoupler for Cross-Domain Few-Shot Semantic Segmentation
Jintao Tong, Ran Ma, Yixiong Zou et al.
Learning with Selectively Labeled Data from Multiple Decision-makers
Jian Chen, Zhehao Li, Xiaojie Mao
MiraGe: Editable 2D Images using Gaussian Splatting
Joanna Waczyńska, Tomasz Szczepanik, Piotr Borycki et al.
Scalable Generation of Spatial Transcriptomics from Histology Images via Whole-Slide Flow Matching
Tinglin Huang, Tianyu Liu, Mehrtash Babadi et al.
Enhancing Spectral GNNs: From Topology and Perturbation Perspectives
Taoyang Qin, Ke-Jia CHEN, Zheng Liu
GRU: Mitigating the Trade-off between Unlearning and Retention for LLMs
Yue Wang, Qizhou Wang, Feng Liu et al.
TinyMIG: Transferring Generalization from Vision Foundation Models to Single-Domain Medical Imaging
Chuang Liu, Hongyan Xu, Yichao Cao et al.
Scalable Gaussian Processes with Latent Kronecker Structure
Jihao Andreas Lin, Sebastian Ament, Maximilian Balandat et al.
LGDM: Latent Guidance in Diffusion Models for Perceptual Evaluations
Shreshth Saini, Ru-Ling Liao, Yan Ye et al.
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic
Stefano Viel, Luca Viano, Volkan Cevher
DS-VLM: Diffusion Supervision Vision Language Model
Zhen Sun, Yunhang Shen, Jie Li et al.
Bellman Unbiasedness: Toward Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation
Taehyun Cho, Seungyub Han, Seokhun Ju et al.
Fragments to Facts: Partial-Information Fragment Inference from LLMs
Lucas Rosenblatt, Bin Han, Robert Wolfe et al.
Soup-of-Experts: Pretraining Specialist Models via Parameters Averaging
Pierre Ablin, Angelos Katharopoulos, Skyler Seto et al.
Navigating Semantic Drift in Task-Agnostic Class-Incremental Learning
Fangwen Wu, Lechao Cheng, Shengeng Tang et al.