2025 "online learning" Papers
31 papers found
Agnostic Continuous-Time Online Learning
Pramith Devulapalli, Changlong Wu, Ananth Grama et al.
Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs
Zhaowei Zhang, Fengshuo Bai, Qizhi Chen et al.
Conformal Online Learning of Deep Koopman Linear Embeddings
Ben Gao, Jordan Patracone, Stephane Chretien et al.
Conservative classifiers do consistently well with improving agents: characterizing statistical and online learning
Dravyansh Sharma, Alec Sun
Event-Driven Online Vertical Federated Learning
Ganyu Wang, Boyu Wang, Bin Gu et al.
Exploring the Noise Robustness of Online Conformal Prediction
HuaJun Xi, Kangdao Liu, Hao Zeng et al.
Fast Direct: Query-Efficient Online Black-box Guidance for Diffusion-model Target Generation
Kim Yong Tan, YUEMING LYU, Ivor Tsang et al.
FCOM: A Federated Collaborative Online Monitoring Framework via Representation Learning
Tanapol Kosolwattana, Huazheng Wang, Raed Al Kontar et al.
Feature-Based Online Bilateral Trade
Solenne Gaucher, Martino Bernasconi, Matteo Castiglioni et al.
Improved Bounds for Swap Multicalibration and Swap Omniprediction
Haipeng Luo, Spandan Senapati, Vatsal Sharan
Improved Regret and Contextual Linear Extension for Pandora's Box and Prophet Inequality
Junyan Liu, Ziyun Chen, Kun Wang et al.
Learning-Augmented Algorithms for $k$-median via Online Learning
Anish Hebbar, Rong Ge, Amit Kumar et al.
Lifelong Test-Time Adaptation via Online Learning in Tracked Low-Dimensional Subspace
Dexin Duan, Rui Xu, Peilin Liu et al.
Longhorn: State Space Models are Amortized Online Learners
Bo Liu, Rui Wang, Lemeng Wu et al.
Markov Persuasion Processes: Learning to Persuade From Scratch
Francesco Bacchiocchi, Francesco Emanuele Stradi, Matteo Castiglioni et al.
Near-Optimal Regret-Queue Length Tradeoff in Online Learning for Two-Sided Markets
Zixian Yang, Sushil Varma, Lei Ying
Non-Stationary Dueling Bandits Under a Weighted Borda Criterion
Joe Suk, Arpit Agarwal
Offline-to-Online Hyperparameter Transfer for Stochastic Bandits
Dravyansh Sharma, Arun Suggala
Online Learning in the Repeated Mediated Newsvendor Problem
Nataša Bolić, Tom Cesari, Roberto Colomboni et al.
Online Reinforcement Learning in Non-Stationary Context-Driven Environments
Pouya Hamadanian, Arash Nasr-Esfahany, Malte Schwarzkopf et al.
Online robust locally differentially private learning for nonparametric regression
Chenfei Gu, Qiangqiang Zhang, Ting Li et al.
On the Universal Near Optimality of Hedge in Combinatorial Settings
Zhiyuan Fan, Arnab Maiti, Lillian Ratliff et al.
Optimal Strong Regret and Violation in Constrained MDPs via Policy Optimization
Francesco Emanuele Stradi, Matteo Castiglioni, Alberto Marchesi et al.
PhiNets: Brain-inspired Non-contrastive Learning Based on Temporal Prediction Hypothesis
Satoki Ishikawa, Makoto Yamada, Han Bao et al.
Prediction with expert advice under additive noise
Alankrita Bhatt, Victoria Kostina
Replicable Online Learning
Saba Ahmadi, Siddharth Bhandari, Avrim Blum
ReSpec: Relevance and Specificity Grounded Online Filtering for Learning on Video-Text Data Streams
Chris Dongjoo Kim, Jihwan Moon, Sangwoo Moon et al.
Robust Contextual Pricing
Anupam Gupta, Guru Guruganesh, Renato Leme et al.
Statistical Parity with Exponential Weights
Stephen Pasteris, Chris Hicks, Vasilios Mavroudis
Toward Long-Tailed Online Anomaly Detection through Class-Agnostic Concepts
Chiao-An Yang, Kuan-Chuan Peng, Raymond A. Yeh
Tradeoffs between Mistakes and ERM Oracle Calls in Online and Transductive Online Learning
Idan Attias, Steve Hanneke, Arvind Ramaswami