2024 "online learning" Papers

32 papers found

Adaptive Anytime Multi-Agent Path Finding Using Bandit-Based Large Neighborhood Search

Thomy Phan, Taoan Huang, Bistra Dilkina et al.

AAAI 2024paperarXiv:2312.16767
10
citations

Adaptive Online Experimental Design for Causal Discovery

Muhammad Qasim Elahi, Lai Wei, Murat Kocaoglu et al.

ICML 2024spotlightarXiv:2405.11548

Adaptive Robust Learning using Latent Bernoulli Variables

Aleksandr Karakulev, Dave Zachariah, Prashant Singh

ICML 2024posterarXiv:2312.00585

A General Online Algorithm for Optimizing Complex Performance Metrics

Wojciech Kotlowski, Marek Wydmuch, Erik Schultheis et al.

ICML 2024poster

Beyond Implicit Bias: The Insignificance of SGD Noise in Online Learning

Nikhil Vyas, Depen Morwani, Rosie Zhao et al.

ICML 2024spotlightarXiv:2306.08590

Conformalized Adaptive Forecasting of Heterogeneous Trajectories

Yanfei Zhou, Lars Lindemann, Matteo Sesia

ICML 2024posterarXiv:2402.09623

Designing Decision Support Systems using Counterfactual Prediction Sets

Eleni Straitouri, Manuel Gomez-Rodriguez

ICML 2024spotlightarXiv:2306.03928

Doubly Perturbed Task Free Continual Learning

Byung Hyun Lee, Min-hwan Oh, Se Young Chun

AAAI 2024paperarXiv:2312.13027
5
citations

Efficient Learning in Polyhedral Games via Best-Response Oracles

Darshan Chakrabarti, Gabriele Farina, Christian Kroer

AAAI 2024paperarXiv:2312.03696

Efficient Online Set-valued Classification with Bandit Feedback

Zhou Wang, Xingye Qiao

ICML 2024posterarXiv:2405.04393

Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing

Ioannis Maniadis Metaxas, Georgios Tzimiropoulos, ioannis Patras

ECCV 2024posterarXiv:2407.11168
2
citations

Factored-Reward Bandits with Intermediate Observations

Marco Mussi, Simone Drago, Marcello Restelli et al.

ICML 2024poster

Federated Combinatorial Multi-Agent Multi-Armed Bandits

Fares Fourati, Mohamed-Slim Alouini, Vaneet Aggarwal

ICML 2024posterarXiv:2405.05950

Graph2Tac: Online Representation Learning of Formal Math Concepts

Lasse Blaauwbroek, Mirek Olšák, Jason Rute et al.

ICML 2024posterarXiv:2401.02949

High-dimensional Linear Bandits with Knapsacks

Wanteng Ma, Dong Xia, Jiashuo Jiang

ICML 2024posterarXiv:2311.01327

Imitation Learning in Discounted Linear MDPs without exploration assumptions

Luca Viano, EFSTRATIOS PANTELEIMON SKOULAKIS, Volkan Cevher

ICML 2024posterarXiv:2405.02181

Leveraging (Biased) Information: Multi-armed Bandits with Offline Data

Wang Chi Cheung, Lixing Lyu

ICML 2024spotlight

Monotone Individual Fairness

Yahav Bechavod

ICML 2024posterarXiv:2403.06812

Nash Incentive-compatible Online Mechanism Learning via Weakly Differentially Private Online Learning

Joon Suk Huh, Kirthevasan Kandasamy

ICML 2024posterarXiv:2407.04898

Noise-Adaptive Confidence Sets for Linear Bandits and Application to Bayesian Optimization

Kwang-Sung Jun, Jungtaek Kim

ICML 2024posterarXiv:2402.07341

Non-exemplar Online Class-Incremental Continual Learning via Dual-Prototype Self-Augment and Refinement

Fushuo Huo, Wenchao Xu, Jingcai Guo et al.

AAAI 2024paperarXiv:2303.10891
23
citations

Online Cascade Learning for Efficient Inference over Streams

Lunyiu Nie, Zhimin Ding, Erdong Hu et al.

ICML 2024posterarXiv:2402.04513

Online Isolation Forest

Filippo Leveni, Guilherme Weigert Cassales, Bernhard Pfahringer et al.

ICML 2024posterarXiv:2505.09593

Online Learning in Betting Markets: Profit versus Prediction

Haiqing Zhu, Alexander Soen, Yun Kuen Cheung et al.

ICML 2024posterarXiv:2406.04062

Online Learning in CMDPs: Handling Stochastic and Adversarial Constraints

Francesco Emanuele Stradi, Jacopo Germano, Gianmarco Genalti et al.

ICML 2024poster

Online Learning under Budget and ROI Constraints via Weak Adaptivity

Matteo Castiglioni, Andrea Celli, Christian Kroer

ICML 2024posterarXiv:2302.01203

Online Learning with Bounded Recall

Jon Schneider, Kiran Vodrahalli

ICML 2024posterarXiv:2205.14519

Online Matrix Completion: A Collaborative Approach with Hott Items

Dheeraj Baby, Soumyabrata Pal

ICML 2024posterarXiv:2408.05843

Online Variational Sequential Monte Carlo

Alessandro Mastrototaro, Jimmy Olsson

ICML 2024posterarXiv:2312.12616

Parameterized Projected Bellman Operator

Théo Vincent, Alberto Maria Metelli, Boris Belousov et al.

AAAI 2024paperarXiv:2312.12869
4
citations

Performative Prediction with Bandit Feedback: Learning through Reparameterization

Yatong Chen, Wei Tang, Chien-Ju Ho et al.

ICML 2024posterarXiv:2305.01094

Understanding Adam Optimizer via Online Learning of Updates: Adam is FTRL in Disguise

Kwangjun Ahn, Zhiyu Zhang, Yunbum Kook et al.

ICML 2024posterarXiv:2402.01567