Online Learning in the Random Order Model

2510.02820v1 cs.LG, cs.DS 2025-10-07

Авторы:

Martino Bernasconi, Andrea Celli, Riccardo Colini-Baldeschi, Federico Fusco, Stefano Leonardi, Matteo Russo

Abstract

In the random-order model for online learning, the sequence of losses is chosen upfront by an adversary and presented to the learner after a random permutation. Any random-order input is \emph{asymptotically} equivalent to a stochastic i.i.d. one, but, for finite times, it may exhibit significant {\em non-stationarity}, which can hinder the performance of stochastic learning algorithms. While algorithms for adversarial inputs naturally maintain their regret guarantees in random order, simple no-regret algorithms exist for the stochastic model that fail against random-order instances. In this paper, we propose a general template to adapt stochastic learning algorithms to the random-order model without substantially affecting their regret guarantees. This allows us to recover improved regret bounds for prediction with delays, online learning with constraints, and bandits with switching costs. Finally, we investigate online classification and prove that, in random order, learnability is characterized by the VC dimension rather than the Littlestone dimension, thus providing a further separation from the general adversarial model.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Online Learning in the Random Order Model

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Dynamic Algorithm for Explainable k-medians Clustering under lp Norm

Limitations of Membership Queries in Testable Learning

Learning-Augmented Online Bipartite Matching in the Random Arrival Order Model

Learning Intersections of Halfspaces under Factorizable Distribution

Tight Differentially Private PCA via Matrix Coherence

Навигация