A Best-of-Both-Worlds Proof for Tsallis-INF without Fenchel Conjugates

2511.11211v1 cs.LG, math.OC, stat.ML 2025-11-18

Авторы:

Wei-Cheng Lee, Francesco Orabona

Abstract

In this short note, we present a simple derivation of the best-of-both-world guarantee for the Tsallis-INF multi-armed bandit algorithm from J. Zimmert and Y. Seldin. Tsallis-INF: An optimal algorithm for stochastic and adversarial bandits. Journal of Machine Learning Research, 22(28):1-49, 2021. URL https://jmlr.csail.mit.edu/papers/volume22/19-753/19-753.pdf. In particular, the proof uses modern tools from online convex optimization and avoid the use of conjugate functions. Also, we do not optimize the constants in the bounds in favor of a slimmer proof.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Найти цитирования в Google Scholar
Поиск в Semantic Scholar
Другие статьи категории cs.LG, math.OC, stat.ML

A Best-of-Both-Worlds Proof for Tsallis-INF without Fenchel Conjugates

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Diagonalizing the Softmax: Hadamard Initialization for Tractable Cross-Entropy D...

When do spectral gradient updates help in deep learning?

Lower Complexity Bounds for Nonconvex-Strongly-Convex Bilevel Optimization with ...

Adaptivity and Universality: Problem-dependent Universal Regret for Online Conve...

Non-Asymptotic Optimization and Generalization Bounds for Stochastic Gauss-Newto...

Навигация