Extensions of Robbins-Siegmund Theorem with Applications in Reinforcement Learning
2509.26442v1
cs.LG, math.OC
2025-10-02
Авторы:
Xinyu Liu, Zixuan Xie, Shangtong Zhang
Abstract
The Robbins-Siegmund theorem establishes the convergence of stochastic
processes that are almost supermartingales and is foundational for analyzing a
wide range of stochastic iterative algorithms in stochastic approximation and
reinforcement learning (RL). However, its original form has a significant
limitation as it requires the zero-order term to be summable. In many important
RL applications, this summable condition, however, cannot be met. This
limitation motivates us to extend the Robbins-Siegmund theorem for almost
supermartingales where the zero-order term is not summable but only square
summable. Particularly, we introduce a novel and mild assumption on the
increments of the stochastic processes. This together with the square summable
condition enables an almost sure convergence to a bounded set. Additionally, we
further provide almost sure convergence rates, high probability concentration
bounds, and $L^p$ convergence rates. We then apply the new results in
stochastic approximation and RL. Notably, we obtain the first almost sure
convergence rate, the first high probability concentration bound, and the first
$L^p$ convergence rate for $Q$-learning with linear function approximation.
Ссылки и действия
Дополнительные ресурсы: