Risk level dependent Minimax Quantile lower bounds for Interactive Statistical Decision Making

2510.05808v1 cs.IT, cs.AI, math.IT 2025-10-09

Авторы:

Raghav Bongole, Amirreza Zamani, Tobias J. Oechtering, Mikael Skoglund

Abstract

Minimax risk and regret focus on expectation, missing rare failures critical in safety-critical bandits and reinforcement learning. Minimax quantiles capture these tails. Three strands of prior work motivate this study: minimax-quantile bounds restricted to non-interactive estimation; unified interactive analyses that focus on expected risk rather than risk level specific quantile bounds; and high-probability bandit bounds that still lack a quantile-specific toolkit for general interactive protocols. To close this gap, within the interactive statistical decision making framework, we develop high-probability Fano and Le Cam tools and derive risk level explicit minimax-quantile bounds, including a quantile-to-expectation conversion and a tight link between strict and lower minimax quantiles. Instantiating these results for the two-armed Gaussian bandit immediately recovers optimal-rate bounds.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Risk level dependent Minimax Quantile lower bounds for Interactive Statistical Decision Making

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Adaptive Cooperative Transmission Design for Ultra-Reliable Low-Latency Communic...

Fed-PELAD: Communication-Efficient Federated Learning for Massive MIMO CSI Feedb...

Spatial Computing Communications for Multi-User Virtual Reality in Distributed M...

Way to Build Native AI-driven 6G Air Interface: Principles, Roadmap, and Outlook

Large AI Models for Wireless Physical Layer

Навигация