On the Optimality of the Median-of-Means Estimator under Adversarial Contamination

2510.07867v1 stat.ML, cs.LG, math.ST, stat.TH 2025-10-11

Авторы:

Xabier de Juan, Santiago Mazuelas

Abstract

The Median-of-Means (MoM) is a robust estimator widely used in machine learning that is known to be (minimax) optimal in scenarios where samples are i.i.d. In more grave scenarios, samples are contaminated by an adversary that can inspect and modify the data. Previous work has theoretically shown the suitability of the MoM estimator in certain contaminated settings. However, the (minimax) optimality of MoM and its limitations under adversarial contamination remain unknown beyond the Gaussian case. In this paper, we present upper and lower bounds for the error of MoM under adversarial contamination for multiple classes of distributions. In particular, we show that MoM is (minimax) optimal in the class of distributions with finite variance, as well as in the class of distributions with infinite variance and finite absolute $(1+r)$-th moment. We also provide lower bounds for MoM's error that match the order of the presented upper bounds, and show that MoM is sub-optimal for light-tailed distributions.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

On the Optimality of the Median-of-Means Estimator under Adversarial Contamination

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Vector-valued self-normalized concentration inequalities beyond sub-Gaussianity

Optimal Convergence Analysis of DDPM for General Distributions

Multimodal Bandits: Regret Lower Bounds and Optimal Algorithms

Complexity Dependent Error Rates for Physics-informed Statistical Learning via t...

Learning Upper Lower Value Envelopes to Shape Online RL: A Principled Approach

Навигация