Non-Asymptotic Analysis of Data Augmentation for Precision Matrix Estimation

2510.02119v1 stat.ML, cs.LG, math.PR, math.ST, stat.TH 2025-10-04

Авторы:

Lucas Morisset, Adrien Hardy, Alain Durmus

Abstract

This paper addresses the problem of inverse covariance (also known as precision matrix) estimation in high-dimensional settings. Specifically, we focus on two classes of estimators: linear shrinkage estimators with a target proportional to the identity matrix, and estimators derived from data augmentation (DA). Here, DA refers to the common practice of enriching a dataset with artificial samples--typically generated via a generative model or through random transformations of the original data--prior to model fitting. For both classes of estimators, we derive estimators and provide concentration bounds for their quadratic error. This allows for both method comparison and hyperparameter tuning, such as selecting the optimal proportion of artificial samples. On the technical side, our analysis relies on tools from random matrix theory. We introduce a novel deterministic equivalent for generalized resolvent matrices, accommodating dependent samples with specific structure. We support our theoretical results with numerical experiments.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Non-Asymptotic Analysis of Data Augmentation for Precision Matrix Estimation

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Limit Theorems for Stochastic Gradient Descent in High-Dimensional Single-Layer ...

Dimension-Free Minimax Rates for Learning Pairwise Interactions in Attention-Sty...

Phase Transition for Stochastic Block Model with more than $\sqrt{n}$ Communitie...

Навигация