On the Benefits of Weight Normalization for Overparameterized Matrix Sensing
2510.01175v1
cs.LG, eess.SP, math.OC, stat.ML
2025-10-04
Авторы:
Yudong Wei, Liang Zhang, Bingcong Li, Niao He
Abstract
While normalization techniques are widely used in deep learning, their
theoretical understanding remains relatively limited. In this work, we
establish the benefits of (generalized) weight normalization (WN) applied to
the overparameterized matrix sensing problem. We prove that WN with Riemannian
optimization achieves linear convergence, yielding an exponential speedup over
standard methods that do not use WN. Our analysis further demonstrates that
both iteration and sample complexity improve polynomially as the level of
overparameterization increases. To the best of our knowledge, this work
provides the first characterization of how WN leverages overparameterization
for faster convergence in matrix sensing.