Revisiting Zeroth-Order Optimization: Minimum-Variance Two-Point Estimators and Directionally Aligned Perturbations

2510.19975v1 cs.LG, cs.AI, math.OC 2025-10-25

Авторы:

Shaocong Ma, Heng Huang

Abstract

In this paper, we explore the two-point zeroth-order gradient estimator and identify the distribution of random perturbations that minimizes the estimator's asymptotic variance as the perturbation stepsize tends to zero. We formulate it as a constrained functional optimization problem over the space of perturbation distributions. Our findings reveal that such desired perturbations can align directionally with the true gradient, instead of maintaining a fixed length. While existing research has largely focused on fixed-length perturbations, the potential advantages of directional alignment have been overlooked. To address this gap, we delve into the theoretical and empirical properties of the directionally aligned perturbation (DAP) scheme, which adaptively offers higher accuracy along critical directions. Additionally, we provide a convergence analysis for stochastic gradient descent using $\delta$-unbiased random perturbations, extending existing complexity bounds to a wider range of perturbations. Through empirical evaluations on both synthetic problems and practical tasks, we demonstrate that DAPs outperform traditional methods under specific conditions.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Revisiting Zeroth-Order Optimization: Minimum-Variance Two-Point Estimators and Directionally Aligned Perturbations

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Frugality in second-order optimization: floating-point approximations for Newton...

Learning Branching Policies for MILPs with Proximal Policy Optimization

SMiLE: Provably Enforcing Global Relational Properties in Neural Networks

Q3R: Quadratic Reweighted Rank Regularizer for Effective Low-Rank Training

A Convexity-dependent Two-Phase Training Algorithm for Deep Neural Networks

Навигация