Problem-Parameter-Free Decentralized Bilevel Optimization

2510.24288v1 math.OC, cs.LG, stat.ML 2025-10-30

Авторы:

Zhiwei Zhai, Wenjing Yan, Ying-Jun Angela Zhang

Abstract

Decentralized bilevel optimization has garnered significant attention due to its critical role in solving large-scale machine learning problems. However, existing methods often rely on prior knowledge of problem parameters-such as smoothness, convexity, or communication network topologies-to determine appropriate stepsizes. In practice, these problem parameters are typically unavailable, leading to substantial manual effort for hyperparameter tuning. In this paper, we propose AdaSDBO, a fully problem-parameter-free algorithm for decentralized bilevel optimization with a single-loop structure. AdaSDBO leverages adaptive stepsizes based on cumulative gradient norms to update all variables simultaneously, dynamically adjusting its progress and eliminating the need for problem-specific hyperparameter tuning. Through rigorous theoretical analysis, we establish that AdaSDBO achieves a convergence rate of $\widetilde{\mathcal{O}}\left(\frac{1}{T}\right)$, matching the performance of well-tuned state-of-the-art methods up to polylogarithmic factors. Extensive numerical experiments demonstrate that AdaSDBO delivers competitive performance compared to existing decentralized bilevel optimization methods while exhibiting remarkable robustness across diverse stepsize configurations.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Problem-Parameter-Free Decentralized Bilevel Optimization

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Safeguarded Stochastic Polyak Step Sizes for Non-smooth Optimization: Robust Per...

Efficient Penalty-Based Bilevel Methods: Improved Analysis, Novel Updates, and F...

DIGing--SGLD: Decentralized and Scalable Langevin Sampling over Time--Varying Ne...

Global Convergence of Four-Layer Matrix Factorization under Random Initializatio...

A Support-Set Algorithm for Optimization Problems with Nonnegative and Orthogona...

Навигация