Communication-aware Wide-Area Damping Control using Risk-Constrained Reinforcement Learning

2509.23620v1 eess.SY, cs.LG, cs.SY 2025-10-01

Авторы:

Kyung-bin Kwon, Lintao Ye, Vijay Gupta, Hao Zhu

Abstract

Non-ideal communication links, especially delays, critically affect fast networked controls in power systems, such as the wide-area damping control (WADC). Traditionally, a delay estimation and compensation approach is adopted to address this cyber-physical coupling, but it demands very high accuracy for the fast WADC and cannot handle other cyber concerns like link failures or {cyber perturbations}. Hence, we propose a new risk-constrained framework that can target the communication delays, yet amenable to general uncertainty under the cyber-physical couplings. Our WADC model includes the synchronous generators (SGs), and also voltage source converters (VSCs) for additional damping capabilities. To mitigate uncertainty, a mean-variance risk constraint is introduced to the classical optimal control cost of the linear quadratic regulator (LQR). Unlike estimating delays, our approach can effectively mitigate large communication delays by improving the worst-case performance. A reinforcement learning (RL)-based algorithm, namely, stochastic gradient-descent with max-oracle (SGDmax), is developed to solve the risk-constrained problem. We further show its guaranteed convergence to stationarity at a high probability, even using the simple zero-order policy gradient (ZOPG). Numerical tests on the IEEE 68-bus system not only verify SGDmax's convergence and VSCs' damping capabilities, but also demonstrate that our approach outperforms conventional delay compensator-based methods under estimation error. While focusing on performance improvement under large delays, our proposed risk-constrained design can effectively mitigate the worst-case oscillations, making it equally effective for addressing other communication issues and cyber perturbations.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Communication-aware Wide-Area Damping Control using Risk-Constrained Reinforcement Learning

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Deep Learning Prediction of Beam Coherence Time for Near-FieldTeraHertz Networks

Bridging Earth and Space: A Survey on HAPS for Non-Terrestrial Networks

A Deep State-Space Model Compression Method using Upper Bound on Output Error

MAKO: Meta-Adaptive Koopman Operators for Learning-based Model Predictive Contro...

Falsification-Driven Reinforcement Learning for Maritime Motion Planning

Навигация