Safe Deep Reinforcement Learning for Real-time AC Optimal Power Flow: A Near-optimal Solution

Feng, Bin; Zhao, Jiayue; Huang, Gang; Hu, Yijie; Xu, Huating; Guo, Changxin; Chen, Zhe

doi:10.17775/cseejpes.2023.02070

articleCSEE Journal of Power and Energy SystemsJan 1, 2026DIAMOND OA

Safe Deep Reinforcement Learning for Real-time AC Optimal Power Flow: A Near-optimal Solution

BFBin Feng JZJiayue Zhao GHGang Huang YHYijie Hu HXHuating Xu

Zhejiang University · Aalborg University

Indexed incrossrefdoaj

Abstract

The real-time AC optimal power flow (OPF) problem is a key issue in making fast and accurate decisions to ensure the safety and economy of power systems. With the rapid development of renewable energies, the fluctuation has grown more vibrant, thus a novel approach called safe deep reinforcement learning is proposed in this paper. Herein, the real-time ACOPF problem is modeled as a constrained Markov decision process, and primal-dual optimization (PDO) based proximal policy optimization (PPO) is used to learn the optimal generator outputs in the primal domain and security constraints in the dual domain, which avoids manually selecting a tradeoff between penalties for constraint violations and rewards for the…

Citation impact

6

total citations

FWCI: 0.00
Percentile: 97%
References: 0

Citations per year

Authors

7

Topics & keywords

Topics

Keywords

Reinforcement learning
Key (lock)
Constraint (computer-aided design)
Markov decision process
Dual (grammatical number)
Generator (circuit theory)
Economic dispatch
Domain (mathematical analysis)

UN Sustainable Development Goals

Affordable and clean energy

No related works found for this paper.