Deep Reinforcement Learning Based Volt-VAR Optimization in Smart Distribution Systems (2003.03681v2)

Published 7 Mar 2020 in eess.SY, cs.SY, and eess.SP

Abstract: This paper develops a model-free volt-VAR optimization (VVO) algorithm via multi-agent deep reinforcement learning (MADRL) in unbalanced distribution systems. This method is novel since we cast the VVO problem in unbalanced distribution networks to an intelligent deep Q-network (DQN) framework, which avoids solving a specific optimization model directly when facing time-varying operating conditions of the systems. We consider statuses/ratios of switchable capacitors, voltage regulators, and smart inverters installed at distributed generators as the action variables of the DQN agents. A delicately designed reward function guides these agents to interact with the distribution system, in the direction of reinforcing voltage regulation and power loss reduction simultaneously. The forward-backward sweep method for radial three-phase distribution systems provides accurate power flow results within a few iterations to the DQN environment. Finally, the proposed multi-objective MADRL method realizes the dual goals for VVO. We test this algorithm on the unbalanced IEEE 13-bus and 123-bus systems. Numerical simulations validate the excellent performance of this method in voltage regulation and power loss reduction.

Citations (168)

View on Semantic Scholar

Summary

The paper presents a novel model-free multi-agent deep reinforcement learning (MADRL) approach for Volt-VAR Optimization (VVO) in unbalanced distribution systems.
Utilizing deep Q-networks (DQN), the method leverages power device statuses and a reward function to simultaneously achieve voltage regulation and power loss reduction.
Rigorous testing showed over 99.8% success in voltage regulation and significant power loss reductions with millisecond execution times on standard test systems.

Deep Reinforcement Learning for Volt-VAR Optimization in Unbalanced Distribution Systems

The paper, "Deep Reinforcement Learning Based Volt-VAR Optimization in Smart Distribution Systems", presents a novel approach by integrating multi-agent deep reinforcement learning (MADRL) for Volt-VAR Optimization (VVO) in unbalanced distribution networks. The central focus of this research is the development of a model-free algorithm that efficiently addresses the challenges inherent in the dynamic operating conditions of modern power systems.

Overview of Methodology

This paper introduces a sophisticated framework utilizing deep Q-networks (DQN), designed to bypass the complex optimization models traditionally required in VVO tasks. Leveraging the statuses and ratios of power regulation devices such as switchable capacitors, automatic voltage regulators (AVRs), and smart inverters, the authors have crafted an environment where DQN agents intelligently guide system operation through a meticulously designed reward function. The dual objectives of voltage regulation and power loss reduction are simultaneously addressed by deploying a forward-backward sweep method, ensuring precise power flow calculations in radial, three-phase distribution systems.

Numerical Results and Analysis

The efficacy of the proposed method is substantiated through rigorous testing on the IEEE 13-bus and 123-bus systems. Results are particularly notable; the success rate in achieving voltage regulation without violation—even under challenging conditions simulated by varying loads—consistently reached above 99.8% in the reported trials. In addition, substantial power loss reductions were observed, reinforcing the capability of the model-free approach to navigate complex, high-dimensional control spaces with efficiency.

Further analysis indicates the computational proficiency of the method, executing within milliseconds—namely, average execution times of 21.7 ms for the 13-bus system and 39.2 ms for the 123-bus system. Such promptness in computation underscores the potential of the method for real-time applications.

Implications and Future Directions

This work presents several implications for the field of power systems operation and control:

Practical Application: The method's model-free nature suggests scalability and adaptability to various system architectures and conditions, promising a significant enhancement in operational efficiency for real-world distribution networks.
Theoretical Contribution: By circumventing conventional NP-hard optimization through MADRL, the research offers a fresh perspective on managing the complexity and scalability challenges of integrated smart grid systems, particularly under high penetration of distributed generation.

Looking ahead, future studies could explore extending the MADRL approach to broader applications within the grid, such as load frequency control, potentially leveraging inherent capabilities of the algorithm to handle continuous rather than discretized actions.

Conclusion

This paper makes a substantial contribution to intelligent power systems management by leveraging multi-agent deep reinforcement learning. The model-free approach is poised to transform traditional VVO strategies, enabling robust, adaptive control that aligns with the evolving demands of modern distribution systems. As distributed generation continues to reshape power system dynamics, approaches like MADRL present a viable pathway toward more integrated, efficient grid operations.