中国电机工程学报2025,Vol.45Issue(19):7493-7509,中插19,18.DOI:10.13334/j.0258-8013.pcsee.240635
考虑参数共享的深度强化学习双阶段分布式电源优化
Two-stage Distributed Generators Optimization Based on Deep Reinforcement Learning With Parameter Sharing
摘要
Abstract
As renewable energy sources such as solar and wind are increasingly integrated into the grid at high proportions,the optimization and scheduling of distributed power generation face challenges due to frequent changes in system topology,affecting the stability and economic operation of the distribution network.Existing methods,designed for systems with fixed topology,rely on precise models and are time-consuming,making real-time control difficult.Current deep reinforcement learning approaches struggle to balance distributed training and mixed discrete-continuous action spaces.This study introduces a distributed power optimization strategy based on multi-agent deep reinforcement learning with two stages and parameter sharing.Initially,the problem is vertically decoupled,constructing a dynamic distribution network reconfiguration model with distributed generation using mixed-integer second-order cone programming to determine the topology.Subsequently,the distribution network environment is horizontally decoupled into several regions.In the second stage,a centralized training with decentralized execution framework that incorporates parameter sharing is proposed.This framework incorporates a multi-agent prioritized double-delay deep deterministic policy gradient algorithm with a priority experience replay mechanism.Topology information is embedded into the distribution network environment,mapped to agents through power flow calculations to minimize network active power loss in the optimization scheduling model.Case studies demonstrate that the proposed algorithm,by considering changes in the distribution network topology and enhancing learning efficiency through strategy and experience sharing among agents,as well as priority experience replay,meets the efficiency requirements of real-time online decision-making and shows superior voltage stability and loss reduction performance compared to other strategies.关键词
深度强化学习/参数共享/动态重构/拓扑变化/分布式电源优化调度Key words
deep reinforcement learning/parameter sharing/dynamic reconfiguration/topological changes/distributed generation optimization scheduling分类
动力与电气工程引用本文复制引用
高放,姚浩天,高庆,殷林飞,蔡运翔,金岩,潘宇..考虑参数共享的深度强化学习双阶段分布式电源优化[J].中国电机工程学报,2025,45(19):7493-7509,中插19,18.基金项目
国家重点研发计划项目(2022YFB3304700) (2022YFB3304700)
国家自然科学基金项目(62463001) (62463001)
工业控制技术全国重点实验室开放课题(ICT2024B21).National Key R&D Program of China(2022YFB3304700) (ICT2024B21)
The Project Supported by National Natural Science Foundation of China(62463001) (62463001)
The Open Research Project of the State Key Laboratory of Industrial Control Technology(China)(ICT2024B21). (China)