首页|期刊导航|电力建设|基于安全深度强化学习的电力-交通耦合网络韧性提升策略

基于安全深度强化学习的电力-交通耦合网络韧性提升策略

徐鼎杨祺铭吴明明傅超然邢强张国立王明深

电力建设2026，Vol.47Issue(3)：24-38,15.

电力建设2026，Vol.47Issue(3)：24-38,15.DOI:10.12204/j.issn.1000-7229.2026.03.003

基于安全深度强化学习的电力-交通耦合网络韧性提升策略

Resilience Improvement Strategy for the Electrification-Transportation Coupling Network Based on Safe Deep Reinforcement Learning

徐鼎 ¹杨祺铭 ¹吴明明 ¹傅超然 ¹邢强 ²张国立 ²王明深³

作者信息

1. 国网上海市电力公司浦东供电公司,上海市 200122
2. 南京邮电大学自动化学院,南京市 210023
3. 国网江苏省电力有限公司电力科学研究院,南京市 211103
折叠

摘要

Abstract

[Objective]To address the problems that when large-scale electrification-transportation coupling network(ETCN)encounters sudden resilience faults,traditional schemes have slow generation speed,are difficult to respond to dynamic information interaction in real time,and artificial intelligence algorithms are prone to cause safety accidents such as voltage over-limit due to the lack of security mechanisms in application,this paper proposes a resilience improvement strategy for ETCN based on safe deep reinforcement learning(SDRL).[Methods]First,the paper establishes a two-stage electrification-transportation optimization framework:the first stage prioritizes the protection of high-value loads with minimum reconfiguration cost,while the second stage optimizes electric vehicle(EV)routing with minimum traffic dispatch cost.Second,a hierarchical decision-making model based on a modified Rainbow algorithm is designed.The upper layer outputs the action plan of the power grid interconnection switch and inputs the reconstructed power grid state to the lower layer.The lower layer integrates grid reconfiguration state with real-time traffic information to optimize EV routing selection,with the objective to ensure that EV routing optimization can real-time adapt to the power grid's recovery needs.In addition,the Lagrange multiplier safety mechanism is embedded,and an objective function with risk penalties is constructed to achieve dynamic penalties for risk behaviors such as voltage over-limit and current over-limit.[Results]Finally,the simulation based on the actual road network in Shanghai and the IEEE123-node distribution network shows that the proposed strategy can significantly enhance the resilience and operational safety of the system in fault scenarios.Compared with the mixed integer programming and particle swarm optimization methods,the method proposed in this paper demonstrates superior comprehensive performance in terms of load recovery rate,recovery speed,voltage stability and strategy security.[Conclusions]This paper verifies the effectiveness of hierarchical safe deep reinforcement learning in improving the resilience of ETCN.This method solves the problem of the separation of electrification-transportation targets through a two-stage architecture,achieving a balanced synergy among computing efficiency,load recovery rate and operational safety.

关键词

电力-交通耦合网络/韧性提升策略/安全深度强化学习(SDRL)/分层决策模型/改进彩虹算法

Key words

electrification-transportation coupling network/resilience improvement strategy/safe deep reinforcement learning(SDRL)/hierarchical decision-making model/modified Rainbow algorithm

分类

信息技术与安全科学

引用本文复制引用

徐鼎,杨祺铭,吴明明,傅超然,邢强,张国立,王明深..基于安全深度强化学习的电力-交通耦合网络韧性提升策略[J].电力建设,2026,47(3):24-38,15.

基金项目

国家自然科学基金面上项目(52477101) （52477101）

国家电网有限公司科技项目(52092125000A) This work is supported by National Natural Science Foundation of China(No.52477101)and the Science and Technology Project of State Grid Corporation of China(No.52092125000A). （52092125000A）

电力建设

ISSN：1000-7229

访问量0

下载量0

段落导航