浙江电力2024,Vol.43Issue(2):126-136,11.DOI:10.19585/j.zjdl.202402014
基于深度强化学习的多能流楼宇低碳调度方法
A low-carbon scheduling method for multi-energy flow buildings based on deep rein-forcement learning
摘要
Abstract
Building emissions reduction has become a crucial pathway for China to achieve its'dual-carbon'goals.As an integrated energy entity coupled with multi-energy flow networks,smart buildings face challenges such as high carbon emissions,a high degree of coupling in multi-energy flow networks,and distinct dynamic characteris-tics in load energy consumption behavior.In response to these challenges,a low-carbon scheduling method for multi-energy flow buildings based on deep reinforcement learning(deep RL)is proposed.Firstly,a reward and punish-ment ladder-type carbon emissions trading mechanism is established based on the actual carbon emissions of smart buildings.Secondly,targeting the carbon market and multi-energy flow coupling networks,a low-carbon scheduling model for multi-energy flow buildings is developed,aiming to minimize operating costs as the objective function,and the scheduling is transformed into a Markov decision process(MDP).Subsequently,the Rainbow algorithm is employed to solve the optimal scheduling.Finally,the feasibility and effectiveness of the optimal scheduling model are verified through simulation analysis.关键词
"双碳"目标/多能流/低碳调度/深度强化学习Key words
'dual-carbon'goals/multi-energy flow/low-carbon scheduling/deep RL引用本文复制引用
胥栋,李逸超,李赟,徐刚,杜佳玮..基于深度强化学习的多能流楼宇低碳调度方法[J].浙江电力,2024,43(2):126-136,11.基金项目
国网上海市电力公司浦东供电公司营销项目(640921220001) (640921220001)