|国家科技期刊平台
首页|期刊导航|电子科技|Q学习差分进化算法求解热电动态经济排放调度

Q学习差分进化算法求解热电动态经济排放调度OA

A Q-Learning Differential Evolution Algorithm for Combined Heat and Power Dynamic Economic Emission Dispatch

中文摘要英文摘要

热电联产动态经济排放调度同时考虑了燃料成本花费和污染气体排放两个目标值,且下一时间段的热电产量受当前时间段热电产量的影响,这是近年来电力系统运行中的一个重要问题.文中提出一种基于Q学习强化多目标差分进化(Q Learning Multi-Objective Differential Evolution,QLMODE)算法,以此求解热电联产动态经济排放调度(Combined Heat and Power Dynamic Economic Emission Dispatch,CHPDEED)问题.在QLMODE中,采用Q学习技术调整算法的比例因子参数,即在迭代过程中利用子代解和父代解之间的支配关系确定动作奖励和惩罚,并通过Q学习调整参数值,以获得最适合环境模型的算法参数.文中将所提QLMODE用于求解11 机组和33 机组的热电联产动态经济排放调度问题.仿真结果表明,与4 种成熟的多目标优化算法相比,QLMODE算法燃料成本最小,污染气体排放最少,收敛性和多样性指标优于其他4 种算法,且QLMODE在两组问题上都获得了更好的Pareto最优前沿.

The dynamic economic emission scheduling of cogeneration takes into account both fuel cost and pol-lution gas emission,and the thermoelectricity output in the next period is affected by the thermoelectricity output in the current period,which is an important problem in power system operation in recent years.In this study,a new QLMODE(Q-Learning Multi-Objective Differential Evolution)algorithm is proposed to solve the CHPDEED(Combined Heat and Power Dynamic Economic Emission Dispatch)problem.In QLMODE,the Q-learning tech-nique is used to adjust the scale factor parameters of the algorithm,that is,in the iterative process,the action reward and punishment are determined by using the dominant relationship between the child solution and the parent solution,and the parameter values are adjusted by Q-learning to obtain the most suitable algorithm parameters for the environ-mental model.The proposed QLMODE is used to solve the CHPDEED with 11 units and 33 units.The simulation re-sults show that compared with four mature multi-objective optimization algorithms,the QLMODE algorithm has the least fuel cost and the least pollution gas emission,the convergence and diversity index of QLMODE algorithm is bet-ter than the other four algorithms,and QLMODE has a better Pareto optimal frontier on both sets of problems.

方帅;陈旭;李康吉

江苏大学 电气信息工程工程学院,江苏 镇江 212013

计算机与自动化

Q学习强化学习多目标算法差分进化热电联产经济排放调度动态调度电力系统

Q learningreinforcement learningmulti-objective algorithmdifferential evolutioncogenera-tion combined heat and powereconomic emission dispatchdynamic dispatchpower system

《电子科技》 2024 (005)

9-17 / 9

国家自然科学基金(61873114);江苏大学农业装备学部青年计划项目(NZXB20210211). National Natural Science Foundation of China(61873114);Youth Program of Faculty of Agricultural Equipment Jiangsu University(NZXB20210211)

10.16180/j.cnki.issn1007-7820.2024.05.002

评论