首页|期刊导航|铁道科学与工程学报|基于深度强化学习DDDQN的高速列车智能调度调整方法

基于深度强化学习DDDQN的高速列车智能调度调整方法

吴卫阴佳腾陈照森唐涛

铁道科学与工程学报2024，Vol.21Issue(4)：1298-1308,11.

铁道科学与工程学报2024，Vol.21Issue(4)：1298-1308,11.DOI:10.19713/j.cnki.43-1423/u.T20230864

基于深度强化学习DDDQN的高速列车智能调度调整方法

Intelligent rescheduling optimization method of high-speed railway based on deep reinforcement learning DDDQN

吴卫 ¹阴佳腾 ²陈照森 ²唐涛²

作者信息

1. 北京和利时系统工程有限公司,北京 100176
2. 北京交通大学轨道交通控制与安全国家重点实验室,北京 100044
折叠

摘要

Abstract

In the daily operation of the high-speed railway system,trains are often disturbed by various emergencies leading to delays,which seriously affects passengers'travel experience.In order to work out train rescheduling scheme in a short time and reduce train delay time as much as possible,a train rescheduling optimization method DDDQN combining deep reinforcement learning and an programming model was proposed.First,the track was divided into multiple sections connected.An integer programming model was constructed to describe the train operation process to minimize the total delay time of all trains based on the job-shop scheduling problem.Then,each train was regarded as an agent,and the state,action and reward functions of multiple agents were defined according to the actual operation requirements.Two deep neural networks were constructed to approximate the functions.Finally,combined with the above integer programming model,the training method of DDDQN was designed.In this model,the feasible solution to the problem was explored by the agent in the simulation environment,and the parameters of the neural network were updated by the"mutual feed"mechanism between the two neural networks.On this basis,the optimal solution to the problem can be obtained in a short time by solving the integer programming model.The actual track data and operation data of the Beijing-Zhangjiakou high-speed railway were used for simulation experiments,and the total train delay time and solution time obtained by three different solution methods under 10 different emergency scenarios were compared,which verified that the proposed DDDQN model could obtain the optimal solution of the problem in a short time.DDDQN can reduce train delay time by up to 30.43%and solution time by up to 68.33%.DDDQN provides an intelligent method and reference for improving the emergency handling ability and transportation organization efficiency of high-speed railway systems under emergencies.

关键词

列车智能调度调整/列车晚点时间/深度强化学习/整数规划模型/神经网络

Key words

intelligent train rescheduling/train delay time/deep reinforcement learning/integer programming/neural network

分类

交通工程

引用本文复制引用

吴卫,阴佳腾,陈照森,唐涛..基于深度强化学习DDDQN的高速列车智能调度调整方法[J].铁道科学与工程学报,2024,21(4):1298-1308,11.

基金项目

国家自然科学基金"基础科学中心项目"(72288101) （72288101）

国家自然科学基金"优青"项目(72322022) （72322022）

先进轨道交通自主运行全国重点实验室项目(RAO2023ZZ001) （RAO2023ZZ001）

铁道科学与工程学报

OA北大核心CSTPCDEI

ISSN：1672-7029

访问量0

下载量0

段落导航