| 注册
首页|期刊导航|控制与信息技术|基于DQN算法的货运列车长大下坡运行优化算法研究

基于DQN算法的货运列车长大下坡运行优化算法研究

何之煜 李一楠 李辉 吉志军

控制与信息技术Issue(4):19-27,9.
控制与信息技术Issue(4):19-27,9.DOI:10.13889/j.issn.2096-5427.2024.04.003

基于DQN算法的货运列车长大下坡运行优化算法研究

A DQN-Based Algorithm for Operational Optimization of Freight Trains in Long Steep Downhill Sections

何之煜 1李一楠 1李辉 1吉志军1

作者信息

  • 1. 中国铁道科学研究院集团有限公司 通信信号研究所,北京 100081
  • 折叠

摘要

Abstract

Freight trains running in long steep downhill sections require speed regulation through cycle braking. However,improper braking application and release timing can pose significant safety risks in train operation. Taking SS6B electric locomotive pulling C80 freight car as the research object,the train dynamics model based on mass belt is established. This study proposes a deep Q-network (DQN) based intelligent curve generation algorithm for operational optimization in these sections. This algorithm incorporates train operational efficiency,safety,and brake shoe wear as optimization objectives,and considers speed limits and charging time constraints for brake cylinders,enabling the search for optimal transition points in cycle braking conditions through interactions with the environment. The study employed the batch collection of training samples utilizing experience replay and a double-network mechanism,along with the preprocessing of neural network state inputs,and the investigation into feasible regions within the action space using a variable ε-greedy strategy. A loss function based on the value function was then constructed,and network parameters were updated iteratively by a batch gradient descent method. Results from simulations conducted in environments set up using Matlab showed that in the task training of train operation on long steep downhill with randomly generated entry speeds,cumulative rewards gradually converged over training runs,which verified the convergence and generalization of the proposed algorithm. The optimized operational curves generated with various entry speeds at the completion of training,effectively controlled the trains to apply air braking before reaching the speed limits and to release braking at the end of air charging,which verified the efficacy of the algorithm in ensuring the safety and efficiency of train operation. In addition,by comparing average cumulative reward curves for different learning rates and distribution ranges after preprocessing of different network inputs,the algorithm was further verified capable in improving convergence speeds and stability. The research results provide a reference for further optimizing the generation of operational curves for freight trains running in long steep downhill sections,thereby ensuring both train operational efficiency and safety.

关键词

货运列车/长大下坡/运行曲线/深度Q网络/神经网络/输入预处理

Key words

freight train/long steep downhill/operational curve/deep Q-network (DQN)/neural network/input pretreatment

分类

交通运输

引用本文复制引用

何之煜,李一楠,李辉,吉志军..基于DQN算法的货运列车长大下坡运行优化算法研究[J].控制与信息技术,2024,(4):19-27,9.

基金项目

中国铁道科学研究院集团有限公司基金项目(2022HT15,2023YJ273) (2022HT15,2023YJ273)

控制与信息技术

2096-5427

访问量0
|
下载量0
段落导航相关论文