| 注册
首页|期刊导航|同济大学学报(自然科学版)|典型匝道控制场景下深度强化学习决策机理解析

典型匝道控制场景下深度强化学习决策机理解析

刘冰 唐钰 暨育雄 沈煜 杜豫川

同济大学学报(自然科学版)2024,Vol.52Issue(6):928-934,981,8.
同济大学学报(自然科学版)2024,Vol.52Issue(6):928-934,981,8.DOI:10.11908/j.issn.0253-374x.22418

典型匝道控制场景下深度强化学习决策机理解析

Understanding Deep Reinforcement Learning Algorithm in Typical Ramp Metering Scenarios

刘冰 1唐钰 2暨育雄 1沈煜 1杜豫川1

作者信息

  • 1. 同济大学道路与交通工程教育部重点实验室,上海 201804
  • 2. 纽约大学坦登工程学院,纽约11201
  • 折叠

摘要

Abstract

This paper presents the control mechanism of deep reinforcement learning(DRL)in a typical ramp metering scenario.The state value function is used to evaluate if the DRL model has the ability to distinguish the change of state.The saliency map is used to perceive the state key features and control pattern for the DRL model under specific traffic states.By using the input perturbation,the action match ratio and control performance under perturbed data are analyzed to explore the key areas of control.The results show that the DRL model can evaluate the traffic state accurately,distinguish the key features,and then make reasonable decisions.

关键词

交通工程/深度强化学习/可解释机器学习/匝道控制

Key words

traffic engineering/deep reinforcement learning(DRL)/explainable machine learning/ramp metering

分类

交通工程

引用本文复制引用

刘冰,唐钰,暨育雄,沈煜,杜豫川..典型匝道控制场景下深度强化学习决策机理解析[J].同济大学学报(自然科学版),2024,52(6):928-934,981,8.

基金项目

上海市科委科研计划(19DZ1209100) (19DZ1209100)

浙江省重点研发计划(2021C01011) (2021C01011)

同济大学学报(自然科学版)

OA北大核心CSTPCD

0253-374X

访问量0
|
下载量0
段落导航相关论文