| 注册
首页|期刊导航|交通运输工程与信息学报|基于深度强化学习PPO的匝道混合交通流合流控制方法

基于深度强化学习PPO的匝道混合交通流合流控制方法

张浩然 王嘉文 周丽萍

交通运输工程与信息学报2026,Vol.24Issue(1):116-130,15.
交通运输工程与信息学报2026,Vol.24Issue(1):116-130,15.DOI:10.19961/j.cnki.1672-4747.2025.10.003

基于深度强化学习PPO的匝道混合交通流合流控制方法

Deep reinforcement learning PPO-based ramp merging control approach for mixed traffic flow

张浩然 1王嘉文 1周丽萍1

作者信息

  • 1. 上海理工大学,管理学院,上海 200093
  • 折叠

摘要

Abstract

[Background]The gradual integration of autonomous vehicles(AVs)into traffic systems has increased the prevalence of mixed traffic flows,which comprise human-driven vehicles(HDVs)and AVs in on-ramp merging areas.However,single-vehicle intelligence approaches exhibit limita-tions in terms of real-time responsiveness and system-wide coordination.Moreover,the overall char-acteristics and comprehensive effects of mixed traffic flows require further in-depth investigation.[Objective]This study aims to address control challenges posed by mixed traffic flows in on-ramp merging areas to enhance the overall traffic efficiency and safety.[Method]A control method for AVs is proposed by leveraging the proximal policy optimization(PPO)deep reinforcement-learning(DRL)algorithm to execute car-following and lane-changing behaviors.To mitigate frequent lane changes and prevent overly conservative driving,this strategy incorporates a lane-changing penalty and a fixed-step negative reward mechanism.Simulations are conducted to evaluate the effect of DRL-based AVs on mixed traffic flows.[Result]As the AV penetration rate increases,the proposed strategy significantly enhances both traffic efficiency and safety.Compared with a DRL baseline(DRL-B)and a rule-based(RB)strategy,this method improves the overall traffic efficiency and two safety indicators,TTC and DRAC,by 2.31%,17.3%,3.1%,and 4.57%,10.7%,0.34%,respectively.The most significant improvement in operational efficiency is observed under moderate arrival rates when compared with the RB strategy,and under high arrival rates when compared with the DRL-B strategy.The most substantial safety improvements are observed at moderate AV penetration rates.[Application]This study validates the effectiveness of the DRL approach in enhancing the efficiency and safety of mixed traffic flows in on-ramp merging areas,as well as offer insights for the deploy-ment and application of AVs in such scenarios.

关键词

智能交通/自动驾驶/深度强化学习/匝道合流区/单车智能

Key words

intelligent transportation/autonomous vehicles/deep reinforcement learning/express-way merging areas/single vehicle intelligence

分类

交通工程

引用本文复制引用

张浩然,王嘉文,周丽萍..基于深度强化学习PPO的匝道混合交通流合流控制方法[J].交通运输工程与信息学报,2026,24(1):116-130,15.

基金项目

上海市科技创新行动计划项目(25692117700,25692107000) (25692117700,25692107000)

交通运输工程与信息学报

1672-4747

访问量1
|
下载量0
段落导航相关论文