首页|期刊导航|交通运输工程与信息学报|基于深度强化学习PPO的匝道混合交通流合流控制方法

基于深度强化学习PPO的匝道混合交通流合流控制方法

张浩然王嘉文周丽萍

交通运输工程与信息学报2026，Vol.24Issue(1)：116-130,15.

交通运输工程与信息学报2026，Vol.24Issue(1)：116-130,15.DOI:10.19961/j.cnki.1672-4747.2025.10.003

基于深度强化学习PPO的匝道混合交通流合流控制方法

Deep reinforcement learning PPO-based ramp merging control approach for mixed traffic flow

张浩然 ¹王嘉文 ¹周丽萍¹

作者信息

1. 上海理工大学,管理学院,上海 200093
折叠

摘要

Abstract

[Background]The gradual integration of autonomous vehicles(AVs)into traffic systems has increased the prevalence of mixed traffic flows,which comprise human-driven vehicles(HDVs)and AVs in on-ramp merging areas.However,single-vehicle intelligence approaches exhibit limita-tions in terms of real-time responsiveness and system-wide coordination.Moreover,the overall char-acteristics and comprehensive effects of mixed traffic flows require further in-depth investigation.[Objective]This study aims to address control challenges posed by mixed traffic flows in on-ramp merging areas to enhance the overall traffic efficiency and safety.[Method]A control method for AVs is proposed by leveraging the proximal policy optimization(PPO)deep reinforcement-learning(DRL)algorithm to execute car-following and lane-changing behaviors.To mitigate frequent lane changes and prevent overly conservative driving,this strategy incorporates a lane-changing penalty and a fixed-step negative reward mechanism.Simulations are conducted to evaluate the effect of DRL-based AVs on mixed traffic flows.[Result]As the AV penetration rate increases,the proposed strategy significantly enhances both traffic efficiency and safety.Compared with a DRL baseline(DRL-B)and a rule-based(RB)strategy,this method improves the overall traffic efficiency and two safety indicators,TTC and DRAC,by 2.31％,17.3％,3.1％,and 4.57％,10.7％,0.34％,respectively.The most significant improvement in operational efficiency is observed under moderate arrival rates when compared with the RB strategy,and under high arrival rates when compared with the DRL-B strategy.The most substantial safety improvements are observed at moderate AV penetration rates.[Application]This study validates the effectiveness of the DRL approach in enhancing the efficiency and safety of mixed traffic flows in on-ramp merging areas,as well as offer insights for the deploy-ment and application of AVs in such scenarios.

关键词

智能交通/自动驾驶/深度强化学习/匝道合流区/单车智能

Key words

intelligent transportation/autonomous vehicles/deep reinforcement learning/express-way merging areas/single vehicle intelligence

分类

交通工程

引用本文复制引用

张浩然,王嘉文,周丽萍..基于深度强化学习PPO的匝道混合交通流合流控制方法[J].交通运输工程与信息学报,2026,24(1):116-130,15.

基金项目

上海市科技创新行动计划项目(25692117700,25692107000) （25692117700,25692107000）

交通运输工程与信息学报

ISSN：1672-4747

访问量0

下载量0

段落导航