Safe Efficient Policy Optimization Algorithm for Unsignalized Intersection NavigationOACSTPCDEI
Safe Efficient Policy Optimization Algorithm for Unsignalized Intersection Navigation
Unsignalized intersections pose a challenge for autonomous vehicles that must decide how to navigate them safely and efficiently.This paper proposes a reinforcement learn-ing(RL)method for autonomous vehicles to navigate unsignal-ized intersections safely and efficiently.The method uses a semantic scene representation to handle variable numbers of vehicles and a universal reward function to facilitate stable learn-ing.A collision risk function is designed to penalize unsafe actions and guide the agent to avoid them.A scalable policy optimization algorithm is introduced to improve data efficiency and safety for vehicle learning at intersections.The algorithm employs experi-ence replay to overcome the on-policy limitation of proximal pol-icy optimization and incorporates the collision risk constraint into the policy optimization problem.The proposed safe RL algo-rithm can balance the trade-off between vehicle traffic safety and policy learning efficiency.Simulated intersection scenarios with different traffic situations are used to test the algorithm and demonstrate its high success rates and low collision rates under different traffic conditions.The algorithm shows the potential of RL for enhancing the safety and reliability of autonomous driv-ing systems at unsignalized intersections.
Xiaolong Chen;Biao Xu;Manjiang Hu;Yougang Bian;Yang Li;Xin Xu
State Key Laboratory of Advanced Design and Manufacturing for Vehicle Body,the College of Mechanical and Vehicle Engineering,Hunan University,Changsha 410082,ChinaState Key Laboratory of Advanced Design and Manufacturing for Vehicle Body,the College of Mechanical and Vehicle Engineering,Hunan University,Changsha 410082||Wuxi Intelligent Control Research Institute(WICRI)of Hunan University,Wuxi 214115,ChinaCollege of Intelligence Science and Technology,Institute of Unmanned Systems,National University of Defense Technology,Changsha 410073,China
Autonomous drivingdecision-makingreinforce-ment learning(RL)unsignalized intersection
《自动化学报(英文版)》 2024 (009)
2011-2026 / 16
This work was supported by the National Natural Science Foundation of China(52102394,52172384),Hunan Provincial Natural Science Foundation of China(2023JJ10008),and Young Elite Scientists Sponsorship Program by CAST(2022QNRC001).
评论