首页|期刊导航|自动化学报|基于滚动时域强化学习的智能车辆侧向控制算法

基于滚动时域强化学习的智能车辆侧向控制算法

张兴龙陆阳李文璋徐昕

自动化学报2023，Vol.49Issue(12)：2481-2492,12.

自动化学报2023，Vol.49Issue(12)：2481-2492,12.DOI:10.16383/j.aas.c210555

基于滚动时域强化学习的智能车辆侧向控制算法

Receding Horizon Reinforcement Learning Algorithm for Lateral Control of Intelligent Vehicles

张兴龙 ¹陆阳 ¹李文璋 ¹徐昕¹

作者信息

1. 国防科技大学智能科学学院长沙 410073
折叠

摘要

Abstract

This paper presents a receding horizon reinforcement learning(RHRL)algorithm for realizing high-accur-acy lateral control of intelligent vehicles.The overall lateral control is composed of a feedforward control term that is directly computed using the curvature of the reference path and the dynamic model,and a feedback control term that is generated by solving an optimal control problem using the proposed RHRL algorithm.The proposed RHRL adopts a receding horizon optimization mechanism,and decomposes the infinite-horizon optimal control problem in-to several finite-horizon ones to be solved.Different from existing finite-horizon actor-critic learning algorithms,in each prediction horizon of RHRL,a time-independent actor-critic structure is utilized to learn the optimal value function and control policy.Also,compared with model predictive control(MPC),the control learned by RHRL is an explicit state-feedback control policy,which can be deployed directly offline or learned and deployed synchron-ously online.Moreover,the convergence of the proposed RHRL algorithm in each prediction horizon is proven and the stability analysis of the closed-loop system is peroformed.Simulation studies on a structural road show that,the proposed RHRL algorithm performs better than current state-of-the-art methods.The experimental studies on an intelligent driving platform built with a Hongqi E-HS3 electric car show that RHRL performs better than the pure pursuit method in the adopted structural city road scenario,and exhibits strong adaptability to road conditions and satisfactory control performance in the country road scenario.

关键词

滚动时域/强化学习/智能汽车/侧向控制

Key words

Receding horizon/reinforcement learning/intelligent vehicles/lateral control

引用本文复制引用

张兴龙,陆阳,李文璋,徐昕..基于滚动时域强化学习的智能车辆侧向控制算法[J].自动化学报,2023,49(12):2481-2492,12.

基金项目

国家重点研究发展计划(2018YFB1305105),国家自然科学基金(62003361,61825305)资助Supported by National Key Research and Development Pro-gram of China(2018YFB1305105)and National Natural Science Foundation of China(62003361,61825305) （2018YFB1305105）

自动化学报

OA北大核心CSCDCSTPCD

ISSN：0254-4156

访问量0

下载量0

段落导航