计算机工程与应用2024,Vol.60Issue(10):341-352,12.DOI:10.3778/j.issn.1002-8331.2302-0033
基于碰撞预测的强化模仿学习机器人导航方法
Reinforcement Imitation Learning Method Based on Collision Prediction for Robots Navigation
摘要
Abstract
The learning-based robot navigation methods have high dependence on the dataset and imperfect performance in some specific environments,for example,agents cannot run towards its goal through a wide-open space and have high collision rate in space with dense obstacles.In order to improve the navigation performance of robots in multi-obstacle scenarios,a reinforcement imitation learning navigation method based on collision prediction is proposed.Firstly,the state space,action space,and reward function are built for the Markov decision process(MDP)based on the performance of the robot without model.The model is trained in simulation environment based on reinforcement learning to allow the robot to acquire navigation and obstacle avoidance abilities in sparse obstacle environments.To improve the shortcomings of reinforcement learning in terms of imperfect performance in specific environments,imitation learning is used to train the policy.Finally,a collision prediction model is designed to combine traditional control with deep learning to make the robot adaptively select the appropriate control policy in different environments based on the prediction results,which greatly improves the safety of navigation.The navigation performance and generalization capability of the proposed method are experimentally verified in a large number of never-before-encountered scenarios.关键词
导航/强化学习/模仿学习/碰撞预测/混合控制Key words
navigation/reinforcement learning/imitation learning/collision prediction/hybrid control分类
信息技术与安全科学引用本文复制引用
王浩杰,陶冶,鲁超峰..基于碰撞预测的强化模仿学习机器人导航方法[J].计算机工程与应用,2024,60(10):341-352,12.基金项目
国家重点研发计划(2018YFB1702902) (2018YFB1702902)
山东省高等学校青创科技支持计划(2019KJN047). (2019KJN047)