首页|期刊导航|计算机工程与应用|基于碰撞预测的强化模仿学习机器人导航方法

基于碰撞预测的强化模仿学习机器人导航方法

王浩杰陶冶鲁超峰

计算机工程与应用2024，Vol.60Issue(10)：341-352,12.

计算机工程与应用2024，Vol.60Issue(10)：341-352,12.DOI:10.3778/j.issn.1002-8331.2302-0033

基于碰撞预测的强化模仿学习机器人导航方法

Reinforcement Imitation Learning Method Based on Collision Prediction for Robots Navigation

王浩杰 ¹陶冶 ¹鲁超峰¹

作者信息

1. 青岛科技大学信息科学技术学院,山东青岛 266100
折叠

摘要

Abstract

The learning-based robot navigation methods have high dependence on the dataset and imperfect performance in some specific environments,for example,agents cannot run towards its goal through a wide-open space and have high collision rate in space with dense obstacles.In order to improve the navigation performance of robots in multi-obstacle scenarios,a reinforcement imitation learning navigation method based on collision prediction is proposed.Firstly,the state space,action space,and reward function are built for the Markov decision process(MDP)based on the performance of the robot without model.The model is trained in simulation environment based on reinforcement learning to allow the robot to acquire navigation and obstacle avoidance abilities in sparse obstacle environments.To improve the shortcomings of reinforcement learning in terms of imperfect performance in specific environments,imitation learning is used to train the policy.Finally,a collision prediction model is designed to combine traditional control with deep learning to make the robot adaptively select the appropriate control policy in different environments based on the prediction results,which greatly improves the safety of navigation.The navigation performance and generalization capability of the proposed method are experimentally verified in a large number of never-before-encountered scenarios.

关键词

导航/强化学习/模仿学习/碰撞预测/混合控制

Key words

navigation/reinforcement learning/imitation learning/collision prediction/hybrid control

分类

信息技术与安全科学

引用本文复制引用

王浩杰,陶冶,鲁超峰..基于碰撞预测的强化模仿学习机器人导航方法[J].计算机工程与应用,2024,60(10):341-352,12.

基金项目

国家重点研发计划(2018YFB1702902) （2018YFB1702902）

山东省高等学校青创科技支持计划(2019KJN047). （2019KJN047）

计算机工程与应用

OA北大核心CSTPCD

ISSN：1002-8331

访问量7

下载量0

段落导航