首页|期刊导航|西安电子科技大学学报（自然科学版）|一种改进dueling网络的机器人避障方法

一种改进dueling网络的机器人避障方法

周翼陈渤

西安电子科技大学学报（自然科学版）2019，Vol.46Issue(1)：46-50,63,6.

西安电子科技大学学报（自然科学版）2019，Vol.46Issue(1)：46-50,63,6.DOI:10.19665/j.issn1001-2400.2019.01.008

一种改进dueling网络的机器人避障方法

Method for robot obstacle avoidance based on the improved dueling network

周翼 ¹陈渤²

作者信息

1. 西安电子科技大学雷达信号处理国家重点实验室,陕西西安 710071
2. 西安电子科技大学信息感知技术协同创新中心,陕西西安 710071
折叠

摘要

Abstract

In view of the disadvantages of traditional reinforcement learning methods in motion planning, especially the problem of robot obstacle avoidance,it is easy to have overestimation and difficult to adapt to complex environment.A new model based on deep reinforcement learning is proposed to improve the obstacle avoidance performance of robots.The model combines dueling networks with Q-learning which is the traditional reinforcement learning method,and using two independent trained dueling networks to deal with environmental data and predict the action value.In the output layer,the state value and the action advantage are output respectively,with both values combined as the final action value.The model can process high dimension data to adapt to complex and changeable environment,and output advantageous actions for robot selection to get a higher accumulative reward.It can effectively improve the obstacle avoidance performance of a robot.

关键词

机器人避障/深度增强学习/dueling网络/独立训练

Key words

robot obstacle avoidance/deep reinforcement learning/dueling networks/independent trained

分类

信息技术与安全科学

引用本文复制引用

周翼,陈渤..一种改进dueling网络的机器人避障方法[J].西安电子科技大学学报（自然科学版）,2019,46(1):46-50,63,6.

基金项目

国家自然科学基金(61771361) （61771361）

国家自然科学基金杰出青年基金(61525105) （61525105）

西安电子科技大学学报（自然科学版）

OA北大核心CSCDCSTPCD

ISSN：1001-2400

访问量0

下载量0

段落导航