| 注册
首页|期刊导航|计算机工程与应用|自注意力机制结合DDPG的机器人路径规划研究

自注意力机制结合DDPG的机器人路径规划研究

王凤英 陈莹 袁帅 杜利明

计算机工程与应用2024,Vol.60Issue(19):158-166,9.
计算机工程与应用2024,Vol.60Issue(19):158-166,9.DOI:10.3778/j.issn.1002-8331.2307-0009

自注意力机制结合DDPG的机器人路径规划研究

Robot Path Planning Based on Self-Attention Mechanism Combined with DDPG

王凤英 1陈莹 2袁帅 2杜利明1

作者信息

  • 1. 沈阳建筑大学计算机科学与工程学院,沈阳 110168||宿迁学院信息工程学院,江苏 宿迁 223800
  • 2. 沈阳建筑大学计算机科学与工程学院,沈阳 110168
  • 折叠

摘要

Abstract

In order to better solve the problems of low sample utilization,sparse reward and slow stability of network model in path planning of depth deterministic strategy gradient algorithm,an improved DDPG algorithm is proposed.By incorporating a self-attention mechanism into the image information obtained from robot camera sensors and using the Dot-product method to calculate the correlation between images,high weights can be accurately focused on obstacle infor-mation.In complex environments,it is difficult for robots to obtain positive feedback rewards due to their lack of experi-ence,which affects their exploration ability.Combining DDPG algorithm with HER,a DDPG-HER algorithm is pro-posed,which effectively utilizes positive and negative feedback to enable robots to learn appropriate rewards from both successful and failed experiences.A static and dynamic simulation environment is built by Gazebo for training and test-ing.The experimental results show that the proposed algorithm can significantly improve the sample utilization rate,accel-erate network model stability,and solve the problem of sparse reward,so that the robot can efficiently avoid obstacles and reach the target point in the path planning with unknown environment.

关键词

深度强化学习/深度确定性策略梯度算法(DDPG)/后见经验算法(HER)/自注意力机制/机器人路径规划

Key words

deep reinforcement learning/deep deterministic policy gradient(DDPG)/hindsight experience replay(HER)/self-attention mechanism,robot path planning

分类

信息技术与安全科学

引用本文复制引用

王凤英,陈莹,袁帅,杜利明..自注意力机制结合DDPG的机器人路径规划研究[J].计算机工程与应用,2024,60(19):158-166,9.

基金项目

辽宁省应用基础研究计划(2023JH2/101300212) (2023JH2/101300212)

宿迁学院人才引进科研启动基金(校2022XRC091). (校2022XRC091)

计算机工程与应用

OA北大核心CSTPCD

1002-8331

访问量0
|
下载量0
段落导航相关论文