计算机工程2024,Vol.50Issue(2):105-112,8.DOI:10.19678/j.issn.1000-3428.0066799
基于时空图注意力网络的服务机器人动态避障
Dynamic Obstacle Avoidance for Service Robots Based on Spatio-Temporal Graph Attention Network
摘要
Abstract
To solve the problems of collision,freezing,and the unnatural paths of service robots in dense crowds with autonomous decision-making ability,this study proposes a dynamic obstacle avoidance algorithm for service robots based on spatio-temporal graph attention network under the framework of Deep Reinforcement Learning(DRL).Spatio-temporal graph attention network represents the decision function of Proximal Policy Optimization(PPO)algorithm.First,the algorithm uses a Gated Recurrent Unit(GRU)to control the degree of memory and forgetting of the robot with respect to its environment and then extracts the time characteristics of that environment.This ensures the robot has a certain predictive effect on the movement trend of pedestrians.Second,the algorithm uses graph attention networks to obtain the spatially implicit interaction features between robots and pedestrians,enabling the robot to locate collision-free paths.Finally,the spatio-temporal graph attention network is trained under the PPO algorithm,which enables the robot to realize collision-free navigation tasks in a crowd.The algorithm is verified by simulation experiments in a dynamic closed environment of 2.5 m2 per capita.Compared with the non-learning Dynamic Window Algorithm(DWA),the navigation success rate of the proposed algorithm is improved by 71 percentage points.In addition,compared with the learning-type DSRNN-RL algorithm,the navigation success rate of the proposed algorithm is improved by 3 percentage points and the navigation path is shorter.Finally,a real-time navigation test in the Gazebo environment shows that the average inference time of the algorithm is 21.90 ms,which meets the requirements of real-time navigation.关键词
服务机器人/动态避障/深度强化学习/时空图注意力网络/实时导航Key words
service robot/dynamic obstacle avoidance/Deep Reinforcement Learning(DRL)/spatio-temporal graph attention network/real-time navigation分类
信息技术与安全科学引用本文复制引用
杜海军,余粟..基于时空图注意力网络的服务机器人动态避障[J].计算机工程,2024,50(2):105-112,8.基金项目
上海市科委科研计划项目(17511110204). (17511110204)