| 注册
首页|期刊导航|计算机工程与应用|动态环境下共融机器人深度强化学习导航算法

动态环境下共融机器人深度强化学习导航算法

顾金浩 况立群 韩慧妍 曹亚明 焦世超

计算机工程与应用2025,Vol.61Issue(4):90-98,9.
计算机工程与应用2025,Vol.61Issue(4):90-98,9.DOI:10.3778/j.issn.1002-8331.2405-0088

动态环境下共融机器人深度强化学习导航算法

Deep Reinforcement Learning Navigation Algorithm for Coexisting-Cooperative-Cognitive Robots in Dynamic Environment

顾金浩 1况立群 1韩慧妍 1曹亚明 1焦世超1

作者信息

  • 1. 中北大学 计算机科学与技术学院,太原 030051||机器视觉与虚拟现实山西省重点实验室,太原 030051||山西省视觉信息处理及智能机器人工程研究中心,太原 030051
  • 折叠

摘要

Abstract

In the past few decades,navigation algorithms for mobile service robots have been extensively studied,but intelli-gent agents still lack the complexity and cooperation exhibited by humans in crowded environments.With the continuous expansion of human-machine integration applications,collaboration between robots and humans in shared workspaces will become increasingly important.Therefore,the next generation of mobile service robots needs to meet social require-ments in order to be accepted by humans.In order to enhance the autonomous navigation ability of multi-agent systems in dynamic scenarios,a deep reinforcement learning obstacle avoidance algorithm for coexisting-cooperative-cognitive robots in dynamic environments is proposed to address the issues of low social adaptability and finding the optimal value func-tion in multi-agent navigation.A motion model that is more closely related to human behavior is established and added to the deep reinforcement learning framework to improve the cooperation of coexisting-cooperative-cognitive robots.In order to enhance the perceived safety of pedestrians based on their physical safety,a reward function is redefined.Non-linear deep neural networks are used instead of traditional value functions to solve the problem of finding the optimal value function.Simulation experiments show that compared to the latest deep reinforcement learning navigation methods,the proposed method achieves a 100%navigation success rate without increasing navigation time and without any collisions.The results indicate that this method maximizes the satisfaction of human social principles for the fusion robot,while effectively moving towards the goal and improving the perceived safety of pedestrians.

关键词

服务机器人/避障算法/深度强化学习/最优值函数/奖励函数

Key words

service robot/obstacle avoidance/deep reinforcement learning/optimal value function/reward function

分类

信息技术与安全科学

引用本文复制引用

顾金浩,况立群,韩慧妍,曹亚明,焦世超..动态环境下共融机器人深度强化学习导航算法[J].计算机工程与应用,2025,61(4):90-98,9.

基金项目

国家自然科学基金(62272426) (62272426)

山西省科技重大专项计划"揭榜挂帅"项目(202201150401021) (202201150401021)

山西省自然科学基金(202303021211153,202203021222027,202303021212189,202203021212138) (202303021211153,202203021222027,202303021212189,202203021212138)

山西省科技成果转化引导专项(202104021301055). (202104021301055)

计算机工程与应用

OA北大核心

1002-8331

访问量2
|
下载量0
段落导航相关论文