计算机科学与探索2024,Vol.18Issue(8):2080-2090,11.DOI:10.3778/j.issn.1673-9418.2310040
改进MADDPG算法的非凸环境下多智能体自组织协同围捕
Multi-agent Self-organizing Cooperative Hunting in Non-convex Environment with Improved MADDPG Algorithm
摘要
Abstract
A multi-agent reinforcement learning algorithm based on improved experience playback is proposed to solve the trapping efficiency problem of multi-agent in non-convex environment.The residual network(ResNet)is used to improve the network degradation problem,and the RW-MADDPG algorithm combined with the multi-agent depth deterministic strategy gradient algorithm(MADDPG)is proposed.In order to solve the problem of low utilization of experience pool data during multi-agent training,two methods to improve the utilization of experience pool data are proposed.In order to solve the problem that multiple agents are trapped inside obstacles such as unreachable target in non-convex obstacle environment,a reasonable trapping reward function is designed to make intelligent agents complete the trapping task in non-convex obstacle environment.Simulation experiments are designed based on this algorithm.Experimental results show that the algorithm increases the reward faster in the training stage and can complete the rounding task faster.Compared with MADDPG algorithm,the training time is shortened by 18.5%under static rounding environment and 49.5%under dynamic environment.Moreover,the global average reward of the rounding agent trained by this algorithm is higher in the non-convex obstacle environment.关键词
深度强化学习/RW-MADDPG/残差网络/经验池/围捕奖励函数Key words
deep reinforcement learning/RW-MADDPG/residual network/experience pool/rounding reward function分类
信息技术与安全科学引用本文复制引用
张红强,石佳航,吴亮红,王汐,左词立,陈祖国,刘朝华,陈磊..改进MADDPG算法的非凸环境下多智能体自组织协同围捕[J].计算机科学与探索,2024,18(8):2080-2090,11.基金项目
国家自然科学基金(52104192,62271199) (52104192,62271199)
湖南省自然科学基金(2021JJ30280,2022JJ30265) (2021JJ30280,2022JJ30265)
湖南省教育厅重点项目(23A0382) (23A0382)
湖南省教育厅优秀青年项目(22B0476,21B0456) (22B0476,21B0456)
湖南省科技托举工程青年英才项目(2022TJ-Q03). This work was supported by the National Natural Science Foundation of China(52104192,62271199),the Natural Science Foundation of Hunan Province(2021JJ30280,2022JJ30265),the Key Project of Hunan Provincial Department of Education(23A0382),the Excellent Youth Project of Hunan Provincial Department of Education(22B0476,21B0456),and the Young Talents Project of Science and Technology Lifting Project of Hunan Province(2022TJ-Q03). (2022TJ-Q03)