首页|期刊导航|计算机工程|基于融合课程思想MADDPG的无人机编队控制

基于融合课程思想MADDPG的无人机编队控制

吴凯峰刘磊刘晨梁成庆

计算机工程2025，Vol.51Issue(5)：73-82,10.

计算机工程2025，Vol.51Issue(5)：73-82,10.DOI:10.19678/j.issn.1000-3428.0069850

基于融合课程思想MADDPG的无人机编队控制

Unmanned Aerial Vehicle Formation Control Based on MADDPG with Integrated Curriculum Learning

吴凯峰 ¹刘磊 ¹刘晨 ¹梁成庆¹

作者信息

1. 河海大学数学学院,江苏南京 211100
折叠

摘要

Abstract

The Multi-Agent Deep Deterministic Policy Gradient(MADDPG)algorithm is an extension of the Deep Deterministic Policy Gradient(DDPG)algorithm,specifically designed for multi-agent environments.In the MADDPG algorithm,each agent considers not only its own observations and actions but also the strategies of other agents to make more accurate collective decisions.This design significantly improves performance and stability in complex and changing environments.Based on the MADDPG algorithm framework,this study addressed the problem of Unmanned Aerial Vehicle(UAV)formation control.To overcome the challenge of convergence difficulty in multi-agent algorithms,a curriculum reinforcement learning approach was employed to train tasks in a stagewise manner.Progressively enhanced reward functions were designed for different tasks of each stage,and dense rewards were devised using the artificial potential field concept to significantly reduce the training difficulty.The effectiveness and stability of the MADDPG algorithm in multi-agent environments were demonstrated through ablation and control experiments performed in a self-built Software in the Loop(SITL)simulation environment.Furthermore,real-world experiments were conducted to verify the practicality of the designed algorithm.

关键词

无人机编队/深度强化学习/多智能体深度确定性策略梯度/课程学习/神经网络

Key words

Unmanned Aerial Vehicle(UAV)formation/deep reinforcement learning/Multi-Agent Deep Deterministic Policy Gradient(MADDPG)/curriculum learning/neural network

分类

信息技术与安全科学

引用本文复制引用

吴凯峰,刘磊,刘晨,梁成庆..基于融合课程思想MADDPG的无人机编队控制[J].计算机工程,2025,51(5):73-82,10.

基金项目

河北省自然科学基金面上项目(A2023209002). （A2023209002）

计算机工程

OA北大核心

ISSN：1000-3428

访问量0

下载量0

段落导航