空天防御2025,Vol.8Issue(3):73-85,13.
基于强化学习的无人机协作防守策略设计与验证
Design and Verification of UAV Cooperative Defense Strategy Based on Reinforcement Learning
摘要
Abstract
The drone swarm confrontation is built based on the OODA decision loop and employs multi-agent deep reinforcement learning for algorithm design to find the optimal collaborative defence strategy for drone swarm.Specifically,a QMIX-based single-layer decision algorithm is developed to tackle contribution allocation and high-dimensional space challenges in drone cooperation.In this paper,a hierarchical decision-making model integrating rule-based methods and reinforcement learning was proposed.This model first adopted a decision layer with rule-based or HMM intention recognition to analyze combat scenarios and schedule drones,followed by an action layer utilizing the QMIX algorithm to output actions.To verify the performance of the proposed algorithms,this study established a controllable and observable simulation platform using Python and Unity and produced a challenging defensive game problem.Experiments quantitatively evaluated defence strategies in perspectives of cooperation effectiveness,resource efficiency,and generalisation.The results show that each index of hierarchical decision-making is significantly better than that of single-layer decision making,and the winning rate has been dramatically improved.The HMM-based hierarchical strategy shows the best performance,offering a promising new approach to drone swarm defence.关键词
无人机集群/协作防守/多智能体强化学习/仿真平台/分层决策Key words
drone swarm/cooperative defence/multi-agent reinforcement learning/simulation platform/hierarchical decision分类
航空航天引用本文复制引用
李奕佳,李嘉诺,柯良军..基于强化学习的无人机协作防守策略设计与验证[J].空天防御,2025,8(3):73-85,13.基金项目
国家自然科学基金项目(72001214) (72001214)