首页|期刊导航|空天防御|基于强化学习的无人机协作防守策略设计与验证

基于强化学习的无人机协作防守策略设计与验证

李奕佳李嘉诺柯良军

空天防御2025，Vol.8Issue(3)：73-85,13.

基于强化学习的无人机协作防守策略设计与验证

Design and Verification of UAV Cooperative Defense Strategy Based on Reinforcement Learning

李奕佳 ¹李嘉诺 ¹柯良军¹

作者信息

1. 西安交通大学自动化科学与工程学院,陕西西安 710049
折叠

摘要

Abstract

The drone swarm confrontation is built based on the OODA decision loop and employs multi-agent deep reinforcement learning for algorithm design to find the optimal collaborative defence strategy for drone swarm.Specifically,a QMIX-based single-layer decision algorithm is developed to tackle contribution allocation and high-dimensional space challenges in drone cooperation.In this paper,a hierarchical decision-making model integrating rule-based methods and reinforcement learning was proposed.This model first adopted a decision layer with rule-based or HMM intention recognition to analyze combat scenarios and schedule drones,followed by an action layer utilizing the QMIX algorithm to output actions.To verify the performance of the proposed algorithms,this study established a controllable and observable simulation platform using Python and Unity and produced a challenging defensive game problem.Experiments quantitatively evaluated defence strategies in perspectives of cooperation effectiveness,resource efficiency,and generalisation.The results show that each index of hierarchical decision-making is significantly better than that of single-layer decision making,and the winning rate has been dramatically improved.The HMM-based hierarchical strategy shows the best performance,offering a promising new approach to drone swarm defence.

关键词

无人机集群/协作防守/多智能体强化学习/仿真平台/分层决策

Key words

drone swarm/cooperative defence/multi-agent reinforcement learning/simulation platform/hierarchical decision

分类

航空航天

引用本文复制引用

李奕佳,李嘉诺,柯良军..基于强化学习的无人机协作防守策略设计与验证[J].空天防御,2025,8(3):73-85,13.

基金项目

国家自然科学基金项目(72001214) （72001214）

空天防御

ISSN：2096-4641

访问量0

下载量0

段落导航