首页|期刊导航|航空兵器|基于多智能体深度强化学习的高炮反无人机算法

基于多智能体深度强化学习的高炮反无人机算法

胡家威代昌华祁万龙陈志恒王震樊浡昊郑欣磊唐杰

航空兵器2025，Vol.32Issue(5)：64-71,8.

航空兵器2025，Vol.32Issue(5)：64-71,8.DOI:10.12132/ISSN.1673-5048.2025.0073

基于多智能体深度强化学习的高炮反无人机算法

Multi-Agent Deep Reinforcement Learning for Anti-Aircraft Artillery Systems in Counter-UAV Operations

胡家威 ¹代昌华 ¹祁万龙 ¹陈志恒 ¹王震 ¹樊浡昊 ¹郑欣磊 ¹唐杰¹

作者信息

1. 西北机电工程研究所,陕西咸阳 712099
折叠

摘要

Abstract

To address the issues of low engagement efficiency and insufficient adaptability in current coun-ter unmanned aerial vehicle(UAV)artillery systems,this paper proposes a situational-fused hierarchical multi-objective multi-agent reinforcement learning for counter-UAV systems.Firstly,within the context of counter-UAV artillery fire engagement systems,the problem of the counter-UAV defense scenario is formally defined.Secondly,the counter-UAV artillery task is formulated as a Markov decision process.Decision-making agents are constructed,and their state space,action space,and reward function are defined.Specifically,to enhance the agents' global situational awareness,multi-source situational information is fused into the state space.Addi-tionally,considering the characteristics of artillery counter-UAV engagement and fire control delay,a hierarchi-cal multi-objective reward mechanism is designed to guide the agents' decision-making process.Finally,the a-gents are trained using a deep multi-agent reinforcement learning approach based on monotonic value function factorization and validated within a simulation environment.Experimental results demonstrate that across three distinct counter-UAV scenarios,the proposed algorithm achieves task completion rates of 86％,88％,and 78％,respectively.This represents an average improvement of 48.9％compared to other prevalent MADRL al-gorithms.The results significantly enhance the engagement efficiency and battlefield adaptability of the counter-UAV artillery system,providing an effective AI-driven solution for countering UAV threats with artillery.

关键词

高炮反无人机/态势信息/层次化多目标奖励/多智能体/深度强化学习/马尔可夫决策

Key words

anti-aircraft artillery counter-UAV/situational information/hierarchical multi-objective re-ward/multi-agent/deep reinforcement learning/Markov decision

分类

军事科技

引用本文复制引用

胡家威,代昌华,祁万龙,陈志恒,王震,樊浡昊,郑欣磊,唐杰..基于多智能体深度强化学习的高炮反无人机算法[J].航空兵器,2025,32(5):64-71,8.

基金项目

智能基础算法与技术应用团队计划(L2024-CXNL-KJRCTD-KJTD-0006) （L2024-CXNL-KJRCTD-KJTD-0006）

航空兵器

OA北大核心

ISSN：1673-5048

访问量0

下载量0

段落导航