| 注册
首页|期刊导航|航空兵器|基于多智能体深度强化学习的高炮反无人机算法

基于多智能体深度强化学习的高炮反无人机算法

胡家威 代昌华 祁万龙 陈志恒 王震 樊浡昊 郑欣磊 唐杰

航空兵器2025,Vol.32Issue(5):64-71,8.
航空兵器2025,Vol.32Issue(5):64-71,8.DOI:10.12132/ISSN.1673-5048.2025.0073

基于多智能体深度强化学习的高炮反无人机算法

Multi-Agent Deep Reinforcement Learning for Anti-Aircraft Artillery Systems in Counter-UAV Operations

胡家威 1代昌华 1祁万龙 1陈志恒 1王震 1樊浡昊 1郑欣磊 1唐杰1

作者信息

  • 1. 西北机电工程研究所,陕西咸阳 712099
  • 折叠

摘要

Abstract

To address the issues of low engagement efficiency and insufficient adaptability in current coun-ter unmanned aerial vehicle(UAV)artillery systems,this paper proposes a situational-fused hierarchical multi-objective multi-agent reinforcement learning for counter-UAV systems.Firstly,within the context of counter-UAV artillery fire engagement systems,the problem of the counter-UAV defense scenario is formally defined.Secondly,the counter-UAV artillery task is formulated as a Markov decision process.Decision-making agents are constructed,and their state space,action space,and reward function are defined.Specifically,to enhance the agents' global situational awareness,multi-source situational information is fused into the state space.Addi-tionally,considering the characteristics of artillery counter-UAV engagement and fire control delay,a hierarchi-cal multi-objective reward mechanism is designed to guide the agents' decision-making process.Finally,the a-gents are trained using a deep multi-agent reinforcement learning approach based on monotonic value function factorization and validated within a simulation environment.Experimental results demonstrate that across three distinct counter-UAV scenarios,the proposed algorithm achieves task completion rates of 86%,88%,and 78%,respectively.This represents an average improvement of 48.9%compared to other prevalent MADRL al-gorithms.The results significantly enhance the engagement efficiency and battlefield adaptability of the counter-UAV artillery system,providing an effective AI-driven solution for countering UAV threats with artillery.

关键词

高炮反无人机/态势信息/层次化多目标奖励/多智能体/深度强化学习/马尔可夫决策

Key words

anti-aircraft artillery counter-UAV/situational information/hierarchical multi-objective re-ward/multi-agent/deep reinforcement learning/Markov decision

分类

武器工业

引用本文复制引用

胡家威,代昌华,祁万龙,陈志恒,王震,樊浡昊,郑欣磊,唐杰..基于多智能体深度强化学习的高炮反无人机算法[J].航空兵器,2025,32(5):64-71,8.

基金项目

智能基础算法与技术应用团队计划(L2024-CXNL-KJRCTD-KJTD-0006) (L2024-CXNL-KJRCTD-KJTD-0006)

航空兵器

OA北大核心

1673-5048

访问量0
|
下载量0
段落导航相关论文