| 注册
首页|期刊导航|空天防御|基于人工势场法的无人机集群突防博弈研究

基于人工势场法的无人机集群突防博弈研究

王瑞昌 石琛 张科 呼卫军 马先龙

空天防御2026,Vol.9Issue(2):8-17,10.
空天防御2026,Vol.9Issue(2):8-17,10.

基于人工势场法的无人机集群突防博弈研究

Research on Penetration Games of UAV Swarms Based on the Artificial Potential Field Method

王瑞昌 1石琛 2张科 1呼卫军 1马先龙1

作者信息

  • 1. 西北工业大学 航天学院,陕西 西安 710072
  • 2. 上海机电工程研究所,上海 201109
  • 折叠

摘要

Abstract

Targeting the challenges of high-dimensional continuous decision non-convergence,low exploration efficiency,and insufficient policy robustness in multi-to-multi red-blue quadrotor swarm zero-sum penetration games within three-dimensional airspace,this paper proposes a Multi-Agent Deep Deterministic Policy Gradient(MADDPG)solution framework integrating artificial potential field priors with opponent strategy prediction.First,the three-degree-of-freedom UAV kinematics are embedded into a complete-information differential game,designing a"mission-threat-cooperation"three-tier reward structure,and introducing differentiable potential field energy to transform sparse terminal rewards into dense gradient signals,achieving explicit representation of the"seeking-advantage-avoiding-disadvantage"prior.Second,a potential field-guided hybrid exploration mechanism is constructed,online modulating Ornstein-Uhlenbeck process(OU)noise using potential energy directions,and offline smoothing target Q-values with potential field regularization,improving sample utilization and suppressing overestimation.Furthermore,a lightweight opponent strategy predictor is integrated,introducing a meta-game term into the Actor gradient,enabling red-team policy updates to simultaneously minimize opponent expected payoffs,proactively disrupting enemy decision consistency and accelerating convergence to Nash equilibrium.Simulation results demonstrate that the proposed method achieves stable win rates exceeding 90%in 2v2 and 4v4 dense confrontations,systematically induces blue team to generate redundant accelerations and energy dissipation,continuously creates spatial-temporal gaps to complete collision-free penetration,significantly outperforming MADDPG without prediction,validating the framework's scalability,real-time performance,and robustness in multi-to-multi zero-sum games.

关键词

人工势场/多对多零和突防博弈/强化学习/策略预测

Key words

artificial potential field/many-versus-many zero-sum penetration game/reinforcement learning/policy prediction

分类

航空航天

引用本文复制引用

王瑞昌,石琛,张科,呼卫军,马先龙..基于人工势场法的无人机集群突防博弈研究[J].空天防御,2026,9(2):8-17,10.

基金项目

中国航天科技集团有限公司上海航天科技创新基金项目(SAST2022-006) (SAST2022-006)

空天防御

2096-4641

访问量0
|
下载量0
段落导航相关论文