| 注册
首页|期刊导航|舰船电子工程|基于改进深度Q网络的多UAV队形变换策略

基于改进深度Q网络的多UAV队形变换策略

李洁菁 卢宗汉 高恒杰 戴邵武

舰船电子工程2024,Vol.44Issue(6):22-29,8.
舰船电子工程2024,Vol.44Issue(6):22-29,8.DOI:10.3969/j.issn.1672-9730.2024.06.006

基于改进深度Q网络的多UAV队形变换策略

Formation Change Strategy of Multiple UAV Based on Improved DQN

李洁菁 1卢宗汉 2高恒杰 3戴邵武4

作者信息

  • 1. 中国人民解放军第四八零一工厂虎门军械修理厂 东莞 523938
  • 2. 山东科技大学 青岛 266400
  • 3. 中国人民解放军第91729部队 陵水 572400
  • 4. 海军航空大学 烟台 264001
  • 折叠

摘要

Abstract

Aiming at the problems of complex system structure and large amount of calculation in traditional multiple UAV for-mation transformation methods,a multiple UAV formation transformation method based on improved DQN algorithm is proposed.First,a DQN-based multiple UAV formation transformation method is proposed,and the multiple UAV formation transformation problem is modeled as a Markov decision process.Secondly,the dynamic target point optimal allocation algorithm(DTA)is used to target formation of members in the formation.The optimal node allocation of the formation can improve the efficiency of the formation transformation and accelerate the convergence speed of the DQN algorithm.Again,the introduction of the Reciprocal Velocity Obsta-cle Method(RVO)to guide the formation members to avoid collisions during the formation transformation,thereby improving the performance of the DQN algorithm learning efficiency.Finally,for the unsmooth route planned by the algorithm,the UAV cannot fly,and the three uniform B-spline interpolation algorithm is introduced to smooth the route.In the formation transformation experi-ment,compared with the DQN algorithm based on dynamic target allocation(DTA-DQN),the DQN algorithm based on the recipro-cal speed obstacle method(RVO-DQN),and the traditional DQN algorithm,the convergence speed of the proposed algorithm is in-creased by 39.26%,40.31%,50.77%respectively,and the average range is shorter.The simulation results show that the proposed algorithm can effectively improve the efficiency of multiple UAV formation transformation,and the algorithm has good generalization and practicability.

关键词

DQN算法/动态目标点分配/互惠速度障碍法/无人飞行器/队形变换

Key words

DQN algorithm/dynamic target allocation/reciprocal velocity obstacle/unmanned aerial vehicle/formation change

分类

航空航天

引用本文复制引用

李洁菁,卢宗汉,高恒杰,戴邵武..基于改进深度Q网络的多UAV队形变换策略[J].舰船电子工程,2024,44(6):22-29,8.

舰船电子工程

OACSTPCD

1672-9730

访问量4
|
下载量0
段落导航相关论文