火力与指挥控制2024,Vol.49Issue(3):25-34,10.DOI:10.3969/j.issn.1002-0640.2024.03.004
面向无人机集群察打场景的PPO算法设计
Research on PPO Algorithm Design for UAV Swarm Reconnaissance and Strike Scenarios
李俊慧 1张振华 1边疆 1聂天常 1车博山1
作者信息
- 1. 北方自动控制技术研究所,太原 030006
- 折叠
摘要
Abstract
The problem of d ecision-making of UAV swarms is an important research direction of intelligent warfare.Taking the mission scenario of the constructed typical UAV swarm reconnaissance and strike as an example,the task allocation and motion planning of UAV swarm under the complex and uncertain conditions are studied.In order to solve this problem,the complexity of mission decision-making and the uncertainty of battlefield environment are first elaborated from the perspective of parametric design of battlefield environment model and typical swarm reconnaissance and strike mission.Then,a state space,reward function,action space and strategy network with strong versatility is designed.First,types of features are designed and processed as state space in order to capture multiple situation information.Multiple types of rewards closely related to the reconnaissance and strike task are designed at the same time.Moreover,the output of action strategy takes the form of subject-verb-object to better express the complex operations.The encoder-time series aggregation-attention mechanism-decoder structure is designed for the strategy network,which fully integrates the feature information and promotes the training effects.Then it is solved by Deep Reinforcement Learning(DRL)based on Proximal Policy Optimization(PPO).Finally,the feasibility and effectiveness of UAV swarm to realize reconnaissanc e and strike mission decision-making under the complex and uncertain conditions are verified through simulation environment experiments,meanwhile the intelligence of swarm task allocation and motion planning is demonstrated.关键词
近端策略优化算法设计/任务分配/运动规划/侦察打击/决策Key words
proximal policy optimization design/task assignment/motion planning/reconnaissance and strike/collaborative decision-making分类
军事科技引用本文复制引用
李俊慧,张振华,边疆,聂天常,车博山..面向无人机集群察打场景的PPO算法设计[J].火力与指挥控制,2024,49(3):25-34,10.