首页|期刊导航|航空兵器|融合DDQN与示教学习的高超声速飞行器智能姿态控制方法研究

融合DDQN与示教学习的高超声速飞行器智能姿态控制方法研究

刘静文蔡光斌凡永华樊红东吴彤尚逸鸣

航空兵器2024，Vol.31Issue(6)：50-56,7.

航空兵器2024，Vol.31Issue(6)：50-56,7.DOI:10.12132/ISSN.1673-5048.2024.0130

融合DDQN与示教学习的高超声速飞行器智能姿态控制方法研究

Intelligent Attitude Control of Hypersonic Vehicle Based on DDQN and Deep Q-Learning from Demonstrations

刘静文 ¹蔡光斌 ¹凡永华 ²樊红东 ¹吴彤 ¹尚逸鸣¹

作者信息

1. 火箭军工程大学导弹工程学院,西安 710025
2. 西北工业大学航天学院,西安 710072
折叠

摘要

Abstract

In order to improve the speed and accuracy of solving the attitude control problem of hypersonic vehi-cle,an intelligent attitude control method of hypersonic vehicle based on demonstration learning is proposed.Firstly,the control model of hypersonic vehicle is established,and the appropriate action is selected as the attitude control out-put.Secondly,an algorithm based on DDQN(Double Deep Q-Network)and DQfD(Deep Q-learning from Demonstra-tions)is designed,which divides the training of agents into two stages:pre-training and formal training.In the pre-training stage,the agent extracts small batch data from the demonstration data,and applies four loss functions to update the neural network.In the formal training phase,samples are taken from the data generated by its own training and demonstration data,and the proportion of two types of data in each small batch is controlled through priority experience replay buffer.Learning through interaction with the environment,so that the hypersonic vehicle can adaptively adjust its attitude according to changes in the flight environment.The simulation results show that the reinforcement learning method based on demonstration data can track control command,realize the attitude control of hypersonic vehicle,and improve the performance of neural network in the early stage of training,with a higher average reward.

关键词

高超声速飞行器/姿态控制/强化学习/示教学习/DDQN

Key words

hypersonic vehicle/attitude control/reinforcement learning/learning from demonstrations/DDQN

分类

军事科技

引用本文复制引用

刘静文,蔡光斌,凡永华,樊红东,吴彤,尚逸鸣..融合DDQN与示教学习的高超声速飞行器智能姿态控制方法研究[J].航空兵器,2024,31(6):50-56,7.

基金项目

国家自然科学基金项目(61773387) （61773387）

航空兵器

OA北大核心CSTPCD

ISSN：1673-5048

访问量0

下载量0

段落导航