首页|期刊导航|空天防御|基于近端策略优化的制导律设计

基于近端策略优化的制导律设计

李梦璇郭建国许新鹏沈昱恒

空天防御2023，Vol.6Issue(4)：51-57,7.

基于近端策略优化的制导律设计

Guidance Law Based on Proximal Policy Optimization

李梦璇 ¹郭建国 ¹许新鹏 ²沈昱恒²

作者信息

1. 西北工业大学精确制导与控制研究所,陕西西安 710072
2. 上海机电工程研究所,上海 201109
折叠

摘要

Abstract

The design of guidance law is critical in the interception system.The accuracy of the commonly used variable structure guidance law decreases while intercepting complex manoeuvring targets,and chattering occurs frequently.This paper has proposed a guidance law design method based on near-end strategy optimization.The guidance problem of intercepting manoeuvring targets was abstracted as a Markov decision process,and a reward function evaluating miss distance and line-of-sight angular rate chattering was applied.Comparative experiments show that the interception effect of the guidance law based on near-end strategy optimization and continuous output performs more effectively and can successfully restrain the chattering phenomenon in the sliding mode guidance law,thus providing a significant research prospect and potential application value.

关键词

制导律/强化学习/滑模控制/近端策略优化

Key words

guidance law/reinforcement learning/sliding mode control/proximal policy optimization

分类

信息技术与安全科学

引用本文复制引用

李梦璇,郭建国,许新鹏,沈昱恒..基于近端策略优化的制导律设计[J].空天防御,2023,6(4):51-57,7.

基金项目

国家自然科学基金(61973254) （61973254）

西北工业大学硕士研究生实践创新能力培育基金(PF2023044) （PF2023044）

空天防御

ISSN：2096-4641

访问量4

下载量0

段落导航