| 注册
首页|期刊导航|空天防御|基于分层强化学习的低过载比拦截制导律

基于分层强化学习的低过载比拦截制导律

王旭 蔡远利 张学成 张荣良 韩成龙

空天防御2024,Vol.7Issue(1):40-47,8.
空天防御2024,Vol.7Issue(1):40-47,8.

基于分层强化学习的低过载比拦截制导律

Intercept Guidance Law with a Low Acceleration Ratio Based on Hierarchical Reinforcement Learning

王旭 1蔡远利 1张学成 2张荣良 3韩成龙3

作者信息

  • 1. 西安交通大学 电子与信息学部,陕西 西安 710049
  • 2. 陆军装备部驻上海地区第三军事代表室,上海 200031
  • 3. 上海机电工程研究所,上海 201109
  • 折叠

摘要

Abstract

This paper has proposed an intercept guidance law based on hierarchical reinforcement learning to solve the three-dimensional maneuvering target intercept guidance problem with constraints of low acceleration ratio and bearings-only measurement.The aforementioned problem was initially modelled using a Markov decision process model,where a heuristic reward function was applied considering both the energy consumption and the missile-to-target line of sight(LOS)angular rate.Besides,the policy of two levels was built up with the lower-level policy generating the required guidance command and being supervised by subgoals that were instructed by the higher levels,allowing the convergence of the LOS angular rate and guaranteeing the successful interception against a maneuvering target.Simulation results have validated the superiority of the proposed method compared with the augmented proportional navigation guidance law in terms of intercept accuracy and hit probability,and its required acceleration ratio is much lower.

关键词

末制导/机动目标拦截/低过载比/分层强化学习

Key words

guidance law/maneuvering target intercept/low acceleration ratio/hierarchical reinforcement learning

分类

军事科技

引用本文复制引用

王旭,蔡远利,张学成,张荣良,韩成龙..基于分层强化学习的低过载比拦截制导律[J].空天防御,2024,7(1):40-47,8.

基金项目

国家自然科学基金项目(62203349,12302061) (62203349,12302061)

空天防御

2096-4641

访问量6
|
下载量0
段落导航相关论文