| 注册
首页|期刊导航|南京航空航天大学学报(英文版)|基于策略迭代的空间系绳载荷捕获自适应最优控制

基于策略迭代的空间系绳载荷捕获自适应最优控制

冯毅庭 张鸣 郭闻昊 王长青

南京航空航天大学学报(英文版)2021,Vol.38Issue(4):560-570,11.
南京航空航天大学学报(英文版)2021,Vol.38Issue(4):560-570,11.

基于策略迭代的空间系绳载荷捕获自适应最优控制

Adaptive Optimal Control of Space Tether System for Payload Capture via Policy Iteration

冯毅庭 1张鸣 2郭闻昊 2王长青1

作者信息

  • 1. 西北工业大学自动化学院,西安 710129,中国
  • 2. 北京宇航系统工程研究所,北京 100076,中国
  • 折叠

摘要

Abstract

The libration control problem of space tether system(STS) for post-capture of payload is studied. The process of payload capture will cause tether swing and deviation from the nominal position,resulting in the failure of capture mission. Due to unknown inertial parameters after capturing the payload,an adaptive optimal control based on policy iteration is developed to stabilize the uncertain dynamic system in the post-capture phase. By introducing integral reinforcement learning(IRL)scheme,the algebraic Riccati equation(ARE)can be online solved without known dynamics. To avoid computational burden from iteration equations,the online implementation of policy iteration algorithm is provided by the least-squares solution method. Finally,the effectiveness of the algorithm is validated by numerical simulations.

关键词

空间系绳系统/载荷捕获/策略迭代/积分强化学习/状态反馈

Key words

space tether system(STS)/payload capture/policy iteration/integral reinforcement learning(IRL)/state feedback

分类

航空航天

引用本文复制引用

冯毅庭,张鸣,郭闻昊,王长青..基于策略迭代的空间系绳载荷捕获自适应最优控制[J].南京航空航天大学学报(英文版),2021,38(4):560-570,11.

基金项目

This work was supported by the Nation-al Natural Science Foundation of China(No.62111530051),the Fundamental Research Funds for the Central Universi-ties(No.3102017JC06002)and the Shaanxi Science and Technology Program,China(No.2017KW-ZD-04). (No.62111530051)

南京航空航天大学学报(英文版)

OACSCDCSTPCD

1005-1120

访问量0
|
下载量0
段落导航相关论文