南京航空航天大学学报(英文版)2021,Vol.38Issue(4):560-570,11.
基于策略迭代的空间系绳载荷捕获自适应最优控制
Adaptive Optimal Control of Space Tether System for Payload Capture via Policy Iteration
摘要
Abstract
The libration control problem of space tether system(STS) for post-capture of payload is studied. The process of payload capture will cause tether swing and deviation from the nominal position,resulting in the failure of capture mission. Due to unknown inertial parameters after capturing the payload,an adaptive optimal control based on policy iteration is developed to stabilize the uncertain dynamic system in the post-capture phase. By introducing integral reinforcement learning(IRL)scheme,the algebraic Riccati equation(ARE)can be online solved without known dynamics. To avoid computational burden from iteration equations,the online implementation of policy iteration algorithm is provided by the least-squares solution method. Finally,the effectiveness of the algorithm is validated by numerical simulations.关键词
空间系绳系统/载荷捕获/策略迭代/积分强化学习/状态反馈Key words
space tether system(STS)/payload capture/policy iteration/integral reinforcement learning(IRL)/state feedback分类
航空航天引用本文复制引用
冯毅庭,张鸣,郭闻昊,王长青..基于策略迭代的空间系绳载荷捕获自适应最优控制[J].南京航空航天大学学报(英文版),2021,38(4):560-570,11.基金项目
This work was supported by the Nation-al Natural Science Foundation of China(No.62111530051),the Fundamental Research Funds for the Central Universi-ties(No.3102017JC06002)and the Shaanxi Science and Technology Program,China(No.2017KW-ZD-04). (No.62111530051)