首页|期刊导航|控制理论与应用|基于输出反馈逆强化Q学习的线性二次型最优控制方法

基于输出反馈逆强化Q学习的线性二次型最优控制方法

刘文范家璐薛文倩

控制理论与应用2024，Vol.41Issue(8)：1469-1479,11.

控制理论与应用2024，Vol.41Issue(8)：1469-1479,11.DOI:10.7641/CTA.2023.20551

基于输出反馈逆强化Q学习的线性二次型最优控制方法

Linear quadratic optimal control method based on output feedback inverse reinforcement Q-learning

刘文 ¹范家璐 ¹薛文倩¹

作者信息

1. 东北大学流程工业综合自动化国家重点实验室,辽宁沈阳 110819
折叠

摘要

Abstract

In this paper,a data-driven output feedback optimal control method using inverse reinforcement Q-learning for linear quadratic optimal control problem of linear discrete-time systems with unknown model parameters and unmea-surable states is proposed.Only input and output data are used to adaptively determine the values of appropriate quadratic performance index weights and optimal control law,so that the system exhibits the same trajectories as the reference tra-jectories.Firstly,an equation for parameter correction is proposed,by combining which with inverse optimal control,a model-based inverse reinforcement learning based optimal control method framework is proposed to compute the cor-rection of the output feedback control law and performance index weights.On this basis,this paper introduces the idea of reinforcement Q-learning and a data-driven output feedback inverse reinforcement Q-learning optimal control method is eventually proposed,which does not require system model parameters,but uses only historical input and output data to solve output feedback control law parameter and performance index weights.The theoretical analysis and simulation experiments are provided to verify the effectiveness of the proposed method.

关键词

逆强化学习/Q学习/输出反馈/数据驱动最优控制

Key words

inverse reinforcement learning/Q-learning/output feedback/data-driven optimal control

引用本文复制引用

刘文,范家璐,薛文倩..基于输出反馈逆强化Q学习的线性二次型最优控制方法[J].控制理论与应用,2024,41(8):1469-1479,11.

基金项目

国家自然科学基金重大项目(61991400),辽宁省"兴辽英才计"项目(XLYC2007135)资助.Supported by the National Natural Science Foundation of China(61991400)and the Liaoning Revitalization Talents Program(XLYC2007135). （61991400）

控制理论与应用

OA北大核心CSTPCD

ISSN：1000-8152

访问量0

下载量0

段落导航