自动化学报2023,Vol.49Issue(9):1813-1835,23.DOI:10.16383/j.aas.c220631
安全强化学习综述
Safe Reinforcement Learning:A Survey
摘要
关键词
安全强化学习/约束马尔科夫决策过程/学习过程/学习目标/离线强化学习Key words
Safe reinforcement learning(SRL)/constrained Markov decision process(CMDP)/learning process/learning objective/offline reinforcement learning引用本文复制引用
王雪松,王荣荣,程玉虎..安全强化学习综述[J].自动化学报,2023,49(9):1813-1835,23.基金项目
国家自然科学基金(62176259,61976215),江苏省重点研发计划项目(BE2022095)资助Supported by National Natural Science Foundation of China(62176259,61976215)and Key Research and Development Pro-gram of Jiangsu Province(BE2022095) (62176259,61976215)