自动化学报2023,Vol.49Issue(11):2237-2256,20.DOI:10.16383/j.aas.c220648
异策略深度强化学习中的经验回放研究综述
Research on Experience Replay of Off-policy Deep Reinforcement Learning:A Review
摘要
关键词
深度强化学习/异策略/经验回放/人工智能Key words
Deep reinforcement learning(DRL)/off-policy/experience replay(ER)/artificial intelligence引用本文复制引用
胡子剑,高晓光,万开方,张乐天,汪强龙,NERETIN Evgeny..异策略深度强化学习中的经验回放研究综述[J].自动化学报,2023,49(11):2237-2256,20.基金项目
国家自然科学基金(62003267,61573285),中央高校基本科研业务费专项资金(G2022KY0602),电磁空间作战与应用重点实验室(2022ZX0090),西安市科技计划项目——关键核心技术攻关工程项目计划(21RGZN0016),陕西省重点研发计划项目(2023-GHZD-33)资助 Supported by National Natural Science Foundation of China(62003267,61573285),the Fundamental Research Funds for the Central Universities(G2022KY0602),the Technology on Electro-magnetic Space Operations and Applications Laboratory(2022ZX0090),the Key Core Technology Research Plan of Xi'an(21RGZN0016),and the Key Research and Development Pro-gram of Shaanxi Province(2023-GHZD-33) (62003267,61573285)