自动化学报(英文版)2021,Vol.8Issue(2):249-272,24.DOI:10.1109/JAS.2021.1003814
Multiagent Reinforcement Learning:Rollout and Policy Iteration
Multiagent Reinforcement Learning:Rollout and Policy Iteration
Dimitri Bertsekas1
作者信息
- 1. Arizona State University (ASU), Tempe, AZ 85281 USA, and also with Massachusetts Institute of Technology (MIT), Cambridge, MA 02139 USA
- 折叠
摘要
关键词
Dynamic programming/multiagent problems/neuro-dynamic programming/policy iteration/reinforcement learning/rolloutKey words
Dynamic programming/multiagent problems/neuro-dynamic programming/policy iteration/reinforcement learning/rollout引用本文复制引用
Dimitri Bertsekas..Multiagent Reinforcement Learning:Rollout and Policy Iteration[J].自动化学报(英文版),2021,8(2):249-272,24.