Multiagent Reinforcement Learning:Rollout and Policy IterationOACSCD
Multiagent Reinforcement Learning:Rollout and Policy Iteration
Dimitri Bertsekas
Arizona State University (ASU), Tempe, AZ 85281 USA, and also with Massachusetts Institute of Technology (MIT), Cambridge, MA 02139 USA
Dynamic programmingmultiagent problemsneuro-dynamic programmingpolicy iterationreinforcement learningrollout
Dynamic programmingmultiagent problemsneuro-dynamic programmingpolicy iterationreinforcement learningrollout
《自动化学报(英文版)》 2021 (2)
249-272,24