首页|期刊导航|自动化学报(英文版)|Multiagent Reinforcement Learning:Rollout and Policy Iteration

Multiagent Reinforcement Learning:Rollout and Policy IterationOACSCD

Multiagent Reinforcement Learning:Rollout and Policy Iteration

Dimitri Bertsekas

Arizona State University (ASU), Tempe, AZ 85281 USA, and also with Massachusetts Institute of Technology (MIT), Cambridge, MA 02139 USA

Dynamic programmingmultiagent problemsneuro-dynamic programmingpolicy iterationreinforcement learningrollout

Dynamic programmingmultiagent problemsneuro-dynamic programmingpolicy iterationreinforcement learningrollout

《自动化学报(英文版)》 2021 (2)

249-272,24

10.1109/JAS.2021.1003814