Discovering Latent Variables for the Tasks With Confounders in Multi-Agent Reinforcement LearningOACSTPCDEI
Discovering Latent Variables for the Tasks With Confounders in Multi-Agent Reinforcement Learning
Kun Jiang;Wenzhang Liu;Yuanda Wang;Lu Dong;Changyin Sun
School of Automation,Southeast University,Nanjing 210096,ChinaSchool of Artificial Intelligence,Anhui University,Hefei 230601,ChinaSchool of Automation,Southeast University,Nanjing 210096,ChinaSchool of Cyber Science and Engineering,Southeast University,Nanjing 211189,ChinaSchool of Automation,Southeast University,Nanjing 210096||Engineering Research Center of Autonomous Unmanned System Technology,Ministry of Education,Hefei 230601,China
Latent variable modelmaximum entropymulti-agent reinforcement learning(MARL)multi-agent system
Latent variable modelmaximum entropymulti-agent reinforcement learning(MARL)multi-agent system
《自动化学报(英文版)》 2024 (7)
1591-1604,14
This work was supported in part by the National Natural Science Foundation of China(62136008,62236002,61921004,62173251,62103104),the"Zhishan"Scholars Programs of Southeast University,and the Fundamental Research Funds for the Central Universities(2242023K30034).
评论