| 注册
首页|期刊导航|计算机科学与探索|多智能体强化学习算法研究综述

多智能体强化学习算法研究综述

李明阳 许可儿 宋志强 夏庆锋 周鹏

计算机科学与探索2024,Vol.18Issue(8):1979-1997,19.
计算机科学与探索2024,Vol.18Issue(8):1979-1997,19.DOI:10.3778/j.issn.1673-9418.2401020

多智能体强化学习算法研究综述

Review of Research on Multi-agent Reinforcement Learning Algorithms

李明阳 1许可儿 1宋志强 2夏庆锋 2周鹏1

作者信息

  • 1. 南京信息工程大学 自动化学院,南京 210044
  • 2. 无锡学院 自动化学院,江苏 无锡 214105
  • 折叠

摘要

Abstract

In recent years,the technique of multi-agent reinforcement learning algorithm has been widely used in the field of artificial intelligence.This paper systematically analyses the multi-agent reinforcement learning algo-rithm,examines its application and progress in multi-agent systems,and explores the relevant research results in depth.Firstly,it introduces the research background and development history of multi-agent reinforcement learning and summarizes the existing relevant research results.Secondly,it briefly reviews the application of traditional rein-forcement learning algorithms under different tasks.Then,it highlights the classification of multi-agent reinforce-ment learning algorithms and their application in multi-agent systems according to the three main types of tasks(path planning,pursuit and escape game,task allocation),challenges,and solutions.Finally,it explores the existing algorithm training environments in the field of multi-agents,summarizes the improvement of deep learning on multi-agent reinforcement learning algorithms,proposes challenges and looks forward to future research directions in this field.

关键词

智能体/强化学习/多智能体强化学习/多智能体系统

Key words

agent/reinforcement learning/multi-agent reinforcement learning/multi-agent systems

分类

信息技术与安全科学

引用本文复制引用

李明阳,许可儿,宋志强,夏庆锋,周鹏..多智能体强化学习算法研究综述[J].计算机科学与探索,2024,18(8):1979-1997,19.

基金项目

江苏省产学研合作项目(BY2021238) (BY2021238)

无锡学院人才启动经费项目(2021r001). This work was supported by the Industry University Research Co-operation Project of Jiangsu Province(BY2021238),and the Talent Start-up Funding Project of Wuxi University(2021r001). (2021r001)

计算机科学与探索

OA北大核心CSTPCD

1673-9418

访问量3
|
下载量0
段落导航相关论文