强化学习中的注意力机制研究综述OA北大核心CSTPCD

Review of Attention Mechanisms in Reinforcement Learning

中文摘要

英文摘要

近年来,强化学习与注意力机制的结合在算法研究领域备受瞩目.在强化学习算法中,注意力机制的应用在提高算法性能方面发挥了重要作用.重点聚焦于注意力机制在深度强化学习中的发展,审视了其在多智能体强化学习领域的应用,并对相关研究成果进行调研.首先,介绍了注意力机制和强化学习的研究背景与发展历程,并调研了该领域中的相关实验平台;然后,回顾了强化学习与注意力机制的经典算法,并从不同角度对注意力机制进行分类;接着,对注意力机制在强化学习领域的应用进行了梳理,根据三种任务类型(完全合作型、完全竞争型和混合合作竞争型)进行分类分析,重点关注了多智能体领域的应用情况;最后,总结了注意力机制对强化学习算法的改进作用,并展望了该领域所面临的挑战和未来的研究前景.

In recent years,the combination of reinforcement learning and attention mechanisms has attracted an increasing attention in algorithmic research field.Attention mechanisms play an important role in improving the performance of algorithms in reinforcement learning.This paper mainly focuses on the development of attention mechanisms in deep reinforcement learning and examining their applications in the multi-agent reinforcement learning domain.Relevant researches are conducted accordingly.Firstly,the background and development of attention mechanisms and reinforcement learning are introduced,and relevant experimental platforms in this field are also presented.Secondly,classical algorithms of reinforcement learning and attention mechanisms are reviewed and attention mechanism is categorized from different perspectives.Thirdly,practical applications of attention mechanisms in the reinforcement field are sorted out based on three types of tasks including fully cooperative,fully competitive and mixed,with focus on the application in the field of multi-agent.Finally,the improvement of attention mechanisms on reinforcement learning algorithms is summarized.The challenges and future prospects in this field are discussed.

作者：夏庆锋;许可儿;李明阳;胡凯;宋利鹏;宋志强;孙宁

作者单位：无锡学院自动化学院,江苏无锡 214105南京信息工程大学自动化学院,南京 210044

分类：计算机与自动化

中文关键词：强化学习注意力机制多智能体系统

英文关键词：reinforcement learningattention mechanismmulti-agent system

刊名：《计算机科学与探索》 2024 (006)

页码/页数：1457-1475 / 19

基金：辽宁省科技厅机器人学国家重点实验室联合开放基金(2021-KF-22-19).This work was supported by the Joint Fund of Science&Technology Department of Liaoning Province and State Key Laboratory of Robotics(2021-KF-22-19).

DOI：10.3778/j.issn.1673-9418.2312006

强化学习中的注意力机制研究综述OA北大核心CSTPCD

Review of Attention Mechanisms in Reinforcement Learning

评论