|国家科技期刊平台
首页|期刊导航|信息与控制|基于不完备信息预测的多智能体分布式协同

基于不完备信息预测的多智能体分布式协同OA北大核心CSTPCD

Multi-agent Distributed Cooperation Based on Incomplete Information Prediction

中文摘要英文摘要

为了解决部分可观对抗环境中多智能体协同决策难题,受人大脑皮层通过记忆进行学习和推理功能启发,提出一种新的部分可观对抗环境下基于不完备信息预测的多智能体分布式协同决策框架.该框架可采用支持向量回归等多种预测方法通过历史记忆和当前观察信息对环境中不可见信息进行预测,并将预测信息和观察到的信息融合,作为协同决策的依据;再通过分布式多智能体强化学习进行协同策略学习得到团队中每个智能体的决策模型.使用该框架结合多种预测算法在典型的部分可观对抗环境中进行了多智能体协同决策的验证.结果表明,提出的框架对多种预测算法具有普适性,且在保证对不可见部分高预测精度时能将多智能体协同决策水平提升23.4%.

To solve the problem of multi-agent cooperative decision-making in a partially observable ad-versarial environment inspired by the learning and reasoning functions of the human cerebral cortex through memory,a new multi-agent distributed cooperative decision-making framework based on incomplete information prediction in a partially observable adversarial environment is proposed.The framework can use support vector regression and other prediction methods to predict invisible information in the environment through historical memory and current observed information and fuse the predicted information and the observed information as a basis of cooperative decision-making;Then,cooperative strategy learning is performed through distributed multi-agent reinforcement learning to obtain the decision-making model of each agent in the team.Multi-agent cooperative de-cision-making is verified in a typical partially observable adversarial environment using this framework and various prediction algorithms.The results show that the proposed framework is universal to va-rious prediction algorithms and can improve the level of multi-agent cooperative decision-making by 23.4%while ensuring high prediction accuracy for invisible parts.

张宏达;李德才;何玉庆

中国科学院沈阳自动化研究所机器人学国家重点实验室,辽宁沈阳 110016||中国科学院机器人与智能制造创新研究院,辽宁沈阳 110169||中国科学院大学,北京 100049中国科学院沈阳自动化研究所机器人学国家重点实验室,辽宁沈阳 110016||中国科学院机器人与智能制造创新研究院,辽宁沈阳 110169

计算机与自动化

多智能体协同部分可观信息预测分布式协同决策对抗环境

multi-agent cooperationpartially observableinformation predictiondistributed cooperative deci-sion-makingadversarial environment

《信息与控制》 2024 (001)

多无人机以及人-多机协调合作方法研究与验证

86-97 / 12

国家自然科学基金项目(61991413,91948202,91848203,91948303);辽宁省中央引导地方科技发展专项项目(2022JH6/100100009)

10.13976/j.cnki.xk.2023.2506

评论