| 注册
首页|期刊导航|信息与控制|基于不完备信息预测的多智能体分布式协同

基于不完备信息预测的多智能体分布式协同

张宏达 李德才 何玉庆

信息与控制2024,Vol.53Issue(1):86-97,12.
信息与控制2024,Vol.53Issue(1):86-97,12.DOI:10.13976/j.cnki.xk.2023.2506

基于不完备信息预测的多智能体分布式协同

Multi-agent Distributed Cooperation Based on Incomplete Information Prediction

张宏达 1李德才 2何玉庆2

作者信息

  • 1. 中国科学院沈阳自动化研究所机器人学国家重点实验室,辽宁沈阳 110016||中国科学院机器人与智能制造创新研究院,辽宁沈阳 110169||中国科学院大学,北京 100049
  • 2. 中国科学院沈阳自动化研究所机器人学国家重点实验室,辽宁沈阳 110016||中国科学院机器人与智能制造创新研究院,辽宁沈阳 110169
  • 折叠

摘要

Abstract

To solve the problem of multi-agent cooperative decision-making in a partially observable ad-versarial environment inspired by the learning and reasoning functions of the human cerebral cortex through memory,a new multi-agent distributed cooperative decision-making framework based on incomplete information prediction in a partially observable adversarial environment is proposed.The framework can use support vector regression and other prediction methods to predict invisible information in the environment through historical memory and current observed information and fuse the predicted information and the observed information as a basis of cooperative decision-making;Then,cooperative strategy learning is performed through distributed multi-agent reinforcement learning to obtain the decision-making model of each agent in the team.Multi-agent cooperative de-cision-making is verified in a typical partially observable adversarial environment using this framework and various prediction algorithms.The results show that the proposed framework is universal to va-rious prediction algorithms and can improve the level of multi-agent cooperative decision-making by 23.4%while ensuring high prediction accuracy for invisible parts.

关键词

多智能体协同/部分可观/信息预测/分布式协同决策/对抗环境

Key words

multi-agent cooperation/partially observable/information prediction/distributed cooperative deci-sion-making/adversarial environment

分类

信息技术与安全科学

引用本文复制引用

张宏达,李德才,何玉庆..基于不完备信息预测的多智能体分布式协同[J].信息与控制,2024,53(1):86-97,12.

基金项目

国家自然科学基金项目(61991413,91948202,91848203,91948303) (61991413,91948202,91848203,91948303)

辽宁省中央引导地方科技发展专项项目(2022JH6/100100009) (2022JH6/100100009)

信息与控制

OA北大核心CSTPCD

1002-0411

访问量0
|
下载量0
段落导航相关论文