| 注册
首页|期刊导航|计算机应用与软件|基于MAPPO-RDL的多专家协同评估规则决策框架

基于MAPPO-RDL的多专家协同评估规则决策框架

朱玺 江杨靖 何茂贤

计算机应用与软件2026,Vol.43Issue(4):225-231,7.
计算机应用与软件2026,Vol.43Issue(4):225-231,7.DOI:10.3969/j.issn.1000-386x.2026.04.032

基于MAPPO-RDL的多专家协同评估规则决策框架

A RULE-BASED DECISION FRAMEWORK FOR MULTI-EXPERT COLLABORATIVE EVALUATION BASED ON MAPPO

朱玺 1江杨靖 2何茂贤2

作者信息

  • 1. 上海市公安局 上海 200042
  • 2. 上海计算机软件技术开发中心 上海 201112
  • 折叠

摘要

Abstract

In the context of digital transformation,project evaluation tasks face challenges such as rapidly growing data scales and significantly increased content complexity.Retrieval-augmented generation(RAG)introduces an external knowledge base for vector retrieval before generating model responses.However,to achieve high accuracy,the RAG process incorporates complex branches,multi-layer filtering,and rule matching,resulting in cumbersome workflows and high deployment barriers.To address these issues,this paper proposes a rule-based decision framework for multi-expert collaborative evaluation based on MAPPO(Multi-Agent Proximal Policy Optimization).This framework included modeling the project evaluation process and implementing a multi-dimensional reward mechanism to dynamically optimize rule selection and retrieval strategies.An attention-based decision layer was designed to enhance the model's focus on key rules and knowledge fragments.Experimental results show that the proposed method demonstrates stable convergence trends across rule sets of various scales.Visualization analyses reveal that in both early and later stages,different experts gradually develop complementary and differentiated choices,highlighting significant semantic division of labor and interpretability.

关键词

项目评估/强化学习/MAPPO/注意力机制

Key words

Project evaluation/Reinforcement learning/MAPPO/Attention mechanism

分类

信息技术与安全科学

引用本文复制引用

朱玺,江杨靖,何茂贤..基于MAPPO-RDL的多专家协同评估规则决策框架[J].计算机应用与软件,2026,43(4):225-231,7.

计算机应用与软件

1000-386X

访问量0
|
下载量0
段落导航相关论文