首页|期刊导航|自动化学报（英文版）|QuadQ:Quadratic-Based Value Decomposition for Cooperative Policy Optimization in Multi-Agent Reinforcement Learning

QuadQ:Quadratic-Based Value Decomposition for Cooperative Policy Optimization in Multi-Agent Reinforcement Learning

Siying Wang Ruoning Zhang Yang Zhou Jinliang Shao Yuhua Cheng

自动化学报（英文版）2026，Vol.13Issue(3)：728-730,3.

自动化学报（英文版）2026，Vol.13Issue(3)：728-730,3.DOI:10.1109/JAS.2025.125666

QuadQ:Quadratic-Based Value Decomposition for Cooperative Policy Optimization in Multi-Agent Reinforcement Learning

Siying Wang ¹Ruoning Zhang ²Yang Zhou ²Jinliang Shao ³Yuhua Cheng¹

作者信息

1. School of Automation Engineering,University of Electronic Science and Technology of China,Chengdu 611731 China
2. School of Computer Science and Engineering (School of Cyber Security),University of Electronic Science and Technology of China,Chengdu 611731,China
3. School of Automation Engineering,University of Electronic Science and Technology of China,Chengdu 611731 and also with the Tianfu Jiangxi Laboratory,Sichuan 641419,China
折叠

摘要

引用本文复制引用

Siying Wang,Ruoning Zhang,Yang Zhou,Jinliang Shao,Yuhua Cheng..QuadQ:Quadratic-Based Value Decomposition for Cooperative Policy Optimization in Multi-Agent Reinforcement Learning[J].自动化学报（英文版）,2026,13(3):728-730,3.

基金项目

This work was supported in part by the National Natural Science Foundation of China(62273077),the Natu-ral Science Foundation of Sichuan Province(2024NSFJQ0013),and the Sichuan Science and Technology Program(2025ZDZX0006). （62273077）

自动化学报（英文版）

ISSN：2329-9266

访问量0

下载量0

段落导航