首页|期刊导航|自动化学报(英文版)|QuadQ:Quadratic-Based Value Decomposition for Cooperative Policy Optimization in Multi-Agent Reinforcement Learning
自动化学报(英文版)2026,Vol.13Issue(3):728-730,3.DOI:10.1109/JAS.2025.125666
QuadQ:Quadratic-Based Value Decomposition for Cooperative Policy Optimization in Multi-Agent Reinforcement Learning
QuadQ:Quadratic-Based Value Decomposition for Cooperative Policy Optimization in Multi-Agent Reinforcement Learning
摘要
引用本文复制引用
Siying Wang,Ruoning Zhang,Yang Zhou,Jinliang Shao,Yuhua Cheng..QuadQ:Quadratic-Based Value Decomposition for Cooperative Policy Optimization in Multi-Agent Reinforcement Learning[J].自动化学报(英文版),2026,13(3):728-730,3.基金项目
This work was supported in part by the National Natural Science Foundation of China(62273077),the Natu-ral Science Foundation of Sichuan Province(2024NSFJQ0013),and the Sichuan Science and Technology Program(2025ZDZX0006). (62273077)