| 注册
首页|期刊导航|自动化学报(英文版)|QuadQ:Quadratic-Based Value Decomposition for Cooperative Policy Optimization in Multi-Agent Reinforcement Learning

QuadQ:Quadratic-Based Value Decomposition for Cooperative Policy Optimization in Multi-Agent Reinforcement Learning

Siying Wang Ruoning Zhang Yang Zhou Jinliang Shao Yuhua Cheng

自动化学报(英文版)2026,Vol.13Issue(3):728-730,3.
自动化学报(英文版)2026,Vol.13Issue(3):728-730,3.DOI:10.1109/JAS.2025.125666

QuadQ:Quadratic-Based Value Decomposition for Cooperative Policy Optimization in Multi-Agent Reinforcement Learning

QuadQ:Quadratic-Based Value Decomposition for Cooperative Policy Optimization in Multi-Agent Reinforcement Learning

Siying Wang 1Ruoning Zhang 2Yang Zhou 2Jinliang Shao 3Yuhua Cheng1

作者信息

  • 1. School of Automation Engineering,University of Electronic Science and Technology of China,Chengdu 611731 China
  • 2. School of Computer Science and Engineering (School of Cyber Security),University of Electronic Science and Technology of China,Chengdu 611731,China
  • 3. School of Automation Engineering,University of Electronic Science and Technology of China,Chengdu 611731 and also with the Tianfu Jiangxi Laboratory,Sichuan 641419,China
  • 折叠

摘要

引用本文复制引用

Siying Wang,Ruoning Zhang,Yang Zhou,Jinliang Shao,Yuhua Cheng..QuadQ:Quadratic-Based Value Decomposition for Cooperative Policy Optimization in Multi-Agent Reinforcement Learning[J].自动化学报(英文版),2026,13(3):728-730,3.

基金项目

This work was supported in part by the National Natural Science Foundation of China(62273077),the Natu-ral Science Foundation of Sichuan Province(2024NSFJQ0013),and the Sichuan Science and Technology Program(2025ZDZX0006). (62273077)

自动化学报(英文版)

2329-9266

访问量0
|
下载量0
段落导航相关论文