| 注册
首页|期刊导航|自动化学报|基于分层策略强化学习的多类型流量差异化路由优化

基于分层策略强化学习的多类型流量差异化路由优化

赵之栩 刘坤 王璐瑶 夏元清

自动化学报2026,Vol.52Issue(4):709-723,15.
自动化学报2026,Vol.52Issue(4):709-723,15.DOI:10.16383/j.aas.c250413

基于分层策略强化学习的多类型流量差异化路由优化

Differentiated Routing Optimization for Multi-type Traffic Based on Hierarchical Policy Reinforcement Learning

赵之栩 1刘坤 1王璐瑶 1夏元清1

作者信息

  • 1. 北京理工大学自动化学院自主智能无人系统全国重点实验室 北京 100081
  • 折叠

摘要

Abstract

Routing is an important method for optimizing network resource allocation.However,traditional rout-ing algorithms rely on static strategies to optimize single quality of service metrics,making it difficult to address the differentiated requirements of explosive growth in multi-type traffic.Although deep reinforcement learning has provided new ideas for routing optimization in dynamic network environments,existing methods still lack fine-grained perception of traffic types and cannot flexibly adjust routing strategies.To this end,this paper designs a traffic-aware routing algorithm based on hierarchical policy reinforcement learning for the differentiated routing re-quirements of different traffic types.First,a traffic classification module is introduced to achieve fine-grained per-ception of the differentiated service requirements of different traffic.Second,graph convolutional networks are used to efficiently model the network topology,based on which a hierarchical decision network and a differentiated re-ward function are designed to guide the agent to generate adaptive routing decisions and realize dynamic adjust-ment of routing strategies for each traffic category.Meanwhile,a global attention mechanism is introduced into the actor-critic framework to enhance the agent's ability to model the spatio-temporal dependency of network states,and the training efficiency and stability are improved through generalized advantage estimation and proximal policy optimization algorithms.Finally,the effectiveness of the proposed algorithm is verified on various network topologies.

关键词

多类型流量/深度强化学习/注意力机制/差异化路由/QoS优化

Key words

multi-type traffic/deep reinforcement learning/attention mechanism/differentiated routing/quality of service optimization

引用本文复制引用

赵之栩,刘坤,王璐瑶,夏元清..基于分层策略强化学习的多类型流量差异化路由优化[J].自动化学报,2026,52(4):709-723,15.

自动化学报

0254-4156

访问量0
|
下载量0
段落导航相关论文