| 注册
首页|期刊导航|计算机应用研究|基于端到端深度强化学习求解有能力约束的车辆路径问题

基于端到端深度强化学习求解有能力约束的车辆路径问题

葛斌 田文智 夏晨星 秦望博

计算机应用研究2024,Vol.41Issue(11):3245-3250,6.
计算机应用研究2024,Vol.41Issue(11):3245-3250,6.DOI:10.19734/j.issn.1001-3695.2024.03.0101

基于端到端深度强化学习求解有能力约束的车辆路径问题

Solving capacitated vehicle routing problems based on end to end deep reinforcement learning

葛斌 1田文智 1夏晨星 2秦望博1

作者信息

  • 1. 安徽理工大学计算机科学与工程学院,安徽淮南 232001
  • 2. 安徽理工大学计算机科学与工程学院,安徽淮南 232001||合肥综合性国家科学中心能源研究院,合肥 230031
  • 折叠

摘要

Abstract

The capacitated vehicle routing problem(CVRP)is the most prevalent problem model in supply chain applications at present,and researchers often use heuristic algorithms to solve it,but the solution speed is slow and the quality of the solu-tion cannot be guaranteed.This paper proposed an end-to-end deep reinforcement learning(DRL)network framework to study the CVRP problem.Firstly,it used the edge graph attention network encoder(EGATE)to perform feature embedding enco-ding on the graph representation of VRP.Then,it designed a multi-head attention decoder(MAD)to decode the encoded graph representation.Additionally,it proposed a multi-decoding strategy to enhance the spatial diversity of the solutions.Con-tinuing with the training of the end-to-end network model using the baseline REINFORCE algorithm with a rollout baseline,the adaptive updating of the baseline was employed to enhance the effectiveness of model training.Additionally,reward function normalization and optimization using Adam optimizer were utilized to further improve the algorithm.Finally,this paper valida-ted the feasibility and effectiveness of the proposed end-to-end DRL framework through experiments on problems of different scales,comparing its performance against other algorithms.The average solution time of the trained model on the CVRPLIB public dataset is only 0.189 s to obtain a better solution.

关键词

车辆路径问题/路径规划/端到端模型/深度强化学习/基线REINFORCE算法

Key words

vehicle routing problem(VRP)/path planning/end-to-end model/deep reinforcement learning/baseline REIN-FORCE algorithm

分类

信息技术与安全科学

引用本文复制引用

葛斌,田文智,夏晨星,秦望博..基于端到端深度强化学习求解有能力约束的车辆路径问题[J].计算机应用研究,2024,41(11):3245-3250,6.

基金项目

国家重点研发计划资助项目(2020YFB1314103) (2020YFB1314103)

计算机应用研究

OA北大核心CSTPCD

1001-3695

访问量6
|
下载量0
段落导航相关论文