首页|期刊导航|计算技术与自动化|基于强化学习的带软时间窗多行程绿色车辆路径优化研究

基于强化学习的带软时间窗多行程绿色车辆路径优化研究

姚利军王可君

计算技术与自动化2024，Vol.43Issue(4)：153-160,8.

计算技术与自动化2024，Vol.43Issue(4)：153-160,8.DOI:10.16339/j.cnki.jsjsyzdh.202404025

基于强化学习的带软时间窗多行程绿色车辆路径优化研究

Reinforcement Learning Based Path Optimization for Multi-Trip Green Vehicles with Soft Time Window

姚利军 ¹王可君¹

作者信息

1. 湖南省长株潭烟草物流有限责任公司,湖南长沙 410004
折叠

摘要

Abstract

In order to assist the logistics industry in achieving its goal of peak carbon dioxide emissions and carbon neu-trality,construction must be facilitated,and a green logistics industry must be rapidly established and developed.Firstly,a multi-trip green vehicle path optimization model with soft time window constraints is constructed by comprehensively consid-ering the factors of fuel consumption,carbon emission,manpower,vehicles,and user experience.Subsequently,the Pin-SAGE graph network,TRPO,and GAE methods are considered collectively to enhance the deep reinforcement learning opti-mization algorithm of Actor-Critic.Ultimately,the Actor-Critic algorithm is employed to address the model for the green multi-trip vehicle path scheme.Experimental evidence indicates that the solution method proposed in the paper is an effective means of planning green vehicle routes,which in turn has the potential to significantly reduce logistics costs and realise the dual optimisation of economic and environmental benefits for logistics enterprises.

关键词

绿色物流/软时间窗/深度强化学习/Actor-Critic框架

Key words

green logistics/soft time window/deep reinforcement learning/Actor-Critic framework

分类

管理科学

引用本文复制引用

姚利军,王可君..基于强化学习的带软时间窗多行程绿色车辆路径优化研究[J].计算技术与自动化,2024,43(4):153-160,8.

基金项目

湖南省烟草公司长沙市公司科技项目(CS2023KJ09) （CS2023KJ09）

计算技术与自动化

OACSTPCD

ISSN：1003-6199

访问量0

下载量0

段落导航