首页|期刊导航|控制理论与应用|深度强化学习算法求解动态流水车间实时调度问题

深度强化学习算法求解动态流水车间实时调度问题

杨媛媛胡蓉钱斌张长胜金怀平

控制理论与应用2024，Vol.41Issue(6)：1047-1055,9.

控制理论与应用2024，Vol.41Issue(6)：1047-1055,9.DOI:10.7641/CTA.2023.20916

深度强化学习算法求解动态流水车间实时调度问题

Deep reinforcement learning algorithm for dynamic flow shop real-time scheduling problem

杨媛媛 ¹胡蓉 ¹钱斌 ¹张长胜 ²金怀平²

作者信息

1. 昆明理工大学信息工程与自动化学院,云南昆明 650500||昆明理工大学云南省人工智能重点实验室,云南昆明 650500
2. 昆明理工大学信息工程与自动化学院,云南昆明 650500
折叠

摘要

Abstract

This paper aims at the dynamic flow shop scheduling problem(DFSP),an adaptive deep reinforcement learning algorithm(ADRLA)is proposed to minimize the maximum completion time of DFSP.Firstly,the solving process of DFSP is described by the Markov decision process(MDP),so as to transform the DFSP into a sequential decision problem that can be solved by reinforcement learning.Then,according to the characteristics of DFSP scheduling model,the state representation vector with good state feature discrimination and generalization is designed,and five specific actions are proposed(i.e.reward value).Furthermore,the deep double Q network(DDQN)is used as the agent in ADRLA to make scheduling decisions.After training with the data set determined by a small number of small-scale DFSPs(i.e.the data of three basic elements on different problems),the agent can accurately describe the nonlinear relationship between the state representation vector and the Q-value vector(composed of the Q-value of each action)of different scale DFSPs,so as to carry out adaptive real-time scheduling for various scale DFSPs.Finally,simulation experiments on different test problems and comparison with the algorithm verify the effectiveness and real-time performance of the proposed ADRLA in solving DFSP.

关键词

流水车间调度/新工件到达/深度强化学习/动态实时调度/智能调度

Key words

flow shop scheduling/arrival of new jobs/deep reinforcement learning/dynamic real-time scheduling/intelligent scheduling

引用本文复制引用

杨媛媛,胡蓉,钱斌,张长胜,金怀平..深度强化学习算法求解动态流水车间实时调度问题[J].控制理论与应用,2024,41(6):1047-1055,9.

基金项目

国家自然科学基金项目(62173169,61963022),云南省基础研究重点项目(202201AS070030)资助.Supported by the National Natural Science Foundation of China(62173169,61963022)and the Basic Research Key Project of Yunnan Province Province(202201AS070030). （62173169,61963022）

控制理论与应用

OA北大核心CSTPCD

ISSN：1000-8152

访问量0

下载量0

段落导航