| 注册
首页|期刊导航|工业工程|基于近端策略优化算法的带批处理机的混合流水车间在线调度方法

基于近端策略优化算法的带批处理机的混合流水车间在线调度方法

柳再为 王明伟 袁媛 刘齐浩 李新宇

工业工程2025,Vol.28Issue(2):78-90,13.
工业工程2025,Vol.28Issue(2):78-90,13.DOI:10.3969/j.issn.1007-7375.240122

基于近端策略优化算法的带批处理机的混合流水车间在线调度方法

An Online Scheduling Method for Hybrid Flow Shops with Batch Machines Based on Proximal Policy Optimization Algorithm

柳再为 1王明伟 2袁媛 2刘齐浩 1李新宇1

作者信息

  • 1. 华中科技大学 机械工程与科学学院,湖北 武汉 430074
  • 2. 北京遥感设备研究所,北京 100071
  • 折叠

摘要

Abstract

Batch processing machines enable continuous overlapping operations,which is important for shortening production cycle time,reducing unnecessary waiting time,and increasing productivity.However,when faced with dynamic shop-floor events,the selection of workpiece types for batch processing machines may lead to unavoidable variations in the completion time of each workpiece.To this end,our study focuses on adaptively selecting appropriate workpiece types for batch processing machines based on real-time shop floor production and machining characteristics to minimize the total delay cost of all workpieces.A hybrid flow shop scheduling problem with batch processing machines is studied and modeled as a Markov decision process.Multiple real-time features of workpiece resource are designed,which integrate job processing information with workshop resource information.Furthermore,job selection rules and batch processing selection rules for batch processing machines are formulated.An intelligent agent decides the workpieces to be processed by the machine and the type of workpieces to be batch processed based on real-time characteristics of decision points through the integrated scheduling rules,while a reward function based on total delay cost of workpieces is formulated to guide the decisions of the agent.The network of the agent is trained through the proximal policy optimization algorithm.Numerical experiments are conducted on a large number of instances with different production configurations.Results demonstrate the superiority and generalizability of the proposed algorithm compared to heuristic methods.

关键词

混合流水车间调度/近端策略优化算法/批处理机/马尔科夫决策

Key words

hybrid flow shop scheduling/proximal policy optimization algorithm/batch machines/Markov decision process

分类

管理科学

引用本文复制引用

柳再为,王明伟,袁媛,刘齐浩,李新宇..基于近端策略优化算法的带批处理机的混合流水车间在线调度方法[J].工业工程,2025,28(2):78-90,13.

工业工程

1007-7375

访问量3
|
下载量0
段落导航相关论文