重庆科技学院学报(自然科学版)2024,Vol.26Issue(3):42-48,7.DOI:10.19406/j.issn.1673-1980.2024.03.007
基于改进Sarsa算法的拖轮动态调度方法
Tugboat Dynamic Scheduling Method Based on Improved Sarsa Algorithm
摘要
Abstract
Aiming at the shortcomings of the traditional Sarsa algorithm,the optimization of tugboat dynamic schedu-ling method is studied.Based on the reinforcement learning framework and the state and environment information of tugboats,the state-action function is established to search the optimal strategy of tugboats scheduling decision.The update method of Q function in Sarsa algorithm is changed to overcome the problem of slow convergence.At the same time,according to the learning rate and action selection mode,the exploration strategy and utilization strategy are balanced to improve the convergence speed and performance of the algorithm.The simulation results show that the algorithm can effectively shorten the waiting time and improve the utilization efficiency of tugboat resources.关键词
Sarsa算法/拖轮/自适应调度/强化学习/算法策略Key words
Sarsa algorithm/tugboats/adaptive scheduling/reinforcement learning/algorithm strategy分类
信息技术与安全科学引用本文复制引用
李佳琛,段兴锋..基于改进Sarsa算法的拖轮动态调度方法[J].重庆科技学院学报(自然科学版),2024,26(3):42-48,7.基金项目
福建省自然科学基金项目"海上智能调度"(2019J01325) (2019J01325)