首页|期刊导航|机电工程技术|基于冲突搜索增强深度强化学习的多AGV路径规划方法

基于冲突搜索增强深度强化学习的多AGV路径规划方法

王亦晨刘雪梅

机电工程技术2024，Vol.53Issue(8)：23-27,88,6.

机电工程技术2024，Vol.53Issue(8)：23-27,88,6.DOI:10.3969/j.issn.1009-9492.2024.00089

基于冲突搜索增强深度强化学习的多AGV路径规划方法

Conflict-based Search Enhanced Deep Reinforcement Learning Approach for Multi-AGV Path Planning

王亦晨 ¹刘雪梅¹

作者信息

1. 同济大学机械与能源工程学院,上海 201804
折叠

摘要

Abstract

To address the path planning problem for automated guided vehicles(AGVs)with the objective of minimizing total travel time,an approach that enhances distributed independent Q-learning(IQL)with the conflict-based search(CBS)algorithm is proposed.Initially,a grid map method is employed to construct the environmental map,and the multi-AGV path planning problem is mathematically described,including assumptions about types of collisions between AGVs.Subsequently,the problem is transformed into a partially observable Markov decision process(POMDP),with detailed definitions of the observation space,action space,and reward function.Further,by utilizing an asynchronous priority experience replay architecture,the IQL method is extended to a distributed environment,and guided by the CBS algorithm to refine the Q-network,enhancing decision-making processes for AGVs in congested environments.Finally,comparative experiments with other deep reinforcement learning algorithms are designed based on varying numbers of AGVs.The results demonstrate that the proposed method outperforms the control algorithms in key performance indicators such as success rate and average step length,thereby validating the effectiveness and feasibility of the proposed approach.

关键词

基于冲突搜索/深度强化学习/多AGV/路径规划

Key words

conflict-based search/deep reinforcement learning/multi-AGV/path planning

分类

信息技术与安全科学

引用本文复制引用

王亦晨,刘雪梅..基于冲突搜索增强深度强化学习的多AGV路径规划方法[J].机电工程技术,2024,53(8):23-27,88,6.

基金项目

西门子(中国)有限公司合作项目(kz0100020210591) （中国）

机电工程技术

ISSN：1009-9492

访问量0

下载量0

段落导航