面向煤矿巡检机器人的高能效路径规划方法OA北大核心CSTPCD
Energy efficient path planning method for coal mine patrol robot
针对现有矿用机器人路径路规划方法存在的效率低、收敛速度慢、易陷入局部最优等不足,提出了一种基于Actor-Critic算法的路径规划方法.首先根据巡检目标和障碍物的实时位置信息,计算巡检机器人的转向角,确定行进方向,可显著提高路径规划的效率;以能量消耗最小化和避免碰撞为目标,巡检机器人根据动态随机变化的矿山环境,学习巡检的目标顺序和行进速度;因为矿山环境动态连续变化,导致较高的状态维度,因此采用深度学习网络估计连续状态产生的动作和奖赏;为了提高学习效率,采用策略网络和价值网络 2 个网络,实现实时更新策略和价值.仿真结果表明:采用所提方法,巡检机器人可以在动态环境中规划出安全合理的巡检路线,能够以 98%的成功概率和更低的能量消耗完成巡检作业.
In order to solve the shortcomings of the existing mining robot path planning methods,such as low efficiency,slow con-vergence speed,and easy to fall into local optimum,a path planning method based on Actor-Critic algorithm is proposed.Firstly,ac-cording to the real-time position information of the inspection target and the obstacles,the steering angle of the patrol robot is calcu-lated and the forward direction is determined,which can significantly improve the efficiency of path planning.With the goal of min-imizing energy consumption and avoiding collisions,the patrol robot learns the target inspection sequence and forward speed accord-ing to the dynamically changing mining environment.Because the dynamic and continuous changes of the mine environment lead to a high state dimension,the action and reward generated by the continuous state are estimated by the deep learning networks.In order to improve the efficiency of learning,two networks are adopted,namely the Actor network and the Critic network,to achieve real-time update of strategy and value.The simulation results show that the proposed method can design a safe and reasonable patrol route in a dynamic environment,and can complete the patrol task with a 98%success probability and lower energy consumption.
陈骋;苏成杰
中煤科工集团沈阳研究院有限公司,辽宁 抚顺 113122||煤矿安全技术国家重点实验室,辽宁 抚顺 113122
矿山工程
巡检机器人路径规划深度强化学习避障能量消耗
patrol robotpath planningdeep reinforcement learningcollision avoidanceenergy consumption
《煤矿安全》 2024 (006)
211-216 / 6
中煤科工集团沈阳研究院有限公司产品升级改造资助项目(CSJ-2022-009)
评论