| 注册
首页|期刊导航|东华大学学报(英文版)|融合RRT∗与TD3深度强化学习的无人机三维部分未知环境路径规划算法

融合RRT∗与TD3深度强化学习的无人机三维部分未知环境路径规划算法

何彦熹 齐洁 吴乃龙

东华大学学报(英文版)2025,Vol.42Issue(6):639-649,11.
东华大学学报(英文版)2025,Vol.42Issue(6):639-649,11.DOI:10.19884/j.1672-5220.202407004

融合RRT∗与TD3深度强化学习的无人机三维部分未知环境路径规划算法

A Hybrid of RRT∗ and TD3 Deep Reinforcement Learning Algorithm for UAV Path Planning in 3D Partially Unknown Environments

何彦熹 1齐洁 2吴乃龙2

作者信息

  • 1. 东华大学 信息科学与技术学院,上海 201620
  • 2. 东华大学 信息科学与技术学院,上海 201620||东华大学 数字化纺织服装技术教育部工程研究中心,上海 201620
  • 折叠

摘要

Abstract

To guide an unmanned aerial vehicle(UAV)flying in complex three-dimensional(3D)environments with unknown obstacles,a novel UAV path planning algorithm named IRRT∗-C2TD3 is proposed.The algorithm combines the rapidly-exploring random tree star(RRT∗)algorithm with the twin delayed deep deterministic policy gradients(TD3)algorithm(a deep reinforcement learning algorithm).By employing exploration strategies from reinforcement learning,IRRT∗-C2TD3 improves the RRT∗ algorithm.IRRT∗-C2TD3 is a two-stage path planning algorithm comprising pre-planning and real-time planning.It performs pre-planning of paths by generating paths based on geometric connections toward the goal and smoothing them using cubic B-spline curves.By designing the network architecture and reward function of the TD3 algorithm,real-time planning in unknown environments is achieved based on the pre-planned path from the first stage.Simulation results show that IRRT∗-C2TD3 demonstrates better path planning performance in 3D partially unknown environments than RRT∗-C2TD3,M-C2TD3 and MOD-RRT∗ algorithms.

关键词

三维路径规划/深度强化学习/快速探索随机树/无人机

Key words

3D path planning/deep reinforcement learning/rapidly-exploring random tree(RRT)/UAV

分类

信息技术与安全科学

引用本文复制引用

何彦熹,齐洁,吴乃龙..融合RRT∗与TD3深度强化学习的无人机三维部分未知环境路径规划算法[J].东华大学学报(英文版),2025,42(6):639-649,11.

基金项目

National Natural Science Foundation of China(No.62173084) (No.62173084)

Foundation of Shanghai Committee of Science and Technology,China(Nos.23ZR1401800 and 22JC1401403) (Nos.23ZR1401800 and 22JC1401403)

东华大学学报(英文版)

1672-5220

访问量0
|
下载量0
段落导航相关论文