| 注册
首页|期刊导航|火力与指挥控制|基于深度强化学习的无人机实时航迹规划

基于深度强化学习的无人机实时航迹规划

舒健生 周于翔 郑晓龙 赖晓昌 陶大甜

火力与指挥控制2023,Vol.48Issue(12):133-141,9.
火力与指挥控制2023,Vol.48Issue(12):133-141,9.DOI:10.3969/j.issn.1002-0640.2023.12.020

基于深度强化学习的无人机实时航迹规划

Deep Reinforcement Learning-based UAV Real-time Trajectory Planning

舒健生 1周于翔 1郑晓龙 1赖晓昌 1陶大甜2

作者信息

  • 1. 火箭军工程大学,西安 710025
  • 2. 武汉理工大学信息工程学院,武汉 430070
  • 折叠

摘要

Abstract

With the application and development of UAV technology,the flight environment of UAV is becoming more and more complex and changeable,the higher requirements for the obstacle avoidance ability and real-time path planning of UAVs are proposed.Based on the deep reinforcement learning algorithm with good generalization and weak dependence on the environment,the real-time path planning is carried out based on the obstacle map information obtained by radar in real time.The continuous reward function is designed according to the characteristics of the two-dimensional path planning problem,the problem of sparse reward in the two-dimensional plane path planning of the reinforcement learning algorithm is solved.Based on the idea of transfer learning,multiple training environments are designed and trained step by step according to the difficulty of the task,which reduces the training difficulty of the algorithm,improves the training effects and makes the convergence effects of the algorithm more stable.Finally,the SAC algorithm is compared with the current mainstream PPO and TD3 algorithms in the experiment.The experimental results show that the SAC algorithm has fast convergence speed,good real-time performance and better track smoothness.

关键词

无人机/SAC算法/二维平面规划/实时航迹规划

Key words

UAV/SAC algorithm/2D plane planning/real-time route planning

分类

信息技术与安全科学

引用本文复制引用

舒健生,周于翔,郑晓龙,赖晓昌,陶大甜..基于深度强化学习的无人机实时航迹规划[J].火力与指挥控制,2023,48(12):133-141,9.

火力与指挥控制

OA北大核心CSCDCSTPCD

1002-0640

访问量2
|
下载量0
段落导航相关论文