首页|期刊导航|电子科技|基于IL-TD3的轮式机器人无地图导航研究

基于IL-TD3的轮式机器人无地图导航研究

秦源赛牟海明刘元基李清都

电子科技2026，Vol.39Issue(1)：57-63,7.

电子科技2026，Vol.39Issue(1)：57-63,7.DOI:10.16180/j.cnki.issn1007-7820.2026.01.008

基于IL-TD3的轮式机器人无地图导航研究

Research on Mapless Navigation of Wheeled Robot Based on IL-TD3

秦源赛 ¹牟海明 ²刘元基 ²李清都²

作者信息

1. 上海理工大学机器智能研究院,上海 200093||上海理工大学健康科学与工程学院,上海 200093
2. 上海理工大学机器智能研究院,上海 200093
折叠

摘要

Abstract

In view of the dependence of the wheeled robot navigation method on high-precision maps and its a-daptability issues in dynamic and complex scenarios,this study proposes a mapless navigation method based on IL-TD3(Imitation Learning Enhanced Twin Delayed Deep Deterministic Policy Gradient).The navigation task is modeled as a POMDP(Partially Observable Markov Decision Process),and a LSTM(Long Short-Term Memory)network is combined to process historical information for improving the environmental state modeling.The robot quickly acquires limited mapless navigation capabilities through imitation learning,and continuously explores and trains the TD3 deep reinforcement learning network to enhance its navigation skills.The simulation experiment results show that the naviga-tion trajectory of IL-TD3 in unknown dynamic environments is stable,continuous,and safe,demonstrating good navi-gation performance.The Sim2Real(Simulation to Reality)test results indicate that the unadjusted IL-TD3 model per-forms well in real-world navigation tasks,which proves the robustness and generalization ability of the proposed model.

关键词

机器人/无地图导航/POMDP/LSTM/模仿学习/深度强化学习/未知动态环境/Sim2Real

Key words

robot/mapless navigation/POMDP/LSTM/imitation learning/deep reinforcement learning/unknown dyna-mic environments/Sim2Real

分类

信息技术与安全科学

引用本文复制引用

秦源赛,牟海明,刘元基,李清都..基于IL-TD3的轮式机器人无地图导航研究[J].电子科技,2026,39(1):57-63,7.

基金项目

东方学者计划(TP2019064)Oriental Scholars Program(TP2019064) （TP2019064）

电子科技

ISSN：1007-7820

访问量0

下载量0

段落导航