首页|期刊导航|机器人|基于引导多样性的护理机器人模仿学习

基于引导多样性的护理机器人模仿学习

谢劼欣陈敏轩伍锡如李洋郭士杰

机器人2026，Vol.48Issue(1)：38-47,54,11.

机器人2026，Vol.48Issue(1)：38-47,54,11.DOI:10.13973/j.cnki.robot.240269

基于引导多样性的护理机器人模仿学习

Imitation Learning in Nursing Robots Based on Oriented Diversity

谢劼欣 ¹陈敏轩 ¹伍锡如 ¹李洋 ²郭士杰²

作者信息

1. 桂林电子科技大学电子工程与自动化学院,广西桂林 541004
2. 河北工业大学机械工程学院,天津 300131
折叠

摘要

Abstract

Traditional robot imitation learning methods generally suffer from poor imitation success rates and severe re-liance on quantity of expert samples,which is unsuitable for the highly time-varying and unstructured nursing scenarios.To solve the above problems,TGOD-SD,a imitation learning method based on oriented diversity,is proposed for nursing robots.Firstly,a TGOD(trajectory generation with oriented diversity)paradigm is constructed,which can be implemented in reinforcement learning based imitation learning approaches.TGOD can guide the agent to generate diverse imitation trajec-tories around the trajectory from expert demonstrations without constructing reward functions.Next,a trajectory matching method based on Sinkhorn distance(SD)is proposed,which benefits the agent to search for the best matching trajectory as the output of imitation learning.Finally,a sim-to-real transfer method is constructed based on joint angle to implement the imitated trajectory on the real nursing robot.A large number of imitation learning experiments on the nursing robot show that the proposed TGOD-SD method effectively improves the success rate of robot imitation learning,achieving an average improvement of 64.6％compared to the state-of-the-art(SOTA)methods;and the quality of successfully imitated trajectories is also promoted,with an average increase of 32.61％in the correlation coefficient with expert demonstration trajectories;additionally the expected time of successful imitation is reduced to 62.5％at least compared with SOTA methods.Principally,TGOD-SD accomplishes robot imitation learning from a single expert demonstration sample,which reduces the dependence on quantity of expert demonstration samples,and effectively improves the practicality of robot imitation learning.

关键词

护理机器人/机器人模仿学习/强化学习

Key words

nursing robot/robot imitation learning/reinforcement learning

引用本文复制引用

谢劼欣,陈敏轩,伍锡如,李洋,郭士杰..基于引导多样性的护理机器人模仿学习[J].机器人,2026,48(1):38-47,54,11.

基金项目

国家自然科学基金(62303154). （62303154）

机器人

ISSN：1002-0446

访问量0

下载量0

段落导航