机器人2026,Vol.48Issue(1):38-47,54,11.DOI:10.13973/j.cnki.robot.240269
基于引导多样性的护理机器人模仿学习
Imitation Learning in Nursing Robots Based on Oriented Diversity
摘要
Abstract
Traditional robot imitation learning methods generally suffer from poor imitation success rates and severe re-liance on quantity of expert samples,which is unsuitable for the highly time-varying and unstructured nursing scenarios.To solve the above problems,TGOD-SD,a imitation learning method based on oriented diversity,is proposed for nursing robots.Firstly,a TGOD(trajectory generation with oriented diversity)paradigm is constructed,which can be implemented in reinforcement learning based imitation learning approaches.TGOD can guide the agent to generate diverse imitation trajec-tories around the trajectory from expert demonstrations without constructing reward functions.Next,a trajectory matching method based on Sinkhorn distance(SD)is proposed,which benefits the agent to search for the best matching trajectory as the output of imitation learning.Finally,a sim-to-real transfer method is constructed based on joint angle to implement the imitated trajectory on the real nursing robot.A large number of imitation learning experiments on the nursing robot show that the proposed TGOD-SD method effectively improves the success rate of robot imitation learning,achieving an average improvement of 64.6%compared to the state-of-the-art(SOTA)methods;and the quality of successfully imitated trajectories is also promoted,with an average increase of 32.61%in the correlation coefficient with expert demonstration trajectories;additionally the expected time of successful imitation is reduced to 62.5%at least compared with SOTA methods.Principally,TGOD-SD accomplishes robot imitation learning from a single expert demonstration sample,which reduces the dependence on quantity of expert demonstration samples,and effectively improves the practicality of robot imitation learning.关键词
护理机器人/机器人模仿学习/强化学习Key words
nursing robot/robot imitation learning/reinforcement learning引用本文复制引用
谢劼欣,陈敏轩,伍锡如,李洋,郭士杰..基于引导多样性的护理机器人模仿学习[J].机器人,2026,48(1):38-47,54,11.基金项目
国家自然科学基金(62303154). (62303154)