| 注册
首页|期刊导航|国防科技大学学报|面向长序列自主作业的非对称Actor-Critic强化学习方法

面向长序列自主作业的非对称Actor-Critic强化学习方法

任君凯 瞿宇珂 罗嘉威 倪子淇 卢惠民 叶益聪

国防科技大学学报2025,Vol.47Issue(4):111-122,12.
国防科技大学学报2025,Vol.47Issue(4):111-122,12.DOI:10.11887/j.issn.1001-2486.24120032

面向长序列自主作业的非对称Actor-Critic强化学习方法

Asymmetric Actor-Critic reinforcement learning for long-sequence autonomous manipulation

任君凯 1瞿宇珂 1罗嘉威 1倪子淇 2卢惠民 1叶益聪2

作者信息

  • 1. 国防科技大学智能科学学院,湖南长沙 410073||装备状态感知与敏捷保障全国重点实验室,湖南长沙 410073
  • 2. 国防科技大学空天科学学院,湖南长沙 410073
  • 折叠

摘要

Abstract

Long-sequence autonomous manipulation capability becomes one of the bottlenecks hindering the practical application of intelligent robots.To address the diverse long-sequence operation skill requirements faced by robots in complex scenarios,an efficient and robust asymmetric Actor-Critic reinforcement learning method was proposed.This approach aims to solve the challenges of high learning difficulty and complex reward function design in long-sequence tasks.By integrating multiple Critic networks to collaboratively train a single Actor network,and introducing GAIL(generative adversarial imitation learning)to generate intrinsic rewards for the Critic network,the learning difficulty of long-sequence tasks was reduced.On this basis,a two-stage learning method was designed,utilizing imitation learning to provide high-quality pre-trained behavior policies for reinforcement learning,which not only improves learning efficiency but also enhances the generalization performance of the policy.Simulation results for long-sequence autonomous task execution in a chemical laboratory demonstrate that the proposed method significantly improves the learning efficiency of robot long-sequence skills and the robustness of behavior policies.

关键词

自主作业机器人/强化学习/Actor-Critic/长序列操作

Key words

autonomous manipulation robot/reinforcement learning/Actor-Critic/long-sequence operation

分类

信息技术与安全科学

引用本文复制引用

任君凯,瞿宇珂,罗嘉威,倪子淇,卢惠民,叶益聪..面向长序列自主作业的非对称Actor-Critic强化学习方法[J].国防科技大学学报,2025,47(4):111-122,12.

基金项目

国家自然科学基金资助项目(62373201) (62373201)

国防科技大学自主创新科学基金资助项目(ZK2023-30,24-ZZCX-GZZ-11) (ZK2023-30,24-ZZCX-GZZ-11)

国防科技大学学报

OA北大核心

1001-2486

访问量0
|
下载量0
段落导航相关论文