实验技术与管理2025,Vol.42Issue(10):48-53,6.DOI:10.16791/j.cnki.sjg.2025.10.006
基于语言大模型的工业机器人智能作业综合实验设计
Comprehensive experimental design for industrial robot intelligent tasks based on large language models
摘要
Abstract
[Objective]With increasing integration of large language models(LLMs)and robotics,industrial robots are playing pivotal roles in smart manufacturing,particularly in responding to growing demands for flexibility and customization in modern manufacturing.This transition toward intelligent industrial robots represents an inevitable trend in industrial development.This paper aims to explore the combination of LLM applications,machine vision,and industrial robot programming,proposing a new experimental platform design to facilitate the intelligent operation of industrial robots in various scenarios.The study focuses on developing a comprehensive experimental approach that enhances our understanding of industrial robot intelligence and provides new insights for talent cultivation in the rapidly evolving field of smart manufacturing.[Methods]The study design involves an innovative"virtual-real integration"experimental platform for industrial robots that integrates hardware and software components.The platform comprises a computer,the Yuejie E6 collaborative robotic arm,an Intel D435i depth camera,and a simulated work environment created using the Gazebo platform.The experimental system is divided into three core modules:decision-making,perception,and execution.The decision-making module employs an LLM to process voice commands and plan tasks.Meanwhile,the perception module utilizes machine vision for object recognition and precise positioning.Finally,the execution module controls the motion units of the modular units to ensure reliable execution of assigned tasks.The study conducted voice recognition and task decision-making experiments to evaluate the effectiveness of the proposed task planning model in detail.Data processing for these modules was conducted using Python,with experimental environments set up under Windows and Ubuntu operating systems.[Results]Experimental validation showed the effectiveness of the platform,yielding the following notable results:First,regarding voice recognition,the Whisper-1 and Qianfan models achieved recognition accuracy of over 95%,with Qianfan delivering faster response times.Second,regarding LLM task planning,the hierarchical prompt system was highly effective in parsing complex instructions.The LLMs generated valid high-level action sequences and handled ambiguous commands by returning empty action sets.Third,regarding visual perception,hand-eye calibration achieved sufficient accuracy.Notably,traditional image processing provided stable and accurate target localization(mean error<7 mm),making it suitable for coordinate transformation.Meanwhile,vision foundation models showed better semantic understanding but exhibited larger and less stable localization errors,making them unsuitable for precise positioning.Fourth,regarding integrated performance,in 10 bin-picking trials,voice commands were recognized correctly(>95%),LLMs generated accurate action sequences,and the vision module accurately located targets.The end-effector positioning error,attributed to calibration residuals,was consistently below 6 mm and was deemed acceptable for the task.[Conclusions]This study successfully designed and implemented a comprehensive"virtual-real integration"experimental platform for intelligent industrial robot operations that integrated LLM-based decision-making,machine vision,and modular execution.The hierarchical architecture and innovative prompt engineering strategy provided a robust approach to translating natural language into reliable robot actions.Traditional image processing outperformed vision foundation models in precise localization tasks.The proposed platform provides a practical and accessible tool for students to understand the integration of LLMs,vision,and robotics in intelligent industrial operations,providing a valuable resource for cultivating talent in the emerging field of smart manufacturing.The core design principles,particularly the layered architecture and prompt engineering,present a transferable framework for real-world"AI+"industrial applications.关键词
大语言模型/工业机器人/提示词工程/综合实验设计Key words
large language model/industrial robots/prompt engineering/comprehensive experimental design分类
信息技术与安全科学引用本文复制引用
苏宇,薛斌,黄明士..基于语言大模型的工业机器人智能作业综合实验设计[J].实验技术与管理,2025,42(10):48-53,6.基金项目
广西高校教改项目(2024JGB254) (2024JGB254)