大数据2025,Vol.11Issue(3):108-138,31.DOI:10.11959/j.issn.2096-0271.2025035
基于多模态大模型的具身智能体研究进展与展望
Review and emerging trends of embodied agent based on multimodal large language models
摘要
Abstract
Embodied agents refer to intelligent entities capable of completing one or multiple tasks based on instructions and possessing the ability to interact with the physical environment.These agents have immense potential applications across various fields,such as service robotics,intelligent education,and assistive healthcare,and represent a crucial pathway toward realizing general-purpose robots.With the advancement of multimodal large language models,embodied agents possess enhanced abilities in natural language understanding,reasoning,and environmental perception,significantly accelerating progress in this domain.Although many outstanding works have emerged in recent years,the field still lacks comprehensive surveys and targeted evaluations.To help researchers quickly and thoroughly know the developments in this area,in-depth review and analysis were conducted.Multimodal large language models were introducted,followed by datasets and a review of the physical carriers used for constructing embodied intelligent agents.Then,three key research directions are analyzed,including embodied large models,high-level task planning,and low-level action control.Finally,the challenges and limitations of embodied agents were summarized and potential future directions were explored.This review serves as a foundational reference for the research community and fosters further development and innovation in the field.关键词
具身智能体/多模态大模型/机器人/视觉语言模型/具身智能Key words
embodied agent/multimodal large language model/robot/vision-language model/embodied intelligence分类
计算机与自动化引用本文复制引用
赵博涛,亢祖衡,瞿晓阳,彭俊清,张旭龙,王健宗..基于多模态大模型的具身智能体研究进展与展望[J].大数据,2025,11(3):108-138,31.基金项目
广东省重点领域研发计划"新一代人工智能"重大专项(No.2021B0101400003) Guangdong Province Key Field R&D Program"New Generation Artificial Intelligence"Major Special Project(No.2021B0101400003) (No.2021B0101400003)