| 注册
首页|期刊导航|大数据|基于多模态大模型的具身智能体研究进展与展望

基于多模态大模型的具身智能体研究进展与展望

赵博涛 亢祖衡 瞿晓阳 彭俊清 张旭龙 王健宗

大数据2025,Vol.11Issue(3):108-138,31.
大数据2025,Vol.11Issue(3):108-138,31.DOI:10.11959/j.issn.2096-0271.2025035

基于多模态大模型的具身智能体研究进展与展望

Review and emerging trends of embodied agent based on multimodal large language models

赵博涛 1亢祖衡 1瞿晓阳 1彭俊清 1张旭龙 1王健宗1

作者信息

  • 1. 平安科技(深圳)有限公司,广东 深圳 518063
  • 折叠

摘要

Abstract

Embodied agents refer to intelligent entities capable of completing one or multiple tasks based on instructions and possessing the ability to interact with the physical environment.These agents have immense potential applications across various fields,such as service robotics,intelligent education,and assistive healthcare,and represent a crucial pathway toward realizing general-purpose robots.With the advancement of multimodal large language models,embodied agents possess enhanced abilities in natural language understanding,reasoning,and environmental perception,significantly accelerating progress in this domain.Although many outstanding works have emerged in recent years,the field still lacks comprehensive surveys and targeted evaluations.To help researchers quickly and thoroughly know the developments in this area,in-depth review and analysis were conducted.Multimodal large language models were introducted,followed by datasets and a review of the physical carriers used for constructing embodied intelligent agents.Then,three key research directions are analyzed,including embodied large models,high-level task planning,and low-level action control.Finally,the challenges and limitations of embodied agents were summarized and potential future directions were explored.This review serves as a foundational reference for the research community and fosters further development and innovation in the field.

关键词

具身智能体/多模态大模型/机器人/视觉语言模型/具身智能

Key words

embodied agent/multimodal large language model/robot/vision-language model/embodied intelligence

分类

计算机与自动化

引用本文复制引用

赵博涛,亢祖衡,瞿晓阳,彭俊清,张旭龙,王健宗..基于多模态大模型的具身智能体研究进展与展望[J].大数据,2025,11(3):108-138,31.

基金项目

广东省重点领域研发计划"新一代人工智能"重大专项(No.2021B0101400003) Guangdong Province Key Field R&D Program"New Generation Artificial Intelligence"Major Special Project(No.2021B0101400003) (No.2021B0101400003)

大数据

2096-0271

访问量0
|
下载量0
段落导航相关论文