| 注册
首页|期刊导航|数据与计算发展前沿|多模态交互:从人机协同迈向人智协同

多模态交互:从人机协同迈向人智协同

王镇远 田东 董禹 乔娜 单桂华

数据与计算发展前沿2025,Vol.7Issue(3):81-93,13.
数据与计算发展前沿2025,Vol.7Issue(3):81-93,13.DOI:10.11871/jfdc.issn.2096-742X.2025.03.007

多模态交互:从人机协同迈向人智协同

Multimodal Interaction:From Human-Computer Collaboration to Human-Intelligence Collaboration

王镇远 1田东 1董禹 1乔娜 2单桂华1

作者信息

  • 1. 中国科学院计算机网络信息中心,北京 100083||中国科学院大学,北京 100049
  • 2. 中国人与生物圈国家委员会秘书处,北京 100864
  • 折叠

摘要

Abstract

[Objective]This paper explores the paradigm shift of multimodal interaction technology from"human-computer collaboration"to"human-intelligence collaboration".[Coverage]The refer-ences include 70 relevant works from recent domestic and international journals or conferences on multimodal human-intelligent interaction.[Methods]First,it introduces the development of multimodal interaction technology from traditional methods(speech,gestures,eye movements)to the human-intelligence interaction paradigm integrated with large models(LLMs,VLMs).Secondly,it focuses on the cutting-edge methods of interaction context awareness and user intention understanding,and presents cas-es of multimodal human-intelligence interaction technology in healthcare,education,creation,and daily life.[Re-sults]Multimodal human-intelligence interaction has been explored in various vertical and general fields,but still faces core technical challenges such as lack of compensation mechanisms and appropriate handling of erroneous or ambiguous intents.[Limitations]Due to the scope of available literature,only typical types of interaction mo-dalities are listed,and the coverage of application scenarios is limited.[Conclusions]Multimodal interaction is progressing towards greater intelligence.Future research should focus on optimizing compensation mechanisms,improving intent understanding accuracy,enhancing dynamic balancing mechanisms,and offering embodied de-sign to better support diverse intent tasks and facilitate scalable applications.

关键词

多模态交互/智能体/智能交互/扩展现实

Key words

multimodal interaction/agent/intelligent interaction/extended reality

引用本文复制引用

王镇远,田东,董禹,乔娜,单桂华..多模态交互:从人机协同迈向人智协同[J].数据与计算发展前沿,2025,7(3):81-93,13.

基金项目

中华人民共和国人与生物圈国家委员会项目"候鸟迁徙可视分析关键技术研究"(MAB-CN-2023-HNQX) (MAB-CN-2023-HNQX)

中国科学院计算机网络信息中心青年基金"面向科学数据分析的多模态空间智能交互"(25YF08) (25YF08)

数据与计算发展前沿

2096-742X

访问量5
|
下载量0
段落导航相关论文