软件导刊2025,Vol.24Issue(8):1-10,10.DOI:10.11907/rjdk.251140
GUI-Agent:研究、应用与展望
GUI-Agent:Research,Application and Prospect
摘要
Abstract
GUI-Agent,as a new paradigm of human-computer interaction,plays an increasingly important role in daily life.With the power-ful modal interaction capabilities of the multimodal large model,it is able to simulate the way human users interact with digital systems,per-forming clicks,swipes,text input,and more complex tasks.This paper starts from the development history of GUI-Agent,deeply analyzes its architecture foundation,optimization strategy,test benchmark and evaluation index,and collects practical application cases in common appli-cation scenarios.This paper aims to provide a more granular review for relevant researchers,help them quickly and intuitively understand the current research progress of GUI-Agent,summarize the problems and challenges it faces,and discuss the future development direction to pro-mote the further application and development of related fields.关键词
图形用户界面智能体/大模型/人机交互/智能决策Key words
GUI-Agent/large model/human-computer interaction/intelligent decision分类
信息技术与安全科学引用本文复制引用
塔杰,王亮,刘进,黄勃,王一鹤..GUI-Agent:研究、应用与展望[J].软件导刊,2025,24(8):1-10,10.基金项目
西藏自治区自然科学基金项目(XZ202401ZR0119) (XZ202401ZR0119)