| 注册
首页|期刊导航|计算机应用研究|可解释的视觉问答研究进展

可解释的视觉问答研究进展

张一飞 孟春运 蒋洲 栾力 Ernest Domanaanmwi Ganaa

计算机应用研究2024,Vol.41Issue(1):10-20,11.
计算机应用研究2024,Vol.41Issue(1):10-20,11.DOI:10.19734/j.issn.1001-3695.2023.05.0181

可解释的视觉问答研究进展

Research advances in explainable visual question answering

张一飞 1孟春运 1蒋洲 2栾力 3Ernest Domanaanmwi Ganaa4

作者信息

  • 1. 江苏科技大学经济管理学院,江苏镇江 212100
  • 2. 江苏大学计算机科学与通信工程学院,江苏镇江 212013
  • 3. 中国科学技术大学 公共事务学院,合肥 230026
  • 4. 希拉·利曼技术大学应用科学与技术学院,加纳瓦00233
  • 折叠

摘要

Abstract

In the context of visual question answering(VQA)tasks,"explainability"refers to the various ways in which re-searchers can explain why a model works in a given task.The lack of explainability of some existing VQA models has led to a lack of assurance that the models can be used safely in real-life applications,especially in fields such as autonomous driving and healthcare.This would raise ethical and moral issues that hinder their implementation in industry.This paper introduced various implementations for enhancing explainability in VQA tasks and categorized them into four main categories:image inter-pretation,text interpretation,multi-modal interpretation,modular interpretation,and graph interpretation.This paper dis-cussed the characteristics of each approach,and further presented the subdivisions for some of them.Furthermore,it presented several VQA datasets that aimed to enhance explainability.These datasets primarily focused on incorporating external know-ledge bases and annotating image information to improve explainability.In summary,this paper provided an overview of exis-ting commonly used interpretable methods for VQA tasks and proposed future research directions based on the identified short-comings of the current approaches.

关键词

视觉问答/视觉推理/可解释性/人工智能/自然语言处理/计算机视觉

Key words

visual question answering/visual reasoning/explainability/artificial intelligence/natural language processing/computer vision

分类

信息技术与安全科学

引用本文复制引用

张一飞,孟春运,蒋洲,栾力,Ernest Domanaanmwi Ganaa..可解释的视觉问答研究进展[J].计算机应用研究,2024,41(1):10-20,11.

基金项目

国家社科基金重点项目(16AJL008) (16AJL008)

江苏省社科基金青年项目(22EYC001) (22EYC001)

江苏高校哲学社会科学研究一般项目(2019SJA1927) (2019SJA1927)

计算机应用研究

OA北大核心CSTPCD

1001-3695

访问量0
|
下载量0
段落导航相关论文