首页|期刊导航|计算机工程与应用|视觉Transformer在低级视觉领域的研究综述

视觉Transformer在低级视觉领域的研究综述

朱凯李理张彤江晟别一鸣

计算机工程与应用2024，Vol.60Issue(4)：39-56,18.

计算机工程与应用2024，Vol.60Issue(4)：39-56,18.DOI:10.3778/j.issn.1002-8331.2304-0139

视觉Transformer在低级视觉领域的研究综述

Survey of Vision Transformer in Low-Level Computer Vision

朱凯 ¹李理 ²张彤 ³江晟 ³别一鸣⁴

作者信息

1. 长春理工大学物理学院,长春 130022||长春理工大学中山研究院光电/生物纳米检测与制造中心,广东中山 528437
2. 长春理工大学物理学院,长春 130022||长春理工大学中山研究院光电/生物纳米检测与制造中心,广东中山 528437||长春理工大学电子信息工程学院,长春 130022
3. 长春理工大学物理学院,长春 130022
4. 吉林大学交通学院,长春 130012
折叠

摘要

Abstract

Transformer is a revolutionary neural network architecture initially designed for natural language processing.However,its outstanding performance and versatility have led to widespread applications in the field of computer vision.While there is a wealth of research and literature on Transformer applications in natural language processing,there re-mains a relative scarcity of specialized reviews focusing on low-level visual tasks.In light of this,this paper begins by pro-viding a brief introduction to the principles of Transformer and analyzing several variants.Subsequently,the focus shifts to the application of Transformer in low-level visual tasks,specifically in the key areas of image restoration,image en-hancement,and image generation.Through a detailed analysis of the performance of different models in these tasks,this paper explores the variations in their effectiveness on commonly used datasets.This includes achievements in restoring damaged images,improving image quality,and generating realistic images.Finally,this paper summarizes and forecasts the development trends of Transformer in the field of low-level visual tasks.It suggests directions for future research to further drive innovation and advancement in Transformer applications.The rapid progress in this field promises break-throughs for computer vision and image processing,providing more powerful and efficient solutions for practical applica-tions.

关键词

Transformer/深度学习/注意力机制/计算机视觉/低级视觉任务

Key words

Transformer/deep learning/attention mechanism/computer vision/low-level vision task

分类

信息技术与安全科学

引用本文复制引用

朱凯,李理,张彤,江晟,别一鸣..视觉Transformer在低级视觉领域的研究综述[J].计算机工程与应用,2024,60(4):39-56,18.

基金项目

吉林省科技发展计划重点研发项目(20210203214SF). （20210203214SF）

计算机工程与应用

OA北大核心CSTPCD

ISSN：1002-8331

访问量5

下载量0

段落导航