| 注册
首页|期刊导航|电子科技|层级特征融合Transformer的图像分类算法

层级特征融合Transformer的图像分类算法

DUAN Shixi WANG Bo

电子科技2026,Vol.39Issue(2):72-78,7.
电子科技2026,Vol.39Issue(2):72-78,7.DOI:10.16180/j.cnki.issn1007-7820.2026.02.009

层级特征融合Transformer的图像分类算法

Image Classification Algorithm Based on Hierarchical Feature Fusion Transformer

DUAN Shixi 1WANG Bo1

作者信息

  • 1. School of Science,Shenyang University of Technology,Shenyang 110870,China
  • 折叠

摘要

Abstract

In view of the problem that the traditional ViT(Vision Transformer)model is difficult to complete multi-level image classification,this study proposes a HICViT(Hierarchical Feature Fusion Vision Transformer)for image classification based on ViT.The input data is processed through the ViT extraction module to generate multiple feature maps at different levels,and each feature map contains abstract feature representations at different levels.Ac-cording to the hierarchical labels,the features extracted by ViT are mapped into features at different levels,and a HIC method is used to fuse the features at different levels,thereby improving the classification performance of the model.The proposed model is compared and analyzed with a variety of advanced deep learning models on three data-sets,namely CIFRA-10,CIFRA-100,and CUB-200-2011.On the CIFRA-10 dataset,the classification accura-cies of the proposed method at the first level,the second level,and the third level are 99.70%,98.80%,and 97.80%,respectively.On the CIFRA-100 dataset,the classification accuracies of the proposed method at the first level,the second level,and the third level are 95.23%,93.54%,and 90.12%,respectively.On the CUB-200-2011 dataset,the classification accuracies of the proposed method at the first level and the second level are 98.09%and 93.66%,respectively.The results indicate that the classification accuracy of the proposed model outperforms that of other comparative models.

关键词

深度学习/卷积神经网络/Transformer/图像分类/层级特征/特征融合/多头注意力/Vision Transformer

Key words

deep learning/convolutional neural networks/Transformer/image classification/hierarchical charac-teristics/feature fusion/multi-head attention/Vision Transformer

分类

信息技术与安全科学

引用本文复制引用

DUAN Shixi,WANG Bo..层级特征融合Transformer的图像分类算法[J].电子科技,2026,39(2):72-78,7.

基金项目

国家自然科学基金(62103289)National Natural Science Foundation of China(62103289) (62103289)

电子科技

1007-7820

访问量0
|
下载量0
段落导航相关论文