首页|期刊导航|电子学报|M3 Res-Transformer:新冠肺炎胸部X-ray图像识别模型

M3 Res-Transformer:新冠肺炎胸部X-ray图像识别模型

周涛刘赟璨侯森宝常晓玉叶鑫宇陆惠玲

电子学报2024，Vol.52Issue(2)：589-601,13.

电子学报2024，Vol.52Issue(2)：589-601,13.DOI:10.12263/DZXB.20220999

M3 Res-Transformer:新冠肺炎胸部X-ray图像识别模型

M3 Res-Transformer:Chest X-ray Image Recognition Model of COVID-19

周涛 ¹刘赟璨 ²侯森宝 ²常晓玉 ²叶鑫宇 ²陆惠玲³

作者信息

1. 北方民族大学计算机科学与工程学院,宁夏银川 750021||北方民族大学图像图形智能处理国家民委重点实验室,宁夏银川 750021
2. 北方民族大学计算机科学与工程学院,宁夏银川 750021
3. 宁夏医科大学医学信息与工程学院,宁夏银川 750004
折叠

摘要

Abstract

COVID-19 has seriously affected human life and health since its outbreak.In recent years,residual neural network has been widely used in COVID-19 recognition task to assist doctors to quickly diagnose COVID-19 patients.However,the shape of COVID-19 image lesion regions is complex,the size is different,and the boundary with surrounding tissues is blurred,which make it difficult for the network to extract effective features.Aiming at the above problems,a M3 Res-Transformer model for COVID-19 Chest X-ray image recognition is proposed.Res-Transformer is used as the back-bone network of the model,combining ResNet and ViT to effectively integrate local lesion features and global features;A mixed residual attention module(mraM)is designed to enhance the feature expression ability of the network by considering the interdependence of channels and spatial locations;In order to increase the receptive field and extract multi-scale fea-tures,the multi-scale dilated residual module(mdrM)is constructed by superimposing dilated convolution with different di-lation rates,and three mdrM with gradually shrinking scales are used for multi-scale feature extraction according to the dif-ference of feature scales at different layers;The contextual cross-awareness module(ccaM)is proposed,which uses the se-mantic information of deep features to guide shallow features,then embeds the spatial information of shallow features into deep features,and uses the cross-weighted attention mechanism to efficiently aggregate deep and shallow features to obtain richer contextual information.In order to verify the effectiveness of the model in this paper,experiments were conducted on the Chest X-ray image dataset of COVID-19,and through comparison with advanced CNN classification models,com-parison with ResNet50 models fusing different attention mechanisms,comparison with Transformer-based classification models and ablation experiment,the results showed that the Acc,Pre,Rec,F1-Score and Spe indexes of the proposed model are 96.33%,96.36%,96.33%,96.35%and 96.26%respectively,which effectively improves the recognition accuracy in CO-VID-19 Chest X-ray image recognition task,then it is further verified by visualization method,which provides important reference value for COVID-19 aided diagnosis.

关键词

COVID-19/胸部X-ray图像/残差神经网络/vision transformer/注意力机制

Key words

COVID-19/chest X-ray image/residual neural network/vision transformer/attention mechanism

分类

信息技术与安全科学

引用本文复制引用

周涛,刘赟璨,侯森宝,常晓玉,叶鑫宇,陆惠玲..M3 Res-Transformer:新冠肺炎胸部X-ray图像识别模型[J].电子学报,2024,52(2):589-601,13.

基金项目

国家自然科学基金(No.62062003) （No.62062003）

宁夏自治区重点研发计划(No.2020BEB04022) （No.2020BEB04022）

北方民族大学研究生创新项目(No.YCX22198,No.YCX22190) National Natural Science Foundation of China(No.62062003) （No.YCX22198,No.YCX22190）

Key Research and Development Plan of Ningxia Autonomous Region(No.2020BEB04022) （No.2020BEB04022）

Graduate Innovation Project of North Minzu Univer-sity(No.YCX22198,No.YCX22190) （No.YCX22198,No.YCX22190）

电子学报

OA北大核心CSTPCD

ISSN：0372-2112

访问量0

下载量0

段落导航