电子学报2024,Vol.52Issue(2):589-601,13.DOI:10.12263/DZXB.20220999
M3 Res-Transformer:新冠肺炎胸部X-ray图像识别模型
M3 Res-Transformer:Chest X-ray Image Recognition Model of COVID-19
摘要
Abstract
COVID-19 has seriously affected human life and health since its outbreak.In recent years,residual neural network has been widely used in COVID-19 recognition task to assist doctors to quickly diagnose COVID-19 patients.However,the shape of COVID-19 image lesion regions is complex,the size is different,and the boundary with surrounding tissues is blurred,which make it difficult for the network to extract effective features.Aiming at the above problems,a M3 Res-Transformer model for COVID-19 Chest X-ray image recognition is proposed.Res-Transformer is used as the back-bone network of the model,combining ResNet and ViT to effectively integrate local lesion features and global features;A mixed residual attention module(mraM)is designed to enhance the feature expression ability of the network by considering the interdependence of channels and spatial locations;In order to increase the receptive field and extract multi-scale fea-tures,the multi-scale dilated residual module(mdrM)is constructed by superimposing dilated convolution with different di-lation rates,and three mdrM with gradually shrinking scales are used for multi-scale feature extraction according to the dif-ference of feature scales at different layers;The contextual cross-awareness module(ccaM)is proposed,which uses the se-mantic information of deep features to guide shallow features,then embeds the spatial information of shallow features into deep features,and uses the cross-weighted attention mechanism to efficiently aggregate deep and shallow features to obtain richer contextual information.In order to verify the effectiveness of the model in this paper,experiments were conducted on the Chest X-ray image dataset of COVID-19,and through comparison with advanced CNN classification models,com-parison with ResNet50 models fusing different attention mechanisms,comparison with Transformer-based classification models and ablation experiment,the results showed that the Acc,Pre,Rec,F1-Score and Spe indexes of the proposed model are 96.33%,96.36%,96.33%,96.35%and 96.26%respectively,which effectively improves the recognition accuracy in CO-VID-19 Chest X-ray image recognition task,then it is further verified by visualization method,which provides important reference value for COVID-19 aided diagnosis.关键词
COVID-19/胸部X-ray图像/残差神经网络/vision transformer/注意力机制Key words
COVID-19/chest X-ray image/residual neural network/vision transformer/attention mechanism分类
信息技术与安全科学引用本文复制引用
周涛,刘赟璨,侯森宝,常晓玉,叶鑫宇,陆惠玲..M3 Res-Transformer:新冠肺炎胸部X-ray图像识别模型[J].电子学报,2024,52(2):589-601,13.基金项目
国家自然科学基金(No.62062003) (No.62062003)
宁夏自治区重点研发计划(No.2020BEB04022) (No.2020BEB04022)
北方民族大学研究生创新项目(No.YCX22198,No.YCX22190) National Natural Science Foundation of China(No.62062003) (No.YCX22198,No.YCX22190)
Key Research and Development Plan of Ningxia Autonomous Region(No.2020BEB04022) (No.2020BEB04022)
Graduate Innovation Project of North Minzu Univer-sity(No.YCX22198,No.YCX22190) (No.YCX22198,No.YCX22190)