中国医疗设备2025,Vol.40Issue(10):44-50,7.DOI:10.3969/j.issn.1674-1633.20241568
融合跨视角特征和注意力机制的医学影像报告生成方法
Medical Image Report Generation Method Fusing Cross-View Features and Attention Mechanism
摘要
Abstract
Objective To explore a cross-perspective strategy for aligning medical images and medical reports to optimize the quality of medical image reports automatically generated by deep learning models.Methods A generative model combining cross-view features and attention mechanism was designed.Firstly,different angles were taken by the medical image and the pathological description features of the report were encoded based on the pre-trained model.Then,multiple attention mechanisms were utilized to complete the fusion calculation of features.Finally,the joint features were decoded into case reports using the decoder.Results It could be known from multiple rounds of tests on the authoritative and publicly available chest X-Ray dataset IU X-ray that the average performance of this model in the evaluation metrics of BLEU-1,BLEU-2,BLEU-3,BLEU-4,METEOR,ROUGE_L and P_MEAN was higher than that of the previously proposed methods by 8.34%,14.20%,10.90%,6.14%,1.70%and 5.50%respectively,and the comprehensive performance was improved by 7.79%.Conclusion This model performs well in terms of the accuracy and fluency of report generation,indicating that this fusion strategy can better capture the potential connection between images and reports,and improve the performance of the model in generating reports.关键词
医学影像/编解码器/跨视角特征/注意力机制/报告自动生成/特征融合Key words
medical imaging/codec/cross-perspective features/attention mechanism/automatic report generation/feature fusion分类
医药卫生引用本文复制引用
董雅儒,周霏,王亚如,张东琦..融合跨视角特征和注意力机制的医学影像报告生成方法[J].中国医疗设备,2025,40(10):44-50,7.基金项目
天津市医学重点学科(专科)建设项目(TJYXZDXK-009A). (专科)