广东工业大学学报2026,Vol.43Issue(1):61-70,10.DOI:10.12052/gdutxb.240146
基于先验提示驱动语义一致的医学报告生成
Medical Report Generation Based on Prior Prompt Driving Semantically Consistent
摘要
Abstract
Automated radiology report generation is crucial for reducing radiologist workload and minimizing diagnostic errors.Although existing studies have conducted in-depth research on lesion regions,there is potential for enhancement in generating detailed descriptions.Current methods tend to diminish sensitivity to the semantic information of visual lesions and weaken the critical association between visual and textual semantics.This paper introduces a novel Prior Prompt-Driven Semantic Consistency Model(PPD-SCM)to address these limitations.The Prompt-Lesion Enhancement module in the proposed model systematically integrates both normal and abnormal diagnostic descriptions from radiological chest X-ray images to construct prior prompts.By employing a prompt attention mechanism that fuses visual features with textual prompts,this module enhances the model's ability to perceive potential lesion features.Furthermore,this study introduces a Visual-Textual Semantic Consistency(VTSC)module that employs contrastive learning to deeply align visual and textual semantics.By leveraging prompt tokens to guide the model in generating enriched contextual information,the VTSC optimizes the subsequent report generation process.It effectively reduces the semantic gap between medical images and the generated reports,thereby enhancing the accuracy and reliability of report generation.Extensive experimental results on the IU X-Ray and MIMIC-MV datasets demonstrate that our proposed method significantly outperforms existing approaches in generating high-quality radiology reports.关键词
医学报告生成/语义一致性/注意力机制Key words
medical report generation/semantic consistency/attention mechanism分类
信息技术与安全科学引用本文复制引用
谭喆,黄国恒,张静,余玉绵..基于先验提示驱动语义一致的医学报告生成[J].广东工业大学学报,2026,43(1):61-70,10.基金项目
广东省重点领域研发计划项目(2018B010109007) (2018B010109007)
广州市重点领域研发计划项目(2023B01J0029) (2023B01J0029)