| 注册
首页|期刊导航|广东工业大学学报|基于先验提示驱动语义一致的医学报告生成

基于先验提示驱动语义一致的医学报告生成

谭喆 黄国恒 张静 余玉绵

广东工业大学学报2026,Vol.43Issue(1):61-70,10.
广东工业大学学报2026,Vol.43Issue(1):61-70,10.DOI:10.12052/gdutxb.240146

基于先验提示驱动语义一致的医学报告生成

Medical Report Generation Based on Prior Prompt Driving Semantically Consistent

谭喆 1黄国恒 1张静 1余玉绵1

作者信息

  • 1. 广东工业大学 计算机学院,广东 广州 510006
  • 折叠

摘要

Abstract

Automated radiology report generation is crucial for reducing radiologist workload and minimizing diagnostic errors.Although existing studies have conducted in-depth research on lesion regions,there is potential for enhancement in generating detailed descriptions.Current methods tend to diminish sensitivity to the semantic information of visual lesions and weaken the critical association between visual and textual semantics.This paper introduces a novel Prior Prompt-Driven Semantic Consistency Model(PPD-SCM)to address these limitations.The Prompt-Lesion Enhancement module in the proposed model systematically integrates both normal and abnormal diagnostic descriptions from radiological chest X-ray images to construct prior prompts.By employing a prompt attention mechanism that fuses visual features with textual prompts,this module enhances the model's ability to perceive potential lesion features.Furthermore,this study introduces a Visual-Textual Semantic Consistency(VTSC)module that employs contrastive learning to deeply align visual and textual semantics.By leveraging prompt tokens to guide the model in generating enriched contextual information,the VTSC optimizes the subsequent report generation process.It effectively reduces the semantic gap between medical images and the generated reports,thereby enhancing the accuracy and reliability of report generation.Extensive experimental results on the IU X-Ray and MIMIC-MV datasets demonstrate that our proposed method significantly outperforms existing approaches in generating high-quality radiology reports.

关键词

医学报告生成/语义一致性/注意力机制

Key words

medical report generation/semantic consistency/attention mechanism

分类

信息技术与安全科学

引用本文复制引用

谭喆,黄国恒,张静,余玉绵..基于先验提示驱动语义一致的医学报告生成[J].广东工业大学学报,2026,43(1):61-70,10.

基金项目

广东省重点领域研发计划项目(2018B010109007) (2018B010109007)

广州市重点领域研发计划项目(2023B01J0029) (2023B01J0029)

广东工业大学学报

1007-7162

访问量0
|
下载量0
段落导航相关论文