| 注册
首页|期刊导航|川北医学院学报|XGBoost-SHAP肺结节早期识别可解释性框架构建

XGBoost-SHAP肺结节早期识别可解释性框架构建

易付良 邹雪娜 李刚 刘昕 向茹梅 骆长玲 邓丽春 余秀莲 周厚容 高扬

川北医学院学报2026,Vol.41Issue(4):422-427,6.
川北医学院学报2026,Vol.41Issue(4):422-427,6.DOI:10.3969/j.issn.1005-3697.2026.04.005

XGBoost-SHAP肺结节早期识别可解释性框架构建

XGBoost-SHAP-based interpretable framework for the early identification of pulmonary nodules

易付良 1邹雪娜 1李刚 2刘昕 3向茹梅 1骆长玲 1邓丽春 1余秀莲 1周厚容 1高扬1

作者信息

  • 1. 自贡市第四人民医院公共卫生科,四川 自贡 643000
  • 2. 自流井区疾病预防控制中心慢病所,四川 自贡 643036
  • 3. 自贡市第一人民医院健康管理中心,四川 自贡 643000
  • 折叠

摘要

Abstract

Objective:To achieve early identification of pulmonary nodules and visual interpretation of key variables through interpretable machine learning,and to facilitate precise prevention,control,early diagnosis and treatment of lung cancer.Methods:This study enrolled individuals at high risk of lung cancer and completed clinical screening.Their high-risk assessment data and imaging results were extracted.Participants were divided into high-risk and low-risk groups for pulmonary nodules based on China's Lung Cancer Screening Standard(T/CPMA 013-2020).Variables with differences identified by univariate analysis were used as predictors,with pulmonary nodule grouping as the dependent variable,to construct an interpretable XGBoost-SHAP identification framework for early nodule detection and visual result interpretation.Results:A total of 644 high-risk individuals were included,with 199(30.9%)in the high-risk pulmonary nodule group.The XGBoost model achieved an accuracy of 0.914 6,sensitivity of 0.758 7,specificity of 0.984 3,F1-score of 0.845 8,and AUC of 0.974 1 for nodule grouping.SHAP analysis revealed that higher SHAP values-and thus increased risk of nodule enlargement-were associated with greater smoking intensity,exposure to secondhand smoke from colleagues/family,infrequent kitchen ventilation during cooking,excessive intake of processed foods,occupational exposure to asbestos/radon,insufficient intake of protein,fruits and vegetables,and manual labor occupation.Conclusion:The constructed interpretable framework performs well in early pulmonary nodule identification.Changes in nodule size are associated not only with traditional risk factors(e.g.,smoking habits,secondhand smoke exposure,cooking fume exposure,occupational asbestos/radon exposure)but also with the participants'dietary habits.

关键词

肺结节/早期识别/XGBoost/SHAP/可解释性框架

Key words

Pulmonary nodules/Early identification/XGBoost/SHAP/Interpretable framework

分类

医药卫生

引用本文复制引用

易付良,邹雪娜,李刚,刘昕,向茹梅,骆长玲,邓丽春,余秀莲,周厚容,高扬..XGBoost-SHAP肺结节早期识别可解释性框架构建[J].川北医学院学报,2026,41(4):422-427,6.

基金项目

成都医学院教育发展基金会科研专项项目(25LHZG-12) (25LHZG-12)

四川省自贡市重点科技计划项目(2024-YGY-01-04) (2024-YGY-01-04)

川北医学院学报

1005-3697

访问量0
|
下载量0
段落导航相关论文