| 注册
首页|期刊导航|新发传染病电子杂志|基于可解释机器学习的中青年初治肺结核患者治疗失败预测模型构建与验证

基于可解释机器学习的中青年初治肺结核患者治疗失败预测模型构建与验证

刘慧梅 张硕 许昌伟 韩溪溪 席向宇

新发传染病电子杂志2026,Vol.11Issue(1):44-49,6.
新发传染病电子杂志2026,Vol.11Issue(1):44-49,6.DOI:10.19871/j.cnki.xfcrbzz.2026.01.007

基于可解释机器学习的中青年初治肺结核患者治疗失败预测模型构建与验证

Construction and validation of a predictive model for treatment failure in young and middle-aged patients with initial treatment of pulmonary tuberculosis based on interpretable machine learning

刘慧梅 1张硕 1许昌伟 1韩溪溪 1席向宇1

作者信息

  • 1. 首都医科大学附属北京地坛医院徐州医院/徐州市第七人民医院结核三科,江苏 徐州 221000
  • 折叠

摘要

Abstract

Objective To construct treatment failure prediction model for young and middle-aged patients with initially treated pulmonary tuberculosis based on machine learning,and to provide a reference for the early clinical identification of high-risk patients with treatment failure and the formulation of personalized intervention strategies.Method A total of 760 young and middle-aged patients with initially treated pulmonary tuberculosis admitted to Beijing Ditan Hospital,Xuzhou Hospital,Capital Medical University,Xuzhou Seventh People's Hospital from January 2022 to June 2024 were selected as the research subjects.Using SPSS software to generate random numbers,according to 7:3,the patients were randomly divided into modeling group(n=532)and validation group(n=228).Based on modeling group data,the least absolute shrinkage and selection operator(LASSO)regression was used to screen the characteristic variables of patient treatment failure.Six machine learning methods,including random forest(RF),support vector machine(SVM),logistic regression(LR),naive bayes(NB),artificial neural network(ANN)and gradient boosting machine(GBM)were selected to construct treatment failure prediction model.The effectiveness of model was evaluated by AUC of ROC,calibration curve and decision curve.In addition,the shapley additive interpretation(SHAP)method was used to explain the contributions of each variable to the results.Result LASSO regression screened 5 non-zero coefficient indicators,including residence,smoking history,sputum positive,pulmonary cavity and controlling nutritional status(CONUT)score.Among the 6 machine learning models,the GBM model had the highest AUC and F1 scores.The ROC curve showed that the AUC of GBM model in modeling group and validation group were 0.881 and 0.865,respectively.The calibration curve showed that GBM model had a good calibration degree in two groups(χ2=8.638,P=0.374;χ2=4.786,P=0.780).The decision curve showed that GBM model had wide range of clinical net benefits in two groups.SHAP analysis showed that the importance ranking of contribution to prediction results of GBM model were CONUT score,residence,pulmonary cavity,sputum positivity and smoking history.Among them,high CONUT scores,rural residence,combined pulmonary cavities,positive sputum results and history of smoking can increase the risk of treatment failure for patients.Conclusion In this study,a predictive model for treatment failure in young and middle-aged patients with newly diagnosed pulmonary tuberculosis was constructed and validated based on six machine learning algorithms,among which the GBM model exhibited the optimal predictive performance and clinical application value.This study demonstrated the key risk factors for treatment failure in patients including high CONUT score,rural residence,presence of pulmonary cavities,sputum positivity and smoking history.This model can provide a basis for clinical prediction and identification of high-risk patients with treatment failure,as well as for developing intervention plans.

关键词

中青年/肺结核/初治/治疗失败/危险因素/机器学习

Key words

Young and middle-aged/Pulmonary tuberculosis/Initially treated/Treatment failure/Risk factor/Machine learning

分类

医药卫生

引用本文复制引用

刘慧梅,张硕,许昌伟,韩溪溪,席向宇..基于可解释机器学习的中青年初治肺结核患者治疗失败预测模型构建与验证[J].新发传染病电子杂志,2026,11(1):44-49,6.

基金项目

1.中国公共卫生联盟课题(GWLM202011) (GWLM202011)

2.徐州市科技计划项目(KC23205) (KC23205)

3.徐州市第七人民医院专病队列项目(KCDL202401) (KCDL202401)

新发传染病电子杂志

2096-2738

访问量1
|
下载量0
段落导航相关论文