南方医科大学学报2026,Vol.46Issue(5):967-976,10.DOI:10.12122/j.issn.1673-4254.2026.05.01
绝经后女性冠心病发生风险评估模型的建立与外部验证:一项双中心回顾性研究
Development and validation of a machine learning-based model for assessing coronary artery disease risk in postmenopausal women:a dual-center retrospective study
摘要
Abstract
Objective To investigate the risk factors of coronary heart disease(CHD)and develop a risk assessment model for CHD in postmenopausal women.Methods General information,medical history,and laboratory test results of the patients were collected from postmenopausal women with CHD admitted to two medical centers in Yangzhou(Jiangsu Province,China)from November,2018 to November,2023.After excluding cases with incomplete medical records,1197 patients were included,who were divided into the training cohort(n=821)and validation cohort(n=376)based on the hospital of admission.In the training cohort,the risk factors for CHD in postmenopausal women were identified using Lasso regression,multivariate logistic regression analysis,and machine learning algorithms including Light Gradient Boosting Machine(LGBM),Random Forest(RF),Decision Tree(DT),Support Vector Machine(SVM),Extreme Gradient Boosting(XGBoost),K-Nearest Neighbors(KNN),and Naive Bayes(NB).Risk assessment models were constructed using these algorithms,and their performance was evaluated using ROC curves,decision curve analysis(DCA),and calibration curves.Results Lasso regression suggested body mass index(BMI)classification and glycated hemoglobin were independent risk factors for CHD in postmenopausal women,whereas age at menopause and high-density lipoprotein cholesterol(HDL-C)were independent protective factors(P<0.05).Among the machine learning models,XGBoost demonstrated the best assessment performance in both the training set(AUC:0.912;sensitivity:0.892;specificity:0.766;recall:0.892;F1-score:0.899)and the validation set(AUC:0.891;sensitivity:0.836;specificity:0.921;recall:0.837;F1-score:0.877).Calibration curve and DCA curve analyses indicated good consistency between the predicted and actual outcomes.A nomogram and SHAP summary plot were used to visualize and interpret the logistic regression model and the XGBoost model,respectively.Conclusion The risk assessment model for CHD in Chinese postmenopausal women established in this study demonstrates good accuracy and applicability to allow early identification of high-risk patients.关键词
绝经后女性/冠心病/列线图Key words
postmenopausal women/coronary heart disease/nomogram引用本文复制引用
邓毅凡,潘俊梅,何胜虎,周玮,张晶..绝经后女性冠心病发生风险评估模型的建立与外部验证:一项双中心回顾性研究[J].南方医科大学学报,2026,46(5):967-976,10.基金项目
Supported by Postgraduate Research&Practice Innovation Program of Jiangsu Province(SJCX24-2350),and Clinical Trials Fund of Nor thern Jiangsu People's Hospital(SBLC25008). 江苏省研究生科研与实践创新计划资助项目(SJCX24-2350) (SJCX24-2350)
苏北人民医院临床研究专项资金(SBLC25008) (SBLC25008)