医学新知2024,Vol.34Issue(8):871-879,9.DOI:10.12173/j.issn.1004-5511.202405116
基于机器学习算法构建中重度围绝经期综合征预测模型
Construction of a prediction model for moderate to severe perimenopausal syndrome based on machine learning algorithms
摘要
Abstract
Objective To identify risk factors for perimenopausal syndrome(PMS)among perimenopausal women using machine learning algorithms,and to construct a predictive model for the risk of developing moderate to severe PMS in perimenopausal women.Methods Perimenopausal women from 48 communites in Pudong New Area,Shanghai were selected as the study subjects.Based on the Kupperman index,participants were divided into the normal or mild PMS group and the moderate to severe PMS group.The data was randomly divided into training set and testing set,and feature selection was performed using the Boruta algorithm and SHAP algorithm.Logistic regression(LR),random forest(RF),support vector machine(SVM),and gradient boosting decision tree(GBDT)were constructed,and model performances were evaluated using accuracy,precision,recall,area under curve(AUC)of the receiver operating characteristic curve,and F1-score.Results A total of 856 perimenopausal women were included in the study,of which 557 were in the normal or mild PMS group and 299 were in the moderate to severe PMS group;599 were in the training set and 257 were in the testing set.9 features(employment status,exercise,age,menstrual condition,medical history,obesity,residence area,history of health education,household register)were selected as predictors for the final model using the Boruta algorithm and SHAP analysis.After parameter tuning,the 10-fold cross-validation AUC of LR,RF,SVM,and GBDT models based on the training set were 0.64,0.77,0.74,and 0.77,respectively.The AUC of the LR,RF,SVM,and GBDT models based on the testing set were 0.63,0.69,0.69,and 0.73,respectively,with recall rates of 0.59,0.55,0.55,and 0.62.Conclusion Among the constructed predictive models for the risk of developing moderate to severe PMS in perimenopausal women,the GBDT model demonstrated the best predictive performance and has potential clinical value.This study provides a new approach for the early identification and intervention of moderate to severe PMS in perimenopausal women.关键词
围绝经期综合征/围绝经期妇女/机器学习/预测Key words
Perimenopausal syndrome/Perimenopausal women/Machine learning/Prediction引用本文复制引用
张敏,顾婷婷,关蔚,刘想想,施君瑶..基于机器学习算法构建中重度围绝经期综合征预测模型[J].医学新知,2024,34(8):871-879,9.基金项目
上海市浦东新区卫生健康委员会面上项目(PW2020A-79) (PW2020A-79)
上海市浦东新区第二轮医学学科建设项目(PWYgts2021-02) (PWYgts2021-02)