| 注册
首页|期刊导航|实用心脑肺血管病杂志|中国老年高血压患者抑郁发生风险的可解释机器学习预测模型构建:基于CHARLS数据的队列研究

中国老年高血压患者抑郁发生风险的可解释机器学习预测模型构建:基于CHARLS数据的队列研究

林博 袁杭滔 刘迪 胡慧 郑入文

实用心脑肺血管病杂志2026,Vol.34Issue(5):16-21,6.
实用心脑肺血管病杂志2026,Vol.34Issue(5):16-21,6.DOI:10.12114/j.issn.1008-5971.2026.00.114

中国老年高血压患者抑郁发生风险的可解释机器学习预测模型构建:基于CHARLS数据的队列研究

Construction of an Interpretable Machine Learning Prediction Model for the Risk of Depression in Elderly Hypertensive Patients in China:a Cohort Study Based on CHARLS Data

林博 1袁杭滔 1刘迪 2胡慧 2郑入文2

作者信息

  • 1. 100029 北京市,北京中医药大学第二临床医学院
  • 2. 100078 北京市,北京中医药大学东方医院针灸科
  • 折叠

摘要

Abstract

Objective To construct prediction models for the risk of depression in elderly hypertensive patients in China by machine learning algorithms,and screen and explain the best machine learning model.Methods The data of 1 692 elderly patients with hypertension were from the China Health and Retirement Longitudinal Study(CHARLS)database in 2018 and 2020.In the 2020 database,the Center for Epidemiologic Studies Depression Scale-10(CESD-10)scores of patients were collected.Patients with a CESD-10 score≥10 were classified as having depression,and were divided into the depression group(n=515)and the non-depression group(n=1 177)according to whether depression occurred.In the 2018 CHARLS database,the socio-demographic characteristics,behavioral factors and clinical characteristics of patients were collected;LASSO regression analysis was used to screen the characteristic variables.A total of 1 692 elderly patients with hypertension were randomly divided into the training set(n=1 184)and the validation set(n=508)according to the ratio of 7∶3.In the training set,decision tree,random forest,extreme gradient boosting tree,K nearest neighbor algorithm,support vector machine,Logistic regression and naive Bayesian models were constructed respectively,and then the optimal machine learning model was selected in the validation set.The interpretability of the optimal machine learning model was analyzed by Shapley additive explanations(SHAP).Results The results of LASSO regression analysis showed that 9 characteristic variables were finally selected,namely,place of residence,education level,retirement,self-perceived health status,Activity of Daily Living(ADL)Scale score,lung disease,headache,low back pain,and life satisfaction.Based on the above nine characteristic variables,the decision tree,random forest,extreme gradient boosting tree,K-nearest neighbor algorithm,support vector machine,Logistic regression and naive Bayesian models were constructed in the training set.In the validation set,the accuracy,sensitivity,accuracy,AUC of the Logistic regression model were the highest,the specificity of the support vector machine model was the highest,the F1 score of the naive Bayesian model was the highest,and the calibration AUC of the K-nearest neighbor algorithm model was the highest.Comprehensive evaluation,the Logistic regression model was the optimal machine learning model.The results of SHAP analysis showed that in the Logistic regression model,the characteristic variables with SHAP values from high to low were self-perceived health status(-0.201),life satisfaction(-0.194),education level(-0.173),low back pain(0.150),place of residence(0.134),retirement(-0.119),headache(0.096),ADL Scale score(0.093),lung disease(0.076).Conclusion In this study,seven machine learning prediction models for the risk of depression in elderly patients with hypertension in China were constructed based on self-perceived health status,life satisfaction,education level,low back pain,place of residence,retirement,headache,ADL Scale score,and lung disease,among which the Logistic regression model is optimal.

关键词

高血压/抑郁/老年人/中国/机器学习/Shapley加法解释

Key words

Hypertension/Depression/Elderly/China/Machine learning/Shapley additive explanation

分类

医药卫生

引用本文复制引用

林博,袁杭滔,刘迪,胡慧,郑入文..中国老年高血压患者抑郁发生风险的可解释机器学习预测模型构建:基于CHARLS数据的队列研究[J].实用心脑肺血管病杂志,2026,34(5):16-21,6.

基金项目

北京中医药大学2023年度揭榜挂帅项目(2023-JYB-JBZD-025) (2023-JYB-JBZD-025)

北京中医药大学东方医院高水平能力建设项目"国自然培育计划"——国家级人才培育项目(DFGZRA-2024GJRC007) (DFGZRA-2024GJRC007)

实用心脑肺血管病杂志

1008-5971

访问量0
|
下载量0
段落导航相关论文