| 注册
首页|期刊导航|国际神经病学神经外科学杂志|基于可解释机器学习的卒中患者康复相关出院预测模型研究

基于可解释机器学习的卒中患者康复相关出院预测模型研究

王平 刘爱贤 王丹

国际神经病学神经外科学杂志2026,Vol.53Issue(1):20-27,8.
国际神经病学神经外科学杂志2026,Vol.53Issue(1):20-27,8.DOI:10.16636/j.cnki.jinn.1673-2642.2026.01.003

基于可解释机器学习的卒中患者康复相关出院预测模型研究

Construction of an explainable machine learning-based model for predicting rehabilitation-related discharge in stroke patients

王平 1刘爱贤 1王丹2

作者信息

  • 1. 首都医科大学附属北京康复医院神经康复中心,北京 100144
  • 2. 首都医科大学附属北京康复医院中医康复中心,北京 100144
  • 折叠

摘要

Abstract

Objective To construct an explainable machine learning(XML)-based model for predicting rehabilitation-related discharge in stroke patients,to identify the key influencing factors for rehabilitation-related discharge,and to provide data support for rehabilitation assessment and healthcare resource allocation.Methods Data were extracted from the MIMIC-Ⅳ v3.1 database,and a total of 14 824 stroke patients were identified based on the ICD-9/10 codes.Structured variables including demographic data,admission features,and hospitalization data were extracted to construct predictive models for rehabilitation-related discharge.Logistic regression and extreme gradient boosting(XGBoost)algorithms were used to construct models,and these models were compared in terms of discriminatory ability,calibration performance,and net clinical benefit.The SHapley Additive exPlanations(SHAP)algorithm was used to evaluate the contribution of each feature to the results of prediction,and partial dependence and individual conditional expectation plots were used for interpretability analysis.Results In the overall samples,the rate of rehabilitation-related discharge was 36.9%.The logistic and XGBoost models had an area under the curve of 0.637(95%CI:0.620-0.653)and 0.630(95%CI:0.613-0.647),respectively,indicating a level of moderate discriminatory performance.Both models achieved an average precision of 0.473,with a Brier score of 0.220 and 0.223,respectively,suggesting that both models had good calibration.The decision curve analysis showed that the logistic model provided the greatest net benefit within the threshold range of 30%-40%(P<0.05).The SHAP analysis showed that age(mean SHAP=0.280),insurance type(mean SHAP=0.266),and admission route(mean SHAP=0.237)were the main influencing factors.The partial dependence analysis showed the highest probability of rehabilitation-related discharge around the age of 40 years,which then decreased with the increase in age(P<0.001);the patients admitted through emergency or transfer had a significantly higher probability of rehabilitation-related discharge than those admitted through outpatient service or self-admissions(P<0.001);there was no significant difference between the patients with different subtypes of stroke(P=0.236).The AUC of the model remained stable(0.60-0.70)across 2008-2019,suggesting that the model had good robustness.The high predicted risk group had a significantly higher rate of rehabilitation-related discharge than the low predicted risk group(P<0.001),and the predicted probability was weakly negatively correlated with the length of hospital stay.Conclusions The predictive model for rehabilitation-related discharge in stroke patients is constructed based on XML and SHAP,and the results show that age,insurance type,and admission route are the main characteristic variables affecting rehabilitation-related discharge.The model shows favorable calibration and robustness in real-world data,providing a feasible quantitative tool for early identification of rehabilitation in stroke patients and optimization of resource allocation.

关键词

卒中/康复出院/机器学习/极端梯度提升算法/Logistic回归/可解释机器学习

Key words

stroke/rehabilitation discharge/machine learning/extreme gradient boosting/Logistic regression/explainable machine learning

分类

医药卫生

引用本文复制引用

王平,刘爱贤,王丹..基于可解释机器学习的卒中患者康复相关出院预测模型研究[J].国际神经病学神经外科学杂志,2026,53(1):20-27,8.

基金项目

首都卫生发展科研专项项目(2022-3-2254). (2022-3-2254)

国际神经病学神经外科学杂志

1673-2642

访问量0
|
下载量0
段落导航相关论文