| 注册
首页|期刊导航|南京大学学报(自然科学版)|基于可解释多任务学习模型揭示糖尿病联合并发症的关键特征及预测建模

基于可解释多任务学习模型揭示糖尿病联合并发症的关键特征及预测建模

王志轩 罗冬梅

南京大学学报(自然科学版)2026,Vol.62Issue(2):285-296,12.
南京大学学报(自然科学版)2026,Vol.62Issue(2):285-296,12.DOI:10.13232/j.cnki.jnju.2026.02.011

基于可解释多任务学习模型揭示糖尿病联合并发症的关键特征及预测建模

Interpretable multitask learning-based model reveals key features and predictive modelling of joint complications in diabetes mellitus

王志轩 1罗冬梅1

作者信息

  • 1. 安徽工业大学微电子与数据科学学院,马鞍山,243002
  • 折叠

摘要

Abstract

Complications of diabetes mellitus are important factors in patient mortality,and revealing their key features can effectively help physicians develop targeted intervention strategies to reduce the risk of death in comorbid conditions.However,most previous studies have focused on identifying risk factors for a single complication of diabetes,ignoring potential associations between complications.Therefore,based on the Diabetes Complications Early Warning Dataset provided by the National Population Health Sciences Data Centre,we used Pearson's correlation coefficient and the chi-square test to screen out significantly associated diabetic complications and incorporated them into a multi-task learning model for joint modeling.Then the importance of each feature was assessed using SHAP(SHapley Additive exPlanations),and 11 features with SHAP values higher than the 75%quartile were screened as significant risk factors for diabetes co-morbidities.A predictive model for diabetes-related complications was constructed using random forest,logistic regression,gradient boosting,extreme gradient boosting,adaptive boosting,and categorical feature gradient boosting.Input variables comprised features with SHAP values exceeding the 25th percentile.Optimal parameter combinations were selected via grid search,with model predictive performance evaluated using metrics including accuracy,precision,F1-score,and AUC.Results indicated that features selected through the interpretable multi-task learning model constituted key predictors,with all six predictive models achieving AUC values approaching 0.90.Finally,LIME(Local Interpretable Model-Agnostic Explanations)was introduced to interpret the model outcomes,thereby further validating the effectiveness and reliability of the constructed interpretable multi-task learning model for screening key features.The interpretable multi-task learning model comprehensively accounts for the underlying relationships between complications,enabling the precise identification of key risk factors for concurrent diabetic complications.This assists clinicians in formulating targeted intervention strategies,thereby helping to reduce patient mortality attributable to complications.

关键词

糖尿病/可解释技术/多任务学习/联合并发症

Key words

diabetes mellitus/interpretable technique/multi-task learning/joint complications

分类

医药卫生

引用本文复制引用

王志轩,罗冬梅..基于可解释多任务学习模型揭示糖尿病联合并发症的关键特征及预测建模[J].南京大学学报(自然科学版),2026,62(2):285-296,12.

基金项目

安徽省教育教学改革研究项目(2024sx047),安徽省高校自然科学基金重点研究项目(2022AH050328) (2024sx047)

南京大学学报(自然科学版)

0469-5097

访问量0
|
下载量0
段落导航相关论文