| 注册
首页|期刊导航|山西大学学报(自然科学版)|一种面向能源贫困识别的轻量可解释梯度提升树

一种面向能源贫困识别的轻量可解释梯度提升树

王政 裔扬 史颖 赵兴旺 吴晨旭

山西大学学报(自然科学版)2024,Vol.47Issue(6):1190-1200,11.
山西大学学报(自然科学版)2024,Vol.47Issue(6):1190-1200,11.DOI:10.13451/j.sxu.ns.2024119

一种面向能源贫困识别的轻量可解释梯度提升树

A Lightweight Interpretable Gradient Boosting Tree for Energy Poverty Identification

王政 1裔扬 2史颖 3赵兴旺 4吴晨旭4

作者信息

  • 1. 太原师范学院 智能优化计算与区块链技术山西省重点实验室,山西 晋中 030619
  • 2. 扬州大学 信息工程学院,江苏 扬州 225127
  • 3. 太原师范学院 计算机科学与技术学院,山西 晋中 030619||山西大学 计算机与信息技术学院,山西 太原 030006
  • 4. 山西大学 计算机与信息技术学院,山西 太原 030006
  • 折叠

摘要

Abstract

In order to solve the problems of insufficient training,overfitting,and poor interpretability of the traditional gradient boost-ing tree method in identifying energy poverty,this paper designs a lightweight and interpretable gradient boosting tree for energy poverty identification.First,the noise samples such as missing values and outliers in the original data are eliminated,and the sample gradients after feature correlation analysis are sorted to realize the segmentation of internal nodes of the gradient boosting tree and to achieve the lightweight of the model.Then,the feature binding technology is used to accelerate the training process.Second,the model interpretation method is introduced to analyze the influencing factors to quantify the impact of different features on energy poverty identification,which enhances the interpretability of the model.Experimental results on a typical energy poverty identifica-tion dataset show that compared with other methods[LR(Logistic Regression),KNN(K-Nearest Neighbor),SVM(Support Vector Machine),RF(Random Forest),CART(Classification and Regression Tree),XGBoost(eXtreme Gradient Boosting),GradientBoost-ing],the lightweight interpretable model proposed in this paper achieves an AUC(Area Under Curve)value of 99.61%,showing an improvement of 0.2%to 17.8%,and thus shows a more obvious advantage.

关键词

LightGBM(Light Gradient Boosting Machine)模型/能源贫困预测/特征关联分析/模型解释方法

Key words

LightGBM model/energy poverty prediction/feature correlation analysis/model interpretation method

分类

信息技术与安全科学

引用本文复制引用

王政,裔扬,史颖,赵兴旺,吴晨旭..一种面向能源贫困识别的轻量可解释梯度提升树[J].山西大学学报(自然科学版),2024,47(6):1190-1200,11.

基金项目

国家自然科学基金(92371116) (92371116)

山西大学学报(自然科学版)

OA北大核心CSTPCD

0253-2395

访问量0
|
下载量0
段落导航相关论文