首页|期刊导航|北京师范大学学报（自然科学版）|融合特征选择与可解释性机器学习的地下水位动态预测模型框架研究

融合特征选择与可解释性机器学习的地下水位动态预测模型框架研究

孙康宁谭倩胡立堂黄秋森

北京师范大学学报（自然科学版）2026，Vol.62Issue(1)：110-121,12.

北京师范大学学报（自然科学版）2026，Vol.62Issue(1)：110-121,12.DOI:10.12202/j.0476-0301.2025181

融合特征选择与可解释性机器学习的地下水位动态预测模型框架研究

A dynamic prediction framework for groundwater level integrating feature selection and interpretable machine learning

孙康宁 ¹谭倩 ²胡立堂 ³黄秋森⁴

作者信息

1. 广东工业大学生态环境与资源学院,广东广州||广东省地质调查研究院,广东广州||北京师范大学地下水污染控制与修复教育部工程研究中心,北京
2. 广东工业大学生态环境与资源学院,广东广州
3. 北京师范大学地下水污染控制与修复教育部工程研究中心,北京
4. 广东省环境科学研究院,广东广州
折叠

摘要

Abstract

Accurate groundwater level(GWL)prediction is key to achieving precise management and scientific decision-making of groundwater resources.GWL prediction based on machine learning faces the dual challenges of unclear physical significance of input variables and the"black box"nature of models,which often leads to poor interpretability.To address this problem,in this study we innovatively construct a GWL prediction framework integrating K-means clustering,LASSO-CV variable screening,and machine learning models.Data from 36 observation wells in the Yongding River alluvial fan are used to identify seven distinct driving factor combination patterns with clear physical significance.Spatial analysis reveals that the impact of human activities(such as water supply)on groundwater level change(GWLC)increases significantly from the middle and upper reaches to the lower reaches of the alluvial fan.Comparing four machine learning models,it is found that the long short-term memory(LSTM)and support vector regression(SVR)models are more suitable;averaged Nash efficiency coefficients(NSE)for GWLC prediction during the validation phase are-0.07 and-0.03,respectively,with 13 and 15 wells achieving acceptable results(NSE＞0).After coupling variable screening mechanism,the prediction accuracy of LSTM and SVR models(K-LSTM,K-SVR)is significantly improved,with averaged NSE during validation phase increasing by 0.13 and 0.04 compared to original models,respectively.To further enhance model interpretability,the shapley additive explanations(SHAP)method is used to quantitatively reveal contribution mechanisms of precipitation(P),temperature(T),and water supply(Qws)in different types of wells.P generally has a significant positive impact.Effect of T varies regionally.Qws mainly has a negative impact.The integrated process framework of"feature screening-model selection-method establishment-result interpretation"proposed in this study has collaboratively improved the accuracy and physical interpretability of GWL prediction,providing a novel methodology and reliable basis for decision-making to address similar complex environmental system prediction and attribution problems.

关键词

地下水位预测/特征筛选/模型解释/机器学习

Key words

groundwater level prediction/feature selection/model interpretation/machine learning

分类

天文与地球科学

引用本文复制引用

孙康宁,谭倩,胡立堂,黄秋森..融合特征选择与可解释性机器学习的地下水位动态预测模型框架研究[J].北京师范大学学报（自然科学版）,2026,62(1):110-121,12.

基金项目

国家自然科学基金杰出青年科学基金资助项目(52125902) （52125902）

地下水污染控制与修复教育部工程研究中心开放资助项目(GW202304) （GW202304）

国家资助博士后研究人员计划资助项目(GZC20230557) （GZC20230557）

北京师范大学学报（自然科学版）

ISSN：0476-0301

访问量0

下载量0

段落导航