|国家科技期刊平台
首页|期刊导航|湖泊科学|可解释性长短期记忆模型用于预测湖泊总磷浓度变化

可解释性长短期记忆模型用于预测湖泊总磷浓度变化OA北大核心CSTPCD

The interpretable long-term and short-term memory model was used to predict the change of total phosphorus concentration in lakes

中文摘要英文摘要

对湖泊总磷的变化预测和来源识别对水资源调度和流域生态治理有着重要的意义,然而复杂的生化反应和水动力条件导致的非平稳性给湖泊总磷浓度的准确预测带来极大的困难.为克服这一挑战,本文引入了基于加权回归的季节趋势分解(seasonal and trend decomposition using Loess,STL)技术和夏普利加法(SHapley additive exPlanations,SHAP)结合长短期记忆网络(long short-term memory neural network,LSTM)和门控循环单元(gated recurrent unit,GRU)构建了一个可解释的预测框架,以增强对湖泊总磷浓度演变的预测并提高其可解释性.研究表明:(1)在骆马湖总磷浓度的预测中,该框架拥有较好的预报精度(R2=0.878),优于LSTM和卷积长短期记忆模型(convolutional neural networks and long short term memory network,CNN-LSTM).当预测时间步长增加到8 h时,该框架有效提高了总磷浓度的预测精度,平均相对误差和均方根误差分别降低了 47.1%和33.3%.从预测趋势来看,骆马湖在汛期的总磷平均浓度为0.158 mg/L,相较于非汛期的平均浓度,增加了 202.1%.(2)运河来水是骆马湖总磷浓度最重要的影响因素,贡献权重为60.0%,并且不同断面(三湾、三场)的污染源受水动力、气象等因素的影响存在显著的时空差异.本文凸显了神经网络模型在预警水体污染方面的可实施性,并且为提高传统神经网络的学习能力和可解释性的开发与验证提供了重要方向.

The prediction and source identification of total phosphorus(TP)in lakes is critical for the management of water re-source and watershed ecology.However,non-stationarity caused by inconstant hydrodynamic conditions and the complex biochemi-cal reactions pose significant challenges in accurate forecast of lake TP concentrations.To address this challenge,this study intro-duced the Seasonal and Trend decomposition using Loess(STL)technique and SHapley additive exPlanations(SHAP),and com-bined them with Long Short-Term Memory(LSTM)and Gated Recurrent Unit(GRU)to develop an interpretable prediction frame-work.The framework was applied to enhance the prediction of lake TP concentrations and improving their interpretability.The study achieved the following results.(1)In the prediction of TP concentrations in Lake Luoma,this framework achieved a higher model fit with an R2 value of 0.878,outperforming LSTM and CNN-LSTM.By increasing the prediction time step to 8 hours,the frame-work achieved a better model fit with a decrease of MRE and RMSE by 47.1%and 33.3%,respectively.An analysis of the predic-tion trend for Lake Luoma revealed that the average TP concentration is 0.158 mg/L during the flood season,202.1%higher than that during non-flood seasons.(2)Canal inflow was the most influential factor on TP concentrations,with a contribution of 60%.Different sections(Sanwan and Sanchang)had large spatiotemporal variations in phosphorus sources influenced by hydrodynamics and meteorological factors.This study highlighted the potential of neural network models in predicting water pollution,and offered valuable insights into enhancing the learning capabilities and interpretability of traditional neural networks.

丁艺鼎;范宏翔;徐力刚;蒋名亮;吕海深;朱永华;程俊翔

河海大学水文与水资源学院,南京 210098||中国科学院南京地理与湖泊研究所,南京 210018中国科学院南京地理与湖泊研究所,南京 210018中国科学院南京地理与湖泊研究所,南京 210018||中国科学院大学南京学院,南京 211135||江西省鄱阳湖流域生态水利技术创新中心,南昌 330029河海大学水文与水资源学院,南京 210098

深度学习LSTMSHAP总磷可解释性研究骆马湖

Deep learningLSTMSHAPtotal phosphorusinterpretabilityLake Luoma

《湖泊科学》 2024 (004)

1046-1059,中插7 / 15

国家自然科学基金项目(42307106,U2240224,42071033)、江西省科技计划项目(20232BAB213053,20213AAG01012,20222BCD46002,20224BAB213035)、江西省水利厅科技项目(202325ZDKT08)和长春市科技发展计划项目(23SH03)联合资助.

10.18307/2024.0415

评论