| 注册
首页|期刊导航|灾害学|基于随机森林与K最近邻模型的澜沧江中游GX水电站库区滑坡易发性评价

基于随机森林与K最近邻模型的澜沧江中游GX水电站库区滑坡易发性评价

孙宁 曾伟 余政兴 汤冠雄 钟辉亚 王龙 戴福初 柯尊弘 张志红

灾害学2026,Vol.41Issue(3):39-48,10.
灾害学2026,Vol.41Issue(3):39-48,10.DOI:10.3969/j.issn.1000-811X.2026.03.005

基于随机森林与K最近邻模型的澜沧江中游GX水电站库区滑坡易发性评价

Landslide Susceptibility Assessment of the GX Hydropower Station Reservoir Area in the Middle Reach of the Lancang River Based on Random Forest and K-Nearest Neighbor Models

孙宁 1曾伟 2余政兴 1汤冠雄 1钟辉亚 1王龙 2戴福初 3柯尊弘 3张志红3

作者信息

  • 1. 中国电建集团中南勘测设计研究院有限公司,湖南 长沙 410014
  • 2. 华能澜沧江水电股份有限公司,云南 昆明 650214||西藏自治区澜沧江清洁能源安全绿色智能建设技术创新中心,云南 昆明 650214
  • 3. 北京工业大学 建筑工程学院,北京 100124
  • 折叠

摘要

Abstract

Landslides represent one of the most significant geological hazards threatening infrastructure safety and human settlements in mountainous regions worldwide.In the context of large-scale hydropower development in tecton-ically active and topographically complex alpine environments,accurate landslide susceptibility assessment is essential for risk mitigation and sustainable engineering planning.This study is focused on the reservoir area of the GX hydro-power station located in the middle reach of the Lancang River,a region characterized by extreme relief,active fault-ing,and diverse lithological assemblages within the southeastern margin of the Tibetan Plateau.Covering an area of ap-proximately 980 square kilometers,the study zone exhibits pronounced spatial heterogeneity in landslide distribution,with a notable concentration along the left bank of the Lancang River valley.To address the challenges posed by such complex terrain,the aim of this research is set to evaluate and compare the performance of two machine learning mod-els,Random Forest(RF)and K Nearest Neighbor(KNN),in mapping landslide susceptibility and identifying domi-nant controlling factors.Based on remote sensing imagery from Google Earth Pro and field validation,a comprehen-sive inventory of 980 landslides is established.To ensure balanced model training,an equal number of non-landslide samples are randomly generated outside a 100-meter buffer around all identified landslides.Eleven environmental fac-tors are initially considered,including elevation,slope,aspect,plan curvature,profile curvature,roughness,relative re-lief,lithology,distance to faults,normalized difference vegetation index(NDVI),and distance to rivers.After assess-ing multicollinearity using Pearson correlation coefficients,roughness is excluded due to its high correlation with slope,resulting in a final set of 10 independent predictors.All spatial analyses and modeling are implemented within a GIS environment using a 30-meter resolution NASADEM dataset.The RF and KNN models are trained on 70 percent of the samples and validated on the remaining 30 percent.Model performance is evaluated using the Area Under the Re-ceiver Operating Characteristic Curve(AUC).The RF model achieved an AUC of 0.776 9,slightly outperforming the KNN model with an AUC of 0.766 4,indicating superior discriminative capability in this geologically intricate set-ting.Spatially,both models have identified high susceptibility zones primarily along valley flanks.However,the RF derived map exhibited more coherent,moderately distributed high risk areas with strong spatial continuity,whereas the KNN result showed noticeable overgeneralization and fragmentation of high susceptibility patches.Factor importance analysis revealed that the RF model effectively integrated multidimensional controls,with slope,curvature-related met-rics,NDVI,relative relief,and lithology emerging as the top six contributors.In contrast,the KNN model placed great-er emphasis on topographic proximity metrics such as relative relief,distance to rivers,and elevation,while assigning minimal weight to lithology and vegetation,reflecting its reliance on local spatial similarity rather than complex factor interactions.Using the natural breaks classification method on RF predicted probabilities,the study area is divided into five susceptibility levels.Statistical validation confirmed that the majority of actual landslides fell within the high and very high susceptibility zones,while non landslide points are predominantly located in low risk areas,demonstrating the model's reliability.Notably,the pronounced asymmetry in landslide density between the left and right banks corre-lates with differences in lithology,where weak clastic sedimentary rocks dominate the left bank,and proximity to major fault systems.A systematic comparison of RF and KNN in a high-relief is provided in this study,tectonically active reservoir setting,highlighting the advantages of RF in capturing nonlinear relationships and producing spatially coher-ent landslide susceptibility patterns.The results further clarify the roles of lithology and fault proximity in controlling the spatial heterogeneity and left-right bank asymmetry of landslide occurrence in the GX reservoir area,offering practical support for landslide risk management during hydropower construction and operation.

关键词

滑坡易发性评价/随机森林模型/K最近邻模型/澜沧江/GX水电站

Key words

landslide susceptibility assessment/Random Forest model/K-Nearest Neighbor model/Lancang River/GX hydropower station

分类

资源环境

引用本文复制引用

孙宁,曾伟,余政兴,汤冠雄,钟辉亚,王龙,戴福初,柯尊弘,张志红..基于随机森林与K最近邻模型的澜沧江中游GX水电站库区滑坡易发性评价[J].灾害学,2026,41(3):39-48,10.

基金项目

中国华能集团有限公司科技项目"高寒强震区高混凝土拱坝重大技术及生态保护技术研究"(HNKJ22-H109) (HNKJ22-H109)

云南省马洪琪院士工作站项目(202305AF150207) (202305AF150207)

灾害学

1000-811X

访问量1
|
下载量0
段落导航相关论文