基于梯度提升的优化集成机器学习算法对滑坡易发性评价:以雅鲁藏布江与尼洋河两岸为例

林琴, 郭永刚, 吴升杰, 臧烨祺, 王国闻. 2024. 基于梯度提升的优化集成机器学习算法对滑坡易发性评价:以雅鲁藏布江与尼洋河两岸为例. 西北地质, 57(1): 12-22. doi: 10.12401/j.nwg.2023031
引用本文: 林琴, 郭永刚, 吴升杰, 臧烨祺, 王国闻. 2024. 基于梯度提升的优化集成机器学习算法对滑坡易发性评价:以雅鲁藏布江与尼洋河两岸为例. 西北地质, 57(1): 12-22. doi: 10.12401/j.nwg.2023031
LIN Qin, GUO Yonggang, WU Shengjie, ZANG Yeqi, WANG Guowen. 2024. Evaluation of Landslide Susceptibility by Optimization Integrated Machine Learning Algorithm Based on Gradient Boosting: Take Both Banks of Yarlung Zangbo River and Niyang River as Examples. Northwestern Geology, 57(1): 12-22. doi: 10.12401/j.nwg.2023031
Citation: LIN Qin, GUO Yonggang, WU Shengjie, ZANG Yeqi, WANG Guowen. 2024. Evaluation of Landslide Susceptibility by Optimization Integrated Machine Learning Algorithm Based on Gradient Boosting: Take Both Banks of Yarlung Zangbo River and Niyang River as Examples. Northwestern Geology, 57(1): 12-22. doi: 10.12401/j.nwg.2023031

基于梯度提升的优化集成机器学习算法对滑坡易发性评价:以雅鲁藏布江与尼洋河两岸为例

  • 基金项目: 国家自然科学基金重点支持项目“高原重大工程地质灾害监测与分析”(U21A20158),西藏农牧学院研究生创新计划“基于层次分析法的林芝地区滑坡灾害稳定性模糊综合评价”(YJS2022-25),西藏自治区科技重点研发计划项目“基于大数据下西藏重大水电工程强震监测关键技术”(XZ202201ZY0034G)联合资助。
详细信息
    作者简介: 林琴(1997−),女,硕士研究生,从事西藏重大地质灾害监测。E−mail:qinaiyisheng@foxmail.com
    通讯作者: 郭永刚(1966−)男,教授,从事水利水电工程强震安全监测和高原地质灾害监测与分析。E−mail:1960373107@qq.com
  • 中图分类号: P642.22

Evaluation of Landslide Susceptibility by Optimization Integrated Machine Learning Algorithm Based on Gradient Boosting: Take Both Banks of Yarlung Zangbo River and Niyang River as Examples

More Information
  • 雅鲁藏布江与尼洋河两岸地质构造活跃,山体滑坡时常发生,滑坡易发性评价能有效的减少因灾害发生所造成的对人类生命和财产的伤害。笔者基于基尼系数的加权随机森林、XGBoost和LightGBM算法在滑坡易发性中的性能。选取188个滑坡样本和7个影响因素,应用五折交叉验证法训练模型,训练过程中同时考虑特征选择算法、运用贝叶斯方法优化超参数后,采用precision、recall、F1、Accuracy指标对各个级别的预测结果进行分析。结果表明:在高程为32~1 544 m与2 722~3 752 m、坡度为30°~40°、距断裂带、河流与道路200 m以内的区域最容易发生滑坡;滑坡极高与高易发性分布为12.14%和12.41%,低和极低易发性占比分别为26.47%与29.55%,区内一半以上的地区不容易发生滑坡灾害;LightGBM模型在所有模型中表现最好,AUC值为0.843 2,准确度为0.853 1,F1分数为0.834 5;墨脱县的达木乡、帮辛乡,林芝县的丹娘、里龙、扎西饶登乡,朗县的陇村,工布江达的江达乡位于极高易发区,发生滑坡概率极大,在这些地区应采取相应的地质灾害防治措施。

  • 加载中
  • 图 1  研究区地理位置及滑坡分布

    Figure 1. 

    图 2  流程图

    Figure 2. 

    图 3  各评价因子分级图

    Figure 3. 

    图 4  各评价因子与灾害点的关系

    Figure 4. 

    图 5  Gini–RF模型滑坡易发性分区图

    Figure 5. 

    图 6  基于XGBoost的滑坡易发性图

    Figure 6. 

    图 7  XGBoost 五折交叉验证结果

    Figure 7. 

    图 8  基于LightGBM的滑坡易发性图

    Figure 8. 

    图 9  LightGBM学习曲线

    Figure 9. 

    图 10  机器学习模型ROC曲线

    Figure 10. 

    图 11  典型滑坡验证

    Figure 11. 

    表 1  因子间皮尔逊相关系数表

    Table 1.  Pearson correlation coefficient between factors

    因子高程道路河流坡度断裂带与断层地层岩性土地利用类型
    高程1.000 0−0.162 40.155 4−0.170 80.231 7−0.256 4−0.029 8
    道路−0.162 41.000 00.140 50.349 3−0.207 6−0.093 00.002 5
    河流0.155 40.140 51.000 00.126 9−0.067 20.301 10.012 2
    坡度−0.170 80.349 30.126 91.000 0−0.237 1−0.051 0−0.064 9
    断裂带与断层0.231 7−0.207 6−0.067 2−0.237 11.000 0−0.196 0−0.265 4
    地层岩性−0.256 4−0.093 00.301 1−0.051 0−0.196 01.000 00.072 5
    土地利用类型−0.029 80.002 50.012 2−0.064 9−0.265 40.072 51.000 0
    下载: 导出CSV

    表 2  机器学习模型易发性分区对比

    Table 2.  Comparison of machine learning model vulnerability zones

    类别机器学习模型
    Gini–RFXGBoostLightGBM
    栅格
    个数
    栅格
    占比
    滑坡
    点个数
    滑坡
    占比
    栅格
    个数
    栅格
    占比
    滑坡
    点个数
    滑坡
    占比
    栅格
    个数
    栅格
    占比
    滑坡
    点个数
    滑坡
    占比
    极高 14766439 11.99% 44 23.40% 14840333 12.05% 52 27.66% 14951174 12.14% 56 29.79%
    15554640 12.63% 68 36.17% 15394537 12.50% 72 38.30% 15283696 12.41% 75 39.89%
    24114003 19.58% 38 20.21% 24163265 19.62% 40 21.28% 23929268 19.43% 42 22.34%
    32968940 26.77% 22 11.70% 32981256 26.78% 10 5.32% 32599471 26.47% 8 4.26%
    极低 35752274 29.03% 16 8.51% 35776905 29.05% 14 7.45% 36392714 29.55% 7 3.72%
    下载: 导出CSV

    表 3  各机器学习模型准确率

    Table 3.  Accuracy of each machine learning model

    机器学习模型Gini–RFXGBoostLightGBM
    AUC0.752 40.803 50.825 6
    5–fold0.822 50.835 80.843 2
    ACC0.723 40.814 80.825 6
    5–fold0.753 40.835 90.853 1
    F1-score0.775 20.786 70.802 1
    5-fold0.802 60.825 60.834 5
    Precesion0.783 40.796 80.804 5
    5–fold0.802 60.813 20.825 1
    下载: 导出CSV

    表 4  近几年以来滑坡事件

    Table 4.  Landslide events in recent years

    地区位置发生时间来源易发性分区
    林芝市加拉村E 94°54′04″,N 29°41′45″2018.10.29新华社
    林芝市加拉村下游7公里处E 94°54′24″,N 29°41′27″2022.01.22中国青年网
    林芝市波密县古乡索通村羌纳自然村E 95°27′41″,N 30°00′21″2017.8.24中国军视网
    林芝市朗县辖区560国道K80处E 92°49′24″,N 29°04′03″2022.7.22朗县公安局
    林芝市米林县派镇加拉村E 94°54′04″,N 29°41′45″2018.10.17西藏之声
    林芝市朗县E 93°00′48″,N 29°04′42″2022.7.23朗县住建局
    林芝市墨脱县达木乡E 95°27′46″,N 29°29′35″2021.7.4中国自然资源报极高
    国道559线波密至墨脱路段E 97°02′03″,N 29°19′14″2019.5.16西藏自治区交通运输厅极高
    林芝市墨脱县达木珞巴民族乡小学E 95°27′52″,N 29°29′46″2020.8.26新京报极高
    下载: 导出CSV
  • [1]

    贾俊, 毛伊敏, 孟晓捷, 等. 深度随机森林和随机森林算法的滑坡易发性评价对比—以汉中市略阳县为例[J]. 西北地质, 2023, 56(3): 239−249. doi: 10.12401/j.nwg.2023084

    JIA Jun, MAO Yimin, MENG Xiaojie, et al. Comparison of Landslide Susceptibility Evaluation by Deep Random Forest and Random Forest Model: A Case Study of Lueyang County, Hanzhong City[J]. Northwestern Geology, 2023, 56(3): 239−249. doi: 10.12401/j.nwg.2023084

    [2]

    康孟羽, 朱月琴, 陈晨, 等. 基于多元非线性回归和BP神经网络的滑坡滑动距离预测模型研究[J]. 地质通报, 2022, 41(12): 2281−2289.

    KANG Mengyu, ZHU Yueqin, CHEN Chen, et al. Research on landslide sliding distance prediction model based on multiple nonlinear regression and BP neural network[J]. Geological Bulletin of China, 2022, 41(12): 2281−2289.

    [3]

    孟晓捷, 张新社, 曾庆铭, 等. 基于加权信息量法的黄土滑坡易发性评价——以1: 5万天水市麦积幅为例[J]. 西北地质, 2022, 55(2): 249−259.

    MENG Xiaojie, ZHANG Xinshe, ZENG Qingming, et al. The Susceptibility Evaluation of Loess Landslide Based on Weighted Information Value Method: Taking 1: 50 000 Map of Maiji District of Tianshui City As an Example[J]. Northwestern Geology, 2022, 55(2): 249−259.

    [4]

    乔德京, 王念秦, 郭有金, 杨盼盼. 加权确定性系数模型的滑坡易发性评价[J]. 西安科技大学学报, 2020, 40(02): 259-267

    QIAO Dejing, WANG Nianqin, GUO Youjin, et al, Landslide susceptibility assessment based on weighted certainty factor model[J]. Journal of Xi'an University of Science and Technology, 2020, 40(02): 259-267.

    [5]

    沈玲玲, 刘连友, 许冲, 王静璞. 基于多模型的滑坡易发性评价——以甘肃岷县地震滑坡为例[J]. 工程地质学报, 2016, 24(01): 19-28 doi: 10.13544/j.cnki.jeg.2016.01.003

    SHEN Lingling, LIU Lianyou, XU Chong, et al. Multi-models based landslide susceptibility evaluation—illustrated with landslides triggered by minxian earthquake[J]. Journal of Engineering Geology, 2016, 24(01): 19-28. doi: 10.13544/j.cnki.jeg.2016.01.003

    [6]

    苏立彬, 郭永刚, 吴悦, 杨永涛. 基于DEM的尼洋河流域地貌形态分析[J]. 中国水土保持科学, 2020, 18(03): 12-21 doi: 10.16843/j.sswc.2020.03.002

    SU Libin, GUO Yonggang, WU Yue, et al. Analysis of geomorphology of Niyang River Basin based on digital elevation model[J]. soil and water conservation science, 2020, 18(03): 12-21. doi: 10.16843/j.sswc.2020.03.002

    [7]

    王瑞琪, 王学良, 刘海洋, 等. 基于精细 DEM 的崩塌滑坡灾害识别及主控因素分析——以雅鲁藏布江缝合带加查-朗县段为例[J]. 工程地质学报, 2019, 27(5): 1146-1152

    WANG Ruiqi, WANG Xueliang, LIU Haiyang, et al. Identification and main controlling factor analysis of collapse and landslide based on fine dem——taking jiacha-langxian section of yarlung zangbo suture zone as an example[J]. Journal of Engineering Geology, 2019, 27(5): 1146-1152.

    [8]

    武辰爽. 基于GIS的川藏铁路林芝段地质灾害危险性评价[D]. 拉萨: 西藏大学, 2021

    WU Chenshuang. Evaluation of Geological Hazard Risk Based on Geological Information System in Nyingchi of Sichuan-Tibet Railway[D]. Lasa: Tibet University, 2021.

    [9]

    杨创奇, 陶攀, 杨正. 基于逻辑回归树耦合熵指数模型的滑坡易发性分区——以陕西省延安市吴起县滑坡为例[J]. 人民长江, 2022, 53(05): 128-134

    YANG Chuangqi, TAO Pan, YANG Zheng. Landslide susceptibility zoning based on logistic regression tree coupled entropy index model: case of landslide in Wuqi County, Yan'an City, Shaanxi Province[J]. People's Yangtze River, 2022, 53(05): 128-134.

    [10]

    张玘恺, 凌斯祥, 李晓宁, 等. 九寨沟县滑坡灾害易发性快速评估模型对比研究[J]. 岩石力学与工程学报, 2020, 39(8): 1595-1610 doi: 10.13722/j.cnki.jrme.2020.0029

    ZHANG Qikai, LING Sixiang, LI Xiaoning, et al. Comparison of landslide susceptibility mapping rapid assessment models in Jiuzhaigou County, Sichuan province, China[J]. Chinese Journal of Rock Mechanics and Engineering, 2020, 39(8): 1595-1610. doi: 10.13722/j.cnki.jrme.2020.0029

    [11]

    张林梵, 王佳运, 张茂省, 等. 基于BP神经网络的区域滑坡易发性评价[J]. 西北地质, 2022, 55(02): 260-270

    ZHANG Linfan, WANG Jiayun, ZHANG Maosheng, et al. Evaluation of Regional Landslide Susceptibility Assessment Based on BP Neural Network[J]. Northwest Geology, 2022, 55(02): 260-270.

    [12]

    张琪, 巨能攀, 张成强, 等. 库水位变化时陡倾软弱顺层岩质滑坡变形机制[J]. 成都理工大学学报(自然科学版), 2023, 50(2): 206−217.

    ZHANG Qi, JU Nengpan, ZHANG Chengqiang, et al. Landslide deformation mechanism of steep weak bedding rock under the variation of reservoir water level[J], Journal of Chengdu University of Technology (Science & Technology Edition), 2023, 50(2): 206−217.

    [13]

    张文龙, 张振凯, 杨帅. 勉略宁地区地质灾害危险性智能评价和区划研究[J]. 西北地质, 2023, 56(1): 276−283.

    ZHANG Wenlong, ZHANG Zhenkai, YANG Shuai. Study on Intelligent Evaluation and Zoning of Geohazards Risk in Mianluening Area[J]. Northwestern Geology, 2023, 56(1): 276−283.

    [14]

    赵永辉. 雅鲁藏布江流域嘎贡沟巨型滑坡变形破坏模式及演化过程研究[J]. 防灾科技学院学报, 2019, 21(04): 1-7 doi: 10.3969/j.issn.1673-8047.2019.04.001

    ZHAO Yonghui. Deformation and Failure Model and Evolution Process of Giant Landslides in Gagong Valley in the Yarlung Zangbo River Basin[J]. Journal of Institute of Disaster Prevention, 2019, 21(04): 1-7. doi: 10.3969/j.issn.1673-8047.2019.04.001

    [15]

    赵永辉. 雅鲁藏布江公路滑坡发育特征及破坏机理研究[J]. 公路, 2021, 66(4): 6 -10.

    ZHAO Yonghui. Research on Development Characteristics and Failure Process of Highway Landslide along the Yarlung Zangbo River[J]. Highway, 2021, 66(4): 6-10.

    [16]

    周硼焜, 张洪波, 赵伟华, 等. 基于Massflow的西南山区某大型岩质滑坡-碎屑流运动模拟研究[J]. 成都理工大学学报(自然科学版), 2023, 50(3): 361−368.

    ZHOU Pengkun, ZHANG Hongbo, ZHAO Weihua, et al. Study of a large-scale rock landslide-debris flow in the southwest mountainous region of China based on Massflow numerical simulation[J]. Journal of Chengdu University of Technology (Science & Technology Edition), 2023, 50(3): 361−368.

    [17]

    Alsahaf A, Azzopardi G, Ducro B, et al. Predicting Slaughter Weight in Pigs with Regression Tree Ensembles[C]. APPIS, 2018: 1−9.

    [18]

    Arabameri A, Pradhan B, Rezaei K, et al. Assessment of landslide susceptibility using statistical-and artificial intelligence-based FR–RF integrated model and multiresolution DEMs[J]. Remote Sensing, 2019, 11(9): 999. doi: 10.3390/rs11090999

    [19]

    Batar A K, Watanabe T. Landslide susceptibility mapping and assessment using geospatial platforms and weights of evidence (WoE) method in the Indian Himalayan region: Recent developments, gaps, and future directions[J]. ISPRS International Journal of Geo-Information, 2021, 10(3): 114. doi: 10.3390/ijgi10030114

    [20]

    Disha R A, Waheed S. Performance analysis of machine learning models for intrusion detection system using Gini Impurity-based Weighted Random Forest (GIWRF) feature selection technique[J]. Cybersecurity, 2022, 5(1): 1-22. doi: 10.1186/s42400-021-00103-8

    [21]

    Hong H, Liu J, Bui D T, et al. Landslide susceptibility mapping using J48 Decision Tree with AdaBoost, Bagging and Rotation Forest ensembles in the Guangchang area (China)[J]. Catena, 2018, 163: 399-413. doi: 10.1016/j.catena.2018.01.005

    [22]

    Inan M S K, Ulfath R E, Alam F I, et al. Improved sampling and feature selection to support extreme gradient boosting for PCOS diagnosis[C]. 2021 IEEE 11th Annual Computing and Communication Workshop and Conference (CCWC), IEEE, 2021: 1046−1050.

    [23]

    Khan H, Shafique M, Khan M A, et al. Landslide susceptibility assessment using Frequency Ratio, a case study of northern Pakistan[J]. The Egyptian Journal of Remote Sensing and Space Science, 2019, 22(1): 11-24. doi: 10.1016/j.ejrs.2018.03.004

    [24]

    Kouhartsiouk D, Perdikou S. The application of DInSAR and Bayesian statistics for the assessment of landslide susceptibility[J]. Natural Hazards, 2021, 105(3): 2957-2985. doi: 10.1007/s11069-020-04433-7

    [25]

    Lee D H, Kim Y T, Lee S R. Shallow landslide susceptibility models based on artificial neural networks considering the factor selection method and various non-linear activation functions[J]. Remote Sensing, 2020, 12(7): 1194. doi: 10.3390/rs12071194

    [26]

    Polykretis C, Chalkias C. Comparison and evaluation of landslide susceptibility maps obtained from weight of evidence, logistic regression, and artificial neural network models[J]. Natural hazards, 2018, 93(1): 249-274. doi: 10.1007/s11069-018-3299-7

    [27]

    Rehman A, Song J, Haq F, et al. Multi-Hazard Susceptibility Assessment Using the Analytical Hierarchy Process and Frequency Ratio Techniques in the Northwest Himalayas, Pakistan[J]. Remote Sensing, 2022, 14(3): 554. doi: 10.3390/rs14030554

    [28]

    Taalab K, Cheng T, Zhang Y. Mapping landslide susceptibility and types using Random Forest[J]. Big Earth Data, 2018, 2(2): 159-178. doi: 10.1080/20964471.2018.1472392

    [29]

    Tanyas H, Rossi M, Alvioli M, et al. A global slope unit-based method for the near real-time prediction of earthquake-induced landslides[J]. Geomorphology, 2019, 327: 126-146. doi: 10.1016/j.geomorph.2018.10.022

    [30]

    Tien Bui D, Shahabi H, Shirzadi A, et al. Landslide detection and susceptibility mapping by airsar data using support vector machine and index of entropy models in cameron highlands, malaysia[J]. Remote Sensing, 2018, 10(10): 1527. doi: 10.3390/rs10101527

    [31]

    Zeng H, Yang C, Zhang H, et al. A lightGBM-based EEG analysis method for driver mental states classification[J]. Computational Intelligence and Neuroscience, 2019.

    [32]

    Zweifel L, Samarin M, Meusburger K, et al. Investigating causal factors of shallow landslides in grassland regions of Switzerland[J]. Natural Hazards and Earth System Sciences, 2021, 21(11): 3421-3437. doi: 10.5194/nhess-21-3421-2021

  • 加载中

(11)

(4)

计量
  • 文章访问数:  1119
  • PDF下载数:  47
  • 施引文献:  0
出版历程
收稿日期:  2022-10-17
修回日期:  2023-10-21
录用日期:  2023-10-26
刊出日期:  2024-02-20

目录