Decision tree based ensemble machine learning approaches for landslide susceptibility mapping

Hybrid ensemble machine learning approaches for landslide susceptibility mapping using different sampling ratios at East Sikkim Himalayan, India

Advances in Space Research ◽

10.1016/j.asr.2021.05.018 ◽

2021 ◽

Author(s):

Sunil Saha ◽

Jagabandhu Roy ◽

Biswajeet Pradhan ◽

Tusar Kanti Hembram

Keyword(s):

Machine Learning ◽

Landslide Susceptibility ◽

Susceptibility Mapping ◽

Landslide Susceptibility Mapping ◽

Learning Approaches ◽

Ensemble Machine Learning

Download Full-text

Comparison between Deep Learning and Tree-Based Machine Learning Approaches for Landslide Susceptibility Mapping

Water ◽

10.3390/w13192664 ◽

2021 ◽

Vol 13 (19) ◽

pp. 2664

Author(s):

Sunil Saha ◽

Jagabandhu Roy ◽

Tusar Kanti Hembram ◽

Biswajeet Pradhan ◽

Abhirup Dikshit ◽

...

Keyword(s):

Neural Network ◽

Machine Learning ◽

Deep Learning ◽

Landslide Susceptibility ◽

Learning Model ◽

Susceptibility Mapping ◽

Landslide Susceptibility Mapping ◽

Learning Approaches ◽

Statistical Measures ◽

Deep Learning Model

The efficiency of deep learning and tree-based machine learning approaches has gained immense popularity in various fields. One deep learning model viz. convolution neural network (CNN), artificial neural network (ANN) and four tree-based machine learning models, namely, alternative decision tree (ADTree), classification and regression tree (CART), functional tree and logistic model tree (LMT), were used for landslide susceptibility mapping in the East Sikkim Himalaya region of India, and the results were compared. Landslide areas were delimited and mapped as landslide inventory (LIM) after gathering information from historical records and periodic field investigations. In LIM, 91 landslides were plotted and classified into training (64 landslides) and testing (27 landslides) subsets randomly to train and validate the models. A total of 21 landslide conditioning factors (LCFs) were considered as model inputs, and the results of each model were categorised under five susceptibility classes. The receiver operating characteristics curve and 21 statistical measures were used to evaluate and prioritise the models. The CNN deep learning model achieved the priority rank 1 with area under the curve of 0.918 and 0.933 by using the training and testing data, quantifying 23.02% and 14.40% area as very high and highly susceptible followed by ANN, ADtree, CART, FTree and LMT models. This research might be useful in landslide studies, especially in locations with comparable geophysical and climatological characteristics, to aid in decision making for land use planning.

Download Full-text

Landslide Susceptibility Mapping Using the Stacking Ensemble Machine Learning Method in Lushui, Southwest China

Applied Sciences ◽

10.3390/app10114016 ◽

2020 ◽

Vol 10 (11) ◽

pp. 4016 ◽

Cited By ~ 3

Author(s):

Xudong Hu ◽

Han Zhang ◽

Hongbo Mei ◽

Dunhui Xiao ◽

Yuanyuan Li ◽

...

Keyword(s):

Machine Learning ◽

Landslide Susceptibility ◽

Southwest China ◽

Susceptibility Mapping ◽

Landslide Susceptibility Mapping ◽

Support Vector ◽

Machine Learning Method ◽

Learning Method ◽

Statistical Measures ◽

Ensemble Machine Learning

Landslide susceptibility mapping is considered to be a prerequisite for landslide prevention and mitigation. However, delineating the spatial occurrence pattern of the landslide remains a challenge. This study investigates the potential application of the stacking ensemble learning technique for landslide susceptibility assessment. In particular, support vector machine (SVM), artificial neural network (ANN), logical regression (LR), and naive Bayes (NB) were selected as base learners for the stacking ensemble method. The resampling scheme and Pearson’s correlation analysis were jointly used to evaluate the importance level of these base learners. A total of 388 landslides and 12 conditioning factors in the Lushui area (Southwest China) were used as the dataset to develop landslide modeling. The landslides were randomly separated into two parts, with 70% used for model training and 30% used for model validation. The models’ performance was evaluated using the area under the receiver operating characteristic (ROC) curve (AUC) and statistical measures. The results showed that the stacking-based ensemble model achieved an improved predictive accuracy as compared to the single algorithms, while the SVM-ANN-NB-LR (SANL) model, the SVM-ANN-NB (SAN) model, and the ANN-NB-LR (ANL) models performed equally well, with AUC values of 0.931, 0.940, and 0.932, respectively, for validation stage. The correlation coefficient between the LR and SVM was the highest for all resampling rounds, with a value of 0.72 on average. This connotes that LR and SVM played an almost equal role when the ensemble of SANL was applied for landslide susceptibility analysis. Therefore, it is feasible to use the SAN model or the ANL model for the study area. The finding from this study suggests that the stacking ensemble machine learning method is promising for landslide susceptibility mapping in the Lushui area and is capable of targeting areas prone to landslides.

Download Full-text

Factors Affecting Landslide Susceptibility Mapping: Assessing the Influence of Different Machine Learning Approaches, Sampling Strategies and Data Splitting

Land ◽

10.3390/land10090989 ◽

2021 ◽

Vol 10 (9) ◽

pp. 989

Author(s):

Minu Treesa Abraham ◽

Neelima Satyam ◽

Revuri Lokesh ◽

Biswajeet Pradhan ◽

Abdullah Alamri

Keyword(s):

Machine Learning ◽

Landslide Susceptibility ◽

Sampling Strategy ◽

Susceptibility Mapping ◽

Landslide Susceptibility Mapping ◽

Support Vector ◽

Susceptibility Map ◽

Learning Approaches ◽

Sampling Strategies ◽

Data Splitting

Data driven methods are widely used for the development of Landslide Susceptibility Mapping (LSM). The results of these methods are sensitive to different factors, such as the quality of input data, choice of algorithm, sampling strategies, and data splitting ratios. In this study, five different Machine Learning (ML) algorithms are used for LSM for the Wayanad district in Kerala, India, using two different sampling strategies and nine different train to test ratios in cross validation. The results show that Random Forest (RF), K Nearest Neighbors (KNN), and Support Vector Machine (SVM) algorithms provide better results than Naïve Bayes (NB) and Logistic Regression (LR) for the study area. NB and LR algorithms are less sensitive to the sampling strategy and data splitting, while the performance of the other three algorithms is considerably influenced by the sampling strategy. From the results, both the choice of algorithm and sampling strategy are critical in obtaining the best suited landslide susceptibility map for a region. The accuracies of KNN, RF, and SVM algorithms have increased by 10.51%, 10.02%, and 4.98% with the use of polygon landslide inventory data, while for NB and LR algorithms, the performance was slightly reduced with the use of polygon data. Thus, the sampling strategy and data splitting ratio are less consequential with NB and algorithms, while more data points provide better results for KNN, RF, and SVM algorithms.

Download Full-text

A Modelling Tool for Rainfall-triggered Landslide Susceptibility Mapping and Hazard Warning based on GIS and Machine Learning

IOP Conference Series Earth and Environmental Science ◽

10.1088/1755-1315/783/1/012074 ◽

2021 ◽

Vol 783 (1) ◽

pp. 012074

Author(s):

Haiwei Zhou ◽

Jianjun Yu ◽

Hangjian Feng ◽

Jie Huang

Keyword(s):

Machine Learning ◽

Landslide Susceptibility ◽

Susceptibility Mapping ◽

Landslide Susceptibility Mapping ◽

Hazard Warning

Download Full-text

Machine learning in earthquake- and typhoon-triggered landslide susceptibility mapping and critical factor identification

Environmental Earth Sciences ◽

10.1007/s12665-021-09510-z ◽

2021 ◽

Vol 80 (6) ◽

Author(s):

Muhammad Zeeshan Ali ◽

Hone-Jay Chu ◽

Yi-Chin Chen ◽

Saleem Ullah

Keyword(s):

Machine Learning ◽

Landslide Susceptibility ◽

Critical Factor ◽

Susceptibility Mapping ◽

Landslide Susceptibility Mapping

Download Full-text

A comparative study of the bivariate, multivariate and machine-learning-based statistical models for landslide susceptibility mapping in a seismic-prone region in China

Arabian Journal of Geosciences ◽

10.1007/s12517-021-06630-5 ◽

2021 ◽

Vol 14 (6) ◽

Author(s):

Suhua Zhou ◽

Yunqiang Zhang ◽

Xin Tan ◽

Syed Muntazir Abbas

Keyword(s):

Machine Learning ◽

Comparative Study ◽

Statistical Models ◽

Landslide Susceptibility ◽

Susceptibility Mapping ◽

Landslide Susceptibility Mapping

Download Full-text

A comparison among fuzzy multi-criteria decision making, bivariate, multivariate and machine learning models in landslide susceptibility mapping

Geomatics Natural Hazards and Risk ◽

10.1080/19475705.2021.1944330 ◽

2021 ◽

Vol 12 (1) ◽

pp. 1741-1777

Author(s):

Quoc Bao Pham ◽

Yacine Achour ◽

Sk Ajim Ali ◽

Farhana Parvin ◽

Matej Vojtek ◽

...

Keyword(s):

Machine Learning ◽

Decision Making ◽

Landslide Susceptibility ◽

Susceptibility Mapping ◽

Landslide Susceptibility Mapping ◽

Multi Criteria Decision Making ◽

Learning Models ◽

Machine Learning Models

Download Full-text

A Novel Hybrid Method for Landslide Susceptibility Mapping-Based GeoDetector and Machine Learning Cluster: A Case of Xiaojin County, China

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi10020093 ◽

2021 ◽

Vol 10 (2) ◽

pp. 93

Author(s):

Wei Xie ◽

Xiaoshuang Li ◽

Wenbin Jian ◽

Yang Yang ◽

Hongwei Liu ◽

...

Keyword(s):

Machine Learning ◽

Hybrid Method ◽

Roc Curve ◽

Landslide Susceptibility ◽

Susceptibility Mapping ◽

Assessment Model ◽

Landslide Susceptibility Mapping ◽

Support Vector ◽

Susceptibility Map ◽

Area Index

Landslide susceptibility mapping (LSM) could be an effective way to prevent landslide hazards and mitigate losses. The choice of conditional factors is crucial to the results of LSM, and the selection of models also plays an important role. In this study, a hybrid method including GeoDetector and machine learning cluster was developed to provide a new perspective on how to address these two issues. We defined redundant factors by quantitatively analyzing the single impact and interactive impact of the factors, which was analyzed by GeoDetector, the effect of this step was examined using mean absolute error (MAE). The machine learning cluster contains four models (artificial neural network (ANN), Bayesian network (BN), logistic regression (LR), and support vector machines (SVM)) and automatically selects the best one for generating LSM. The receiver operating characteristic (ROC) curve, prediction accuracy, and the seed cell area index (SCAI) methods were used to evaluate these methods. The results show that the SVM model had the best performance in the machine learning cluster with the area under the ROC curve of 0.928 and with an accuracy of 83.86%. Therefore, SVM was chosen as the assessment model to map the landslide susceptibility of the study area. The landslide susceptibility map demonstrated fit with landslide inventory, indicated the hybrid method is effective in screening landslide influences and assessing landslide susceptibility.

Download Full-text

Landslide susceptibility mapping based on convolutional neural network and conventional machine learning methods

10.21203/rs.3.rs-190195/v1 ◽

2021 ◽

Author(s):

Rui Liu ◽

Xin Yang ◽

Chong Xu ◽

Luyao Li ◽

Xiangqiang Zeng

Keyword(s):

Neural Network ◽

Machine Learning ◽

Convolutional Neural Network ◽

Landslide Susceptibility ◽

Susceptibility Mapping ◽

Landslide Susceptibility Mapping ◽

Support Vector ◽

Learning Methods ◽

Machine Learning Methods ◽

Conventional Machine

Abstract Landslide susceptibility mapping (LSM) is a useful tool to estimate the probability of landslide occurrence, providing a scientific basis for natural hazards prevention, land use planning, and economic development in landslide-prone areas. To date, a large number of machine learning methods have been applied to LSM, and recently the advanced Convolutional Neural Network (CNN) has been gradually adopted to enhance the prediction accuracy of LSM. The objective of this study is to introduce a CNN based model in LSM and systematically compare its overall performance with the conventional machine learning models of random forest, logistic regression, and support vector machine. Herein, we selected the Jiuzhaigou region in Sichuan Province, China as the study area. A total number of 710 landslides and 12 predisposing factors were stacked to form spatial datasets for LSM. The ROC analysis and several statistical metrics, such as accuracy, root mean square error (RMSE), Kappa coefficient, sensitivity, and specificity were used to evaluate the performance of the models in the training and validation datasets. Finally, the trained models were calculated and the landslide susceptibility zones were mapped. Results suggest that both CNN and conventional machine-learning based models have a satisfactory performance (AUC: 85.72% − 90.17%). The CNN based model exhibits excellent good-of-fit and prediction capability, and achieves the highest performance (AUC: 90.17%) but also significantly reduces the salt-of-pepper effect, which indicates its great potential of application to LSM.

Download Full-text