Abstract
Landslide susceptibility assessment is crucial to the development of appropriate strategies to mitigate the risk of landslide fatalities and economic losses. The selection of spatial extent for non-landslide samples has an important role in the statistical-based landslide susceptibility modelling (LSM). In this study, two different non-landslides sampling areas (the entire area and the mountainous area of Anhui Province, China) were designed to explore the influences of the different spatial extent for non-landslides sampling on LSM. Six categories of influencing factors including climatic, morphological, geological, hydrological, vegetation, and human activities were considered. The dominant influencing factors that are more closely related to the distribution of historical landslides were selected based on the GeoDetector. The landslide inventory samples and the non-landslide samples generated on two selected areas were divided into a training set (70%) and a validation set (30%) for establishing the entire area LR model (EaeraLR) and the mountainous area LR model (MareaLR) based on the logistic regression (LR) model. The performance of the models was evaluated by the confusion matrix and the area under the receiver operating characteristic curve (AUROC). The results showed that the EareaLR model outperformed the MareaLR model by various evaluation metrics and the appearance of the final landslide susceptibility map. Hence, we conclude that the potential influence of the spatial extent of the non-landslide sample selection needs to be taken into account while comparing the reliability of different data-driven landslide susceptibility models.
Similar content being viewed by others
References
Brenning A (2012) Spatial cross-validation and bootstrap for the assessment of prediction rules in remote sensing: the R package sperrorest. In: 2012 IEEE international geoscience and remote sensing symposium, pp 5372-5375. https://doi.org/10.1109/IGARSS.2012.6352393
Brenning A, Schwinn M, Ruiz-Páez AP, Muenchow J (2015) Landslide susceptibility near highways is increased by 1 order of magnitude in the andes of southern ecuador, loja province. Nat Hazards Earth Syst Sci 15(1):45–57. https://doi.org/10.5194/nhess-15-45-2015
Choi J, Oh HJ, Lee HJ, Lee C, Lee S (2012) Combining landslide susceptibility maps obtained from frequency ratio, logistic regression, and artificial neural network models using ASTER images and GIS. Eng Geol 124:12–23. https://doi.org/10.1016/j.enggeo.2011.09.011
Chung CJF, Fabbri AG (2003) Validation of spatial prediction models for landslide hazard map**. Nat Hazards 30(3):451–472. https://doi.org/10.1023/B:NHAZ.0000007172.62651.2b
Devkota KC, Regmi AD, Pourghasemi HR, Yoshida K, Pradhan B, Ryu IC, Dhital MR, Althuwaynee OF (2013) Landslide susceptibility map** using certainty factor, index of entropy and logistic regression models in GIS and their comparison at Mugling–Narayanghat road section in Nepal Himalaya. Nat Hazards 65(1):135–165. https://doi.org/10.1007/s11069-012-0347-6
Dou J, Yunus AP, Merghadi A, Shirzadi A, Nguyen H, Hussain Y, Avtar R, Chen Y, Pham BT, Yamagishi H (2020) Different sampling strategies for predicting landslide susceptibilities are deemed less consequential with deep learning. Sci Total Environ 720:137320. https://doi.org/10.1016/j.scitotenv.2020.137320
Erener A, Sivas AA, Selcuk-Kestel AS, Düzgün HS (2017) Analysis of training sample selection strategies for regression-based quantitative landslide susceptibility map** methods. Comput Geosci 104:62–74. https://doi.org/10.1016/j.cageo.2017.03.022
Frattini P, Crosta G, Carrara A (2010) Techniques for evaluating the performance of landslide susceptibility models. Eng Geol 111(1–4):62–72. https://doi.org/10.1016/j.enggeo.2009.12.004
Froude MJ, Petley DN (2018) Global fatal landslide occurrence from 2004 to 2016. Nat Hazards Earth Syst Sci 18(8):2161–2181. https://doi.org/10.5194/nhess-18-2161-2018
Gariano SL, Guzzetti F (2016) Landslides in a changing climate. Earth Sci Rev 162:227–252. https://doi.org/10.1016/j.earscirev.2016.08.011
Goetz JN, Brenning A, Petschko H, Leopold P (2015) Evaluating machine learning and statistical prediction techniques for landslide susceptibility modeling. Comput Geosci 81:1–11. https://doi.org/10.1016/j.cageo.2015.04.007
Guzzetti F, Carrara A, Cardinali M, Reichenbach P (1999) Landslide hazard evaluation: a review of current techniques and their application in a multi-scale study. Central Italy Geomorphology 31(1–4):181–216
Guzzetti F, Reichenbach P, Cardinali M, Galli M, Ardizzone F (2005) Probabilistic landslide hazard assessment at the basin scale. Geomorphology 72(1–4):272–299. https://doi.org/10.1016/j.geomorph.2005.06.002
Guzzetti F, Reichenbach P, Ardizzone F, Cardinali M, Galli M (2006) Estimating the quality of landslide susceptibility models. Geomorphology 81(1–2):166–184. https://doi.org/10.1016/j.geomorph.2006.04.007
Hartmann J, Moosdorf N (2012) Global lithological map database v1. 0
He H, Hu D, Sun Q, Zhu L, Liu Y (2019) A landslide susceptibility assessment method based on GIS technology and an AHP-weighted information content method: a case study of southern Anhui, China. ISPRS Int J Geo Inf 8(6):266. https://doi.org/10.3390/ijgi8060266
Hussin HY, Zumpano V, Reichenbach P, Sterlacchini S, Micu M, van Westen C, Bălteanu D (2016) Different landslide sampling strategies in a grid-based bi-variate statistical susceptibility model. Geomorphology 253:508–523. https://doi.org/10.1016/j.geomorph.2015.10.030
Iovine GG, Greco R, Gariano SL, Pellegrino AD, Terranova OG (2014) Shallow-landslide susceptibility in the Costa Viola mountain ridge (southern Calabria, Italy) with considerations on the role of causal factors. Nat Hazards 73(1):111–136. https://doi.org/10.1007/s11069-014-1129-0
Jebur MN, Pradhan B, Tehrany MS, He H, Hu D, Sun Q, Zhu L, Liu Y (2014) 2019. A landslide susceptibility assessment method based on GIS technology and an AHP-weighted information content method: a case study of southern Anhui, China. ISPRS Int J Geo Inf 8(6): 266. https://doi.org/10.3390/ijgi8060266
Kalantar B, Pradhan B, Naghibi SA, Motevalli A, Mansor S (2018) Assessment of the effects of training data selection on the landslide susceptibility map**: a comparison between support vector machine (SVM), logistic regression (LR) and artificial neural networks (ANN). Geomat Nat Hazards Risk 9(1):49–69. https://doi.org/10.1080/19475705.2017.1407368
Kavzoglu T, Sahin EK, Colkesen I (2014) Landslide susceptibility map** using GIS-based multi-criteria decision analysis, support vector machines, and logistic regression. Landslides 11(3):425–439. https://doi.org/10.1007/s10346-013-0391-7
Lin Q, Wang Y (2018) Spatial and temporal analysis of a fatal landslide inventory in China from 1950 to 2016. Landslides 15:2357–2372. https://doi.org/10.1007/s10346-018-1037-6
Lin L, Lin Q, Wang Y (2017a) Landslide susceptibility map** on a global scale using the method of logistic regression. Natural Hazards and Earth System Sciences 17(8):1411–1424. https://doi.org/10.5194/nhess-17-1411-2017
Lin Q, Wang Y, Liu T, Zhu Y, Sui Q (2017b) The vulnerability of people to landslides: a case study on the relationship between the casualties and volume of landslides in China. Int J Environ Res Public Health 14(2):212. https://doi.org/10.3390/ijerph14020212
Lin Q, Wang Y, Glade T, Zhang J, Zhang Y (2020) Assessing the spatiotemporal impact of climate change on event rainfall characteristics influencing landslide occurrences based on multiple GCM projections in China. Clim Change 162(2):761–779. https://doi.org/10.1007/s10584-020-02750-1
Lin Q, Lima P, Steger S, Glade T, Jiang T, Zhang J, Liu T, Wang Y (2021) National-scale data-driven rainfall induced landslide susceptibility map** for China by accounting for incomplete landslide data. Geosci Front 12(6):101248. https://doi.org/10.1016/j.gsf.2021.101248
Linke S, Lehner B, Dallaire CO, Ariwi J, Grill G, Anand M, Beames P, Burchard-levine V, Maxwell S, Moidu H, Tan F, Thieme M (2019) Global hydro-environmental sub-basin and river reach characteristics at high spatial resolution. Sci Data 6(1):1–15. https://doi.org/10.6084/m9.figshare.10059551
Ma M, Liu C, Zhao G, **e H, Jia P, Wang D, Wang H, Hong Y (2019) Flash flood risk analysis based on machine learning techniques in the Yunnan Province, China. Remote Sens 11(2):170. https://doi.org/10.3390/rs11020170
National Bureau of Statistics (2020) China statistical yearbook. China Statistics Press, Bei**g
Nefeslioglu HA, Gokceoglu C, Sonmez H (2008) An assessment on the use of logistic regression and artificial neural networks with different sampling strategies for the preparation of landslide susceptibility maps. Eng Geol 97(3–4):171–191. https://doi.org/10.1016/j.enggeo.2008.01.004
Nhu VH, Hoang ND, Nguyen H, Ngo PTT, Bui TT, Hoa PV, Samui P, Bui DT (2020) Effectiveness assessment of Keras based deep learning with different robust optimization algorithms for shallow landslide susceptibility map** at tropical area. Catena 188:104458. https://doi.org/10.1016/j.catena.2020.104458
Peng W, Hongwei D (2020) Study on flood hazard risk zoning based on GIS and logistic regression model. Adv Earth Sci 35(10):1064–1072. https://doi.org/10.11867/j.issn.1001-8166.2020.082
Petley D (2012) Global patterns of loss of life from landslides. Geology 40(10):927–930. https://doi.org/10.1130/G33217.1
Pourghasemi HR, Kornejady A, Kerle N, Shabani F (2020) Investigating the effects of different landslide positioning techniques, landslide partitioning approaches, and presence-absence balances on landslide susceptibility map**. Catena 187:104364. https://doi.org/10.1016/j.catena.2019.104364
R Core Team (2020) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/
Regmi NR, Giardino JR, McDonald EV, Vitek JD (2014) A comparison of logistic regression-based models of susceptibility to landslides in western Colorado. USA Landslides 11(2):247–262. https://doi.org/10.1007/s10346-012-0380-2
Reichenbach P, Rossi M, Malamud DB, Mihir M, Guzzetti F (2018) A review of statistically-based landslide susceptibility models. Earth Sci Rev 180:60–91. https://doi.org/10.1016/j.earscirev.2018.03.001
Rong G, Li K, Han L, Alu S, Zhang J, Zhang Y (2020) Hazard map** of the rainfall–landslides disaster Chain based on GeoDetector and Bayesian network models in Shuicheng County, China. Water 12(9):2572. https://doi.org/10.3390/w12092572
Sameen MI, Pradhan B, Bui DT, Alamri AM (2020) Systematic sample subdividing strategy for training landslide susceptibility models. Catena 187:104358. https://doi.org/10.1016/j.catena.2019.104358
Shao X, Ma S, Xu C, Zhou Q (2020) Effects of sampling intensity and non-slide/slide sample ratio on the occurrence probability of coseismic landslides. Geomorphology 363:107222. https://doi.org/10.1016/j.geomorph.2020.107222
Song Y, Wang J, Ge Y, Xu C (2020) An optimal parameters-based geographical detector model enhances geographic characteristics of explanatory variables for spatial heterogeneity analysis: cases with different types of spatial data. GIScience Remote Sens 57(5):593–610. https://doi.org/10.1080/15481603.2020.1760434
Steger S, Brenning A, Bell R, Petschko H, Glade T (2016) Exploring discrepancies between quantitative validation results and the geomorphic plausibility of statistical landslide susceptibility maps. Geomorphology 262:8–23. https://doi.org/10.1016/j.geomorph.2016.03.015
Steger S, Glade T (2017) The challenge of “trivial areas” in statistical landslide susceptibility modelling. Workshop on World Landslide Forum. pp 803–808. https://doi.org/10.1007/978-3-319-53498-5_92
Steger S, Mair V, Kofler C, Pittore M, Zebisch M, Schneiderbauer S (2021) Correlation does not imply geomorphic causation in data-driven landslide susceptibility modelling–Benefits of exploring landslide data collection effects. Sci Total Environ 776:145935. https://doi.org/10.1016/j.scitotenv.2021.145935
Sun D, Shi S, Wen H, Xu J, Zhou X, Wu J (2021) A hybrid optimization method of factor screening predicated on GeoDetector and Random Forest for Landslide Susceptibility Map**. Geomorphology 379:107623. https://doi.org/10.1016/j.geomorph.2021.107623
USGS (2015) Shuttle Radar Topography Mission (SRTM) 1 Arc-Second Global. US Geological Survey. https://doi.org/10.5066/F7PR7TFT
Varnes DJ (1984) Landslilde hazard zonation: a review of principles and practice. United Nations Educational, Scientific and Cultural Organization, Paris
Wang JF, Xu CD (2017) Geodetector: principle and prospective. Acta Geogr Sin 72(1):116–134. https://doi.org/10.11821/dlxb201701010
Wang JF, Li XH, Christakos G, Liao YL, Zhang T, Gu X, Zheng XY (2010) Geographical detectors-based health risk assessment and its application in the neural tube defects study of the Heshun Region, China. Int J Geogr Inf Sci 24(1):107–127. https://doi.org/10.1080/13658810802443457
Wang Y, Song C, Lin Q, Li J (2016) Occurrence probability assessment of earthquake-triggered landslides with Newmark displacement values and logistic regression: the Wenchuan earthquake, China. Geomorphology 258:108–119. https://doi.org/10.1016/j.geomorph.2016.01.004
Wang Y, Lin Q, Shi P (2018) Spatial pattern and influencing factors of landslide casualty events. J Geog Sci 28(3):259–274. https://doi.org/10.6038/cjg20130406
Wu J, Gao XJ (2013) A gridded daily observation dataset over China region and comparison with the other datasets. Chin J Geophys 56(4):1102–1111. https://doi.org/10.6038/cjg20130406
**ao T, Segoni S, Chen L, Yin K, Casagli N (2020) A step beyond landslide susceptibility maps: a simple method to investigate and explain the different outcomes obtained by different approaches. Landslides 17(3):627–640. https://doi.org/10.1007/s10346-019-01299-0
** City. China Nat Hazards. https://doi.org/10.1007/s11069-021-04862-y
**-based geodetector and machine learning cluster: a case of **ao** County, China. ISPRS Int J Geo Inf 10(2):93. https://doi.org/10.3390/ijgi10020093
**ong J, Li J, Cheng W, Zhou C, Guo L, Zhang X, Wang N, Li W (2019) Spatial-temporal distribution and the influencing factors of mountain flood disaster in southwest China. Acta Geogr Sin 74:1374–1391. https://doi.org/10.11821/dlxb201907008
Yang J, Song C, Yang Y, Xu C, Guo F, ** supported by spatial logistic regression and GeoDetector: a case study of Duwen Highway Basin, Sichuan Province, China. Geomorphology 324:62–71. https://doi.org/10.1016/j.geomorph.2018.09.019
Yi Y, Zhang Z, Zhang W, Jia H, Zhang J (2020) Landslide susceptibility map** using multiscale sampling strategy and convolutional neural network: a case study in Jiuzhaigou region. Catena 195:104851. https://doi.org/10.1016/j.catena.2020.104851
Zhang Y, Wang Y, Chen Y, Xu Y, Zhang G, Lin Q, Luo R (2021) Projection of changes in flash flood occurrence under climate change at tourist attractions. J Hydrol 595:126039. https://doi.org/10.1016/j.jhydrol.2021.126039
Zhao G, Pang B, Xu Z, Yue J, Tu T (2018) Map** flood susceptibility in mountainous areas on a national scale in China. Sci Total Environ 615:1133–1142. https://doi.org/10.1016/j.scitotenv.2017.10.037
Zhou X, Wen H, Zhang Y, Xu J, Zhang W (2021) Landslide susceptibility map** using hybrid random forest with GeoDetector and RFE for factor optimization. Geosci Front 12(5):101211. https://doi.org/10.1016/j.gsf.2021.101211
Zhu AX, Miao Y, Liu J, Bai S, Zeng C, Ma T, Hong H (2019) A similarity-based approach to sampling absence data for landslide susceptibility map** using data-driven methods. Catena 183:104188. https://doi.org/10.1016/j.catena.2019.104188
Funding
This research was supported by the Second Tibetan Plateau Scientific Expedition and Research Program (Grant Nos. 2019QZKK0906, 2019QZKK0606), the Project from Anhui Climate Center “Risk assessment technique of meteorological disaster in Anhui Province”, the Startup Foundation for Introducing Talent of NUIST.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
Information provided in this manuscript does not have any conflict or competing interests.
Consent for publication:
All authors consent to participate in this publication.
Consent to participate
All authors consent to participate in this research.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Wang, C., Lin, Q., Wang, L. et al. The influences of the spatial extent selection for non-landslide samples on statistical-based landslide susceptibility modelling: a case study of Anhui Province in China. Nat Hazards 112, 1967–1988 (2022). https://doi.org/10.1007/s11069-022-05252-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11069-022-05252-8