Introduction

Landslides, which can be triggered by events such as rainfall, earthquakes and freeze–thaw cycles, occur frequently in mountainous areas1,2,3,4. Due to climate change, hazards, such as glacier retreat, permafrost degradation and lake shrinkage, occur frequently in high altitude regions5,6,7, often leading to slope instability, and an increase in the occurrence of deep-seated landslides8.

Climate change has an indirect effect on landslides occurring at high altitudes by degrading permafrost9 and melting glaciers10,11 which may increase the magnitude and frequency of landslides12. Sediments, which tend to be accumulate on the receding edge of a glacier, may become unstable as the glacier recedes13. Glacial melting can also change precipitation patterns22. Some scholars have conducted field surveys in the HMA and generated previous landslide inventories23. However, these databases are mostly historical data without detailed and reliable timestamps. Remote sensing offers the possibility of addressing this data gap, by building detailed time-series of landslide inventories.

Landslides can be detected on satellite series by changes in land cover (vegetation distribution, rock outcrop** exposure or soil degradation)15,24. Remote sensing satellite data with high resolution an extensive archive of historical acquisitions, and a generally vertical field of view that provides an almost planimetric perspective, are particularly valuable for generating time-series data. Sentinel and Landsat are sensors that have both used by scholars to study landslides25,26. Although the spatial resolution of Landsat is slightly coarser than that of Sentinel, Landsat provides a longer observation time. To avoid biasing our time series, with more landslides being identified from the more recent high-resolution imagery, we use only Landsat imagery.

For our study, Landsat imagery was obtained from the U.S. Geological Survey (USGS) EarthExplorer website. Images were obtained for each year for the period from 1998 to 2018. The Landsat data include images from the Thematic Mapper (TM), Enhanced Thematic Mapper (ETM+) and Operational Line Imager. The spatial resolution of these data is 30 m. The entire study area is covered by 17 Landsat path/row scenes. For each scene, cloud-free images at the end of summer (September through October) were selected. Additional images were selected during the winter season of each year so that each year's imagery includes one imagery at beginning or end of the year.

Table 1 Earthquakes that potentially triggered landslides in the study area.

Landslides were manually identified by photo-interpretation of the scenes, and by comparing imagery between dates. Landslide scars and deposits resulting from landslides can be identified on Landsat imagery from contrasts in landform and land cover with the surrounding areas, and confirmed by lack of these distinctive features in the summer image of the previous year. The multi-temporal database was used to constrain the date of the landslide, which was labeled with the year the landslide is first visible. The digitization of the landslide polygons was performed in ENVI software. SRTM3 digital elevation data is used to calculate the aspect and slope of a landslide.

In the process of map** landslides, we tried to map all the affected area of each landslide. However, the failed areas of some landslides were not clearly visible because of shadows or a lack of contrast between ground disturbed by the landslide, and undisturbed snow/ice or bare rock. This leads to the possibility that the landslide area may include the failure area and the runout area, or it may include only the runout area or the accumulation area.

We classified the types of landslides according to the criteria of Varnes (1998)27. Due to the limitation of Landsat image resolution, it is difficult to distinguish the type of movement. Therefore, we also used higher spatial resolution imagery available in Google Earth to supplement our Landsat interpretation. The landslide material types were rock, debris and moraine, as illustrated in Fig. 1.

Figure 1
figure 1

source area. (AC) are shows the landslides of rock, debris, and Moraine material types (Figures created using Arcgis10.2 https://www.esri.com/en-us/arcgis/products/arcgis-desktop).

Comparison of three types of landslide

Because we focus on the possible effects of glacier retreat on landslides, we removed from our inventory landslides that could potentially have been triggered by earthquakes and that were more than 50 km from glaciers. Keefer28 has proposed a typical maximum distance for the occurrence of seismically-induced landslides from the epicenter; a distance that varies as a function of the magnitude of the earthquake and represented earthquakes (Fig. 2). We therefore used the USGS earthquake hazards website to identify all major earthquakes near or within the study area during the 20-year interval studied, and then removed from our inventory any landslide that could have occurred at the time of the earthquake and was located within the threshold of distance. We chose the disturbed landslide because it is the most sensitive to earthquakes, which means that the earthquake has a large impact distance on such landslides. We found four earthquakes (Table 1) and an associated four landslides that met the magnitude and distance criteria shown in Fig. 2. These landslides were removed from the inventory.

Figure 2
figure 2

Variation in earthquake epicentral distance from disrupted landslides for earthquakes of varying magnitudes. The extent of black arrow points to is the distance range of disrupted landslides caused by the earthquake.

Glacier map**

The second major dataset that we generated for this study is a 20-year time series of maps of glacial extent for the study area. As with the landslide inventory, we used Landsat remotely sensed data. The Landsat sensors TM, ETM+ , OLI capture the distinctive spectral reflectance of snow and ice in the VNIR (visible and near-infrared) band and SWIR (shortwave infrared) band that is used to separate it from the surrounding terrain. Typical Landsat spectral measures include spectral band ratios such as TM3/TM5, TM4/TM5, and the NDSI (normalized difference snow index). In this study, we used the Landsat NDSI index to obtain glacial boundaries. The NDSI index is defined as:

$$NDSI = \frac{{\left( {{\text{Green}} - SWIR} \right)}}{{\left( {{\text{Green}} + SWIR} \right)}}$$
(1)

where Green and SWIR represent the values of the green and shortwave infrared bands for Landsat series images. We set 0.4 as the extraction threshold, which is commonly used by scholars to extract ice from remote sensing images29.

Spectral ratio methods are effective in delineating clean ice. However, seasonal snow, cloud and frozen lakes may be misclassified as glaciers and require manual editing of the classification. we therefore first classified ice using the NDSI. Then, we clipped our data based on the Randolph glacier inventory30, to reduce snowfall and cloud mass errors. Finally, manual interpretation was used to remove some errors associated with glacial lakes and around the edges of images.

Probability-area distribution of landslides

A number of complex natural phenomena exhibit power-law frequency–area relationships, including earthquakes, which are considered a classic example of such phenomena. Landslides are thought to be another natural hazard that exhibits power-law frequency–area relationships under a wide variety of circumstances31. Some scholars question whether small landslides follow the power distribution, because small landslides sometimes evolve into larger landslides. There is also a problem that smaller landslides may be difficult to identify, or many individual small slides appear to be one large slide32. The main focus of this paper is medium to large landslides, so these concerns are not directly relevant to our work.

The probability-size distribution of landslides in the study area is given by

$$p\left( {A_{L} } \right) = \frac{1}{{N_{LT} }}\frac{{\delta N_{L} }}{{\delta A_{L} }}$$
(2)

where \(p\left( {A_{L} } \right)\) is a probability density function, \(\delta N_{L}\) is the number of landslides with areas between \(A_{L}\) (km2) and \(A_{L}\) + \(\delta A_{L}\), \(\delta A_{L}\) (km2) is based on a log scale and \({ }N_{LT}\) is the total number of landslides in an inventory. The frequency density of landslides, \(f\left( {A_{L} } \right)\), is given by:

$$f\left( {A_{L} } \right) = N_{LT} p\left( {A_{L} } \right)$$
(3)

The frequency-size distribution of landslides in the study area was compared with those for various event magnitudes proposed by Malamud et al. (2004)33 to assess the nature of a landslide event. The \(p\) value represents the probability density of landslides with area \(A_{L}\) (km2) as follows:

$$p\left( {A_{L} :\rho ,a,s} \right) = \frac{1}{a\Gamma \left( \rho \right)}\left[ {\frac{a}{{A_{L} - s}}} \right]^{\rho + 1} exp\left[ { - \frac{a}{{A_{L} - s}}} \right]$$
(4)

where \(\rho\) is a parameter controlling the power-law decay for medium and large landslides, \(a\) (km2) is a parameter controlling the location of the maximum probability distribution, \(s\) (km2) is a parameter controlling the exponential rollover for small landslides, and \(\Gamma \left( \rho \right)\) is the gamma function of \(\rho\). Malamud et al.33 also proposed a magnitude scale for a landslide event, \(m_{L}\), as follows:

$$m_{L} = log_{10} \left( {N_{LT} } \right)$$
(5)

The combination of Eqs. (2)–(5) provides the frequency density of landslides linked to the magnitude scale of a landslide event.

For our study, the power-rate distribution was obtained using the R language.

Results and discussion

Factors influencing identification of landslides

Landslides can be identified in alpine and glacial regions by (1) high contrast compared to surrounding snow and ice, (2) the influence of landslide on river, and (3) lobate forms typical of rock-avalanche deposits15,34. Local variations in tone, texture or pattern, and the presence of lineaments can also be used to infer slope instabilities35. However, there are some factors that affect us to see the certain landslide in the identification process. The factors affecting landslide identification include landslide characteristics, snow cover, and the quality of remote sensing image.

Landslides are a complex movement process. Landslides include failure area, transportation area and accumulation area. Due to the influence of image quality (resolution, clouds, etc.) and the geological characteristics of the alpine region (glacier development, lack of vegetation), it is difficult to distinguish some slides which didn't cause optical characteristic variation in the image. They include some small area debris flow and moraine failure, and rock avalanches that have not formed significant accumulation. In our identification process, we found that the influence of landslide on river is beneficial to our identification work.

The influence of external factors mainly comes from the influence of snow cover. According to our identification process, although a landslide occurs in winter and is covered with snow (Fig. 3A,B), it can be seen after the snow melts as the area is large or the accumulation is large (Fig. 3C). Coe et al.15 mentioned that the accumulation of landslides greater than 0.5 km2 would be obvious.

Figure 3
figure 3

Comparison of landslides before and after being covered by snow. (A) is the scene before the landslide failed, (B) is the scene after the landslide failed with snow cover, (C) is the scene after the landslide failed without snow cover (Figures created using Arcgis10.2 https://www.esri.com/en-us/arcgis/products/arcgis-desktop).

The quality problems of Landsat series satellites mainly come from cloud cover, seasonal snow cover and so on. We try to ensure a late summer and cloudless image every year to identify landslides. But sometimes it can't meet two conditions. Hence, we selected multiple images from the same year to identify the landslide. In particular, there is a problem with the black stripe error in the Lansat7 after it broke down in 2013. We think it is inevitable that we will miss some of the landslides that occur within these bands or most of them. At the same time, we can be sure that some large areas of the landslide, even if the impact of the strip, we can identify and determine its occurrence.

Activity of landslides and climate change in the HMA

A total of 127 landslides were detected in the Landsat images of the study area, covering the period from 1999 to 2018 (Fig. 4 and Supplementary Information). The landslides are mainly concentrated in the Karakoram Mountains, eastern part of the Pamir Mountains, western Himalayas and south of the Hindu Kush. Based on the source area the landslides were divided into three categories. A total of 72 rock landslides were identified, 45 debris landslides, and 10 moraine landslides. Table 2 and Fig. 5 summarize the landslide characteristics. The average area of the 127 landslides was approximately 58 ha, with 6.35 landslides occurring per year. The average elevation of the landslides is 2,966 m, and the average slope is 10.88° at the center point of the failure area. Most of the mapped landslides were at elevations more than 2,000 m above sea level (Fig. 5A). Landslide source areas had more slope directions (aspects) between 90° and 270° (i.e., southwest- to southeast-facing slopes, Fig. 5B).

Figure 4
figure 4

source area. The blue filled areas are glacier-covered areas (Figures created using Arcgis10.2 https://www.esri.com/en-us/arcgis/products/arcgis-desktop).

Study area. The gray rectangles represent the coverage of the Landsat images. Triangles, pentagons, and circles represent three types of landslide

Table 2 Landslide inventory summary characteristics.
Figure 5
figure 5

Variation in (A) elevations and (B) aspects for landslide from a 30-m DEM. Aspects are shown on a 360° Rose diagram. Numbers on circles are counts within that range.

The landslides of rock type are widely distributed in the HMA, and concentrated in the western Kunlun Mountains, although they also occur frequently in the Hindu Kush and the western Himalayas. Debris landslides are mainly distributed in the West Kunlun Mountains and the Hindu Kush, with additional occurrences in the Pamirs and western Himalayas. As expected, morainic landslides occur at higher elevations, including the Karakorum Mountains, the western Himalayas, the Kunlun Mountains, and the Pamirs, at an average elevation of 4,139 m. We also analyzed the distribution of large landslides, defined as landslides that affected areas greater than 2 km2. A total of three large landslides were found in the Pamirs, four in the western Himalayas, and two in Karakorum.

A graph of annual landslide area is show in Fig. 6. The Thompson Tau method36 is used to find the outliers with the critical probability alpha value is 0.01. The result indicates that the landslide area points in 2003, 2010 and 2016 are outliers, as shown in Fig. 6. After removing the outliers, linear-fit line (R2 = 0.26) shows that the annual landslide area in the HMA are increasing (Fig. 6).

Figure 6
figure 6

The orange scatterplot shows the temporal distribution of landslide area; blue shaded areas are develo** El Niño years38. Red dots are outliers based on Thompson’s Tau37. The black dotted line is a linear-fit trendlines line of landslide area without outliers.

The area of landslides was greatest in 2016. The average annual temperature was unusually high during this year, and the glacial area had notably declined in the previous 3 years (Fig. 8E). To provide additional insight into the impact of climate change on landslides, we compared the annual landslide area data with annual El Niño data for the study years37. We found that the landslide area outliers tend to coincide with El Niño years. The one exception is 2006, an El Niño year with no associated peak in landslide area. However, 2006 was associated with generally lower average temperatures in Fig. 8E. Overall, however, the time-series of landslide area, and particularly the outliers, appears to show periodicity, and the trend in landslide area is increasing. Scientists have hypothesized that as El Niño becomes more frequent and stronger, alpine glaciers are also being affected38. Warmer temperatures caused by El Niño can accelerate melting of glaciers, and the increase of rainfall also increases the conditions that favor the occurrence of landslides.

Shifts in landslide frequency–area distribution

To investigate whether landslide characteristics have changed over the past 20 years, we divided the landslide data set into two datasets: landslides that occurred before 2009, and those that occurred in 2009 or later. Table 3 shows that both the number of landslides and the average area of landslides increased in the second 10 year period. The difference in the average slope of the landslides was not significant.

Table 3 Comparison of landslide inventory characteristics.

The power distribution of landslide data in the study area over the entire 20 years of the landslide inventory has a decay factor ρ of approximately 1.14, which is in the normal range of previous studies. Although there is some variability in the findings of previous research, most prior landslide data sets follow noncumulative power-law frequency statistics, and the range of ρ values is approximately 1.5 ± 0.532. However, it is important to note that previous studies have obtained different power distributions, fitting different tail attenuation coefficients. Furthermore, the data sets used in these studies come from different environments and the associated earthquakes are assumed to have been triggered by different factors, including rainfall, earthquakes and snowmelt.

The power-rate distribution for the landslides separated into the two 10-year periods is shown in Fig. 7. Both periods show a distinct power distribution. The decay factor in the second 10 year period (ρ = 0.85, 95%CI: 0.70–1.01) is smaller than that of the first 10 year period (ρ = 1.32, 95%CI: 1.02–1.61). This indicates that the probability of large and medium-sized landslides in the second 10 year period is higher than the first period. The confidence intervals of the two values of ρ do not overlap, and thus the difference is statistically significant.

Figure 7
figure 7

Comparison of frequency–area distribution for the 10 year period before 2009 and for 2009 and later. (A) is the frequency–area distribution for the 10 year period before 2009 with the decay factor ρ (1.32, 95%CI: 1.02–1.61). (B) is the frequency–area distribution for 2009 and later with the decay factor ρ (0.85, 95%CI: 0.70–1.01).

Relationship between landslides and glaciation

For the entire study area, we used the Randolph glacier outlines as a mask to minimize misclassification due to factors such as seasonal snow. However, seasonal snow can also lead to errors in the calculations. In order to evaluate the error associated with the glacier delineation, we compared automatic extraction of glacial area with a semi-automatic approach that was regarded by scholars as a relatively accurate method39. The semi-automatic method uses the ratio threshold method to generate approximate results, when are then modified by hand, using visual interpretation by an expert interpreter. We selected data from 3 regions for data verification (Table 4). The calculated difference is less than 10%, and the average difference is about ± 6.3%. For our study, we think the error of the glacier area results is acceptable.

Table 4 Comparison of frequency–area distribution characteristics based on semi-automatic and automatic glacial map**.

The glacier area in the HMA has shown a downward trend over the past 20 years. Before 2005, the glacier area decreased and increased repeatedly, which may be related to the monsoon climate. After 2005, the area of glaciers decreased significantly. The trend of glacial degradation over the past decade was significantly stronger than that in the previous decade.

For a comparison of the glacial extent between years, we chose the remote sensing image from 2002 as the basic reference because the imaging times for the mosaic image in 2002 were not very different. We selected the glacier area in four subsequent years (2006, 2009, 2013 and 2017) to compare with the reference to analyze the change in glaciers in the study area (Fig. 8A–D). We found that except for the Karakoram Mountains, which exhibited only small areas of glacial retreat over the past 2 decades, the retreat in the other areas was more extensive. An anomaly of the Karakoram Mountains was found due to the relatively high elevation40. In addition, climate anomalies and the strong summer monsoon climate in the region have brought additional moisture to the Karakoram Mountains, leading to increased snow41.

Figure 8
figure 8

Comparison of glacial area in the study region for selected years. Panels (AD) show glacial extent in 2002 and four other years (2006, 2009, 2013 and 2017, respectively). In each case, red indicates glacial extent lost during the time indicated, blue indicates glaciers present at both times. Panel E summarizes the overall trends in glacial area (GA) and mean annual temperature (MAT) during 20 years studied (Figures created using Arcgis10.2 https://www.esri.com/en-us/arcgis/products/arcgis-desktop).

Figure 8E summarizes the total area of glacial extent of the 20-year period studied. The figure also shows the mean annual temperature, averaged over the region. MAAT was calculated based on MODIS surface temperature product MOD11A2, which has a spatial resolution of 1000 m and a temporal resolution of 8 days.

The HMA glacial area has had a downward trend over the past 20 years (Fig. 8E). Looking at the record in more detail, the glacial area decreased prior to 2005, and then increased, possibly in response to variations in the associated region’s monsoon climate. After 2012, the area of glaciers decreased notably. The glacial area showed little no overall change in the first decade, and a notable decline in the second decade. The temperature record does not show evidence of warming over the period 2000 to 2018. The year 2000 was unusually warm compared to the subsequent 18 years. Even excluding 2000, there is no clear evidence of a warming trend over the remaining years.

However, glacial area can be assumed to be an environmental proxy that integrates climate over time and space. Glacial area could therefore be a useful predictor of slope stability. Glacier area does indeed show a negative association with landslide area (Fig. 9), indicating landslide area is increasing as glacial area declines. Glacial retreat is an indicator of climate change, and our results support the hypothesis that warming of the region is associated with an increase in landslide occurrence. However, the small number of samples and the scatter indicate notable uncertainty, and the association therefore requires further research.

Figure 9
figure 9

Graph of landslide area and glacier area. GA represents glacier area, and LA represents landslide area, and ln represents the natural log. The black dotted line is the linear trend line.

Conclusion

In this work, we generated an inventory of 127 landslides in the HMA using Landsat data covering the 2 decades between 1999 and 2018. Annual glacial area maps for the same 20 years were also generated. In addition to the impact of image resolution, the rock surface and ice cover make landslide identification difficult in the HMA, which may lead to the number of landslides being underestimated. Nevertheless, the landslide and glacial area maps provide detailed spatial and temporal information. Based on our custom inventories, we studied the interaction between landslides and glaciers in the study area. The following major conclusions can be drawn from this work:

  1. 1.

    In the HMA, landslides are widespread. Over the past 2 decades, the area affected by landslide disasters has increased. Both the number of landslides and the affected area was larger during the most recent 10 years (2009–2018) compared to the previous 10 years (1999–2008). The area of landslides was anomalously high in three out of four El Niño years.

  2. 2.

    Landslides in the HMA follow power-law (fractal) frequency statistics. The attenuation coefficient of the 10 years from 2009 onwards decreased compared to the previous 10 years, indicating that the probability of occurrence of large landslides has increased.

  3. 3.

    Glacial area in the HMA has shown an overall downward trend over the past 20 years. Except for the Karakoram Mountains, most glaciers throughout the study area have been retreated. As the glacial area decreases, the area of annual landslides has increased.

In summary, retreat of glaciers in the HMA of 20 years appears to be associated with more frequent landslides, and larger landslides. A weak negative correlation between annual landslides and glacial areas is evident. Although local influences may have triggered landslides in the HMA, glacial retreat may be a useful proxy for aspects of climate that control slope stability. If climate warming continues, the area of glaciers will further decrease. As a result, the probability of large landslides in the HMA may continue to increase.