Articles | Volume 22, issue 3
Research article
01 Apr 2022
Research article |  | 01 Apr 2022

Insights from the topographic characteristics of a large global catalog of rainfall-induced landslide event inventories

Robert Emberson, Dalia B. Kirschbaum, Pukar Amatya, Hakan Tanyas, and Odin Marc

Landslides are a key hazard in high-relief areas around the world and pose a risk to populations and infrastructure. It is important to understand where landslides are likely to occur in the landscape to inform local analyses of exposure and potential impacts. Large triggering events such as earthquakes or major rain storms often cause hundreds or thousands of landslides, and mapping the landslide populations generated by these events can provide extensive datasets of landslide locations. Previous work has explored the characteristic locations of landslides triggered by seismic shaking, but rainfall-induced landslides are likely to occur in different parts of a given landscape when compared to seismically induced failures. Here we show measurements of a range of topographic parameters associated with rainfall-induced landslides inventories, including a number of previously unpublished inventories which we also present here. We find that the average upstream angle and compound topographic index are strong predictors of landslide scar location, while the local relief and topographic position index provide a stronger sense of where landslide material may end up (and thus where hazard may be highest). By providing a large compilation of inventory data for open use by the landslide community, we suggest that this work could be useful for other regional and global landslide modeling studies and local calibration of landslide susceptibility assessment, as well as hazard mitigation studies.

1 Introduction

The impact of natural hazards on populations and infrastructure is most acute where the footprints of these hazards intersect the locations where people live and buildings are situated. For some hazards like earthquakes and cyclones, the footprints of the hazard can be distributed across wide regions, but for other hazards like landslides the footprint may be significantly more localized. Although the impacts of individual landslides may be localized, large triggering events such as intense rainfall or seismic activity can cause large numbers of landslides across a wide region, the extent of which often mirrors the extent of the intense rainfall and seismic shaking (Marc et al., 2017, 2018; Tanyaš and Lombardo, 2019). The individual landslides triggered during these extreme events occur in specific parts of the landscape that are most susceptible to failure. These slopes become critically unstable due to both preconditioning factors like slope and internal frictional strength and triggering factors like change in fluid pore pressure or seismic acceleration.

A range of studies from around the world have assessed the locations of landslides and used them to construct susceptibility models for local settings (e.g., Emberson et al., 2021; Goetz et al., 2015; Broeckx et al., 2019), across larger regions (e.g., Van Den Eeckhaut and Hervás, 2012; Van Den Eeckhaut et al., 2012), and globally (e.g., Stanley and Kirschbaum, 2017; Nowicki Jesse et al., 2018; Tanyaş et al., 2019). Comprehensive reviews of landslide susceptibility models (Budimir et al., 2015; Reichenbach et al., 2018) highlight a number of factors that are often considered to be generally important for landslide susceptibility. These include morphological (slope, aspect, roughness), geological (e.g., lithology), land cover, seismic and hydrological factors. Naturally, to study the importance of each of these factors, information on landslide location is essential to both calibrate and validate any susceptibility model that is produced.

Landslide location data can come in different forms, and landslide inventory maps are the most useful data source in which the extent of landslide phenomena are systematically documented in a region (Guzzetti et al., 2012). Unfortunately, the number of digitally available landslide inventories is still rather limited (Wasowski et al., 2011; Guzzetti et al., 2012; Tanyaş et al., 2017; Mirus et al., 2020). As a result, landslide locations in global catalogs are often based on media reports (e.g., Kirschbaum et al., 2015; Froude and Petley, 2018), which can limit the accuracy of the defined locations. A review of data in the NASA Global Landslide Catalog (Kirschbaum et al., 2015) suggests that only 33 % of landslides have a location known to within a 1 km resolution, which does not permit assessment of the specific locations where landslides occur within a landscape (e.g., at a hillslope scale). Additionally, global landslide catalogs generally do not include the entire landslide population for a given area. While they may capture many of the landslides that cause damage or fatalities (Petley, 2012; Froude and Petley, 2018), underestimation of landslide susceptibility may result if systematic biases in reporting are found for certain geographies or terrain parameters.

Landslide inventories are the ideal data source not only to better understand the spatial, temporal and size distribution of landslides but also to conduct more accurate susceptibility, hazard and risk assessments (Guzzetti et al., 2012). Overall, landslide inventories are categorized as historical and event inventories (Malamud et al., 2004). Historical landslide inventories include many landslide events over time in a given region. Landslide event inventories, on the other hand, contain landslides triggered by a specific trigger (e.g., earthquake, rainfall or snowmelt) of a known date. In other words, the time of landslide occurrence is unknown in historical landslide inventories, and therefore, landslide susceptibility models developed based on historical inventories are time-invariant products solely representing geomorphologically landslide-prone hillslopes (Lombardo and Tanyas, 2020). Historic inventories are by definition biased toward frequent climatic triggers and are not representative of the long-term average susceptibility to triggers including earthquakes, whereas landslide event inventories are more suitable data sources to develop near-real-time products to predict the spatial distribution of landslides triggered by a specific event (e.g., Nowicki Jessee et al., 2018).

For specific large triggering events such as an earthquake or an episode of extreme rainfall, it is possible to relatively accurately define the timing of the event, and if high-resolution imagery is found that brackets the dates in question, it is also possible to systematically map the landslides generated by such a trigger (Guzzetti et al., 2012). Mapping landslides following extreme events has become common, and inventories exist for a large number of earthquakes (Tanyaş et al., 2017). A smaller number of intense rainfall events have also been mapped (Marc et al., 2018), but unlike for earthquakes (Schmitt et al., 2017) no centralized repository of these data exists at present. Location data for landslides triggered by intense rainfall are vitally important to calibrate and validate existing susceptibility models since the datasets produced are generally considered to be nearly complete. It is also useful to characterize the rainfall required to trigger landslides and thus help inform local and global hazard models (Emberson et al., 2021; Kirschbaum and Stanley, 2018). These can then be used to inform exposure and risk assessment estimates (Emberson et al., 2020).

It is important to note that the positions where earthquake-triggered landslides occur on a given hillslope are not necessarily applicable to rainfall-triggered landslides. As shown by previous research (Densmore and Hovius, 2000; Meunier et al., 2008), the higher peak ground acceleration in earthquakes at the top of ridges tends to increase landslides in those locations, while increasing water saturation at the base of slopes by intense rain tends to increase landslides lower down the slope (e.g., Rault et al., 2019). As such, it is imperative to use the appropriate type of landslide inventory to calibrate any model. Finally, recent studies have sought to derive underlying simple topographic rules to understand hazard associated with earthquake-triggered landslides (e.g., Milledge et al., 2019), and it is important that we extend this kind of analysis to rainfall-triggered events to provide comparative data.

In this study, we combine 10 existing inventories of landslides triggered by intense rain storms with 6 new inventories mapped using high-resolution data for this study. Assessing these landslide event inventories both individually and in combination, we assess the local topographic characteristics that are most strongly related to where landslides are initiated, as well as local forest loss that can be calculated from satellite data. We suggest that these inventories and the associated parameters can be used to calibrate and validate other models of susceptibility and hazard and will provide valuable information to authors seeking landslide data with high spatial accuracy, as well as supporting characterization of rainfall thresholds for landslide impacts (e.g., Conrad et al., 2021). Moreover, with a set of simplified rules for landslide hazard, researchers can support hazard assessment in areas where more detailed models may be unavailable. The inventories described here will be available on the NASA Landslide Viewer app (, last access: 6 January 2022) for open access by other researchers.

2 Context and methods

2.1 Landslide inventories

It has become common practice to map areas affected by landslide-triggering earthquakes to build a spatially complete picture of landslide impacts (Tanyaş et al., 2017), and the inventories that are generated have been used to produce hazard maps (Jibson et al., 2000; Harp et al., 2011), susceptibility models (García-Rodríguez et al., 2008; Xu et al., 2012), guidelines for hazard zonation (Milledge et al., 2019) and global alerting systems (Nowicki Jessee et al., 2018). Landslide event inventories are also required to explore the landscape response to tectonic and climatic forcings (e.g., Malamud et al., 2004; Korup et al., 2012; Marc et al., 2016, 2019). Mapping of landslides in the aftermath of major rainfall events is somewhat less common, since cloud cover is often a significant impediment in the impacted areas, which may limit clear views from satellites. However, an increasing number of intense rainfall events have now had landslides mapped, with extensive examples in Taiwan (Lin et al., 2011; Chen et al., 2013), Japan and Brazil (Marc et al., 2018), and the Caribbean (van Westen and Zhang, 2018).

Several methods exist to generate event-specific landslide inventories. The robustness and accuracy of the final inventory depend on the type and quality of imagery and data available, as well as the method chosen. Synthetic aperture radar (SAR) data have been employed to generate inventories of slow-moving landslides (Handwerger et al., 2019; Bekaert et al., 2020), to focus on the kinematics of single slow-moving slides (Hu et al., 2019) and to map landslides occurring in the aftermath of major triggering events (Mondini et al., 2019; Handwerger et al., 2019; Adriano et al., 2020; Burrows et al., 2020; Jung and Yun, 2020). The most widely used technique is to map landslides directly from optical imagery, from unmanned aerial vehicle (UAV) imagery (Casagli et al., 2017; Rossi et al., 2018), aerial photography (Harp et al., 2004) or satellite observations (Casagli et al., 2017; Martha et al., 2012; Behling et al., 2014). While satellite observations generally have the lowest spatial resolution and may be impinged by cloud cover, these satellites offer near-global coverage and frequent return intervals that generally allow for imagery that brackets the event in question. This is particularly the case for some of the newer commercial satellite constellations. Some rainfall-triggered events may occur in locations where cloud cover is so prevalent that it precludes anything other than seasonal assessment of landslide occurrence, such as the Himalayas during the monsoon. However, an increasing number of satellite-generated inventories now exist. The methods used to delineate landslides from optical imagery include manual mapping, where a human determines what is and is not a landslide, or semi-automatic/automatic mapping, where detection algorithms are used to determine landslide locations.

2.2 Methodology

Summarizing various previous work, five mapping criteria appear essential for landslide inventories (see Guzzetti et al., 2012; Marc and Hovius, 2015; Tanyaş et al., 2017): (i) manual mapping (or correction) to reduce errors and avoid amalgamation, (ii) a high enough imagery resolution for completeness and to avoid amalgamation, (iii) mapping landslides as polygons to allow maximum scientific usage (e.g., area affected, volume of sediment mobilized, frequency–size distributions), (iv) mapping with pre- and post-event imagery to focus on landslides with a known trigger, and (v) defined mapping boundary to clarify inventory completeness. For the purposes of this study, we have tried to obtain as many inventories as possible for comparison, while generally satisfying these five essential criteria. Nevertheless, due to varying imagery and mapping techniques, criteria (i) and (ii) are fulfilled with variable quality for the studied inventories (Table 1). More detailed inventories have differentiated the source and deposit areas of landslides, but this often requires field validation. The locations of the inventories are shown in Fig. 1. It is important to note that although high-resolution imagery can provide more accurate mapping in some cases, it can also be more challenging to ortho-rectify, which can limit the quality of landslide inventories generated (Williams et al., 2018).

Figure 1Locations of landslide inventories considered in this study. Locations labeled in red have been published previously, while those in blue are presented for the first time here. Satellite images of the newly mapped landslide inventories can be found in the Supplement. Table 1 contains the details of each of the inventories, organized in alphabetical order.

Table 1Details of landslide inventories analyzed in this study.

Download Print Version | Download XLSX

As such, we incorporate 10 existing inventories and supplement them with 6 further inventories that we have produced for this study. The details of each of the inventories are described in Table 1. For several of the newly produced inventories, we have utilized high-resolution imagery from Planet Dove satellites (Planet Team, 2017) available through the Commercial Smallsat Data Acquisition (CSDA) Program (, last access: 12 January 2022). Planet imagery represents an important step change for landslide mapping procedures, since the high spatial resolution of approximately 3 m of the images is combined with a rapid return time of the satellites (of the order of 1 image per day). For several of the newly generated inventories, we have mapped the landslides manually using GIS software. This has resulted in three inventories; two in the Philippines and one in Thrissur, India (Table 1). The remaining new inventories were generated using the semi-automatic object-based methods of Amatya et al. (2019, 2021). The algorithmic method was used to reduce the overall time spent mapping some of the larger new inventories. Since algorithmic methods are known to produce artifacts when broadly applied (Pawluszek et al., 2018) and can lead to amalgamation of individual landslides into larger polygons (Marc and Hovius, 2015), each of the automatically generated inventories was additionally corrected by manual comparison with pre- and post-event high-resolution imagery in GIS software.

Beyond the five essential mapping criteria, additional criteria include the differentiation of scar and deposit areas and the classifications of landslides according to their type/mechanisms. However, these criteria are difficult to fulfill for large event inventories (see in Tanyaş et al., 2017), especially when based on various sources of optical imagery, limiting our ability to differentiate between scar and deposit areas (Casagli et al., 2017).

The mapped inventories combine scars and deposits in the polygon delineation, although in the analysis discussed below we have sought to differentiate these areas. In terms of landslide type we could not systematically classify each landslide polygon. However, we have removed debris flows from the analysis where possible by removing long-runout landslide polygons from each mapped inventory. In general, this mapping identifies rockslides, rock avalanches, shallow soil toppling and slumping failures but does not capture slow-moving landslides where surface changes may be less evident. A focus on these kinds of landslides is warranted since the volume of material mobilized during large storms from such landslides can lead to damaging debris flows and bedload transport impacts (Badoux et al., 2014). Removing debris flows from the analysis allows us to provide consistent landslide maps that can be used to estimate volumes of mobilized landslide material, for example using global scaling relationships like those defined by Larsen et al. (2010), and permits a focus solely on the topographic characteristics of landslide source regions, rather than on the characteristics of preferential runout paths.

Figure 2Probability density for landslides in each event inventory, obtained as the number of landslides with areas falling into logarithmic bins (consistent bins for all inventories), N[A:A+dA], normalized by the bin width, dA, and the total number of slides for the event, Ntot (see Malamud et al., 2004).


Table 2Macro-level characteristics for events discussed, including rainfall statistics. Note that median slope values have been calculated by excluding very low slope values (<1) to remove lakes and oceans.

Download Print Version | Download XLSX

We contrast each of the inventories mapped here by comparing the size–frequency distributions of each dataset, shown in Fig. 2. For each of the inventories, we show the probability of a landslide within a given area interval, as a way to assess the frequency of small and large landslides across the different datasets. Each of these landslide events was triggered by extreme rainfall, and although it is not our intention to examine the triggering rainfall in detail in this study, it is useful to briefly discuss the characteristics of the rainfall events in question. It is important to note that the date of the triggering rainfall is not identical to the dates on which the imagery used to map the landslides was obtained. Although we have selected events where the triggering rainfall significantly exceeds historical peak rainfall (and therefore is likely to be the dominant trigger for landslides), some events may have occurred as a result of lesser rainfall before or after. While the new inventories generated for this study utilize Planet imagery that closely brackets the rainfall events (within 1 week either side), the older inventories may be more subject to this challenge. A detailed analysis of the triggering rainfall associated with several of these inventories is described by Marc et al. (2018), who used local gauge data to characterize the rainfall intensities. We were unable to find consistent local gauge data for several of the more recent events that are published here for the first time (events in Zimbabwe, Burundi and Kenya and the two events in the Philippines). We can still use satellite rainfall data as a consistent source of rainfall for each of the events, however. To assess these, we utilize the reprocessed IMERG (Integrated Multi-satellitE Retrievals for GPM) version 6B rainfall product (Huffman et al., 2020), which merges and homogenizes data from NASA's Global Precipitation Measurement (GPM) mission with its predecessor Tropical Rainfall Measuring Mission (TRMM). All of the events considered occurred within the period during which GPM IMERG v06B rainfall data are available (2001–present). Because the satellite rainfall data spatial resolution is relatively coarse, it is not possible to effectively draw comparisons between the landslide polygons and the surrounding data in the same manner as the topographic data. However, we can still characterize the rainfall occurring during each event. We have analyzed the total rainfall occurring during each of the events by accumulating the rainfall data over the period of each event indicated in Table 1 and compared this with the calculated historical 99th percentile of daily rainfall as a way to normalize each event to the historical trends. The 99th percentile is calculated empirically based on the GPM IMERG v06B record (2001–2020). Since the length of rainfall period associated with each inventory varies, normalizing by the 99th percentile for a single day provides a consistent normalizing factor for each inventory. Additionally, in Table 2 we show the maximum 3 h rainfall intensity for each of the events, normalized by the historical 99th percentile of daily rainfall. The normalized total event rainfall and normalized 3 h rainfall provide a side-by-side comparison of the overall rainfall accumulation and the maximum intensity. The values for both total event rainfall and maximum 3 h rainfall are calculated as the average across all IMERG grid cells in the area of the inventory. Table 2 summarizes information on the landslide inventory characteristics including the total landslide area, the density of landslides in the mapped area, satellite rainfall and average slope in the mapped area. Despite other studies suggesting links between event total rainfall and the density of landsliding (Chen et al., 2013; Marc et al., 2018), we do not observe clear links between the measured rainfall data and the macro-scale statistics of each landslide inventory. Relations between landslide density and rainfall can be obscured by variations in climatic and/or hydromechanical properties with each study area (Marc et al., 2019). We suggest that exploring the links between rainfall intensity as characterized by satellite measurements and the density of landsliding that results is an important topic for future research.

2.3 Topographic analysis

We have analyzed the topographic characteristics of landslide locations for the event inventories, using global satellite datasets to ensure consistency across each site. These datasets are also openly available, which supports replication of these methods and findings by other authors. In Table 3, we show the datasets we have used.

Table 3Analysis datasets. Explanation of each of the variables is found in the accompanying text.

Download Print Version | Download XLSX

The DEM and forest loss data are both provided at approximately a 1 arcsec resolution, which means we do not have to resample either dataset when conducting a raster-based analysis at this scale. While this resolution is not as fine as some of the imagery used to map the landslides, which can be 3 m or finer, it represents the finest resolution at which these two datasets can be analyzed using non-commercial, open datasets at a global extent. We utilize forest loss data derived from Landsat imagery spanning the years 2000–2018. Cells where forest loss is observed in any year from 2000 until the year in which the landslide event occurred are considered a binary “true” value for forest loss. This does not consider regrowth of vegetation in places where forest loss was observed many years prior to the event, and as such it is a relatively blunt tool to assess the importance of vegetation to landslide location.

Not all of the topographic parameters are universally used in landslide analysis. Slope is almost universally considered for landslide modeling, but the use of others (in particular the topographic position index) is less common (Reichenbach et al., 2018). The topographic position index (TPI) (Weiss, 2001) is a quantification of the relative position of a cell within the landscape. It is calculated as the difference in elevation of each cell in a DEM from the mean elevation of a specified neighborhood around that cell, with the radius of the neighborhood chosen beforehand (in this case, 300 m). Negative values indicate the cell is in a topographic hollow, and positive values suggest that it is elevated above its surroundings. The distance over which the neighborhood comparison is made (TPI wavelength) determines the scale of the features resolved; negative values at long wavelengths indicate a position in a wider valley, while at short wavelengths this would indicate steep narrow gorges. In this study, we focus on short-wavelength TPI values since this aligns more closely with the scale of the landslide features. Relief indicates the difference between minimum and maximum elevation in a given window. It is a proxy for both slope and the size of hillslopes; higher-relief zones have been shown to be associated with landslides in many locations (Reichenbach et al., 2018). The compound topographic index (CTI) is a measure of both slope and the upstream contributing area. It is calculated by the formula ln(a/tanb), where a is the flow accumulation area per pixel and b is the local slope in radians. In some locations the CTI is correlated with soil parameters such as thickness (Liang and Chan, 2017). We also calculate the topographic ruggedness index (TRI), a measure of the local surface roughness. It is defined as the root-mean-squared difference in elevation between a central pixel and each of its 8 neighboring pixels.

Finally, we also analyze the average upstream angle – this is the average angle from the pixel location to every cell that drains into that pixel. It provides a measure of how steep the areas are that feed into each pixel. There is a significant degree of overlap between how some of these parameters are calculated, and we recognize the importance of considering co-linearity.

In order to assess the co-linearity of the variables, we have compared each pair of variables. Pair plots are shown in the supplementary material (Figs. S1 and S2 in the Supplement). Unsurprisingly, strong correlations are observed between slope and the average upstream angle, as well as between topographic ruggedness and relief. It is also important to note the negative relationship observed between the TPI and CTI, confirming that the hollows in the landscape are also locations where the saturation state is likely to be higher. Considering these co-linear relationships, it is important to ask which variables are the most effective predictors of landslide locations for the analyzed inventories. To analyze the importance of the input variables, we first perform analysis of the influence of each individual variable as a bivariate analysis (Sect. 3.1) and then use a generalized linear model to explore the effect of co-linearity (Sect. 3.2).

For the assessment of each parameter by itself, we calculate the relative ratios of the distributions for each variable for the topography and the landslide populations. The topography values are calculated for all pixels within the area in which landslides were mapped. Since we lack data on the mapped areas for all of the inventories, we assume that the convex hull (minimum bounding polygon) for the landslide polygons represents the mapping area. This follows the example of other recent studies (Marc et al., 2018; Milledge et al., 2019). For both the landslide parameter distribution and the parameter distribution for the topography, we divide the values into bins, normalizing by the total size of the distribution. This essentially represents a value–frequency distribution. For each of the bins, we then divide the landslide probability by that of the topography to obtain a ratio. Using slope as an example, this provides an estimate of the probability of a landslide occurrence at a given slope value compared to the occurrence of that slope in the landscape. This step is meant to explore the significance of each variable in a bivariate structure (Fig. 3).

Figure 3Example of landslide : topography ratio comparison for landslides in Zimbabwe triggered by Cyclone Idai. This shows the distribution of values for landslides (a) and topography (b) for slope. The lower part of the figure (c) shows the relative ratio of the two distributions for the parameter. The black lines in (a) and (b) illustrate the mean value of the data. In the lower figure (c), the size of the points depends on whether the difference between landslide and topography exceeds the calculated confidence interval for that bin interval; larger points indicate significant deviation (p>0.95), and small points indicate the difference is not significant at that probability level.


We characterize the landslides in two ways – first, by calculating the parameter value for the scar area of the landslide and, secondly, by calculating the raster values for the entire landslide body. We lack consistent data on the scar area for the landslide inventories in question, so instead we calculate an approximation of the scar area based on the geometry of each individual landslide. We utilize the method of Marc et al. (2018) to extract the scar areas, which uses the perimeter and area (A) of landslide polygons to calculate the aspect ratio of an equivalent ellipse, K (Marc and Hovius 2015), and the associated width (W), according to the formula W(4A/πK) (Marc et al., 2018). The scar area is defined as ∼1.5 W2 based on a global database of the scar aspect ratio (Domej et al., 2017). For each scar, we calculate the average value within the polygon area of each parameter. This avoids bias toward landslides with long runouts and effectively removes the lowest portion of the polygons which may not have the same topographic signal as the source areas. Even if the scar areas are only approximate, focusing on the upper part of the landslides is warranted for an improved understanding of the topographic control on landslide susceptibility.

The second way of characterizing the landslide – assessing the overall area of the landslide – allows us to focus on areas in the landscape that are likely to be hazardous, including areas where landslide material may end up.

To calculate the parameter distribution for the whole landslide body, we first rasterize the polygons of landslide locations to the resolution of the SRTM DEM (1 arcsec). This provides a binary raster of landslide presence. We then assess the parameter values for each of the pixels where landslides are present. This does mean that the largest landslides are most strongly represented in the distribution, but this is intentional as it permits us to focus on all of the areas affected in the landscape. It is important to note this approach – counting all pixels – is not appropriate for statistical susceptibility analysis, since it could lead to highly dependent datasets. For hazard analysis purposes, we feel it is appropriate to consider all pixels, since larger landslides are consistently more damaging, and we seek to capture the entire footprint.

In Fig. 3, an example of the comparison is shown for the landslides occurring in Zimbabwe as a result of Cyclone Idai. We show upstream slope as an illustrative parameter. To compare the landslide data with the topography, we split the data into bins, using the same bins for both landslide and topography. The probabilities of the landslide and topography values are then compared with one another. To allow for more consistent comparison of inventories with diverse topography, we normalize the landslide and topography data by the median value for the parameter in question prior to splitting the data into bins (Marc et al., 2018; Milledge et al., 2019). This specifically results in the normalized conditional probability (Milledge et al., 2019). For example, in the case of slope, we calculate the median slope value for all pixels within the mapped area for each inventory and divide each binned interval by the median slope value calculated across the mapped area. Finally, for each bin, we calculate the confidence interval for the comparison of topography to landslides using the method of Rault et al. (2019).

We have generated these estimates for each of the variables listed in Table 3 and for each of the landslide inventories listed in Table 1. One of the variables – forest loss – is a binary variable – it is calculated as forest is either lost or not. As such, we can only compare the average value for landslides and for topography at large to obtain a relative difference in the average forest loss value.

Because landslides are triggered as a result of a complex interaction between various factors, we also analyze the inventories using a multivariate regression scheme to consider the interactions between the topographic factors. We do so by fitting a binomial generalized linear model (GLM) for each landslide inventory. We also apply a feature selection algorithm to identify the significant and irrelevant variables to feed the GLM. For this purpose, we use the least absolute shrinkage and selection operation (LASSO) technique (Tibshirani, 1996). This method is particularly suggested for landslide susceptibility assessment to reduce the large number of highly correlated predictors without losing parameter interpretability (e.g., Camilo et al., 2017). GLM fitting with a LASSO implementation is carried out by using the R (R Core Team, 2018) “glmnet” library, which was made available by Friedman et al. (2021). We apply this method and couple it with the 10-fold cross-validation to remove non-informative covariates and to assess the modeling performance based on the area under the receiver operating characteristic curve (AUC) calculated for each landslide inventory (Hosmer and Lemeshow, 2000). From each model we built, we store the information related to the regression coefficients. Before fitting the regression model, we apply a mean zero and unit variance normalization to all variables (e.g., Lombardo et al., 2018), which are expressed in different ranges and on different scales. This normalization allows us to better examine the modeling results in terms of the contribution of each variable. In this scheme, larger absolute values of the regression coefficients refer to a relatively large contribution of variables.

We have combined the results from each individual inventory into a single figure for each of the variables to assess relative differences, as well as which variables are most strongly associated with where landslides are mapped.

3 Results

3.1 Bivariate analysis

The bivariate analyses show that several of the parameters are strong predictors of the location of both the scars and the overall area of landslides, and while there is significant variability between the different inventories, there are consistent patterns that emerge across all events. In broad terms, we find that rainfall-triggered landslides occur more often in rough, steep terrain (Figs. 4, 5 and 7). Results from the compound topographic index (CTI) and short-wavelength topographic position suggest that these parameters can be used to effectively distinguish between scars and the entire landslide area, with a high probability of scars at low CTI values and at more positive TPI values (i.e., landscape convexities). For all metrics, we find that all studied inventories have approximately equal sampling at the median landscape value. This can be observed in Figs. 4–8, where the probability ratio of 1 for almost all inventories occurs at approximately the median value of the parameter for the entire landscape. In other words, the transition from low to high landslide probability is relative to the local landscape median value and not to an absolute value of the considered metric.

Figure 4Landslide probability ratio against the slope normalized by the median of the local landscape, for the scar area (a) and the whole landslide area (b). The size of the points depends on whether the difference between the landslide and topography exceeds the calculated confidence interval for that bin interval; larger points indicate significant deviation (p>0.95), and small points indicate the difference is not significant at that probability level.


For all of the events, there is a general increase in landslide probability at higher slope values (Fig. 4). Similarly, a strong increase in landslide probability is observed for the average upstream slope angle (Fig. 5). The distributions of the different inventories are slightly tighter than for slope, indicating that this may be a more consistently applicable variable. Specifically, we note that both scars and whole landslides are at an equal sampling level at the median landscape upstream angle, strongly undersampling and oversampling the gentler and steeper slopes, respectively (i.e., proportionately more landsliding at higher slope values and less at lower slopes). This can be observed in Fig. 5. Consistent trends emerge where results are within a 95th-percentile confidence interval, although there is a greater spread of data values where results are not considered statistically significant.

Figure 5Landslide probability ratio against the average upstream angle normalized by the median of the local landscape, for the scar area (a) and the whole landslide area (b). The size of the points depends on whether the difference between the landslide and topography exceeds the calculated confidence interval for that bin interval; larger points indicate significant deviation (p>0.95), and small points indicate the difference is not significant at that probability level.


Figure 6Same as Fig. 4 but for the compound topographic index (CTI). Note that the x axis is a logarithmic scale.


Figure 7Same as Fig. 4 but for the topographic ruggedness index (TRI).


The compound topographic index (sometimes referred to as the wetness index), when tested for landslide scars, shows higher probability of landslides for lower CTI values (Fig. 6). This trend is negligible for whole landslide areas, with the statistically significant points showing almost no variation in landslide probability with a changing CTI. This suggests broadly that the CTI is a poor predictor of the areas where landslide hazard may be increased but a better predictor of the source locations. The relationship between scar probability and the CTI is not strongly linked with flow accumulation, despite the role that flow accumulation plays in setting the CTI. In Fig. S3 we show the probability ratios for flow accumulation values, and no clear relationship emerges between the probability ratio and flow accumulation. This suggests that the slope component of the CTI is more important when considering scar locations, while the flow accumulation factor (observed to be correlated with an increase in probability of whole landslide areas) may be offset by the slope in the areas where landslides run out, leading to no overall correlation with the CTI and whole landslide area.

Additionally, we observe that the CTI value where landslide scars and topography are equally sampled is approximately the median value, and the fit for each inventory is relatively consistent.

Figure 8Same as Fig. 4 but for relief in a 1 km radius of each cell.


We find that the topographic ruggedness index is also a relatively strong predictor for landslide probability, with increases in the TRI correlated with increases in the landslide probability ratio for almost all events (Fig. 7), and statistically significant results are observed for several of the inventories. For several events, the results are in line with prior work that has shown roughness and related metrics to be correlated with landslide occurrence (Costanzo et al., 2012; Reichenbach et al., 2018). While the point of equal sampling of landslides and topography is approximately the median value for the inventories analyzed, the slope of the relationship diverges somewhat above and below this point. This suggests that the most heterogeneous parts of the landscape may not be as strong a predictor for landslide occurrence as areas of high slope.

There are not strong systematically consistent relationships between relief at a 1 km scale and the probability of landslide scars and total areas. Some increase in probability is observed with increasing relief, although this is saturated at relief values above the median relief. This suggests that relief alone is a relatively poor predictor of the source areas of landslides or that the resolution of relief may be too coarse. In a few cases – Burundi, Typhoon Morakot in Taiwan and Kii Province in Japan – there is a slightly clearer increasing relationship.

We observe a link between the short-wavelength topographic position index (300 m assessment radius) and landslide probability ratios for landslide scars in several, but not all (i.e., not Kalmaegi, Morakot and Kii), of the events (Fig. 9). Specifically, landslide scars are significantly more likely at positive TPI values (short-wavelength landscape convexities like ridges). Although this is the case for the majority of inventories, results from both inventories in Taiwan and Dominica do not exhibit this tendency. This parameter also shows the clearest distinction between the scar areas and the whole landslide areas. The entire landslide area is more likely to be found at negative TPI values (landscape concavities like valley bottoms). While the TPI at the 300 m wavelength does not demonstrate quite as consistent relationships as slope or the CTI, it remains a strong predictor. In particular, the larger variation between scars and overall landslide areas suggests that short wavelengths may be a valuable way to distinguish between scarps and deposits in a preliminary assessment.

Figure 9Same as Fig. 4 but for the topographic position index (however, here we omit normalization given that the TPI is a zero-centered variable) with an analysis window radius of 300 m.


Since forest loss is a binary variable, we do not plot this across multiple bins. However, we calculate the average ratio of forest loss in landslide zones to the overall topography, and we find across all events that the value is 1.98±1.38 (1 SD – standard deviation). This implies that forest loss zones are correlated with higher probability of landsliding. We see the largest differences between landslide zones and non-landslide zones in Salgar, Colombia, and Hiroshima, Japan, and the landslides triggered by Typhoon Morakot in Taiwan (forest loss is more than 3.5 times more likely in landslides in these cases). In four cases, forest loss was observed as less likely in landslide pixels: in Blumenau, Brazil; Lanao del Norte, the Philippines; Thrissur, India; and West Pokot, Kenya. We suggest that while forest loss prior to landslide events is generally higher in landslide locations than in the rest of the landscape, this relationship is not consistent enough to necessarily be a good predictor of landslide location by itself.

3.2 Multivariate analysis

We have used the LASSO method to quantify the importance of the different predictors for both scars and whole landslides while reducing the influence of co-linearity (Camilo et al., 2017) (Fig. 10).

Figure 10Figure showing regression coefficients and corresponding AUC values calculated from the fitted GLM for each landslide inventory, considering scars (a) or the whole landslide area (b). The error bar corresponds to the variation in the AUC distribution across 10-fold cross-validation replicates. Regression coefficients are not shown for predictors considered not significant by the LASSO methods. The AUC values shown in the legend are for scars (prior to slash) and entire landslides (second value).


Figure 10a and b also show the modeling performances, which are represented by AUC values, varying from “reasonable” (0.6<AUC<0.7) to “adequate” (0.7<AUC<0.8) and “outstanding” (0.8<AUC<0.9) based on AUC classes defined by Hosmer and Lemeshow (2000). The lower AUC values for some events may indicate that we lack some key explanatory variables; crucially, this may include rainfall intensity and duration, although as discussed above we lack consistent high-resolution data to assess this within the spatial domain of the considered inventories. Other relevant parameters may include land cover, lithology and geo-structural parameters, and anthropogenic influence (Reichenbach et al., 2018).

Our findings show that slope, on one hand, is the factor that most frequently appears as significant in the GLM run for both the landslide scars and the whole areas, with landslides favoring steeper locations. On the other hand, the average upstream angle and CTI in the GLM of the scars and the average upstream angle, CTI and topographic position index (with a 300 m radius) in the GLM run for the whole area appear as the least commonly observed significant variables. The non-significant or low impact of most of these topographic variables is likely due to the co-linearity existing between these variables (e.g., slope and average upstream slope, slope and CTI, CTI and TPI – Figs. S1 and S2).

The results indicate that except for the CTI, all variables have a positive weight on classifying a given grid cell as “landslide presence” instead of “landslide absence” given the choice of predictors. There are only two cases that do not follow this general trend (i.e., the two Taiwanese inventories).

The regression coefficient of the TPI calculated for the landslide scar areas for Typhoon Kalmaegi, Taiwan, has a negative sign, unlike all other cases. Overall, as we explained above, different TPI values refer to different subsections of a hillslope. Specifically, positive values refer to ridges or hillslopes, whereas negative values correspond to the valley. As a result, the negative weight of the TPI on classifying the landslide presence and absence condition in the GLM is difficult to interpret. This could be caused by interactions between variables. Although we run a variable selection method (i.e., LASSO), the TPI could be still interacting with the others. If it shares a similar signal to at least one other variable, the sign of the regression coefficient can be influenced by the interaction.

The negative regression coefficient of the TRI obtained for the whole landslide areas for Typhoon Morakot, Taiwan, is the other case where the response of the covariate is different from the other examples. Similarly to the Kalmaegi inventory case we presented above, the negative sign could also be caused by the interactions between variables. However, this could also be associated with the physical properties characterizing whole landslide areas. The TRI in the case of Typhoon Morakot, Taiwan, in particular, might correspond to smooth topography. In either case, the TRI does not appear as a significant variable other than in case of Typhoon Morakot, Taiwan.

The two inventories where negative CTI values are most significantly associated with landslide incidence – Morakot and Hiroshima – have few commonalities; their lithologies differ, and the mean slope of the affected area in Hiroshima is markedly lower than for Morakot. Perhaps the most significant commonality is that the triggering rainfall exceeded the historical maxima by a significant degree (Table 3), which may increase the likelihood of failure resulting from local transient pore pressure increases, rather than due to saturated flow at the base of hillslopes. Excepting these two examples, the results show that the signal of the CTI does not contribute to the GLM while classifying landslide presence or absence.

4 Discussion

The primary intention of this study is to assess the critical topographic parameters associated with rainfall-triggered landslides, using a large dataset of landslide inventories that includes six newly mapped events. Our results are comparable with existing studies (e.g., Marc et al., 2018; Milledge et al., 2019) exploring landslide locations using inventories while adding more detail and assessment of global variability, which we discuss below. First, however, we explore some of the limitations and assumptions that go into the mapping and analysis of the landslides.

4.1 Uncertainty in landslide mapping and DEM metrics' extraction

Firstly, it is important to consider how representative the landslide inventories are. We have attempted to include landslide maps from a diverse set of locations around the world, but this is still only a fraction of landslides that have occurred in the last 2 decades. Some of the inventories, like the landslides occurring due to Typhoon Morakot in Taiwan, are driven by such huge rainfall events that the overall area of landsliding greatly exceeds other examples with lower rainfall. This is one of the key reasons why we have used probability ratios as a metric to assess landslide locations, since they do not consider the overall area of landslides triggered by a given rainfall event. Most of the inventories are drawn from locations with tropical climates (Geiger, 1954), and although the pairs from Brazil and Japan sample areas of humid subtropical climates, this is not a true representative sampling of climatic regimes. One exception is the landslide inventory from Zimbabwe, which is in a semi-arid climatic region. Although our examples disproportionately sample tropical and subtropical areas, these areas generally experience the highest erosion rates (Milliman and Syvitski, 1991), driven by increasingly intense rainfall (e.g., Bookhagen and Strecker, 2012). Similarly, while the inventories are drawn from places with diverse lithologies, we do not have datasets from a fully representative set of lithological locations (Hartmann and Moosdorf, 2012; Table 1).

Although we have used hand corrections to reduce the impact of polygon amalgamation from algorithmic mapping methods, some inconsistencies may still exist. One important consideration is that the datasets used here do not distinguish between different landslide types or distinguish between scar and deposit. While this is consistent across inventories, it is important when considering the results. In particular, since we do not have constraints on whether mapped landslides are purely shallow soil slides or whether they incorporate deeper bedrock, we cannot determine differences in topographic characteristics associated with each. The change in material properties from soil to bedrock can lead to changes in overall volume mobilized for a given landslide area (Larsen et al., 2010), so inventories where smaller shallow landslides are a larger proportion of the mapped inventory may have different characteristics. For example, the landslides mapped using aerial photography around Hiroshima, Japan, do not show particularly high probability ratios at very high relief or TRI values, suggesting these landslides generally occur on smaller less rough hillslopes.

Hand mapping and correcting will help reduce the potential for landslide amalgamation (Marc and Hovius, 2015), which is essential in order to estimate width and scar areas from landslide polygon geometry (Marc et al., 2018). Since we do not have access to the imagery for all of the previously published events, we are not able to correct these events and thus must rely on prior mapping being consistent with our own efforts. Part of the challenge of compiling different events is the different sources of imagery used to create each inventory. For most of the inventories we have mapped as part of this study, the imagery is consistent in terms of resolution, and we have benefited from the rapid return time of Planet Dove satellites to ensure that cloud cover does not mask any of the areas mapped. However, without imagery to clarify, it may be possible that parts of the previously published inventories are masked by cloud cover. In addition, the inventories mapped using coarser-resolution satellite imagery, such as the part of Taiwan impacted by Typhoon Kalmaegi, may not capture the smallest landslides that resulted. If smaller landslides are preferentially found in certain parts of the landscape, this may introduce systematic biases in observed probability ratios.

While the imagery used to compile the different inventories varies in resolution, there do not seem to be consistent, systematic differences between probability ratios that can be explained as a result of small landslides that systematically bias the events. For example, landslides in the Dominica events have similar probability ratios for each parameter compared to the landslides from Typhoon Kalmaegi in Taiwan, despite these datasets having the largest difference in effective imagery resolution.

When considering the whole landslide area, we pixelate the landslides to the resolution of the DEM to highlight the most hazardous parts of the landscape. This pixelation process can introduce a source of systematic error, since if less than half of a cell area is occupied by a landslide polygon, it is still considered to be a “landslide pixel”. Some landslides may be significantly smaller than the SRTM cell resolution if they are mapped using high-resolution imagery, but they will still count as a full-size pixel for the purposes of analysis as a result of the rasterization process. This introduces a potential source of bias as smaller landslides may make up a larger proportion of analyzed pixels than their actual area would represent. Landslides below the approximate area of half an SRTM cell (450 m3) make up an appreciable proportion of the total landslide area in Hiroshima and Dominica (∼30 %) (Fig. S12). In these settings, we would expect that the influence of smaller landslides may be over-estimated at the 30 m resolution of analysis, although in the majority of other inventories landslides below this cutoff represent less than 5 %–10 % of the total landslide area (and therefore the analytical dataset of landslide pixels).

To address the potential for bias due to oversampling of small landslides with a coarse-resolution DEM, we have resampled the DEM for the events in Hiroshima and Dominica to a resolution of 10 m, at which nearly all landslides are captured without size exaggeration. We then recalculate the probability ratios for the landslides and compare the resampled DEM results with those from the original DEM (Fig. S5). For the slope, average upstream angle, CTI, relief and TRI, only minor differences are observed between the results for a resampled DEM and the original DEM. There are some differences between the TPI at a 300 m resolution, but no consistent relationship seems to emerge. Thus we do not think our results are affected by the coarse rasterization process, although it is likely that accessing a higher-resolution DEM may alter the result depending on the local variable, like the slope, CTI or TRI.

Other parameters are often incorporated into landslide susceptibility such as geological factors like soil characteristics or lithology, local land cover type, or climatic metrics. Although global data for rainfall, soil type and geological parameters exist, the resolution of these datasets is too low to allow for consistent comparison of landslide and non-landslide areas at the scale of the analysis described here (∼100 m). As such, we have chosen to focus exclusively on topographic factors within our assessment.

Figure 11Probability ratio of landslide scar areas compared with the entire landslide area. It is important to note that while Figs. 3–9 contrast the landslide areas (scar or entire mapped area) with the topography, this figure shows the ratio of probabilities for the scar and whole landslide area. Specifically, this shows the probability of a scar at a given CTI value, normalized by the probability of that CTI value, divided by the probability of the entire landslide at a given CTI value, normalized by the probability of that CTI value. Higher values indicate that scars are more likely than whole landslide areas at that parameter value. Panel (a) shows the result for the CTI and panel (b) for the short-wavelength TPI.


4.2 Differences between scar and overall landslide area

While for several parameters, scars and the entire landslide area are similarly sampled at a range of values, for the TPI and CTI we see significant differences across a large number of the inventories (Fig. 11). Scars are more likely at lower CTI values and at more positive TPI values. A positive TPI implies scars are more likely at concave locations in the landscape, while a lower CTI value indicates areas with lower flow accumulation and saturation. This describes parts of the landscape that sit closer to ridges. This broadly supports the assessment above that higher TPI and CTI values may be a way to distinguish between scar and deposit areas. No systematic differences are observed with respect to the TRI or average upstream angle (Fig. S4).

By comparing scars and the overall landslide area, the observations we have made provide informative contrasts with prior work. Similar recent work exploring the characteristics of earthquake-induced landslide inventories suggests that the slope angle and upslope contributing area are key determinants of hazard, defined by the entire landslide areas (Milledge et al., 2019). Our findings are consistent for entire landslide areas but differ for the scar area, which is poorly determined by flow accumulation (Fig. S3). One may be surprised by the fact that landslides triggered by intense rainfall have scars uncorrelated with drainage area, while both earthquake- and rainfall-induced landslides have whole areas strongly related to it. We propose that the whole-area relationship mainly reflects runout paths and not hydrological processes and that the initiation of rainfall-induced landslides poorly relates to the surface-parallel hydrological flow. This is discussed in more detail below. Our results for drainage are of the whole landslides are quite different from the ones of Milledge et al. (2019). We suggest that the variability in scar location (higher for earthquake; see Rault et al., 2019) may explain more diverse behavior in normalized drainage below the median, while the propensity for longer runout (more likely for rainfall-induced landslides) may explain that some (not all) cases have a probability ratio increasing until reaching very large drainage levels (Fig. S3).

The observed differences between the scar and overall landslide area can be exploited to refine our understanding of susceptibility and hazard modeling by focusing on parameters controlling scar areas where landslides are initiated (e.g., slope, CTI) and the entire landslide area where landslides impact (e.g., drainage area, TPI), respectively. Such a focus can help support both sets of applications for more comprehensive landslide hazard information and emphasizes the need to distinguish diverse portions of mapped landslides depending on the study objective.

4.3 Implications for triggering mechanisms and landscape evolution

One of the most consistent observations that emerges from this study is that for several parameters (slope, average upstream slope, TRI, CTI), the critical point where landslides and topography are equally sampled is approximately the median value for the inventory in question. This is consistent with previous observations on rainfall- and earthquake-induced landsliding (Marc et al., 2018; Milledge et al., 2019). For the average upstream slope and CTI, the relationships for different inventories are in fact very similar; this is despite a large variation in the median slope for each of the inventories (Table 2). This suggests that landslide probability is strongly dependent on the median topography, rather than on a specific critical angle. This implies two important points: first, that despite important differences in the landscapes observed, consistent hazard relationships can be defined based upon median landscape values and, second, that these diverse landscapes may be in a form of long-term equilibrium with respect to their landslide behavior.

We suggest that each of the considered landscapes, each with its own lithology, vegetation, climate and tectonic forcing, may have evolved such that local hillslopes have slope gradients and hydromechanical properties that set the possibility for landslides on the upper half of the distribution. The evolution of the hillslopes' regolith state, which acts as an important control on landslide susceptibility, under climatic forcing is predicted by geomorphological models of hillslope stability coupled with stochastic rainfall forcing (Dietrich et al., 1995; Iida, 1999, 2004). Landscape evolution toward a critical state was also inferred to explain why landsliding in the Kii peninsula better matched the relative rainfall anomaly than absolute rainfall patterns (Marc et al., 2019).

Alongside the implications for landscape evolution and how to derive susceptibility metrics, our results also offer insight into the mechanisms of landslide triggering in extreme rainfall events. By focusing on the scar areas, we observe that landslides are more likely in locations with lower CTI and higher TPI values – parts of a landscape near ridges with a generally lower propensity for water saturation. This is somewhat in contrast to studies that suggest rainfall-triggered landslides are more likely to occur in areas lower down hillslopes where fluid saturation is greater (Densmore and Hovius, 2000; Meunier et al., 2008), possibly because these studies did not clearly differentiate the scar from whole landslide area, which are very different relative to these two metrics (Fig. 11). The relationship with the CTI and drainage area also suggests that modeling landslides under the assumption of regolith saturation due to slope-parallel, steady-state flow (e.g., Montgomery and Dietrich, 1994) may be inadequate. Instead, the pore pressure triggering landslides in extreme rainfall events may rather be controlled by transient, vertical infiltration and/or preferential flow paths (Iverson, 2000; Montgomery et al., 2009; Hencher, 2010; Bogaard and Greco, 2016). This recalls the essential challenge for developing modeling approaches that can account for such complex hillslope hydrology as well as highly variable hydromechanical properties of the regolith. Nevertheless, we also suggest that future studies should compare the results from this analysis of landslides triggered by extreme rainfall with landslide inventories resulting from longer-duration, lower-intensity rainfall events to assess whether the relationship with the CTI and TPI changes. Indeed, we might expect that lower-intensity rainfall would trigger landslides in parts of the landscape with higher CTI values as steady-state saturation may be more widespread. In any future comparative study of low-intensity and high-intensity rainfall events, it will be necessary to carefully select landslide inventories where the imagery used to generate them closely brackets the start and end of the rainfall events to ensure only landslides triggered by an individual event are analyzed.

Finally, for some events, including Typhoon Morakot in Taiwan and Cyclone Idai in Zimbabwe, there is a small decline in relative landslide probability at very high slope values (>35–40) (Fig. 4). It is possible that these slopes, which are generally above the angles considered to be critically unstable (Selby, 1982; Roering et al., 2001), may represent areas where landslide probability is reduced as a result of non-topographic factors, such as local lithological bedding plane angles (Guzzetti et al., 1996; Santangelo et al., 2015) or thinner soils limiting the availability of material that can be mobilized as landslides (Prancevic et al., 2020). The decline in landslide probability at the highest slope values may represent the limit to which local pore pressure as a result of extreme rainfall can influence triggering.

5 Conclusions

In this study we have combined 10 existing rainfall-induced landslide inventories from a range of mountainous regions with 6 new inventories mapped as part of this study. We suggest that providing newly mapped inventories is a valuable service for the landslide community at large, and we anticipate that these inventories can provide data to calibrate and validate susceptibility and hazard models both in the specific locations where landslides occurred and also further afield. In addition, we have used moderate-resolution open-source satellite data to assess the parameters that characterize the location of landslides in these inventories. We find that alongside the previously documented importance of slope and topographic ruggedness, the average upstream angle and topographic position are also determinants of landslide probability in a given location. After normalizing the topographic variables by the local landscape median, we find consistent relationships across the different inventories despite the variety of lithological and topographic settings. This suggests that relative metrics should be considered to perform landslide susceptibility analysis and that different landscapes can be at a state of equilibrium with respect to the probability of landsliding. The importance of multiple topographic factors to determine the local landslide probability highlights the value of high-resolution DEM data. While we have used the 1 arcsec resolution SRTM data, higher-resolution DEM data are increasingly available. Given that we are able to map landslides at finer and finer resolutions as very high resolution satellite imagery becomes available, combining these new detailed inventories with DEMs of similar resolutions is likely to provide further insights about landslide location within the landscape.

While we have not undertaken a detailed assessment of the rainfall that triggered these landslides, we emphasize that variability in rainfall is likely to explain a significant degree of variability in where landslides occur (e.g., Marc et al., 2019). Future work should assess each of these inventories with respect to the rainfall that triggered the significant landsliding, which could yield important insights into the relationship between intense rainfall and landslide occurrence.

Data availability

All data used in this study are provided in the Supplement, and all methods are detailed above.


The supplement related to this article is available online at:

Author contributions

All authors were involved in study conceptualization and writing of the manuscript. RE, PA and OM conducted landslide mapping. RE, HT and OM conducted data analysis.

Competing interests

The contact author has declared that neither they nor their co-authors have any competing interests.


Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.


Robert Emberson, Pukar Amatya and Dalia B. Kirschbaum are supported by a NASA Disasters program grant, 18-DISASTER18-0022.

Financial support

This research has been supported by the Science Mission Directorate (grant no. 18-DISASTER18-0022). This work utilized data made available through the NASA Commercial Smallsat Data Acquisition (CSDA) Program.

Review statement

This paper was edited by Paola Reichenbach and reviewed by Alexander Densmore and one anonymous referee.


Adriano, B., Yokoya, N., Miura, H., Matsuoka, M., and Koshimura, S.: A semiautomatic pixel-object method for detecting landslides using multitemporal ALOS-2 intensity images, Remote Sens., 12, 561,, 2020. 

Amatya, P., Kirschbaum, D., and Stanley, T.: Use of very high-resolution optical data for landslide mapping and susceptibility analysis along the Karnali highway, Nepal, Remote Sens., 11, 2284,, 2019. 

Amatya, P., Kirschbaum, D., Stanley, T., and Tanyas, H.: Landslide mapping using object-based image analysis and open source tools, Eng. Geol., 282, 106000,, 2021. 

Badoux, A., Andres, N., and Turowski, J. M.: Damage costs due to bedload transport processes in Switzerland, Nat. Hazards Earth Syst. Sci., 14, 279–294,, 2014. 

Behling, R., Roessner, S., Segl, K., Kleinschmit, B., and Kaufmann, H.: Robust automated image co-registration of optical multi-sensor time series data: Database generation for multi-temporal landslide detection, Remote Sens., 6, 2572–2600,, 2014. 

Bekaert, D. P., Handwerger, A. L., Agram, P., and Kirschbaum, D. B.: InSAR-based detection method for mapping and monitoring slow-moving landslides in remote regions with steep and mountainous terrain: An application to Nepal, Remote Sens. Environ., 249, 111983,, 2020. 

Beven, K. J. and Kirkby, M. J.: A physically based, variable contributing area model of basin hydrology/Un modèle à base physique de zone d'appel variable de l'hydrologie du bassin versant, Hydrolog. Sci. J., 24, 43–69,, 1979. 

Bogaard, T. A. and Greco, R.: Landslide hydrology: from hydrology to pore pressure, Wiley Interdisciplin. Rev.: Water, 3, 439–459,, 2016. 

Bookhagen, B. and Strecker, M. R.: Spatiotemporal trends in erosion rates across a pronounced rainfall gradient: Examples from the southern Central Andes, Earth Planet. Sc. Lett., 327–328, 97–110,, 2012. 

Broeckx, J., Maertens, M., Isabirye, M., Vanmaercke, M., Namazzi, B., Deckers, J., Tamale, J., Jacobs, L., Thiery, W., Kervyn, M., Vranken, L., and Poesen, J.: Landslide susceptibility and mobilization rates in the Mount Elgon region, Uganda, (October 2018), Landslides, 16, 571–584,, 2019. 

Budimir, M. E. A., Atkinson, P. M., and Lewis, H. G.: A systematic review of landslide probability mapping using logistic regression, Landslides, 12, 419–436,, 2015. 

Burrows, K., Walters, R. J., Milledge, D., and Densmore, A. L.: A systematic exploration of satellite radar coherence methods for rapid landslide detection, Nat. Hazards Earth Syst. Sci., 20, 3197–3214,, 2020. 

Camilo, D. C., Lombardo, L., Mai, P. M., Dou, J., and Huser, R.: Environmental Modelling & Software Handling high predictor dimensionality in slope-unit-based landslide susceptibility models through LASSO-penalized Generalized Linear Model, Environ. Model. Softw., 97, 145–156,, 2017. 

Casagli, N., Frodella, W., Morelli, S., Tofani, V., Ciampalini, A., Interieri, C., Raspini, F., Rossi, G., Tanteri, L., and Lu, P.: Spaceborne, UAV and ground-based remote sensing techniques for landslide mapping, monitoring and early warning, Geoenviron. Disast., 4, 1–23,, 2017. 

Chang, K., Chiang, S., Chen, Y., and Mondini, A. C.: Modeling the spatial occurrence of shallow landslides triggered by typhoons, Geomorphology, 208, 137–148,, 2014. 

Chen, Y., Chang, K., Chiu, Y., Lau, S., Lee, H., and County, T.: Quantifying rainfall controls on catchment-scale landslide erosion in Taiwan, Earth Surf. Proc. Land., 382, 372–382,, 2013. 

Conrad, J. L., Morphew, M. D., Baum, R. L., and Mirus, B. B.: HydroMet: A New Code for Automated Objective Optimization of Hydrometeorological Thresholds for Landslide Initiation, Water, 13, 1752,, 2021. 

Costanzo, D., Rotigliano, E., Irigaray, C., Jiménez-Perálvarez, J. D., and Chacón, J.: Factors selection in landslide susceptibility modelling on large scale following the gis matrix method: Application to the river Beiro basin (Spain), Nat. Hazards Earth Syst. Sci., 12, 327–340,, 2012. 

Densmore, A. L. and Hovius, N.: Topographic fingerprints of bedrock landslides, Geology, 28, 371–374,<371:TFOBL>2.0.CO;2, 2000. 

Dietrich, W. E., Reiss, R., Hsu, M. L., and Montgomery, D. R.: A process-based model for colluvial soil depth and shallow landsliding using digital elevation data, Hydrol. Process., 9, 383–400,, 1995. 

Domej, G., Bourdeau, C., Lenti, L., Martino, S., and Piuta, K.: Mean landslide geometries inferred from a global database of earthquake-and non-earthquake-triggered landslides, Ital. J. Eng. Geol. Environ., 17, 87–107,, 2017. 

Emberson, R., Kirschbaum, D., and Stanley, T.: New global characterisation of landslide exposure, Nat. Hazards Earth Syst. Sci., 20, 3413–3424,, 2020. 

Emberson, R. A., Kirschbaum, D. B., and Stanley, T.: Landslide hazard and exposure modelling in data-poor regions: the example of the Rohingya refugee camps in Bangladesh, Earth's Future, 9, e2020EF001666,, 2021. 

Friedman, J., Hastie, T., Tibshirani, R., Narasimhan, B., Tay, K., Simon, N., and Qian, J.: Lasso and Elastic-Net Regularized Generalized Linear Models, CRAN,, 2021. 

Froude, M. J. and Petley, D. N.: Global fatal landslide occurrence from 2004 to 2016, Nat. Hazards Earth Syst. Sci., 18, 2161–2181,, 2018. 

García-Rodríguez, M. J., Malpica, J. A., Benito, B., and Díaz, M.: Susceptibility assessment of earthquake-triggered landslides in El Salvador using logistic regression, Geomorphology, 95, 172–191,, 2008. 

Geiger, R.: Klassifikation der Klimate nach W. Köppen, in: Landolt-Börnstein – Zahlenwerte und Funktionen aus Physik, Chemie, Astronomie, Geophysik und Technik, alte Serie, Springer, Berlin, 603–607, 1954. 

Goetz, J. N., Brenning, A., Petschko, H., and Leopold, P.: Evaluating machine learning and statistical prediction techniques for landslide susceptibility modeling, Comput. Geosci., 81, 1–11,, 2015. 

Guzzetti, F., Cardinali, M., and Reichenbach, P.: The Influence of Structural Setting and Lithology on Landslide Type and Pattern, Environ. Eng. Geosci., II, 531–555,, 1996. 

Guzzetti, F., Cesare, A., Cardinali, M., Fiorucci, F., Santangelo, M., and Chang, K.: Landslide inventory maps: New tools for an old problem, Earth Sci. Rev., 112, 42–66,, 2012. 

Handwerger, A. L., Fielding, E. J., Huang, M., Bennett, G., Liang, C., and Schulz, W. H.: Widespread Initiation, Reactivation , and Acceleration of Landslides in the Northern California Coast Ranges due to Extreme Rainfall, J. Geophys. Res.-Earth, 124, 1782–1797,, 2019. 

Hansen, M. C., Potapov, P. V., Moore, R., Hancher, M., Turubanova, S. A., Tyukavina, A., Thau, D., Stehman, S. V., Goetz, S. J., Loveland, T. R., Kommareddy, A., Egorov, A., Chini, L., Justice, C. O., and Townshend, J. R. G.: High-Resolution Global Maps of 21st-Century Forest Cover Change, Science, 342, 850–853,, 2013. 

Harp, B. E. L., Reid, M. E., and Michael, J. A.: Hazard Analysis of Landslides Triggered by Typhoon Chata'an on July 2, 2002, in Chuuk State, Federated States of Micronesia, USGS Open-File Report 2004-1348, USGS,, 2004. 

Harp, E. L., Keefer, D. K., Sato, H. P., and Yagi, H.: Landslide inventories: The essential part of seismic landslide hazard analyses, Eng. Geol., 122, 9–21,, 2011. 

Hartmann, J. and Moosdorf, N.: The new global lithological map database GLiM: A representation of rock properties at the Earth surface, Geochem. Geophy. Geosy., 13, 1–37,, 2012. 

Hencher, S. R.: Preferential flow paths through soil and rock and their association with landslides, Hydrol. Process., 24, 1610–1630,, 2010. 

Hosmer, D. and Lemeshow, S: Applied Logistic Regression, 2nd Edn., Wiley, New York, ISBN 978-0-470-58247-3, 2000. 

Hu, X., Bürgmann, R., Lu, Z., Handwerger, A. L., Wang, T., and Miao, R.: Mobility, Thickness, and Hydraulic Diffusivity of the Slow – Moving Monroe Landslide in California Revealed by L – Band Satellite Radar Interferometry J. Geophys. Res.-Solid, 124, 7504–7518,, 2019. 

Huffman, G. J., Bolvin, D. T., Braithwaite, D., Hsu, K.-L., Joyce, R. J., Kidd, C., Nelkin, E. J., Sorooshian, S., Stocker, E. F., Tan, J., Wolff, D. B., and Xie, P.: Integrated Multi-Satellite Retrievals for the Global Precipitation Measurement (GPM) Mission (IMERG), in: Advances in Global Change Research, Springer, 343–353,, 2020. 

Iida, T.: A stochastic hydro-geomorphological model for shallow landsliding due to rainstorm, Catena, 34, 293–313,, 1999. 

Iida, T.: Theoretical research on the relationship between return period of rainfall and shallow landslides, Hydrol. Process., 18, 739–756,, 2004. 

Iverson, R. M.: Landslide triggering by rain infiltration, Water Resour. Res., 36, 1897–1910,, 2000. 

Jibson, R. W., Harp, E. L., and Michael, J. A.: A method for producing digital probabilistic seismic landslide hazard maps, Eng. Geol., 58, 271–289,, 2000. 

Jung, J. and Yun, S. H.: Evaluation of coherent and incoherent landslide detection methods based on synthetic aperture radar for rapid response: A case study for the 2018 Hokkaido landslides, Remote Sens., 12, 265,, 2020. 

Kirschbaum, D. B. and Stanley, T.: Satellite-Based Assessment of Rainfall-Triggered Landslide Hazard for Situational Awareness, Earth's Future, 6, 505–523,, 2018. 

Kirschbaum, D. B., Stanley, T., and Zhou, Y.: Spatial and temporal analysis of a global landslide catalog, Geomorphology, 249, 4–15,, 2015. 

Korup, O., Görüm, T., and Hayakawa, Y.: Without power? Landslide inventories in the face of climate change, Earth Surf. Proc. Land., 37, 92–99,, 2012. 

Larsen, I. J., Montgomery, D. R., and Korup, O.: Landslide erosion controlled by hillslope material, Nat. Geosci., 3, 247–251,, 2010. 

Liang, W. and Chan, M.: Spatial and temporal variations in the effects of soil depth and topographic wetness index of bedrock topography on subsurface saturation generation in a steep natural forested headwater catchment, J. Hydrol., 546, 405–418,, 2017. 

Lin, C.-W., Chang, W.-S., Liu, S.-H., Tsai, T.-T., Lee, S.-P., Tsang, Y.-C., Shieh, C.-L., and Tseng, C.-M.: Landslides triggered by the 7 August 2009 Typhoon Morakot in southern Taiwan, Eng. Geol., 123, 3–12,, 2011. 

Lombardo, L. and Tanyas, H.: Chrono-validation of near-real-time landslide susceptibility models via plug-in statistical simulations, Eng. Geol., 278, 105818,, 2020. 

Lombardo, L., Optiz, T., and Huser, R.: Point process-based modeling of multiple debris flow landslides using INLA: an application to the 2009 Messina disaster, Stoch. Environ. Res. Risk Assess., 32, 2179–2198,, 2018. 

Malamud, B. D., Turcotte, D. L., Guzzetti, F., and Reichenbach, P.: Landslide inventories and their statistical properties, Earth Surf. Proc. Land., 29, 687–711,, 2004. 

Marc, O. and Hovius, N.: Amalgamation in landslide maps: effects and automatic detection, Nat. Hazards Earth Syst. Sci., 15, 723–733,, 2015. 

Marc, O., Hovius, N., Meunier, P., Gorum, T., and Uchida, T.: A seismologically consistent expression for the total area and volume of earthquake-triggered landsliding, J. Geophys. Res.-Earth, 121, 640–663,, 2016. 

Marc, O., Meunier, P., and Hovius, N.: Prediction of the area affected by earthquake-induced landsliding based on seismological parameters, Nat. Hazards Earth Syst. Sci., 17, 1159–1175,, 2017. 

Marc, O., Stumpf, A., Malet, J.-P., Gosset, M., Uchida, T., and Chiang, S.-H.: Initial insights from a global database of rainfall-induced landslide inventories: the weak influence of slope and strong influence of total storm rainfall, Earth Surf. Dynam., 6, 903–922,, 2018. 

Marc, O., Gosset, M., Saito, H., Uchida, T., and Malet, J.-P.: Spatial Patterns of Storm-Induced Landslides and Their Relation to Rainfall Anomaly Maps, Geophys. Res. Lett., 46, 11167–11177,, 2019. 

Martha, T. R., Kerle, N., Van Westen, C. J., Jetten, V., and Kumar, K. V.: Object-oriented analysis of multi-temporal panchromatic images for creation of historical landslide inventories, ISPRS J. Photogram. Remote Sens., 67, 105–119,, 2012. 

Meunier, P., Hovius, N., and Haines, J. A.: Topographic site effects and the location of earthquake induced landslides, Earth Planet. Sc. Lett., 275, 221–232,, 2008. 

Milledge, D. G., Densmore, A. L., Bellugi, D., Rosser, N. J., Watt, J., Li, G., and Oven, K. J.: Simple rules to minimise exposure to coseismic landslide hazard, Nat. Hazards Earth Syst. Sci., 19, 837–856,, 2019. 

Milliman, J. and Syvitski, J. P. M.: Geomorphic Tectonic Control of Sediment Discharge to Ocean – The Importance of Small Mountainous Rivers Geomorphic/Tectonic Control of Sediment Discharge to the Ocean: The Importance of Small, J. Geol., 100, 525–544,, 1991. 

Mirus, B., Jones, E. S., Baum, R. L., Godt, J. W., Slaughter, S., Crawford, M. M., Lancaster, J., Stanley, T., Kirschbaum, D. B., Burns, W. J., Schmitt, R. G., Lindsey, K. O., and McCoy, K. M.: Landslides across the USA: occurrence, susceptibility, and data limitations, Landslides, 17, 2271–2285, 2020. 

Mondini, A. C., Santangelo, M., Rocchetti, M., Rossetto, E., Manconi, A., and Monserrat, O.: Sentinel-1 SAR amplitude imagery for rapid landslide detection, Remote Sens., 11, 760,, 2019. 

Montgomery, D. R. and Dietrich, W. E.: A physically based model for the topographic control on shallow landsliding, Water Resour. Res., 30, 1153–1171,, 1994. 

Montgomery, D. R., Schmidt, K. M., Dietrich, W. E., and McKean, J.: Instrumental record of debris flow initiation during natural rainfall: Implications for modeling slope stability, J. Geophys. Res.-Earth, 114, F01031,, 2009. 

Nowicki Jessee, M. A., Hamburger, M. W., Allstadt, K., Wald, D. J., Robeson, S. M., Tanyas, H., Hearne, M., and Thompson, E. M.: A Global Empirical Model for Near-Real-Time Assessment of Seismically Induced Landslides, J. Geophys. Res.-Earth, 123, 1835–1859,, 2018. 

Pawluszek, K., Borkowski, A., and Tarolli, P.: Sensitivity analysis of automatic landslide mapping: numerical experiments towards the best solution, Landslides, 15, 1851–1865,, 2018. 

Petley, D. Global patterns of loss of life from landslides, Geology, 40, 927–930,, 2012. 

Planet Team: Planet Application Program Interface: In Space for Life on Earth, Planet Team, San Francisco, CA, (last access: 31 March 2022), 2017. 

Prancevic, J. P., Lamb, M. P., McArdell, B. W., Rickli, C., and Kirchner, J. W.: Decreasing landslide erosion on steeper slopes in soil-mantled landscapes, Geophys. Res. Lett., 47, e2020GL087505,, 2020. 

Rault, C., Robert, A., Marc, O., Hovius, N., and Meunier, P.: Seismic and geologic controls on spatial clustering of landslides in three large earthquakes, Earth Surf. Dynam., 7, 829–839,, 2019. 

R Core Team: R: A language and environment for statistical computing, R Foundation for Statistical Computing, Vienna, Austria, (last access: 31 March 2022), 2018. 

Reichenbach, P., Rossi, M., Malamud, B. D., Mihir, M., and Guzzetti, F.: A review of statistically-based landslide susceptibility models, Earth-Sci. Rev., 180, 60–91,, 2018. 

Riley, S. J., DeGloria, S. D., and Elliot, R.: A Terrain Ruggedness Index that Quantifies Topographic Heterogeneity, Intermount. J. Sci., 5, 23–27, 1999. 

Roering, J. J., Kirchner, J. W., Sklar, L. S., and Dietrich, W. E.: Hillslope evolution by nonlinear creep and landsliding: An experimental study, Geology, 29, 143–146,<0143:HEBNCA>2.0.CO;2, 2001. 

Rossi, G., Tanteri, L., Tofani, V., Vannocci, P., Moretti, S., and Casagli, N.: Multitemporal UAV surveys for landslide mapping and characterization, Landslides, 15, 1045–1052,, 2018. 

Santangelo, M., Marchesini, I., Cardinali, M., Fiorucci, F., Rossi, M., Bucci, F., and Guzzetti, F.: A method for the assessment of the influence of bedding on landslide abundance and types, Landslides, 12, 295–309,, 2015. 

Schmitt, R. G., Tanyas, H., Nowicki Jessee, M. A., Zhu, J., Biegel, K. M., Allstadt, K. E., Jibson, R. W., Thompson, E. M., van Westen, C. J., Sato, H. P., Wald, D. J., Godt, J. W., Gorum, T., Xu, C., Rathje, E. M., and Knudsen, K. L.: An open repository of earthquake-triggered ground-failure inventories, Data Series, USGS, Reston, VA,, 2017. 

Selby, M. J.: Controls on the Stability and Inclinations of Hillslopes formed on hard rock, Earth Surf. Proc. Land., 7, 449–467,, 1982. 

Sörensen, R., Zinko, U., and Seibert, J.: On the calculation of the topographic wetness index: evaluation of different methods based on field observations, Hydrol. Earth Syst. Sci., 10, 101–112,, 2006. 

Stanley, T. A. and Kirschbaum, D. B.: A heuristic approach to global landslide susceptibility mapping, Nat. Hazards, 87, 145–164,, 2017. 

Tanyaş, H. and Lombardo, L.: Variation in landslide-affected area under the control of ground motion and topography, Eng. Geol., 260, 105229,, 2019.  

Tanyaş, H., van Westen, C. J., Allstadt, K. E., Nowicki Jessee, M. A., Gorum, T., Jibson, R. W., Godt, J. W., Sato, H. P., Schmitt, R. G., Marc, O., and Hovius, N.: Presentation and Analysis of Earthquake-Induced Landslide Inventories, J. Geophys. Res.-Earth, 122, 1991–2015,, 2017. 

Tanyaş, H., Rossi, M., Alvioli, M., van Westen, C. J., and Marchesini, I.: A global slope unit-based method for the near real-time prediction of earthquake-induced landslides, Geomorphology, 327, 126–146,, 2019. 

The Association of Japanese Geographers: The 2018 July Heavy rain in West Japan,, last access: 1 November 2019. 

Tibshirani, R.: Regression Shrinkage and Selection via the Lasso, J. Roy. Stat. Soc. Ser. B, 58, 267–288,, 1996. 

Van Den Eeckhaut, M. and Hervás, J.: Geomorphology State of the art of national landslide databases in Europe and their potential for assessing landslide susceptibility, Hazard Risk, 140, 545–558,, 2012. 

Van Den Eeckhaut, M., Hervás, J., Jaedicke, C., Malet, J.-P., Montanarella, L., and Nadim, F.: Statistical modelling of Europe-wide landslide susceptibility using limited landslide inventory data, Landslides, 9, 357–369,, 2012. 

van Westen, C. J. and Zhang, J.: Landslides and floods triggered by Hurricane Maria (18 September, 2017) in Dominica, Digital or Visual Products, UNITAR-UNOSAT, (last access: 31 March 2022), 2018. 

van Westen, C., Jetten, V., and Alkema, D.: Validating national landslide susceptibility and hazard maps for Caribbean island countries: the case of Dominica and tropical storm Erika, in: EGU General Assembly Conference Abstracts, April 2016, EPSC2016-4334, 2016. 

Wasowski, J., Keefer, D. K., and Lee, C. T.: Toward the next generation of research on earthquake-induced landslides: current issues and future challenges, Eng. Geol., 122, 1–8,, 2011. 

Weiss, A.: Topographic Position and Landforms Analysis, in: ESRI User Conference, San Diego, CA, (last access: 31 March 2022), 2001. 

Williams, J. G., Rosser, N. J., Kincey, M. E., Benjamin, J., Oven, K. J., Densmore, A. L., Milledge, D. G., Robinson, T. R., Jordan, C. A., and Dijkstra, T. A.: Satellite-based emergency mapping using optical imagery: experience and reflections from the 2015 Nepal earthquakes, Nat. Hazards Earth Syst. Sci., 18, 185–205,, 2018. 

Xu, C., Dai, F., Xu, X., and Hsi, Y.: GIS-based support vector machine modeling of earthquake-triggered landslide susceptibility in the Jianjiang River watershed, China, Geomorphology, 145–146, 70–80,, 2012. 

Short summary
Understanding where landslides occur in mountainous areas is critical to support hazard analysis as well as understand landscape evolution. In this study, we present a large compilation of inventories of landslides triggered by rainfall, including several that are described here for the first time. We analyze the topographic characteristics of the landslides, finding consistent relationships for landslide source and deposition areas, despite differences in the inventories' locations.
Final-revised paper