the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Use of delayed ERA5-Land soil moisture products for improving landslide early warning
Nunziarita Palazzolo
Antonino Cancelliere
Robert D. Zofei
David J. Peres
Previous studies have demonstrated that incorporating soil moisture information in landslide-triggering thresholds can improve their predictive performance. ECMWF ERA5-Land reanalysis soil moisture information has proven to have this potential. However, ERA5-Land data are released with a latency of usually 5 d, which limits their immediate operational use in Landslide Early Warning Systems (LEWSs). In this study, we investigate whether delayed soil moisture data – ranging from 0 to 15 d prior to rainfall events – can still effectively inform landslide-triggering conditions. Specifically, we develop artificial neural networks (ANNs) trained on various delay times and evaluate how detection performances vary with increasing lag. We measure performances by ROC-based indices, such as the True Skill Statistic (TSS). Focusing on Sicily, Italy, our results show that even delayed soil moisture data consistently outperform models based solely on rainfall (TSS=0.68 vs. 0.59). Notably, TSS reduces only marginally, from 0.78 with no delay to 0.72 with 5 d delay, and 0.67 with 15 d delay. This performance remains higher than that obtained using only soil moisture data (without precipitation and no delay, TSS=0.53), as well as those achieved with a traditional power-law threshold based on rainfall intensity and duration (TSS=0.50) and also through ANN model using rainfall intensity and duration (TSS=0.59). These findings are, thus, promising for an operational use of ERA5-Land soil moisture products in LEWSs.
- Article
(2441 KB) - Full-text XML
- BibTeX
- EndNote
Rainfall-induced landslides rank among the most devastating natural hazards, causing extensive loss of life and severe impacts on the environment, infrastructure, and the economy (Winter et al., 2016). Between 2004 and 2016, 4862 fatal landslides were recorded worldwide, leading to an estimated 55 997 fatalities (Froude and Petley, 2018). Territorial Landslide Early Warning Systems (LEWSs) aim to promptly alert at-risk populations, thereby significantly reducing casualties and injuries. Identifying the conditions that trigger landslides is crucial for developing effective LEWSs. Early efforts primarily focused on rainfall intensity–duration thresholds (Caine, 1980; Guzzetti et al., 2007), but more recent studies highlight the value of incorporating additional hydrological variables – particularly soil moisture – to better capture the mechanisms driving slope failure (Uwihirwe et al., 2022; Mirus et al., 2018a, b; Thomas et al., 2018; Segoni et al., 2018; Wicki et al., 2021; Bogaard and Greco, 2016, 2018; Reder and Rianna, 2021; Marino et al., 2020; Conrad et al., 2021; Palazzolo et al., 2023; Meneses et al., 2019).
Machine learning methods, particularly Artificial Neural Networks (ANNs), have proven effective in handling multiple predictors for landslide forecasting, often going beyond classical parametric threshold models (e.g., power-law relationships).
Distefano et al. (2022), focusing on Sicily, showed that precipitation thresholds derived via Artificial Neural Networks (ANNs) outperform traditional power-law formulations. This evidence was further supported at broader scales. Mondini et al. (2023) demonstrated that a simple deep learning framework, based on a feed-forward network with two hidden layers of four neurons and bagging for robustness, could reliably forecast rainfall-induced landslides across Italy, achieving AUC values between 0.88 and 0.92. Alongside, Peng and Wu (2024) applied multilayer perceptron (MLP) regression to extend classical effective rainfall–duration thresholds by including daily rainfall and showed that the proportion of hazardous conditions classified as warning or severe warning increased markedly, from 41.5 % to 76.8 %. Expanding the comparison across algorithms, Harsa et al. (2023) tested Random Forest (RF), Extreme Gradient Boosting (XGBoost), Generalized Boosting Machine (GBM), Generalized Linear Model (GLM), and a feed-forward Deep Learning network with satellite-derived rainfall in Central Java (Indonesia) to build landslide event prediction models using as input of the algorithms precipitation data obtained from the global satellite mapping. They found that the GLM performed best on raw data (AUC=0.828), while the Deep Learning model achieved the best results after log-transformation (AUC=0.836). More recently, the focus shifted toward both predictive skill and interpretability. Shao et al. (2025) employed XGBoost together with SHAP (SHapley Additive exPlanations) to integrate rainfall, soil moisture, and environmental factors in Italy, reporting excellent performance (, sensitivity=0.79, specificity=0.81), and outperforming traditional E–D curves. Similarly, Wen et al. (2025) used CatBoost regression with SHAP in Fengjie County (China), highlighting daily rainfall (DR) and early effective rainfall (ER) as key variables, and identifying critical values around DR≈5 mm and ER≈15 mm above which landslide probability increases significantly (R2=0.987; RMSE=0.075). Finally, Lee et al. (2025) developed a nationwide framework for South Korea using AutoML-optimized RF models with SHAP interpretation, obtaining significant high predictive skill (AUC=0.981; accuracy=0.93), and showing that 88 % of landslides in 2023 were classified as high or very high susceptibility, supporting the model's use in daily warning up to 72 h in advance.
Together, these studies illustrate a progression from traditional rainfall thresholds toward increasingly complex machine learning frameworks for operational landslide early warning. Meanwhile, other studies have explored the benefit from the incorporation of soil-hydrologic monitoring data into ML-based models to simulate changes in soil moisture and pore pressure that predispose triggering mechanisms. Orland et al. (2020), for instance, developed a deep learning model that combined soil moisture, pore pressure, and rainfall monitoring data from landslide-prone hillslopes in the USA to predict both the timing and magnitude of hydrologic responses at various soil depths. Their findings suggest that machine learning can provide an accurate and computationally efficient alternative to empirical approaches and physical models for landslide hazard warning. Along the same line, Distefano et al. (2023) showed that incorporating ERA5-Land reanalysis soil moisture products (Muñoz-Sabater et al., 2021) – specifically the soil moisture at the onset of rainfall events – can significantly improve ANN-based landslide predictions. However, since ERA5-Land data (provided by the European Centre for Medium-Range Weather Forecasts, ECMWF) are released with a 5 d delay, their direct use in operational real-time LEWSs is constrained. Indeed, such a latency issues affects a broader range of Earth observation datasets. For instance, the NASA GPM (Global Precipitation Measurement) mission provides datasets with varying latencies, between the observation time and the publication one, varying from 4 h to 3.5 months (Huffman et al., 2019). Similarly, NASA's SMAP (Soil Moisture Active Passive) soil moisture retrievals typically have a latency of 1–3 d, with latency increasing for higher-level or gap-filled products (Dashtian et al., 2024). Against this backdrop, the present study investigates the feasibility of using specifically delayed ERA5-Land soil moisture data for real-time landslide forecasting, recognizing that data latency is a common challenge shared by other sources potentially valuable for landslide prediction studies. Within this framework, we specifically assess the extent to which different publication lags (up to 15 d) affect the performance of ANN-based landslide prediction. Indeed, although the current latency of ERA5-Land soil moisture data is approximately 5 d, this study explores a broader latency range to evaluate the sensitivity of model performance to delayed information. This choice serves a dual purpose: first, to account for potential improvements in future reanalysis products, e.g., ERA6 (https://climate.copernicus.eu/sites/default/files/2022-09/S3_Hans_Hersbach_v1.pdf, last access: 4 December 2025) as well as temporary disruptions in data availability due to maintenance or access issues; second, to investigate how the timeliness of soil moisture data affects landslide prediction capability. Understanding the extent to which performance decreases as latency increases allows us to better define the operational value of reanalysis data and identify time thresholds within which delayed soil moisture information remains effective for early warning applications. The models are evaluated using receiver operating characteristic (ROC) statistics, and the methodology is tested on the island of Sicily (Italy).
2.1 Overview and dataset creation
The proposed methodology is briefly schematized in Fig. 1.
Figure 1Scheme of the methodology applied to explore the potential use of delayed ERA5-Land soil moisture products for the identification of landslide triggering conditions.
As illustrated, the first step of the methodology involves collecting rainfall and landslide data. Specifically, the FraneItalia inventory (Calvello and Pecoraro, 2018) is employed to gather information on observed landslides. This inventory records historical landslides throughout Italy since 2010, encompassing both fatal and non-fatal events. This detailed dataset enables the identification and selection of landslide events whose main cause is intense and/or prolonged rainfall. The CTRL-T (Calculation of Thresholds for Rainfall-Induced Landslides-Tool) code (Melillo et al., 2018) has then been applied to identify rainfall events most likely responsible for the observed slope failures. Briefly, the tool comprises different modules, each designed for a specific function. One of these modules reconstructs rainfall events in terms of duration D (h) and cumulative event rainfall E (mm) by analyzing a continuous rainfall time series, along with various climate and spatial parameters. Another module, on the other hand, selects the most representative rain gauge for each landslide event. This tool allows, thus, the reconstruction of both triggering and non-triggering rainfall events. On the other side, ERA5-Land reanalysis dataset provides soil moisture values (ϑ [m3 m−3]) at four distinct depths (i.e., 0–7, 7–28, 28–100, and 100–289 cm), at the hourly scale and at a high spatial resolution. In general terms, ERA5-Land is designed to improve the representation of land surface processes. Its land surface module is natively produced at ∼9 km resolution, whereas the atmospheric forcing fields (e.g., precipitation, temperature) are statistically downscaled from the coarser ERA5 reanalysis (31 km) (Hersbach et al., 2020). This structure ensures consistency with ERA5 while enhancing the detail of land surface variables, making ERA5-Land particularly suitable for hydrological and landslide applications. Thus, multi-layered soil moisture data, from the ERA5-Land reanalysis project, at the onset of all rainfall events are then retrieved, together with the preceding values up to 15 d before. Regarding landslide information, in the present study, solely the FraneItalia inventory has been employed, in order to objectively compare the performance of the models using delayed soil moisture with our previous study utilizing non-delayed soil moisture (Distefano et al., 2023). Nonetheless, the proposed approach is fully transferable to other landslide datasets, such as ITALICA (ITAlian rainfall-induced LandslIdes CAtalogue) (Peruccacci et al., 2023), as far as the necessary information is available.
2.2 Preliminary correlation analysis
An exploratory analysis is firstly conducted, in order to understand from a purely statistical standpoint, how delay affects the information content of soil moisture data from the ERA5-Land Reanalysis. To this aim, we computed a conditional autocorrelation, i.e. the autocorrelation between the soil moisture at the beginning of triggering and non-triggering rainfall events and its values in the preceding days, with time lags ranging from 0 to 15 d with respect to the rainfall event's timing. Specifically, for each cell corresponding to a rainfall station soil moisture at each depth and at onset of rainfall events is compared with values from preceding days. With reference to the generic cell corresponding to the rainfall station, let St(i) with represent the soil moisture at the onset time t(i) of the generic rainfall event i and k the lag time (in days). We have then computed the linear correlation coefficient ρ(k) between soil moisture values at lag k and lag 0 as follows (Eq. 1):
where St(i)−k denotes soil moisture lagged by k days relative to the onset time t(i) and is the mean of St(i)−k. Equation (1) differs from the so-called autocorrelation for being conditioned to the beginning time of rainfall events. For this reason, we refer to ρ(k) as a conditional autocorrelation. Equation (1) has been applied with reference to all rainfall events, as well as to the triggering and non-triggering events.
2.3 Artificial Neural Network models (ANNs)
ANNs are a class of machine learning models designed to mimic the structure and functioning of neural systems, particularly how the human brain processes information (Haykin, 1994). They consist of numerous interconnected processing elements (i.e., neurons) that work together to solve complex tasks. Similar to human learning, ANNs improve their performance through experience, adjusting the strength of connections between neurons (synapses) to improve their ability to recognize patterns and make predictions (Grosan and Abraham, 2011; Haykin, 1994). In this study, the following input variables were considered: (i) precipitation duration D, (ii) cumulative precipitation E, (iii) volumetric soil water content S at different time-lag relatively to the initial instant of rainfall events. Regarding the lagged soil water content, we have chosen to consider the data relative only to a given lag. In other words, the ANNs for a given lag of k days consider as input soil moisture data only for that lag, and not for larger lags. All input variables considered have been transformed to their natural logarithms – this has been empirically proven to improve the efficiency of ANN calibration (Distefano et al., 2022). Figure 2 shows the architecture of the implemented ANN model and the input variables considered in this study.
Figure 2Sketch of the implemented ANN model. Soil moisture subscripts (S) indicate the soil layer (depth range) of the ERA5-Land reanalysis dataset. (a) Soil moisture in the 7–28 cm layer at increasing lags; (b) Soil moisture in the 7–28 cm layer at increasing lags together with cumulated rainfall and rainfall duration; (c) Multilayer soil moisture at increasing lags together with cumulated rainfall and rainfall duration. All variables are used as inputs to the ANN.
The ANN models were developed using the MATLAB® Deep Learning Toolbox. Training was performed with the scaled conjugate gradient backpropagation algorithm (Møller, 1993), and model performance was evaluated using the cross-entropy loss function (Kline and Berardi, 2005), which is widely adopted in classification tasks and is known to be robust to class imbalance. This ensures a reliable training process even under unbalanced conditions, where the number of reconstructed non-triggering rainfall events could exceed that of triggering events by multiple orders of magnitude. Indeed, within a study currently under preparation, we have investigated the influence of different proportions of non-triggering and triggering events. The outcome of these tests suggests that it not necessary to balance the datasets with our model setting and that even unbalanced datasets lead to more robust models, as, while using the entire information available, they reflect the actual likelihood of triggering and non-triggering events. Hence, the entire dataset, comprising both triggering and non-triggering rainfall events, was randomly split into three subsets: 70 % for training, 15 % for validation, and 15 % for testing. The allocation was performed randomly according to a uniform distribution over the dataset indices, meaning that each observation had the same probability of being assigned to any of the three subsets. To account for the variability introduced by this random sampling process, each neural network configuration was trained and evaluated 30 times, each with a different random split. This approach allows for a robust assessment of model performance and simulates realistic operational conditions, where landslide-triggering events are relatively rare. Furthermore, for each of these training sessions the number of neurons has been varied from 5 to 20, choosing at each session the number that corresponds to the highest performances.
The following simulations using different input datasets: (i) soil moisture at the second-layer depth only; (ii) rainfall duration (D, h), rainfall depth (E, mm), and soil moisture at the second-layer depth; and (iii) rainfall duration (D), rainfall depth (E), and soil moisture at all depth layers. These input combinations were selected based on the findings of Distefano et al. (2023), who reported that the highest performance was achieved when considering soil moisture from the second layer, with only slightly higher performances with all four layers.
To assess the accuracy of the ANNs' output, hence the performance of the model's prediction, ROCs (Receiver Operating Characteristics) analysis is carried out. Specifically, the True Skill Statistic (TSS) index is computed for each simulation, corresponding to every investigated combination of input variables. TSS is a measure of how much better a model performs compared to random guessing, making it a reliable measure of the model's ability, i.e., ANNs, to correctly classify both positive and negative cases, namely triggering and non-triggering rainfall events. Its computation is based on the confusion matrix which provides a detailed breakdown of the model's predictions compared to actual outcomes (Fawcett, 2006). It is a square matrix where rows represent the actual classes and columns represent the predicted classes. Thus, for a binary classification problem, like the discrimination between triggering and non-triggering conditions, the matrix distinguishes: (i) True Positives (TP) as triggering events correctly identified as triggering events; (ii) True Negatives (TN) as non-triggering events correctly identified as non-triggering events; (iii) False Positives (FP) as non-triggering events incorrectly classified as triggering events (false alarms); and (iv) False Negatives (FN) as triggering events incorrectly classified as non-triggering events (missed alarms). Therefore, TSS (Eq. 4) index is defined as the difference between the True Positive Rate or Sensitivity (TPR) (Eq. 2) and the False Positive Rate (FPR) (Eq. 3). The highest performances correspond to TSS=1 when the model correctly predicts for all the analysed data.
Thus, the network outputs, originally ranging from 0 to 1, were converted into binary predictions through a thresholding procedure, whereby an event is classified as a landslide (i.e., output = 1) if the model output exceeds a predefined threshold, and as a non-triggering event otherwise. This threshold was selected by maximizing the TSS, which ensures an optimal trade-off between TPR and FPR. While TSS is widely adopted in the literature for evaluating binary classifiers (Frattini et al., 2010), other metrics can also be employed to assess model performance and complexity. For example, information criteria such as the Akaike Information Criterion (AIC) and the Bayesian Information Criterion (BIC) have been effectively applied in studies comparing heterogeneous or hybrid models (Dutta et al., 2025; Patton et al., 2023; Quraishi and Choudhury, 2023) to balance model fit and parametrization. In the case of neural networks, however, the most common strategies rely on the use of independent training, validation, and test datasets, combined with early stopping, which – as in the present study – effectively control overfitting and ensure generalization.
Sicily (Southern Italy) is selected as study area to perform our analyses (Fig. 3). Annual rainfall in the region ranges on average from 700 to 800 mm, with the majority occurring during the autumn and winter months. Red triangles, within the study area map, represent the 207 landslides retrieved by FraneItalia database (Calvello and Pecoraro, 2018) from 2010 to 2018 along with their longitude–latitude coordinates (WGS84 datum) and the triggering time information, if the latter is available (otherwise, it is assumed at the end of the day). Furthermore, the mentioned landslide database distinguishes between two categories of events: single landslide events (SLEs) and areal landslide events (ALEs). SLEs generally provide more precise temporal and spatial information on the failure, whereas ALEs consist of multiple landslides within a defined area and are typically associated with lower spatial resolution – often limited to administrative units such as municipalities. Despite the potential uncertainties in the location and data for both SLEs and ALEs, we have chosen to retain all events in order to have statistically representative subsamples for training, validating and testing the ANNs. More specifically, only those landslides linked to rainfall as the main triggering factor were considered. This includes events where the mobilized material was consistent with rainfall-induced landslides, while other types (e.g., rockfalls) were excluded. The final dataset consisted of 207 rainfall-related landslides. The CTRL-T software enabled the reconstruction of triggering conditions for 144 of these events. Among them, movement type information is unavailable for 126 events (87.5 %). Of the remaining cases, 10 were identified as areal rockfalls (6.9 %), and both flows and slides accounted for 4 events each (5.6 %). Regarding temporal resolution, for 103 landslides only the day of occurrence was known, while for the remaining cases more detailed information – such as the hour or time of the day (e.g., morning, afternoon, or evening) – was available. For events with daily resolution, the failure was assumed to occur at the end of the day. When more precise timing was reported, the failure was assumed to coincide with the peak rainfall within the reported time window. However, as discussed by Peres et al. (2018), even small shifts in the assumed timing of landslide occurrence – whether anticipating or delaying the event – can affect the estimation of rainfall thresholds. Despite this limitation, the lack of precise temporal information cannot currently be resolved. Nonetheless, the consistency of the dataset across all models ensures an objective comparison of their performance.
Figure 3Study area map (credit to https://www.sitr.regione.sicilia.it/geoportale/mobile/search.html?q=dem, last access: 4 December 2025, and ESRI, 2020). Red triangles represent the locations of the occurred landslides in the study area from 2010 to 2018 and retrieved by the FraneItalia catalogue; green dots represent the locations of the 306 gauges installed in Sicily.
The map also shows the ERA5-Land grid at which extent soil moisture data are released. Specifically, ERA5-Land reanalysis data are released with a horizontal resolution of 0.1°×0.1° corresponding to about 9 km×9 km grid at the latitudes of the case study area (https://cds.climate.copernicus.eu/datasets/reanalysis-era5-land?tab=download, last access: 4 December 2025).
Green circles, instead, represent the locations of the 306 gauges installed in Sicily and managed by the three main Sicilian gauging network, namely the Regional Water Observatory (Osservatorio delle Acque, OdA), the Sicilian Agrometeorological Information Service (Servizio Informativo Agrometeorologico Siciliano, SIAS), and the Regional Department of Civil Protection (Dipartimento Regionale di Protezione Civile, DRPC). Rainfall data are, thus, provided at the hourly scale from 2009 up to 2018 (Distefano et al., 2022).
As mentioned in the previous section, CTRL-T tool enables the identification of both triggering and non-triggering rainfall events for each rainfall station based the hourly rainfall, and by setting a combination of regional parameters. These parameters are designed to account for climatic seasonality, namely at defining the warm period CW and the cold one CC; the length of dry period exceeded which an hourly measurement is considered isolated and thus removed (P1); the time periods used to remove irrelevant amounts of rainfall between rainfall sub-events (P2), and (P3); and the minimum dry period separating two rainfall events assumed to be single or multiple sub-events (P4). Moreover, even the instrumental sensitivity of a rain gauge (GS), as well the radius of the buffer to assign each landslide to the closest rain gauge (RB), and the minimum value ER above which a rainfall measurement is classified as isolated can be set. For further details on these parameters, readers are referred to Melillo et al. (2018). Table 1 summarize the parameters' values adopted for our analysis. Specifically, the RB value of 16 km was used; however, since the mean distance between rain gauges and landslides was approximately 5 km, this maximum value was seldom reached. The warm season CW was defined as the period from April to October, and the cold season CC as the period from November to March. Accordingly, parameters sws and ews, indicating the beginning of the warm season the end of the warm season, are set equal to 4 (i.e., April) and 10 (i.e., October), respectively. These values are consistent with those proposed by Melillo et al. (2015).
First, the explanatory analysis regarding conditional autocorrelation analysis has been carried out, in order to understand how information content may decrease with lag from a statistical standpoint. For all four soil layers, the conditional autocorrelation function has a decreasing trend, even though for the deepest layer the decay is minimal (Fig. 4).
Figure 4Conditional autocorrelation function for soil moisture at the four ERA5-Land depths (time lags from 1 to 15 d) (a) 0–7 cm; (b) 7–28 cm; (c) 28–100 cm; (d) 100–239 cm.
The shallowest layers, particularly the first one, appear to be the most responsive to the effect of lag, showing a notable drop in conditional autocorrelation as early as lag=1. This is consistent with the fact that shallow layers are mostly sensitive to the forcing of even small rainfall events that induce high frequency fluctuations in soil moisture, with a consequent reduction of short short-term memory in the process. As deeper layers are considered, high frequency forcings tend to be dampened by the presence of the above layers resulting in a more stable pattern in time and stronger autocorrelation, at least for the investigated lags. For subsequent analyses, as outlined in the previous sections, we focused on layer 2 (7–28 cm), which had already been identified in Distefano et al. (2023). This because, in the same study, a systematic evaluation of alternative input combinations (D, E, S1, S2, S3, S4, and their subsets) indicates that soil moisture at the second layer depth consistently provided the best model performance.
Moving to the results of ANNs for the prediction of landslide triggering conditions, starting with those using only soil moisture data without precipitation (Table 2), a TSS of 0.53 is achieved when no lag is considered. This performance is slightly higher than that obtained with a traditional approach (i.e., the power-law threshold based on rainfall intensity and duration, which has a TSS of 0.50; Palazzolo et al. (2023) but slightly lower than that achieved by Distefano et al. (2023), where an ANN model using rainfall intensity and duration lead to significantly higher performances (TSS=0.59). When a 5 d lag is considered, TSS drops further to 0.43.
Table 2Performances of ANNs using exclusively soil moisture at the second layer (7–28 cm).Bold entries indicate the performance metrics associated with the critical lag times considered in this study: no lag and a 5 d lag.
As expected, including rainfall duration (D [h]) and rainfall depth (E [mm]) besides soil moisture leads to a significant improvement in performance (Fig. 5). In more detail, when D-E-S2 dataset is considered as input to the model, a TSS of 0.76 is achieved, while incorporating the entire D-E-Sall dataset increases the TSS to 0.78. Nevertheless, in both cases, performance reduces as the time-lag increases. Focusing on a 5 d time delay a slight drop in performance is observed, with TSS values of 0.70 and 0.72, respectively. Although these values are lower, they are still above the ANN model based solely on rainfall data. As the time delay increases, performance continues to marginally decrease, reaching a TSS of around 0.68 at the 15 d time lag in both analyzed scenarios, namely a mean decrease of just 0.09 point.
Figure 5TSS values from 30 ANN training runs, with the dashed grey line representing the mean TSS across lag time. (a) Simulation set with rainfall duration (D, [h]), rainfall depth (E, [mm]) and soil moisture at the second-layer depth as input dataset; (b) Simulation set with rainfall duration (D, [h]), rainfall depth (E, [mm]) and soil moisture at all -layer depth as input dataset.
Overall, box-plots present a narrow interquartile range, supporting the adequacy of the input dataset and confirming robustness of the results. Their width further depicts how sensitive the model performance is to the random selection of subsets, with narrow boxes indicating stable outcomes across different splits. More details about simulations carried out are summarized in Tables 3 and 4. For the E-D-S2 configuration (Table 3), short delays up to 5 d determine a modest reduction in TSS, with TPR almost unchanged (TPR from 0.89 to 0.87) and a moderate increase in the false alarms rate (FPR from 0.12 to 0.16). At longer delays, performance stabilized around TSS=0.68–0.70, still with relatively high TPR but with higher false alarm rates (up to 0.20). When soil moisture from all four layers was included (Table 4), performances improve slightly systematically; but for instance, with lags up to 5 d, TPR is maintained (TPR=0.89–0.90), and FPR increases only slightly (up to 0.18). Beyond 5 d, performance gradually declines (TSS=0.67–0.69, FPR≈0.20–0.21). Compared to the single-layer setup, the multilayer configuration consistently yielded higher TSS values and lower FPRs, confirming that the vertical soil moisture profile provides a more realistic representation of antecedent hydrological conditions. Nevertheless, the increase in performance respect to the most representative layer may not be worth the increase of input complexity.
Table 3Performances of ANNs using rainfall duration, depth and soil moisture at the second layer (7–28 cm). Bold entries indicate the performance metrics associated with the critical lag times considered in this study: no lag and a 5 d lag.
Table 4Performances of ANNs using rainfall duration, depth and soil moisture at all four layers. Bold entries indicate the performance metrics associated with the critical lag times considered in this study: no lag and a 5 d lag.
In these latter two simulation sets, the results indicate that, on average, using 15 neurons provides the best performance, providing a suitable balance between model complexity and generalization capabilities.
Notably, a lower number of neurons may lead to underfitting, limiting the network's capacity to capture the patterns in the data. Conversely, increasing the neuron beyond 15 does not yield significant improvements and may introduce unnecessary complexity. In this regard, looking at the simulation set exclusively using S2 as input, the number of hidden neurons decreases at 9, on average, reasonably due to the lower complexity of the input dataset.
As a whole, results show that delayed ERA5-Land soil moisture information can be considered a useful predictor for landslide prediction, although the performance of the prediction model slightly decreases as the time-lag increases. Indeed, the improved performance, even with lagged data, can be explained by the slow variation of soil moisture during dry periods preceding rainfall events. Consequently, soil moisture at the onset of a rainfall event does not differ significantly from that recorded 5 or more days earlier. Our results corroborate the findings of Marino et al. (2020), where, based on Monte Carlo simulations, they have shown that soil moisture estimates, even when derived from a simplified hydrological model – rather than satellite or in-situ data – can significantly enhance the robustness of forecasting thresholds under different uncertainty scenarios by reducing false alarms. These findings emphasize that high-accuracy, point-scale observations, which have proven to have more information content than satellite data (Mirus et al., 2025), are not strictly necessary to extract meaningful predictive signals. On the contrary, large-scale products, when properly interpreted, can effectively reflect the temporal dynamics of soil moisture that control slope stability. Therefore, satellite or reanalysis-based soil moisture datasets represent a valuable resource to support landslide early warning systems, especially in areas lacking soil moisture monitoring networks, as in Sicily, where there are no publicly available datasets – for instance, the closest sensors from the International Soil Moisture Network (https://ismn.earth/en/, last access: 4 December 2025) are in Calabria. This is particularly relevant in contexts where current warning procedures rely solely on rainfall thresholds, as the inclusion of soil moisture, even coarse or delayed, provides useful information on catchment predisposition and can improve the reliability of alerts.
Concerning, instead, the generalizability and broader applicability of the proposed approach, it is worth noting that the rainfall events considered for model development covered a wide range of characteristics, with durations between 1 h and 10 d and cumulative totals up to 300 mm (Distefano et al., 2023). In addition, soil moisture values ranged between 0.15 and 0.45, effectively spanning the porosity domain of most common soil types. These conditions delineate the applicability domain of the trained models and support their potential transferability to different climate and geological conditions.
This study proposed an ANN-based framework to evaluate whether ERA5-Land reanalysis soil moisture data can effectively serve as real-time data for landslide prediction. Building on the work of Distefano et al. (2023), our investigation focused on the classification capabilities of ANNs to identify triggering conditions using rainfall duration, rainfall depth, and delayed multi-layered soil moisture information. Specifically, ERA5-Land data are released with an approximately 5 d delay, limiting their immediate availability for real-time landslide prediction. The analysis was conducted with a focus on Sicily, Italy. Three simulation sets were examined, progressively increasing in complexity based on the number of variables considered, with performance evaluated in terms of TSS. Significant performance was achieved when rainfall duration, rainfall depth, and soil moisture information at the second layer depth were used as input into ANNs. In this case, results demonstrated that soil moisture lagged data did not significantly undermine landslide prediction performance, as the TSS decreased from 0.76 with lag=0 to 0.70 with a 5 d lag. Even with a lag of 15 d, the prediction performance of ANNs is still significantly higher than those models using only rainfall event characteristics, corresponding to a decrease of just 0.08 points. When all available soil moisture information is used as input, together with rainfall depth and duration, the model reaches even slightly higher predictive performance, comparable with those obtained using the most representative soil moisture layer. However, it may not be worth the increase of input complexity. These findings underscore the potential of integrating ERA5-Land multi-layered soil moisture data into LEWSs, despite their delayed release. Indeed, their regular availability in both time and space enhances their suitability as a reliable tool for these systems. Further developments of this study will consider the use of rainfall forecasts at various temporal horizons in order to assess effective the operational predictive capability of the investigated tools.
The FraneItalia landslides catalog is available at https://doi.org/10.17632/zygb8jygrw.2 (Calvello and Pecoraro, 2018). Rainfall measurements are available at the website of the Servizio Informativo Agrometeorologico Siciliano (SIAS) (http://www.sias.regione.sicilia.it/, last access: 4 December 2025; SIAS, 2025) and at the Osservatorio delle Acque (http://www.bio.isprambiente.it/annalipdf/, 4 December 2025; ISPRA, 2025). Reanalysis soil moisture data are available from https://doi.org/10.24381/cds.e2161bac (Muñoz-Sabater et al., 2021).
Conceptualization was done by DJP, AC and NP; formal analysis by DJP, AC and NP; investigation by DJP and NP; methodology by DJP and NP; coding by DJP, NP and RDZ; writing the original draft by DJP, AC, NP and RDZ All authors have read and agreed to the published version of the paper.
At least one of the (co-)authors is a member of the editorial board of Natural Hazards and Earth System Sciences. The peer-review process was guided by an independent editor, and the authors also have no other competing interests to declare.
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
This work was supported by the PRIN 2022 project “ITALERT – Prediction of Rainfall-INduced landslides – Improving multi-scale TerritoriAL Early warning through aRTificial intelligence” – funded by the Italian Ministry of University and Research (CUP E53D23004150006). The authors acknowledge the support of University of Catania, through the Piano di incentivi per la ricerca di Ateneo (Pia.ce.ri.) Linea 1 – ESTREMI project. The authors would like to thank Valerio Santoro for his valuable input in the early stages of this work. Finally, the authors acknowledge Roberto Greco (handling editor) and the referees for their valuable comments.
This research has been supported by University of Catania, through the Piano di incentivi per la ricerca di Ateneo (Pia.ce.ri.) Linea 1 – ESTREMI project.
This paper was edited by Roberto Greco and reviewed by Matt Thomas, Massimo Melillo, Ben Mirus, and one anonymous referee.
Bogaard, T. and Greco, R.: Invited perspectives: Hydrological perspectives on precipitation intensity-duration thresholds for landslide initiation: proposing hydro-meteorological thresholds, Nat. Hazards Earth Syst. Sci., 18, 31–39, https://doi.org/10.5194/nhess-18-31-2018, 2018.
Bogaard, T. A. and Greco, R.: Landslide hydrology: from hydrology to pore pressure, Wiley Interdiscip. Rev. Water, 3, 439–459, https://doi.org/10.1002/WAT2.1126, 2016.
Caine, N.: The Rainfall Intensity: Duration Control of Shallow Landslides and Debris Flows, Geogr. Ann. Ser. A, Phys. Geogr., 62, 23, https://doi.org/10.2307/520449, 1980.
Calvello, M. and Pecoraro, G.: FraneItalia: a catalog of recent Italian landslides, Geoenvironmental Disasters, 5, 1–16, https://doi.org/10.1186/s40677-018-0105-5, 2018.
Conrad, J. L., Morphew, M. D., Baum, R. L., and Mirus, B. B.: HydroMet: A New Code for Automated Objective Optimization of Hydrometeorological Thresholds for Landslide Initiation, Water, 13, 1752, https://doi.org/10.3390/W13131752, 2021.
Dashtian, H., Young, M. H., Young, B. E., McKinney, T., Rateb, A. M., Niyogi, D., and Kumar, S. V.: A framework to nowcast soil moisture with NASA SMAP level 4 data using in-situ measurements and deep learning, J. Hydrol. Reg. Stud., 56, 102020, https://doi.org/10.1016/J.EJRH.2024.102020, 2024.
Distefano, P., Peres, D. J., Scandura, P., and Cancelliere, A.: Brief communication: Introducing rainfall thresholds for landslide triggering based on artificial neural networks, Nat. Hazards Earth Syst. Sci., 22, 1151–1157, https://doi.org/10.5194/nhess-22-1151-2022, 2022.
Distefano, P., Peres, D. J., Piciullo, L., Palazzolo, N., Scandura, P., and Cancelliere, A.: Hydro-meteorological landslide triggering thresholds based on artificial neural networks using observed precipitation and ERA5-Land soil moisture, Landslides, 20, 2725–2739, https://doi.org/10.1007/s10346-023-02132-5, 2023.
Dutta, K., Poddar, A., Middya, A. I., and Roy, S.: Spatial variations of landslide severity with respect to meteorological and soil related factors, Nat. Hazards, 121, 3267–3291, https://doi.org/10.1007/s11069-024-06930-5, 2025.
ESRI: “Ocean” [basemap], Scale Not Given, “World Ocean Base map”, https://services.arcgisonline.com/arcgis/rest/services/Ocean/World_Ocean_Reference/MapServer (last access: 16 January 2023), 2020.
Fawcett, T.: An introduction to ROC analysis, Pattern Recognit. Lett., 27, 861–874, https://doi.org/10.1016/J.PATREC.2005.10.010, 2006.
Frattini, P., Crosta, G., and Carrara, A.: Techniques for evaluating the performance of landslide susceptibility models, Eng. Geol., 111, 62–72, https://doi.org/10.1016/J.ENGGEO.2009.12.004, 2010.
Froude, M. J. and Petley, D. N.: Global fatal landslide occurrence from 2004 to 2016, Nat. Hazards Earth Syst. Sci., 18, 2161–2181, https://doi.org/10.5194/nhess-18-2161-2018, 2018.
Grosan, C. and Abraham, A.: Artificial Neural Networks, Intell. Syst. Ref. Libr., 17, 281–323, https://doi.org/10.1007/978-3-642-21004-4_12, 2011.
Guzzetti, F., Peruccacci, S., Rossi, M., and Stark, C. P.: Rainfall thresholds for the initiation of landslides in central and southern Europe, Meteorol. Atmos. Phys., 98, 239–267, https://doi.org/10.1007/S00703-007-0262-7, 2007.
Harsa, H., Praja, A. S., Hutapea, D., Hidayat, A., Hidayat, A. M., Mulsandi, A., Suprihadi, B., Kurniawan, R., Habibie, M. N., Daniel Hutapea, T., Swarinoto, Y. S., Eka, E., Makmur, S., Fitria, W., Sapta, R., Sudewi, S., and Sukmana Praja, A.: Machine learning and artificial intelligence models development in rainfall-induced landslide prediction, Artic. IAES Int. J. Artif. Intell., 12, 262–270, https://doi.org/10.11591/ijai.v12.i1.pp262-270, 2023.
Haykin, S.: Neural networks: a comprehensive foundation, Prentice hall PTR, ISBN 0-13-273350-1, 1994.
Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A., Muñoz-Sabater, J., Nicolas, J., Peubey, C., Radu, R., Schepers, D., Simmons, A., Soci, C., Abdalla, S., Abellan, X., Balsamo, G., Bechtold, P., Biavati, G., Bidlot, J., Bonavita, M., De Chiara, G., Dahlgren, P., Dee, D., Diamantakis, M., Dragani, R., Flemming, J., Forbes, R., Fuentes, M., Geer, A., Haimberger, L., Healy, S., Hogan, R. J., Hólm, E., Janisková, M., Keeley, S., Laloyaux, P., Lopez, P., Lupu, C., Radnoti, G., de Rosnay, P., Rozum, I., Vamborg, F., Villaume, S., and Thépaut, J. N.: The ERA5 global reanalysis, Q. J. R. Meteorol. Soc., 146, 1999–2049, https://doi.org/10.1002/QJ.3803, 2020.
Huffman, G. J., Bolvin, D., Braithwaite, D., Hsu, K., Joyce, R., Kidd, C., Nelkin, E., Sorooshian, S., Tan, J., and Xie, P.: NASA Global Precipitation Measurement (GPM) Integrated Multi-Satellite Retrievals for GPM (IMERG), Algorithm Theoretical Basis Document (ATBD) Version 06, vol. 30, NASA/GSFC, Greenbelt, MD, USA, 2019.
ISPRA – Istituto Superiore per la Protezione e la Ricerca Ambientale: Annali idrologici Storici, http://www.bio.isprambiente.it/annalipdf/, last access: 4 December 2025.
Kline, D. M. and Berardi, V. L.: Revisiting squared-error and cross-entropy functions for training neural network classifiers, Neural Comput. Appl., 14, 310–318, https://doi.org/10.1007/s00521-005-0467-y, 2005.
Lee, S., Roh, M., Jo, H. W., Joon, K., and Lee, W. K.: Machine learning-based rainfall-induced landslide susceptibility model and short-term early warning assessment in South Korea, Landslides, 22, 2809–2827, https://doi.org/10.1007/s10346-025-02513-y, 2025.
Marino, P., Peres, D. J., Cancelliere, A., Greco, R., and Bogaard, T. A.: Soil moisture information can improve shallow landslide forecasting using the hydrometeorological threshold approach, Landslides, 17, 2041–2054, https://doi.org/10.1007/s10346-020-01420-8, 2020.
Melillo, M., Brunetti, M. T., Peruccacci, S., Gariano, S. L., and Guzzetti, F.: An algorithm for the objective reconstruction of rainfall events responsible for landslides, Landslides, 12, 311–320, https://doi.org/10.1007/s10346-014-0471-3, 2015.
Melillo, M., Brunetti, M. T., Peruccacci, S., Gariano, S. L., Roccati, A., and Guzzetti, F.: A tool for the automatic calculation of rainfall thresholds for landslide occurrence, Environ. Model. Softw., 105, 230–243, https://doi.org/10.1016/J.ENVSOFT.2018.03.024, 2018.
Meneses, B. M., Pereira, S., and Reis, E.: Effects of different land use and land cover data on the landslide susceptibility zonation of road networks, Nat. Hazards Earth Syst. Sci., 19, 471–487, https://doi.org/10.5194/nhess-19-471-2019, 2019.
Mirus, B. B., Becker, R. E., Baum, R. L., and Smith, J. B.: Integrating real-time subsurface hydrologic monitoring with empirical rainfall thresholds to improve landslide early warning, Landslides, 15, 1909–1919, https://doi.org/10.1007/s10346-018-0995-z, 2018a.
Mirus, B. B., Morphew, M. D., and Smith, J. B.: Developing Hydro-Meteorological Thresholds for Shallow Landslide Initiation and Early Warning, Water, 10, 1274, https://doi.org/10.3390/W10091274, 2018b.
Mirus, B. B., Bogaard, T., Greco, R. and Stähli, M.: Invited perspectives: Integrating hydrologic information into the next generation of landslide early warning systems, 25, 169–182, https://doi.org/10.5194/nhess-25-169-2025, 2025.
Møller, M. F.: A scaled conjugate gradient algorithm for fast supervised learning, Neural Networks, 6, 525–533, https://doi.org/10.1016/S0893-6080(05)80056-5, 1993.
Mondini, A. C., Guzzetti, F., and Melillo, M.: Deep learning forecast of rainfall-induced shallow landslides, Nat. Commun., 14, 1–10, https://doi.org/10.1038/s41467-023-38135-y, 2023.
Muñoz-Sabater, J., Dutra, E., Agustí-Panareda, A., Albergel, C., Arduini, G., Balsamo, G., Boussetta, S., Choulga, M., Harrigan, S., Hersbach, H., Martens, B., Miralles, D. G., Piles, M., Rodríguez-Fernández, N. J., Zsoter, E., Buontempo, C., and Thépaut, J.-N.: ERA5-Land: a state-of-the-art global reanalysis dataset for land applications, Earth Syst. Sci. Data, 13, 4349–4383, https://doi.org/10.5194/essd-13-4349-2021, 2021.
Orland, E., Roering, J. J., Thomas, M. A., and Mirus, B. B.: Deep Learning as a Tool to Forecast Hydrologic Response for Landslide-Prone Hillslopes, Geophys. Res. Lett., 47, e2020GL088731, https://doi.org/10.1029/2020GL088731, 2020.
Palazzolo, N., Peres, D. J., Creaco, E., and Cancelliere, A.: Using principal component analysis to incorporate multi-layer soil moisture information in hydrometeorological thresholds for landslide prediction: an investigation based on ERA5-Land reanalysis data, Nat. Hazards Earth Syst. Sci., 23, 279–291, https://doi.org/10.5194/nhess-23-279-2023, 2023.
Patton, A. I., Luna, L. V., Roering, J. J., Jacobs, A., Korup, O., and Mirus, B. B.: Landslide initiation thresholds in data-sparse regions: application to landslide early warning criteria in Sitka, Alaska, USA, Nat. Hazards Earth Syst. Sci., 23, 3261–3284, https://doi.org/10.5194/nhess-23-3261-2023, 2023.
Peng, B. and Wu, X.: Optimizing rainfall-triggered landslide thresholds for daily landslide hazard warning in the Three Gorges Reservoir area, Nat. Hazards Earth Syst. Sci., 24, 3991–4013, https://doi.org/10.5194/nhess-24-3991-2024, 2024.
Peres, D. J., Cancelliere, A., Greco, R., and Bogaard, T. A.: Influence of uncertain identification of triggering rainfall on the assessment of landslide early warning thresholds, Nat. Hazards Earth Syst. Sci., 18, 633–646, https://doi.org/10.5194/nhess-18-633-2018, 2018.
Peruccacci, S., Gariano, S. L., Melillo, M., Solimano, M., Guzzetti, F., and Brunetti, M. T.: The ITAlian rainfall-induced LandslIdes CAtalogue, an extensive and accurate spatio-temporal catalogue of rainfall-induced landslides in Italy, Earth Syst. Sci. Data, 15, 2863–2877, https://doi.org/10.5194/essd-15-2863-2023, 2023.
Quraishi, M. I. and Choudhury, J. P.: Assessment, Categorisation and Prediction of the Landslide-Affected Regions Using Soft Computing and Clustering Techniques, J. Inst. Eng. Ser. B, 104, 579–602, https://doi.org/10.1007/s40031-023-00876-1, 2023.
Reder, A. and Rianna, G.: Exploring ERA5 reanalysis potentialities for supporting landslide investigations: a test case from Campania Region (Southern Italy), 18, 1909–1924, https://doi.org/10.1007/s10346-020-01610-4, 2021.
Segoni, S., Tofani, V., Rosi, A., Catani, F., and Casagli, N.: Combination of rainfall thresholds and susceptibility maps for dynamic landslide hazard assessment at regional scale, Front. Earth Sci., 6, 375062, https://doi.org/10.3389/feart.2018.00085, 2018.
Shao, X., Yan, W., Yan, C., Zhao, W., Wang, Y., Shi, X., Dong, H., Li, T., Yu, J., Zuo, P., Zhou, Z., and Jin, J.: Explainable Machine Learning for Mapping Rainfall-Induced Landslide Thresholds in Italy, Appl. Sci., 15, 7937, https://doi.org/10.3390/app15147937, 2025.
SIAS – Servizio Informativo Agrometeorologico Siciliano (Sicilian Agro-meteorological Information Service): Dati meteorologici (Meteorological data), SIAS [data set], http://www.sias.regione.sicilia.it/, last access: 4 December 2025.
S.I.T.R - Geoportale Regione Siciliana Infrastruttura dati territoriali [dem], https://www.sitr.regione.sicilia.it/geoportale/mobile/search.html?q=dem, last access: 4 December 2025.
Thomas, M. A., Collins, B. D., and Mirus, B. B.: Assessing the Feasibility of Satellite-Based Thresholds for Hydrologically Driven Landsliding, Water Resour. Res., 55, 9006–9023, https://doi.org/10.1029/2019WR025577, 2018.
Uwihirwe, J., Hrachowitz, M., and Bogaard, T.: Integration of observed and model-derived groundwater levels in landslide threshold models in Rwanda, Nat. Hazards Earth Syst. Sci., 22, 1723–1742, https://doi.org/10.5194/nhess-22-1723-2022, 2022.
Wen, H., Yan, F., Huang, J., and Li, Y.: Interpretable machine learning models and decision-making mechanisms for landslide hazard assessment under different rainfall conditions, Expert Syst. Appl., 270, 126582, https://doi.org/10.1016/J.ESWA.2025.126582, 2025.
Wicki, A., Jansson, P.-E., Lehmann, P., Hauck, C., and Stähli, M.: Simulated or measured soil moisture: which one is adding more value to regional landslide early warning?, Hydrol. Earth Syst. Sci., 25, 4585–4610, https://doi.org/10.5194/hess-25-4585-2021, 2021.
Winter, M. G., Shearer, B., Palmer, D., Peeling, D., Harmer, C., and Sharpe, J.: The Economic Impact of Landslides and Floods on the Road Network, Procedia Eng., 143, 1425–1434, https://doi.org/10.1016/J.PROENG.2016.06.168, 2016.