the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
FLEMOflash – Flood Loss Estimation MOdels for companies and households affected by flash floods
Apoorva Singh
Nivedita Sairam
Kasra Rafiezadeh Shahi
Anna Buch
Melanie Fischer
Chandrika Thulaseedharan Dhanya
Heidi Kreibich
In light of the increasing losses from flash floods intensified by climate change, there is a critical need for improved loss models. Loss assessments predominantly focus on fluvial flood processes, leaving a significant gap in understanding the key drivers of flash floods and the effect of emergency response on losses. To address these gaps, we introduce FLEMOflash – a novel multivariate probabilistic Flood Loss Estimation Model compilation for flash floods. The models are developed for companies and households based on survey data collected after flash flood events in 2002, 2016, and 2021 in Germany. FLEMOflash employs a data-driven feature selection approach, combining machine learning techniques (Elastic Net, Random Forest, XGBoost) to select key drivers influencing flash flood losses and Bayesian networks to model probabilistic loss estimates, including uncertainty. Model-based findings show that in extreme hazard scenarios, successful implementation of emergency measures can reduce building losses by up to 47 % for large companies. Households who knew exactly what to do during high water depth were able to reduce their building losses by 77 % and contents losses by 55 %. Thus, FLEMOflash can support risk communication and management by providing reliable estimation of flash flood losses.
- Article
(3367 KB) - Full-text XML
-
Supplement
(1280 KB) - BibTeX
- EndNote
-
Applying Machine Learning on multi-event data reveals key drivers of flash flood losses such as flow velocity and emergency response.
-
The first probabilistic flash flood loss model provides robust estimates of company and household losses, including uncertainty information.
-
High knowledge about emergency action during extreme flash floods was found to reduce building losses of households by up to 77 %.
Flash floods characterized by their rapid onset and short duration, rank amongst the most devastating natural disasters, leading to significant loss of life and property (Zain et al., 2021). The flash flood events in western Germany and neighbouring countries during July 2021 caused an estimated USD 54 billion in losses (Munich Re, 2025), marking it as the costliest natural disaster in the history of Germany to date. Other notable examples include events in the Elbe and Danube rivers in Germany in 2002, which resulted in USD 9 billion losses (Kreibich et al., 2007). The 2017 flash floods in the Houston area of Texas during Hurricane Harvey resulted in losses ranging from USD 90 to 160 billion (Rözer et al., 2021). The 2012 event in Beijing caused total losses of USD 1.86 billion (Wang et al., 2013), and the monster flood of Pakistan in 2022 caused losses worth USD 30 billion (Chughtai, 2022). With the ongoing climate change crisis and high population density, the risk of flash floods is anticipated to increase in the future; thus, emphasizing the need for flash flood loss modelling to derive quantitative estimates of expected losses in monetary terms.
While progress has been made related to fluvial flood loss models for households (Lüdtke et al., 2019; Thieken et al., 2008), there remains a limited understanding of the variables and mechanisms influencing flash flood losses. Unlike fluvial floods that have longer lead time and slower rise in water levels (Laudan et al., 2020), flash floods are characterized by rapid and unprecedented rise in water levels. Moreover, due to high flow velocities, sediment transport, and shorter lead times, flash floods often cause more losses than fluvial floods. The sudden nature of flash floods makes them extremely difficult to predict (Dougherty and Rasmussen, 2020), necessitating loss modelling tailored to these events. Unlike floods caused by slowly rising water levels, and dyke breaches, flash floods exhibit heterogeneity in hazard characteristics such as water depth, flow velocity, inundation duration, and contamination indicators (Kreibich and Dimitrova, 2010). Furthermore, earlier studies suggest significant differences in the variables and processes that influence losses in different flood types (Mohor et al., 2020). Thus, understanding the flash flood process is crucial, despite our previous understanding of the losses caused by fluvial floods. A comprehensive understanding of the complex processes behind flash floods is essential to develop sustainable and cost-efficient flash flood risk management strategies.
Traditionally, flood loss estimation relied on univariate stage-damage functions (SDF) (Middelmann-Fernandes, 2010). To improve the description of complex damage processes, the Flood Loss Estimation MOdel (FLEMOps) for the private sector, was developed as rule-based, multivariate, deterministic model (Thieken et al., 2008). Merz et al. (2013) and Sieg et al. (2017) introduced decision tree-based damage models that explicitly quantify uncertainty associated with both data variability and model structure uncertainty through bootstrap aggregation. Subsequently, Bayesian Networks were used (BN-FLEMO), enabling the modelling of complex flood loss processes through conditional probability relationships (Lüdtke et al., 2019; Schoppa et al., 2020; Schröter et al., 2014; Vogel et al., 2018).
In parallel, various machine learning approaches have also been developed for flood loss estimation, including neural networks (Salas et al., 2023), random forests (Ghaedi et al., 2022), Bayesian regression (Mohor et al., 2021). Among these, Bayesian networks are particularly advantageous due to their probabilistic representation of conditional dependencies among multiple variables, handle missing data, and model transferability (Schröter et al., 2014). Bayesian models enhance the understanding of flood loss dynamics by quantifying uncertainty and offering probabilistic estimates. For instance, Wagenaar et al. (2018) developed a regional and temporal transferable BN-FLEMO for microscale residential applications, which was later upscaled to mesoscale by Lüdtke et al. (2019). In addition to the FLEMO typology, various synthetic, multivariate, rule-based flood loss models have been proposed for fluvial flood contexts (Amadio et al., 2019; Dottori et al., 2016; Nofal et al., 2020; Sairam et al., 2020).
However, all these loss models were developed to simulate damage processes during fluvial floods. In this study, we present the first probabilistic flash flood loss model – Flood Loss Estimation Model affected by flash floods (FLEMOflash) using a BN-based approach and gain new knowledge about flash flood damage processes based on the conditional probabilities among multiple influencing variables. We use machine learning based feature importance to select the most important variables from our dataset. The performance of FLEMOflash model is compared against conventional SDF models. Finally, we illustrate the loss processes with the Conditional Probability Table (CPT) and Markov blanket in controlling the extent of loss reduction.
2.1 Multievent empirical data in different regions
FLEMOflash is built based on self-reported flash flood losses along with associated information of the affected companies and households. The data was collected through different surveys using computer-aided telephone interviews with representatives of affected companies and households following three highly damaging flash flood occurrences in Germany (Kellermann et al., 2020; Kreibich et al., 2017). These include the 2002 event in the Elbe catchment in Eastern Germany, the 2016 heavy precipitation event, and the most recent July 2021 event in Western Germany. The variables potentially influencing losses are extracted and homogenized from the datasets of the 2002, 2016, and 2021 flood surveys. The variables are grouped into five categories, as presented in Table 1 for companies and Table 2 for households, respectively. An overview of all variables, along with the corresponding survey questions, response categories, and details on variable construction, is presented in Tables S1 and S2 in the Supplement. The loss variable represented as relative loss (rloss), is defined as the ratio between the reported loss and replacement cost on the [0, 1] interval. A value of 0 indicates no loss, while 1 indicates complete loss (Kreibich and Dimitrova, 2010; Schoppa et al., 2020; Sieg et al., 2017). For companies, losses are estimated for three categories of assets – buildings, equipment, and goods and stock. For households, losses are estimated for buildings and contents. Survey responses with no data on loss were excluded from the dataset.
Table 1List of variables for companies. The variable type stands “C” for continuous, “O” for ordinal, and “N” for nominal. An overview of all variables, along with the corresponding survey questions, response categories, and details on variable construction, is presented in Table S1.
2021 data source: Survey “Flooding in Germany in July 2021: Damage of companies”, German Research Centre for Geosciences, Deutsche Rückversicherung AG, 2022. 2016 data source: Survey “Pluvial Flooding and Flash Floods in May/June 2016: Damage of companies”, University of Potsdam, German Research Centre for Geosciences, Deutsche Rückversicherung AG, 2017. 2002 data source: Survey “Flooding in Germany in August 2002: Damage of companies”, German Research Centre for Geosciences, Deutsche Rückversicherung AG, 2003.
Table 2List of variables for households. The variable type stands “C” for continuous, “O” for ordinal, and “N” for nominal. An overview of all variables, along with the corresponding survey questions, response categories, and details on variable construction, is presented in Table S2.
2021 data source: Survey “Flooding in Germany in July 2021: Damage of private households”, University of Potsdam, data collection within the KAHR-project, funded by the German Ministry of Education and Research (BMBF, contract 01LR2102I), approved by the ethical committee of the University of Potsdam (60/2022). 2016 data source: Survey “Pluvial Flooding and Flash Floods in May/June 2016: Damage of private households”, University of Potsdam, German Research Centre for Geosciences, Deutsche Rückversicherung AG, 2017. 2002 data source: Survey “Flooding in Germany in August 2002: Damage of private households”, German Research Centre for Geosciences, Deutsche Rückversicherung AG, 2003.
2.2 Terrain analysis for the identification of flash flood cases
Since the surveys covered regions affected by both flash floods and riverine floods, we applied a terrain-based filtering to exclude riverine cases. We calculated the median slope within a 10 km radius around the location of each observation using the Digital Elevation Model with a 90 m resolution from the Shuttle Radar Topographic Mission. This process was repeated for 14 reference municipalities known to have experienced flash flood events in the past or described as prone to flash floods (see Table A1). The minimum value among the median slope from these 14 municipalities was then used as the slope threshold (see Table S4). Observations with slope values equal to or above this threshold were classified as flash flood cases. Other metrics, such as river basin concentration time, may indeed provide a more process-based characterization of flash flood potential. Nevertheless, we used slope alone as a pragmatic solution that balances two competing needs: maintaining physical relevance in identifying flash flood prone companies and households and retaining a sufficient number of data points for robust model development. The cases with longer warning lead times in the sample are likely to be due to warnings of high precipitation than to flood-specific warnings.
The final database available for developing loss models consists of 241, 379, 355, 1131, 1448 observations for company building, equipment, goods and stock, household building, and contents, respectively. The percentage of missing data for different variables in each of the asset is summarized in Fig. S1 in the Supplement (companies) and Fig. S2 (households) in the supplement. To maximise the amount of training data for model building, we employed the k nearest neighbour technique to impute the missing data. We tested a range of k-neighbours for our datasets (k=1, 3, 5, 7, 9) and selected the value with best performance.
2.3 Machine Learning-based feature selection
Flood damage processes vary by region, flood type, and asset type (Mohor et al., 2020; Sairam et al., 2021; Wagenaar et al., 2018). To derive the most significant drivers of flash flood losses from our list of variables, this study adopts a data-driven feature selection approach to the empirical data. Feature selection involves identifying variables that have the highest influence on the target variable (i.e. relative loss). We train multiple models – nonlinear models: Random Forest (RF), Extreme Gradient Boosting (XGBoost), and linear model: Elastic Net (EN).
Figure 1Illustration of the feature importance in predicting the relative loss of companies categorized into (a) Building, (b) Equipment, (c) Goods and Stock. Similarly, households categorized into (d) Building and (e) Contents. x axis denotes the weighted importance derived from an ensemble approach, combining two non-linear models (Random Forest and Extreme Gradient Boosting) and one linear model (Elastic Net).
RF is an ensemble machine learning method primarily used for classification and regression tasks, developed by Breiman (2001). RF generates an ensemble of decision trees, each trained on a random subset of the data using bootstrap sampling. At each node within these trees, a random subset of features is considered for splitting. The final prediction for a given input is obtained by averaging the predictions from all individual trees. This approach helps RF reduce overfitting and enhances the model's generalization ability. XGBoost, similarly to RF, is an ensemble learning algorithm that benefits from a decision tree-based structure. However, the key difference compared to RF is that in XGBoost, each tree corrects the errors from the previous ones. The process starts with a simple model and iteratively adds trees that focus on the residuals or errors made by the existing ensemble. With its efficient implementation, XGBoost demonstrates superior performance and handles large-scale data more effectively than RF (Chen and Guestrin, 2016). While RF and XGBoost are non-linear models, EN is a regularization technique used in linear regression, combining both Lasso (L1) and Ridge (L2) regularization penalties. It effectively addresses multicollinearity in datasets by shrinking the less influential predictors toward zero (Lasso) while additionally providing some degree of regularization to prevent overfitting (Ridge). EN's ability to handle correlated features and select relevant predictors makes it a valuable tool in regression tasks (Zou and Hastie, 2005).
During training, we employed a nested cross-validation framework with 10 splits and 10 repeats, resulting in a total of 100 evaluations. We selected the best set of hyperparameters, which obtained the least mean absolute error, which was then applied to the final feature selection. From each resulting final model, we derived the feature importance. Next, we calculated each variable's weighted feature importance and overall rank. The final selection of the variables (Fig. 1) is elaborated upon in the results section.
2.4 Probabilistic FLEMOflash development
Based on the identified features, a multivariate probabilistic Flood Loss Estimation MOdel (Bayesian Network – BN) is calibrated for predicting flash flood losses (Jensen and Nielsen, 2007; Kitson et al., 2023; Scutari and Denis, 2021). BNs are probabilistic graphical models where the predictor and target variables are connected through a directed acyclic graph (DAG). Each variable is depicted as a node, and related nodes are connected through arcs. These connections allow for the estimation of conditional probability distributions, facilitating an understanding of the underlying processes and probabilistic estimation of rloss (Vogel et al., 2018). The continuous variables are discretized using an equal frequency discretization approach (Scutari and Denis, 2021), and a discrete BN is formulated. The number of bins is determined based on the model performance against different numbers of bins – the number of bins resulting in the best model performance is chosen. The specification of a discrete BN involves defining a set of variables (X1, …, Xn), constructing a DAG representing the probabilistic dependencies among variables, and obtaining the conditional probability distribution for each variable (Xi) in the DAG, where parents(Xi) denotes the parents of node Xi in the DAG. The final joint probability distribution for the set of variables connected in a discrete BN is formulated as (Pearl, 1988):
Within the predicted bins of the discrete BN (rloss bins), we fit a continuous distribution by applying weighted sampling to the empirical loss data, resulting in a smoothed representation of the loss distribution (Schoppa et al., 2020). For further details on the BN structure learning we refer to Sect. S1 in the supplement and Fig. A1.
2.4.1 Comparison to stage damage function
We compared FLEMOflash to a univariate stage-damage function (SDF), a conventional model in flood loss estimation (Gerl et al., 2016). We implemented the linear functional form of deterministic SDF (SDF-D) and probabilistic SDF (SDF-P) to assess the added value of the multivariate and probabilistic model. SDF-D is formulated as:
where rlossi is the relative loss for a given water depth (wdi). α, β and εi are the intercept, regression coefficient, and error of observation i, respectively. Further, to implement the SDF-P, we assume that the relative loss follows a zero-and-one-inflated Beta distribution (Schoppa et al., 2020):
In the above Eq. (3), we only predict the μ, whereas other distribution parameters are assumed constant across the observations. Table S3 contains further information on the prior choice for model parameters as well as specification for Markov chain Monte Carlo sampling.
2.4.2 Model validation and sensitivity
We evaluate the performance of the FLEMOflash, SDF-D, and SDF-P models individually for the different types of assets in both, companies and households. We employed 10-fold cross-validation, which was repeated (n=100) with independent random seeds to obtain robust estimates. To address parameter sensitivity of FLEMOflash model performances due to Bayesian network structure learning, we systematically evaluated three critical factors:
-
Number of predictors (f1–f5): Sect. 2.3 identifies the ensemble-based important predictors, and the top five were used to develop the BN model. We demonstrated the model performance with varying number of predictors.
-
Number of discretization bins (b3–b8): continuous variables were binned using quantile-based stratification.
-
Number of neighbours (k1–k9): missing data were imputed with k-nearest neighbours.
For each combination, we validated the model for each cross-validation fold using three performance metrics: mean absolute error (MAE), continuous ranked probability score (CRPS), and mean bias error (MBE) (Gneiting and Katzfuss, 2014; Jensen and Nielsen, 2007; Krüger et al., 2021; Schoppa et al., 2020). Detailed information on the validation procedure and three scores used to compare the models are provided in Sect. S2.
3.1 Drivers of flash flood losses
An ensemble of linear and non-linear machine learning models ensures that both linear and non-linear relationships between the predictors and flood loss are captured. Water depth emerges as the most important predictor of damage across all asset types (Fig. 1), which is consistent with previous loss models (Kreibich et al., 2010; Merz et al., 2013; Schoppa et al., 2020; Sieg et al., 2017; Thieken et al., 2008). For companies, emergency measures success and number of employees are also significant factors influencing the flood loss estimation. Among other flood characteristics, duration is ranked fourth for building (Fig. 1a), velocity is ranked third for equipment (Fig. 1b), and contamination is the fifth most significant driver for goods and stock (Fig. 1c), respectively. In case of households, human stability and contamination are most important hazard variables after water depth.
The significance of water depth and emergency measures has also been emphasized by Hasanzadeh Nafari et al. (2016) in case of fluvial flood losses. Exposure variables such as number of employees significantly influence company losses. Additionally in case of households, losses are more influenced by variables representing flood vulnerability such as knowledge about emergency action (Fig. 1d and e). These findings are in line with the previous studies (Kreibich et al., 2005; Sairam et al., 2019; Zander et al., 2023) that highlight the potential for adaptation measures to reduce flood losses. By quantitatively comparing the varied drivers of flash flood losses, these results also emphasize the importance of multivariable loss estimation models that capture the interplay across these drivers and their influence on losses.
Although flow velocity has been identified as a significant contributor to flash flood losses (Kreibich and Dimitrova, 2010), it does not appear among the most significant factors in the current study. In our analysis, we represent hydrodynamic forces using velocity (v) and human stability (hs). While velocity provides a subjective yet direct measure of local strength of flow current, human stability reflects on the perceived difficulty of standing in flood waters. As shown in Fig. 1d and e, human stability emerges as the second most influential factor affecting loss in households, indicating that the combined effect of water depth and velocity play a crucial role for the flash flood model.
3.2 Probabilistic multivariate flash flood loss model
3.2.1 FLEMOflash Bayesian network structure
The FLEMOflash models are developed using a score-based structure learning algorithm and BN models are developed to capture the multivariate dependencies among variables. Data-driven BNs with the best performance were evaluated by domain experts to ensure consistency with the existing understanding about the underlying dynamics of flood loss processes (Fig. 2). The direction of the arrow represents an association between two variables but does not necessarily represent causality (Lüdtke et al., 2019; Sairam et al., 2020). Water depth emerged as the most important predictor for loss estimation across all asset types and is directly connected to the rloss node in all BN structures (Fig. 2). In case of companies and households, the losses are influenced predominantly by water depth. Company characteristics (number of employees and size premise) significantly impact the losses for company assets (Fig. 2a–c). The measure success (ms) predictor is directly connected to rloss in case of building and equipment (Fig. 2a and b) and indirectly connected to rloss through water depth for goods and stock (Fig. 2c).
Figure 2Bayesian Network structures for (a) Companies: Building, (b) Companies: Equipment, (c) Companies: Goods and Stock, (d) Households: Building, (e) Households: Contents, obtained from score-based structure learning algorithm.
In households, building rloss is directly connected to water depth, contamination, and knowledge about emergency action (Fig. 2d). For contents, rloss is directly connected to water depth and knowledge about emergency action, with contamination, human stability, and income also playing important roles (Fig. 2e). Additionally, human stability, which is a factor of both depth and velocity, influences household losses through water depth. The direct connection of rloss with measures success for companies' assets (Fig. 2a–c), and with knowledge about emergency action for households (Fig. 2d and e) highlights the significance in mitigating flood losses. We derived joint probability distribution for all asset types and the findings reveal trade-offs between preparation strategies for different target losses.
3.2.2 Performance and comparison
The performance of the FLEMOflash BN structure was tested considering varying numbers of predictors, bins, and k-nearest neighbors. In the BN structure, null value was assigned to one predictor at a time (from lowest-5 rank to highest-1 rank based on feature importance – Fig. 1) to examine the performance using the remaining variables (Fig. 3). The performance metrics (MAE, CRPS) vary with the number of predictors. The model's performance improved with the number of predictors. For instance, the MAE values for all asset types decreased as the number of predictors increased. Based on the performance metrics, the optimum number of predictors was found to be five. However, the Markov blanket of loss consists of two predictors (water depth and knowledge of emergency measures) for households: contents loss, and three predictors otherwise. Similarly, CRPS values also showed a declining pattern with increasing number of predictors, indicating better probabilistic predictions. The MBE was relatively stable, suggesting that bias in the prediction did not significantly change with the number of predictors.
Figure 3Model sensitivity of FLEMOflash to number of predictors (f1–f5), bins (b3–b8), number of neighbours used for data imputation (k1–k9) evaluated using mean average error (MAE), continuous ranked probability score (CRPS), and mean bias error (MBE) for the five asset types (x axis). Each bocplot summarizes 100 repetitions of a fivefold cross-validation (companies) and tenfold cross-validation (households) with randomized data partitioning. Best-performing configurations were identified through a sequential tuning process: first selecting the number of predictors based on the first panel, then optimizing bin count in the second panel with predictors fixed, and finally selecting the number of neighbours in the third panel with both previous parameters fixed. Best-performing configurations are: Companies – Buildings (C:BUI) f5, b6, k7; Companies – Equipment (C:EQU) f5, b6, k5; Companies – Goods and Stock (C:GNS) f5, b6, k9; Private Households – Buildings (P:BUI) f5, b8, k1; Private Households – Contents (P:CON) f5, b8, k3.
Examining the performance with optimal predictors while modifying the number of bins, revealed significant differences for companies but not for households, which could be attributed to the fact that the number of data points for companies is relatively limited and more heterogenous (Schoppa et al., 2020) compared to households. Company buildings model (C:BUI) with too few bins tend to lose information, resulting in higher MAE and CRPS values. Conversely, the goods and stocks model (C:GNS) performed better with fewer bins. For household models, MAE was not sensitive to the choice of bins, but model performance was slightly better with 8 bins. The MBE for company models showed fluctuations around zero, indicating some sensitivity to the choice of bins and potential bias.
The number of neighbours used for data imputation does not show any significant difference in predictive performance. The MBE showed minor variations, remaining close to zero across different numbers of neighbours, suggesting that the bias in the predictions was not significantly affected by the imputation. The kNN method of imputation assumes that the missing values can be inferred based on similarity of feature space. This may not hold equally well across variables, particularly for those with weak correlation to other features. To evaluate the robustness of the imputation process, we compared the distributions of variables before and after imputation and found them to be largely consistent (not shown for brevity). Nevertheless, the imputation process may still introduce uncertainty or reduce natural variability in the data. Future studies could benefit from sensitivity testing using alternative imputation techniques and explore models that explicitly incorporate imputation uncertainty.
Overall, building assets in both company and household sectors performed better compared to other asset types. This superior performance is due to the effective inference of the relationship between water depth and building rloss (Gerl et al., 2016; Lüdtke et al., 2019; Merz et al., 2013; Vogel et al., 2018). Additionally, the emergency response of mitigation measures for companies and knowledge about emergency action for households also boosted performance.
The FLEMOflash model with the best performance, identified in Fig. 3 was compared to SDF models (Fig. 4). The FLEMOflash model consistently outperformed both the SDF-probabilistic and SDF-deterministic models across all five asset types due to its comprehensive representation of loss processes (Schröter et al., 2014). Study by Schoppa et al. (2020) also noted a similar observation reporting that probabilistic multivariate models performed better than univariate models for fluvial flood loss estimation. These findings highlight that FLEMOflash outperformed in estimating losses while also indicating the need for improvement to address biases in the predictions.
Figure 4Comparison of FLEMOflash predictive performance against SDF-probabilistic and SDF-deterministic using MAE, CRPS, and MBE for the five assets (x axis). Each boxplot summarizes 100 repetitions of a fivefold cross-validation (companies) and tenfold cross-validation (households) with randomized data partitioning.
3.3 Description of loss processes by FLEMOflash
The loss processes described by FLEMOflash is illustrated using the predictive density of predicted losses under scenarios of hazard, exposure and vulnerability. For brevity, this section primarily focusses on the FLEMOflash model for household buildings (Fig. 5), with a similar interpretation, results of other asset types are shown in Figs. S3–S6. The nodes of the model comprise of water depth, human stability, inundation duration, contamination, knowledge about emergency action, and relative losses, each with 7, 4, 7, 5, 6, and 8 classes, respectively. The CPT was populated with joint probabilities to find the predictive density of loss given the condition of other nodes. The conditional probability of rloss based only on water depth indicates a monotonic relationship. Shallow inundations are associated with very low losses, while deeper water substantially increases the probability of severe losses (Fig. 5e). The highest probabilities are concentrated along the diagonal, confirming this trend. For instance, depths < 0.28 m are most likely associated with very low losses (<0.017), whereas depths ≥ 2.3 m are strongly associated with high losses (>0.42). Similar patterns of increasing loss probability with greater water depth are observed across all asset types (Figs. S3–S6). Water depth also influences human stability: while shallow flooding results in low instability, extreme depths markedly increase the probability of instability (0.54) (Fig. 5b).
Figure 5Conditional probability table (CPT) of the Bayesian network for the residential building. Each heatmap illustrates the conditional probabilities of a child node given its parent node. Parent node states are shown along the x axis, and child node states along the y axis. Darker shades of blue indicate higher probability values. Numerical values are displayed in each cell, with an accompanying colorbar showing the probability scale. (a) Inundation duration given contamination; (b) water depth given human stability; (c) contamination given relative loss; (d) knowledge about emergency action given relative loss and (e) relative loss given water depth.
Contamination emerges as another important driver of losses. In uncontaminated conditions (class 0), the probability of very low losses (<0.01) is high (0.82). Conversely, under severe contamination (class 4), the probability of very high losses (>0.427) increases to 0.30 (Fig. 5c), reflecting the destructive impact of oils, chemicals, and sewage entering buildings (Kreibich et al., 2005; Laudan et al., 2020). Households exposed to inundation lasting [13–50) h showed a high probability of experiencing moderate contamination levels (classes 1–2). Knowledge about emergency action shows a strong mitigating effect. The CPT (Fig. 5d) demonstrates that households with low awareness (Ke≤2) face a high probability of severe losses, whereas households with very good knowledge (Ke≥5) display a substantially higher probability of reduced losses. Comparable findings are observed for household contents (Fig. S6c). This agrees with Kreibich et al. (2021), who also reported that clear awareness of emergency actions substantially reduces damages. Importantly, socioeconomic status indirectly shapes vulnerability, as higher-income groups are more likely to report very clear knowledge of emergency actions after receiving warnings (Fig. S6b).
For companies (Figs. S3–S5), the CPT results reveal consistent patterns across buildings, equipment, and goods and stock. Smaller companies (with fewer employees or smaller premises) show higher probabilities of severe losses, whereas larger firms and premises are more strongly associated with lower loss outcomes (Figs. S3b, S4d and S5c). Across all asset types, the success of emergency measures emerges as a dominant factor, as unsuccessful measures are strongly associated with a high probability of severe losses (Figs. S3d, S4b and S5a). Contamination further amplifies losses, with severe categories linked to markedly higher probabilities of loss. Together, these results emphasize that hazard intensity (water depth, velocity, contamination), exposure (number of employees, size premises) and vulnerability factors (effectiveness of emergency measures) interactively determine relative losses for companies.
Further, we analyse to what extent these adaptation strategies are helpful in reducing rloss. Through feature selection and Bayesian Networks we identified emergency measures success (ms) and knowledge about emergency action (ke) for companies and private households respectively, as the significant variables within the Markov Blanket of rloss. We applied the FLEMOflash model to derive predictive densities of rloss. Results were summarized using the median and associated uncertainty (25th and 75th percentiles) for selected combinations of hazard, exposure, and vulnerability conditions, rather than displaying the full predictive densities (Fig. 6). For clarity of interpretation, Fig. A1 illustrates step by step how predictive densities are derived from the posterior distributions using kernel density estimation based on 1000 resampled values. Figure A2 provides an overview of the posterior and predictive densities across varying levels of measure success under same conditions of water depth and number of employees.
Figure 6FLEMOflash application for company buildings, private household buildings and contents, considering relative loss Markov blankets. The first row in each panel shows the probabilistic predictive density of relative loss on the interval [0, 1] based on the specific scenarios of hazard and exposure combination. The second to fourth rows in each panel illustrate the changes in relative loss with different levels of emergency measures success (ms) and knowledge about emergency action (ke) for the given hazard and exposure combinations.
For company building, the rloss Markov blanket consists of water depth (wd), number of employees (emp), and emergency measures success (ms). The incurred loss increases with increasing water depth but combining the water depth and number of employees, the relative loss experienced by companies with 9 to 12 employees is higher compared to those with 38 or more employees. We further investigate the role of successful implementation of emergency measures in reducing loss. For example, when considering only water depth and number of employees, companies with 38 or more employees, inundated by water depths ranging from 1.85 to 2.40 m above the ground, experience a rloss of 0.38. However, with successful implementation of emergency measures (ms=3), this rloss decreases to 0.20, representing a substantial 47 % reduction. On the other hand, when emergency measures were perceived to be ineffective (ms=1), the rloss increases to 0.50, marking a 32 % increase in estimated losses.
For household buildings, considering hazard and exposure with a water depth of ≥2.3 m above ground and no contamination, the predicted rloss is 0.22. However, when residents had clarity on emergency actions (Ke≥5) this loss decreases to 0.05, reflecting a 77 % reduction. Conversely, when residents did not have clear knowledge on emergency action (Ke≤2), the rloss rises to 0.27, a 23 % increase. Similarly, for household contents, we observed that when considering only water depths (hazard) of height 2.3 m above ground, the rloss is 0.38. With (Ke≥5), this loss drops to 0.17, a 55 % reduction. In contrast, with (Ke≤2), the rloss increases to 0.44, representing a 16 % rise. These findings highlight the potential of successful implementation of emergency measures and knowledge about emergency action in reducing flash flood losses (Barendrecht et al., 2020; Berghäuser et al., 2023; Bubeck et al., 2013; Sairam et al., 2019; Surminski and Thieken, 2017). While residents with high knowledge about emergency actions are more likely to take effective emergency measures, thereby reducing the severity of flood loss (Kreibich et al., 2005; Sairam et al., 2019), the knowledge about emergency action (ke) variable as considered in this study, does not capture which exact actions respondents undertook. Though the specific actions likely varied across respondents, empirical evidence indicates that having clear knowledge of emergency action generally contributes to reduced flood losses, consistent with previous findings (Kreibich et al., 2021).
This study introduces FLEMOflash – new set of multivariate flash flood loss models for estimating relative losses for companies and households. The machine learning based feature selection highlights key loss drivers comprising hazard, company characteristics, and emergency response. Results demonstrate that successful implementation of measures for companies and knowledge about emergency action for private households can substantially reduce losses. In extreme hazard scenarios, losses were reduced by nearly half for larger companies, and households that received timely warnings and has clear knowledge were able to reduce building losses by 77 % and contents losses by 55 %. While FLEMOflash already provides a tool to support risk analyses, and impact-based forecasting, future developments could further strengthen its applicability by integrating complementary hydrological indicators (e.g., basin concentration time), incorporating building-level susceptibility factors (e.g., construction materials, structural condition, floor count), and expanding the empirical database by including high loss observations and more diverse geographic regions. It is important to note that the model's performance and predictive capacity, as presented are specific to the empirical dataset and survey variables available for FLEMOflash, and the results should be interpreted within the context and limitations of the underlying data.
Figure A1Visualizations of the prior, posterior, and predictive distributions of rloss. (a) Empirical kernel density estimate of the prior rloss based on collected data; (b) prior distribution of rloss represented as bin weights (inverse frequency) across discretized intervals; (c) posterior distribution of rloss conditioned on and emp≥38 and (d) predictive distribution of rloss generated by resampling 1000 values using the prior bin weights and the posterior probabilities. The solid vertical line indicates the median (50th percentile), while the dotted vertical lines represent the 25th and 75th percentiles, representing the predictive uncertainty. Shaded area highlights the interquartile range.
Figure A2(a) Posterior distribution and (b) predictive density of relative loss (rloss) under condition of water depth and number of employees (emp)≥38. (c–h) Posterior and predictive distributions of rloss for varying levels of measure success (ms): subplots (c), (e), (g) present posterior distributions of rloss under three ms conditions – not at all, only limited, and yes – completely/most part — with and emp≥38. Subplots (d), (f), (h) shows corresponding predictive densities, estimated using kernel density estimation from resampled values (n=1000). In each density plot, the solid vertical line marks the median (50th percentile), while dotted vertical lines indicate the 25th and 75th percentiles, with shaded regions representing the uncertainty. The sequence from top to bottom illustrates increasing levels of emergency measures success (ms).
The survey data is partly accessible at the German flood damage database, HOWAS21 (https://doi.org/10.1594/GFZ.SDDB.HOWAS21, GFZ Helmholtz Centre for Geosciences, 2025).
The supplement related to this article is available online at https://doi.org/10.5194/nhess-26-103-2026-supplement.
Conceptualization: AS, RKG, KRS, NS, HK; Methodology: AS, RKG, NS, KRS, AB, MF, HK; Data curation: AS, RKG, MF; Writing – original draft: AS, RKG; Writing – review & editing: RKG, NS, KRS, HK; Visualization: AS, RKG; Supervision: CTD, HK.
At least one of the (co-)authors is a member of the editorial board of Natural Hazards and Earth System Sciences. The peer-review process was guided by an independent editor, and the authors also have no other competing interests to declare.
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
This research has been supported by the German Federal Ministry of Education and Research (BMBF) within the framework of the AVOSS project (grant no. FKZ 02WEE1629C). Apoorva Singh acknowledges Helmholtz Information & Data Science Academy (HIDA) for providing financial support for the research stay at the Section Hydrology, GFZ. Nivedita Sairam is funded by the BMBF project “HI-CliF”, FKZ: 01LN2209A. Collection of the 2021 private household data was undertaken by the Geography and Disaster Risk Research Lab, University of Potsdam within the KAHR-project, funded by BMBF (contract 01LR2102I). Collection of the 2021 company data was undertaken by Section Hydrology, GFZ and Deutsche Rückversicherung AG, funded by the GFZ-HART initiative and Deutsche Rückversicherung AG. Collection of the 2016 data was undertaken within the DFG Research Training Group “NatRiskChange” (GRK 2043/1), funding is gratefully acknowledged, as are additional funds provided by GFZ and the Deutsche Rückversicherung AG. Collection of the 2002 data was undertaken within the German Research Network Natural Disasters (DFNK), in cooperation between GFZ and the Deutsche Rückversicherung AG, we thank the Deutsche Rückversicherung AG and BMBF (01SFR9969/5) for financial support. We would like to thank the editor and anonymous reviewers for their constructive feedback on the article.
This research has been supported by the Bundesministerium für Bildung und Forschung (grant nos. FKZ 02WEE1629C, “HI-CliF”, FKZ: 01LN2209A, 01LR2102I, and 01SFR9969/5).
The article processing charges for this open-access publication were covered by the GFZ Helmholtz Centre for Geosciences.
This paper was edited by Kai Schröter and reviewed by two anonymous referees.
Amadio, M., Scorzini, A. R., Carisi, F., Essenfelder, A. H., Domeneghetti, A., Mysiak, J., and Castellarin, A.: Testing empirical and synthetic flood damage models: the case of Italy, Nat. Hazards Earth Syst. Sci., 19, 661–678, https://doi.org/10.5194/nhess-19-661-2019, 2019.
Barendrecht, M. H., Sairam, N., Cumiskey, L., Metin, A. D., Holz, F., Priest, S. J., and Kreibich, H.: Needed: A systems approach to improve flood risk mitigation through private precautionary measures, Water Secur., 11, 100080, https://doi.org/10.1016/j.wasec.2020.100080, 2020.
Berghäuser, L., Bubeck, P., Hudson, P., and Thieken, A. H.: Identifying and characterising individual flood precautionary behaviour dynamics from panel data, Int. J. Disast. Risk Reduct., 94, 103835, https://doi.org/10.1016/j.ijdrr.2023.103835, 2023.
Breiman, L.: Random Forests, Mach. Learn., 45, 5–32, https://doi.org/10.1023/A:1010933404324, 2001.
Bronstert, A., Agarwal, A., Boessenkool, B., Crisologo, I., Fischer, M., Heistermann, M., Köhn-Reich, L., López-Tarazón, J. A., Moran, T., Ozturk, U., Reinhardt-Imjela, C., and Wendi, D.: Forensic hydro-meteorological analysis of an extreme flash flood: The 2016-05-29 event in Braunsbach, SW Germany, Sci. Total Environ., 630, 977–991, https://doi.org/10.1016/j.scitotenv.2018.02.241, 2018.
Bubeck, P., Botzen, W. J. W., Kreibich, H., and Aerts, J. C. J. H.: Detailed insights into the influence of flood-coping appraisals on mitigation behaviour, Global Environ. Change, 23, 1327–1338, https://doi.org/10.1016/j.gloenvcha.2013.05.009, 2013.
Chen, T. and Guestrin, C.: XGBoost: A Scalable Tree Boosting System, in: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD'16: The 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, California, USA, 785–794, https://doi.org/10.1145/2939672.2939785, 2016.
Chughtai, A.: Mapping the scale of damage by the catastrophic Pakistan floods infographic news Al Jazeera, https://www.aljazeera.com/news/longform/2022/9/16/mapping-the-scale-of-destruction-of-the-pakistan-floods (last access: 28 March 2025), 2022.
Dottori, F., Figueiredo, R., Martina, M. L. V., Molinari, D., and Scorzini, A. R.: INSYDE: a synthetic, probabilistic flood damage model based on explicit cost analysis, Nat. Hazards Earth Syst. Sci., 16, 2577–2591, https://doi.org/10.5194/nhess-16-2577-2016, 2016.
Dougherty, E. and Rasmussen, K. L.: Changes in Future Flash Flood–Producing Storms in the United States, J. Hydrometeorol., 21, 2221–2236, https://doi.org/10.1175/JHM-D-20-0014.1, 2020.
Gerl, T., Kreibich, H., Franco, G., Marechal, D., and Schröter, K.: A Review of Flood Loss Models as Basis for Harmonization and Benchmarking, PLoS ONE, 11, e0159791, https://doi.org/10.1371/journal.pone.0159791, 2016.
GFZ Helmholtz Centre for Geosciences: HOWAS21, GFZ Helmholtz Centre for Geosciences [data set], https://doi.org/10.1594/GFZ.SDDB.HOWAS21, 2025.
Ghaedi, H., Reilly, A. C., Baroud, H., Perrucci, D. V., and Ferreira, C. M.: Predicting flood damage using the flood peak ratio and Giovanni Flooded Fraction, PLoS ONE, 17, e0271230, https://doi.org/10.1371/journal.pone.0271230, 2022.
Gneiting, T. and Katzfuss, M.: Probabilistic Forecasting, Annu. Rev. Stat. Appl., 1, 125–151, https://doi.org/10.1146/annurev-statistics-062713-085831, 2014.
Hasanzadeh Nafari, R., Ngo, T., and Mendis, P.: An Assessment of the Effectiveness of Tree-Based Models for Multi-Variate Flood Damage Assessment in Australia, Water, 8, 282, https://doi.org/10.3390/w8070282, 2016.
Hübl, J., Heiser, M., Braito, S., Tscharner, S., Kuntner, K., Schraml, K., Falkensteiner, M., and Rabanser, E.: Ereignisdokumentation und Ereignisanalyse Rottal-Inn 2016: Band 1: Ergebnisdokumentation, Universitaet fuer Bodenkultur, Vienna, Austria, https://boku.ac.at/fileadmin/data/H03000/H87000/H87100/DAN_IAN_Reports/Rep180_Band_1_mit_Anhang.pdf (last access: 28 March 2025), 2017.
Jensen, F. V. and Nielsen, T. D.: Bayesian Networks and Decision Graphs, Springer, New York, NY, https://doi.org/10.1007/978-0-387-68282-2, 2007.
Kellermann, P., Schröter, K., Thieken, A. H., Haubrock, S.-N., and Kreibich, H.: The object-specific flood damage database HOWAS 21, Nat. Hazards Earth Syst. Sci., 20, 2503–2519, https://doi.org/10.5194/nhess-20-2503-2020, 2020.
Kitson, N. K., Constantinou, A. C., Guo, Z., Liu, Y., and Chobtham, K.: A survey of Bayesian Network structure learning, Artif. Intell. Rev., 56, 8721–8814, https://doi.org/10.1007/s10462-022-10351-w, 2023.
Kreibich, H. and Dimitrova, B.: Assessment of damages caused by different flood types, FRIAR 2010, Milan, Italy, 3–11, https://doi.org/10.2495/FRIAR100011, 2010.
Kreibich, H., Thieken, A. H., Petrow, T., Müller, M., and Merz, B.: Flood loss reduction of private households due to building precautionary measures – lessons learned from the Elbe flood in August 2002, Nat. Hazards Earth Syst. Sci., 5, 117–126, https://doi.org/10.5194/nhess-5-117-2005, 2005.
Kreibich, H., Müller, M., Thieken, A. H., and Merz, B.: Flood precaution of companies and their ability to cope with the flood in August 2002 in Saxony, Germany, Water Resour. Res., 43, 2005WR004691, https://doi.org/10.1029/2005WR004691, 2007.
Kreibich, H., Seifert, I., Merz, B., and Thieken, A. H.: Development of FLEMOcs – a new model for the estimation of flood losses in the commercial sector, Hydrolog. Sci. J., 55, 1302–1314, https://doi.org/10.1080/02626667.2010.529815, 2010.
Kreibich, H., Thieken, A., Haubrock, S., and Schröter, K.: HOWAS21, the German Flood Damage Database, in: Geophysical Monograph Series, edited by: Molinari, D., Menoni, S., and Ballio, F., Wiley, 65–75, https://doi.org/10.1002/9781119217930.ch5, 2017.
Kreibich, H., Hudson, P., and Merz, B.: Knowing What to Do Substantially Improves the Effectiveness of Flood Early Warning, B. Am. Meteorol. Soc., 102, E1450–E1463, https://doi.org/10.1175/BAMS-D-20-0262.1, 2021.
Krüger, F., Lerch, S., Thorarinsdottir, T., and Gneiting, T.: Predictive Inference Based on Markov Chain Monte Carlo Output, Int. Stat. Rev., 89, 274–301, https://doi.org/10.1111/insr.12405, 2021.
Laudan, J., Zöller, G., and Thieken, A. H.: Flash floods versus river floods – a comparison of psychological impacts and implications for precautionary behaviour, Nat. Hazards Earth Syst. Sci., 20, 999–1023, https://doi.org/10.5194/nhess-20-999-2020, 2020.
LfU – Bayerisches Landesamt für Umwelt (Ed.): Sturzfluten- und Hochwasserereignisse Mai/Juni 2016 – Wasserwirtschaftlicher Bericht, https://files.hnd.bayern.de/berichte/lfu_SturzflutenMaiJuni2016.pdf (last access: 28 March 2025), 2017.
Lüdtke, S., Schröter, K., Steinhausen, M., Weise, L., Figueiredo, R., and Kreibich, H.: A Consistent Approach for Probabilistic Residential Flood Loss Modeling in Europe, Water Resour. Res., 55, 10616–10635, https://doi.org/10.1029/2019WR026213, 2019.
Merz, B., Kreibich, H., and Lall, U.: Multi-variate flood damage assessment: a tree-based data-mining approach, Nat. Hazards Earth Syst. Sci., 13, 53–64, https://doi.org/10.5194/nhess-13-53-2013, 2013.
Middelmann-Fernandes, M. H.: Flood damage estimation beyond stage–damage functions: an Australian example, J. Flood Risk Manage., 3, 88–96, https://doi.org/10.1111/j.1753-318X.2009.01058.x, 2010.
Mohor, G. S., Hudson, P., and Thieken, A. H.: A Comparison of Factors Driving Flood Losses in Households Affected by Different Flood Types, Water Resour. Res., 56, e2019WR025943, https://doi.org/10.1029/2019WR025943, 2020.
Mohor, G. S., Thieken, A. H., and Korup, O.: Residential flood loss estimated from Bayesian multilevel models, Nat. Hazards Earth Syst. Sci., 21, 1599–1614, https://doi.org/10.5194/nhess-21-1599-2021, 2021.
Mühr, B., Daniell, J., Ehmele, F., Kron, A., Dittrich, A., and Kunz, M.: Hochwasser/Überschwemmungen Süddeutschland Mai/Juni 2016, CEDIM FDA – Forensic Disaster Analysis Group, https://www.cedim.kit.edu/download/Hochwasser_S%C3%BCddeutschland_Report_1.pdf (last access: 28 March 2025), 2016.
Munich Re: Flood risks on the rise – Greater loss prevention is needed, https://www.munichre.com/en/risks/natural-disasters/floods.html (last access: 28 March 2025), 2025.
Nofal, O. M., Van De Lindt, J. W., and Do, T. Q.: Multi-variate and single-variable flood fragility and loss approaches for buildings, Reliabil. Eng. Syst. Safe., 202, 106971, https://doi.org/10.1016/j.ress.2020.106971, 2020.
Pearl, J.: Probabilistic Reasoning in Intelligent Systems, Elsevier, https://doi.org/10.1016/C2009-0-27609-4, 1988.
Plapp, S. T.: Risk perception of natural catastrophes – an empirical investigation in six endangers areas in south and west Germany, in: Vol. 2, Karlsruher Reihe II – Risikoforschung und Versicherungsmanagement, https://books.google.co.in/books?hl=en&lr=&id=6gJvbjhXV_cC&oi=fnd&pg=PA1&dq=Wahrnehmung+von+Risiken+aus+Naturkatastrophen+Eine+...&ots=AuLYOXuVUv&sig=GYvzBl_dpwRaXt653R4E-V9gLec&redir_esc=y#v=onepage&q=Wahrnehmung von Risiken aus Naturkatastrophen Eine ...&f=false (last access: 9 January 2026), 2003.
Rözer, V., Peche, A., Berkhahn, S., Feng, Y., Fuchs, L., Graf, Haberlandt, U., Kreibich, H., Sämann, R., Sester, M., Shehu, B., Wahl, J., and Neuweiler, I.: Impact-Based Forecasting for Pluvial Floods, Earth's Future, 9, 2020EF001851, https://doi.org/10.1029/2020EF001851, 2021.
Sairam, N., Schröter, K., Lüdtke, S., Merz, B., and Kreibich, H.: Quantifying Flood Vulnerability Reduction via Private Precaution, Earth's Future, 7, 235–249, https://doi.org/10.1029/2018EF000994, 2019.
Sairam, N., Schröter, K., Carisi, F., Wagenaar, D., Domeneghetti, A., Molinari, D., Brill, F., Priest, S., Viavattene, C., Merz, B., and Kreibich, H.: Bayesian Data-Driven approach enhances synthetic flood loss models, Environ. Model. Softw., 132, 104798, https://doi.org/10.1016/j.envsoft.2020.104798, 2020.
Sairam, N., Brill, F., Sieg, T., Farrag, M., Kellermann, P., Nguyen, V. D., Lüdtke, S., Merz, B., Schröter, K., Vorogushyn, S., and Kreibich, H.: Process-Based Flood Risk Assessment for Germany, Earth's Future, 9, e2021EF002259, https://doi.org/10.1029/2021EF002259, 2021.
Salas, J., Saha, A., and Ravela, S.: Learning inter-annual flood loss risk models from historical flood insurance claims, J. Environ. Manage., 347, 118862, https://doi.org/10.1016/j.jenvman.2023.118862, 2023.
Schoppa, L., Sieg, T., Vogel, K., Zöller, G., and Kreibich, H.: Probabilistic Flood Loss Models for Companies, Water Resour. Res., 56, e2020WR027649, https://doi.org/10.1029/2020WR027649, 2020.
Schröter, K., Kreibich, H., Vogel, K., Riggelsen, C., Scherbaum, F., and Merz, B.: How useful are complex flood damage models?, Water Resour. Res., 50, 3378–3395, https://doi.org/10.1002/2013WR014396, 2014.
Scutari, M. and Denis, J.-B.: Bayesian Networks: With Examples in R, in: 2nd Edn., Chapman and Hall/CRC, Boca Raton, https://doi.org/10.1201/9780429347436, 2021.
Sieg, T., Vogel, K., Merz, B., and Kreibich, H.: Tree-based flood damage modeling of companies: Damage processes and model performance, Water Resour. Res., 53, 6050–6068, https://doi.org/10.1002/2017WR020784, 2017.
Surminski, S. and Thieken, A. H.: Promoting flood risk reduction: The role of insurance in Germany and England, Earth's Future, 5, 979–1001, https://doi.org/10.1002/2017EF000587, 2017.
Thieken, A. H., Olschewski, A., Kreibich, H., Kobsch, S., and Merz, B.: Development and evaluation of FLEMOps – a new Flood Loss Estimation MOdel for the private sector, in: WIT Transactions on Ecology and the Environment, Flood recovery, innovation and response 2008, WIT Press, London, England, 315–324, https://doi.org/10.2495/friar080301, 2008.
Thieken, A. H., Samprogna Mohor, G., Kreibich, H., and Müller, M.: Compound inland flood events: different pathways, different impacts and different coping options, Nat. Hazards Earth Syst. Sci., 22, 165–185, https://doi.org/10.5194/nhess-22-165-2022, 2022.
Vogel, K., Weise, L., Schröter, K., and Thieken, A. H.: Identifying Driving Factors in Flood-Damaging Processes Using Graphical Models, Water Resour. Res., 54, 8864–8889, https://doi.org/10.1029/2018WR022858, 2018.
Wagenaar, D., Lüdtke, S., Schröter, K., Bouwer, L. M., and Kreibich, H.: Regional and Temporal Transferability of Multivariable Flood Damage Models, Water Resour. Res., 54, 3688–3703, https://doi.org/10.1029/2017WR022233, 2018.
Wang, K., Wang, L., Wei, Y.-M., and Ye, M.: Beijing storm of July 21, 2012: observations and reflections, Nat. Hazards, 67, 969–974, https://doi.org/10.1007/s11069-013-0601-6, 2013.
Zain, A., Legono, D., Rahardjo, A. P., and Jayadi, R.: Review on Co-factors Triggering Flash Flood Occurrences in Indonesian Small Catchments, IOP Conf. Ser.: Earth Environ. Sci., 930, 012087, https://doi.org/10.1088/1755-1315/930/1/012087, 2021.
Zander, K. K., Nguyen, D., Mirbabaie, M., and Garnett, S. T.: Aware but not prepared: understanding situational awareness during the century flood in Germany in 2021, Int. J. Disast. Risk Reduct., 96, 103936, https://doi.org/10.1016/j.ijdrr.2023.103936, 2023.
Zou, H. and Hastie, T.: Regularization and Variable Selection Via the Elastic Net, J. Roy. Stat. Soc. Ser. B, 67, 301–320, https://doi.org/10.1111/j.1467-9868.2005.00503.x, 2005.