the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
The value of ultra-detailed survey data for an improved flood damage modelling with explicit input data uncertainty treatment: INSYDE 2.0
Mario Di Bacco
Daniela Molinari
Anna Rita Scorzini
Abstract. Accurate flood damage modelling is essential to estimate the potential impact of floods and to develop effective mitigation strategies. However, flood damage models rely on diverse sources of hazard, exposure and vulnerability data, which are often incomplete, inconsistent, or totally missing. These issues with data quality or availability introduce uncertainties in the modelling process and affect the final risk estimations. In this study, we present INSYDE 2.0, a flood damage modelling tool that integrates ultra-detailed survey and desk-based data for an enhanced reliability and informativeness of flood damage predictions, including an explicit representation of the effect of uncertainties arising from an incomplete knowledge on the variables characterizing the system under investigation.
- Preprint
(1565 KB) - Metadata XML
-
Supplement
(1635 KB) - BibTeX
- EndNote
Mario Di Bacco et al.
Status: open (until 20 Dec 2023)
-
RC1: 'Comment on nhess-2023-179', Julius Schlumberger, 21 Nov 2023
reply
Summary
The manuscript “The value of ultra-detailed survey data for an improved flood damage modelling with explicit input data uncertainty treatment: INSYDE 2.0” proposes a tailored flood impact modeling framework INSYDE to account for the lack of information/uncertainty with regards to the required micro-scale vulnerability and exposure characterstics of buildings. In this study, the framework and process of data-preparation are discussed alongside three test cases to show the benefits.
The authors have done a good job. The authors are addressing a very relevant need for better fit tools to support decision-making acknowledging uncertainty. The authors offer a comprehensive idea how to deal with limited knowledge and give insights into the sensitivities. The manuscript is generally well written and makes particular good use of Tables. The overall structure of the manuscript could be improved alongside revising some of the figures and reflecting on their use. Strengthening the discussion of the benefits and learnings for a decision-maker from using INSYIDE 2.0 compared to other models, could make this manuscript a very strong and relevant addition to the scientific community.
General comments
- The authors do a great job in justifying the two research questions to explore in this study. While this reviewer can clearly identify the evidence presented to answer the second research question. However, the first question on arguing the added value in terms of output quality and usefulness seems to be addressed marginally and should receive more attention in the results and discussion section. What can decision-makers specifically learn from INSYDE 2.0? In that context, this reviewer observed that the Authors seem not to discuss any limitations of this approach.
- Line 95: This reviewer thinks that the description of the methodological approach could benefit from improved visualization and explanation as well as restructuring the sections. Figure 1 seems complete but complex. The reviewer cannot recognize the elements mentioned in the figure in the accompanying text (or the subsequent subsection titles). This makes it very difficult to follow. Using more descriptive subsection titles aligned with the steps in an (updated?) Figure 1 could avoid this challenge. If others were to use INSYDE 2.0, would they use this methodological approach as well? If so, making a clear distinction between describing INSYDE 2.0, Preparing data for INSYDE 2.0 and Applying INSIYDE 2.0 could be beneficial in the method section.
- This reviewer is wondering whether the title (particularly: 'The value of ultra-detailed survey data') of this manuscript captures the main purpose of this study. Firstly, because the data-sources discussed in this paper are not limited to survey data. Secondly, the paper mostly focuses on exploring the effects of uncertainty.
Specific comments
- Line 23: these references seem outdated to confirm the Author’s claim regarding development and remaining limitations of flood risk modelling over the past decade.
- Line 103 – Line 106: What benefits did the Authors see in using data from a specific region to explore the sensitivities of the impacts/feature importance? Would the findings from a sensitivity analysis not offer similar insights, perhaps even more generalizable?
- Line 108 – Line 119: Did the authors generate the EDF’s? How many data points were available for fitting the distributions mentioned in Tab.1 and Tab.2 ? Since this study is addressing effects of uncertainty, it would be interesting to know on what basis these distributions are developed . This reviewer thinks it could be a good idea to add these EDF’s + fits in the Supplemental Material.
- Line 120: How were different return periods included in the EDF’s?
- Meanwhile the current text in section 2.3 is interesting and probably relevant for the functioning of the model, this reviewer does not see the direct link between the purpose of this study (include uncertainty into INSYDE) with this fix addressing the scalability problem. Since this bias seems to be mentioned in 2.1 (line 83), this reviewer would suggest to mention the change of the model in 2.1 and/or put the detailed elaboration in the Supplementary Material. Given that the change already had been applied when using INSYDE in Belgium, this seems not innovative or relevant to report here. Instead, this reviewer was expecting an elaboration of the statement in line 100 elaborating on the process of translating mixed source data into distributions or required adjustments to the (existing) probabilistic framework.
- Line 148 – line 150: Why did the authors choose to build one dataset containing two different flood types (riverine and flash floods)? Would it not be more accurate to have two separate ones, one per flood type?
- Line 152: These distributions are generated based on the Po river modelling exercise only? Or also for the two historic case studies separately?
- Line 155 – Line 163: The study would benefit from additional elaboration how these correlations are built. This reviewer can understand why authors built a synthetic correlation between inundation depth and duration (does it work differently for riverine floods and flash floods?), but much less with regards to the flow speed and flood duration. Can the Authors elaborate on these decisions? How accurate was the fitting of the correlation function based on the sample values? In case of the Po river modelling, flow depth and flow velocity were modelled and thus directly correlated already? What does d / max(d) stand for? Why did the Authors not use Copulas to account for joint probabilities?
- Line 163 – Line 165: This is not clear. It seems that he and v are the leading parameters and d, s, q are depending on these parameters. Why do we lose information? And what can this reviewer picture under “[…] the values of d*, v* and s* were then replaced with the correspondent percentiles from the datasets of d, v and s”? What is the effect of this?
- Line 174: How did the authors end up with the number of 5000 hypothetical buildings? Did the authors explore the convergence behavior? When looking into uncertainty and Monte Carlo sampling justification of such choices should be provided.
- Line 179: In the results, readers are presented with the damage difference as the metric to explore the feature importance. Information on how this metric is calculated (e.g. aggregated vs averaged over the 5000 buildings) would clarify how to read the results.
- Line 195: per house 1000 replicates were generated to account for uncertain combination of ~ 20 parameters. Did the authors explore the convergence behavior of the results to confirm that this choice is reliable (see comment regarding line 174)?
- Line 215 – Line 220: Here the Authors mention INSYDE and its benefits. Showcasing the utility of INSYDE 2.0 would not only be towards a decision-maker but also towards the previous version INSYDE. It would be interesting to see the results of the old INSYDE alongside INSYDE 2.0 .
- Line 220: Table 3 is very helpful. This reviewer would suggest to add the details regarding the synthetic case study in that table as well for overview purposes.
- Line 224: The benefit of section 3.1 is not clear to this reviewer. It seems to focus on the pairwise occurrence of parameters. It is unclear how it offers evidence to answer the initially proposed research questions. This reviewer thus suggests to either incorporate it in the methodology section or place it in the Supplemental Material. For this reviewer, pairwise occurrence is just one of the different elements in the data sampling process. For example, the pairwise occurrence in the collected information used for the sampling might be of additional interest (to gain insight into uncertainty progression). At the same time, Figures 2 and 3 seem to have an incorrect design: the subplots on the diagonal seem to be histograms, but the y-axis labels are not correct.
- Line 253: Here, an extended dataset is mentioned the first time (Elaboration in the Method section needed!). So authors are using 4 case studies? In general, what is the justification for the three different cases? What added benefit do the authors see by adding a second stylized case here?
- Figure 4: How is the damage difference calculated? The medians are barely visible. In general this figure is very colorful, while some other elements are not visible (whiskers, box whiskers for LM to YY in upper plot). The bars seem to go beyond the chosen y-axis limits on some occasions , e.g for BE (upper plot).
- Line 260: how do the correlations between he, d, v, and s play into these results?
- Line 261: 10,000 EUR per house or averaged across the entire set of buildings? Are these higher damages or lower damages? A permutation of only 10,000 EUR for each house would lead to a difference of up to 5 million EUR, which is significant again? What are the uncertainty bounds for this difference?
- Figure 5: The left plot is very useful and clear to see. Why did the Authors choose to combine the scatter plots for the other BT’s?
- Figure 7: While Table 4 is very helpful and supports the reasoning in the text, this reviewer has doubts regarding Figure 7. First of all, the use of log-log scales makes it very difficult to interpret the results since distance is not constant at different positions in the figure. As such, diverging from the diagonal has much more severe implications towards increasing Observed Damages. Secondly, this reviewer is wondering whether making use of the damage difference as in previous figures might be more informative and supportive regarding the research questions. What is a decision-maker learning from this visualization about uncertainty in modelling?
- Table 4: What is the computed expected damage? Other point: As a decision-maker, a conclusion I would draw form this Table is that Lodi and Caldogno are both cities that are on average much more vulnerable (because of building properties) than other cities in Northern Italy. Or that INSYDE underestimates damages. Can a decision-maker get any insights into how much of the model gap with reality can be attributed to the uncertain input data vs. other sources of uncertainty/error (e.g. the hazard-damage relations of the model). What lessons can a decision-maker learn from this? Elaborating a bit more on this could benefit the significance of this work.
- Line 350-352: This reviewer has not seen any evidence that confirms this claim. The advantage can only be compared to the original INSYIDE set up, or the learnings of analysing the uncertainty bands.
Citation: https://doi.org/10.5194/nhess-2023-179-RC1
Mario Di Bacco et al.
Mario Di Bacco et al.
Viewed
HTML | XML | Total | Supplement | BibTeX | EndNote | |
---|---|---|---|---|---|---|
151 | 35 | 5 | 191 | 10 | 5 | 4 |
- HTML: 151
- PDF: 35
- XML: 5
- Total: 191
- Supplement: 10
- BibTeX: 5
- EndNote: 4
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1