A performance-based approach to quantify  atmospheric river flood risk

Bowers, Corinne; Serafin, Katherine A.; Baker, Jack

doi:https://doi.org/10.5194/nhess-22-1371-2022

Articles | Volume 22, issue 4

https://doi.org/10.5194/nhess-22-1371-2022

© Author(s) 2022. This work is distributed under
the Creative Commons Attribution 4.0 License.

https://doi.org/10.5194/nhess-22-1371-2022

© Author(s) 2022. This work is distributed under
the Creative Commons Attribution 4.0 License.

Articles | Volume 22, issue 4

Research article

|

19 Apr 2022

Research article |

| 19 Apr 2022

A performance-based approach to quantify atmospheric river flood risk

Corinne Bowers, Katherine A. Serafin, and Jack Baker

Download

Final revised paper (published on 19 Apr 2022)
Preprint (discussion started on 12 Nov 2021)

Interactive discussion

Status: closed

RC1:
'Comment on nhess-2021-337', Anonymous Referee #1, 18 Dec 2021

General Comments

The authors provide a process-based probabilistic framework for predicting damages associated with ARs based on AR intensity and duration and antecedent hydrologic conditions. This is a useful tool. There are a number of technical innovations throughout the study. The 2019 Russian River case study contains several creative data sourcing and manipulation steps to overcome inherent data availability issues. The overall method has broader applications than AR damage prediction. It could be applied to any damaging hydrometeorogic events, including hurricanes and tropical storms. The AAL and loss exceedance curve calculations are compelling. A number of valuable insights are presented in the discussion section.

My only comment of substance is that the current ordering of sections makes the model difficult to follow as variables are introduced before being defined and the multivariate Monte Carlo integration framework is explained after presenting the series of integrals.

I'd put section 2 paragraph 1 first, then 3.2 paragraph 1, then 2.1 description of pinch point variables, then 2 framework description with the equations, then 3.2 paragraph 2 explanation of Monte Carlo integration. Or something along those lines. The current ordering was difficult for me to follow although it did all make sense at the end.

The rest of the comments are minor technical corrections / suggestions or requests for clarification. Overall this is a great contribution to the literature. I recommend accepting the manuscript after minor revisions.

- - - - -

Specific Comments / Technical Corrections

1 Introduction

26 California experiences ARs coming from a pathway called the Pineapple Express -> California often experiences ARs coming from a pathway called the Pineapple Express [not all ARs in CA are considered Pineapple Express storms]

31 $300 million -> $660 million

Data appendix S1 Top Counties lists damages for CA counties over 40 years at 26.53 billion of which AR damages were 24.86 billion in 2019 dollars. This translates into annual AR damages of 24.86/40 = 621.5 million. In 2021 dollars this is approximately $660 in 2021 dollars (e.g., https://www.minneapolisfed.org/about-us/monetary-policy/inflation-calculator)

1.1 Disciplinary Context

In addition to FEMA's Hazus, USACE's HEC-FIA and HEC-FDA are potential methods that can be used to convert HEC-RAS outputs to economic impacts.

2 Framework Description

109 total probability theorem -> law of total probability

116 decision variable DV appears here for the first time but is described later at 165. Either move the section on pinch point variables above the introduction of DV, etc., or note that the variables are defined in detail below.

Eq 2 consider \cdot or \times in place of asterisks.

Eq 2 consider six integrals evenly spaced rather than two sets of three integrals.

Eq 2 it would perhaps make sense to include the supports over which the pinch point variables are being integrated. But perhaps it would be a distraction.

Eq 2 ideally, the variables should be defined as they are introduced. Perhaps it is sufficient to note that the variables are defined below in Sect. 2.1.

Eq 2 An explanation of why f(Q | PRCP, HC) has two conditional variables while all other elements in the chain have only one may be useful for the reader, at some point in the text.

Maybe something like: at each point in the causal chain one pinch point variable depends on the next. Flow, Q, depends on two variables precipitation, PRCP, and antecedent hydrologic conditions, HC. Could perhaps write out the whole chain in English in the paragraph below the equation. This would be easier to follow than waiting to read the text in the next section. Or else note that all variables are defined in the following section.

Eq 2 Each pinch point variable is a scalar here?

Eq 2 I'm unclear on how \lambda(AR) works and on how \lambda(DV > x) and P(DV > x | DM) work... An additional line providing some context could be helpful.

2.1 Pinch Point Variables

(The following comments were written as I was reading through the manuscript. They could be avoided if you are clear upfront about how the integration is Monte Carlo integration and how the pinch point variables can be vectors.)

I'm a little unclear on how the causal chain works with scalar variables given that the process is spatially heterogeneous. Should I think of the PARRA process running in parallel over all locations? But what about spatial correlations?

AR is a measure of intensity, so it could be something like peak IVT or cumulative vapor transport over some time period, the duration of the AR, say. But IVT is a vector field. So you'd need to aggregate or average over time and space to get a scalar metric of intensity.

PRCP as a scalar field has the same issue. You'd need to integrate over time and space to get a scalar value. Should I think of this as some metric of precipitation over the whole watershed? Or is there a way to apply PARRA with a time series of precipitation grids as inputs?

HC, same as AR and PRCP.

Q makes more sense as a single input if you're considering a single channel, although the hydrograph is a curve which captures the duration as well as the intensity of the flow above flood stage, so I'm unclear on how this enters into the formulation.

INUN at a given location or structure is just a scalar, but over a set of n structures is an n-dimensional vector. Here, duration of inundation may also be important, in addition to depth, in terms of generating damages.

I'm unclear on how DM and DV differ. DV is a metric of impact or consequence. DM is a damage measure. So, DV could be a more broad measure of impact that is perhaps related to DM through some probabilistic relationship that is modeled using the observational record?

Ah, the variables are discussed in more detail. AR is a vector of max IVT and duration, got it.

PRCP is storm-total accumulated rainfall over the watershed. Did you experiment at all with more complex formulations for precipitation? Don't tools like HEC-RAS and LisFlood take precipitation fields as inputs?

HC watershed-average soil moisture equivalent height. There's probably some additional uncertainty introduced by averaging over the whole watershed. Upstream soil moisture may be more relevant than downstream soil moisture, for example, although these are probably highly correlated.

Q is time series of flow at inlet. This is parameterized as a 3-vector with Q_p, t_p, and m.

INUN is surface water depth at locations of interest. So this is N-dimensional.

DM is a damage ratio, expected cost to repair over the total value. Assumed to be a function of water depth.

DV actionable measure of impacts. So, it converts damage ratios into damages? So it requires observed building values then? What's the utility in splitting DM and DV? I think I can see it, but an explanation could be useful.

2.2 Component Models

This could perhaps go above the equations.

I'd put section 2 paragraph 1 first, then 3.2 paragraph 1, then 2.1 description of pinch point variables, then 2 framework description with the equations, then 3.2 paragraph 2 explanation of Monte Carlo integration. Or something along those lines. The current ordering was difficult for me to follow.

3 Case Study: Sonoma County

185 The spatially repetitive, locally severe flooding seen in Sonoma County is a signature characteristic of ARs. <- I'm not sure if I agree with this statement; I suggest removing it. The statement suggests that ARs tend to reoccur at the same locations and always generate locally severe flooding. Some ARs generate multi-basin flooding, like the 1862 event. Some locations affected by ARs flood (relatively) infrequently.

3.2.1 Precipitation Component Model

238 mixture model 90% with WLS standard errors, 10% with distribution fit to largest 10% of events. I'm familiar with WLS but not with this approach. More detail on this method, or a reference, would be helpful.

3.2.2 Precipitation 2019 Case Study

271 We note that Sonoma County is not guaranteed to see any impacts -> We note that, according to the simulated distribution, Sonoma County... (or, according to the distribution simulated from the observational record, etc.)

3.3.2 Hydrologic Conditions 2019 Case Study

289 it is interesting that soil moisture is an "input" here and not simulated, just as AR IVT and duration are "inputs." This is explicitly captured in the flow chart and in the Eq 2 multiple integral. It might be worth emphasizing this in the description of the flow chart, for example.

291 why is observed precipitation used as an input here? Shouldn't the full precipitation distribution, derived from the input AR intensity and duration, enter here? What am I missing?

3.4.1 Flow Component Model

310 - what data were you using here? The observational precipitation record? Fed into the runoff calculation? So, you have how many observations to fit the mixture OLS model?

3.4.2 Flow 2019 Case Study

Fig 5 b - any speculation on the early streamflow peak in the 2019 event? It doesn't seem to be captured within the 90% PI. A horizontal line indicating flood stage could also be informative in this figure.

3.5.1 Inundation Component Model

344 100 year peak flow -> 100-year peak flow, etc. (make this change throughout the manuscript)

367 how many buildings were there in your domain? What year were the building footprints taken from?

3.5.2 Inundation 2019 Case Study

Figure 7 in the Data Type legend it appears that Observed is dashed and Simulated in solid. Making this more clear would be helpful.

3.6.2 Damage Measure 2019 Case Study

RESA tagging is a fascinating approach.

3.7.1 Decision Variable Component Model

Interesting approach to estimating property values from tax assessments adjusted using ACS correction factors.

3.7.2 Decision Variable 2019 Case Study

451 missing comma after i.e.

Figure 9 b - it would be useful to have a high-resolution version of this figure in the appendix, or in a data appendix.

4 Results

eq 6 - consider \cdot or \times in place of asterisk, or no multiplication symbol at all. Same comment throughout equations.

4.1 AAL

487 You could note that $156m is likely to be an overestimate given that the county-wide penetration rates are lower than the penetration rates for properties at risk.

487 What is the uncertainty around the $163m estimate?

5 Discussion

There are many valuable insights in the discussion section.

Citation: https://doi.org/10.5194/nhess-2021-337-RC1
- AC1: 'Reply on RC1', Corinne Bowers, 03 Mar 2022
  
  Please see the attached document for detailed responses to each of the reviewer's comments.
  
  Citation: https://doi.org/10.5194/nhess-2021-337-AC1
RC2:
'Comment on nhess-2021-337', Anonymous Referee #2, 10 Jan 2022
This paper provides a framework (PARRA) to quantify Atmospheric River flood risk using performance-based submodules for Sonoma County, CA. The methodology is described in detail, and a case study from a 2019 AR event was investigated. The PARRA framework is very interesting and useful for providing a mean estimate of expected losses with uncertainty bounds. However, I have some concerns about the methodology and the way it was described in the manuscript. I also have some concern about the 2019 case study that was investigated – why choose a case study that the PARRA framework barely captures in the tail of its distribution? Why not show a case study that the PARRA framework captures much better? I have some comments below that can help improve the paper. I think the paper can be accepted after some minor revisions and clarifications.

General Comments

What would the PARRA framework provide a stakeholder as the estimated losses for the 2019 AR event? The results show an average loss of about $25 million (Figure 9a), but the actual cost was $91.6 million. This actual cost is covered in the tail of the distribution provided by PARRA but is far from the mean of this distribution. Do the authors consider this an accurate assessment? Some comments on how to interpret the results with their associated uncertainties, as well as how to interpret the acceptability of the results, would be helpful.

Specific Comments / Technical Corrections

Section 1 Introduction

Line 27: Pineapple express is not the only mechanism that brings ARs to California.

Line 56: “… understanding climatology of ARs”, see Espinoza et al. 2018 and Massoud et al. 2019 who aimed to understand AR climatology in a global context.

Espinoza, Vicky, Duane E. Waliser, Bin Guan, David A. Lavers, and F. Martin Ralph. "Global analysis of climate change projection effects on atmospheric rivers." Geophysical Research Letters 45, no. 9 (2018): 4299-4308.

Massoud, E. C., V. Espinoza, B. Guan, and D. E. Waliser. "Global climate model ensemble approaches for future projections of atmospheric rivers." Earth's Future 7, no. 10 (2019): 1136-1151.

Section 2 Framework Description

Line 110: Is this theorem a version of Bayes theorem? How are they related?

Line 143: Initially it seems that the AR category score (1-5) is used as input in the PARRA framework. It isn’t until later in the manuscript that it becomes clear that AR max IVT and duration are used. The authors should clarify this earlier in the paper.

Line 147: Some precipitation can be from non-AR sources. Is this considered for the calculation of the precipitation submodule in the PARRA framework?

Line 160: take out the word ‘are’

Section 3 Case Study: Sonoma County

Line 235: Is there a citation that shows why WLS can be used to express the relationship between IVT/DUR and PRCP? This seems rather simplistic and not thorough enough to capture the estimated PRCP. According to Figure 4 there seems to be significant spread in these relationships. Perhaps the authors can explain why this choice was made.

Line 260: The mean of the distribution is way off here. Should this be a reason for concern? It seems that this methodology begins to break down for extreme AR events.

Line 266: “… a suitable representation of reality”, this is a subjective acceptance criterion, and the authors should note it as so.

Line 282: Figure 6 is mentioned before Figure 5.

Line 332/Figure 5: The initial peak on Feb 26 is not captured. Can the authors provide some comments and reasoning behind this?

Line 339: This comment is applicable for this section and for other sections. There are several choices that need to be made by the user, such as the LISFLOOD parameters. This raises the question of the PARRA method's applicability to other locations. Does the whole framework need to be re-calibrated with local data for other local case studies?

Line 352: There is no information on which surrogate model was used, and what the accuracy or efficiency of that surrogate model is. In general, there is very little information on this emulation method or how it is used. How can other readers re-produce or build on this analysis if this critical information is missing?

Line 440: On correction factors - Again, this seems like a subjective fix for applying the PARRA framework in this region. How can the framework be applied elsewhere using this methodology? Although the framework seems to be useful for Sonoma County, how can the authors show that the methodology can still be efficiently applied for other locations? Some ideas that address this question can be helpful in accepting the PARRA framework as a generally usable framework.

Figure 9: The distribution just barely captures the observed event in its tail. As mentioned above, how is this result with its uncertainty reported to a manager or a stakeholder? What is the provided answer here?

Section 4 Results

Line 477: Equation 6 - Is this equation reported anywhere in the literature? Seems like another subjective criteria that the authors implement. There needs to be more information describing this choice.

Line 482: The AAL is an interesting concept to describe the average annual losses. However, it is known that this region experiences significant swings between wet and dry years. Is it feasible for the authors to calculate what the AAL is for wet vs dry years?

Line 487: What are the uncertainties around these estimates? Do the authors provide this?

Line 522: the word ‘the’ is duplicated

Line 524: ‘Expected benefits’ - See Massoud et al. 2018, who did a similar analysis for groundwater and investigated how changes to decisions in managing water resources can impact expected changes to groundwater storage. Studies like this are starting to populate the literature.

Massoud, Elias C., Adam J. Purdy, Michelle E. Miro, and James S. Famiglietti. "Projecting groundwater storage changes in California’s Central Valley." Scientific reports 8, no. 1 (2018): 1-9.

Line 529: Take out the words ‘is of the 2019’.

Section 5 Discussion

Line 535: I would argue that these insights are helpful for planners, managers, and engineers, yet not so helpful for purely scientific investigation since many choices in the framework are purely subjective. I think it is important for the authors to make this clear throughout the paper.

Line 543: Another process that can matter here is the role of sequential ARs (i.e., multiple ARs occurring sequentially), something to consider for 'future directions'.

Line 557: Yes, but what did this do to the expected accuracy of capturing the relationships? The framework is trading potential accuracy and confidence for computational efficiency. This introduces even more uncertainty. The authors should state this.

Line 582: Component Model Alternatives - This is where some of the subjective choices of the framework can be replaced with more objective choices, and therefore can make the framework more sound for scientific analysis.

Line 589: the word ‘the’ is duplicated

Line 591: the word ‘underlying’ is duplicated

Section 6 Conclusions

Line 594: Is it possible/feasible to test another case study event that the PARRA framework accurately estimates the damages for? This can help show case the value of the PARRA framework even more than just showing the one case study from 2019 that was barely captured in the tail of the distribution.

Line 608: ‘… event fell within the expected probabilistic range …’, In the tail of the distribution. It was barely captured. The authors should be careful with how they communicate the accuracy of the provided result.
Citation: https://doi.org/10.5194/nhess-2021-337-RC2
- AC2: 'Reply on RC2', Corinne Bowers, 03 Mar 2022
  
  Please see the attached document for detailed responses to each of the reviewer's comments.
  
  Citation: https://doi.org/10.5194/nhess-2021-337-AC2

Peer review completion

AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload

ED: Publish as is (08 Mar 2022) by Olga Petrucci

AR by Corinne Bowers on behalf of the Authors (15 Mar 2022) Author's response Manuscript

Short summary

Atmospheric rivers (ARs) cause significant flooding on the US west coast. We present a new Performance-based Atmospheric River Risk Analysis (PARRA) framework that connects models of atmospheric forcings, hydrologic impacts, and economic consequences to better estimate losses from AR-induced river flooding. We apply the PARRA framework to a case study in Sonoma County, CA, USA, and show that the framework can quantify the potential benefit of flood mitigation actions such as home elevation.