18 Aug 2021
18 Aug 2021
Extreme Storm Surge estimation and projection through the Metastatistical Extreme Value Distribution
 Department of Civil, Architectural, and Environmental Engineering, University of Padova, 35131, Padova, Italy
 Department of Civil, Architectural, and Environmental Engineering, University of Padova, 35131, Padova, Italy
Abstract. Accurate estimates of the probability of extreme sea levels are pivotal for assessing risk and the design of coastal defense structures. This probability is typically estimated by modelling observed sealevel records using one of a few statistical approaches. In this study we comparatively apply the Generalized Extreme Value (GEV) distribution, based on Block Maxima (BM) and PeakOverThreshold (POT) formulations, and the recently Metastatistical Extreme Value Distribution (MEVD) to four long time series of sealevel observations distributed along European coastlines. A crossvalidation approach, dividing available data in separate calibration and test subsamples, is used to compare their performances in highquantile estimation. To address the limitations posed by the length of the observational time series, we quantify the estimation uncertainty associated with different calibration sample sizes, from 5 to 30 years. Focusing on events with a high return period, we find that the GEVbased approaches and MEVD perform similarly when considering short samples (5 years), while the MEVD estimates outperform the traditional methods when longer calibration sample sizes (1030 years) are considered. We then investigate the influence of sealevel rise through 2100 on storm surges frequencies. The projections indicate an increase in the height of storm surges for a fixed return period that are spatially heterogeneous across the coastal locations explored.
 Preprint
(5614 KB) 
Supplement
(3249 KB)  BibTeX
 EndNote
Maria Francesca Caruso and Marco Marani
Status: open (until 04 Oct 2021)

RC1: 'Comment on nhess2021236', Anonymous Referee #1, 10 Sep 2021
reply
The manuscript aims at assessing the performance of the Metastatistical Extreme Value Distribution in estimating high quantile of extreme sea level also including future sealevel rise. The topic discussed is relevant and important to improve the resilience of coastal systems facing the effect of a changing climate. However, few aspects discussed in the manuscript need to be revised and discussed more indepth. The terminology and notation used need also improvement to ensure consistency throughout the manuscript and avoid confusion in the readers.
From the title of the manuscript, the reader expects to read a study about extreme storm surge. However, the study’s objectives (Lines 6668) refer to extreme sea level. Later on, Line 150, the Authors say that they will investigate the variable h(t) being the sum of tide and storm surge, so sealevel without mean sea level. I would encourage the Authors to clearly state the variable of interest and the variable used when performing the analyses, see also other comments below.
Information regarding MEVD, which is the main method investigated in the manuscript, is limited. The Authors say that this method guarantees “the least amount of apriori assumption” (line 56). However, the following assumption must be made: F(x,θ) in Eq. 2, the threshold for the ordinary values, the estimation window for parameter estimation, the timelag to ensure independence between ordinary values. How then is this method the one with the least amount of apriori assumptions? I suggest clarifying further the advantages of the MEVD compared to the other two methods investigated. Moreover, additional information should be discussed: how the threshold for the ordinary value was selected (line 121 says “as small as possible”); how the 5year estimation window was selected; why the 30day lag time for the independence of the ordinary value is so different compared to the values found in the literature (lines 173179); and how F(x,θ), which turns out to be a GDP (Line 267), is different compared to the classical GDP
I do see the value in implementing the crossvalidation procedure to assess the predictability power of the distribution selected as representative of the observations. At the same time, I see the crossvalidation as an additional measure of goodness of fit rather than the main one. The NDE only tests if the one quantile associated with the return period Tr of interest is well captured. What about the other quantiles? Is the distribution representative of the entire sample? Also, how the observed quantile h(obs,p) is calculated? Which sample (M,S, or V) is used? The QQ plots are mentioned only in the results section and they are only performed for the 30 years insample test. In my opinion, the QQ plots put the NDE into perspective and should be included as goodnessoffit method. Also, it would be useful to have them in the main manuscript. I do understand that the space is limited, maybe the Authors could consider including in the main manuscript only the ones related to the MEDV.
In the section Return Period, the definition of Equation 4 needs to be further discussed. Even if the Authors replace (h) with (zmsl), Equation 4 is still the return period of (h), and not the return period of the (z), as indicated by the Authors. Mean sea level (msl) shows a clear linear trend and such trend is recognizable in (z). Similarly, in Equation 5, the distribution G is the distribution of the variable (h) and not the variable (z) as reported in line 341. This has an implication in Figure 5. I assume that the yaxis in Figure 5 “water level” refers to the variable (z). This variable (z) is timedependent, while in Figure 5 it seems like the statistical properties of (z) are constant. I would have expected something similar to the effective return level plots, to show the effect of sealevel rise. How (msl), which is timedependent, is added to (h), which is not timedependent, to derive Figure 5? I suggest clarifying the transition from the analysis on the variable (h), a random variable, to (z), which presents a linear trend due to (msl). I also suggest being more precise with the notation and the terms used throughout the manuscript. It is very difficult to understand the variables the Authors refer to because are often called with many different terms, e.g., total sea level, water level, extreme sea level...
The Authors say that “MEVD proves to be a good model for the extreme sea levels” (line 288) and that “MEVDbased estimates outperform the traditional approaches” (line 301). I do fail to see what the Authors describe. In the QQplots Figure S26, MEVD in the insample analysis has, in general, the highest variability, especially compared to the GEV. In the outofsample, MEVD looks better for lower quantiles, but it has quite a large variability for higher quantiles, compared to the other distributions. Overall, it is difficult to quantify which distribution performs best. This is also reflected in the NDE plots, Figure 3, where the differences between distributions are minimal.
Point by point comments:
 Line 92. Please revise the notation. Pr(Mn<= x) = F(x)^n where Mn is the maximum of a sequence of independent random variable X. See also Coles 2001 (line 415)
 Line 154. Additional discussion is needed concerning the fact that h(t) can be considered a stochastic variable even though a determinist component is included. Also, a literature review on indirect and direct methods (Line 149) for extreme sea level is missing.
 Lines 133. The Authors discuss the negligibility of tidesurge interaction. Does this condition hold in the case of Punta della Salute which is located within the Venice Lagoon?
 How the GDP threshold is selected and tested?
 It would be very interesting and useful to appreciate the difference between the performance of the distribution functions to see the sample of maxima used for fitting the distributions.
 Lines 205209. My suggestion is to revise this paragraph. The terminology is confusing. I believe the Authors here are discussing the variable (z), in which storm surge is a component.
 Lines 220221. The Authors say that the tidal and storm components do not change over time as mean sea level. How did the Author check that no trend is detected in the variable h?
 Section 3: Was the trend test performed only on the annual maxima or also on the samples of maxima used to compute the GPD and the MEVD?
 Line 281: Storm surge or storm surge and tide?
 Line 285: what is L?
Maria Francesca Caruso and Marco Marani
Maria Francesca Caruso and Marco Marani
Viewed
HTML  XML  Total  Supplement  BibTeX  EndNote  

193  39  5  237  12  1  2 
 HTML: 193
 PDF: 39
 XML: 5
 Total: 237
 Supplement: 12
 BibTeX: 1
 EndNote: 2
Viewed (geographical distribution)
Country  #  Views  % 

Total:  0 
HTML:  0 
PDF:  0 
XML:  0 
 1