the Creative Commons Attribution 4.0 License.

the Creative Commons Attribution 4.0 License.

# The use of historical information for regional frequency analysis of extreme skew surge

### Roberto Frau

### Marc Andreewsky

### Pietro Bernardara

The design of effective coastal protections requires an adequate
estimation of the annual occurrence probability of rare events associated
with
a return period up to 10^{3} years. Regional frequency analysis (RFA) has
been proven to be an applicable way to estimate extreme events by sorting
regional data
into large and spatially distributed datasets. Nowadays, historical data are
available to provide new insight on past event estimation. The utilisation
of historical information would increase the precision and the reliability of
regional extreme's quantile estimation. However, historical data are from
significant extreme events that are not recorded by tide gauge. They usually
look like isolated data and they are different from continuous data from
systematic measurements of tide gauges. This makes the definition of the
duration of our observations period complicated. However, the duration of the observation
period is crucial for the frequency estimation of extreme
occurrences. For this reason, we introduced here the concept of “credible
duration”. The proposed RFA method (hereinafter referenced as FAB, from the
name of the authors) allows the use of historical data together with
systematic data, which is a result of the use of the credible duration concept.

Flood risk is a noteworthy threat for anthropic and industrial settlement along the coast. Coastal defence constructions have to withstand extreme events. Therefore, in this context, a statistical estimation of extreme marine variables (such as sea levels or skew surges, the difference between the maximum observed sea level and the maximum predicted astronomical tide level during a tidal cycle; Simon, 2007; Weiss, 2014) occurrence probability is necessary. These variables associated with high return periods (up to 1000 years; ASN, 2013) are estimated by statistical methods linked to the extreme value theory (EVT) (Fréchet, 1928; Gnedenko, 1943; Gumbel, 1958; Picklands, 1975; Coles, 2001).

In the past, time series recorded at a single site had been used to estimate return levels for marine hazards. However, these extreme quantiles are linked to huge estimation uncertainties due to local recordings being too short (typically 30 to 50 years long). A greater amount of data is therefore necessary to obtain reliable estimations of extreme marine hazards.

For this reason and thanks to the availability of several local datasets, the regional frequency analysis (RFA) is employed to reduce statistical uncertainties, allowing the use of bigger extreme samples. The primary purpose of RFA is to cluster different locations in a region and to use all the data together (Cunnane, 1988; Hosking and Wallis, 1997; Van Gelder and Neykov, 1998; Bernardara et al., 2011; Weiss, 2014). RFA is based on the index flood principle (Dalrymple, 1960) that introduces a local index for each site in order to preserve their peculiarities in a region. A regional distribution can be defined only after checking its statistical homogeneity (Hosking and Wallis, 1997). The probability distribution of extreme values must be the same everywhere in the region to allow the fitting of a probability distribution with a big data sample. Several applications exist for marine variables, although these are mostly for skew surges (Bernardara et al., 2011; Bardet et al., 2011; Weiss et al., 2013).

Specifically, Weiss (2014) proposed a RFA approach to treat extreme waves and skew surges. According to this approach, homogeneous regions are first built relying on typical storm footprints and their statistical homogeneity is subsequently tested. The addition of physical elements allows for the inclusion of to regions that are both physically and statistically homogeneous. Furthermore, this methodology, and the pooling method in particular, yields an equivalent regional duration in years that enables the evaluation of the return periods of extreme events.

As proved in hydrology, extending datasets with historical data or paleofloods is an alternative way to reduce uncertainties in the estimation of extremes (Benson, 1950; Condie and Lee, 1982; Cohn, 1984; Stedinger and Cohn, 1986; Hosking and Wallis, 1986a, 1986b; Cohn and Stedinger, 1987; Stedinger and Baker, 1987; Ouarda et al., 1998; Benito et al., 2004; Reis and Stedinger, 2005; Payrastre et al., 2011; Sabourin and Renard, 2015) as well as representing better potential outliers. Previous works on local sea levels (Val Gelder, 1996; Bulteau et al., 2015) and on local skew surges (Hamdi et al., 2015) pointed out that the incorporation of historical data leads to a positive effect on extreme frequency analysis.

The combination of these two approaches increases the amount of available data allowing to get better extreme return levels. The use of extraordinary floods events in RFA has already been explored by Nguyen et al. (2014) for peak discharges and Hamdi et al. (2016) have explored it in the frame of oceanic and meteorological variables.

In this study, we investigate the use of historical data in RFA of systematic measurements of skew surge levels. The proposed FAB method (a declination of RFA using credible duration) allows the estimation of reliable extreme skew surge events by taking into account all the available non-systematic records of skew surge levels. Specifically, historical data are defined as all of the skew surge events that were not recorded by a tide gauge, and consequently, systematic data are all the measurements of skew surge recorded by tide gauge. One obvious and very promising way of collecting information not recorded by a tide gauge is to investigate and reconstruct historical events that occurred before the starting date of gauge observations. This activity is also known as data archaeology and has been demonstrated very promising and useful for several study in the literature. However, this definition also includes isolated data points reconstructed during gaps in the systematic measurements. This is because statistically speaking they could be treated similarly to the isolated data reconstructed from before the gauged period. To find historical events it is thus necessary to go back in time and look for information before the gauging period or during each data gap in a time series. During extreme storms, tide gauge may be affected by a partial or total failure in the sea level measurements and so this kind of event cannot always be detected. Thanks to non-systematic records, we can hold a numerical value to these ungauged extreme events. This allows to use a larger number of storms in the statistical assessment of the extreme skew surges.

Baart et al. (2011) pointed out the hitch in recovering accurate historical information. A historical event is the result of a wide investigation of several qualitative and quantitative sources. Historical data are collected from different sources as newspapers of the period, archives, water marks, palaeographic studies etc. For this reason, each historical data has its own origin and its quality has to be checked (Barriendos et al., 2003). Therefore, a critical analysis of historical data must be performed before applying a statistical approach.

Using historical data with time series data recorded by a tide gauge is not a simple process. Extreme statistical analysis requires a knowledge of the time period for which all the events above an extreme threshold are known (Leese, 1973; Payrastre et al., 2011). For the gauged, or preferably, systematic data, this time is represented by the recording period (i.e. systematic duration). As historical data are isolated data points, the duration of the observation is not defined. What happened during the time period between two or more past extreme events remains unknown. However, the coverage period estimation of historical data is a key step in extreme value statistical models (Prosdocimi, 2018). Reasonable computation of historical duration is the base of reliable statistical estimation of rare events.

In the past, to overcome these issues the perception threshold concept has been introduced. The perception threshold represents the minimum value over which all the historical data are reported and documented (Payrastre et al., 2011; Payrastre et al., 2013). The hypothesis is thus set that all the events over the perception threshold must have been recorded and the time period without records corresponds to time period without extreme events. This allows the estimation of the coverage period and the indirect assumption that the collected historical data are exhaustive throughout the coverage period assessed. So far, the perception threshold has allowed the use of historical information in extreme frequency analysis (Payrastre et al., 2011; Payrastre et al., 2013; Bulteau et al., 2015) but the hypothesis for using this approach seems sometimes difficult to prove.

Indeed, the perception threshold is usually defined equal to a historical event value, based on the principles that, if any other historical event would have occurred it would have been recorded (Payrastre et al., 2011; Bulteau et al., 2015). This is quite a sweeping hypothesis, which is difficult to verify and validate. Moreover, the duration associated to the perception threshold is usually defined as the duration between the oldest historical data and the start of the systematic series (Payrastre et al., 2011; Hamdi et al., 2015).

The main point of difference in this paper is the proposal of a new approach in order to deal with these questions in a more satisfactory way.

Note that the new approach is required for the application of the RFA, at
least in the POT approaches (Van Gelder and Neykov, 1998; Bernardara et al.,
2011; Bardet et al., 2011; Weiss et al., 2014b). Indeed, the definition of
the perception threshold as equal to a historical value precludes the
possibility of using the RFA methodology proposed by Weiss (2014).
Specifically, in Weiss regional methodology, the number of extreme events per
year *λ* must be equal in each site of a region in order to estimate a
regional distribution. This is impossible to achieve if the threshold is
linked to the characteristics of the local observations.

In addition, when historical information (i.e. non-systematic data) is
collected during a total failure of tide gauge, this influences the
frequency of the event. One could only re-estimate *λ* if the
credible duration is re-estimated (for details, see the Sect. 2).

In order to deal with these questions in this paper, a different approach is proposed to estimate the equivalent duration of historical data. This evaluation is based on the hypothesis that the frequency of the extreme events is not changing significantly in time. In other words, the average number of extreme events should be the same (obviously this has to be carefully verified and demonstrated case by case). In this way, we can assess the coverage period of historical data. Based on the frequency value of systematic storms occurrence, our hypothesis introduces a physical and tangible element in the estimation of historical duration. For this reason, we state that this advantage allows us to supplant the concept of a perception threshold providing by a credible estimation of the unknown historical duration used to estimate the extreme quantiles.

Thanks to this new concept of duration in this study, the FAB methodology is developed in order to use historical data and systematic data together in a RFA of extreme events. This methodology, described in detail in Sect. 2, is applied to the extreme skew surges given by 71 tide gauges located on the British, French and Spanish coasts and 14 historical collected skew surges is shown in Sect. 4. In addition, a functional example on the computation of credible duration is illustrated in Sect. 3.

## 2.1 The concept of credible duration

In the extreme value analysis (EVA) of natural hazards, the estimations of extreme quantiles are performed by applying statistical tools to a dataset of extreme variables. Usually, these datasets of extremes can be generated from continuous time series data or by taking extremes as the maximum value in several predefined time blocks (block maxima approach) or fixing a sampling threshold (peak-over-threshold (POT) approach).

Based on a POT approach, our study examines datasets of skew surges. From now on, we use skew surge as our chosen extreme variable but all concepts that we will describe can be extended also to other natural hazards.

In the POT framework, an extreme sampling threshold has to be considered in
order to extract the dataset of extreme skew surges from continuous time
series data of systematic skew surges recorded by tide gauge. In this way,
the extreme sample is created and the extreme quantiles computation of skew surge can be performed
according to the statistical distribution used. The relationship between
return periods *T* and quantiles *x*_{T} is expressed as follows:

where *P*(*X*>*x*_{T}) is the exceedance probability of *x*_{T} or
the probability that in every *λ**T* years an event *X*,
stronger than *x*_{T}, will occur. By using Eq. (1), the results can be illustrated
through a return level plot.

The variable *λ* represents the number of skew surge events per year
that exceed the sampling threshold. In other words, *λ* is the
frequency of storm occurrence and it can be expressed as the ratio of
the number of skew surges *n* over the sampling threshold to the recording
time of systematic skew surge measurements in years *d* (hereafter called
systematic duration).

Knowledge of *n* and *d* is necessary to compute the frequency of extreme skew
surges occurrence *λ* and to estimate extreme quantiles of skew
surges. In other words, a duration estimation is needed to transform the
probability [no units] in frequency [time^{−1} dimension].

When historical skew surges are available, the computation of *λ* is a
complex topic. In fact, although the
number of historical data *n*_{hist} over the sampling threshold is
known, we do not have any information on what happens during non-gauged
periods. It is thus impossible to define a duration over which the
*n*_{hist} data are observed. In other words,
regardless of the selection of sampling threshold value, we need this information to define the frequency of occurrence of
events over the threshold (see the Eq. 2). We called the missing information
“historical duration” *d*_{hist}, or in other words, the time period
for which historical skew surges above the sampling threshold are estimated
to be exhaustively recorded to the best of our knowledge. The inclusion of
historical data in a systematic dataset of extreme skew surge is shown as an
example in Fig. 1. It needs the formulation of a credible hypothesis in order
to estimate the duration of the whole dataset composed by systematic and
historical records.

Taking advantage of the perfect knowledge of systematic skew surges, the
basic hypothesis proposed here for the computation of historical duration is
that the frequency of storms *λ* remains unchanged over time. More
precisely, *λ*_{syst}, namely the number of systematic skew surges
per year over the sampling threshold, is supposed to be equal to *λ*_{hist}, namely the number of historical skew surges per year over the
same sampling threshold.

Equation (3) is in accordance with the lack of a significant trend on the extreme skew surges frequency during the second part of the 19th, 20th and the first part of 21st century in the regions considered in this study (Fig. 5).

According to current scientific literature, based on past observations, frequency of storms has no defined trend (Hanna et al., 2008; Barring and Fortuniak, 2009; MICORE Project, 2009). Therefore, the hypothesis that storm frequency is constant has been assumed. No significant trends are detected in terms of magnitude or frequency of storms between 1780 and 2005 in the North Atlantic and European regions since the Dalton minimum (Barring and Fortuniak, 2009). Over the past 100 years in the North Atlantic basin, no robust trends in annual numbers of tropical storms, hurricanes and major hurricanes have been identified by the last IPCC report of 2013. Hanna et al. (2008), Matulla et al. (2008) and Allan et al. (2009) state that observations have no clear trends over the past century or longer with substantial decadal and longer fluctuations with the exceptions of some regional and seasonal trends (Wang et al., 2009, 2011). Moreover, MICORE (2009) in the “Review of climate change impacts on storm occurrence” affirms that results from coastal areas do not detect any significant trend in the frequency of storms for the existing and available datasets.

Therefore, the hypothesis that storm frequency is constant may be a reasonable working hypothesis for demonstrating this approach. Obviously, it is crucial to test this hypothesis before the application of the approach in the present state to any new dataset. Note that the methodology could be adapted quickly in the future for non-stationary datasets, where the introduced concept can be adjusted. For skew surges, the validity this hypothesis is ensured.

The assumption that the annual frequency of skew surges has been constant in Europe during the last two centuries is warranted by the several references mentioned above. A solid check must be done for other marine variables to know if they can fit this hypothesis as well.

A new duration of observation *d*_{cr} is defined as “credible”
(i.e. based on credible hypothesis) considering a steady *λ*. This
novelty is expressed as follows:

Equation (4) shows how the number of extreme data is divided in two sides according
to the nature of data: *n*_{syst} is the number of extreme systematic data
above the sampling threshold *u* and *n*_{syst} is the number of extreme
historical data over the same threshold *u*. Therefore, the credible duration also depends on the sampling threshold value.

Moreover, this splitting indirectly provides the coverage period of historical skew surges in addition to the duration of recorded data. The historical side of the duration ${d}_{{\mathrm{cr}}^{\mathrm{hist}}}$ can be formulated as follows:

The coverage period on average that every historical skew surge holds is
called associated duration *d*_{cr,ass}.

Equation (6) displays how the associated duration can be evaluated just by knowing
*λ*. Therefore, every extreme data not originated from a recorded
time series has an associated duration. This is also the case for historical
events that occurred during a time gap in gauged skew surges series.

A functional example is illustrated in Sect. 3 in order to explain all of the notions put forth up to this point and their relevance to the sampling threshold choice.

## 2.2 The use of credible duration in RFA framework

Weiss (2014) defined a methodology to fit a regional probability distribution
for sea extreme variables. This approach enables the putting together of data
from different locations and the forming of regions. The probability
distribution of extreme skew surges must be the same inside a region. Thus,
the extreme data of the same region are gathered together through the use of
a normalisation parameter, which preserves the peculiarities of each site and
the regional probability distribution defined as such. The formation of
regions based on typical storm footprints (Weiss et al., 2013) is used to create physically and
statistically homogeneous regions and it leads to knowledge of a master
component of this regional process: the regional effective duration
*D*_{eff}.

The regional effective duration *D*_{eff} represents the effective duration
(in years), filtered of any intersite dependence, in which the number of
regional data *N*_{r} are observed. *D*_{eff} is defined as the product
of the degree of regional dependence *φ*[1;*N*] and the mean of
local durations:

where *λ*_{r} is the number of regional storms per year ${\mathit{\lambda}}_{\mathrm{r}}={N}_{\mathrm{r}}/\stackrel{\mathrm{\u203e}}{d}$ and $\stackrel{\mathrm{\u203e}}{d}$ is the mean of local
observations' duration of the *N* regional sites: $\stackrel{\mathrm{\u203e}}{d}=\sum _{i=\mathrm{1}}^{N}{d}_{i}/N$.

The local frequency of storms *λ* must be the same across the region.
More details about the *D*_{eff} are clarified in Weiss et al. (2014b) for the
classic RFA approach, which is set for systematic data.

The knowledge of the regional sample duration, as exposed by a generic
frequency analysis in the previous paragraph of this section, is a required
factor. In fact, the evaluation of the regional return period *T*_{r} of a
particular regional quantile ${x}_{{T}_{\mathrm{r}}}$ is expressed as in Eq. (8) as follows:

where $P\left({X}_{s}>{x}_{{T}_{\mathrm{r}}}\right)$ is the probability that a storm *s*
impacts at least one site in the region with a normalised intensity greater
than ${x}_{{T}_{\mathrm{r}}}$. Therefore, thanks to the regional distribution *F*_{r}, the
local return period of the storm *s* is calculated as follows:

In addition, Weiss et al. (2014b) use for the computation of the empirical local
return periods a Weibull plotting position. This element is correlated with
the regional duration *D*_{r}:

here the biggest storm *s*_{max} of the regional sample is linked to a local
return period thanks to *D*_{r}.

However, the need to employ historical data in the extreme analysis has been
considered as highly relevant by scientific community to get reliable
quantiles and to not miss significant information. In the RFA, the use of
historical records is allowed by the new notion of credible duration. A new
way to process historical skew surges at regional scale is introduced. The
addition of one or more historical skew surges in one or more sites of a
region requires the knowledge of the duration of each site. Thanks to the
credible duration concept applied previously to a single site, a coverage
period for both types of data can be evaluated. It allows switching from the
notion of the regional effective duration *D*_{r,eff} to the regional
credible duration *D*_{r,cr}:

Regional credible duration depends also on the degree of regional dependence
*φ* that increases when the data are independent. Moreover, Eq. (11)
shows that a local duration surplus consequently brings an excess of the
coverage period of regional sample. With some simplifications, the Eq. (11) can
thus be reformulated as follows:

The lack of a trend on storms' frequency is a necessary condition to assume a
steady *λ*. If this hypothesis is satisfied, the regional credible
duration can be computed.

The regional frequency analysis of skew surges including historical data, or FAB method, can be performed and these two combined approaches ensure better estimations of extreme values.

In order to better illustrate the concept of credible duration and how it varies according to different thresholds, a specific illustration on connection between these variables is described in this section by the example shown in Fig. 1.

Figure 1 shows the functional example of the whole skew surge series recorded at the La Rochelle tide gauge (France). Systematic skew surges are represented by the continuous series of data plotted in black from about 1940 to 2017. They are observed during a fixed period equal to the recording time period, in this case, 32.88 years.

The historical skew surges are artificial data generated by ourselves for this descriptive illustration and they are represented as green cross points. As they are not supposed to be obtained by continuous systematic records of skew surge measurements, the duration of their support is unknown.

As expressed above, *d*_{cr} and in particular ${d}_{\mathrm{cr}}^{\mathrm{hist}}$ depends on
the sampling threshold value *u*. Equations (4) and (5) show this dependence through
the *λ* value.

The application of the new proposed concept allows the computation of the historical duration of skew surges that represents a basic way to take into account the historical data on the estimation of extreme values.

Although the period of systematic observation is known and fixed by tide
gauge's records, the coverage period of historical observations is
associated with the value of sampling threshold *u*. The *λ* value is
computed from the systematic measurements (Eq. 2). As a result of the hypothesis
that the frequency of storms *λ* has no trend (Eq. 3), this *λ*
value is then used to compute the duration of the historical records.
Moreover, the hypothesis of Eq. (3) leads to different values of historical
duration for each *λ* selected value and consequently, for each
likely sampling threshold. Four cases of different values of high sampling
threshold are examined (Fig. 2 and Table 1); these particular cases are chosen
in order to let us know the effects of different sampling thresholds on the
computation of the credible duration.

Case 1 uses a sampling threshold *u*_{c1} corresponding to 0.5 extremes per
year (*λ*= 0.5) and lead to a credible duration of 42.88 years
(Table 1). A higher threshold *u*_{c2} is used in Fig. 2 for the Case 2
(*λ*= 0.23) and the credible duration grows until 54.62 years as the
number of historical data over the threshold does not change. In Case 3, the
threshold (*u*_{c3}) increases (*λ*= 0.2) but the credible duration
decreases (52.88 years) because the smallest value of historical skew surge
does not exceed the threshold. On average, the coverage period of each
historical event still rises. Case 4 displays the use of a very high
threshold *u*_{c4} (*λ*= 0.07) to define the sample of extremes. In
this case, only one historical event is bigger than threshold and its
coverage period, or associated duration, corresponds to the historical
duration. When the historical
data value is larger, the probability that the historical
event covers a longer period is higher.

As stated above, the four cases (resume in Fig. 2 and Table 1) show the behaviour of credible duration for different values of threshold. As threshold value increases, associated duration increases as well. Equation (4) expresses the credible duration as the sum of systematic duration and historical duration. Systematic duration is steady and it is independent from threshold value. For this reason, the variation of credible duration depends only on the variation of historical duration. In particular, when the threshold is extremely high, it is almost sure that no extreme skew surges occurred during the coverage period of historical data. If the threshold is lower, then it is less certain that all extreme skew surges have been observed in the historical period.

However, it is not always true that the credible duration is biggest when threshold grows (Fig. 3). In fact, the credible duration and historical duration depends not only on the threshold value but also on the number of historical events exceeding it. If the threshold is extremely high, not all historical data are taken into account in the extremes' sample. On the contrary, the associated duration of each set of historical data increases when the threshold grows for the reasons explained in the previous paragraph (shown by the blue line in Fig. 3).

In conclusion, the sampling threshold choice is an important step in the extreme quantiles estimation allowing the computation of the credible duration. Finally, with all of these elements we can deal with historical data in a regional framework.

## 4.1 Systematic skew surge database and historical data

The systematic skew surge database is originated from a time series of sea levels recorded by 71 tide gauges located along French, British and Spanish coasts of the Atlantic Ocean, the English Channel, the North Sea and the Irish Sea (Fig. 4).

French data are provided by SHOM (Service Hydrographique et Océanographique de la Marine, France), English data by BODC (British Oceanographic Data Center, UK) and Spanish data by IEO (Instituto Espanol de Oceanografia, Spain). All data except the Spanish data, are updated to January 2017. Every site has different systematic durations depending on the tide gauge records. Starting from 1846, the longest time series is recorded at Brest for a systematic period of 156.57 years. Sea levels are recorded hourly except for British ports. BODC supplies sea levels every 15 min from 1993 onwards.

Skew surge data are generated at each site from the difference between the maximum sea levels recorded around the time of the predicted astronomical high tide and the self-same predicted astronomical high tide. Before computing skew surge, sea levels must be corrected by a potentially long-term alteration of mean sea levels, or eustatism. Only sea level data with significant linear trends are edited. More details on the computation of skew surges are described by Simon (2007), Bernardara et al. (2011) and Weiss (2014). In this way, the systematic database of 71 skew surge series is created.

Finding accurate historical data is complex and difficult and at the time of writing only 14 historical skew surges are available. These historical records are located in 3 of the 71 ports: 9 at La Rochelle (Gouriou, 2012; Garnier and Surville, 2010; Breilh, 2014; Breilh et al., 2014), 4 at Dunkirk (DREAL, 2009; Parent et al., 2007; Pouvreau, 2008; Maspataud, 2011) and 1 at Dieppe (provided by the Service hydrographique et océanographique de la Marine SHOM, France). The historical data are introduced in corresponding skew surge series of the concerned sites.

This skew surge database of systematic and historical data is exploited in this study for the estimation of regional return periods.

## 4.2 Formation of regions

The method to form homogeneous regions proposed by Weiss et al. (2014a) is used to get physically and statistically homogeneous regions from the database shown above. This methodology is based on a storm propagation criterion identifying the most typical storms footprints. The configuration of parameters to detect storms is set using the same criteria as Weiss (2014). The most typical storm footprints are revealed by Ward's hierarchical classification (Ward, 1963) and correspond to four physical regions. The homogeneity test (Hosking and Wallis, 1997) is applied to check if all regions are also statistically homogeneous. Only one physical region is not statistically homogeneous, and after an inner partition and a further check, five physically and statistically homogeneous regions are obtained.

Moreover, a double threshold approach (Bernardara et al., 2014) is employed to separate the physical part from the statistical part of the used extreme variables and two different thresholds are used: the first to define physical storms (physical threshold) and the second to define regional samples (statistical sampling threshold, or more simply, sampling threshold).

Figure 5 displays the five regions founded out through the application of Weiss' methodology. Although a different input database is used, the number and the position of the regions are similar to Weiss' study.

The regionalisation process shows that only Region 1 and Region 2 hold historical data. The three sites with historical data belong to these two regions: La Rochelle to Region 1 and Dieppe and Dunkirk to Region 2.

The region with more historical information is the Region 1 (nine historical skew surges) and the results of FAB method proposed are focused on this region.

## 4.3 Focus on Region 1

### 4.3.1 Statistical threshold and credible duration

The sample of regional extremes is analysed with statistical tools in order
to estimate regional return levels associated with very extreme return
periods. A regional pooling method is used to select independent maximum
data. A regional sample is formed through normalised local data. Maximum
local data are divided by a local index, equal to the local sampling
threshold value (Weiss, 2014). Moreover, this methodology enables us to
estimate the regional distribution *F*_{r} considered as the distribution of
the normalised maximum data above the statistical sampling threshold.

This threshold corresponds to a storm's frequency *λ*, which is
assumed equal at all sites of a region. The sampling threshold value
selection, or more accurately the *λ* value selection, is a
complicated topic (Bernardara et al., 2014). There is no an unequivocal
approach for defining the exact sampling threshold. Twelve sensitivity
indicators depending on all possible *λ* values are proposed to assign
the best *λ* value for the regional analysis of extreme skew surges
using historical data: the statistical homogeneity test (Hosking and Wallis,
1997), the stationarity test applied to storm intensities of regional sample
(Hosking and Wallis, 1993), the chi-squared test for the regional
distribution (Cochran, 1952), the statistical test to detect outliers in
regional sample (Barnett and Lewis, 1994; Hubert and Van der Veeken, 2008;
Weiss, 2014), the value of regional credible duration, the number of regional
sample data, the value of the degree of regional dependence *φ*, the
visual look of the regional return plot and the stability of regional
credible duration, local return levels, shape parameter and scale parameter
of the GPD distribution (Coles, 2001). The *λ* value selection can not
be carried out through one of these methods and only the combination of all
indicators can enable us to appoint the optimal *λ*. All sensitivity
indicators have the same relevance in the proposed sensitivity analysis.

A skew surge's frequency of 0.36 is chosen for Region 1 thanks the
sensitivity analysis of the statistical factors mentioned above. Taking
*λ*= 0.36 results in having, on average, one extreme skew surge over the
sampling threshold every 2.78 years at each site. This threshold selection
permits the computation of the credible duration for every site belonging to
the Region 1. In Region 1, all ports except for La Rochelle have only
systematic skew surge and their credible duration, equal to systematic
duration, is determined and independent from the sampling threshold value.

On the contrary, the site of La Rochelle has nine historical skew surges and its credible duration depends both on systematic duration and historical duration. The systematic duration does not change and it is equal to the recording period of La Rochelle tide gauge. As shown in the fictive illustration of Sect. 3, the historical duration differs as threshold value changes.

The credible duration at La Rochelle is 57.88 years and the contribution of historical data is almost the 43 % of the total duration. Obviously, without these nine historical data, the coverage period of extreme data at La Rochelle would have been less than 25 years, equivalent to the historical duration. All the historical data are over the chosen threshold and, consequently, the associated duration of each historical data is equal to 2.77 years.

### 4.3.2 Regional credible duration

Equation (11) enables us to compute the regional credible duration for Region 1. The introduction of nine historical data in the local time series and only eight in the regional sample after the use of the regional pooling method, grants an earning of 11.11 years of regional duration. As Table 2 shows, historical data increase the credible durations for sites where there are historical records and consequently the regional credible duration likewise increases. In Region 1, only the site of La Rochelle increases its duration. This contribution allows for the increase of the regional credible duration.

Regional credible duration depends not only on the mean of local duration but
also on the degree of regional dependence *ϕ* (Eq. 11). Obviously, this
degree does not change with only nine more added
data. In conclusion, the rise of the regional
credible duration of Region 1 is produced mostly by the increase of credible
duration at La Rochelle.

Table 2 shows the difference with the FAB method that allows to take into
account historical data and a traditional method of RFA (Bernardara et al.,
2011; Weiss, 2014) that does not enable the use of historical data. Figure 6
displays how the more *λ* decreases in value, the more the difference
between regional credible duration with the nine historical data (blue line)
and the regional effective duration without historical data (green line) for
Region 1 increases (red line). This is due to the greater certainty that
historical data above the sampling threshold are the biggest over a longer
period.

### 4.3.3 Regional return levels

Regional return levels for Region 1 are obtained through the fit of a
generalised Pareto distribution (Fig. 7). Penalised maximum likelihood
estimation (PMLE) (Coles and Dixon, 1999) is used to estimate the regional
GPD parameters (*γ*, *k*). As shows Fig. 7, the regional distribution is
unbounded due to a positive shape parameter *k*. The return level plot of
Region 1 is produced with the 95 % confidence intervals generated by a
bootstrap of observed storms. A Weibull plotting position is employed for
the regional empirical return period. Return levels for each site are
achieved through multiplication between regional return levels and local
indices, equivalent to the local sampling threshold.

Red dotted points represent the eight historical data taken into account in Region 1. As it appears, the benefit of using historical data is not only an increase of the coverage period of extreme data but also to be able to consider in a frequency analysis when non-systematic storms occurred. The historical skew surges used in this study represent very big storms and for this reason it is newsworthy to take into account their large magnitude. Indeed, the biggest normalised skew surge considered in Region 1 is 3.01 compared with 2.13 of the regional sample generated without historical data (Table 2).

The application of the FAB method, based on the new concept of credible duration, allows more reliable extreme estimations of return levels to be obtained due to the inclusion of historical data to a systematic skew surge database.

The introduction of historical data on a frequency analysis of extremes is an innovative way to increase the reliability of estimations. In the past, the estimation of the period covered by the historical data collection has usually limited the use of historical data. The concept of a perception threshold was employed to use historical data even though its hypothesis is quite broad and difficult to verify. For this reason, the new concept of credible duration has been formulated that is based on physical considerations supported by several scientific studies. This leads to use historical isolated data points on the EVA.

The hypothesis, that the frequency of storms *λ* remains unchanged in
time and on which credible duration concept is based, is satisfactory for
skew surge. For other variables, a hypothesis check has to be done.
Following the hypothesis check, all the considerations that we raised in
this study can also be applied for other natural hazards.

When historical data are available, the credible duration depends on sampling threshold. This novelty allows the expansion of the credible duration concept to a regional scale. The RFA is another efficient way to reduce uncertainties on the extreme estimations. These two ways can be used together by the proposed FAB method.

The results describe the relevance of using historical records on a regional analysis both in terms of regional duration and of storm magnitude. Sometimes, these data can represent very large events and so we have to consider them during the estimation of extreme variables. Moreover, the existence of historical records enables to raise the coverage period of the extreme sample.

No extreme data can be missed in an extreme frequency analysis. It is very important to use all of the available data in order to get increasingly more reliable extreme estimations and regional analysis and historical data are two tools that can be merged to get satisfactory results.

New perspectives are opened to improve extreme quantiles estimations by using historical data that is becoming increasingly more available through different investigations on several qualitative and quantitative sources. The aggregation of all of these data in a historical database is a need to exploit the maximum amount of extremes.

Furthermore, every historical data value is associated with different uncertainties and it is challenging to be able to take into account these uncertainties in the regional analysis.

The systematic data can be obtained by request addressed to the corresponding author: SHOM (Service Hydrographique et Océanographique de la Marine) for French data, the BODC (British Oceanographic Data Centre) for British data and the IEO (Instituto Español de Oceanografía) for Spanish data.

The authors declare that they have no conflict of interest.

The permission to publish the results of this ongoing research study was
granted by the EDF (Electricité De France). The results in this paper
should, of course, be considered as R&D exercises without any significance
or embedded commitments upon the real behaviour of the EDF power facilities
or its regulatory control and licensing. The authors would like to
acknowledge the SHOM (Service Hydrographique et Océanographique de la
Marine, France), the BODC (British Oceanographic Data Centre, UK) and the IEO
(Instituto Español de Oceanografía, Spain) for providing the data
used in this study.Edited by: Ira
Didenkulova

Reviewed by: two anonymous referees

Allan, R., Tett, S., and Alexander, L.: Fluctuations in autumn-winter severe storms over the British Isles: 1920 to present, Int. J. Climatol., 29, 357–371, https://doi.org/10.1002/joc.1765, 2009.

ASN: Guide de l'ASN n^{∘} 13 : Protection des installations
nucléaires de base contre les inondations externes, 2013.

Baart, F., Bakker, M. A. J., van Dongeren, A., den Heijer, C., van Heteren, S., Smit, M. W. J., van Koningsveld, M., and Pool, A.: Using 18th century storm-surge data from the Dutch Coast to improve the confidence in flood-risk estimates, Nat. Hazards Earth Syst. Sci., 11, 2791–2801, https://doi.org/10.5194/nhess-11-2791-2011, 2011.

Bardet, L., Duluc, C.-M., Rebour, V., and L'Her, J.: Regional frequency analysis of extreme storm surges along the French coast, Nat. Hazards Earth Syst. Sci., 11, 1627–1639, https://doi.org/10.5194/nhess-11-1627-2011, 2011.

Barnett, V. and Lewis, T.: Outliers in Statistical Data, 3rd Edn., John Wiley, Chichester, 1994.

Barriendos, M., Coeur, D., Lang, M., Llasat, M. C., Naulet, R., Lemaitre, F., and Barrera, A.: Stationarity analysis of historical flood series in France and Spain (14th–20th centuries), Nat. Hazards Earth Syst. Sci., 3, 583–592, https://doi.org/10.5194/nhess-3-583-2003, 2003.

Barring, L. and Fortuniak, K.: Multi-indices analysis of southern Scandinavian storminess 1780-2005 and links to interdecadal variations in the NW Europe-North sea region, Int. J. Climatol., 29, 373–384, https://doi.org/10.1002/joc.1842, 2009.

Benito, G., Lang, M., Barriendos, M., Llasat, M., Frances, F., Ouarda, T., Thorndycraft, V., Enzel, Y., Bardossy, A., Cœur, D., and Bobée, B.: Use of Systematic, Paleoflood and Historical Data for the Improvement of Flood Risk Estimation, Review of Scientific Methods, Nat. Hazards, 31, 623–643, 2004.

Benson, M. A.: Use of historical data in flood-frequency analysis, Eos, Transactions American Geophysical Union, 31, 419–424, https://doi.org/10.1029/TR031i003p00419, 1950.

Bernardara, P., Andreewsky, M., and Benoit, M.: Application of regional frequency analysis to the estimation of extreme storm surges, J. Geophys. Res., 116, C02008, https://doi.org/10.1029/2010JC006229, 2011.

Bernardara, P., Mazas, F., Kergadallan, X., and Hamm, L.: A two-step framework for over-threshold modelling of environmental extremes, Nat. Hazards Earth Syst. Sci., 14, 635–647, https://doi.org/10.5194/nhess-14-635-2014, 2014.

Breilh, J. F.: Les surcotes et les submersions marines dans la partie centrale du Golfe de Gascogne : les enseignements de la tempete Xynthia, PhD thesis, Sciences de la Terre, Université de La Rochelle, France, available at: https://tel.archives-ouvertes.fr/tel-01174997/document (last access: 20 March 2018), 2014.

Breilh, J. F., Bertin, X., Chaumillon, E., Giloy, N., and Sauzeau, T,: How frequent is storm-induced flooding in the central part of the Bay of Biscay?, Global Planet. Change, 122, 161–175, https://doi.org/10.1016/j.gloplacha.2014.08.013, 2014.

Bulteau, T., Idier, D., Lambert, J., and Garcin, M.: How historical information can improve estimation and prediction of extreme coastal water levels: application to the Xynthia event at La Rochelle (France), Nat. Hazards Earth Syst. Sci., 15, 1135–1147, https://doi.org/10.5194/nhess-15-1135-2015, 2015.

Cochran, W. G.: The *χ*2 Test of Goodness of Fit, Ann. Math. Stat., 23,
315–345, 1952.

Cohn, T. A.: The incorporation of historical information in flood frequency analysis, MS thesis, Dep. of Environ. Eng., Cornell Univ., Ithaca, NY, USA, 1984.

Cohn, T. A. and Stedinger, J. R.: Use of historical information in a maximum-likelihood framework, J. Hydrol., 96, 215–223, https://doi.org/10.1016/0022-1694(87)90154-5, 1987.

Coles, S.: An Introduction to Statistical Modeling of Extreme Values, Springer, London, https://doi.org/10.1007/978-1-4471-3675-0, 2001.

Coles, S. and Dixon, M.: Likelihood-based inference for extreme value models, Extremes, 2, 5–23, https://doi.org/10.1023/A:1009905222644, 1999.

Condie, R. and Lee, K. A.: Flood frequency analysis with historic information, J. Hydrol., 58, 47–61, https://doi.org/10.1016/0022-1694(82)90068-3, 1982.

Cunnane, C.: Methods and merits of regional floods frequency analysis, J. Hydrol., 100, 269–290, https://doi.org/10.1016/0022-1694(88)90188-6, 1988.

Dalrymple, T.: Flood Frequency Analyses, Manual of Hydrology: Part 3, “Flood Flow Techniques” U.S. Geological Survey Water-Supply Paper, 1543-A, US GPO, 1960.

DREAL (Ministère de l'Ecologie, de l'Energie, du Développement durable, et de l'Aménagement du territoire), 59/62 SREI/DRNHM: Détermination de l'aléa de submersion marine intégrant les conséquences du changement climatique en région Nord – Pas-de-Calais – Etape 1: compréhension du fonctionnement du littoral, available at: http://www.hauts-de-france.developpement-durable.gouv.fr/IMG/pdf/Determination-alea-submersion-marine-integrant-consequences-changement-climatique-dreal-npdc.pdf (last access: 20 March 2018), 2009.

Fréchet, M.: Sur la lois de probabilité de l'écart maximum, Annales de la Société Polonaise de Mathématique, 6, 93–122, 1928.

Garnier, E. and Surville, F.: La tempête Xynthia face à l'histoire, Submersions et tsunamis sur les littoraux français du Moyen Âge à nos jours, Le croît Vif., Saintes, France, 2010.

Gnedenko, B.: Sur la distribution limite du terme maximum d'une série aléatoire, Ann. Math., 44, 423–453, 1943.

Gouriou, T.: Evolution des composantes du niveau marin à partir d'observations de marégraphe effectuées depuis la fin du 18e siècle en Charente-Maritime, Ph.D. thesis, Océanographie physique, Université de La Rochelle, France, 2012.

Gumbel, E. J.: Statistics of Extremes, Columbia University Press, New York, USA, 1958.

Hamdi, Y., Bardet, L., Duluc, C.-M., and Rebour, V.: Use of historical information in extreme-surge frequency estimation: the case of marine flooding on the La Rochelle site in France, Nat. Hazards Earth Syst. Sci., 15, 1515–1531, https://doi.org/10.5194/nhess-15-1515-2015, 2015.

Hamdi, Y., Bardet, L., Duluc, C.-M., and Rebour, V.: Development of a target-site-based regional frequency model using historical information, Geophys. Res. Abstr., EGU2016-8765, EGU General Assembly 2016, Vienna, Austria, 2016.

Hanna, E., Cappelen, J., Allan, R., Jonsson, T., Le Blancq, F., Lillington, T., and Hickey, K.: New Insights into North European and North Atlantic Surface Pressure Variability, Storminess, and Related Climatic Change since 1830, J. Climate, 21, 6739–6766, https://doi.org/10.1175/2008JCLI2296.1, 2008.

Hosking, J. R. M. and Wallis, J. R.: Paleoflood hydrology and flood frequency analysis, Water Resour. Res., 22, 543–550, https://doi.org/10.1029/WR022i004p00543, 1986a.

Hosking, J. R. M. and Wallis, J. R.: The value of historical data in flood frequency analysis, Water Resour. Res., 22, 1606–1612, https://doi.org/10.1029/WR022i011p01606, 1986b.

Hosking, J. R. M. and Wallis, J. R.: Some statistics useful in regional frequency analysis, Water Resour. Res., 29, 271–281, https://doi.org/10.1029/92WR01980, 1993.

Hosking, J. R. M. and Wallis, J. R.: Regional Frequency Analysis, An approach based on L-moments, Cambridge University Press, 1997.

Hubert, M. and Van der Veeken, S.: Outlier detection for skewed data, J. Chemometr., 22, 235–246, https://doi.org/10.1002/cem.1123, 2008.

IPCC: Climate change 2013: the physical science basis, Contribution to the IPCC fifth assessment report (AR5), Chapter 2: Observations: Atmosphere and Surface, 2013.

Leese, M. N.: Use of censored data in the estimation of Gumbel distribution parameters for annual maximum flood series, Water Resour. Res., 9, 1534–1542, https://doi.org/10.1029/WR009i006p01534, 1973.

Maspataud, A.: Impacts des tempêtes sur la morphodynamique du profil côtier en milieu macrotidal, Ph.D. thesis, Océanographie, Université du Littoral Côte d'Opale, France, available at: https://tel.archives-ouvertes.fr/tel-00658671/document (last access: 20 March 2018), 2011.

Matulla, C., Schoner, W., Alexandersson, H., von Storch, H., and Wang, X. L.: European storminess: late nineteenth century to present, Clim. Dynam., 31, 125–130, https://doi.org/10.1007/s00382-007-0333-y, 2008.

MICORE Project 202798: Review of climate change impacts on storm occurrence, edited by: Ferreira, O. (UALG), Vousdoukas, M. (UALG), and Ciavola, P. (UniFe), available at: http://www.micore.eu/file.php?id=4 (last access: 20 March 2018), 2009.

Nguyen, C. C., Gaume, E., and Payrastre, O.: Regional flood frequency analyses involving extraordinary flood events at ungauged sites: further developments and validations, J. Hydrol., 508, 385–396, https://doi.org/10.1016/j.jhydrol.2013.09.058, 2014.

Ouarda, T. B. M. J., Rasmussen, P. F., Bobée, B., and Bernier, J.: Utilisation de l'information historique en analyse hydrologique fréquentielle, Revue des sciences de l'eau, 11, 41–49, https://doi.org/10.7202/705328ar, 1998.

Parent, P., Butin, T., Vanhee, S., and Busz, N.: Un territoire soumis au risque de submersion marine. Les tempetes de 1949 et 1953 à Dunkerque, Institution départementale des Wateringues, available at: http://www.ingeo.fr/fichier/file/Wateringues/Affiche4-Dunkerque.pdf (last access: 20 March 2018), 2007.

Payrastre, O., Gaume, E., and Andrieu, H.: Usefulness of historical information for flood frequency analyses: developments based on a case study, Water Resour. Res., 47, W08511, https://doi.org/10.1029/2010WR009812, 2011.

Payrastre, O., Gaume, E., and Andrieu, H.: Historical information and flood frequency analyses: which optimal features for historical floods inventories?, La Houille Blanche – Revue internationale de l'eau, 3, 5–11, https://doi.org/10.1051/lhb/2013019, 2013.

Picklands, J.: Statistical inference using extreme order statistics, The Ann. Stat., 3, 119–131, 1975.

Pouvreau, N.: Trois cents ans de mesures marégraphiques en France : outils, méthodes et tendances des composantes du niveau de la mer au port de Brest, PhD thesis, Climatologie, Université de La Rochelle, France, available at: https://tel.archives-ouvertes.fr/tel-00353660/document (last access: 20 March 2018), 2008.

Prosdocimi, I.: German tanks and historical records: the estimation of the time coverage of ungauged extreme events, Stoch. Environ. Res.-Risk. Assess., 32, 607–622, https://doi.org/10.1007/s00477-017-1418-8, 2018.

Reis, D. S. and Stedinger, J. R.: Bayesian MCMC flood frequency analysis with historical information, J. Hydrol., 313, 97–116, https://doi.org/10.1016/j.jhydrol.2005.02.028, 2005.

Sabourin, A. and Renard, B.: Combining regional estimation and historical floods: A multivariate semiparametric peaks-over-threshold model with censored data, Water Resour. Res., 51, 9646–9664, https://doi.org/10.1002/2015WR017320, 2015.

SHOM (Service Hydrographique et Océanographique de la Marine): Références Altimétriques Maritimes, available at: http://diffusion.shom.fr/pro/risques/references-verticales/references-altimetriques-maritimes-ram.html (last access: 20 March 2018), 2017.

Simon, B.: La marée océanique côtière, Editions de l'Institut Océanographique, available at: http://diffusion.shom.fr/pro/la-maree-oceanique-cotiere.html# (last access: 20 March 2018), 2007.

Stedinger, J. R. and Baker, V. R.: Surface water hydrology: historical and paleoflood information, Rev. Geophys., 25, 119–124, https://doi.org/10.1029/RG025i002p00119, 1987.

Stedinger, J. R. and Cohn, T. A.: Flood frequency analysis with historical and paleoflood information, Water Resour. Res., 22, 785–793, https://doi.org/10.1029/WR022i005p00785, 1986.

Van Gelder, P. H. A. J. M.: A new statistical model for extreme water levels along the Dutch coast, edited by: Tickle, K. S., Goulter, I. C., Xu, C. C., Wasimi, S. A., and Bouchart, F., Stochastic Hydraulics '96, Proceedings of the 7th IAHR International Symposium, Mackay, Queensland, Australia, 243–249, 1996.

Van Gelder, P. H. A. J. M. and Neykov, N. M.: Regional frequency analysis of extreme water level along the Dutch coast using L moments: A preliminary study, Stochastic models of hydrological processes and their applications to problems of environmental preservation, NATO Advanced Research Workshop, 23–27 November 1998, Moscow, Russia, 14–20, 1998.

Wang, X. L., Zwiers F. W., Swail V. R., and Feng Y.: Trends and variability of storminess in the Northeast Atlantic region, 1874–2007, Clim. Dynam., 33, 1179–1195, https://doi.org/10.1007/s00382-008-0504-5, 2009.

Wang, X. L., Wan, H., Zwiers, F. W., Swail, V. R., Compo, G. P., Allan, R. J., Vose, R. S., Jourdain, S., and Yin, X.: Trends and low-frequency variability of storminess over western Europe, 1878–2007, Clim. Dynam., 37, 2355–2371, https://doi.org/10.1007/s00382-011-1107-0, 2011.

Ward, J. H.: Hierarchical grouping to optimize an objective function, J. Am. Stat. Assoc., 58, 236–244, 1963.

Weiss, J.: Analyse régionale des aléas maritimes extrêmes, PhD thesis, Mécanique des fluides, Université Paris-Est, France, available at: https://pastel.archives-ouvertes.fr/tel-01127291/document (last access: 20 March 2018), 2014.

Weiss, J., Bernardara, P., and Benoit, M.: A method to identify and form homogeneous regions for regional frequency analysis of extreme skew storm surges, Proceedings of the 1st International Short Conference on Advances in Extreme Value Analysis and Application to Natural Hazards, 18–20 September 2013, Siegen, Germany, 2013.

Weiss, J., Bernardara, P., and Benoit, M.: Formation of homogeneous regions for regional frequency analysis of extreme significant wave heights, J. Geophys. Res.-Oceans, 119, 2906–2922, https://doi.org/10.1002/2013JC009668, 2014a.

Weiss, J., Bernardara, P., and Benoit, M.: Modeling intersite dependence for regional frequency analysis of extreme marine events, Water Resour. Res., 50, 5926–5940, https://doi.org/10.1002/2014WR015391, 2014b.