Flash flood detection via copula-based intensity–duration–frequency curves: evidence from Jamaica

Collalti, Dino; Spencer, Nekeisha; Strobl, Eric

doi:https://doi.org/10.5194/nhess-24-873-2024

Articles | Volume 24, issue 3

https://doi.org/10.5194/nhess-24-873-2024

Articles | Volume 24, issue 3

Research article

14 Mar 2024

Research article |

| 14 Mar 2024

Flash flood detection via copula-based intensity–duration–frequency curves: evidence from Jamaica

Dino Collalti, Nekeisha Spencer, and Eric Strobl

Abstract

Extreme rainfall events frequently cause hazardous floods in many parts of the world. With growing human exposure to floods, studying conditions that trigger floods is imperative. Flash floods, in particular, require well-defined models for the timely warning of the population at risk. Intensity–duration–frequency (IDF) curves are a common way to characterize rainfall and flood events. Here, the copula method is employed to model the dependence between the intensity and duration of rainfall events flexibly and separately from their respective marginal distribution. Information about the localization of 93 flash floods in Jamaica was gathered and linked to remote-sensing rainfall data, and additional data on location-specific yearly maximum rainfall events were constructed. The estimated normal copula has Weibull and generalized extreme value (GEV) marginals for duration and intensity, respectively. Due to the two samples, it is possible to pin down above which line in the intensity duration space a rainfall event likely triggers a flash flood. The parametric IDF curve with an associated return period of $2 \frac{1}{6}$ years is determined as the optimal threshold for flash flood event classification. This methodology delivers a flexible approach to generating rainfall IDF curves that can directly be used to assess flash flood risk.

Download & links

Article (PDF, 4706 KB)

Download & links

How to cite.

Received: 05 Sep 2023 – Discussion started: 19 Sep 2023 – Revised: 26 Jan 2024 – Accepted: 26 Jan 2024 – Published: 14 Mar 2024

1 Introduction

Over the last 20 years, more people have been affected by floods than by any other natural disaster.¹ Among pluvial floods, flash floods have the highest average mortality (Jonkman, 2005; Hu et al., 2018). The Caribbean is especially at risk from flash floods. The region is particularly prone to hydro-meteorological hazards, urbanization is often unregulated, and soil degradation is common such that flash floods are frequent (Gencer, 2013; Pinos and Quesada-Román, 2021). For instance, heavy rain on 5 March 2022 in northern Hispaniola caused severe flash floods, leading to two deaths and hundreds being displaced.² Flash floods follow shortly after heavy rainfall and are highly localized phenomena that occur in basins of no more than a few hundred square kilometers and have a response time of a few hours (Amponsah et al., 2018). Steep slopes, impermeable surfaces, and saturated soils are factors that can transform a heavy rainfall event into a flash flood hazard (Silvestro et al., 2019). The high localization and multidimensionality involved in flash floods make their study particularly involved.

It has long been a primary objective of weather service providers to create a warning system that connects rainfall to floods and landslides (Alfieri et al., 2012). Warning systems typically use some lower bound or threshold above which a warning would be issued (Hapuarachchi et al., 2011). Empirical thresholds for when rainfall events become hazardous connect the intensity (I) to the duration (D) and are used for the construction of so-called intensity–duration–frequency (IDF) curves (Koutsoyiannis et al., 1998). Commonly, estimation of IDF curves requires assumptions on the marginal distribution of I and D or the two marginals were assumed to be independent.³ Using copula functions for conditional sampling allows the flexible and separate definition of marginals and dependence. Multiple studies employ the said method to estimate rainfall IDF curves for landslides and heavy rainfall events (Singh and Zhang, 2007; Ariff et al., 2012; Bezak et al., 2016; Li et al., 2019; Suresh and Pekkat, 2023). These studies often define the yearly maximum event of measurement stations by some decision rule and model the resulting time series. This allows for a good statistical fit and a well-described dependence between I and D. However, since the data do not necessarily contain hazardous events, little inference can be made about these.

The calculation of rainfall IDF curves is strongly rooted in extreme value theory (EVT) since the intensity and duration of extreme events lend themselves as a relevant application (Koutsoyiannis et al., 1998). It is a common approach to flexibly derive IDF curves that can accommodate non-linearity in their parameters, e.g., to depict climatic time trends (Hosseinzadehtalaei et al., 2020; Sam et al., 2023). More recently, the use of copula functions to extend univariate EVT to multivariate extreme value theory (MVET) has been suggested as a complement to existing approaches, including IDF curves (Renard and Lang, 2007; Salvadori and De Michele, 2010; Chen and Guo, 2019). The main difference between the two approaches is the focus of analysis. Taking a multivariate perspective with copula functions, one focuses on the potentially non-linear dependence between intensity and duration (Bezak et al., 2016). In contrast, the univariate EVT approach typically focuses on univariate characteristics such as non-stationarity in distributional components (Martel et al., 2021). In principle, a multivariate approach with copula functions could also incorporate non-stationarity in an appropriate methodology. However, such a procedure becomes increasingly complex and requires sufficient data (Li et al., 2019).

This study aims to construct IDF curves with information from confirmed flash flood events in Jamaica. This allows for inference with regard to the hazard by comparing the odds ratio of flood occurrence given a frequency, where less frequent events are more severe and vice versa. The calculation of the odds ratio requires a set of extreme but non-hazardous events as well as a set of hazardous rainfall events. Following the literature, the local yearly maximum rainfall events are defined. Additionally, a complete and confirmed list of Jamaican flash floods by the Office of Disaster and Preparedness Management (ODPEM) is utilized to define hazardous events. These observed flash flood events are linked with 11 km × 11 km cells of remote-sensing rainfall information. These remotely sensed data have several advantages compared to station data, such as consistency in sensors and resolution. While direct in situ measurements are factual, they depend on the location and continuous operation of stations. Currently, the number of modern automatic weather stations in Jamaica is well below the remote-sensing resolution, with the exception of the area around the capital Kingston.⁴ Subsequently, the IDF curve threshold, which separates the confirmed hazard events from the rest via the odds ratio, is determined. This threshold can serve as a simple decision rule for the identification of flash-flood-triggering rainfall events.

There are a number of reasons why the Caribbean and Jamaica in particular are an interesting case study. Small island states in the Caribbean have long been identified as especially vulnerable to extreme meteorological events and associated flooding (IPCC, 2012; Wilson et al., 2014). Moreover, extreme precipitation events have shown an increase since 1950 in the Caribbean region (Peterson et al., 2002; Stephenson et al., 2014). At the same time, there is little information on the local rainfall risk. In this regard, it is common practice to transfer IDF curves for some Caribbean island nations to others, despite their different rainfall characteristics (Lumbroso et al., 2011). Burgess et al. (2015) therefore developed IDF curves for Jamaica with long historical data. Linearly projecting the historical parameter estimates to 2100, they find that the intensity of a 100-year return event increases by 27 % to 59 % as a result of increasing variability due to climate change.

https://nhess.copernicus.org/articles/24/873/2024/nhess-24-873-2024-f01

Figure 1Maps of the (bilinearly interpolated) rainfall climate in Jamaica from the precipitation data as used in this study. Panel (a) shows the average yearly precipitation, panel (b) the average precipitation of events with a 72 h inter-event time definition (IETD), panel (c) the average precipitation of events with a 24 h IETD, and panel (d) the average precipitation of events with a 6 h IETD.

Quantifying extreme rainfall-induced hazards has important applications, such as for risk maps, warning systems, or re-insurance schemes, particularly for the Caribbean. For example, the Climate Risk and Early Warning Systems (CREWS) aims to strengthen hydro-meteorological and early warning services in the Caribbean, focusing on hurricanes and other hydro-meteorological hazards. Its first assessment in 2015 identified the need for increased forecasting of secondary hazards such as coastal flooding and flash floods. Currently, pilot projects to strengthen national multi-hazard early warning systems in the Caribbean community countries are devised through CREWS. The Caribbean Risk Information System (CRIS) platform, created by the Caribbean Disaster Emergency Management Agency (CDEMA), aims to support informed decision-making by providing access to information on hazards and does so via geospatial data for risk and hazard mapping, disaster preparedness, and response operations. These input data rely on research in the hazard, exposure, and vulnerability domain.⁵ Another example is the Caribbean Catastrophe Risk Insurance Facility (CCRIF), which since 2013 has provided insurance against excess rainfall to member countries (Linkin, 2014). More specifically, its CCRIF Excess Rainfall (XSR) product is a parametric insurance based on specific rainfall thresholds that determine payouts.

The remainder of the paper is organized as follows: Sect. 2 presents the study region and describes the data. Section 3 details the methodology of conditional copula modeling and how the two samples are used to determine an IDF-curve-based flash flood threshold. Section 4 then presents the results. Section 5 discusses the findings, and Sect. 6 concludes.

2 Study region and data

2.1 Study region

Jamaica is the third-largest Caribbean island by land area after Cuba and Hispaniola. The island's topography is characterized by interior mountain ranges descending to coastal plains where the eastern Blue Mountains historically experience the most rainfall (Climate Studies Group Mona, 2020). Jamaica lies in the Atlantic hurricane belt and is especially at risk of climate change (Monioudi et al., 2018). Tropical cyclones and the accompanying heavy rainfall are frequent and cause severe destruction (Spencer and Polachek, 2015; Collalti and Strobl, 2022). For instance, between June 2007 and August 2021, the CCRIF made 54 payouts for a total of USD 245 million, of which USD 135 million are for tropical cyclones, USD 60 million for excess rainfall, and USD 49 million for earthquakes (mainly the devastating 2021 Haiti earthquake). Thus, local susceptibility to floods has become vital to planning and development in Jamaica (Nandi et al., 2016).

2.2 Rainfall climate

The annual cycle of rainfall for Jamaica reflects a bimodal pattern with rainfall peaks in May and October typical for the northwestern Caribbean (Martinez et al., 2019). This pattern is a result of the interplay between the large-scale climatic modulators of the region, namely the North Atlantic high-pressure system (Azores High), the seasonal warming of the Atlantic, and the Atlantic trade winds (Climate Studies Group Mona, 2020). That is, the north-to-south movement of the North Atlantic High in boreal autumn (south-to-north movement in spring) and the Atlantic cooling in autumn (warming in spring), coupled with the trade wind inversion in boreal winter, cause boreal winter (and to a lesser degree summer) to be dry. Looking at the spatial rainfall distribution over Jamaica in Fig. 1, one sees that the spatial distribution of rainfall depends on the weather system at different temporal scales. On average, during the study period from 2000 to 2019, most precipitation falls in the central northwestern area of the island and, to a lesser degree, on the northeastern shore close to the Blue Mountains. Comparing this to the average total precipitation of events with a 6, 24, and 72 h inter-event time definition (IETD), we can conclude that the western part of the island is subject to the most heavy rainfall events with long durations, whereas the eastern part experiences the heaviest short duration events.

2.3 Flash floods

The source of flash flood information is the Office of Disaster and Preparedness Management (ODPEM), whose responsibility includes monitoring extreme weather events in Jamaica and implementing measures to mitigate their impact. From the ODPEM, shapefiles of all 48 known flood events are obtained from 2001 to 2018. Many of these events correspond to a specific meteorological event, like a tropical storm that caused flooding in more than one location in Jamaica. We treat each event location separately if it falls uniquely in a remote-sensing rainfall cell. For example, during heavy rain on 14–15 May in 2017, several places around Cave Valley (parish of St. Ann) in central Jamaica, as well as to the south around Morgans Pass in the parish of Clarendon, experienced severe flooding. These locations are approximately 20 km apart, lie on two sides of the north/south watershed, and are thus treated as two incidents in their respective rainfall cell. Some flood events in the ODPEM shapefiles could not be verified by any report and were therefore dropped, as were a few riverine floods that would require explicit hydrological modeling, which is beyond the scope of this study. Some events where the exact day (days) is (are) not included in the data are identified using local newspaper reports. A total of 93 flash flood events were localized for Jamaica with approximate timing.

2.4 Precipitation

The source for precipitation data is version 06B of the Global Precipitation Measurement (GPM) Integrated Multi-satellitE Retrievals (IMERG, Huffman et al., 2023). The satellite precipitation algorithm combines microwave and infrared precipitation measurements to produce precipitation estimates, adjusted with surface gauge data. The resulting product is a half-hourly data set with near-global coverage at a 0.1° × 0.1° resolution since June 2000. Compared to other remote-sensing or reanalysis products, the GPM IMERG has a considerably higher spatial and temporal resolution compared to its competitors. Also, the number of distinct cells and, thus, spatial resolution are considerably higher than the number of measurement stations in Jamaica. One major drawback of the GPM IMERG, its short time frame, does not apply to this study because all the ODPEM events are fully captured in the observational period since June 2000. Note that the quality of satellite rainfall data has leapfrogged in the last decade: an inter-comparison of rain-gauge, radar, and GPM IMERG for rainfall–runoff modeling by Gilewski and Nawalany (2018) in a mountainous catchment in Poland identified that radar and GPM IMERG outperform rain-gauge data. Tang et al. (2020) provide a comprehensive overview of different satellite precipitation and reanalysis products, reporting good performance for GPM IMERG, and they are continuously improving in more recent versions.

https://nhess.copernicus.org/articles/24/873/2024/nhess-24-873-2024-f02

Figure 2Flowchart illustrating the methodology. In gray are the original data inputs. Diamond squares are methods and procedures applied, whereas regular squares are outcomes of said methods and procedures. In green are methods that involve choices outside regular statistical testing such that they do not contribute to the analysis of uncertainty in Sect. 4.6, whereas methods in gold do. Blue squares are intermediary outcomes, whereas purple squares are final outcomes that are subject to discussion in Sect. 5.

Download

3 Methodology

In this methodology section, we describe how to create IDF curves from the GPM IMERG rainfall data via copula functions. Furthermore, we present a flash flood classification of extreme rainfall events by connecting the IDF curves with confirmed flash flood events. The methodology is summarized in Fig. 2. Core to the methodology are the event definition, copula selection and estimation, marginal distribution selection and estimation, conditional copula sampling, and the classification via odds ratio. Subsequently, these methods are explained and their application is presented in the results Sect. 4.

https://nhess.copernicus.org/articles/24/873/2024/nhess-24-873-2024-f03

Figure 3Illustration of the event definition in the case of a confirmed event with the reported date from ODPEM or newspapers in red. In the time frame ± 7 d around this date, three separate events are defined given an IETD and a minimum threshold of 0.1 mm h⁻¹. Three events result, where the second event is the maximum event measured by cumulative rainfall and is considered the flood-inducing rainfall event.

Download

3.1 Event definition

The data on confirmed flash flood events provide location and start date information but no sub-daily timing of rainfall onset and its ending. We thus need to find and define rainfall events that start before the flood (potentially lasting longer than the reported date). We resort to the common inter-event time definition (IETD) method to delimit the events (Ariff et al., 2012; Bezak et al., 2016). The IETD refers to the minimum duration without rain between consecutive rainfall events. An IETD of a few hours is typically selected for floods, while for landslides the IETD is longer, i.e., up to several days (Melillo et al., 2015). For confirmed events, the event definition starts with a window of ± 7 d around the date given by the ODPEM or newspapers. Within that window, the event with the maximum cumulative rainfall is regarded as the flood-inducing rainfall event. Figure 3 illustrates the procedure. The yearly maximum events are constructed the same way, though for each cell each year is considered separately. Note that a minimum threshold of 0.1 mm h⁻¹ for a given observation to start an event is imposed to reduce the number of events.

3.2 Conditional copula modeling

Informally, copulas can be described as “functions that join or couple multivariate distribution functions to their one-dimensional marginal distribution functions” (Nelsen, 2007). More formally, given a two-dimensional (joint) distribution function H with univariate margins F₁ and F₂, there exists, by the first part of Sklar's theorem, a two-dimensional copula C such that

\begin{matrix} (1) & H (x) = C (F_{1} (x_{1}), F_{2} (x_{2})), x \in R^{2} . \end{matrix}

The copula C is uniquely defined on $\prod_{j = j}^{2} ran F_{j}$ and is there given by

\begin{matrix} (2) & C (u) = H (F_{1}^{\leftarrow} (u_{1}), F_{2}^{\leftarrow} (u_{2})), u \in \prod_{j = 1}^{d} ran F_{j}, \end{matrix}

where F^← denotes the generalized inverse, which equals the regular inverse F⁻¹ for continuous and strictly increasing distribution functions (dfs). By the definition of a cumulative distribution function, $ran F_{j} \in (0, 1)$ such that the copulas univariate margins are standard uniform U(0,1) (Hofert et al., 2018). Three attributes follow: the copula function (1) uniquely specifies the dependence for the whole distribution, (2) can be recovered from data on joint and marginal distribution, and (3) imposes no constraint on the shape of the dependence.

The conditional copula method has previously been used to estimate IDF curves (Singh and Zhang, 2007; Ariff et al., 2012; Bezak et al., 2016; Li et al., 2019). Let C be the two-dimensional copula and let U∼C, $u_{1} \in (0, 1)$ and $u_{2} \in [0, 1]$ , then

\begin{matrix} (3) & C_{2 | 1} (u_{2} | u_{1}) = P (U_{j} \leq u_{j} | U_{1} = u_{1}) . \end{matrix}

If one fixes for some value of $u_{1} \in (0, 1)$ , the conditional copula function $C_{2 | 1} (u_{2} | u_{1})$ is a distribution function on [0,1] and can be used for conditional sampling. The evaluation of $C_{2 | 1} (u_{2} | u_{1})$ , however, involves the evaluation of partial derivatives instead of densities (Hofert et al., 2012, 2018). The theoretical basis for the process is the inverse Rosenblatt transformation, also known as the conditional distribution method.

Consider that $C_{U, V} (u, v)$ is the copula function of interest and let intensity I=i and duration D=d have marginal distribution functions V=F_I(i) and U=F_D(d). For a known value of U=u, $C_{V | U = u}$ gives realizations of marginal V. The corresponding value of u can be obtained by the marginal distribution function. From u and v, the respective i and d can be recovered easily since $d = F_{D}^{- 1} (u)$ and $i = F_{I}^{- 1} (v)$ . The conditional copula function can be written as

\begin{matrix} (4) & C_{V | U = u} (v | U = u) = \frac{\partial}{\partial u} C_{U, V} (u, v) |_{U = u .} \end{matrix}

The conditional copula, which is a conditional bivariate distribution, relates to the return period T as follows

\begin{matrix} (5) & C_{V | U = v} (v | U = u) = 1 - \frac{1}{T} . \end{matrix}

For a given value of u and a return period T, solving Eqs. (4) and (5) simultaneously yields the corresponding v. Via the marginal distribution function, the respective values of i and d are recovered and represent a point on the IDF curve for a return period T. For every return period T, many values of u are chosen to get an approximately smooth IDF curve. That process is repeated for other T to construct IDF curves which are increasing in severity with T.

3.3 Two-sample approach

Rainfall events of interest are those that lead to flash floods. However, a block maxima approach, partitioning the data into yearly blocks, allows a direct relation with return periods and is thus often chosen (Ariff et al., 2012). The proposed methodology uses information from block maxima as well as confirmed flood events. There are m=93 confirmed flood events and, at these locations, n=1120 yearly cell-wise maxima. The yearly maximum events serve to estimate the copula function and the marginal distributions of intensity and duration for these extreme rainfall events. Conditional sampling from the copula enables the construction of IDF curves with T-year return periods. One can then derive the IDF curve associated with a certain return period above which the likelihood of flash flood occurrence is maximized. For every return period the IDF curve is recovered and the ratio R of confirmed flash flood events m against the number of yearly maximum rainfall events n that lies above that curve is calculated:

\begin{matrix} (6) & R = \frac{\sum_{i = 1}^{m} I (d_{i} \geq (\tilde{d} | U = u_{i}))}{\sum_{j = 1}^{n} I (d_{j} \geq (\tilde{d} | U = u_{j}))}, \end{matrix}

where $(\tilde{d} | U = u_{i})$ is the estimated duration via conditional copula sampling and marginal transformation $\tilde{d} = F_{D}^{- 1} (\tilde{u})$ . The IDF curve with a return period associated with the highest ratio R_r:max is the one that separates the events from non-events best. This constitutes a so-called critical layer $(d, i) \in L^{2} : 1 - H ((d, i) = P (D > d, I > i) = t$ , where all combinations of i and d ∈L² have the same probability $1 - H ((d, i) = t$ (Salvadori et al., 2016). The critical region, which corresponds to a flash flood classification, is defined as $L_{t}^{>} = {(i, d) \in L^{2} : 1 - H (i, d) < t}$ (De Michele et al., 2013). Subsequently, the return period T^> of an event in the critical region is defined by the inverse probability of falling into the critical region (Zscheischler et al., 2017):

\begin{matrix} (7) & T^{>} = \frac{μ}{P ((D, I) \in L_{t}^{>})}, \end{matrix}

where μ denotes the average time unit, which is 1 year in the case of yearly maxima.

Table 1Candidate copula families.

Overview of candidate copula families with respect to their attributes.

Download Print Version | Download XLSX

3.4 Candidate copulas

The selection of appropriate copula is carried out in two steps. First, a set of candidate copulas is defined. Second, the candidate copula is compared on the basis of fit, for both the event and the yearly maxima data. The first restriction on candidate copulas is that a conditional sampling algorithm exists. This is the case for the families of Archimedean and elliptical copulas (Hofert et al., 2018). The literature on landslides and flash flood IDF curves has further established the negative relation between an event's duration and its intensity, which is the second restriction on candidates (Aleotti, 2004; Piciullo et al., 2017). Table 1 shows the copulas for which conditional sampling algorithms exist and some of their properties. The two restrictions leave one with three potential copula classes, normal, Frank, and t copula. Note that these copulas are all radially symmetric and exchangeable. Geometrically, radial symmetry is the symmetry of the density with respect to the point $1 / 2 = (1 / 2, \dots, 1 / 2)$ . Exchangeability is the symmetry of the density with respect to the main diagonal. Given a negative dependence, a copula that is not radial symmetric is one whose lower tail dependence is different from its upper tail dependence, whereas a copula that is not exchangeable is one whose dependence changes with the order of the marginals. The best candidate copula is selected on the basis of the cross-validation copula criterion (CIC) by Grønneberg and Hjort (2014), which is an Akaike information criterion (AIC)-like criterion on a maximum-pseudo-likelihood estimate (MPLE) of semi-parametrically (i.e., with non-parametric estimated margins) estimated copula. The methodology is implemented in the R package “copula”, with which all the subsequent copula modeling is carried out (Kojadinovic and Yan, 2010; Hofert et al., 2018).

3.5 Selection of marginals

The IDF curve construction via conditional copula in Sect. 3.2 requires the estimation of marginals for duration D and intensity I. Candidate marginal distributions are the Weibull, gamma, lognormal, and generalized extreme value (GEV) distributions and are all estimated via maximum likelihood and assessed by AIC. The empirical probability density function is also inspected graphically against the marginal distributions' estimated probability density to assert its fit.

https://nhess.copernicus.org/articles/24/873/2024/nhess-24-873-2024-f04

Figure 4(a) Location of the confirmed flash flood events, (b) cell-wise mean intensity of locations' maximum events, (c) cell-wise mean total rainfall of locations' maximum events, and (d) cell-wise mean duration of locations' maximum events.

4 Results

4.1 Event definition

The event definition in time requires an appropriate IETD. Values of IETD between 4 and 24 h are considered. Graphical assessment of mean event intensity and duration revealed that an IETD of 12 h best delimits the rainfall events. See Appendix Sect. A for a discussion and graphical examples for various IETDs. Figure 4 shows the locations of confirmed flash flood events in Jamaica and the average intensity, duration, and total rainfall of the yearly maximum events. Average intensity is highest inland and to the west, but it is fairly evenly distributed. Average total rainfall is highest at the eastern shore north of the Blue Mountains, with a second agglomeration of high total rainfall cells in the west. Duration exhibits a similar pattern to total rainfall, with the longest events in the west. Since the confirmed flash flood events are evenly distributed across the island, variation in local conditions is expected to be captured well. Table A1 in the Appendix further provides summary statistics for both yearly maximum and confirmed flash flood events.

4.2 Copula selection

The shape of dependence can be assessed via pseudo-observations. Pseudo-observations are obtained by first estimating the empirical distribution functions F_n(n,j) for $j \in (I, D)$ ,

\begin{matrix} (8) & F_{n, j} = \frac{1}{n + 1} \sum_{i = 1}^{n} 1 (X_{i, j} < x), x \in R, \end{matrix}

where 1(⋅) is the indicator function. These estimated margins can then be used to form the sample:

\begin{matrix} (9) & U_{i, n} = (F_{n, D} (X_{i, D}), F_{n, I} (X_{i, I})), i \in {1, \dots, n} . \end{matrix}

https://nhess.copernicus.org/articles/24/873/2024/nhess-24-873-2024-f05

Figure 5(a) Pseudo-observation of the yearly maximum rainfall events (YMEs) and (b) of the confirmed flash flood events (FFEs). Ties in the duration variable due to the measurement scale are randomly split.

Download

Figure 5 displays the pseudo-observations and demonstrates a strong negative dependence in both samples. This limits the set of potential copulas to the normal, Frank, and t copula. Note that these copulas are all radially symmetric and exchangeable.

Estimates of the copula information criterion (CIC) are shown in Table 2. Selecting the Frank copula for the yearly maximum events leads to a higher CIC than the normal or t copula. For the confirmed events, selecting the Frank copula leads to a lower CIC than the normal or t copula.

https://nhess.copernicus.org/articles/24/873/2024/nhess-24-873-2024-f06

Figure 6(a) Random sample (n=1120) of pseudo-observation of the yearly maximum rainfall events (YMEs) under the assumption of Frank copula, (b) under the assumption of normal copula, and (c) the true pseudo-observation. (d) Random sample (n=93) of pseudo-observation of the confirmed flash flood events (FFEs) under the assumption of Frank copula, (e) under the assumption of normal copula, and (f) the true pseudo-observation.

Download

Table 2Cross-validation copula criterion.

Cross-validation copula criterion (CIC) by Grønneberg and Hjort (2014) for both data samples. The t copula assumes 10=v degrees of freedom.

Download Print Version | Download XLSX

Figure 6 displays pseudo-observations for both samples as well as a random sample of pseudo-observation under Frank and normal copula. The sample of confirmed flash floods is too small to draw conclusive evidence regarding the optimal copula. There is also no clear visual indication around the locus of points for the Frank copula over the normal copula or vice versa. However, the normal copula exhibits tail dependence similar to the data, while the Frank copula is tail quadrant independent (Joe, 2014).

In summary, the normal copula is better suited for the data and thus chosen for the analysis. It is more appropriate for yearly maximum events, which are the data on which the IDF curves are generated, as outlined in Sect. 3.3 concerning the two-sample approach. Additionally, the normal copula is suitable for the confirmed events, as the CIC and graphical evidence show.

4.3 Estimation of marginals

Table 3 reports the Akaike information criterion for both samples, where the parametric distributions are fitted via maximum likelihood estimation (MLE) to the marginals. For both samples' intensity, the generalized extreme value (GEV) distribution results in the lowest information loss. Similarly, for the duration, the Weibull distribution yields, in both instances, the lowest AIC.

Table 3Akaike information criterion.

Akaike information criterion (AIC) for both samples. A parametric distribution was fitted via MLE to the marginals, intensity, and duration.

Download Print Version | Download XLSX

https://nhess.copernicus.org/articles/24/873/2024/nhess-24-873-2024-f07

Figure 7Comparison of estimated probability density function for (a) intensity of yearly maximum events (YMEs), (b) duration of yearly maximum events, (c) intensity of flash flood events, and (d) duration of flash flood events (FFEs).

Download

Figure 7 compares the estimated distributions and the empirical probability density. In all cases the two match well. The confirmed flood events are on average slightly longer (11.4 h versus 9 h) and less intense (5.2 mm h⁻¹ versus 9.9 mm h⁻¹) compared to the yearly maximum events. Notably, the flood events are not as smoothly distributed due to the smaller sample size. Both samples yield similar distributions and agree on the shape. Subsequent conditional copula modeling focuses on the more precisely estimated distributions from the large sample of yearly maximum events.

4.4 Conditional copula IDF curves

Given the conditional copula modeling presented in Sect. 3.2 and the estimates for the GEV distribution of intensity, the Weibull distribution of duration, and the normal copula for dependence, the following procedure can be employed similarly from a conditional copula. Let's create a vector of quantiles $u \in [0, 1]$ and set a return period T relating to the conditional copula as in Eq. (5), $C_{V | U = u} (v | U = u) = 1 - \frac{1}{T}$ , where C(⋅) is the normal copula function estimated in Sect. 4.2. Then for every quantile of u and for every return period, the corresponding (conditional) quantile of v results, thus giving a triplet of $(u_{q}, v_{q}, T)$ for any quantile q of u and return period T. Via the inverse distribution transformations $d = F_{D}^{- 1} (u_{q})$ and $i = F_{I}^{- 1} (v_{q})$ and the estimated marginal distributions from Sect. 4.3, that triplet becomes $(d, i, T)$ . Connecting all points in the (d,i) space for some T then gives us an IDF curve for return period T from conditional copula modeling.

https://nhess.copernicus.org/articles/24/873/2024/nhess-24-873-2024-f08

Figure 8Intensity–duration curves for frequencies corresponding to a return period of 2, 5, 10, 20, and 40 years. Panel (a) shows these IDF curves for the normal copula and marginals estimated from the yearly maximum events (YMEs), in logs in panel (b), and panel (c) shows the normal copula from YMEs and marginals from flash flood events (FFEs), in logs in panel (d).

Download

IDF curves corresponding to return periods between 2 and 40 years are shown in Fig. 8. The curves are all convex, such that shorter events have a disproportionately higher intensity. Visually, the choice of marginals has little impact on the IDF curves. Higher return periods shift the IDF curve outwards to higher intensities for all durations. Interestingly, convexity decreases with higher return periods. Even though longer return periods might be of interest, we are cautious that long return periods might not be appropriate given the available data. Coles et al. (2001) for instance argue that increasing extrapolation to upper quantiles of the distribution and longer return periods is increasingly at risk of unverifiable assumptions and uncertainty. Therefore, the maximum return period here of 40 years is twice the length of the data used in model estimation, which is more than sufficient for the extreme rainfall event classification.

4.5 Best IDF curve

IDF curves generated with the normal copula, a generalized extreme value distribution for intensity and a Weibull distribution for duration, reliably quantify the joint severity of an event by linking a return period to it. The next step is to find the IDF curve above which the probability of a flood event is maximized. The highest odds ratio (0.66) is reached with a return period of 2 years and 2 months. Rainfall events that potentially trigger flash floods are thus expected to be at least as severe as a 2.17-year return period event.⁶

4.6 Uncertainty

Each method and procedure in the methodology comes with uncertainty. In the case of the green-labeled procedure in Fig. 2, this uncertainty is in terms of modeling choice, i.e., which approach to event definition or functional form of copula and marginal distribution is appropriate. Uncertainty from such modeling choices is difficult to quantify since there is no clear framework to define appropriate alternative hypotheses. In other words, the functions that have to be evaluated are not necessarily nested models that can be compared directly with a statistical test. In contrast, the gold-labeled methods in Fig. 2 by design give measures of uncertainty or are nested to this uncertainty. Therefore, the uncertainty of the copula estimation, marginal distribution estimation, conditional copula sampling, and odds ratio classification can be assessed, for instance, with bootstrapping.

https://nhess.copernicus.org/articles/24/873/2024/nhess-24-873-2024-f09

Figure 9IDF curves with a return period of (a) 2 years, (b) 10 years, (c) 20 years, and (d) 40 years with a 99 % confidence band from bootstrapping over the estimation of marginals and the copula function.

Download

Bootstrapping is a method commonly used to create confidence intervals for complex problems related to other resampling methods such as cross-validation (Hesterberg, 2011). The approach has been employed to quantify the uncertainty of IDF curves (Sane et al., 2018). In our case, we use the standard bootstrap with replacement but randomly draw a sample of both the confirmed events and the yearly maximum rainfall events from the original data. We repeat that process many times and calculate the values of interest for the IDF curve and the ratio classification each time. This yields a distribution of these that mimics the real sampling procedure in the data. We repeat that process 1000 times and calculate confidence intervals as the bootstrapped quantiles of the quantities of interest.

https://nhess.copernicus.org/articles/24/873/2024/nhess-24-873-2024-f10

Figure 10Panel (a) shows the ratio R of confirmed flood events versus yearly maximum rainfall events for different return periods with a 95 % confidence band from bootstrapping over the estimation of marginals and the copula function. Panel (b) shows the IDF curve with a return period of 2.17 years and how this separates the yearly maximum rainfall events.

Download

Figure 9 shows IDF curves for return periods of 2, 10, 20, and 40 years with a 99 % confidence band from bootstrapping. There is very little uncertainty coming from the estimation of distribution and copula. We suspect that there could be more uncertainty from the modeling choices, albeit these choices are herein not discretionary but firmly grounded in statistical theory. Figure 10 shows the uncertainty of the return period classification ratio and the classification of yearly maximum rainfall events with the optimal IDF curve. The uncertainty is much larger when classifying the events compared to the IDF curves. Not only is the uncertainty from the estimation of distribution and copula relevant, but many events are close to the optimal IDF curve itself and thus within the range of estimation uncertainty. The sampling of confirmed and maximum rainfall events also directly affects the classification ratio by changing the composition of events that are evaluated.

5 Discussion

It is insightful to compare the IDF curves from this study with those otherwise obtained for Jamaica. Burgess et al. (2015) provide Jamaica's most recent IDF curves using long-time-series data from two stations in Jamaica and extending existing annual maximum records back to 1895. With a return period of 5 years and a duration of 12 h, they estimate intensities of around 7.2–11.4 mm h⁻¹, depending on the configuration. For a duration of 2 h and again a return period of 5 years, intensities are between 32–33 mm h⁻¹. For a return period of 5 years, the results from the current study suggest an intensity of 7.14 mm h⁻¹ for 10 h and 22.8 mm h⁻¹ for 3 h. The corresponding IDF curves are thus in a similar range but are more strongly convex than those in Burgess et al. (2015), where dependence is not explicitly addressed. This might be caused by the choice of the normal copula, which is well suited to depict convex dependence. It might also be caused by the type of data input in that Burgess et al. (2015) use data from stations in the two largest cities in Jamaica, namely Kingston and Montego Bay, spanning back to 1895. In contrast, the remote-sensing data employed in the current study cover the whole island but only since 2000. Given the large difference in the investigated time period, climatic factors likely impact the results. Considering the spatial rainfall climate in Fig. 1, there is a strong variation across the islands for different meteorological scales that changes with the duration of events in the satellite data. It is arguably advantageous for some applications to represent the island on average instead of in two specific locations, whereas the longer time series in Burgess et al. (2015) are preferred in other applications.

Another kind of comparison can be made concerning the marginal distributions and studies that employ extreme value modeling for flood risk prediction. The intensity of confirmed flash flood events and extreme rainfall events in Jamaica is GEV distributed. The shape parameter around 0.59 implies that intensity is Fréchet extreme value distributed and has a lower limit. The duration of confirmed flash flood events and extreme rainfall events in Jamaica is Weibull distributed and has an upper limit. The lower limit of intensity and upper limit of duration are consistent with the IETD definition, which forces events to be relatively short and have a minimum intensity to be considered. Several studies also find evidence for Fréchet-distributed intensity when constructing IDF curves with extreme value analysis, though the shape of the distribution appears to be highly case-sensitive (Sane et al., 2018; Bonaccorso et al., 2020; Yeo et al., 2021).

The quantification of extreme rainfall hazards through the IDF curve classification has direct applications for policymakers. One may first consider the case of the CCRIF XSR parametric insurance against excess rainfall that is based on specific rainfall thresholds for payouts. The most recent version, XSR 2.5, utilizes separate exposure, vulnerability, and hazard modules for each member. For the Caribbean, the module is triggered by rainfall events that exceed some country-specific average intensity threshold for 12 h (short events) or 48 h (long events). These country-specific thresholds are optimized to increase the likelihood of detecting severe events while not capturing false positives. The results from the current study aim at a threshold identification similar to the XSR but differ in methodology. However, the flash flood identification from conditional copula modeling could be coupled with a module of exposure and a specific vulnerability function. The IDF curve can also provide thresholds for events shorter than 12 h. For instance, a 6 h long rainfall event with an average intensity above 8.4 mm h⁻¹ is potentially flash flood inducing. Such an integrated model based on the IDF curves would be an alternative verification to the CCRIF XSR and reduce model uncertainty.

The methodology proposed here could also be employed for hazard warning services. The Climate Risk and Early Warning Systems (CREWS) Caribbean project aims at strengthening such services. One of the three project components is the institutional strengthening and capacity building of hydro-meteorological services and early warning systems. The simple decision rule within the intensity duration space derived in the current study could be adapted for such purposes. More precisely, given a local weather forecast for the next day and corresponding uncertainty, the risk of a potential flash flood event can be deduced. After the initial parameterization, a direct implementation into the forecasting routine comes at virtually no cost. Again, even if there exist other systems, introducing another model based on a different methodology can greatly reduce model uncertainty.

It must be pointed out that the proposed methodology suffers from some shortcomings. The focus on rainfall events as measured at a certain location ignores general meteorological conditions as well as conditions on the ground. Additional information such as antecedent rainfall and soil moisture, soil type, or slope gradients can be employed to get a more precise decision rule. Likely, these factors play a crucial role in the actual development of a hazard given a specific rainfall event. The current methodology with a bivariate copula at its core is not directly suited for additional variables. While trivariate and higher-dimension copulas do exist, they are much less understood. Trivariate copulas also impose some limits on the attainable negative dependence. Furthermore, adding another variable to the copula requires a disproportionately larger sample, where the sample density decreases exponentially with the number of dimensions. One should note that several of these shortcomings such as the sample density apply to other methodologies as well. Another potentially more fruitful route might be to consider separate copula functions for different topography classes or meteorological conditions instead of a unified model that explicitly accounts for these interdependencies.

The procedure also omits the role of tropical cyclones (TCs). It has long been recognized that in the Caribbean many instances of extreme rainfall and consequential flooding are due to TCs (Laing, 2004). Ideally, a classification scheme would take into account synoptic-scale weather events. Suppose the proposed classification scheme for flash flood incidents will be used to estimate the effect of extreme rainfall on the economy or for insurance schemes. In that case, evidence is necessary to distinguish it from TCs (Czajkowski et al., 2017). While the current study did verify via newspaper articles that the flash flood incidents are largely non-TC events, additional care is necessary for applications. For instance, Collalti and Strobl (2022) study the economic impacts of flooding during tropical storms in Jamaica and find that only a minor number of heavy rainfall events occur during tropical storms of hurricane strength compared to the number of flash flood incidents discussed in this study.

Still, the analysis has to be geographically interpreted in terms of the whole island of Jamaica since the cell-wise annual maxima are likely not independent of each other due to weather systems such as TCs affecting more than one cell. This type of dependence due to the simultaneous occurrences of extreme events at multiple stations is also known as spatial coherence or storm dependence and can lead to an underestimation of the risks associated with extreme precipitation at each specific location (Zhang et al., 2022). However, it is particularly challenging to model such a dependence adequately in a statistical analysis. This is the main reason why we refrain from making statements concerning a single location which would require the explicit modeling of dependence, for instance, via single-site conditioning (Wadsworth and Tawn, 2022). In other words, by not exploiting the timing or location relative to other observations in the data of yearly maximum events and thus focusing the analysis on the island level, the results are unbiased concerning the spatial dependence of single observations but do not provide any location-specific insights.

Another shortcoming in the generalization of the methodology concerns climate change and the stationarity of the purported relationship. There is a strong consensus that climate change will influence extreme precipitation and consequentially flood risk in many parts of the world, including the Caribbean (IPCC, 2012, 2023). One possibility would be to incorporate non-stationarity in time in the marginal extreme value distributions (Sam et al., 2023). This could be extended to the copula function, as Yin et al. (2018) demonstrate. However, this approach is not feasible in the current application due to the, in climatic terms, short time series of the data. Most climatic variation in a 20-year time series is likely internal climate oscillation like ENSO (Bedoya-Soto et al., 2019; Cai et al., 2020). There could, however, be an alternative route to tackle climate change in settings where the data length is short in the form of (external) climate change allowance (Kay et al., 2021). This would entail resampling and adjusting the satellite precipitation data by a climate change allowance factor derived from other studies. Running the methodology of conditional copula modeling with these new data, one could explicitly model the effect of climate change on flood risk and classification in a comparison.

6 Conclusions

This study uses 93 confirmed flash flood events in Jamaica from 2001 to 2018 to estimate intensity–duration–frequency (IDF) curves via conditional copula sampling. Rainfall information of flash flood events is taken from remote-sensing data, and additional data on location-specific yearly maximum rainfall events were constructed. This considerably larger sample of statistically similar events allows for higher robustness in the estimation. It further enables one to find an IDF curve threshold above which flash flood events become likely. This threshold corresponds to a return period of $2 \frac{1}{6}$ years. A comparison with IDF curves for Jamaica in Burgess et al. (2015) yields similar results in terms of absolute level, but these are less convex concerning extremely intense or long events. The simple nature of connecting the copula method for IDF curves with a classification for flash floods potentially opens up many applications in parametric insurance programs, regional risk mapping, and hazard warning systems. The current study abstracts from event-determining conditions other than local rainfall intensity and duration. Future research should, therefore, include other factors, such as soil type and terrain ruggedness in the conditional copula modeling, as well as incorporate synoptic-scale meteorological conditions and climate change scenarios.

Appendix A: Event definition

Depending on the IETD, the statistical properties of the events change. Values of IETD between 4 and 24 h are considered. Figure A1 shows how the average intensity and duration changes for the flash flood event data with different values of the IETD. There is a relatively sudden drop in mean intensity for IETDs above 21 h. Also, the mean duration increases one to one up until an IETD of 12 h, after which the slope becomes flatter. Both indicate that the IETD above 12 and 21 h results in imprecisely delimited events with regard to duration and intensity, respectively. Figure A2 shows the probability density function of duration and intensity for the confirmed flash flood event data (green) and maximum rainfall events (blue). In order to apply the two-sample approach, they should follow the same marginal distribution. The maximum rainfall events are more intense but shorter compared to the confirmed flash flood events. The resemblance between duration increases with higher IETD while there is no clear pattern for intensity. In conclusion, an IETD of 12 h is best suitable for the data at hand. Note that in Sect. 4.3, where marginals are estimated for the conditional copula modeling, Kolmogorov–Smirnov tests indicate that the marginal distributions of the maximum rainfall event data are suitable for the smaller-confirmed-event data as well.

https://nhess.copernicus.org/articles/24/873/2024/nhess-24-873-2024-f11

Figure A1Mean event intensity and duration for different values of IETD.

Download

https://nhess.copernicus.org/articles/24/873/2024/nhess-24-873-2024-f12

Figure A2Probability density plots for duration and intensity for an IETD of 6, 12, and 24 h.

Download

Table A1Summary statistics.

Summary table of events with IETD of 12 h for all locations with a confirmed flash flood event.

Download Print Version | Download XLSX

Code and data availability

Shapefiles of all flood events in the study region and period are available from the Jamaican Office of Disaster and Preparedness Management (ODPEM). The Integrated Multi-satellitE Retrievals for GPM (IMERG) data are freely available from NASA. Cleaned flash flood event data derived from the ODPEM shapefiles are available under https://doi.org/10.48620/364 (Collalti, 2024) together with code that merges them with the GPM (IMERG), performs the main analysis, and creates figures.

Author contributions

ES and DC conceived the research framework and developed the methodology. DC was responsible for the code compilation, data analysis, graphic visualization, and first-draft writing. NS and ES participated in the data collection of this study. All authors discussed the results and contributed to the final version of the paper.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

We would like to thank the participants of the Workshop on Compound Weather and Climate Events 2021 (Bern, Online) and the IPWSD 2021 (Columbia University, Online) for their valuable comments.

Review statement

This paper was edited by Olga Petrucci and reviewed by Ify Nwaogazie and one anonymous referee.

References

Aleotti, P.: A warning system for rainfall-induced shallow failures, Eng. Geol., 73, 247–265, 2004. a

Alfieri, L., Salamon, P., Pappenberger, F., Wetterhall, F., and Thielen, J.: Operational early warning systems for water-related hazards in Europe, Environ. Sci. Policy, 21, 35–49, 2012. a

Amponsah, W., Ayral, P.-A., Boudevillain, B., Bouvier, C., Braud, I., Brunet, P., Delrieu, G., Didon-Lescot, J.-F., Gaume, E., Lebouc, L., Marchi, L., Marra, F., Morin, E., Nord, G., Payrastre, O., Zoccatelli, D., and Borga, M.: Integrated high-resolution dataset of high-intensity European and Mediterranean flash floods, Earth Syst. Sci. Data, 10, 1783–1794, https://doi.org/10.5194/essd-10-1783-2018, 2018. a

Ariff, N., Jemain, A., Ibrahim, K., and Zin, W. W.: IDF relationships using bivariate copula for storm events in Peninsular Malaysia, J. Hydrol., 470, 158–171, 2012. a, b, c, d

Bedoya-Soto, J. M., Poveda, G., Trenberth, K. E., and Vélez-Upegui, J. J.: Interannual hydroclimatic variability and the 2009–2011 extreme ENSO phases in Colombia: from Andean glaciers to Caribbean lowlands, Theor. Appl. Climatol., 135, 1531–1544, 2019. a

Bezak, N., Šraj, M., and Mikoš, M.: Copula-based IDF curves and empirical rainfall thresholds for flash floods and rainfall-induced landslides, J. Hydrol., 541, 272–284, 2016. a, b, c, d

Bonaccorso, B., Brigandì, G., and Aronica, G. T.: Regional sub-hourly extreme rainfall estimates in Sicily under a scale invariance framework, Water Resour. Manag., 34, 4363–4380, 2020. a

Burgess, C. P., Taylor, M. A., Stephenson, T., and Mandal, A.: Frequency analysis, infilling and trends for extreme precipitation for Jamaica (1895–2100), Journal of Hydrology: Regional Studies, 3, 424–443, 2015. a, b, c, d, e, f

Cai, W., McPhaden, M. J., Grimm, A. M., Rodrigues, R. R., Taschetto, A. S., Garreaud, R. D., Dewitte, B., Poveda, G., Ham, Y.-G., Santoso, A., and Ng, B.: Climate impacts of the El Niño–southern oscillation on South America, Nature Reviews Earth & Environment, 1, 215–231, 2020. a

Chen, L., and Guo, S.: Copulas and its application in hydrology and water resources, Springer, https://doi.org/10.1007/978-981-13-0574-0, 2019. a

Climate Studies Group Mona: The State of the Jamaican Climate 2019: Historical and Future Climate Changes for Jamaica, Produced for the Planning Institute of Jamaica (PIOJ), Kingston Jamaica, https://www.pioj.gov.jm/product/the-state-of-the-jamaican-climate-2019-historical-and-future-climate-changes-for-jamaica/ (last access: 26 January 2024), 2020. a, b

Coles, S., Bawa, J., Trenner, L., and Dorazio, P.: An introduction to statistical modeling of extreme values, vol. 208, Springer, London, ISBN 1-85233-459-2, 2001. a

Collalti, D.: Flash Flood Detection via Copula-based IDF Curves: Evidence from Jamaica, Bern Open Repository and Information System [code and data set], https://doi.org/10.48620/364, 2024. a

Collalti, D. and Strobl, E.: Economic damages due to extreme precipitation during tropical storms: evidence from Jamaica, Nat. Hazards, 110, 2059–2086, 2022. a, b

Czajkowski, J., Villarini, G., Montgomery, M., Michel-Kerjan, E., and Goska, R.: Assessing current and future freshwater flood risk from North Atlantic tropical cyclones via insurance claims, Sci. Rep.-UK, 7, 41609, 2017. a

De Michele, C., Salvadori, G., Vezzoli, R., and Pecora, S.: Multivariate assessment of droughts: Frequency analysis and dynamic return period, Water Resour. Res., 49, 6985–6994, 2013. a

Gencer, E.: An Overview of Urban Vulnerability to Natural Disasters and Climate Change in Central America & the Caribbean Region, FEEM Working Paper No. 78, 2013, https://doi.org/10.2139/ssrn.2334068, 2013. a

Gilewski, P. and Nawalany, M.: Inter-comparison of rain-gauge, radar, and satellite (IMERG GPM) precipitation estimates performance for rainfall-runoff modeling in a mountainous catchment in Poland, Water, 10, 1665, https://doi.org/10.3390/w10111665, 2018. a

Grønneberg, S. and Hjort, N. L.: The copula information criteria, Scand. J. Stat., 41, 436–459, 2014. a, b

Hapuarachchi, H., Wang, Q., and Pagano, T.: A review of advances in flash flood forecasting, Hydrol. Process., 25, 2771–2784, 2011. a

Hesterberg, T.: Bootstrap, WIREs Computational Statistics, 3, 497–526, 2011. a

Hofert, M., Mächler, M., and McNeil, A. J.: Likelihood inference for Archimedean copulas in high dimensions under known margins, J. Multivariate Anal., 110, 133–150, 2012. a

Hofert, M., Kojadinovic, I., Mächler, M., and Yan, J.: Elements of copula modeling with R, Springer, Switzerland, https://doi.org/10.1007/978-3-319-89635-9, 2018. a, b, c, d

Hosseinzadehtalaei, P., Tabari, H., and Willems, P.: Climate change impact on short-duration extreme precipitation and intensity–duration–frequency curves over Europe, J. Hydrol., 590, 125249, https://doi.org/10.1016/j.jhydrol.2020.125249, 2020. a

Hu, P., Zhang, Q., Shi, P., Chen, B., and Fang, J.: Flood-induced mortality across the globe: Spatiotemporal pattern and influencing factors, Sci. Total Environ., 643, 171–182, 2018. a

Huffman, G. J., Bolvin, D. T., Joyce, R., Kelly, O. A., Nelkin, E. J., Tan, J, Watters, D. C., and, West, J.: Integrated Multi-satellitE Retrievals for GPM (IMERG) Technical Documentation, NASA/GSFC Code, Maryland, https://gpm.nasa.gov/sites/default/files/2023-07/IMERG_TechnicalDocumentation_final_230713.pdf (last access: 26 January 2024), 2023. a

IPCC: Summary for Policymakers, in: Managing the Risks of Extreme Events and Disasters to Advance Climate Change Adaptation, edited by: Field, C. B., Barros, V., Stocker, T. F., Qin, D., Dokken, D. J., Ebi, K. L., Mastrandrea, M. D., Mach, K. J., Plattner, G.-K., Allen, S. K., Tignor, M., and Midgley, P. M., A Special Report of Working Groups I and II of the Intergovernmental Panel on Climate Change. Cambridge University Press, Cambridge, UK, and New York, NY, USA, 1–19, 2012. a, b

IPCC: Climate Change 2023: Synthesis Report. Summary for Policymakers. Contribution of Working Groups I, II and III to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change, Core Writing Team, edited by: Lee, H. and Romero, J., IPCC, Geneva, Switzerland, 1–34, https://doi.org/10.59327/IPCC/AR6-9789291691647.001, 2023. a

Joe, H.: Dependence modeling with copulas, Chapman & Hall/CRC Monographs on Statistics & Applied Probability, Boca Raton, USA, ISBN 978-1-4665-8322-1, 2014. a

Jonkman, S. N.: Global perspectives on loss of human life caused by floods, Nat. Hazards, 34, 151–175, 2005. a

Kay, A., Rudd, A., Fry, M., Nash, G., and Allen, S.: Climate change impacts on peak river flows: Combining national-scale hydrological modelling and probabilistic projections, Climate Risk Management, 31, 100263, https://doi.org/10.1016/j.crm.2020.100263, 2021. a

Kojadinovic, I. and Yan, J.: Modeling multivariate distributions with continuous margins using the copula R package, J. Stat. Softw., 34, 1–20, 2010. a

Koutsoyiannis, D., Kozonis, D., and Manetas, A.: A mathematical framework for studying rainfall intensity–duration–frequency relationships, J. Hydrol., 206, 118–135, 1998. a, b, c

Laing, A. G.: Cases of heavy precipitation and flash floods in the Caribbean during El Niño winters, J. Hydrometeorol., 5, 577–594, 2004. a

Li, H., Wang, D., Singh, V. P., Wang, Y., Wu, J., Wu, J., Liu, J., Zou, Y., He, R., and Zhang, J.: Non-stationary frequency analysis of annual extreme rainfall volume and intensity using Archimedean copulas: A case study in eastern China, J. Hydrol., 571, 114–131, 2019. a, b, c

Linkin, M. E.: Excess Rainfall Product for the Caribbean Region-Developed by The CCRIF and Swiss Re, in: AGU Fall Meeting Abstracts, vol. 2014, pp. NH51C–04, 2014. a

Lumbroso, D., Boyce, S., Bast, H., and Walmsley, N.: The challenges of developing rainfall intensity–duration–frequency curves and national flood hazard maps for the Caribbean, J. Flood Risk Manag., 4, 42–52, 2011. a

Martel, J.-L., Brissette, F. P., Lucas-Picher, P., Troin, M., and Arsenault, R.: Climate change and rainfall intensity–duration–frequency curves: Overview of science and guidelines for adaptation, J. Hydrol. Eng., 26, 03121001, https://doi.org/10.1061/(ASCE)HE.1943-5584.0002122, 2021. a

Martinez, C., Goddard, L., Kushnir, Y., and Ting, M.: Seasonal climatology and dynamical mechanisms of rainfall in the Caribbean, Clim. Dynam., 53, 825–846, 2019. a

Melillo, M., Brunetti, M. T., Peruccacci, S., Gariano, S. L., and Guzzetti, F.: An algorithm for the objective reconstruction of rainfall events responsible for landslides, Landslides, 12, 311–320, 2015. a

Monioudi, I. N., Asariotis, R., Becker, A., Bhat, C., Dowding-Gooden, D., Esteban, M., Feyen, L., Mentaschi, L., Nikolaou, A., Nurse, L., and Phillips, W.: Climate change impacts on critical international transportation assets of Caribbean Small Island Developing States (SIDS): the case of Jamaica and Saint Lucia, Reg. Environ. Change, 18, 2211–2225, 2018. a

Nandi, A., Mandal, A., Wilson, M., and Smith, D.: Flood hazard mapping in Jamaica using principal component analysis and logistic regression, Environ. Earth Sci., 75, 1–16, 2016. a

Nelsen, R. B.: An introduction to copulas, Springer Science & Business Media, New York, USA, ISBN 978-0387-28659-4, 2007. a

Peterson, T. C., Taylor, M. A., Demeritte, R., Duncombe, D. L., Burton, S., Thompson, F., Porter, A., Mercedes, M., Villegas, E., Semexant Fils, R., and Klein Tank, A.: Recent changes in climate extremes in the Caribbean region, J. Geophys. Res.-Atmos., 107, 4601, https://doi.org/10.1029/2002JD002251, 2002. a

Piciullo, L., Gariano, S. L., Melillo, M., Brunetti, M. T., Peruccacci, S., Guzzetti, F., and Calvello, M.: Definition and performance of a threshold-based regional early warning model for rainfall-induced landslides, Landslides, 14, 995–1008, 2017. a

Pinos, J. and Quesada-Román, A.: Flood risk-related research trends in Latin America and the Caribbean, Water, 14, 10, https://doi.org/10.3390/w14010010, 2021. a

Renard, B. and Lang, M.: Use of a Gaussian copula for multivariate extreme value analysis: Some case studies in hydrology, Adv. Water Resour., 30, 897–912, 2007. a

Salvadori, G. and De Michele, C.: Multivariate multiparameter extreme value models and return periods: A copula approach, Water Resour. Res., 46, W10501, https://doi.org/10.1029/2009WR009040, 2010. a

Salvadori, G., Durante, F., De Michele, C., Bernardi, M., and Petrella, L.: A multivariate copula-based framework for dealing with hazard scenarios and failure probabilities, Water Resour. Res., 52, 3701–3721, 2016. a

Sam, M. G., Nwaogazie, I. L., Ikebude, C., Inyang, U. J., and Irokwe, J. O.: Modeling Rainfall Intensity-Duration-Frequency (IDF) and Establishing Climate Change Existence in Uyo-Nigeria Using Non-Stationary Approach, Journal of Water Resource and Protection, 15, 194–214, 2023. a, b

Sane, Y., Panthou, G., Bodian, A., Vischel, T., Lebel, T., Dacosta, H., Quantin, G., Wilcox, C., Ndiaye, O., Diongue-Niang, A., and Diop Kane, M.: Intensity–duration–frequency (IDF) rainfall curves in Senegal, Nat. Hazards Earth Syst. Sci., 18, 1849–1866, https://doi.org/10.5194/nhess-18-1849-2018, 2018. a, b

Silvestro, F., Rossi, L., Campo, L., Parodi, A., Fiori, E., Rudari, R., and Ferraris, L.: Impact-based flash-flood forecasting system: Sensitivity to high resolution numerical weather prediction systems and soil moisture, J. Hydrol., 572, 388–402, 2019. a

Singh, V. P. and Zhang, L.: IDF curves using the Frank Archimedean copula, J. Hydrol. Eng., 12, 651–662, 2007. a, b

Spencer, N. and Polachek, S.: Hurricane watch: Battening down the effects of the storm on local crop production, Ecol. Econ., 120, 234–240, 2015. a

Stephenson, T. S., Vincent, L. A., Allen, T., Van Meerbeeck, C. J., McLean, N., Peterson, T. C., Taylor, M. A., Aaron-Morrison, A. P., Auguste, T., Bernard, D., and Boekhoudt, J. R.: Changes in extreme temperature and precipitation in the Caribbean region, 1961–2010, Int. J. Climatol., 34, 2957–2971, 2014. a

Suresh, A. and Pekkat, S.: Importance of Copula-Based Bivariate Rainfall Intensity–Duration–Frequency Curves for an Urbanized Catchment Incorporating Climate Change, J. Hydrol. Eng., 28, 05023012, 2023. a

Tang, G., Clark, M. P., Papalexiou, S. M., Ma, Z., and Hong, Y.: Have satellite precipitation products improved over last two decades? A comprehensive comparison of GPM IMERG with nine satellite and reanalysis datasets, Remote Sens. Environ., 240, 111697, https://doi.org/10.1016/j.rse.2020.111697, 2020. a

Wadsworth, J. L. and Tawn, J.: Higher-dimensional spatial extremes via single-site conditioning, Spat. Stat.-Neth., 51, 100677, https://doi.org/10.1016/j.spasta.2022.100677, 2022. a

Wilson, M., Mandal, A., Taylor, M., Burgess, C., Campbell, J., and Stepphenson, T.: Flood Risk and Climate Change in Negril, Jamaica: An Assessment of Combined Terrestrial and Coastal Flood Risk Driven by Projections of Future Climate, in: WCrP Conference for Latin America and the Caribbean: Developing, Linking and Applying Climate knowledge, Montevideo, Uruguay, 17–21 March 2014, 17–21, 2014. a

Yeo, M.-H., Nguyen, V.-T.-V., and Kpodonu, T. A.: Characterizing extreme rainfalls and constructing confidence intervals for IDF curves using Scaling-GEV distribution model, Int. J. Climatol., 41, 456–468, 2021. a

Yin, J., Guo, S., He, S., Guo, J., Hong, X., and Liu, Z.: A copula-based analysis of projected climate changes to bivariate flood quantiles, J. Hydrol., 566, 23–42, 2018. a

Zhang, L., Risser, M. D., Molter, E. M., Wehner, M. F., and O'Brien, T. A.: Accounting for the spatial structure of weather systems in detected changes in precipitation extremes, Weather and Climate Extremes, 38, 100499, https://doi.org/10.1016/j.wace.2022.100499, 2022. a

Zscheischler, J., Orth, R., and Seneviratne, S. I.: Bivariate return periods of temperature and precipitation explain a large fraction of European crop yields, Biogeosciences, 14, 3309–3320, https://doi.org/10.5194/bg-14-3309-2017, 2017. a

The authors' calculation is performed using the EM-DAT database. Since 2000, 1.7 billion people have been affected by floods, followed by droughts (1.4 billion), storms (0.8 billion), and earthquakes (0.12 billion).

https://floodlist.com/, last access: 11 January 2023.

There are also some instances where a specific dependence has been assumed from theoretical considerations; see Koutsoyiannis et al. (1998)

⁴

http://metservice.gov.jm/aws/, last access 3 January 2023.

⁵

Most commonly, risk is defined as the combination of the three components hazard, exposure, and vulnerability. Hazard relates to the physical phenomenon – in this case, flash floods. Exposure could be in terms of people, buildings, or economic assets at risk of the hazard. Vulnerability then links the hazard to the exposure and translates it to risk. For instance, given a flash flood hazard, the vulnerability of urban or agricultural settlements (exposure) is different, and as such, the risk is different as well.

⁶

This threshold is naturally higher than the simple empirical analog of the 93 confirmed events in Jamaica during the 18-year period because the geographical resolution is higher: looking at smaller-scale areas, each of these areas' flood probability has to be lower than that of the whole island.

Articles

Short summary

The risk of extreme rainfall events causing floods is likely increasing with climate change. Flash floods, which follow immediately after extreme rainfall, are particularly difficult to forecast and assess. We develop a decision rule for flash flood classification with data on all incidents between 2001 and 2018 in Jamaica with the statistical copula method. This decision rule tells us for any rainfall event of a certain duration how intense it has to be to likely trigger a flash flood.