the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.

The dynamics of peak head responses at Dutch canal dikes and the impact of climate change
Matthijs Kok
Managing the water and flood risk in low-lying polder regions depends on the performance of canal dikes. This performance is influenced by hydraulic heads, which can peak due to heavy rainfall, affecting their stability and potentially inducing dike breaches. Variations in head responses and head statistics are relevant for regional flood risk analysis of canal dike systems. This study examined the dynamics of peak heads in canal dikes on a national scale using time series models calibrated on a unique dataset of head observations across the dike system. Various model structures were evaluated, and a non-linear model performed the best. These models were used to simulate 30 years of head time series representing current and future climate scenarios. Subsequently, dike clusters were identified based on the coincidence of peak heads, allowing for the identification of dikes where peaks are caused by (dis)similar types of rainfall events. The differences and similarities in peak head response between dikes and identified clusters were related to physical dike characteristics. While the subsurface material and dike width appeared to influence the head response variation of clusters, their presence across multiple clusters indicates that they do not yield a definitive outcome. Moreover, peak head statistics across various dikes indicated that extreme and yearly occurring load conditions are relatively close to each other, with a median decimate height of only 15 cm. With climate change driving higher winter precipitation and summer evaporation, head statistics are changing. By 2100, extreme peak heads are expected to occur between 3 times less and 8 times more frequently, depending on the climate scenario and the type of canal dike.
- Article
(16410 KB) - Full-text XML
- BibTeX
- EndNote
Catastrophic dike failures have occurred throughout history due to various causes, such as storm surges, extreme river discharges, ice drifts, and extreme weather conditions like heavy rainfall or drought. Several failure mechanisms were involved, including overflow and overtopping, external erosion, piping, and inner slope instability (Van Baars and Van Kempen, 2009; Özer et al., 2019). For many dikes along rivers and coasts, inner slope instability occurs due to the infiltration of water into the dike body and its foundation, leading to higher head levels and pore-water pressures, reducing effective stresses and shear strength of the soil (Frank et al., 2004; Ridley et al., 2004; Sharp et al., 2013; van Woerkom et al., 2021; Van der Krogt et al., 2022). The infiltration of water can be caused by high water levels against the dike as well as heavy rainfall (Rikkert, 2022; Van Baars and Van Kempen, 2009). For dikes with controlled water levels that show little fluctuations, such as canal dikes, the infiltration of water caused by heavy rainfall can be significant and is considered a primary mechanism of dike failure. Canal dikes are among others present in polders, which can be found in coastal and alluvial lowlands all over the world, like the Netherlands, Bangladesh, Vietnam, England and China (Martín-Antón et al., 2016; Morton and Olson, 2018; Lendering et al., 2018; Tran and Weger, 2018; Triet et al., 2017; Manh et al., 2013; Warner et al., 2018). The water levels in these reclaimed areas are artificially regulated by an internal drainage system with canals, where water levels can reach several metres above the surrounding terrain (see Fig. 1). This makes these low-lying areas vulnerable to floods in the event of a canal dike breach. These canal dikes are found not only in polders but also along internal waterways and irrigation canals worldwide, where major dike failures have occurred throughout history (Gildeh et al., 2019).

Figure 1Two examples of canal dikes in the Netherlands: Duifpolder is shown on the left and the Drooggemaakte Geer- en Kleine Blankaardpolder is depicted on the right. In both images, the canal has permanent high water levels close to the crest level (on the left side) and the low-lying polder is mainly used for agriculture (on the right side). The head differences between the canal water levels and water levels in the polders are 2.7 m (Duifpolder) and 4.3 m (Drooggemaakte Geer- en Kleine Blankaardpolder). The red tubes protect the measuring equipment and the piezometers used to measure the hydraulic head. Photos by EURECO/Cyril Liebrand (2022).
To manage flood risks in an embanked area, the performance of dikes plays a crucial role. The failure probabilities of the individual elements or dike stretches contribute to the reliability of the canal dike system and the flood risk level in an area (Vanmarcke, 1977; Kanning, 2012; Jongejan et al., 2020). The Netherlands has an extensive system of canal dikes critical for managing its low-lying land and protecting it against flooding, with a total length of more than 10 000 km (Pleijster et al., 2015). In general, the failure probability of a dike system increases with the total length of the dike system, due to partial correlation or independency between different individual dike stretches (Kanning, 2012; Vrijling and van Gelder, 2002). This phenomenon is known as the length effect, which is caused by both spatial dependencies in the resistance and loading conditions and differs for each failure mechanism. Inner slope instability is a failure mechanism of the Dutch canal dikes that contributes significantly to the calculated failure probability and flood risks in the polders, where the load primarily consists of high hydraulic head peaks (Lendering et al., 2018; Rikkert et al., 2022; Van Baars and Van Kempen, 2009). The variations of the canal water levels in the drainage systems of Dutch polders are small (typically up to tens of centimetres), while the observed hydraulic head fluctuations are an order of magnitude larger than water level fluctuations. Therefore, the fluctuations of the hydraulic heads are primarily driven by rainfall and evaporation. Whether two nearby canal dikes both experience extreme load conditions after a heavy rainfall event depends on the head response of the dikes. Variations in the head response can cause extreme loads to occur after different weather events and influence the system's reliability. Furthermore, these variations in response also help water authorities identify threatening situations, as weather forecasts can be translated to potential peak head levels in dikes.
To calculate the failure probability of individual canal dikes and dike systems, information about peak head responses is essential. Currently, there is limited understanding regarding the spatial variability in head responses and head statistics in canal dikes, which is partly due to the lack of measurements and the extensive computation time required for groundwater models. Multiple studies have modelled the effects of rainfall and evaporation on the phreatic surface in dikes using different approaches (Rikkert, 2022; Jamalinia et al., 2020; van Esch, 2012). Multi-year measurements of hydraulic head levels are often lacking in dikes, which makes modelling exercises difficult to validate. Furthermore, the validation of models is hindered by the heterogeneity of dikes and the unknown field hydraulic conductivities, potentially influenced by burrowing animals, plant roots or cracks, and resulting flow paths. This means that there is also little known about the effects of climate change on head statistics and failure probabilities of canal dikes. Future climate projections indicate increasing temperatures with summers becoming hotter and drier and winters becoming warmer and wetter. This is expected to affect the stability of slopes. Although the impact has been studied for both natural slopes (e.g. Moore et al., 2010) and earthworks (e.g. Huang et al., 2024; Rouainia et al., 2020) for canal dikes with different boundary conditions, subsurface materials and resulting head dynamics, the effects are studied to a limited extent. This study aims to assess the dynamics of peak hydraulic heads in canal dikes on a national level, caused by heavy rainfall events, by analysing the variation in head responses and head statistics. It also seeks to understand why differences in head dynamics occur by relating these variations to the physical properties of the dikes. Furthermore, the potential impact of climate change on the head statistics is quantified, indicating how flood risks in Dutch polders are expected to change in the future.
2.1 Dutch canal dike system
In the Netherlands, the threat of flooding is controlled by a system of flood defences, where distinction can be made between primary and regional flood defences. The primary flood defences are located along major bodies of water, such as the sea, the major rivers and large lakes, often referred to as outside waters, while regional defences are found along inland waters, including drainage canals, artificial lakes and smaller rivers. In general, a breach in regional defences will have a smaller impact than a breach in the primary defences, though it can still have considerable consequences, as shown in Fig. 2. This study focuses on a subset of the regional flood defences, namely the canal dikes. The canal dikes are primarily located in the western and northern parts of the Netherlands, where the polders are located (see Fig. 2). These cultivated lowland areas serve as agricultural land as well as for human settlement. Many cities, villages and small communities are situated throughout the polders. The water inside the polder is separated from the outside water by the primary flood defences, and the polder drainage systems manage the water inside the primary flood defences. The water is managed by discharging or pumping the polder water into canals (also called the boezem in Dutch), after which the canals release the water into the outside water, either naturally or using pumps (Steenbergen et al., 2009). Water levels in the canals are higher than the polder levels, resulting in a flood hazard for the polders that are protected by the canal dikes. The subsurface of canal dikes is characterized by low-permeability soils that mainly consist of clay and peat, and in the past many canal dikes breached (Van Baars and van Kempen, 2009).

Figure 2Overview of the study area with the canal dike system (subset within the regional flood defence system for the provinces of South Holland, North Holland, Friesland, Groningen and Utrecht). Monitoring sites are indicated by circle markers, with orange and green denoting collected data and the data checked and utilized for further analysis, respectively. Maximum inundation depths are depicted to illustrate potential flood impact in the polders.
2.2 Data collection
2.2.1 Head observations and preparation checks
To set up and calibrate groundwater models, head observations in canal dikes were collected and received from seven Dutch regional water authorities, namely Hoogheemraadschap Schieland & de Krimpenerwaard (HHSK), Hoogheemraadschap Delfland (DL), Hoogheemraadschap Rijnland (RL), Hoogheemraadschap Hollands Noorderkwartier (HHNK), Weterksip Fryslân (WF), Waterschap Amstel, Gooi and Vecht (AGV) and Hoogheemraadschap De Stichtse Rijnlanden (HDSR). The crest levels of the dikes with head observations vary from around NAP+1.5 m to NAP−2.5 m (all elevations are relative to the Dutch reference level called Normaal Amsterdams Peil (NAP), which is approximately mean sea level) and polder water levels ranging from NAP−2 m to NAP−6.5 m, highlighting that many canal dikes lie below sea level.
In total, 258 head observations at 91 monitoring sites were collected in this study (see Fig. 2). Multiple head observations can be located at one monitoring site, where piezometers are aligned within a dike cross-section, e.g. measurements in the crest, inner slope and toe of the dike. At each site, up to five piezometers were installed with filter depths reaching up to approximately 5 m below the surface to measure phreatic head levels. Monitoring sites can be located near each other, e.g. 20 m, while still exhibiting varying responses due to differences in subsurface materials, highlighting the large spatial heterogeneity. Consequently, the distance between monitoring sites was not used to exclude any observations. The heads were measured with automatic pressure loggers, with hourly measurement intervals in the period between 2006 and 2023 and were resampled to daily-mean values for the analysis. For further analysis, only time series that are longer than 2.5 years were selected. Additionally, any time series exhibiting visual anomalies attributed to failing measurement devices or odd behaviour such as pronounced drift, absence of fluctuations or inexplicable jumps were removed from the dataset. The monitoring sites were also checked for whether the head observations are located in a dike, since sometimes the dike is more like a quay without a slope. In these situations, the dike was not considered for further analyses. The head dynamics in a dike are complex and also location-dependent within the dike profile, since the head in the outer crest can respond differently from head levels in the inner slope. Only head observations that are located in the talud zone or mid-slope of the dike were used (see Fig. 3). This is the area between the inner crest (the top of the dike at the polder side) and the toe of the dike, where the most variations in groundwater levels are expected. This is because it is farthest from the regulated water levels in the canal and polder, which are maintained at the target levels.
The resulting dataset consists of 108 head time series at 48 monitoring sites, consisting of phreatic head levels measured in the dike body. The length of the head time series varies between 3 and 9 years, with an average length of 5 years.
2.2.2 Precipitation and evaporation
The Royal Netherlands Meteorological Institute (KNMI) provides several data products about weather and climate. In this study, two data products for rainfall and evaporation (on daily basis) were used, serving the purpose of (1) getting the best estimate of the historic local weather conditions at the head observation sites (for the calibration of models) and (2) getting long term time series of the weather representing the current and future climate situation in the Netherlands (for extending the head time series). First, the local historic weather conditions were extracted from rainfall and potential evaporation maps of the Netherlands, namely the radar-derived precipitation amounts (Wolters et al., 2013) and inverse distance weighting (IDW) interpolated potential evaporation amounts based on the KNMI ground stations. The evaporation maps give estimates of the daily Makkink reference evapotranspiration derived from ground observations of the global radiation and the average daily temperature (De Bruin, 1987). Secondly, to derive extended head time series encompassing more extreme events, 30 years of precipitation and evaporation time series corresponding to different (current and future) climate scenarios were used (Van Dorland et al., 2023). These time series are a representation of the climate and not an estimate of the actual weather. Since this study focuses on variations in peak heads caused by different head responses, rather than from spatial variations in the nature of the load (dimensions of weather events), the 30-year time series at one location was used for all sites. The station Aalsmeer, close to Amsterdam, was used, which lies rather central in the western Netherlands (see Fig. 2). In total, nine 30-year time series were used. One time series corresponds to the current climate and eight to future climate scenarios with combinations of two time horizons (2050 and 2100), two greenhouse gas emission pathways and two types of regional climate responses. The emission scenarios include SSP1-2.6 (a low-emission scenario that assumes sustainable development) and SSP5-8.5 (a high-emission scenario that assumes fossil-fuel-intensive development). Each emission scenario is split into a wet-trending and a dry-trending regional response, reflecting the uncertainty in how precipitation patterns may shift in the Netherlands. All future scenarios with different emission levels and regional climate responses predict an increase in winter precipitation and drier summers, accompanied by increased evaporation and reduced precipitation. Although these trends occur across all future scenarios, their intensity varies across different scenarios. These combinations capture a wide range of future scenarios, allowing us to assess the sensitivity of head statistics to climate change across time, emission pathways and regional climate responses.

Figure 3A simplified cross-sectional profile of a canal dike with regulated water levels (TL = target level) on both sides (canal and polder) and the variation in the phreatic surface (blue dashed lines). In this study, the groundwater levels in the talud zone of the dike are analysed. The conceptualization of several dike characteristics is highlighted in red, as discussed in Sect. 3.3. The 90th and 10th percentile profile points were derived from the elevation profile as an approximation of the dike slope.
This study takes a novel approach by combining a unique nationwide dataset of head observations in canal dikes with time series modelling to investigate how canal dikes respond to heavy rainfall. The approach developed for assessing the dynamics of hydraulic heads is shown in Fig. 4. After collecting head observations in canal dikes, time series models were set up to simulate hydraulic heads in canal dikes, using precipitation and potential evaporation as the explanatory time series. Several model structures were evaluated, and the one with the overall best performance was selected. Only models that meet the specified reliability criteria (minimum goodness of fit and sufficient time series length compared to model parameters) were selected, resulting in a set of models that explain the fluctuations of the observed head levels in different dikes with a variety of head responses (step 1). These models were forced with 30 years of precipitation and evaporation time series, corresponding to different climate scenarios (current and future conditions). This was done to obtain extended head time series that encompass more extreme events (step 2), facilitating the analysis of both the dynamics and statistics associated with extreme occurrences. The variation in head responses was quantified by analysing the coincidence of the head peaks across canal dikes, selected using the peaks-over-threshold method and classifying different clusters of dikes with similar head responses. The variation and similarity of head responses were related to several physical dike characteristics, like subsurface material and dike profile, to search for explanations of the differences found (step 3). Finally, a generalized Pareto distribution (GPD) is fitted to the head peaks that describe the probability of occurrence of peak values. The variations in Wand properties of the head statistics are analysed, as well as the impact of climate change (step 4).
3.1 Groundwater modelling in dikes
Several modelling approaches can be used to model the hydraulic head in canal dikes. Commonly used approaches comprise numerical groundwater models, like Hydrus-2D, PlaxFlow and MODFLOW, that are solutions to (systems of) differential equations that describe the flow of groundwater (Šimůnek et al., 1999; McDonald and Harbaugh, 2003). These approaches need detailed information on material behaviour for both unsaturated and saturated soils. In the case of canal dikes, Van Esch (2012) showed that it remains difficult to reproduce observed hydraulic heads in dikes because of the uncertain conceptualization of the subsurface, spatial heterogeneity and applied boundary conditions. Time series modelling is a simplified and abstract representation of head fluctuations at one point resulting from the complex 3D movement of water in the dike (Bakker and Schaars, 2019). It is a data-driven approach that can estimate the contribution of independent drivers (rainfall, evaporation, water levels, etc.) on the observed head levels derived exclusively from observed data. This approach is used in this study.
3.1.1 Time series models
The basic principles of time series analysis come from the statistical sciences (Box and Jenkins, 1970). Transfer function noise (TFN) modelling is a subfield within time series analysis that aims to convert one or more input series into an output series using a statistical model. Von Asmuth et al. (2002) presented a novel form of TFN models that relies on the concepts of convolution and predefined impulse response functions and is used for many applications within groundwater science. Predefined response functions are used to estimate the effect of a unit pulse of a driver, like precipitation, on the head response. The head response is simulated through the convolution of various drivers with their response functions. The basic model structure of a TFN model to simulate heads may be written as
where h(t) is the observed heads, hm(t) is the contribution of drivers m to the head, d is the base elevation of the model and r(t) represents the residuals. The number of drivers M in each model varies based on the selected model structure. The contribution of driver m to the head is computed through convolution:
where Sm(τ) is a time series of driver m at preceding time τ, and θm(t−τ) is the associated impulse response function determining how much of that past driver still influences the head at time t. A variety of impulse response functions can be used to simulate the effects of certain drivers, where commonly used impulse response functions are the scaled gamma and the exponential response functions (Collenteur et al., 2019). The exponential response function is the simplest response function with only two parameters and may be used for drivers that have an immediate effect on the head, like the shallow head levels in the canal dikes (up to a few metres below ground level). Together with the small geohydrological dimensions (the dimensions of the dike are typically only a few tens of metres), a relatively rapid response is expected, which is confirmed by measurements where head observations respond quickly to rainfall, despite the presence of low-permeability soils.
3.1.2 Various time series model structures
The head fluctuations are explained by the contribution of various hydrological drivers that are convoluted with a response function. Various model structures can be chosen, incorporating different drivers with different response functions. A driver can also be the combination of multiple drivers. The net groundwater recharge R(t) is frequently used as a driver that is derived from rainfall P(t) and Makkink reference evaporation Ep(t) as inputs (e.g. von Asmuth et al., 2008):
where the parameter f is the so-called evaporation factor used to scale the reference evaporation. This model is referred to as a linear recharge model, after which Sm is substituted by the net recharge R(t) and then convoluted with a response function to determine the impact of recharge on the head. In this formulation, processes such as surface runoff are not accounted for but may be relevant for canal dikes and can be incorporated using non-linear models.
Non-linear recharge models, such as those based on soil-water balance concepts, like FlexModel or Berendrecht, offer a way to incorporate additional hydrological processes, like surface runoff. They introduce more complexity and typically require more model parameters (Collenteur et al., 2021; Berendrecht et al., 2006). Furthermore, these models can account for the non-linear response of the head to precipitation and evaporation by using connecting reservoirs, such as interception and root zone reservoirs, which can include short-term water retention in the soil. Another non-linear model structure is the threshold autoregressive self-exciting open-loop (TARSO) model (Knotters and Gooijer, 1999). This structure consists of two regimes (upper and lower), which are separated by a threshold. Each regime has its own exponential response function with corresponding drainage levels, but only when the head reaches the upper drainage level will the upper response function become active. Therefore, this model can be useful when the head response is different above a certain head level. This threshold value is not fixed but is estimated during the calibration process, just like other parameters.
The head time series were modelled using time series models as implemented in the Python open-source package Pastas (version 1.6.0) (Collenteur et al., 2019). Time series models were set up, assuming that the head dynamics in canal dikes are primarily influenced by rainfall and evaporation, and only these two drivers are included in the model. This assumption is supported by the observation that canal water levels fluctuated minimally (on the order of tens of centimetres) during the measurement period, while the observed average head range was more than 1 m. Additionally, the models demonstrated an overall good fit. In total, four different model structures were employed (see Table 1). The model structure with the highest averaged goodness of fit across all models was used for further analysis. Selecting a single model structure ensures consistent comparison across different locations and simplifies the interpretation of results.
3.1.3 Model calibration and selection
The time series models were used to characterize and simulate the heads of canal dikes with a single deterministic parameter set. For every head time series, time series models were set up, where the full series were used for calibration to maximize data utilization. This was done because the length of the available time series was limited and to avoid the missing information of the head response when splitting up the data for model calibration and model validation. Although validation tests of the models can indicate that the models are performing well and are adequate to achieve good quality model predictions in post-validation model application, previous studies showed that the most robust models are achieved when all data are used for calibration (Shen et al., 2022; Arsenault et al., 2018), which is in line with the goal of this study. Overfitting is mitigated by employing time series models with up to seven parameters and using head calibration data with more than 1000 data points. The model parameters were estimated using the least squares method, employing a warmup period of 10 years and without incorporating a noise model to represent the residuals. After choosing the best model structure, the calibrated models of that structure were evaluated using two criteria. These two criteria were used to determine whether a model is reliable for further analysis:
- -
Goodness of fit: the model's goodness of fit, measured by the R squared (r2), must be equal to or greater than 0.7 in the calibration period, indicating a minimum acceptable level of fit. This is also known as the coefficient of determination, which is a measure of how well observed outcomes are explained by the model. The sensitivity to threshold selection was tested using threshold values of 0.6 and 0.8. Although the number of reliable models changed, both thresholds resulted in models with similar peak head responses within comparable limits. Therefore, our findings are robust to the exact choice of threshold.
- -
Response time: the 95 % response time, the time it takes for 95 % of the influence of an impulse (groundwater recharge) to dissipate, must not exceed the length of the measurement series. Time series should be long enough to cover the head response in order to estimate parameters accurately (Knotters and van Walsum, 1997). This criterion eliminates models for which the time series data are not long enough considering the estimated model parameters.
When there are multiple reliable models of head time series available at one monitoring location, the model that provided the best fit was selected as the representative model for that location.
3.2 Peak selection and extreme value analysis
Peaks in hydraulic heads often occur in groups over time: an extremely high hydraulic head is likely to be followed by another since the groundwater system within dikes contains autocorrelation or memory. For extreme value analyses, we are interested in independent peaks to avoid biases and underestimation of the variability of extremes. Therefore, peaks were filtered out of the time series such that the peaks were mutually independent from each other in time by using the peaks-over-threshold (POT) method in combination with a time window. The POT method was applied with a threshold set at the 90th percentile of the analysed head series, and a time window of 30 d was chosen to guarantee the selection of independent peaks. Afterwards, only the n highest peaks were selected, where n corresponds to the length of the time series in years, ensuring that only the most extreme values were included.
Next, a generalized Pareto distribution (GPD) was fitted to the peaks that describes the annual probability of occurrence of peak values. The GPD has three main forms, Type I, Type II and Type III distributions, which differ in the number of parameters and the flexibility of the tail behaviour. The cumulative distribution functions of the GPD are defined by
where x is the hydraulic head, and the three parameters of the GPD are called the scale (σ), shape (ξ) and location (μ) parameters. When ξ = 0, the GPD is equivalent to the exponential distribution. All generalized Pareto distributions were explored in a sensitivity analysis. In the case of peak hydraulic heads, the exponential distribution was preferred due to its relative simplicity and its suitability for the process, as suggested by visual inspections of the tail of the distribution and the peak values. One key characteristic of the extreme value distribution is the decimate height. It is defined as the increase in head level that occurs when the return period increases by a factor of 10 and is a relevant characteristic in load and dike safety analyses (Wojciechowska, 2015; Schweckendiek, 2014).
The effects of climate change were quantified by estimating the extreme value distribution, or head statistics, of simulated head time series for different time horizons and climate scenarios (including emission scenario and regional response). To compare future scenarios with the current climate, the increases or decreases in head levels at a certain return period were compared, and the so-called probability factor was used. This metric expresses how the frequency of a head level with a 100-year return period is expected to change under climate scenarios. A probability factor of five implies that a head level that occurs, on average, once every 100 years in the current climate occurs once every 20 years under future climate conditions.
3.3 Clustering head responses and relating to physical dike characteristics
The relationships between the head responses and several physical dike characteristics were examined. These characteristics include the subsurface material of the dike body, the head difference (water level difference between the target levels of the canal and polder), the dike slope and the equivalent drainage length (see Fig. 3). Dike slopes can be seen as a measure of the hydraulic gradient within the dike body, which influences the horizontal groundwater flow and the head response. In addition, steep dike slopes can reduce recharge, as they can increase surface runoff that limits the possibility for water to infiltrate into the dike. The slope of a canal dike was not always obvious, since the dikes do not always have a typical geometry, where the profile is irregular and the crest, the berm and the toe of the dike are not clearly recognizable. The dike slope was obtained from an elevation map of the Netherlands with a horizontal resolution of 0.5 m (Actueel Hoogtebestand Nederland, 2022) and was calculated by referencing two percentile points on the elevation profile (10th and 90th percentile), as an approximation of the slope between the crest and the toe of the dike. The target levels of the canal and polder were obtained from the local water authorities, which were used to calculate the head difference. The equivalent drainage length was determined by dividing the head difference by the estimated dike slope, providing an estimate of the horizontal distance over which water is effectively drained from the dike. The subsurface of the dike was based on borehole descriptions or cone penetration tests (CPTs) at the monitoring sites. In the absence of soil investigation, a detailed three-dimensional model (GeoTOP) of the upper 30 m of the subsurface of the Netherlands was used (Stafleu et al., 2012).
3.3.1 Clustering head responses
Differences in peak head behaviours were examined by analysing the coincidence of selected head peaks across all dikes in the 30-year simulated head time series. These simulations were based on 30 years of rainfall and evaporation at the same KNMI station to isolate the effect of differences in peak heads from the spatial variability of weather events. If peak heads at two different dikes occurred on the same day, they were assumed to coincide. By calculating the percentage of coinciding peak levels for each dike pair, a coincidence matrix was formed. This matrix provided a quantitative measure of how often peak heads align across different dikes, indicating their response to similar weather events. Based on this matrix, dike clusters were identified. These clusters consist of dikes where peak heads were driven by similar weather events, while dikes in different clusters experienced peak heads caused by distinct weather events. The clusters are estimated using the k-means clustering algorithm (Hartigan and Wong, 1979), where the number of clusters (k) has to be given beforehand and is based on the mean Silhouette score of all samples (Rousseeuw, 1987) and the “elbow method”, as implemented by Yellowbrick (Bengfort and Bilbro, 2019).
3.3.2 Statistical tests
The relationships between these physical dike characteristics and characteristics of the impulse response functions, as well as clusters of dikes, were examined. Various statistical tests were employed to assess these relationships by calculating the p value for different types of variables, both categorical and continuous: the Wald test for comparing two continuous variables, the Wald chi-squared test used for two categorical variables, and the Kruskal–Wallis test for one continuous and one categorical variable. The analyses of relationships with subsurface materials were limited to clayey and peaty dikes, as the dataset includes only one sand dike, and Dutch canal dikes are generally composed of these materials.
4.1 Modelled head responses in canal dikes
4.1.1 Selecting reliable time series models
For each of the 108 head time series across 48 monitoring sites, time series models with various model structures were created, calibrated and evaluated. To illustrate the performance of different model structures, Fig. 5 gives an example for the monitoring site at Molenlaan (site DL4). The linear models (exponential and gamma response functions) are not able to model the head response for the full range of head levels, as can be seen in the scatterplot. For this location, it appeared that the head response was non-linear. Both the TARSO and Flex models provide a better fit across the entire range of head levels. Notably, the TARSO model shows improved performance in capturing head levels at the most extreme ends of the range.

Figure 5The performance of different model structures at Molenlaan (DL4). Panel (a) shows the simulated and observed head levels, and (b–e) show four scatterplots of the model structures showing the observed and simulated daily head levels (the black line indicates the 1:1 line).
The TARSO model demonstrated the best performance among all the calibrated models. It has the highest average goodness of fit (an average r2 of 0.74) and performed as the best structure for 81 % of the models. The second-best model was the linear recharge model with the gamma response function (an average r2 of 0.68), while the Flex model structure performed, on average, the worst with an average r2 of 0.63. However, the Flex model still performed best for 12 % of the models, indicating that in some cases a more detailed non-linear representation of recharge processes can be beneficial. Overall, the head dynamics in Dutch canal dikes can be best modelled with a non-linear structure that incorporates two regimes, which are separated by a threshold. This non-linear behaviour can be the result of various soil layers in the dike body, each with distinct hydraulic properties, and changes in infiltration rates or non-constant storage capacities of the unsaturated zone during the dry season (Knotters and de Gooijer, 1999). Next, the calibrated TARSO models were evaluated using the reliability criteria (goodness of fit and response time), and for every monitoring site, the model that met the reliability criteria and had the highest goodness-of-fit score was selected. At 38 out of 48 monitoring sites (79 %), models were developed with r2 of 0.7 or higher. The other reliability criterion, that the 95 % response time should not exceed the length of the measurement series, reduced the number of sites with reliable models to 35 (73 %). In Fig. A1, detailed information is provided on the best-performing TARSO models at each monitoring site, including the r2 values and corresponding response times. In addition, plots of the measured and simulated heads during the measuring period for all selected monitoring sites are shown in Fig. A2. This set of models was used for further analysis.
The model results of one of the selected time series models are shown in Fig. 6. The simulated heads closely match the observations with an r2 of 0.84, indicating a close one-to-one relationship between simulations and observations. The model overestimates the heads in the summer of 2019, which was particularly dry in terms of head levels. These differences may be due to inaccurate precipitation data used in the model (as summer precipitation can be highly localized) or disturbances during installation, affecting the head levels in 2019. The other dry summer in 2022 was captured more accurately. The calibrated block response functions for both the upper and lower regimes are shown in the lower right graph in Fig. 6. These functions are dimensionless and represent the unit head response to a 1 d recharge event of 1 mm. The actual head change is obtained by scaling this response with the recharge input. Two key characteristics of these functions are (1) the peak of the block response and (2) the 95 % response time. The peak of the block response represents the maximum increase in head level that would occur. The 95 % response time, further on referred to as the response time, is a measure of the memory of the groundwater system and, in this case, represents the time it takes for 95 % of the influence of an impulse (groundwater recharge) to dissipate. In the TARSO models, the upper and lower regimes are separated by a fitted threshold parameter, with head responses differing in each regime. For this model, the head response in the upper regime reacts more strongly to recharge events (higher peak block response) and dissipates more quickly (lower response time) than the lower regime. Whether this behaviour is consistent across all dikes is examined in the next subsection.

Figure 6Illustration of the simulated head levels with the time series model in comparison with the observations for dike AGV10. Panel (a) shows the observations and model series where also the upper and lower regime are indicated with the black dotted line, and panel (b) shows the scatterplot of observed and simulated heads. Panel (c) shows the impulse response functions of the upper and lower regimes, including the parameters of the peak of the block response (A) and the response time (t95) in the legend.
4.1.2 Characteristics of impulse response functions
The head dynamics of various dikes were quantified by examining the two key characteristics of the impulse response functions, namely the peaks of the block responses and the response times. Figure 7 shows these characteristics for both the upper and lower regime of the TARSO model, where the colours indicate in which water authority region the dikes are located. The colours initially appear random, indicating no clear spatial pattern in the variation of head responses across regions, likely due to the heterogeneous subsoil conditions and other dike characteristics within the canal dike system. The response times of the upper regimes of the dikes are generally short, mostly ranging from about 2 to 50 d, with two exceptions where response times exceed 250 d. These two exceptions have a very small peak of the block response compared to other dikes, which have a large variation in the peak response of the upper regime, with values reaching up to nearly 15. This variation may result from different soil storage capacities and the redistribution of infiltrated water within the dike, which can accumulate in the talud zone causing large head responses. The response times of the lower regime show more variation than those of the upper regime, with most dikes ranging between 100 and 600 d. Meanwhile, the peak block response for the majority is below 5.
Two key patterns of non-linearity appear in nearly all locations when analysing the response functions. First, the response time is longer for the lower regime than for the upper regime (34 out of 35 sites). Second, the peak of the block response is higher in the upper regime than in the lower regime (32 out of 35 sites). These observed differences may be explained by underlying physical processes. For example, longer response times in the lower regime can be caused by head gradients (as a driver of groundwater flow) that depend on the head level itself, the presence of various soil layers with different permeabilities, or the fact that head levels closer to the surface increase the degree of water saturation which affects the hydraulic conductivity and response times in a non-linear way. The lower peak of the block response for the lower regime can be caused by non-linear processes in the unsaturated zone, where lower head levels generally allow more water storage, and root water uptake further increases storage capacity (Berendrecht et al., 2006). In contrast, when head levels are higher, capillary action draws water upward from the saturated zone, increasing moisture in the unsaturated zone and reducing the amount of air-filled pore space. This limits the potential for additional water storage.

Figure 7The characteristics of the impulse response functions at 33 canal dikes (peak of the block response and 95 % response time) for the upper and lower regimes, where the colours indicate the various water authority regions where the dikes are located. The inset for the upper regime shows a zoomed-in view.
4.2 Variation in peak head responses
4.2.1 Coincidence of head peaks and dike clusters
The coincidence matrix, shown in the left graph in Fig. 8, describes how often peak heads occurred on the same day across different dikes in the 30-year simulated head time series. This illustrates whether peak heads tend to result from the same or different weather events. Based on this matrix, the elbow method identified an optimal number of clusters at k = 4, while the Silhouette score indicates that either k = 2 or k = 4 could be optimal, with scores of 0.442 and 0.439, respectively. Therefore, the number of clusters that are identified using the k-means clustering algorithm was set at four. The resulting clusters of dikes are called clusters A, B, C and D and are indicated within the coincidence matrix; see Fig. 8. While the three clusters A, B and C are less distinct from each other with still moderately high percentages of coincident peaks, sometimes exceeding 50 %, cluster D has a very distinct peak behaviour. This cluster consists of only two dikes of which the time series models have deviant impulse response functions, longer response times and smaller peak block responses in the upper regime compared to the other clusters (upper right graph in Fig. 8). In general, the distinctive peak behaviour between clusters is strongly influenced by the response time of the upper regime, with average response times of 16, 32 and 77 d for clusters A, B and C, respectively. This twofold increase in response times for each cluster results in distinct rainfall events leading to peak heads. These longer response times cause peak heads to be driven by more prolonged rainfall events, resulting in peaks that typically occur later in the winter. More details on the differences in seasonality of peak heads can be found in Appendix A2, including analyses on the average timing and distribution of peak heads throughout the year.

Figure 8The coincidence matrix of the head peaks at the canal dikes, including four identified clusters (a). Within every cluster, the locations are ranked based on the 95 % response time. The characteristics of the impulse response functions of the dikes within every cluster are shown in panels (b) and (c).
The dike clusters do not exhibit a clear spatial pattern, as shown in Fig. A4. In some regions, such as within the water authorities AGV and HDSR, dikes appear to be mainly in clusters A and B, respectively, while in other regions no clear spatial pattern is observed. Although canal dikes within the same polder may have similar dike characteristics, there can still be large spatial variabilities of those characteristics across a region (see Figs. A6 to A9). Moreover, even dikes that initially appear to have similar characteristics can exhibit different head responses, which are examined in more detail in the following paragraph.
4.2.2 Clusters in relation to physical dike characteristics
Can differences in peak head responses be explained by physical dike characteristics? Table 2 shows the p values for the relationships between physical dike characteristics and both the clusters and the characteristics of the impulse response functions (upper and lower regimes), indicating that most relationships were not statistically significant (p values > 0.05). However, the subsurface material of the dike appears to play an important role as a distinguishing characteristic in the dike clusters, with a p value of 0.03. Statistical tests used to calculate the p values can be inappropriate due to violations of test assumptions, small sample sizes, multiple comparisons and data dependency, all of which can lead to misleading statistical conclusions (Greenland et al., 2016). Therefore, the physical dike characteristics within every cluster are also visually analysed, and, as expected from the statistical tests, there is a large variation of dike characteristics within every cluster; see Fig. 9. The clayey and peaty dikes appear in all dike clusters, with most clayey dikes in cluster B (see also Fig. A10). Furthermore, small dikes, with drainage lengths less than 20 m and often associated with steep slopes, are found only in clusters A and B, which are clusters with the smallest response times. Furthermore, the median equivalent drainage length of the clusters increases from clusters A to D. This indicates the importance of dike geometry, where shorter distances from a drain (the canal or ditch) lead to faster dike drainage and smaller response times. This can be explained by the fact that the hydraulic gradient, which drives water towards the drain, increases with shorter distances. Yet, determining the cluster to which dikes with certain dike characteristics belong is not straightforward, since all clusters include dikes with various characteristics.
Table 2The p values of the relationships between the considered physical dike characteristics (rows) and the clusters of dikes, as well as for the 95 % response time and peak block response for both the upper and lower regimes (columns).

4.3 Statistics of head peaks
4.3.1 Head statistics and decimate height
The selected hydraulic head peaks in the 30-year simulated head time series are used to estimate the extreme value distribution of peak head levels by fitting an exponential distribution. The statistics of hydraulic head levels at a dike along the Beemster Polder (HHNK3) are used as an illustration; see Fig. 10. The decimate height at HHNK3 is approximately 7 cm, while across various dikes the values range from around 5 to 50 cm with a median decimate height of 15 cm (as seen in the right graph in Fig. 10). Lower decimate heights are found at dikes with smaller peak block responses in the upper regime, in combination with shorter response times. Since these characteristics of impulse response functions do not exhibit a clear relationship with dike characteristics (see Table 2) or a distinct spatial pattern, the decimate height also does not follow a spatial pattern, as shown in Fig. A11.
4.3.2 Impact of climate change
For one location, the resulting frequency lines of various climate scenarios in the year 2100 are shown in Fig. 11 (left graph). For this location, all climate scenarios result in more extreme head levels or head levels to occur more frequently. In the Hw scenario (high emissions and wet regional climate response), which has the largest increase in winter precipitation according to Van Dorland et al. (2023), the head level at a return period of 100 years increases by 10 cm. Due to the small decimate height, the original head level, occurring once every 1000 years, is expected to happen once every 15 years, indicating a sixfold increase in frequency or a probability factor of six. The probability factors across all dikes, shown in the right graph in Fig. 11, range from about three times less frequent to seven times more frequent across all climate scenarios in 2100. This variation could not be directly linked to the clusters of dikes; however, it was found that dikes with longer response times in the lower regime appear to be less impacted by climate change. This can be explained by the fact that these dikes dry out more during drier summers. As a result, a more dried-out dike allows for greater water storage when rainfall returns, causing head peaks to occur less frequently. However, the impact of drier summers on the occurrence of extreme head levels is counterbalanced by wetter winters, which both occur in all climate scenarios. Therefore, the characteristics of both the head response to changing winter precipitation and summer evaporation determine the overall impact of climate change on extreme heads. Under the low-emission scenarios, changes in the frequency of extreme head levels remain small in both 2050 and 2100. This is due to relatively moderate increases in winter precipitation and summer evaporation, which appear to balance each other. Under high-emission scenarios, the impact of climate change in 2050 is, on average, moderate. However, at some dikes, under the dry regional response, the frequency of extreme heads reduced by more than a factor of three. By 2100, however, high-emission scenarios indicate an increase in the frequency of extreme head levels, caused by wetter winters, with the most significant impact observed in the wetting regional response.

Figure 11(a) The frequency lines of head levels for various climate scenarios in 2100 (High/Low emissions; dry/wet regional climate response) at the dike in a polder south of Amsterdam. (b) The probability factor of all dikes (•) and the median value (▪) for different time horizons and climate scenarios.
5.1 Limitations and recommendations
Given the extensive canal dike system with thousands of kilometres of canal dikes and the heterogeneity of dike bodies, the number of available observations, both in terms of locations and measurement duration, is limited. As a result, it remains uncertain to what extent this set of 35 reliable models used in this study is representative of the head responses in Dutch canal dikes. In addition to the limited number of observations, two factors further affect representativity. First, the chosen TARSO model structure influences which head responses are included in the dataset. The reliability criteria applied in this study filtered out observations and associated head responses that could not be modelled adequately, meaning that certain head responses that did not fit the selected models were also excluded. As a consequence, the selected models may not fully capture all relevant processes influencing the head response of canal dikes. For instance, changes in hydraulic conductivity and additional non-linear processes, such as surface runoff from excessive rainfall, are not explicitly accounted for. This limitation also affects the reliability of hydraulic head estimates beyond the measured range of head variations, making them more uncertain. The model uncertainties associated with the time series models were not considered in this study. Furthermore, hypotheses and possible explanations of why the TARSO model fits the head response of canal dikes best require further research. Field measurements and 2D numerical modelling could help improve the understanding of water flow and validate the suitability of TARSO models. Second, it was assumed that each dike has a single head response, as only the best-fitting model within the talud zone was selected to represent that location. However, variations in head responses can exist within the talud zone of a single dike. At 14 monitoring sites, multiple reliable models were available within the talud zone, allowing for an assessment of this variability. At 10 of these sites, the response times of the upper regime differed by only 10 d, indicating similar peak behaviour. Despite the uncertainties surrounding the representativity of this model set, this study demonstrates the value of observations and makes the data publicly available. We hope this will encourage further long-term measurement campaigns to extend the available data and improve the understanding of head responses in canal dikes.
In this study, the k-means clustering algorithm was used to cluster dikes based on the coincidence of peak heads. However, two key factors influence clustering outcomes. First, the choice of clustering method affects the results. Various clustering methods exist, and different methods may lead to different clustering results (Everitt et al., 2011). Second, the input data used for clustering determines the outcome. In this study, clustering was based on the percentages of coinciding peak heads across dikes, as this parameter best aligns with our objective of evaluating peak head variability in canal dikes. However, for different purposes, alternative parameters or datasets can be chosen to cluster dikes. While clustering provides a practical way to manage variability, in reality there is no clear distinction or a clear cut between different dikes in terms of head responses and peak behaviour. Instead, it is a gradual shift across a spectrum of head responses.
Next, the variation in head responses and the found clusters were linked to dike characteristics. These dike characteristics themselves were difficult to determine clearly, leading to uncertainty. For instance, the profile of the canal dike can be very irregular and in many cases there is no singular, defined slope. Additionally, canal dikes are typically made up of multiple soil types, making it difficult to classify a peaty or clayey dike. With all these uncertainties, it is challenging to define clear characteristics for dikes and identify patterns, if not impossible. The uncertainty of the subsurface material of dikes is especially large for those based on the subsurface model GeoTOP rather than borehole descriptions or CPTs. To evaluate the impact of GeoTOP data on the findings, a sensitivity analysis was conducted by testing the effect of excluding this data. While the p values changed slightly, the overall results remained consistent, with the p value for the relationship between subsurface material and clusters increasing to 0.05, highlighting the importance of subsurface material in the analysis. Furthermore, we examined the relationship between the head response and various dike characteristics, looking at each characteristic separately. It is possible that considering the combination of various characteristics might reveal a clearer pattern. However, it is important to keep in mind that even if there is a relationship between head responses in canal dikes and dike characteristics, it is challenging to determine the local dike characteristics of each individual dike stretch because of the heterogeneous nature of the dike system. This heterogeneity exists in both longitudinal and cross-sectional directions. As a result, it can be expected that the head responses of the canal dikes have large spatial variations and can even differ for dikes that are close to each other.
Lastly, there are limitations to how the impacts of climate change on peak heads were modelled. This study considered the impact of climate change effects only in terms of changes in precipitation and potential evaporation, assuming that the head response itself remains unchanged. However, head responses can change over time due to shifts in hydraulic conductivity and water retention capacity of soils. These changes may result from cyclic wetting and drying of soils, leading to swelling, shrinkage, soil consolidation and alterations in dike structures over time (Stirling et al., 2021; Azizi et al., 2020). As climate change is expected to intensify dry–wet cycles, these processes may become more pronounced, potentially affecting the stability of dikes. Additionally, dike resistance can be further influenced by factors such as shear-strength reduction, soil compaction and peat decomposition. Quantifying all the effects of future climate scenarios is challenging, as both the hydraulic and mechanical behaviours of soils are intertwined and impacted. Translating these effects into slope instability is even more complex, which is beyond the scope of this study. To better understand and quantify changes in head responses within dikes, continuous long-term monitoring, preferably exceeding 10 years, is essential.
5.2 Implications for dike safety
This study quantified variations in peak head responses of canal dikes, which are relevant for estimating regional or national flood risk levels in polders. Consider an imaginary canal dike ring along a small polder where each individual dike section is assumed to have a failure probability of per year, with fully spatially correlated load and strength characteristics. The weather conditions across this polder are uniform, with equal precipitation and evaporation everywhere. Under these conditions, the probability of flooding in the polder equals the highest failure probability of the individual dike sections. In contrast, suppose there were four different types of dikes, each with its own peak head responses, resulting in four statistically independent load conditions. If their strength characteristics remain fully spatially correlated, the flood probability in the polder increases to approximately per year, which is a factor of four higher than with fully spatial correlated load characteristics. In general, this can be calculated by Pf,sys = , where Pf,sys is the failure probability of the dike ring (or flood probability of the polder), Pf,sect is the failure probability of the individual dike sections and n is the number of statistically independent dike sections. The spatial correlations in loads and strength are crucial for accurately estimating flood risk levels in regions, and this study provides valuable insights into the variation of loadings in dikes by considering the different head responses that affect these estimates. Besides variations in head responses, regional risk levels are also affected by natural spatial variability in weather events. This spatial variability can increase the number of statistically independent load events, depending on the scale considered.
Gariano and Guzzetti (2016) reviewed the literature about the impact of climate change on landslides, both in natural and engineered slopes, and concluded that the risk of shallow landslides can increase (triggered by short and intense rainfall events), while the risk of deep-seated landslides may decrease or show no significant change (related to long rainfall periods). This is primarily due to changing meteorological conditions that lead to higher head levels, reducing the shear strength, soil suction and cohesion and increasing the weight (wet density) of slope materials, all of which contribute to increasing slope instability. Deep-seated landslides appear to decrease or show no significant change, because these types of landslides depend on monthly and/or seasonal rainfall amounts. These more prolonged rainfall events are expected to decrease in regions like the Alps (Rianna et al., 2014; Gariano and Guzzetti, 2016). Canal dikes show similar behaviour to shallow landslides, with, on average, an increased risk of instability in the future. However, the impact of climate change on canal dikes can still vary considerably, with extreme head levels projected to occur either more or less frequently depending on the specific head response. While this study could not identify definitive explanations for these differences, the results suggest that both the head response to changing winter precipitation and summer evaporation play a role in the overall impact of climate change. These insights are relevant for assessing dike safety over time and enhancing adequate dike design.
Another interesting finding for dike safety assessments is the small decimate height in the canal dike head statistics. For dikes with a decimate height of only 5 cm, yearly occurring head levels (T = 1 year) are only 15 cm lower than extreme head levels that occur on average once every 1000 years. Measurements in these dikes are particularly valuable for improving safety assessments, as observed heads are closer to extreme levels, allowing for more accurate extrapolation with fewer uncertainties in modelling extreme levels (Wojciechowska, 2015). Furthermore, short-term measurements can already capture relatively high observed loads, providing valuable insights into the actual dike strength through reliability-updating techniques (Schweckendiek, 2014). Nevertheless, for dikes that are marginally stable, this small increase in head level can still be a trigger for dike failure.
This study aimed to assess the dynamics of peak heads in Dutch canal dikes at a national level by analysing variations in head responses, head statistics and the impact of climate change. This was done using time series models calibrated on a unique dataset of head observations in dike systems, consisting of 108 head time series across 48 monitoring sites. Various model structures were evaluated, and it was found that the non-linear TARSO model outperformed the other (non-)linear models. This model consists of two regimes (upper and lower), separated by a threshold, each with its own exponential response function and drainage levels. This threshold non-linear behaviour in canal dikes can be attributed to several factors. Groundwater flow is driven by head gradients and hydraulic conductivities, which both can vary vertically and depend on the head level itself, while rising head levels near the surface non-linearly affect hydraulic conductivity and water storage in the unsaturated zone. The TARSO model was selected to calibrate time series models for all head time series, which were then evaluated using two reliability criteria (goodness of fit and response time), resulting in a set of 35 reliable models.
Differences in peak head behaviour between various canal dikes were examined by analysing the coincidence of head peaks across all dikes in 30-year simulated head time series. Four clusters of dikes were identified, consisting of dikes where peak heads were driven by similar weather events, while dikes in different clusters experienced peak heads caused by distinct weather events. The differentiating factor was the response times of the upper regime of these dikes, where longer response times caused peak heads to be driven by more prolonged rainfall events. The identified dike clusters do not exhibit a clear spatial pattern. The reasons are the large spatial variability of dike characteristics and the fact that even dikes with similar characteristics can exhibit different head responses. While the subsurface material and dike width appeared to be important factors influencing the variations in head responses, their presence in multiple clusters indicates that these characteristics alone do not definitively determine the head response.
Next, peak head statistics were derived across the canal dikes, revealing that the median decimate height is only 15 cm, ranging between 5 and 50 cm. This indicates that yearly occurring head levels are, on average, relatively close to extreme events. Dikes with lower decimate heights were associated with smaller peak block responses and shorter response times in the upper regime. Since these characteristics of impulse response functions do not exhibit a clear relationship with dike characteristics or a distinct spatial pattern, decimate heights also do not follow a spatial pattern. With climate change driving higher winter precipitation and summer evaporation, head statistics are changing. Results showed that head levels with a return period of 100 years are expected to occur about three times less frequently to seven times more frequently by 2100, depending on the climate scenario and the type of canal dike. Dryer summers can reduce the frequency of extreme peak heads by lowering head levels during summer, which increases dike water storage capacity when rainfall returns. However, most climate scenarios project a higher frequency of extreme head levels by 2100, caused by a wetter winter trend. The varying impact of climate change on dikes is largely attributed to the response times of the lower regime. Dikes with longer response times seem to be less affected by climate change, as they experience greater drying during drier summers. However, this increased drying during summer can have other negative consequences, as climate change is expected to intensify dry–wet cycles, potentially leading to soil degradation (Stirling et al., 2021; Azizi et al., 2020).
A1 Time series models
A2 Seasonality
The dynamics of the peak heads were analysed by quantifying the seasonality of the dikes, which is measured by using the average timing and temporal concentration of the selected head peaks. The method to determine the average timing of head peaks involves circular statistics, and it is extensively described in Hall and Blöschl (2018). The average timing of the head peaks is the average date on which peaks have occurred during the time series. The average timing of head peaks can be the result of a wide range of peak dates during the year; therefore, the temporal concentration of peaks occurrence within the year is considered using the concentration index. The concentration index of peak dates around the average timing serves as a measure of how well the seasonality is defined for a specific dike, with 0 indicating evenly distributed peaks during the year and 1 indicating that all peaks occur on the same date.
Seasonality varies across the dikes, but the average timing of these peaks occurs in the winter half-year, as shown in the right graph in Fig. A2. The average timing shifts further into the winter from cluster A to cluster D, with increasing temporal concentrations. This behaviour is also illustrated by the probability density distributions of the peaks of four dikes in the left graph in Fig. A2. The vertical dashed line indicates the average timing, which moves further into the winter, while also the density functions become narrower.

Figure A3(b) The average timing and temporal concentrations for the considered dikes, with colours representing the dike clusters (refer to the legend in the left graph). (a) The probability density distribution of the peaks throughout the year for four selected dikes from different clusters, with the vertical dashed line indicating the average timing. These four dikes are highlighted with a black edge around the circle in the right graph.
A3 Spatial patterns in dike clusters

Figure A4Mapping of clustered canal dikes across the Netherlands. The left panel shows the full study area, with dikes grouped into four clusters (A–D) based on their peak head response to weather events. The right panel provides a zoomed-in view of the highlighted region, offering a more detailed look at the cluster distribution.
A4 Physical dike characteristics and their relation to head responses

Figure A5The relationships between the considered physical dike characteristics are illustrated by scatterplots, with the subsurface material of the dike indicated by colour. The Pearson correlation coefficients between the variables are displayed in the bottom right corner. The diagonal plots show the univariate distributions, highlighting the marginal distribution of each variable, with distinctions made based on soil type.

Figure A6Mapping of subsurface material (peat, clay and sand) in the 35 canal dikes for which reliable models were developed across the Netherlands.

Figure A7Mapping of the equivalent drainage length in the 35 canal dikes for which reliable models were developed across the Netherlands.

Figure A8Mapping of the head differences in the 35 canal dikes for which reliable models were developed across the Netherlands.

Figure A9Mapping of the dike slopes in the 35 canal dikes for which reliable models were developed across the Netherlands.
A5 Spatial variation in decimate heights

Figure A11Spatial variation of decimate height across canal dikes in the Netherlands. The left panel shows the full study area, with different water authorities labelled. The right panel provides a zoomed-in view of the highlighted region. The colour scale represents decimate height values. No clear spatial pattern is observed across different regions.
The measurement data used to establish the model set, consisting of 108 head time series across 48 monitoring sites and the local historic rainfall and potential evaporation, are available from 4TU.ResearchData at https://doi.org/10.4121/4004f445-b71b-4996-bd7d-10b1fafbc86b (Strijker, 2024).
This dataset contains the following:
- -
An overview of the monitoring sites and piezometers, along with their geographic locations (CSV1 file and shapefile).
- -
Hydraulic head time series from the piezometers (CSV file)
- -
Time series of local precipitation and potential evaporation (CSV file).
Furthermore, the script, output data (models and figures) and other relevant data (e.g. dike characteristics and meteorological time-series data representing future climate scenarios) are shared to enhance the reproducibility of this study.
A readme file is added that describes the files in the dataset. The data source has a CC0 licence, which entails the waiver of all copyright and related rights, enabling unrestricted use of the data for any purpose. Authors appreciate being informed when using the data by contacting the corresponding author.
BS: writing (review and editing and original draft), visualization, methodology, formal analysis, data curation, conceptualization. MK: writing (review and editing), supervision, resources, project administration, methodology, conceptualization.
The contact author has declared that neither of the authors has any competing interests.
Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.
This research was supported by the STOWA and Rijkswaterstaat. Furthermore, the authors express their gratitude to all the water authorities who provided the necessary data: Hoogheemraadschap Schieland & de Krimpenerwaard (HHSK), Hoogheemraadschap Delfland (DL), Hoogheemraadschap Rijnland (RL), Hoogheemraadschap Hollands Noorderkwartier (HHNK), Weterksip Fryslân (WF), Waterschap Amstel, Gooi and Vecht (AGV) and Hoogheemraadschap De Stichtse Rijnlanden (HDSR).
This research has been financially supported by the Stichting Toegepast Onderzoek Waterbeheer.
This paper was edited by Paolo Tarolli and reviewed by Hans Middelkoop and two anonymous referees.
Actueel Hoogtebestand Nederland: http://www.ahn.nl (last access: 22 November 2023), 2022.
Arsenault, R., Brissette, F., and Martel, J. -L.: The hazards of split-sample validation in hydrological model calibration, Journal of Hydrology, 566, 346–362, https://doi.org/10.1016/j.jhydrol.2018.09.027, 2018.
Azizi, A., Musso, G., and Jommi, C.: Effects of repeated hydraulic loads on microstructure and hydraulic behaviour of a compacted clayey silt, Can. Geotech. J., 57, 100–114, https://doi.org/10.1139/cgj-2018-0505, 2020.
Bakker, M. and Schaars, F.: Solving groundwater flow problems with time series analysis: you may not even need another model, Groundwater, 57, 826–833, https://doi.org/10.1111/gwat.12927, 2019.
Bengfort, B. and Bilbro, R.: Yellowbrick: Visualizing the Scikit-Learn Model Selection Process, Journal of Open Source Software, 4, 1075, https://doi.org/10.21105/joss.01075, 2019.
Berendrecht, W. L., Heemink, A. W., Van Geer, F. C., and Gehrels, J. C.: A non-linear state space approach to model groundwater fluctuations, Adv. Water Resour., 29, 959–973, https://doi.org/10.1016/j.advwatres.2005.08.009, 2006.
Box, G. E. P. and Jenkins, G. M.: Time Series Analysis: Forecasting and Control, Holden-Day, ISBN 0816210942, 1970.
Collenteur, R. A., Bakker, M., Caljé, R., Klop, S. A., and Schaars, F.: Pastas: open source software for the analysis of groundwater time series, Groundwater, 57, 877–885, https://doi.org/10.1111/gwat.12925, 2019.
Collenteur, R. A., Bakker, M., Klammler, G., and Birk, S.: Estimation of groundwater recharge from groundwater levels using nonlinear transfer function noise models and comparison to lysimeter data, Hydrol. Earth Syst. Sci., 25, 2931–2949, https://doi.org/10.5194/hess-25-2931-2021, 2021.
De Bruin, H. A. R.: From penman to makkink, in: Evaporation and Weather, Technical Meeting 44, 5–31, 1987.
Everitt, B. S., Landau, S., Leese, M., and Stahl, D.: Cluster Analysis, Wiley Online Library, https://doi.org/10.1002/9780470977811, 2011.
Frank, R., Bauduin, C., and Driscoll, R. M.: Designers' Guide to Eurocode 7: Geotechnical Design: Designers' Guide to EN 1997-1, Eurocode, 7: Geotechnical Design-General Rules, Thomas Telford, ISBN 978-0-7277-3154-8, 2004.
Gariano, S. L. and Guzzetti, F.: Landslides in a changing climate, Earth-Sci. Rev., 162, 227–252, https://doi.org/10.1016/j.earscirev.2016.08.011, 2016.
Gildeh, H. K., Hosseini, P., Zhang, H., Riaz, M., and Acharya, M.: Canal embankment failure mechanism, breach parameters and outflow predictions, in: Sustainable and Safe Dams Around the World/Un monde de barrages durables et sécuritaires, CRC Press, 61–74, https://doi.org/10.1201/9780429319778-6, 2019.
Greenland, S., Senn, S. J., Rothman, K. J., Carlin, J. B., Poole, C., Goodman, S. N., and Altman, D. G.: Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations, Eur. J. Epidemiol., 31, 337–350, https://doi.org/10.1007/s10654-016-0149-3, 2016.
Hall, J. and Blöschl, G.: Spatial patterns and characteristics of flood seasonality in Europe, Hydrol. Earth Syst. Sci., 22, 3883–3901, https://doi.org/10.5194/hess-22-3883-2018, 2018.
Hartigan, J. A. and Wong, M. A.: Algorithm AS 136: A k-means clustering algorithm, J. Roy. Stat. Soc. C-App., 28, 100–108, https://doi.org/10.2307/2346830, 1979.
Huang, W., Loveridge, F. A., Briggs, K. M., Smethurst, J. A., Saffari, N., and Thomson, F.: Forecast climate change impact on porewater pressure regimes for the design and assessment of clay earthworks, Q. J. Eng. Geol. Hydroge., 57, qjegh2023-015, https://doi.org/10.1144/qjegh2023-015, 2024.
Jamalinia, E., Vardon, P. J., and Steele-Dunne, S. C.: The impact of evaporation induced cracks and precipitation on temporal slope stability, Comput. Geotech., 122, 103506, https://doi.org/10.1016/j.compgeo.2020.103506, 2020.
Jongejan, R. B., Diermanse, F., Kanning, W., and Bottema, M.: Reliability-based partial factors for flood defenses, Reliab. Eng. Syst. Safe., 193, 106589, https://doi.org/10.1016/j.ress.2019.106589, 2020.
Kanning, W.: The Weakest Link - Length Effects for Piping, PhD thesis, Delft University of Technology, Delft, The Netherlands, https://doi.org/10.4233/uuid:5fb7b121-dc00-48aa-bda2-b163f10513bf, 2012.
Knotters, M. and De Gooijer, J. G.: TARSO modeling of water table depths, Water Resour. Res., 35, 695–705, https://doi.org/10.1029/1998WR900049, 1999.
Knotters, M. and Van Walsum, P. E. V.: Estimating fluctuation quantities from time series of water-table depths using models with a stochastic component, J. Hydrology, 197, 25–46, https://doi.org/10.1016/s0022-1694(96)03278-7, 1997.
Lendering, K., Schweckendiek, T., and Kok, M.: Quantifying the failure probability of a canal levee, Georisk: Assessment and Management of Risk for Engineered Systems and Geohazards, 12, 203–217, https://doi.org/10.1080/17499518.2018.1426865, 2018.
Manh, N. V., Merz, B., and Apel, H.: Sedimentation monitoring including uncertainty analysis in complex floodplains: a case study in the Mekong Delta, Hydrol. Earth Syst. Sci., 17, 3039–3057, https://doi.org/10.5194/hess-17-3039-2013, 2013.
Martín-Antón,, M., Negro, V., del Campo, J. M., López-Gutiérrez, J. S., and Esteban, M. D.: Review of coastal land reclamation situation in the world, J. Coastal Res., 75, 667–671, https://doi.org/10.2112/si75-133.1, 2016.
McDonald, M. G. and Harbaugh, A. W.: The history of MODFLOW, Groundwater, 41, 280–283, https://doi.org/10.1111/j.1745-6584.2003.tb02591.x, 2003.
Moore, R., Carey, J. M., and McInnes, R. G.: Landslide behaviour and climate change: predictable consequences for the Ventnor Undercliff, Isle of Wight, Q. J. Eng. Geol. Hydroge., 43, 447–460, https://doi.org/10.1144/1470-9236/08-086, 2010.
Morton, L. W. and Olson, K. R.: The pulses of the Mekong River Basin: Rivers and the livelihoods of farmers and fishers, Journal of Environmental Protection, 9, 431, https://doi.org/10.4236/jep.2018.94027, 2018.
Özer, I. E., van Damme, M., and Jonkman, S. N.: Towards an international levee performance database (ILPD) and its use for macro-scale analysis of levee breaches and failures, Water, 12, 119, https://doi.org/10.3390/w12010119, 2019.
Pleijster, E. J., van der Veeken, C., Jongerius, R., and Luiten, E.: Dijken van Nederland, Nai010 uitgevers, ISBN 978-94-6208-150-5, 2015.
Rianna, G., Zollo, A., Tommasi, P., Paciucci, M., Comegna, L., and Mercogliano, P.: Evaluation of the effects of climate changes on landslide activity of Orvieto clayey slope, Proced. Earth Plan. Sc., 9, 54–63, 2014.
Ridley, A., McGinnity, B., and Vaughan, P.: Role of pore water pressures in embankment stability, P. I. Civil Eng.-Geotec., 157, 193–198, 2004.
Rikkert, S. J. H.: A system perspective on flood risk in polder drainage canal systems, PhD thesis, Delft University of Technology, https://doi.org/10.4233/uuid:61e6645a-f275-4b69-aa29-1150d3cda2b1, 2022.
Rikkert, S. J. H., Kok, M., Lendering, K., and Jongejan, R.: A pragmatic, performance-based approach to levee safety assessments, J. Flood Risk Manag., 15, e12836, https://doi.org/10.1111/jfr3.12836, 2022.
Rouainia, M., Helm, P., Davies, O., and Glendinning, S.: Deterioration of an infrastructure cutting subjected to climate change, Acta Geotech., 15, 2997–3016, 2020.
Rousseeuw, P. J.: Silhouettes: a graphical aid to the interpretation and validation of cluster analysis, J. Comput. Appl. Math., 20, 53–65, 1987.
Schweckendiek, T.: On reducing piping uncertainties: A Bayesian decision approach, PhD thesis, Delft University of Technology, https://doi.org/10.4233/uuid:f9be2f7e-7009-4c73-afe5-8b4bb16e956f, 2014.
Sharp, M., Wallis, M., Deniaud, F., Hersch-Burdick, R., Tourment, R., Matheu, E., Seda-Sanabria, Y., Wersching, S., Veylon, G., Durand, E., and others: The international levee handbook, CIRIA, ISBN 978-0-86017-734-0, 2013.
Shen, H., Tolson, B. A., and Mai, J.: Time to update the split-sample approach in hydrological model calibration, Water Resour. Res., 58, e2021WR031523, https://doi.org/10.1029/2021WR031523, 2022.
Šimůnek, J., Šejna, M., and Van Genuchten, M. T.: The HYDRUS-2D software package for simulating the two-dimensional movement of water, heat, and multiple solutes in variably-saturated media: Version 2.0, US Salinity Laboratory, Agricultural Research Service, US Department Agriculture, 1999.
Stafleu, J., Maljers, D., Busschers, F. S., Gunnink, J. L., Schokker, J., Dambrink, R. M., Hummelman, H. J., and Schijf, M. L.: GeoTop modellering, TNO report, 10991, 2012.
Steenbergen, C. M., Reh, W., Nijhuis, S., and Pouderoijen, M. T.: The Polder Atlas of The Netherlands: Pantheon of the Lowlands, Uitgeverij Thoth, ISBN 978-9-06868-519-0, 2009.
Stirling, R. A., Toll, D. G., Glendinning, S., Helm, P. R., Yildiz, A., Hughes, P. N., and Asquith, J. D.: Weather-driven deterioration processes affecting the performance of embankment slopes, Géotechnique, 71, 957–969, 2021.
Strijker, B.: Dataset in support of the dynamics of peak head responses at Dutch canal dikes and the impact of climate change, 4TU.ResearchData [data set], https://doi.org/10.4121/4004f445-b71b-4996-bd7d-10b1fafbc86b, 2024.
Tran, D. D. and Weger, J.: Barriers to implementing irrigation and drainage policies in An Giang Province, Mekong Delta, Vietnam, Irrig. Drain., 67, 81–95, 2018.
Triet, N. V. K., Dung, N. V., Fujii, H., Kummu, M., Merz, B., and Apel, H.: Has dyke development in the Vietnamese Mekong Delta shifted flood hazard downstream?, Hydrol. Earth Syst. Sci., 21, 3991–4010, https://doi.org/10.5194/hess-21-3991-2017, 2017.
Van Baars, S. and Van Kempen, I. M.: The causes and mechanisms of historical dike failures in the Netherlands, E-Water Official Publication, European Water Association, 1–14, https://resolver.tudelft.nl/uuid:a44dbb70-8116-4dec-bf9d-e5879df3f678 (last access: 20 March 2024), 2009.
Van der Krogt, M. G., Klerk, W. J., Kanning, W., Schweckendiek, T., and Kok, M.: Value of information of combinations of proof loading and pore pressure monitoring for flood defences, Struct. Infrastruct. E., 18, 505–520, 2022.
Van Dorland, R., Beersma, J., Bessembinder, J., Bloemendaal, N., Van Den Brink, H., Brotons Blanes, H., Drijfhout, S., Groenland, R., Haarsma, R., Homan, C., Keizer, I., Krikken, F., Le Bars, D., Lenderink, G., van Meijgaard, E., Meirink, J. F., Overbeek, B., Reerink, T., Selten, F., Severijns, C., Siegmund, P., Sterl, A., de Valk, C., van Velthoven, P., de Vries, H., van Weele, M., Wichers Schreur, B. and Van Der Wiel, K.: Knmi national climate scenarios 2023 for The Netherlands, Scientific report WR-23-02, Royal Netherlands Meteorological Institute (KNMI), 2023.
Van Esch, J. M.: Modeling groundwater flow through embankments for climate change impact assessment, in: Proceedings, 19th International Conference on Computational Methods in Water Resources (CMWR 2012), 17–22 June 2022, University of Illinois at Urbana-Champaign, USA, 2012.
Vanmarcke, E. H.: Reliability of earth slopes, J. Geotech. Eng.-ASCE, 103, 1247–1265, 1977.
van Woerkom, T., van Beek, R., Middelkoop, H., and Bierkens, M. F.: Global sensitivity analysis of groundwater related dike stability under extreme loading conditions, Water, 13, 3041, https://doi.org/10.3390/w13213041, 2021.
Von Asmuth, J. R., Bierkens, M. F., and Maas, K.: Transfer function-noise modeling in continuous time using predefined impulse response functions, Water Resour. Res., 38, 1287, https://doi.org/10.1029/2001WR001136, 2002.
Von Asmuth, J. R., Maas, K., Bakker, M., and Petersen, J.: Modeling time series of ground water head fluctuations subjected to multiple stresses, Groundwater, 46, 30–40, 2008.
Vrijling, J. K. and Van Gelder, P. H. A. J. M.: Probabilistic design in hydraulic engineering, https://resolver.tudelft.nl/uuid:0ba44475-700e-436a-8a2d-7df89b85e19a (last access: 11 September 2025), 2002.
Warner, J. F., van Staveren, M. F., and van Tatenhove, J.: Cutting dikes, cutting ties? Reintroducing flood dynamics in coastal polders in Bangladesh and the netherlands, Int. J. Disast. Risk Re., 32, 106–112, 2018.
Wojciechowska, K. A.: Advances in operational flood risk management in the Netherlands, PhD thesis, Dissertation (TU Delft), Delft University of Technology, https://doi.org/10.4233/uuid:5d719beb-bbbf-4fde-8e10-526785315fd1, 2015.
Wolters, E., Hakvoort, H., Bosch, S., Versteeg, R., Bakker, M., Heijkers, J., Talsme, M., and Peerdeman, K.: Meteobase: Online neerslag-en referentiegewasver-dampingsdatabase voor het Nederlandse waterbeheer, Meteorologica, 1, 15–18, 2013.
comma separated values