<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="research-article">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">NHESS</journal-id><journal-title-group>
    <journal-title>Natural Hazards and Earth System Sciences</journal-title>
    <abbrev-journal-title abbrev-type="publisher">NHESS</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Nat. Hazards Earth Syst. Sci.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1684-9981</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/nhess-26-2189-2026</article-id><title-group><article-title>Considering rainfall events from a neighborhood improves local flood frequency analysis</article-title><alt-title>Counterfactuals improve local flood frequency analysis</alt-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1">
          <name><surname>Voit</surname><given-names>Paul</given-names></name>
          <email>voit@uni-potsdam.de</email>
        <ext-link>https://orcid.org/0000-0003-1005-0979</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff2">
          <name><surname>Fauer</surname><given-names>Felix</given-names></name>
          
        <ext-link>https://orcid.org/0000-0001-7638-829X</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Heistermann</surname><given-names>Maik</given-names></name>
          
        <ext-link>https://orcid.org/0000-0001-9354-1532</ext-link></contrib>
        <aff id="aff1"><label>1</label><institution>Institute for Environmental Sciences and Geography, University of Potsdam, Potsdam, Germany</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>Institute for Meteorology, Freie Universität Berlin, Berlin, Germany</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Paul Voit (voit@uni-potsdam.de)</corresp></author-notes><pub-date><day>11</day><month>May</month><year>2026</year></pub-date>
      
      <volume>26</volume>
      <issue>5</issue>
      <fpage>2189</fpage><lpage>2201</lpage>
      <history>
        <date date-type="received"><day>8</day><month>October</month><year>2025</year></date>
           <date date-type="rev-request"><day>11</day><month>November</month><year>2025</year></date>
           <date date-type="rev-recd"><day>4</day><month>February</month><year>2026</year></date>
           <date date-type="accepted"><day>16</day><month>April</month><year>2026</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2026 Paul Voit et al.</copyright-statement>
        <copyright-year>2026</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://nhess.copernicus.org/articles/26/2189/2026/nhess-26-2189-2026.html">This article is available from https://nhess.copernicus.org/articles/26/2189/2026/nhess-26-2189-2026.html</self-uri><self-uri xlink:href="https://nhess.copernicus.org/articles/26/2189/2026/nhess-26-2189-2026.pdf">The full text article is available as a PDF file from https://nhess.copernicus.org/articles/26/2189/2026/nhess-26-2189-2026.pdf</self-uri>
      <abstract><title>Abstract</title>

      <p id="d2e104">Many aspects of flood risk management require flood frequency analysis (FFA) which is, however, often limited by short observational records  –  especially for flash floods in small basins. In order to address this issue, we propose to extend the underlying data by local counterfactual scenarios. To that end, heavy precipitation events (HPEs) from nearby, hydrologically similar catchments are used to simulate flood peaks which are then included in the FFA for the catchment of interest. In order to demonstrate the added value of this approach, we used 23 <inline-formula><mml:math id="M1" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">years</mml:mi></mml:mrow></mml:math></inline-formula> of radar-based precipitation and a hydrological model, fitted the Generalized Extreme Value (GEV) distribution to three different datasets  –  observed peaks, counterfactual peaks, and their combination –, and evaluated the resulting three GEV fits by means of the quantile skill score (QSS). For a sample of more than 13 000 German headwater catchments, we could show that local counterfactuals improved quantile estimation, with the level of improvement increasing with return period. The improvement declines when the radius of the transposition domain is extended beyond 30 <inline-formula><mml:math id="M2" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula>. Overall, our results provide a tangible perspective to enhance traditional FFA, producing narrower confidence intervals and more robust estimates for design floods and risk assessments.</p>
  </abstract>
    
<funding-group>
<award-group id="gs1">
<funding-source>Bundesministerium für Forschung und Technologie</funding-source>
<award-id>ClimXtreme</award-id>
</award-group>
</funding-group>
</article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d2e132">Flood frequency analysis (FFA) addresses the probability of floods of a given magnitude and their expected recurrence. It provides the statistical basis for defining extreme events and is, since decades, fundamental for various aspects of flood risk management <xref ref-type="bibr" rid="bib1.bibx32 bib1.bibx39" id="paren.1"/>, such as the design of hydraulic infrastructure, landscape planning, flood insurance and many more.</p>
      <p id="d2e138">Typically, extremes are extracted from observations (of, e.g. discharge or precipitation accumulated over specific durations) via the block maxima or peak-over-threshold method, and a probability distribution is fitted <xref ref-type="bibr" rid="bib1.bibx28" id="paren.2"/>, most commonly the Generalized Extreme Value (GEV) or Generalized Pareto (GP) distribution. This allows extrapolation of the tail beyond observed records <xref ref-type="bibr" rid="bib1.bibx13" id="paren.3"/> and an estimation of occurrence probabilities (return periods) for unobserved events.</p>
      <p id="d2e147">However, extreme floods occur rarely (by definition), resulting in limited sample sizes for FFA. Short observational records can increase the sampling error and thus the uncertainty of distribution fitting. Consequently, estimates of occurrence probability are often highly uncertain, which can lead to a severe misrepresentation of risk. This problem is amplified in the case of flash floods: these floods are characterized by a rapid onset and carry high sediment and debris loads, which makes them a highly destructive natural hazard <xref ref-type="bibr" rid="bib1.bibx4 bib1.bibx34 bib1.bibx44 bib1.bibx14" id="paren.4"/>. The corresponding rainfall events are typically brief and intense and occur at small spatial scales. They only trigger a flash flood in case they coincide with basins that are able to convert rainfall into runoff, and rapidly propagate that runoff towards the outlet. These processes are governed by topography, geomorphology, soils, and land use. Furthermore, flash-flood-prone basins are generally small to medium sized (<inline-formula><mml:math id="M3" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">1000</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M4" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">km</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>). The local scarcity of extreme floods and the limited availability of corresponding streamflow records challenge conventional FFA for flash-flood risk management.</p>
      <p id="d2e174">Several approaches have been proposed to address data scarcity and the limitations of FFA. Their central idea is to augment the data basis, either by incorporating information from hydrologically similar sites or by forcing hydrological models with hypothetical heavy precipitation events (HPEs). Following we will give a brief overview over existing concepts.</p>
      <p id="d2e178"><list list-type="bullet">
          <list-item>

      <p id="d2e183"><italic>Regionalization</italic>: Data from hydrologically similar catchments are incorporated into the estimation of distribution parameters to enhance the robustness of extreme value analysis (EVA) <xref ref-type="bibr" rid="bib1.bibx25 bib1.bibx29 bib1.bibx43 bib1.bibx30" id="paren.5"><named-content content-type="pre">e.g.</named-content></xref>.</p>
          </list-item>
          <list-item>

      <p id="d2e196"><italic>Probable maximum precipitation</italic> (PMP): rainfall events from a “meteorological homogeneous” transposition domain are included in the analysis to increase the robustness <xref ref-type="bibr" rid="bib1.bibx23 bib1.bibx15" id="paren.6"/> and to estimate the PMP <xref ref-type="bibr" rid="bib1.bibx31 bib1.bibx55" id="paren.7"/>. Instead of exceedance probabilities this method only yields upper and lower bounds of precipitation. PMP can be used to estimate the upper bounds of a probable maximum flood (PMF), if used as forcing for a hydrological model. While PMP is widely applied in North America and Australia for designing high-risk infrastructure (e.g. dams and nuclear power plants), it is not used in Europe. However, in recent years various studies regarding flood risk management have proposed and investigated different concepts of storm transposition, referring to the idea as “spatial counterfactuals” <xref ref-type="bibr" rid="bib1.bibx41 bib1.bibx38 bib1.bibx51 bib1.bibx52 bib1.bibx46" id="paren.8"/>.</p>
          </list-item>
          <list-item>

      <p id="d2e213"><italic>Stochastic storm transposition</italic>: Building on the PMP/PMF concept, historical precipitation events (HPEs) from the transposition domain are sampled using a Poisson distribution and randomly assigned (uniform distribution) within the domain, potentially affecting the catchment of interest (CoI). For flood frequency analysis, the resulting runoff in the CoI is simulated <xref ref-type="bibr" rid="bib1.bibx56" id="paren.9"><named-content content-type="pre">e.g.</named-content></xref>. This approach allows for the calculation of occurrence probabilities. For a detailed description see <xref ref-type="bibr" rid="bib1.bibx57" id="text.10"/>. Globally, stochastic storm transposition (SST) remains rarely applied in practice <xref ref-type="bibr" rid="bib1.bibx58" id="paren.11"/> but it will form the core of the US Federal Emergency Management Agency's “Future of Flood Risk Data” initiative, aimed at remapping the nation's floodplains <xref ref-type="bibr" rid="bib1.bibx1" id="paren.12"/>.</p>
          </list-item>
          <list-item>

      <p id="d2e235"><italic>Random weather generators</italic> are statistical models that simulate sequences of weather variables, such as temperature and precipitation, by randomly generating data based on observed patterns. They can be used to generate very long time series of meteorological forcings for a hydrological model <xref ref-type="bibr" rid="bib1.bibx19 bib1.bibx3" id="paren.13"><named-content content-type="pre">e.g.</named-content></xref>.</p>
          </list-item>
        </list></p>
      <p id="d2e247">The central issue with these approaches is the plausibility of counterfactuals. Hazard assessments based on such methods are only meaningful if the counterfactuals are considered realistic. Due to the large variability in terminology with regard to the aforementioned concepts, we now clarify and define,  for the sake of consistency, the following terms and acronyms for use throughout this paper:</p>
      <p id="d2e250"><list list-type="bullet">
          <list-item>

      <p id="d2e255"><italic>HPE</italic>: Heavy precipitation event. While sometimes termed “storms”, we adopt the more precise designation HPE.</p>
          </list-item>
          <list-item>

      <p id="d2e263"><italic>TD</italic>: Transposition domain; a region that is assumed to be meteorologically homogeneous (with respect to the features of heavy rainfall). The basic idea is, that an HPE observed within the TD, could have also happened at any other location within the TD.</p>
          </list-item>
          <list-item>

      <p id="d2e271"><italic>storm transposition</italic>: Spatial transposition (relocation) of an observed HPE within a TD.</p>
          </list-item>
          <list-item>

      <p id="d2e279"><italic>CoI</italic>: Catchment of interest, the catchment that is the subject of a FFA.</p>
          </list-item>
          <list-item>

      <p id="d2e287"><italic>NC</italic>: Neighboring catchment. A catchment in proximity to the CoI, typically within a TD.</p>
          </list-item>
          <list-item>

      <p id="d2e296"><italic>counterfactual</italic>: A hypothetical realization of an event under alternative conditions, e.g. an HPE occurring at a different location.</p>
          </list-item>
          <list-item>

      <p id="d2e304"><italic>factual peak</italic>. Flood peak observed in the CoI or modelled with observed rainfall.</p>
          </list-item>
          <list-item>

      <p id="d2e312"><italic>counterfactual peak</italic>: Flood peak simulated by a hydrological model forced with a transposed (counterfactual) HPE.</p>
          </list-item>
        </list></p>
      <p id="d2e319">Previous studies have proposed different methods to define a TD from which HPEs are sampled. <xref ref-type="bibr" rid="bib1.bibx22" id="text.14"/> described it as a region where “significant storms are uniformly distributed in space,” while <xref ref-type="bibr" rid="bib1.bibx59" id="text.15"/> suggested using cloud-to-ground lightning analyses. However, the outcome of these methods also depends on the length of the observed data. Instead of defining a TD based on a complex analysis, <xref ref-type="bibr" rid="bib1.bibx50" id="text.16"/> introduced the concept of “local counterfactuals”: they selected HPEs that had caused high runoff peaks in basins from a close (i.e. “local”) neighborhood around the CoI (more specifically, a 20 <inline-formula><mml:math id="M5" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> radius), transposed these events to the CoI and used it to force a rainfall-runoff model that would than return the counterfactual flood peak. The approach was based on the assumption that if an HPE were sampled from a local neighborhood, it would be more representative for HPEs that are “typical” for the CoI. Even with this local TD, local counterfactuals produced flood peaks comparable to a 200 <inline-formula><mml:math id="M6" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">year</mml:mi></mml:mrow></mml:math></inline-formula> return level flood. By using the runoff reaction of nearby catchments as a filter to sample hydrologically meaningful HPEs for transposition, no previous detection and compilation of HPEs is required with this method.</p>
      <p id="d2e347">But how can counterfactuals be incorporated into FFA? For numerous application contexts, return periods and design levels remain essential for stakeholders. SST provides one way to statistically assess the occurrence of hypothetical flood scenarios, but it requires both the definition of the TD and the selection of the most relevant rainfall duration for sampling events. The latter is not trivial, as the duration of extreme rainfall that generates the highest flood peak may vary between catchments and is often difficult to determine in advance. For this reason <xref ref-type="bibr" rid="bib1.bibx50" id="text.17"/> proposed a bottom-up approach by selecting the HPEs which caused high flood peaks nearby catchments, irrespective of rainfall duration.</p>
      <p id="d2e353">In this study, we propose to extend the concept of local counterfactuals in order to formally integrate counterfactual flood peaks into flood frequency analysis (FFA). This is demonstrated in a case study on the basis of 23 <inline-formula><mml:math id="M7" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">years</mml:mi></mml:mrow></mml:math></inline-formula> of radar-based precipitation records in Germany, in combination with a Germany-wide flash flood model as introduced by <xref ref-type="bibr" rid="bib1.bibx51" id="text.18"/>: for each of 13 452 headwater catchments (<inline-formula><mml:math id="M8" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">750</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M9" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">km</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>) in Germany, we fit three GEV distributions: (i) from the 23 annual flood peak maxima modelled in the basin of interest (our reference), (ii) from 230 counterfactual peaks derived by spatially transposing HPEs which caused 23 annual maximum peak discharge values in 10 hydrologically and topographically similar neighboring basins, and (iii) from the combined dataset. A 30 <inline-formula><mml:math id="M10" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> radius neighborhood (transposition domain) can be still be considered as local and small, compared to the domain sizes in other studies <xref ref-type="bibr" rid="bib1.bibx51 bib1.bibx1" id="paren.19"><named-content content-type="pre">e.g.</named-content></xref>, and it is our prime filter to make sure we sample storms from an atmospheric environment that is governed by similar mechanisms as the CoI. Yet, sampling storms that caused annual maxima in similar catchments should ensure that the transposed rainfall has spatio-temporal characteristics that make them representative for the CoI (e.g. similar catchment size) and that could also occur over the CoI given potential orographic effects (e.g. similar catchment elevation). The number of 10 neighboring catchments was chosen mainly due to limited computational resources.</p>
      <p id="d2e403">The quantile score (QS) <xref ref-type="bibr" rid="bib1.bibx5" id="paren.20"/> is then used to analyze whether counterfactual information improves the representation of extremes beyond the limited observational record. The QS can evaluate improvement for each quantile of interest. We repeat this procedure for four differently sized and shaped TDs and show how the design of the TD affects the results of the FFA. Finally, we discuss the return levels derived from the different GEV distributions and examine the corresponding confidence intervals.</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Data</title>
<sec id="Ch1.S2.SS1">
  <label>2.1</label><title>RADKLIM</title>
      <p id="d2e424">We used the radar climatology product RADKLIM v2017.002 (2001–2023) to compute local counterfactuals and to drive continuous runoff modeling across Germany. RADKLIM is provided by Germany's national meteorological service (Deutscher Wetterdienst, DWD) and represents a reprocessed version <xref ref-type="bibr" rid="bib1.bibx33" id="paren.21"/> of DWD's operational radar-based quantitative precipitation estimation product, RADOLAN <xref ref-type="bibr" rid="bib1.bibx53" id="paren.22"/>. The dataset has a spatial resolution of <inline-formula><mml:math id="M11" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow><mml:mo>×</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>, an hourly temporal resolution, and is openly available via the DWD open data server <xref ref-type="bibr" rid="bib1.bibx54" id="paren.23"/>.</p>
</sec>
<sec id="Ch1.S2.SS2">
  <label>2.2</label><title>DEM</title>
      <p id="d2e464">For catchment delineation and runoff analysis, we used the EU-DEM <xref ref-type="bibr" rid="bib1.bibx17" id="paren.24"/>, which has a 25 <inline-formula><mml:math id="M12" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:math></inline-formula> resolution and combines SRTM (Shuttle Radar Topography Mission) and ASTER GDEM (Advanced Spaceborne Thermal Emission and Reflection Radiometer Global Digital Elevation Model).</p>
</sec>
<sec id="Ch1.S2.SS3">
  <label>2.3</label><title>Land use and soil data</title>
      <p id="d2e486">Information on land cover was obtained from CORINE CLC5-2018 <xref ref-type="bibr" rid="bib1.bibx7" id="paren.25"/>, which classifies high-resolution satellite imagery into 37 land cover classes for Germany following the European Environmental Agency (EEA) nomenclature. The classification considers objects with a minimum size of 5 <inline-formula><mml:math id="M13" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">ha</mml:mi></mml:mrow></mml:math></inline-formula> and is updated every three years. Soil data were derived from the BUEK 200 national soil survey <xref ref-type="bibr" rid="bib1.bibx6" id="paren.26"><named-content content-type="pre">scale <inline-formula><mml:math id="M14" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>:</mml:mo><mml:mn mathvariant="normal">200</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mn mathvariant="normal">000</mml:mn></mml:mrow></mml:math></inline-formula>;</named-content></xref>, compiled from federal state surveys by the Federal Institute for Geosciences and Natural Resources (BGR) in cooperation with the National Geological Services (Staatliche Geologische Dienste). For each mapping unit, BUEK 200 provides areal fractions of dominant soil types along with detailed profile information, including texture, bulk density, and other key properties.</p>
</sec>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Methods</title>
      <p id="d2e529">Much of the data and methodology for this study are detailed in <xref ref-type="bibr" rid="bib1.bibx51" id="text.27"/>. Here, we briefly describe the hydrological model, further explain the flood frequency analysis, and the selection of local counterfactuals.</p>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Modelling surface runoff</title>
      <p id="d2e542">The hydrological model <xref ref-type="bibr" rid="bib1.bibx49" id="paren.28"/> was specifically tailored to simulate flash flood events in small- to medium-sized basins. A detailed model description is provided in <xref ref-type="bibr" rid="bib1.bibx51" id="text.29"/>. During flash floods, surface runoff dominates <xref ref-type="bibr" rid="bib1.bibx36 bib1.bibx27" id="paren.30"/>, while evaporation and groundwater dynamics are negligible. Accordingly, the model comprises two modules. First, effective rainfall is estimated for each catchment and timestep (hourly) using the SCS-CN method <xref ref-type="bibr" rid="bib1.bibx47" id="paren.31"/>, which is widely applied in flash flood modeling <xref ref-type="bibr" rid="bib1.bibx24 bib1.bibx8 bib1.bibx16" id="paren.32"/>. Since flash flood events predominantly occur during the summer months, we slightly adjusted the CN values for agricultural areas to account for the effects of summer crops <xref ref-type="bibr" rid="bib1.bibx45" id="paren.33"><named-content content-type="pre">based on</named-content></xref>. A single CN value for each subbasin was then derived using an area-weighted average. Second, the geomorphological instantaneous unit hydrograph (GIUH), derived from the DEM, represents the concentration of quick runoff from effective rainfall. The flow velocities were computed with the method of Maidment <xref ref-type="bibr" rid="bib1.bibx35" id="paren.34"/>. This approach accounts for the increase in hydraulic radius with rising flow volumes, as described by Manning's equation, thereby capturing the downstream acceleration of flow without requiring the estimation of roughness coefficients for individual grid cells. In addition, it removes the need to distinguish between hillslope and channel grid cells within the catchment. The method assumes a velocity field that is invariant in both time and discharge, enabling the convolution of GIUHs to simulate the catchment response to the effective rainfall of an HPE. When two subcatchments converge, the hydrograph of the upstream basin is superimposed on that of the downstream basin with an appropriate time lag. This delay is defined by the travel time from the downstream basin's inlet to its outlet.</p>
      <p id="d2e569">The model's lightweight design allows the computation of large numbers of counterfactual scenarios. As it does not account for channel hydraulics or engineered structures, the analysis is restricted to headwater catchments smaller than 750 <inline-formula><mml:math id="M15" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">km</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>. Because of the lumped nature of the model it is crucial that the catchments are small enough to account for the spatial variability of rainfall. In our analysis, this corresponds to 13 452 sub-catchments with an mean area of 15.7 <inline-formula><mml:math id="M16" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">km</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> and a maximum headwater catchment size of 163 <inline-formula><mml:math id="M17" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">km</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>.</p>
</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Design of local counterfactuals</title>
      <p id="d2e613">Local counterfactuals are HPEs drawn from a neighborhood (TD) of a given catchment of interest (CoI) – the catchment to which the counterfactual scenarios are applied, and transposed to the CoI. In this study, all aforementioned 13 452 headwater catchments smaller than 750 <inline-formula><mml:math id="M18" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">km</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> are individually treated as a CoI, meaning that the following procedure is applied to each of these catchments (see also Fig. <xref ref-type="fig" rid="F1"/> for illustration): <list list-type="order"><list-item>
      <p id="d2e631">For each CoI, we identified the ten most similar catchments located entirely within a 30 <inline-formula><mml:math id="M19" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> buffer around the CoI. We based similarity mostly on descriptors of topography, land use and soil which should (i) strongly govern the formation and concentration of surface runoff and (ii) ensure that potential orographic effects could occur both in the CoI and the NCs. Following descriptors were chosen: <list list-type="bullet"><list-item>
      <p id="d2e644">Peak [<inline-formula><mml:math id="M20" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">m</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">s</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>], time to peak [<inline-formula><mml:math id="M21" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">s</mml:mi></mml:mrow></mml:math></inline-formula>] and standard deviation [<inline-formula><mml:math id="M22" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">m</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">s</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>] of the unit hydrograph: The unit hydrograph is derived directly from the DEM, similar hydrographs imply, to a certain degree, similar topography.</p></list-item><list-item>
      <p id="d2e696">Total catchment area including upstream basins.</p></list-item><list-item>
      <p id="d2e700">Curve number (soil moisture class 2): The curve number represents soils and land use in our model. A similar curve number would lead to a similar runoff generation in our model.</p></list-item><list-item>
      <p id="d2e704">Mean and standard elevation of the DEM and mean slope. With this descriptor we try to avoid sampling rainfall events from catchments which are e.g. situated at a substantially different elevation. If the CoI was e.g. close to a mountain range, rainfall events should not be sampled from this mountainous area, because they might not be representative for the rainfall events occuring in the CoI.</p></list-item><list-item>
      <p id="d2e708">Unit Peak Discharge: The peak of the unit hydrograph divided by the catchment area is yet another descriptor of the hydrological character of the catchment.</p></list-item></list></p>
      <p id="d2e711">We used the KDTree-algorithm from the Python library “SciKit-Learn” and scaled all catchment descriptors with the “StandardScaler” from this library to ensure that none of this descriptors dominates the decision for similarity. However, we acknowledge that some descriptors are correlated.</p></list-item><list-item>
      <p id="d2e715">For each of these NCs, we model the quick runoff from 2001 until 2023 (Fig. <xref ref-type="fig" rid="F1"/>b). We then identify the annual maximum peak discharge for each of the 23 <inline-formula><mml:math id="M23" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">years</mml:mi></mml:mrow></mml:math></inline-formula> (Fig. <xref ref-type="fig" rid="F1"/>c).</p></list-item><list-item id="Ch1.I3.i3">
      <p id="d2e731">From RADKLIM, we extract the data for the 23 HPEs which caused the annual maximum peaks in the NC (Fig. <xref ref-type="fig" rid="F1"/>b) and transpose them from their original spatial position from the centroid of the NC  to the centroid of the CoI, thereby creating spatial counterfactuals (Fig. <xref ref-type="fig" rid="F1"/>d). We ensure that the CoI and all its upstream catchments will be completely covered by the HPE, by adding a 70 <inline-formula><mml:math id="M24" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> buffer on each side of the RADKLIM subset (for better visualization we do not show the buffer in Fig. <xref ref-type="fig" rid="F1"/>). To ensure a consistent soil moisture state we add a 14 <inline-formula><mml:math id="M25" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">d</mml:mi></mml:mrow></mml:math></inline-formula> temporal buffer before the actual event. If the CoI consists of various subbasin, we additionally transpose the HPEs to the centroid of every upstream subbasin.</p></list-item><list-item id="Ch1.I3.i4">
      <p id="d2e757">We model the surface runoff that these counterfactual HPEs would have caused in the CoI (Fig. <xref ref-type="fig" rid="F1"/>e) and record the maximum counterfactual peak discharge values for each year. We repeat steps 3 and 4 for all NCs.</p></list-item></list></p>

      <fig id="F1" specific-use="star"><label>Figure 1</label><caption><p id="d2e764">Development of local counterfactuals: <bold>(a)</bold> Catchment of Interest (CoI, green) and its 10 neighbor catchments (NCs, dark blue) in a 30 <inline-formula><mml:math id="M26" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> neighborhood (light blue). <bold>(b)</bold> Selecting the HPE which caused the highest annual runoff peak (red dot, <bold>c</bold>) in the NC (red box). <bold>(d)</bold> Transposing the HPE from the NC to the CoI and modelling the resulting runoff <bold>(e)</bold>. This procedure is repeated for each NC and steps <bold>(c–e)</bold> are repeated for each year.</p></caption>
          <graphic xlink:href="https://nhess.copernicus.org/articles/26/2189/2026/nhess-26-2189-2026-f01.png"/>

        </fig>

      <p id="d2e800">We hypothesize that the representativeness of counterfactuals for the meteorological processes governing the CoI generally decreases with the distance between the corresponding NC and the CoI. To test this hypothesis, we compared four transposition domains (TDs): a 10 <inline-formula><mml:math id="M27" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> buffer, a 30 <inline-formula><mml:math id="M28" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> buffer, and two ring-shaped TDs with inner–outer radii of 30–60 and 60–90 <inline-formula><mml:math id="M29" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> around the CoI, respectively.</p>
</sec>
<sec id="Ch1.S3.SS3">
  <label>3.3</label><title>GEV distribution</title>
      <p id="d2e835">Under certain conditions, block maxima are GEV-distributed <xref ref-type="bibr" rid="bib1.bibx21 bib1.bibx26" id="paren.35"/>. These conditions are met for precipitation <xref ref-type="bibr" rid="bib1.bibx12" id="paren.36"/> and discharge. The cumulative distribution function (CDF) of the GEV is defined

                <disp-formula id="Ch1.E1" content-type="numbered"><label>1</label><mml:math id="M30" display="block"><mml:mstyle class="stylechange" displaystyle="true"/><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mi>G</mml:mi><mml:mo>(</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mfenced open="{" close=""><mml:mtable rowspacing="0.2ex" class="cases" columnspacing="1em" columnalign="left left" framespacing="0em"><mml:mtr><mml:mtd><mml:mrow><mml:mi>exp⁡</mml:mi><mml:mo mathsize="2.0em" mathvariant="italic">{</mml:mo><mml:mo>-</mml:mo><mml:msup><mml:mfenced open="[" close="]"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>+</mml:mo><mml:mi mathvariant="italic">ξ</mml:mi><mml:mfenced open="(" close=")"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:mi>x</mml:mi><mml:mo>-</mml:mo><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow><mml:mi mathvariant="italic">σ</mml:mi></mml:mfrac></mml:mstyle></mml:mfenced></mml:mrow></mml:mfenced><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>/</mml:mo><mml:mi mathvariant="italic">ξ</mml:mi></mml:mrow></mml:msup><mml:mo mathsize="2.0em" mathvariant="italic">}</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mi mathvariant="italic">ξ</mml:mi><mml:mo>≠</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mi>exp⁡</mml:mi><mml:mo mathvariant="italic" mathsize="2.0em">{</mml:mo><mml:mo>-</mml:mo><mml:mfenced close=")" open="("><mml:mrow><mml:msup><mml:mi>exp⁡</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mi>z</mml:mi><mml:mo>-</mml:mo><mml:mi mathvariant="italic">μ</mml:mi><mml:mo>)</mml:mo><mml:mo>/</mml:mo><mml:mi mathvariant="italic">σ</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:mfenced><mml:mo mathvariant="italic" mathsize="2.0em">}</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mi mathvariant="italic">ξ</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mfenced></mml:mrow></mml:math></disp-formula>

          with location <inline-formula><mml:math id="M31" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula>, scale <inline-formula><mml:math id="M32" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> and shape <inline-formula><mml:math id="M33" display="inline"><mml:mi mathvariant="italic">ξ</mml:mi></mml:math></inline-formula>.</p>
      <p id="d2e980">From the GEV distribution, return levels can be obtained for return periods that are even longer than the length of record. However, this extrapolation is uncertain with limited sample size <xref ref-type="bibr" rid="bib1.bibx12" id="paren.37"/>. Our suggestion is, hence, to increase the sample size with local counterfactuals. To fulfill the requirements of the Fisher–Tippet-Theorem, all block maxima have to be drawn from the same statistical distribution. We choose a very small area as TD and then select HPEs within this TD based on the streamflow response of neighboring catchments that are very similar regarding slope, elevation, land use and the unit hydrograph (see Sect. <xref ref-type="sec" rid="Ch1.S3.SS2"/>). Based on this similarity of catchments and the small TD, we regard this first requirement as fulfilled. Since the peaks of factual and counterfactual HPEs are determined with the same method, both can be pooled to fit a GEV, given all assumptions above. To fulfill the requirements of the Fisher–Tippet-Theorem, it is also paramount not to arbitrarily discard subsets of the data. More specifically, this mandates to include <bold>all</bold> annual maximum peak discharge values from an NC (instead of,  e.g. just the highest one) to keep a consistent effective block size.</p>
      <p id="d2e991">In FFA, special attention is given to the shape parameter <inline-formula><mml:math id="M34" display="inline"><mml:mi mathvariant="italic">ξ</mml:mi></mml:math></inline-formula>: a large shape parameter indicates a heavy tailed distribution where extreme events with high magnitude can occur. Especially when fitted to limited data points, the GEV distribution can produce implausible parameter estimates or “poor fits”. For this reason we disregard catchments where one of the previous GEV distributions has a shape <inline-formula><mml:math id="M35" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> or shape <inline-formula><mml:math id="M36" display="inline"><mml:mrow><mml:mo>≥</mml:mo><mml:mn mathvariant="normal">0.5</mml:mn><mml:mo>.</mml:mo></mml:mrow></mml:math></inline-formula> These thresholds are a compromise of values for <inline-formula><mml:math id="M37" display="inline"><mml:mi mathvariant="italic">ξ</mml:mi></mml:math></inline-formula> that are considered to be hydrologically plausible <xref ref-type="bibr" rid="bib1.bibx42 bib1.bibx37" id="paren.38"/>. We used the Python package “scipy” <xref ref-type="bibr" rid="bib1.bibx48" id="paren.39"/> with a maximum likelihood estimator to fit the GEV distribution.</p>
</sec>
<sec id="Ch1.S3.SS4">
  <label>3.4</label><title>Quantile skill score</title>
      <p id="d2e1046">We utilize the quantile score (QS) <xref ref-type="bibr" rid="bib1.bibx5" id="paren.40"/> to quantify how a well a GEV distribution represents the quantiles of the series of annual block maxima from the CoI. First, the tilted check-function <inline-formula><mml:math id="M38" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi>p</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is used to compute a penalty for estimated quantiles in comparison to the data points <inline-formula><mml:math id="M39" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mi>n</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>.

                <disp-formula id="Ch1.E2" content-type="numbered"><label>2</label><mml:math id="M40" display="block"><mml:mstyle displaystyle="true" class="stylechange"/><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi>p</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>u</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mfenced open="{" close=""><mml:mtable rowspacing="0.2ex" class="cases" columnspacing="1em" columnalign="left left" framespacing="0em"><mml:mtr><mml:mtd><mml:mrow><mml:mi>p</mml:mi><mml:mi>u</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mi>u</mml:mi><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mo>(</mml:mo><mml:mi>p</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo><mml:mi>u</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mi>u</mml:mi><mml:mo>≤</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mfenced></mml:mrow></mml:math></disp-formula></p>
      <p id="d2e1144">With <inline-formula><mml:math id="M41" display="inline"><mml:mrow><mml:mi>u</mml:mi><mml:mo>=</mml:mo><mml:msub><mml:mi>z</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mi>q</mml:mi></mml:mrow></mml:math></inline-formula>. For high non-exceedance probabilities <inline-formula><mml:math id="M42" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> (which corresponds to a return period <inline-formula><mml:math id="M43" display="inline"><mml:mrow><mml:mi>T</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mi>p</mml:mi></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:math></inline-formula>), it leads to a strong penalty for data points that are still higher than the modeled quantile (<inline-formula><mml:math id="M44" display="inline"><mml:mrow><mml:msub><mml:mi>z</mml:mi><mml:mi>n</mml:mi></mml:msub><mml:mo>&gt;</mml:mo><mml:mi>q</mml:mi></mml:mrow></mml:math></inline-formula>).</p>
      <p id="d2e1209">The quantile score is then computed for each non-exceedance probability <inline-formula><mml:math id="M45" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula>:

                <disp-formula id="Ch1.E3" content-type="numbered"><label>3</label><mml:math id="M46" display="block"><mml:mstyle displaystyle="true" class="stylechange"/><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mtext>QS</mml:mtext><mml:mo>(</mml:mo><mml:mi>y</mml:mi><mml:mo>,</mml:mo><mml:mi>q</mml:mi><mml:mo>;</mml:mo><mml:mi>p</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:munderover><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi>p</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mi>q</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></disp-formula>

          with p-quantile <inline-formula><mml:math id="M47" display="inline"><mml:mi>q</mml:mi></mml:math></inline-formula> (a return level corresponding to <inline-formula><mml:math id="M48" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula>), tilted check-function <inline-formula><mml:math id="M49" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ρ</mml:mi><mml:mi>p</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mo>⋅</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> and block maximum <inline-formula><mml:math id="M50" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> obtained from the <inline-formula><mml:math id="M51" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> factual peaks in the CoI.</p>
      <p id="d2e1321">We estimate the parameters of three GEV distributions that are fitted on different subsets of data and refer to them as follows:</p>
      <p id="d2e1325"><list list-type="bullet">
            <list-item>

      <p id="d2e1330"><inline-formula><mml:math id="M52" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">GEV</mml:mi><mml:mi mathvariant="italic">CoI</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>: fitted only to the factual peaks from the CoI.</p>
            </list-item>
            <list-item>

      <p id="d2e1346"><inline-formula><mml:math id="M53" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">GEV</mml:mi><mml:mi mathvariant="italic">NCs</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>: fitted only to the counterfactual peaks from the NCs.</p>
            </list-item>
            <list-item>

      <p id="d2e1362"><inline-formula><mml:math id="M54" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">GEV</mml:mi><mml:mi mathvariant="italic">all</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>: fitted to both factual peaks from the CoI and the counterfactual peaks from the NCs.</p>
            </list-item>
          </list></p>
      <p id="d2e1377">Essentially, <inline-formula><mml:math id="M55" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>NCs</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M56" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>all</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> are the GEV variants that we introduce as competitors against the conventional <inline-formula><mml:math id="M57" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>CoI</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> which is exclusively based on information obtained in the CoI. In order to verify the added value of the new GEV variants, <inline-formula><mml:math id="M58" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>CoI</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> serves as a reference. For this purpose, we use the quantile skill score (QSS) which compares the QS of <inline-formula><mml:math id="M59" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>NCs</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M60" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>all</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> (denoted <inline-formula><mml:math id="M61" display="inline"><mml:mrow><mml:msub><mml:mtext>QS</mml:mtext><mml:mtext>NCs</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M62" display="inline"><mml:mrow><mml:msub><mml:mtext>QS</mml:mtext><mml:mtext>all</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>, respectively) to the QS of our reference <inline-formula><mml:math id="M63" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>CoI</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> (<inline-formula><mml:math id="M64" display="inline"><mml:mrow><mml:msub><mml:mtext>QS</mml:mtext><mml:mtext>CoI</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>) as follows:

                <disp-formula id="Ch1.E4" content-type="numbered"><label>4</label><mml:math id="M65" display="block"><mml:mstyle class="stylechange" displaystyle="true"/><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msub><mml:mtext>QSS</mml:mtext><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msub><mml:mtext>QS</mml:mtext><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mrow><mml:msub><mml:mtext>QS</mml:mtext><mml:mtext>CoI</mml:mtext></mml:msub></mml:mrow></mml:mfrac></mml:mstyle><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mtext> with </mml:mtext><mml:mi>i</mml:mi><mml:mo>∈</mml:mo><mml:mo mathvariant="italic">{</mml:mo><mml:mtext>NCs, all</mml:mtext><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></disp-formula></p>
      <p id="d2e1538">The QSS can take values between minus infinity and 1. Positive values indicate that the competing GEV (<inline-formula><mml:math id="M66" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>NCs</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> or <inline-formula><mml:math id="M67" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>all</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>) is superior to the reference.</p>
      <p id="d2e1563">As the quantile score (Eq. <xref ref-type="disp-formula" rid="Ch1.E3"/>) is always computed for a specific return period <inline-formula><mml:math id="M68" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula> (or non-exceedance probability <inline-formula><mml:math id="M69" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula>), the QSS itself is obtained for specific values of T, too (20, 50, 100, and 200 <inline-formula><mml:math id="M70" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">years</mml:mi></mml:mrow></mml:math></inline-formula> in this study), similar to <xref ref-type="bibr" rid="bib1.bibx20" id="text.41"><named-content content-type="post">Fig. 4</named-content></xref>. Note that for very high return periods the QS might become unreliable, since only few or low observations are higher than the evaluated quantile. Then, the QS might just reward the model that predicts the lowest quantile.</p>
      <p id="d2e1595">The reference <inline-formula><mml:math id="M71" display="inline"><mml:mrow><mml:msub><mml:mtext>QS</mml:mtext><mml:mtext>CoI</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> itself is obtained by means of a leave-one-out cross-validation: to that end, one year <inline-formula><mml:math id="M72" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula> is excluded from the CoI's series of factual annual maxima and the <inline-formula><mml:math id="M73" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>CoI</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> is estimated from the remaining training years. From the fitted <inline-formula><mml:math id="M74" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>CoI,i</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>, a return level (p-quantile) is calculated and a quantile score <inline-formula><mml:math id="M75" display="inline"><mml:mrow><mml:msub><mml:mtext>QS</mml:mtext><mml:mtext>CoI,i</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> is determined from this return level and the annual maximum value for year <inline-formula><mml:math id="M76" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>. This is repeated for all years in the CoI series. <inline-formula><mml:math id="M77" display="inline"><mml:mrow><mml:msub><mml:mtext>QS</mml:mtext><mml:mtext>CoI</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> is then obtained as the average of all <inline-formula><mml:math id="M78" display="inline"><mml:mrow><mml:msub><mml:mtext>QS</mml:mtext><mml:mtext>CoI,i</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>.</p>
</sec>
</sec>
<sec id="Ch1.S4">
  <label>4</label><title>Results and Discussion</title>
<sec id="Ch1.S4.SS1">
  <label>4.1</label><title>Verifying the added value of counterfactual peaks on GEV estimation</title>
      <p id="d2e1695">For each small-scale basin in Germany, we created local counterfactuals (Sect. <xref ref-type="sec" rid="Ch1.S3.SS2"/>) and used these counterfactual flood peaks to fit a GEV distribution for the CoI. To validate how well local counterfactuals are able to represent the quantiles in the data (the factual flood peaks in the CoI), we performed an out-of-sample-test by comparing the <inline-formula><mml:math id="M79" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>NCs</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> with the <inline-formula><mml:math id="M80" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>CoI</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d2e1722">The inspection of the GEV parameters for each catchment reveals a large number of implausible shape parameters, especially for <inline-formula><mml:math id="M81" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>CoI</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>, which is fitted to only 23 <inline-formula><mml:math id="M82" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">year</mml:mi></mml:mrow></mml:math></inline-formula>-maxima. Because the number of data points increases by adding counterfactual peaks, the fits for <inline-formula><mml:math id="M83" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>NCs</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> improve: about 29 <inline-formula><mml:math id="M84" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> of the basins cannot be included in the analysis because the implausible fit of <inline-formula><mml:math id="M85" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>CoI</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> whereas 5 <inline-formula><mml:math id="M86" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> of the catchments have to be excluded because of an implausible fit of <inline-formula><mml:math id="M87" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>NCs</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> (see Sect. <xref ref-type="sec" rid="Ch1.S3.SS3"/>). In total this leads to an exclusion of <inline-formula><mml:math id="M88" display="inline"><mml:mo>≈</mml:mo></mml:math></inline-formula> 33 <inline-formula><mml:math id="M89" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> of the basins.</p>
      <p id="d2e1811">Figure <xref ref-type="fig" rid="F2"/> shows the results for all TDs and for four different return periods (20, 50, 100, and 200 <inline-formula><mml:math id="M90" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">years</mml:mi></mml:mrow></mml:math></inline-formula>). Since negative values of the QSS are harder to interpret, we show only <inline-formula><mml:math id="M91" display="inline"><mml:mrow><mml:mtext>QSS</mml:mtext><mml:mo>≥</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula>. We will discuss the differences between the different TDs in the following section. For now, we focus on the TD with a radius of 30 <inline-formula><mml:math id="M92" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula>. The main result is that the <inline-formula><mml:math id="M93" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>NCs</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>  –  which has never seen any information from the CoI  –  clearly outperforms the <inline-formula><mml:math id="M94" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>CoI</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>: across all return periods and transposition domains, the majority of catchments have positive QSS values. E.g., for the TD with a 30 <inline-formula><mml:math id="M95" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> buffer, the percentage of catchments with positive <inline-formula><mml:math id="M96" display="inline"><mml:mrow><mml:msub><mml:mtext>QSS</mml:mtext><mml:mtext>NCs</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> values (see intercept on the <inline-formula><mml:math id="M97" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> axis) is 87 <inline-formula><mml:math id="M98" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> for <inline-formula><mml:math id="M99" display="inline"><mml:mrow><mml:mi>T</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">20</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M100" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">a</mml:mi></mml:mrow></mml:math></inline-formula>, 78 <inline-formula><mml:math id="M101" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> for <inline-formula><mml:math id="M102" display="inline"><mml:mrow><mml:mi>T</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">50</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M103" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">a</mml:mi></mml:mrow></mml:math></inline-formula>, 73 <inline-formula><mml:math id="M104" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> for <inline-formula><mml:math id="M105" display="inline"><mml:mrow><mml:mi>T</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">100</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M106" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">a</mml:mi></mml:mrow></mml:math></inline-formula>, and 69 <inline-formula><mml:math id="M107" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> for <inline-formula><mml:math id="M108" display="inline"><mml:mrow><mml:mi>T</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">200</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M109" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">a</mml:mi></mml:mrow></mml:math></inline-formula>. Evidently, <inline-formula><mml:math id="M110" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>NCs</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> performs worse for the corresponding remainder to 100 <inline-formula><mml:math id="M111" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula>.</p>

      <fig id="F2"><label>Figure 2</label><caption><p id="d2e2029">Cumulative distributions showing the quantile skill scores for <inline-formula><mml:math id="M112" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>NCs</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> in reference to <inline-formula><mml:math id="M113" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>CoI</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>, for all subbasins and for four different transposition domains (10 <inline-formula><mml:math id="M114" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> buffer: black, 30 <inline-formula><mml:math id="M115" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> buffer: blue, 30–60 <inline-formula><mml:math id="M116" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> ring: yellow, 60–90 <inline-formula><mml:math id="M117" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> ring: pink. Subplots <bold>(a–d)</bold> show different quantiles that relate to the <bold>(a)</bold> 20 <inline-formula><mml:math id="M118" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">year</mml:mi></mml:mrow></mml:math></inline-formula>, <bold>(b)</bold> 50 <inline-formula><mml:math id="M119" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">year</mml:mi></mml:mrow></mml:math></inline-formula>, <bold>(c)</bold> 100 <inline-formula><mml:math id="M120" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">year</mml:mi></mml:mrow></mml:math></inline-formula>, and <bold>(d)</bold> 200 <inline-formula><mml:math id="M121" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">year</mml:mi></mml:mrow></mml:math></inline-formula> flood. A quantile score <inline-formula><mml:math id="M122" display="inline"><mml:mrow><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> indicates the superiority of the  <inline-formula><mml:math id="M123" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>NCs</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>. The median QSS of the 30 <inline-formula><mml:math id="M124" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> buffer is indicated with the vertical blue dashed line.</p></caption>
          <graphic xlink:href="https://nhess.copernicus.org/articles/26/2189/2026/nhess-26-2189-2026-f02.png"/>

        </fig>

      <p id="d2e2171">We would like to take a closer look at the differences between the return periods. Increasing return periods lead to a decreasing fraction of catchments with positive <inline-formula><mml:math id="M125" display="inline"><mml:mrow><mml:msub><mml:mtext>QSS</mml:mtext><mml:mtext>NCs</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> values  –  obviously not desirable -, but also to a desirable increase of catchments with very high QSS values (for <inline-formula><mml:math id="M126" display="inline"><mml:mrow><mml:mi>T</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">20</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M127" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">a</mml:mi></mml:mrow></mml:math></inline-formula>, 0.2 <inline-formula><mml:math id="M128" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> of the catchments have a <inline-formula><mml:math id="M129" display="inline"><mml:mrow><mml:mtext>QSS</mml:mtext><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0.5</mml:mn></mml:mrow></mml:math></inline-formula>, while this fraction grows to 28 <inline-formula><mml:math id="M130" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> for <inline-formula><mml:math id="M131" display="inline"><mml:mrow><mml:mi>T</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">200</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M132" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">a</mml:mi></mml:mrow></mml:math></inline-formula>). Altogether, the median QSS continuously grows from a value of 0.16 for <inline-formula><mml:math id="M133" display="inline"><mml:mrow><mml:mi>T</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">20</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M134" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">a</mml:mi></mml:mrow></mml:math></inline-formula> to a value of 0.27 for <inline-formula><mml:math id="M135" display="inline"><mml:mrow><mml:mi>T</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">200</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M136" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">a</mml:mi></mml:mrow></mml:math></inline-formula>, suggesting that the value added by using <inline-formula><mml:math id="M137" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>NCs</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> increases with the return period. This is plausible, since return levels for low return periods can be estimated more robustly from short time series (for <inline-formula><mml:math id="M138" display="inline"><mml:mrow><mml:mi>T</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">20</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M139" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">a</mml:mi></mml:mrow></mml:math></inline-formula>, the estimation of a return level from an annual series of 23 <inline-formula><mml:math id="M140" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">years</mml:mi></mml:mrow></mml:math></inline-formula> does not even imply extrapolation). The uncertainty increases the more we extrapolate beyond the length of the annual series. Especially for high return periods the benefit of an increased data basis is visible in these results.</p>
      <p id="d2e2334">These results serve as a proof of concept: for the majority of cases, we are able to better represent the quantiles in the data of the CoI by using a GEV distribution fitted exclusively to the counterfactual peaks (<inline-formula><mml:math id="M141" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>NCs</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>). Besides the fact, that the counterfactual peaks represent the distribution of CoI peaks well, the <inline-formula><mml:math id="M142" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>NCs</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> is also more robust because it is fitted to 230 values, instead of the 23 values used for <inline-formula><mml:math id="M143" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>CoI</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>. The improvement is more pronounced for higher quantiles (or return periods). In practice the GEV would be fitted to both factual <italic>and</italic> counterfactual peaks together (<inline-formula><mml:math id="M144" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>all</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>), which only marginally increases the robustness of the return level estimates. The QSS for <inline-formula><mml:math id="M145" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>all</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> is shown in Fig. S1 in the Supplement.</p>
</sec>
<sec id="Ch1.S4.SS2">
  <label>4.2</label><title>Effect of different transposition domains</title>
      <p id="d2e2404">We calculated the <inline-formula><mml:math id="M146" display="inline"><mml:mrow><mml:msub><mml:mtext>QSS</mml:mtext><mml:mtext>NCs</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> for four different TDs. This way, we want to investigate whether HPEs transposed from larger distances are less “typical” for the CoI and will therefore result in less representative GEV fits with lower values of <inline-formula><mml:math id="M147" display="inline"><mml:mrow><mml:msub><mml:mtext>QSS</mml:mtext><mml:mtext>NCs</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>. This effect can be observed in Fig. <xref ref-type="fig" rid="F2"/>. For each return period, the intercepts of the QSS distributions on the <inline-formula><mml:math id="M148" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> axis are higher for the ring-shaped TDs (30–60 and 60–90 <inline-formula><mml:math id="M149" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula>) than the intercepts of the TDs with a 10- or 30 <inline-formula><mml:math id="M150" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> buffer. This effect is less pronounced with increasing quantiles. The differences between the 10- and 30 <inline-formula><mml:math id="M151" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula>-buffers are very small. These results support the hypothesis that HPEs transposed over short distances are more representative for the HPEs occurring directly over the CoI. Nevertheless, the sampling process of the NCs can also have an impact on the results. Within the TD we sample ten catchments which are most similar to the CoI (Sect. <xref ref-type="sec" rid="Ch1.S3.SS2"/>). If the TD is very small, there are less basins to sample from so that the representativeness of the sampled HPEs for the CoI might suffer. Likewise it could  also be possible that basins are less similar to each other with increasing distance. Due to the complex topography around every catchment, we think that there can be hardly a generalized solution for the “perfect” transposition domain. However, our results show, that there is, for most small-scale basins in Germany, no large difference whether the TD is a 10-, or 30 <inline-formula><mml:math id="M152" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> buffer. Providing large computational resources this could be systematically investigated further by increasing the size of the TD step by step and evaluating the QSS.</p>
</sec>
<sec id="Ch1.S4.SS3">
  <label>4.3</label><title>Return levels</title>
      <p id="d2e2481">We would now like to demonstrate how the use of local counterfactuals affects return levels, in comparison to the conventional use of factual discharge peaks in the CoI. While GEV<sub>NCs</sub> was used for verification in Sect. 4.1, we will now use GEV<sub>all</sub> because there is no reason to entirely discard the data from the CoI for GEV fitting. For the 200 <inline-formula><mml:math id="M155" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">year</mml:mi></mml:mrow></mml:math></inline-formula> return period, Fig. <xref ref-type="fig" rid="F3"/> shows the ratio between the return level obtained from <inline-formula><mml:math id="M156" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>all</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> and from <inline-formula><mml:math id="M157" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>CoI</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> (as a histogram over all analysed catchments). For all TDs, the median ratio is very close to one, so using local counterfactuals results in lower return levels for half of the basins and to higher return level for the other half. In our view, this is an important insight: in contrast to our intuitive expectation, the use of local counterfactuals for GEV fitting does not systematically increase the resulting return levels, but simply reduces the estimation error over all CoIs (based on the higher QSS and the narrower confidence intervals, see below). However, this improvement of the GEV estimation is still based on the inclusion of higher discharge maxima via counterfactuals. This is illustrated by the gray histograms in Fig. <xref ref-type="fig" rid="F3"/> which show, for each catchment, the ratio between the highest value in the annual maximum series of counterfactual <italic>and</italic> factual peaks and the highest value in the annual maximum series of just the factual peaks. The gray histograms clearly show that counterfactuals increase the maximum of the complete series of annual maxima, leading to more robust GEV fits. The medians for all TDs are between 1.43–1.6. It is important to note that the four TDs cover very different spatial extents: the 30–60 <inline-formula><mml:math id="M158" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula>-ring has an area of 14,137 <inline-formula><mml:math id="M159" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">km</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>, while the 10 <inline-formula><mml:math id="M160" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> buffer has a size of <inline-formula><mml:math id="M161" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">466</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M162" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">km</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> (for a circular basin with an area of 15 <inline-formula><mml:math id="M163" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">km</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>). The larger the TD we sample HPEs from, the more options we have to find catchments which are very similar to the CoI. Thus, it is also more likely that we are sampling HPEs that matter regarding the formation of an extreme flood peak in the CoI. This absolute maximum peak could serve as reference for the probable maximum flood (PMF) and is automatically included in the results of the analysis. Yet, remember that sampling from such more distant and larger neighborhoods does not improve the GEV estimation, as was shown in Sect. <xref ref-type="sec" rid="Ch1.S4.SS2"/>.</p>

      <fig id="F3"><label>Figure 3</label><caption><p id="d2e2604">Histogram of the ratio between the 200 <inline-formula><mml:math id="M164" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">year</mml:mi></mml:mrow></mml:math></inline-formula> return level from <inline-formula><mml:math id="M165" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>all</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> and from <inline-formula><mml:math id="M166" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>CoI</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> for four TDs. Gray histograms indicate the ratio of the highest peaks in the respective datasets used for fitting (see main text for further explanation). The median ratio of the return levels ratio is marked in black, and the median ratio of maximum peak discharge values in red.</p></caption>
          <graphic xlink:href="https://nhess.copernicus.org/articles/26/2189/2026/nhess-26-2189-2026-f03.png"/>

        </fig>

      <p id="d2e2643">The difference between the two medians shown in Fig. <xref ref-type="fig" rid="F3"/> may appear counterintuitive. However, two factors account for this observation. First, although the counterfactual dataset exhibits some higher peaks, these peaks occur jointly with the entire set of annual maxima from this NC (23 values for each NC). In many cases these high peaks have little impact on the GEV fit due to the amount of data points that are just “average” peaks. Figure <xref ref-type="fig" rid="F4"/> shows an example of this case. The sampling error with small sample sizes as the 23 annual maxima for the <inline-formula><mml:math id="M167" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>CoI</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> can lead to very heavy tailed GEV fits, high return level estimates and very wide confidence intervals. Even though the data pool for <inline-formula><mml:math id="M168" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>all</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> (253 values) contains more extreme peaks (max. CoI: 43.2 <inline-formula><mml:math id="M169" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">m</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">s</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, max. all: 87.3 <inline-formula><mml:math id="M170" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">m</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">s</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>), the fit is still mainly influenced by the larger amount of moderate peaks and results in lower return level estimates.</p>

      <fig id="F4"><label>Figure 4</label><caption><p id="d2e2716">Comparison of two <inline-formula><mml:math id="M171" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>CoI</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M172" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>all</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> for one exemplary basin. <bold>(a)</bold> Return levels estimated by <inline-formula><mml:math id="M173" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>all</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> (orange) are lower than by <inline-formula><mml:math id="M174" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>CoI</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> (purple). The shaded areas mark the 95 <inline-formula><mml:math id="M175" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> confidence interval estimated with boot strapping (<inline-formula><mml:math id="M176" display="inline"><mml:mrow><mml:mi>n</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">500</mml:mn></mml:mrow></mml:math></inline-formula>). The empirical return periods were estimated with the Weibull plotting position and are indicated with the semi-transparent dots. <bold>(b)</bold> Density histogram of the annual maxima and fitted GEV distribution.</p></caption>
          <graphic xlink:href="https://nhess.copernicus.org/articles/26/2189/2026/nhess-26-2189-2026-f04.png"/>

        </fig>

      <p id="d2e2796">Secondly, local counterfactuals also induce spatial smoothing (which is desired): each catchment is a CoI once, but serves as neighbor for many other CoIs. As a result, nearby and hydrologically similar catchments often share almost identical sets of peaks. When a counterfactual peak increases the return level estimate for one CoI, the peaks from that CoI will also enter the NC data pool once their roles are reversed. In this case, the inclusion of the peak can reduce the return level estimate for the neighboring catchment.</p>
      <p id="d2e2799">The estimation of return levels beyond the observational period comes with large uncertainties in the case of <inline-formula><mml:math id="M177" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>CoI</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> in the example in Fig. <xref ref-type="fig" rid="F4"/>: the 200 <inline-formula><mml:math id="M178" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">year</mml:mi></mml:mrow></mml:math></inline-formula> return level in is between 34–325 <inline-formula><mml:math id="M179" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">m</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msup><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">s</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> (95 <inline-formula><mml:math id="M180" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> confidence interval). This range is much smaller for <inline-formula><mml:math id="M181" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>all</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>, where the 200 <inline-formula><mml:math id="M182" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">year</mml:mi></mml:mrow></mml:math></inline-formula> return level is between 87–130 <inline-formula><mml:math id="M183" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">m</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msup><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">s</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>. Across all catchments, the 95 <inline-formula><mml:math id="M184" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> confidence intervals shrink substantially. Within the 30 <inline-formula><mml:math id="M185" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> buffer, the median reduction in interval span is 78.75 <inline-formula><mml:math id="M186" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> for the 20 <inline-formula><mml:math id="M187" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">year</mml:mi></mml:mrow></mml:math></inline-formula> return level, 86.25 <inline-formula><mml:math id="M188" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> for the 50 <inline-formula><mml:math id="M189" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">year</mml:mi></mml:mrow></mml:math></inline-formula> level, 89.75 <inline-formula><mml:math id="M190" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> for the 100 <inline-formula><mml:math id="M191" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">year</mml:mi></mml:mrow></mml:math></inline-formula> level, and 92.25 <inline-formula><mml:math id="M192" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> for the 200 <inline-formula><mml:math id="M193" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">year</mml:mi></mml:mrow></mml:math></inline-formula> level.</p>
</sec>
</sec>
<sec id="Ch1.S5">
  <label>5</label><title>Limitations and need for future research</title>
      <p id="d2e2984">In our study, we presented a framework to increase the robustness of flood frequency analysis for small and medium sized basins by means of local counterfactuals. Still, the methodology and hence the results are subject to considerable uncertainties and limitations which we would like to discuss in the following, together with perspectives for future research in order to address these uncertainties.</p>
<sec id="Ch1.S5.SS1">
  <label>5.1</label><title>The concept of storm and catchment similarity</title>
      <p id="d2e2994">In the presented framework, we select and transpose HPEs that caused annual discharge maxima in similar catchments within a 30 <inline-formula><mml:math id="M194" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> radius around the CoI. That way, we aimed to find HPEs that are representative for the kind of HPEs that cause annual discharge maxima in the CoI. This procedure follows two main assumptions: (1) the 30 <inline-formula><mml:math id="M195" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> radius (neighborhood) makes sure that the transposed HPEs are governed by a climate that is similar to the CoI's conditions; (2) the catchment similarity makes sure that the transposed events have spatio-temporal characteristics representative for HPEs that cause flood events in the CoI  –  but without the need to predefine such characteristics explicitly: e.g. a similar catchment area allows to filter HPEs that act on a relevant spatial scale, a similar travel time distribution (i.e. similar GIUH properties such as time to peak, unit peak discharge, and standard deviation) allows to filter HPE that act on a relevant temporal scale, while a similar catchment elevation should favor HPEs which are governed by similar levels of orographic enhancement. We must admit, however, that the selection of the neighborhood radius as well as the similarity metrics and their integration by means of a KDTree-analysis are pragmatic choices  –  an expert guess, if you will. Other choices might lead to a superior filtering of HPEs. The question of whether a filter is superior can only be answered by means of a benchmark experiment in which we compare different designs by means of a performance metric, in our case the QSS. We applied such a benchmark experiment with regard to the neighborhood radius and found out that a 30 <inline-formula><mml:math id="M196" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> radius was preferable to a radius of 60–90 <inline-formula><mml:math id="M197" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula>. Future research should aim at a more comprehensive evaluation of both the neighborhood radius and the catchment similarity metrics. This could also include different or additional metrics, e.g. the shape and the orientation of the catchment's major axis <xref ref-type="bibr" rid="bib1.bibx60" id="paren.42"><named-content content-type="pre">see</named-content></xref>. Such a benchmark experiment, however, would require a considerable computational effort which we did not invest in the present study as our focus rather was to establish a proof-of-concept.</p>
</sec>
<sec id="Ch1.S5.SS2">
  <label>5.2</label><title>Hydrological model uncertainty</title>
      <p id="d2e3042">Certainly, the hydrological model used on our analysis introduces considerable uncertainty – as would any hydrological model under extreme hydrological conditions. These uncertainties were already discussed in detail by <xref ref-type="bibr" rid="bib1.bibx51" id="text.43"/>: While the SCS-CN method is robust, it has been widely criticized for various reasons <xref ref-type="bibr" rid="bib1.bibx9" id="paren.44"><named-content content-type="pre">see, e.g.</named-content><named-content content-type="post">for an overview</named-content></xref>; among others, it does not explicitly account for the effect of precipitation intensity on surface runoff generation and is hence prone to underestimate quick runoff formation from short duration events – which might make the tail of the resulting GEV distribution too light. The assumption of linear and time-invariant response to effective rainfall might not hold under extreme runoff conditions, either, which could likewise affect the tail behavior. Furthermore, our focus is explicitly on summer events so that annual maxima caused by prolonged winter rainfall or spring snow melt are not represented on our analysis. However, this should only affect return levels for very low return periods which is not the focus of our study. Finally, our lumped model approach does not account for the spatial distribution of rainfall within the sub-catchments. However, our sub-catchments are very small (mean area of 15.7 <inline-formula><mml:math id="M198" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">km</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>, so that the effect on simulated flood peaks should be acceptable.</p>
      <p id="d2e3066">Overall, it should be noted that, if the model should have any systematic error (bias) in a specific catchment, than this bias should affect the peak discharge of all events simulated for that catchment and hence reflect in all the different GEV distributions fitted for that specific catchment. That way, our comparisons of different GEV distributions should not suffer too much from any such systematic error.</p>
      <p id="d2e3069">Any finally, the presented <italic>concept</italic> of using local counterfactuals for GEV estimation is independent of the actually used hydrological model. For practical applications, e.g. by agencies in charge of risk management or design of hydraulic infrastructure, we recommend to repeat the analysis with a hydrological model that is calibrated and validated to the local conditions.</p>
</sec>
<sec id="Ch1.S5.SS3">
  <label>5.3</label><title>Length of the observational period</title>
      <p id="d2e3083">The radar-based precipitation dataset RADKLIM covers only 23 <inline-formula><mml:math id="M199" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">years</mml:mi></mml:mrow></mml:math></inline-formula>. For the computation of the quantile skill score (QSS), the 23 annual maximum flood peaks, which were modelled with RADKLIM as a forcing, served as the verification for <inline-formula><mml:math id="M200" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>CoI</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M201" display="inline"><mml:mrow><mml:msub><mml:mtext>GEV</mml:mtext><mml:mtext>NCs</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>. In that context, the QSS has to be interpreted with care, specifically for very high return periods such as 100 or 200 <inline-formula><mml:math id="M202" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">years</mml:mi></mml:mrow></mml:math></inline-formula>. In essence, the evaluation of the QSS for unseen quantiles is challenging because observations that exceed high quantiles are rare. Unfortunately, this limitation is difficult to overcome and applies to all scores known to us.</p>
      <p id="d2e3124">Furthermore, we had to discard 31 <inline-formula><mml:math id="M203" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> of GEV fits due to implausible values of the shape parameter – probably due to the small sample size. For fitting the GEV parameters with such short series, the L-Moments method might be a better choice than the maximum likelihood approach (as applied by us in the present study). Future studies should also consider the use of the peak-over-threshold method with the Generalized Pareto distribution as an option to address this issue <xref ref-type="bibr" rid="bib1.bibx2" id="paren.45"/>. And finally, with such a short time series, issues of non-stationarity <xref ref-type="bibr" rid="bib1.bibx40" id="paren.46"/>, as a consequence from e.g. climate and land use change, are difficult to account for. In the present context, the effect of climate change on the frequency and amplitude of convective heavy rainfall will probably constitute a relevant source of uncertainty <xref ref-type="bibr" rid="bib1.bibx10 bib1.bibx11" id="paren.47"><named-content content-type="pre">see, e.g.</named-content></xref>.</p>
      <p id="d2e3146">While our method improves the robustness of the estimation of higher return levels, relevant for flood risk management, the short observational period limits counterfactual analyses in the same way as it does for conventional flood frequency analysis that would only use data from the CoI, in other words: longer records will always be beneficial, even with the inclusion of local counterfactuals.</p>
</sec>
</sec>
<sec id="Ch1.S6" sec-type="conclusions">
  <label>6</label><title>Conclusions</title>
      <p id="d2e3158">In this study, we introduced a framework to increase the robustness of the GEV fits for flood frequency analysis by utilizing local counterfactuals. While being inspired by the concept of stochastic storm transposition, we follow a different approach in selecting candidate HPEs (based on the discharge response they caused in hydrologically similar neighbor catchments within a specific search radius around the the CoI), and in transposing these candidate events within the transposition domain (not stochastically, but systematically right over the CoI).</p>
      <p id="d2e3161">In a case study for Germany, we provided a proof-of-concept by applying this framework to a set of <inline-formula><mml:math id="M204" display="inline"><mml:mrow><mml:mo>≈</mml:mo><mml:mn mathvariant="normal">13</mml:mn></mml:mrow></mml:math></inline-formula> 452 catchments smaller than 750 <inline-formula><mml:math id="M205" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">km</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>. For that purpose, we combined 23 <inline-formula><mml:math id="M206" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">years</mml:mi></mml:mrow></mml:math></inline-formula> of radar-based precipitation records with a Germany-wide flash flood model. By using the quantile skill score, we verified that the use of local counterfactuals improves the fit of GEV parameters for the vast majority of catchments. As expected, the value added by this approach increases with the return period of interest.</p>
      <p id="d2e3193">The main advantage of this approach the increased precision of the GEV return level estimates with much narrower confidence intervals. This is especially relevant for floods with return periods beyond the observational period. According to the Floods Directive of the European Union (2007/60/EC, <xref ref-type="bibr" rid="bib1.bibx18" id="altparen.48"/>), this is particularly relevant for floods of “medium probability” (<inline-formula><mml:math id="M207" display="inline"><mml:mrow><mml:mi>T</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">100</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M208" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">a</mml:mi></mml:mrow></mml:math></inline-formula>) and floods of low probability (which in Germany is defined as a flood with <inline-formula><mml:math id="M209" display="inline"><mml:mrow><mml:mi>T</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">200</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M210" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">a</mml:mi></mml:mrow></mml:math></inline-formula>). We could show that, across return periods, the the use of local counterfactuals improves GEV fitting, but does not lead to a systematic change of return levels across the entirety of investigated catchments.</p>
      <p id="d2e3239">The selection of the TD affects the quality GEV estimation when local counterfactuals are employed. We showed that the QSS decreased when HPEs were sampled from a distance of more than 30 <inline-formula><mml:math id="M211" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> away from the CoI. Still, the optimal definition of the TD will remain arbitrary and represents a subject for further research, as it represents an inherent trade-off: while an increasing distance allows us to sample from a larger variety of events and particularly from a larger choice of hydrologically similar catchments, an increasing distance will typically sample HPEs that are less representative for the meteorological processes that govern the CoI. As of now, the 30 <inline-formula><mml:math id="M212" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> radius remains a rather pragmatic choice and a compromise between these two requirements. In regions with high orographic gradients or highly heterogeneous rainfall patterns the proper size of the TD might have to be reduced or optimized in benchmark experiments similar to the one carried out in this study.</p>
      <p id="d2e3259">The practical application of our framework appears suitable for all contexts in which observational records are short in comparison to the return period required for a specific purpose, such as land use planning, design, or insurance. For such applications, we strongly recommend to use a hydrological model that is calibrated and validated for the local or regional conditions.</p>
</sec>

      
      </body>
    <back><notes notes-type="codedataavailability"><title>Code and data availability</title>

      <p id="d2e3266">We published notebooks and code which demonstrate our hydrological model for a small, exemplary region (Altenahr basin): the derivation of GIUHs from a digital elevation model, the extraction of rainfall data from and effective rainfall for the subbasins from RADKLIM data and the modelling of quick runoff. The code is published at: <ext-link xlink:href="https://doi.org/10.5281/zenodo.10473424" ext-link-type="DOI">10.5281/zenodo.10473424</ext-link> <xref ref-type="bibr" rid="bib1.bibx49" id="paren.49"/>.</p>

      <p id="d2e3275">All data used in this study is accessible at the open data repository of the DWD: the RADKLIM_RW_2017.002 dataset is available at <uri>https://opendata.dwd.de/climate_environment/CDC/grids_germany/hourly/radolan/reproc/2017_002</uri> (last access: 4 May 2026, <xref ref-type="bibr" rid="bib1.bibx54" id="altparen.50"/>); the EU-DEM is available at <uri>https://ec.europa.eu/eurostat/web/gisco/geodata/digital-elevation-model/eu-dem#DD</uri> (last access: 4 May 2026, <xref ref-type="bibr" rid="bib1.bibx17" id="altparen.51"/>); the CLC5-2018 land cover data is available at <uri>https://gdz.bkg.bund.de/index.php/default/open-data/corine-land-cover-5-ha-stand-2018-clc5-2018.html</uri> (last access: 4 May 2026, <xref ref-type="bibr" rid="bib1.bibx7" id="altparen.52"/>). The soil data is available at <uri>https://www.bgr.bund.de/DE/Themen/Boden/Projekte/Flaechen_Rauminformationen_Boden/BUEK200/BUEK200.html?nn=869002</uri> (last access: 4 May 2026, <xref ref-type="bibr" rid="bib1.bibx6" id="altparen.53"/>). All data last accessed 27 June 2024.</p>
  </notes><app-group>
        <supplementary-material position="anchor"><p id="d2e3303">The supplement related to this article is available online at <inline-supplementary-material xlink:href="https://doi.org/10.5194/nhess-26-2189-2026-supplement" xlink:title="pdf">https://doi.org/10.5194/nhess-26-2189-2026-supplement</inline-supplementary-material>.</p></supplementary-material>
        </app-group><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d2e3312">PV, FF, and MH conceptualized this study. PV carried out the analysis, produced the figures and wrote the manuscript, with contributions from FF and MH.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d2e3318">The contact author has declared that none of the authors has any competing interests.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d2e3324">Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.</p>
  </notes><ack><title>Acknowledgements</title><p id="d2e3330">We would like to thank the open-source community; without its software and data this study would have not been possible. Some small parts of the text were improved in exchange with a language model (<uri>https://chat.openai.com/chat</uri>, last access: 7 October 2025).</p></ack><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d2e3338">This research has been supported by the Bundesministerium für Forschung und Technologie (grant nos. 01LP2324B, 01LP2323H).</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d2e3347">This paper was edited by Mihai Niculita and reviewed by three anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><label>Abbasian et al.(2025)</label><mixed-citation>Abbasian, M., Wright, D. B., Notaro, M., Vavrus, S., and Vimont, D. J.: Flood frequency sampling error: insights from regional analysis, stochastic storm transposition, and physics-based modeling, J. Hydrol., 133802, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2025.133802" ext-link-type="DOI">10.1016/j.jhydrol.2025.133802</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx2"><label>Anusha and Maheswaran(2025)</label><mixed-citation>Anusha, G. S. and Maheswaran, R.: Quantitative assessment of automated threshold selection methods for Generalized Pareto Distribution for modelling precipitation extremes in the Indian subcontinent, J. Hydrol., 134166, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2025.134166" ext-link-type="DOI">10.1016/j.jhydrol.2025.134166</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx3"><label>Apel et al.(2016)</label><mixed-citation>Apel, H., Martínez Trepat, O., Hung, N. N., Chinh, D. T., Merz, B., and Dung, N. V.: Combined fluvial and pluvial urban flood hazard analysis: concept development and application to Can Tho city, Mekong Delta, Vietnam, Nat. Hazards Earth Syst. Sci., 16, 941–961, <ext-link xlink:href="https://doi.org/10.5194/nhess-16-941-2016" ext-link-type="DOI">10.5194/nhess-16-941-2016</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx4"><label>Barredo(2007)</label><mixed-citation>Barredo, J. I.: Major flood disasters in Europe: 1950–2005, Nat. Hazards, 42, 125–148, <ext-link xlink:href="https://doi.org/10.1007/s11069-006-9065-2" ext-link-type="DOI">10.1007/s11069-006-9065-2</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx5"><label>Bentzien and Friederichs(2014)</label><mixed-citation>Bentzien, S. and Friederichs, P.: Decomposition and graphical portrayal of the quantile score, Q. J. Roy. Meteor. Soc.,  140, 1924–1934, <ext-link xlink:href="https://doi.org/10.1002/qj.2284" ext-link-type="DOI">10.1002/qj.2284</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx6"><label>BGR(2018)</label><mixed-citation>BGR: BÜK200 V5.5, <uri>https://www.bgr.bund.de/DE/Themen/Boden/Projekte/Flaechen_Rauminformationen_Boden/BUEK200/BUEK200.html?nn=869002</uri> (last access: 4 May 2026), 2018.</mixed-citation></ref>
      <ref id="bib1.bibx7"><label>BKG(2018)</label><mixed-citation>BKG: CORINE CLC5-2018, <uri>https://gdz.bkg.bund.de/index.php/default/open-data/corine-land-cover-5-ha-stand-2018-clc5-2018.html</uri> (last access: 22 May 2023), 2018.</mixed-citation></ref>
      <ref id="bib1.bibx8"><label>Borga et al.(2007)</label><mixed-citation>Borga, M., Boscolo, P., Zanon, F., and Sangati, M.: Hydrometeorological analysis of the 29 August 2003 flash flood in the Eastern Italian Alps, J. Hydrometeorol., 8, 1049–1067, <ext-link xlink:href="https://doi.org/10.1175/JHM593.1" ext-link-type="DOI">10.1175/JHM593.1</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx9"><label>Boughton(1989)</label><mixed-citation>Boughton, W.: A review of the USDA SCS curve number method, Aust. J. Soil Res., 27, 511–523, <ext-link xlink:href="https://doi.org/10.1071/SR9890511" ext-link-type="DOI">10.1071/SR9890511</ext-link>, 1989.</mixed-citation></ref>
      <ref id="bib1.bibx10"><label>Bürger and Heistermann(2023)</label><mixed-citation>Bürger, G. and Heistermann, M.: Shallow and deep learning of extreme rainfall events from convective atmospheres, Nat. Hazards Earth Syst. Sci., 23, 3065–3077, <ext-link xlink:href="https://doi.org/10.5194/nhess-23-3065-2023" ext-link-type="DOI">10.5194/nhess-23-3065-2023</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx11"><label>Bürger and Heistermann(2025)</label><mixed-citation>Bürger, G. and Heistermann, M.: Present and future trends of extreme short-term rainfall events in Germany, by downscaling convective environments of ERA5 and a CMIP6 ensemble, EGUsphere [preprint],  <ext-link xlink:href="https://doi.org/10.5194/egusphere-2025-3584" ext-link-type="DOI">10.5194/egusphere-2025-3584</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx12"><label>Coles(2001)</label><mixed-citation> Coles, S.: An Introduction to Statistical Modeling of Extreme Values, Springer, London, ISBN  1852334592, 2001.</mixed-citation></ref>
      <ref id="bib1.bibx13"><label>Cooley(2012)</label><mixed-citation>Cooley, D.: Return periods and return levels under climate change, in: Extremes in a Changing Climate: Detection, Analysis and Uncertainty, Springer, <ext-link xlink:href="https://doi.org/10.1007/978-94-007-4479-0_4" ext-link-type="DOI">10.1007/978-94-007-4479-0_4</ext-link>, 97–114, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx14"><label>CRED/UCLouvain(2023)</label><mixed-citation>CRED/UCLouvain: EM-DAT International Disaster Database, <uri>http://www.emdat.be</uri> (last access: 25 January 2024), 2023.</mixed-citation></ref>
      <ref id="bib1.bibx15"><label>District and Morgan(1916)</label><mixed-citation> Miami Conservancy District and Morgan, A. E.: Exhibits to Accompany Report of the Chief Engineer, Arthur E. Morgan: Submitting a Plan for the Protection of the District from Flood Damage, Miami Conservancy District, ISBN-10 1018383476, 1916.</mixed-citation></ref>
      <ref id="bib1.bibx16"><label>Emmanuel et al.(2017)</label><mixed-citation>Emmanuel, I., Payrastre, O., Andrieu, H., and Zuber, F.: A method for assessing the influence of rainfall spatial variability on hydrograph modeling. First case study in the Cevennes Region, southern France, J. Hydrol., 555, 314–322, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2017.10.011" ext-link-type="DOI">10.1016/j.jhydrol.2017.10.011</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx17"><label>European Commission(2016)</label><mixed-citation>European Commission: Digital Elevation Model over Europe (EU-DEM), <uri>https://www.eea.europa.eu/en/datahub/datahubitem-view/d08852bc-7b5f-4835-a776-08362e2fbf4b?activeAccordion=735550#tab-metadata</uri> (last access: 2 October 2023), 2016.</mixed-citation></ref>
      <ref id="bib1.bibx18"><label>European Commission, Directorate-General for Environment(2013)</label><mixed-citation>European Commission, Directorate-General for Environment: A compilation of reporting sheets adopted by water directors common implementation strategy for the Water Framework Directive (2000/60/EC). Guidance document No 29, <uri>https://circabc.europa.eu/sd/a/acbcd98a-9540-480e-a876-420b7de64eba/Floods%2520Reporting%2520guidance%2520-%2520final_with%2520revised%2520paragraph%25204.2.3.pdf</uri>, (last access: 27 June 2024), 2013.</mixed-citation></ref>
      <ref id="bib1.bibx19"><label>Falter et al.(2015)</label><mixed-citation>Falter, D., Schröter, K., Dung, N. V., Vorogushyn, S., Kreibich, H., Hundecha, Y., Apel, H., and Merz, B.: Spatially coherent flood risk assessment based on long-term continuous simulation with a coupled model chain, J. Hydrol., 524, 182–193, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2015.02.021" ext-link-type="DOI">10.1016/j.jhydrol.2015.02.021</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx20"><label>Fauer and Rust(2023)</label><mixed-citation>Fauer, F. S. and Rust, H. W.: Non-stationary large-scale statistics of precipitation extremes in central Europe, Stoch. Env. Res. Risk. A., 37, 4417–4429, <ext-link xlink:href="https://doi.org/10.1007/s00477-023-02515-z" ext-link-type="DOI">10.1007/s00477-023-02515-z</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx21"><label>Fisher and Tippett(1928)</label><mixed-citation>Fisher, R. A. and Tippett, L. H. C.: Limiting forms of the frequency distribution of the largest or smallest member of a sample, Math. Proc. Camb. Philos. Soc., 24, 180–190, <ext-link xlink:href="https://doi.org/10.1017/S0305004100015681" ext-link-type="DOI">10.1017/S0305004100015681</ext-link>, 1928.</mixed-citation></ref>
      <ref id="bib1.bibx22"><label>Fontaine and Potter(1989)</label><mixed-citation>Fontaine, T. A. and Potter, K. W.: Estimating probabilities of extreme rainfalls, J. Hydraul. Eng., 115, 1562–1575, <ext-link xlink:href="https://doi.org/10.1061/(ASCE)0733-9429(1989)115:11(1562)" ext-link-type="DOI">10.1061/(ASCE)0733-9429(1989)115:11(1562)</ext-link>, 1989.</mixed-citation></ref>
      <ref id="bib1.bibx23"><label>Fuller(1914)</label><mixed-citation> Fuller, W. E.: Flood flows, T. Am. Soc. Civ. Eng., 77, 564–617, 1914.</mixed-citation></ref>
      <ref id="bib1.bibx24"><label>Gaume et al.(2004)</label><mixed-citation>Gaume, E., Livet, M., Desbordes, M., and Villeneuve, J.-P.: Hydrological analysis of the river Aude, France, flash flood on 12 and 13 November 1999, J. Hydrol., 286, 135–154, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2003.09.015" ext-link-type="DOI">10.1016/j.jhydrol.2003.09.015</ext-link>, 2004.</mixed-citation></ref>
      <ref id="bib1.bibx25"><label>Gaume et al.(2010)</label><mixed-citation> Gaume, E., Gaál, L., Viglione, A., Szolgay, J., Kohnová, S., and Blöschl, G.: Bayesian MCMC approach to regional flood frequency analyses involving extraordinary flood events at ungauged sites, J. Hydrol., 394, 101–117, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx26"><label>Gnedenko(1943)</label><mixed-citation> Gnedenko, B. V.: Sur La Distribution Limite Du Terme Maximum D'Une Serie Aleatoire, Ann. Math., 44, 423–453, 1943.</mixed-citation></ref>
      <ref id="bib1.bibx27"><label>Grimaldi et al.(2010)</label><mixed-citation>Grimaldi, S., Petroselli, A., Alonso, G., and Nardi, F.: Flow time estimation with spatially variable hillslope velocity in ungauged basins, Adv. Water Resour., 33, 1216–1223, <ext-link xlink:href="https://doi.org/10.1016/j.advwatres.2010.06.003" ext-link-type="DOI">10.1016/j.advwatres.2010.06.003</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx28"><label>Gumbel(1958)</label><mixed-citation>Gumbel, E. J.: Statistics of Extremes, Columbia University Press, <ext-link xlink:href="https://doi.org/10.7312/gumb92958" ext-link-type="DOI">10.7312/gumb92958</ext-link>, 1958.</mixed-citation></ref>
      <ref id="bib1.bibx29"><label>Guse et al.(2010)</label><mixed-citation>Guse, B., Hofherr, Th., and Merz, B.: Introducing empirical and probabilistic regional envelope curves into a mixed bounded distribution function, Hydrol. Earth Syst. Sci., 14, 2465–2478, <ext-link xlink:href="https://doi.org/10.5194/hess-14-2465-2010" ext-link-type="DOI">10.5194/hess-14-2465-2010</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx30"><label>Halbert et al.(2016)</label><mixed-citation>Halbert, K., Nguyen, C. C., Payrastre, O., and Gaume, E.: Reducing uncertainty in flood frequency analyses: a comparison of local and regional approaches involving information on extreme historical floods, J. Hydrol., 541, 90–98, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2016.01.017" ext-link-type="DOI">10.1016/j.jhydrol.2016.01.017</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx31"><label>Hansen(1987)</label><mixed-citation>Hansen, E. M.: Probable maximum precipitation for design floods in the United States, J. Hydrol., 96, 267–278, <ext-link xlink:href="https://doi.org/10.1016/0022-1694(87)90158-2" ext-link-type="DOI">10.1016/0022-1694(87)90158-2</ext-link>, 1987.</mixed-citation></ref>
      <ref id="bib1.bibx32"><label>Klemes(1993)</label><mixed-citation> Klemes, V.: Probability of extreme hydrometeorological events-a different approach, IAHS-AISH P., 213, pp. 167–176, 1993.</mixed-citation></ref>
      <ref id="bib1.bibx33"><label>Lengfeld et al.(2019)</label><mixed-citation>Lengfeld, K., Winterrath, T., Junghänel, T., Hafer, M., and Becker, A.: Characteristic spatial extent of hourly and daily precipitation events in Germany derived from 16 years of radar data, Meteorol. Z., 28, 363–378, <ext-link xlink:href="https://doi.org//10.1127/metz/2019/0964" ext-link-type="DOI">/10.1127/metz/2019/0964</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx34"><label>Llasat et al.(2010)</label><mixed-citation>Llasat, M. C., Llasat-Botija, M., Prat, M. A., Porcú, F., Price, C., Mugnai, A., Lagouvardos, K., Kotroni, V., Katsanos, D., Michaelides, S., Yair, Y., Savvidou, K., and Nicolaides, K.: High-impact floods and flash floods in Mediterranean countries: the FLASH preliminary database, Adv. Geosci., 23, 47–55, <ext-link xlink:href="https://doi.org/10.5194/adgeo-23-47-2010" ext-link-type="DOI">10.5194/adgeo-23-47-2010</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx35"><label>Maidment et al.(1996)</label><mixed-citation>Maidment, D., Olivera, F., Calver, A., Eatherall, A., and Fraczek, W.: Unit hydrograph derived from a spatially distributed velocity field, Hydrol. Process., 10, 831–844, <ext-link xlink:href="https://doi.org/10.1002/(SICI)1099-1085(199606)10:6&lt;831::AID-HYP374&gt;3.0.CO;2-N" ext-link-type="DOI">10.1002/(SICI)1099-1085(199606)10:6&lt;831::AID-HYP374&gt;3.0.CO;2-N</ext-link>, 1996.</mixed-citation></ref>
      <ref id="bib1.bibx36"><label>Marchi et al.(2010)</label><mixed-citation>Marchi, L., Borga, M., Preciso, E., and Gaume, E.: Characterisation of selected extreme flash floods in Europe and implications for flood risk management, J. Hydrol., 394, 118–133, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2010.07.017" ext-link-type="DOI">10.1016/j.jhydrol.2010.07.017</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx37"><label>Merz et al.(2022)</label><mixed-citation>Merz, B., Basso, S., Fischer, S., Lun, D., Blöschl, G., Merz, R., Guse, B., Viglione, A., Vorogushyn, S., Macdonald, E., Wietzke, L., and Schumann, A.: Understanding heavy tails of flood peak distributions, Water Resour. Res., 58, e2021WR030506, <ext-link xlink:href="https://doi.org/10.1029/2021WR030506" ext-link-type="DOI">10.1029/2021WR030506</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx38"><label>Merz et al.(2024)</label><mixed-citation>Merz, B., Nguyen, V. D., Guse, B., Han, L., Guan, X., Rakovec, O., Samaniego, L., Ahrens, B., and Vorogushyn, S.: Spatial counterfactuals to explore disastrous flooding, Environ. Res. Lett., <ext-link xlink:href="https://doi.org/10.1088/1748-9326/ad22b9" ext-link-type="DOI">10.1088/1748-9326/ad22b9</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx39"><label>Merz and Blöschl(2008)</label><mixed-citation>Merz, R. and Blöschl, G.: Flood frequency hydrology: 1. Temporal, spatial, and causal expansion of information, Water Resour. Res., 44, <ext-link xlink:href="https://doi.org/10.1029/2007WR006744" ext-link-type="DOI">10.1029/2007WR006744</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx40"><label>Milly et al.(2008)</label><mixed-citation>Milly, P. C. D., Betancourt, J., Falkenmark, M., Hirsch, R. M., Kundzewicz, Z. W., Lettenmaier, D. P., and Stouffer, R. J.: Stationarity is dead: whither water management?, Science, 319, 573–574, <ext-link xlink:href="https://doi.org/10.1126/science.1151915" ext-link-type="DOI">10.1126/science.1151915</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx41"><label>Montanari et al.(2024)</label><mixed-citation>Montanari, A., Merz, B., and Blöschl, G.: HESS Opinions: The sword of Damocles of the impossible flood, Hydrol. Earth Syst. Sci., 28, 2603–2615, <ext-link xlink:href="https://doi.org/10.5194/hess-28-2603-2024" ext-link-type="DOI">10.5194/hess-28-2603-2024</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx42"><label>Morrison and Smith(2002)</label><mixed-citation> Morrison, J. E. and Smith, J. A.: Stochastic modeling of flood peaks using the generalized extreme value distribution, Water Resour. Res., 38, 41–1, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx43"><label>Nguyen et al.(2014)</label><mixed-citation>Nguyen, C. C., Gaume, E., and Payrastre, O.: Regional flood frequency analyses involving extraordinary flood events at ungauged sites: further developments and validations, J. Hydrol., 508, 385–396, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2013.09.058" ext-link-type="DOI">10.1016/j.jhydrol.2013.09.058</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx44"><label>Petrucci et al.(2019)</label><mixed-citation>Petrucci, O., Aceto, L., Bianchi, C., Bigot, V., Brázdil, R., Pereira, S., Kahraman, A., Kılıç, Ö., Kotroni, V., Llasat, M. C., Llasat-Botija, M., Papagiannaki, K., Pasqua, A. A., Řehoř, J., Geli, J. R., Salvati, P., Vinet, F., and Zêzere, J. L.: Flood fatalities in Europe, 1980–2018: variability, features, and lessons to learn, Water-Sui., 11, 1682, <ext-link xlink:href="https://doi.org/10.3390/w11081682" ext-link-type="DOI">10.3390/w11081682</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx45"><label>Seibert et al.(2020)</label><mixed-citation>Seibert, S. P., Auerswald, K., Seibert, S. P., and Auerswald, K.: Abflussentstehung–wie aus Niederschlag Abfluss wird, Hochwasserminderung im ländlichen Raum: Ein Handbuch zur quantitativen Planung, <ext-link xlink:href="https://doi.org/10.1007/978-3-662-61033-6_4" ext-link-type="DOI">10.1007/978-3-662-61033-6_4</ext-link>, 61–93, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx46"><label>Thompson et al.(2025)</label><mixed-citation>Thompson, V., Coumou, D., Beyerle, U., Ommer, J., Cloke, H. L., and Fischer, E.: Alternative rainfall storylines for the Western European July 2021 floods from ensemble boosting, Communications Earth and Environment, 6, 427, <ext-link xlink:href="https://doi.org/10.1038/s43247-025-02386-y" ext-link-type="DOI">10.1038/s43247-025-02386-y</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx47"><label>U.S. Department of Agriculture-Soil Conservation Service(1972)</label><mixed-citation>U.S. Department of Agriculture-Soil Conservation Service: Estimation of Direct Runoff From Storm Rainfall, SCS National Engineering Handbook, Section 4, Hydrology. Chap. 10, <uri>https://lmpublicsearch.lm.doe.gov/SiteDocs/111673.pdf</uri> (last access: 4 May 2026), 1972.</mixed-citation></ref>
      <ref id="bib1.bibx48"><label>Virtanen et al.(2020)</label><mixed-citation>Virtanen, P., Gommers, R., Oliphant, T. E., Haberland, M., Reddy, T., Cournapeau, D., Burovski, E., Peterson, P., Weckesser, W., Bright, J., van der Walt, S. J., Brett, M., Wilson, J., Millman, K. J., Mayorov, N., Nelson, A. R. J., Jones, E., Kern, R., Larson, E., Carey, C. J., Polat, İ., Feng, Y., Moore, E. W., VanderPlas, J., Laxalde, D., Perktold, J., Cimrman, R., Henriksen, I., Quintero, E. A., Harris, C. R., Archibald, A. M., Ribeiro, A. H., Pedregosa, F., van Mulbregt, P., and SciPy 1.0 Contributors: SciPy 1.0: fundamental algorithms for scientific computing in Python, Nat. Methods, 17, 261–272, <ext-link xlink:href="https://doi.org/10.1038/s41592-019-0686-2" ext-link-type="DOI">10.1038/s41592-019-0686-2</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx49"><label>Voit(2024)</label><mixed-citation>Voit, P.: A downward counterfactual analysis of flash floods in Germany – Code repository (v0.1), Zenodo [code], <ext-link xlink:href="https://doi.org/10.5281/zenodo.10473424" ext-link-type="DOI">10.5281/zenodo.10473424</ext-link>, last access: 15 August 2024.</mixed-citation></ref>
      <ref id="bib1.bibx50"><label>Voit and Heistermann(2024a)</label><mixed-citation>Voit, P. and Heistermann, M.: Brief communication: Stay local or go global? On the construction of plausible counterfactual scenarios to assess flash flood hazards, Nat. Hazards Earth Syst. Sci., 24, 4609–4615, <ext-link xlink:href="https://doi.org/10.5194/nhess-24-4609-2024" ext-link-type="DOI">10.5194/nhess-24-4609-2024</ext-link>, 2024a.</mixed-citation></ref>
      <ref id="bib1.bibx51"><label>Voit and Heistermann(2024b)</label><mixed-citation>Voit, P. and Heistermann, M.: A downward-counterfactual analysis of flash floods in Germany, Nat. Hazards Earth Syst. Sci., 24, 2147–2164, <ext-link xlink:href="https://doi.org/10.5194/nhess-24-2147-2024" ext-link-type="DOI">10.5194/nhess-24-2147-2024</ext-link>, 2024b.</mixed-citation></ref>
      <ref id="bib1.bibx52"><label>Vorogushyn et al.(2024)</label><mixed-citation>Vorogushyn, S., Han, L., Apel, H., Nguyen, V. D., Guse, B., Guan, X., Rakovec, O., Najafi, H., Samaniego, L., and Merz, B.: It could have been much worse: spatial counterfactuals of the July 2021 flood in the Ahr Valley, Germany, Nat. Hazards Earth Syst. Sci., 25, 2007–2029, <ext-link xlink:href="https://doi.org/10.5194/nhess-25-2007-2025" ext-link-type="DOI">10.5194/nhess-25-2007-2025</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx53"><label>Winterrath et al.(2012)</label><mixed-citation> Winterrath, T., Rosenow, W., and Weigl, E.: On the DWD quantitative precipitation analysis and nowcasting system for real-time application in German flood risk management, weather radar and hydrology, IAHS-AISH P., 351, 323–329, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx54"><label>Winterrath et al.(2018)</label><mixed-citation>Winterrath, T., Brendel, C., Hafer, M., Junghänel, T., Klameth, A., Lengfeld, K., Walawender, E., Weigl, E., and Becker, A.: Gauge-adjusted one-hour precipitation sum (RW):, RADKLIM Version 2017.002: Reprocessed gauge-adjusted radar data, one-hour precipitation sums (RW), Deutscher Wetterdienst (DWD)/German Weather Service [data set], <ext-link xlink:href="https://doi.org//10.5676/DWD/RADKLIM_RW_V2017.002" ext-link-type="DOI">/10.5676/DWD/RADKLIM_RW_V2017.002</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx55"><label>WMO(2009)</label><mixed-citation>WMO: Manual on estimation of probable maximum precipitation (PMP), <uri>https://library.wmo.int/viewer/35708/?offset=#page=1&amp;viewer=picture&amp;o=bookmarks&amp;n=0&amp;q=</uri>, (last access: 18 September 2024), 2009.</mixed-citation></ref>
      <ref id="bib1.bibx56"><label>Wright et al.(2014)</label><mixed-citation> Wright, D. B., Smith, J. A., and Baeck, M. L.: Flood frequency analysis using radar rainfall fields and stochastic storm transposition, Water Resour. Res., 50, 1592–1615, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx57"><label>Wright et al.(2017)</label><mixed-citation> Wright, D. B., Mantilla, R., and Peters-Lidard, C. D.: A remote sensing-based tool for assessing rainfall-driven hazards, Environ. Modell. Softw., 90, 34–54, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx58"><label>Wright et al.(2020)</label><mixed-citation>Wright, D. B., Yu, G., and England, J. F.: Six decades of rainfall and flood frequency analysis using stochastic storm transposition: review, progress, and prospects, J. Hydrol., 585, <ext-link xlink:href="https://doi.org/10.1016/j.jhydrol.2020.124816" ext-link-type="DOI">10.1016/j.jhydrol.2020.124816</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx59"><label>Zhou et al.(2019)</label><mixed-citation>Zhou, Z., Smith, J. A., Wright, D. B., Baeck, M. L., Yang, L., and Liu, S.: Storm catalog-based analysis of rainfall heterogeneity and frequency in a complex terrain, Water Resour. Res., 55, 1871–1889, <ext-link xlink:href="https://doi.org/10.1029/2018WR023567" ext-link-type="DOI">10.1029/2018WR023567</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx60"><label>Zhou et al.(2021)</label><mixed-citation>Zhou, Z., Smith, J. A., Baeck, M. L., Wright, D. B., Smith, B. K., and Liu, S.: The impact of the spatiotemporal structure of rainfall on flood frequency over a small urban watershed: an approach coupling stochastic storm transposition and hydrologic modeling, Hydrol. Earth Syst. Sci., 25, 4701–4717, <ext-link xlink:href="https://doi.org/10.5194/hess-25-4701-2021" ext-link-type="DOI">10.5194/hess-25-4701-2021</ext-link>, 2021.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>Considering rainfall events from a neighborhood improves local flood frequency analysis</article-title-html>
<abstract-html/>
<ref-html id="bib1.bib1"><label>Abbasian et al.(2025)</label><mixed-citation>
       Abbasian, M., Wright, D. B., Notaro, M., Vavrus, S., and Vimont, D. J.: Flood frequency sampling error: insights from regional analysis, stochastic storm transposition, and physics-based modeling, J. Hydrol., 133802, <a href="https://doi.org/10.1016/j.jhydrol.2025.133802" target="_blank">https://doi.org/10.1016/j.jhydrol.2025.133802</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>Anusha and Maheswaran(2025)</label><mixed-citation>
      
Anusha, G. S. and Maheswaran, R.: Quantitative assessment of automated
threshold selection methods for Generalized Pareto Distribution for modelling
precipitation extremes in the Indian subcontinent, J. Hydrol.,
134166, <a href="https://doi.org/10.1016/j.jhydrol.2025.134166" target="_blank">https://doi.org/10.1016/j.jhydrol.2025.134166</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>Apel et al.(2016)</label><mixed-citation>
       Apel, H., Martínez Trepat, O., Hung, N. N., Chinh, D. T., Merz, B., and Dung, N. V.: Combined fluvial and pluvial urban flood hazard analysis: concept development and application to Can Tho city, Mekong Delta, Vietnam, Nat. Hazards Earth Syst. Sci., 16, 941–961, <a href="https://doi.org/10.5194/nhess-16-941-2016" target="_blank">https://doi.org/10.5194/nhess-16-941-2016</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>Barredo(2007)</label><mixed-citation>
       Barredo, J. I.: Major flood disasters in Europe: 1950–2005, Nat. Hazards, 42, 125–148, <a href="https://doi.org/10.1007/s11069-006-9065-2" target="_blank">https://doi.org/10.1007/s11069-006-9065-2</a>, 2007.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>Bentzien and Friederichs(2014)</label><mixed-citation>
       Bentzien, S. and Friederichs, P.: Decomposition and graphical portrayal of the quantile score, Q. J. Roy. Meteor. Soc.,  140, 1924–1934, <a href="https://doi.org/10.1002/qj.2284" target="_blank">https://doi.org/10.1002/qj.2284</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>BGR(2018)</label><mixed-citation>
       BGR: BÜK200 V5.5, <a href="https://www.bgr.bund.de/DE/Themen/Boden/Projekte/Flaechen_Rauminformationen_Boden/BUEK200/BUEK200.html?nn=869002" target="_blank"/> (last access: 4 May 2026), 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>BKG(2018)</label><mixed-citation>
       BKG: CORINE CLC5-2018, <a href="https://gdz.bkg.bund.de/index.php/default/open-data/corine-land-cover-5-ha-stand-2018-clc5-2018.html" target="_blank"/> (last access: 22 May 2023), 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>Borga et al.(2007)</label><mixed-citation>
       Borga, M., Boscolo, P., Zanon, F., and Sangati, M.: Hydrometeorological analysis of the 29 August 2003 flash flood in the Eastern Italian Alps, J. Hydrometeorol., 8, 1049–1067, <a href="https://doi.org/10.1175/JHM593.1" target="_blank">https://doi.org/10.1175/JHM593.1</a>, 2007.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>Boughton(1989)</label><mixed-citation>
       Boughton, W.: A review of the USDA SCS curve number method, Aust. J. Soil Res., 27, 511–523, <a href="https://doi.org/10.1071/SR9890511" target="_blank">https://doi.org/10.1071/SR9890511</a>, 1989.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>Bürger and Heistermann(2023)</label><mixed-citation>
       Bürger, G. and Heistermann, M.: Shallow and deep learning of extreme rainfall events from convective atmospheres, Nat. Hazards Earth Syst. Sci., 23, 3065–3077, <a href="https://doi.org/10.5194/nhess-23-3065-2023" target="_blank">https://doi.org/10.5194/nhess-23-3065-2023</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>Bürger and Heistermann(2025)</label><mixed-citation>
       Bürger, G. and Heistermann, M.: Present and future trends of extreme short-term rainfall events in Germany, by downscaling convective environments of ERA5 and a CMIP6 ensemble, EGUsphere [preprint],  <a href="https://doi.org/10.5194/egusphere-2025-3584" target="_blank">https://doi.org/10.5194/egusphere-2025-3584</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>Coles(2001)</label><mixed-citation>
       Coles, S.: An Introduction to Statistical Modeling of Extreme Values, Springer, London, ISBN  1852334592, 2001.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>Cooley(2012)</label><mixed-citation>
       Cooley, D.: Return periods and return levels under climate change, in: Extremes in a Changing Climate: Detection, Analysis and Uncertainty, Springer, <a href="https://doi.org/10.1007/978-94-007-4479-0_4" target="_blank">https://doi.org/10.1007/978-94-007-4479-0_4</a>, 97–114, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>CRED/UCLouvain(2023)</label><mixed-citation>
       CRED/UCLouvain: EM-DAT International Disaster Database, <a href="http://www.emdat.be" target="_blank"/> (last access: 25 January 2024), 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>District and Morgan(1916)</label><mixed-citation>
       Miami Conservancy District and Morgan, A. E.: Exhibits to Accompany Report of the Chief Engineer, Arthur E. Morgan: Submitting a Plan for the Protection of the District from Flood Damage, Miami Conservancy District, ISBN-10 1018383476, 1916.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>Emmanuel et al.(2017)</label><mixed-citation>
       Emmanuel, I., Payrastre, O., Andrieu, H., and Zuber, F.: A method for assessing the influence of rainfall spatial variability on hydrograph modeling. First case study in the Cevennes Region, southern France, J. Hydrol., 555, 314–322, <a href="https://doi.org/10.1016/j.jhydrol.2017.10.011" target="_blank">https://doi.org/10.1016/j.jhydrol.2017.10.011</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>European Commission(2016)</label><mixed-citation>
       European Commission: Digital Elevation Model over Europe (EU-DEM), <a href="https://www.eea.europa.eu/en/datahub/datahubitem-view/d08852bc-7b5f-4835-a776-08362e2fbf4b?activeAccordion=735550#tab-metadata" target="_blank"/> (last access: 2 October 2023), 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>European Commission, Directorate-General for Environment(2013)</label><mixed-citation>
       European Commission, Directorate-General for Environment: A compilation of reporting sheets adopted by water directors common implementation strategy for the Water Framework Directive (2000/60/EC). Guidance document No 29, <a href="https://circabc.europa.eu/sd/a/acbcd98a-9540-480e-a876-420b7de64eba/Floods%2520Reporting%2520guidance%2520-%2520final_with%2520revised%2520paragraph%25204.2.3.pdf" target="_blank"/>, (last access: 27 June 2024), 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>Falter et al.(2015)</label><mixed-citation>
       Falter, D., Schröter, K., Dung, N. V., Vorogushyn, S., Kreibich, H., Hundecha, Y., Apel, H., and Merz, B.: Spatially coherent flood risk assessment based on long-term continuous simulation with a coupled model chain, J. Hydrol., 524, 182–193, <a href="https://doi.org/10.1016/j.jhydrol.2015.02.021" target="_blank">https://doi.org/10.1016/j.jhydrol.2015.02.021</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>Fauer and Rust(2023)</label><mixed-citation>
       Fauer, F. S. and Rust, H. W.: Non-stationary large-scale statistics of precipitation extremes in central Europe, Stoch. Env. Res. Risk. A., 37, 4417–4429, <a href="https://doi.org/10.1007/s00477-023-02515-z" target="_blank">https://doi.org/10.1007/s00477-023-02515-z</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>Fisher and Tippett(1928)</label><mixed-citation>
       Fisher, R. A. and Tippett, L. H. C.: Limiting forms of the frequency distribution of the largest or smallest member of a sample, Math. Proc. Camb. Philos. Soc., 24, 180–190, <a href="https://doi.org/10.1017/S0305004100015681" target="_blank">https://doi.org/10.1017/S0305004100015681</a>, 1928.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>Fontaine and Potter(1989)</label><mixed-citation>
       Fontaine, T. A. and Potter, K. W.: Estimating probabilities of extreme rainfalls, J. Hydraul. Eng., 115, 1562–1575, <a href="https://doi.org/10.1061/(ASCE)0733-9429(1989)115:11(1562)" target="_blank">https://doi.org/10.1061/(ASCE)0733-9429(1989)115:11(1562)</a>, 1989.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>Fuller(1914)</label><mixed-citation>
       Fuller, W. E.: Flood flows, T. Am. Soc. Civ. Eng., 77, 564–617, 1914.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>Gaume et al.(2004)</label><mixed-citation>
       Gaume, E., Livet, M., Desbordes, M., and Villeneuve, J.-P.: Hydrological analysis of the river Aude, France, flash flood on 12 and 13 November 1999, J. Hydrol., 286, 135–154, <a href="https://doi.org/10.1016/j.jhydrol.2003.09.015" target="_blank">https://doi.org/10.1016/j.jhydrol.2003.09.015</a>, 2004.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>Gaume et al.(2010)</label><mixed-citation>
       Gaume, E., Gaál, L., Viglione, A., Szolgay, J., Kohnová, S., and Blöschl, G.: Bayesian MCMC approach to regional flood frequency analyses involving extraordinary flood events at ungauged sites, J. Hydrol., 394, 101–117, 2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>Gnedenko(1943)</label><mixed-citation>
       Gnedenko, B. V.: Sur La Distribution Limite Du Terme Maximum D'Une Serie Aleatoire, Ann. Math., 44, 423–453, 1943.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>Grimaldi et al.(2010)</label><mixed-citation>
       Grimaldi, S., Petroselli, A., Alonso, G., and Nardi, F.: Flow time estimation with spatially variable hillslope velocity in ungauged basins, Adv. Water Resour., 33, 1216–1223, <a href="https://doi.org/10.1016/j.advwatres.2010.06.003" target="_blank">https://doi.org/10.1016/j.advwatres.2010.06.003</a>, 2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>Gumbel(1958)</label><mixed-citation>
       Gumbel, E. J.: Statistics of Extremes, Columbia University Press, <a href="https://doi.org/10.7312/gumb92958" target="_blank">https://doi.org/10.7312/gumb92958</a>, 1958.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>Guse et al.(2010)</label><mixed-citation>
       Guse, B., Hofherr, Th., and Merz, B.: Introducing empirical and probabilistic regional envelope curves into a mixed bounded distribution function, Hydrol. Earth Syst. Sci., 14, 2465–2478, <a href="https://doi.org/10.5194/hess-14-2465-2010" target="_blank">https://doi.org/10.5194/hess-14-2465-2010</a>, 2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>Halbert et al.(2016)</label><mixed-citation>
       Halbert, K., Nguyen, C. C., Payrastre, O., and Gaume, E.: Reducing uncertainty in flood frequency analyses: a comparison of local and regional approaches involving information on extreme historical floods, J. Hydrol., 541, 90–98, <a href="https://doi.org/10.1016/j.jhydrol.2016.01.017" target="_blank">https://doi.org/10.1016/j.jhydrol.2016.01.017</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>Hansen(1987)</label><mixed-citation>
       Hansen, E. M.: Probable maximum precipitation for design floods in the United States, J. Hydrol., 96, 267–278, <a href="https://doi.org/10.1016/0022-1694(87)90158-2" target="_blank">https://doi.org/10.1016/0022-1694(87)90158-2</a>, 1987.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>Klemes(1993)</label><mixed-citation>
       Klemes, V.: Probability of extreme hydrometeorological events-a different approach, IAHS-AISH P., 213, pp. 167–176, 1993.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>Lengfeld et al.(2019)</label><mixed-citation>
       Lengfeld, K., Winterrath, T., Junghänel, T., Hafer, M., and Becker, A.: Characteristic spatial extent of hourly and daily precipitation events in Germany derived from 16 years of radar data, Meteorol. Z., 28, 363–378, <a href="https://doi.org//10.1127/metz/2019/0964" target="_blank">https://doi.org//10.1127/metz/2019/0964</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>Llasat et al.(2010)</label><mixed-citation>
       Llasat, M. C., Llasat-Botija, M., Prat, M. A., Porcú, F., Price, C., Mugnai, A., Lagouvardos, K., Kotroni, V., Katsanos, D., Michaelides, S., Yair, Y., Savvidou, K., and Nicolaides, K.: High-impact floods and flash floods in Mediterranean countries: the FLASH preliminary database, Adv. Geosci., 23, 47–55, <a href="https://doi.org/10.5194/adgeo-23-47-2010" target="_blank">https://doi.org/10.5194/adgeo-23-47-2010</a>, 2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>Maidment et al.(1996)</label><mixed-citation>
       Maidment, D., Olivera, F., Calver, A., Eatherall, A., and Fraczek, W.: Unit hydrograph derived from a spatially distributed velocity field, Hydrol. Process., 10, 831–844, <a href="https://doi.org/10.1002/(SICI)1099-1085(199606)10:6&lt;831::AID-HYP374&gt;3.0.CO;2-N" target="_blank">https://doi.org/10.1002/(SICI)1099-1085(199606)10:6&lt;831::AID-HYP374&gt;3.0.CO;2-N</a>, 1996.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>Marchi et al.(2010)</label><mixed-citation>
       Marchi, L., Borga, M., Preciso, E., and Gaume, E.: Characterisation of selected extreme flash floods in Europe and implications for flood risk management, J. Hydrol., 394, 118–133, <a href="https://doi.org/10.1016/j.jhydrol.2010.07.017" target="_blank">https://doi.org/10.1016/j.jhydrol.2010.07.017</a>, 2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>Merz et al.(2022)</label><mixed-citation>
       Merz, B., Basso, S., Fischer, S., Lun, D., Blöschl, G., Merz, R., Guse, B., Viglione, A., Vorogushyn, S., Macdonald, E., Wietzke, L., and Schumann, A.: Understanding heavy tails of flood peak distributions, Water Resour. Res., 58, e2021WR030506, <a href="https://doi.org/10.1029/2021WR030506" target="_blank">https://doi.org/10.1029/2021WR030506</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>Merz et al.(2024)</label><mixed-citation>
       Merz, B., Nguyen, V. D., Guse, B., Han, L., Guan, X., Rakovec, O., Samaniego, L., Ahrens, B., and Vorogushyn, S.: Spatial counterfactuals to explore disastrous flooding, Environ. Res. Lett., <a href="https://doi.org/10.1088/1748-9326/ad22b9" target="_blank">https://doi.org/10.1088/1748-9326/ad22b9</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>Merz and Blöschl(2008)</label><mixed-citation>
       Merz, R. and Blöschl, G.: Flood frequency hydrology: 1. Temporal, spatial, and causal expansion of information, Water Resour. Res., 44, <a href="https://doi.org/10.1029/2007WR006744" target="_blank">https://doi.org/10.1029/2007WR006744</a>, 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>Milly et al.(2008)</label><mixed-citation>
       Milly, P. C. D., Betancourt, J., Falkenmark, M., Hirsch, R. M., Kundzewicz, Z. W., Lettenmaier, D. P., and Stouffer, R. J.: Stationarity is dead: whither water management?, Science, 319, 573–574, <a href="https://doi.org/10.1126/science.1151915" target="_blank">https://doi.org/10.1126/science.1151915</a>, 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>Montanari et al.(2024)</label><mixed-citation>
      
Montanari, A., Merz, B., and Blöschl, G.: HESS Opinions: The sword of Damocles of the impossible flood, Hydrol. Earth Syst. Sci., 28, 2603–2615, <a href="https://doi.org/10.5194/hess-28-2603-2024" target="_blank">https://doi.org/10.5194/hess-28-2603-2024</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>Morrison and Smith(2002)</label><mixed-citation>
       Morrison, J. E. and Smith, J. A.: Stochastic modeling of flood peaks using the generalized extreme value distribution, Water Resour. Res., 38, 41–1, 2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>Nguyen et al.(2014)</label><mixed-citation>
       Nguyen, C. C., Gaume, E., and Payrastre, O.: Regional flood frequency analyses involving extraordinary flood events at ungauged sites: further developments and validations, J. Hydrol., 508, 385–396, <a href="https://doi.org/10.1016/j.jhydrol.2013.09.058" target="_blank">https://doi.org/10.1016/j.jhydrol.2013.09.058</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>Petrucci et al.(2019)</label><mixed-citation>
       Petrucci, O., Aceto, L., Bianchi, C., Bigot, V., Brázdil, R., Pereira, S., Kahraman, A., Kılıç, Ö., Kotroni, V., Llasat, M. C., Llasat-Botija, M., Papagiannaki, K.,
Pasqua, A. A., Řehoř, J., Geli, J. R., Salvati, P., Vinet, F., and
Zêzere, J. L.: Flood fatalities in Europe, 1980–2018: variability, features, and lessons to learn, Water-Sui., 11, 1682, <a href="https://doi.org/10.3390/w11081682" target="_blank">https://doi.org/10.3390/w11081682</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>Seibert et al.(2020)</label><mixed-citation>
       Seibert, S. P., Auerswald, K., Seibert, S. P., and Auerswald, K.: Abflussentstehung–wie aus Niederschlag Abfluss wird, Hochwasserminderung im ländlichen Raum: Ein Handbuch zur quantitativen Planung, <a href="https://doi.org/10.1007/978-3-662-61033-6_4" target="_blank">https://doi.org/10.1007/978-3-662-61033-6_4</a>, 61–93, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>Thompson et al.(2025)</label><mixed-citation>
       Thompson, V., Coumou, D., Beyerle, U., Ommer, J., Cloke, H. L., and Fischer, E.: Alternative rainfall storylines for the Western European July 2021 floods from ensemble boosting, Communications Earth and Environment, 6, 427, <a href="https://doi.org/10.1038/s43247-025-02386-y" target="_blank">https://doi.org/10.1038/s43247-025-02386-y</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>U.S. Department of Agriculture-Soil Conservation Service(1972)</label><mixed-citation>
       U.S. Department of Agriculture-Soil Conservation Service: Estimation of Direct Runoff From Storm Rainfall, SCS National Engineering Handbook, Section 4, Hydrology. Chap. 10, <a href="https://lmpublicsearch.lm.doe.gov/SiteDocs/111673.pdf" target="_blank"/> (last access: 4 May 2026), 1972.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>Virtanen et al.(2020)</label><mixed-citation>
       Virtanen, P., Gommers, R., Oliphant, T. E., Haberland, M., Reddy, T., Cournapeau, D., Burovski, E., Peterson, P., Weckesser, W., Bright, J., van der Walt, S. J., Brett, M., Wilson, J., Millman, K. J., Mayorov, N., Nelson, A. R. J., Jones, E., Kern, R., Larson, E., Carey, C. J., Polat, İ., Feng, Y., Moore, E. W., VanderPlas, J., Laxalde, D., Perktold, J., Cimrman, R., Henriksen, I., Quintero, E. A., Harris, C. R., Archibald, A. M., Ribeiro, A. H., Pedregosa, F., van Mulbregt, P., and SciPy 1.0 Contributors: SciPy 1.0: fundamental algorithms for scientific computing in Python, Nat. Methods, 17, 261–272, <a href="https://doi.org/10.1038/s41592-019-0686-2" target="_blank">https://doi.org/10.1038/s41592-019-0686-2</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>Voit(2024)</label><mixed-citation>
       Voit, P.: A downward counterfactual analysis of flash floods in Germany – Code repository (v0.1), Zenodo [code], <a href="https://doi.org/10.5281/zenodo.10473424" target="_blank">https://doi.org/10.5281/zenodo.10473424</a>, last access: 15 August 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib50"><label>Voit and Heistermann(2024a)</label><mixed-citation>
       Voit,
P. and Heistermann, M.: Brief communication: Stay local or go global? On the
construction of plausible counterfactual scenarios to assess flash flood
hazards, Nat. Hazards Earth Syst. Sci., 24, 4609–4615,
<a href="https://doi.org/10.5194/nhess-24-4609-2024" target="_blank">https://doi.org/10.5194/nhess-24-4609-2024</a>, 2024a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib51"><label>Voit and Heistermann(2024b)</label><mixed-citation>
       Voit, P. and Heistermann, M.: A downward-counterfactual analysis of flash floods in Germany, Nat. Hazards Earth Syst. Sci., 24, 2147–2164, <a href="https://doi.org/10.5194/nhess-24-2147-2024" target="_blank">https://doi.org/10.5194/nhess-24-2147-2024</a>, 2024b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib52"><label>Vorogushyn et al.(2024)</label><mixed-citation>
       Vorogushyn, S., Han, L., Apel, H., Nguyen, V. D., Guse, B., Guan, X., Rakovec, O., Najafi, H., Samaniego, L., and Merz, B.: It could have been much worse: spatial counterfactuals of the July 2021 flood in the Ahr Valley, Germany, Nat. Hazards Earth Syst. Sci., 25, 2007–2029, <a href="https://doi.org/10.5194/nhess-25-2007-2025" target="_blank">https://doi.org/10.5194/nhess-25-2007-2025</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib53"><label>Winterrath et al.(2012)</label><mixed-citation>
       Winterrath, T.,
Rosenow, W., and Weigl, E.: On the DWD quantitative precipitation analysis and
nowcasting system for real-time application in German flood risk management,
weather radar and hydrology, IAHS-AISH P., 351, 323–329,
2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib54"><label>Winterrath et al.(2018)</label><mixed-citation>
       Winterrath, T., Brendel, C.,
Hafer, M., Junghänel, T., Klameth, A., Lengfeld, K., Walawender, E.,
Weigl, E., and Becker, A.: Gauge-adjusted one-hour precipitation sum (RW):,
RADKLIM Version 2017.002: Reprocessed gauge-adjusted radar data, one-hour
precipitation sums (RW), Deutscher Wetterdienst (DWD)/German Weather Service [data set], <a href="https://doi.org//10.5676/DWD/RADKLIM_RW_V2017.002" target="_blank">https://doi.org//10.5676/DWD/RADKLIM_RW_V2017.002</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib55"><label>WMO(2009)</label><mixed-citation>
       WMO: Manual on estimation of probable maximum precipitation (PMP), <a href="https://library.wmo.int/viewer/35708/?offset=#page=1&amp;viewer=picture&amp;o=bookmarks&amp;n=0&amp;q=" target="_blank"/>, (last access: 18 September 2024), 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib56"><label>Wright et al.(2014)</label><mixed-citation>
       Wright, D. B., Smith, J. A., and Baeck, M. L.: Flood frequency analysis using radar rainfall fields and stochastic storm transposition, Water Resour. Res., 50, 1592–1615, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib57"><label>Wright et al.(2017)</label><mixed-citation>
       Wright, D. B., Mantilla, R., and Peters-Lidard, C. D.: A remote sensing-based tool for assessing rainfall-driven hazards, Environ. Modell. Softw., 90, 34–54, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib58"><label>Wright et al.(2020)</label><mixed-citation>
       Wright, D. B., Yu, G., and England, J. F.: Six decades of rainfall and flood frequency analysis using stochastic storm transposition: review, progress, and prospects, J. Hydrol., 585, <a href="https://doi.org/10.1016/j.jhydrol.2020.124816" target="_blank">https://doi.org/10.1016/j.jhydrol.2020.124816</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib59"><label>Zhou et al.(2019)</label><mixed-citation>
       Zhou, Z., Smith, J. A., Wright, D. B., Baeck, M. L., Yang, L., and Liu, S.: Storm catalog-based analysis of rainfall heterogeneity and frequency in a complex terrain, Water Resour. Res., 55, 1871–1889, <a href="https://doi.org/10.1029/2018WR023567" target="_blank">https://doi.org/10.1029/2018WR023567</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib60"><label>Zhou et al.(2021)</label><mixed-citation>
       Zhou, Z., Smith, J. A., Baeck, M. L., Wright, D. B., Smith, B. K., and Liu, S.: The impact of the spatiotemporal structure of rainfall on flood frequency over a small urban watershed: an approach coupling stochastic storm transposition and hydrologic modeling, Hydrol. Earth Syst. Sci., 25, 4701–4717, <a href="https://doi.org/10.5194/hess-25-4701-2021" target="_blank">https://doi.org/10.5194/hess-25-4701-2021</a>, 2021.

    </mixed-citation></ref-html>--></article>
