Global riverine flood risk – how do hydrogeomorphic floodplain maps compare to flood hazard maps?

Riverine flood risk studies often require the identification of areas prone to potential flooding. This modelling process can be based on either (hydrologically-derived) flood hazard maps or (topography-based) hydrogeomorphic floodplain maps. In this paper, we derive and compare riverine flood exposure from three global products: a hydrogeomorphic floodplain map (GFPLAIN) and two flood hazard maps (JRC and GAR). We find an average spatial agreement between these maps of around 15 30 % at river basin level on a global scale. This agreement is highly variable across model combinations and geographic conditions, influenced by climatic humidity, river volume, topography, and coastal proximity. Contrary to expectations, the agreement between the two flood hazard maps is lower compared to their agreement with the hydrogeomorphic floodplain map. We also map riverine flood exposure for 26 countries across the Global South, by intersecting these maps with three human population maps (GHS, HRSL and WorldPop). The findings of this study indicate that hydrogeomorphic floodplain 20 maps can be a valuable way of producing high-resolution maps of flood-prone zones to support riverine flood risk studies, but caution should be taken in regions that are dry, steep, very flat or near the coast.

Abstract. Riverine flood risk studies often require the identification of areas prone to potential flooding. This modelling process can be based on either (hydrologically derived) flood hazard maps or (topography-based) hydrogeomorphic floodplain maps. In this paper, we derive and compare riverine flood exposure from three global products: a hydrogeomorphic floodplain map (GFPLAIN250m, hereinafter GF-PLAIN) and two flood hazard maps (Flood Hazard Map of the World by the European Commission's Joint Research Centre, hereinafter JRC, and the flood hazard maps produced for the Global Assessment Report on Disaster Risk Reduction 2015, hereinafter GAR). We find an average spatial agreement between these maps of around 30 % at the river basin level on a global scale. This agreement is highly variable across model combinations and geographic conditions, influenced by climatic humidity, river volume, topography, and coastal proximity. Contrary to expectations, the agreement between the two flood hazard maps is lower compared to their agreement with the hydrogeomorphic floodplain map. We also map riverine flood exposure for 26 countries across the global south by intersecting these maps with three human population maps (Global Human Settlement population grid, hereinafter GHS; High Resolution Settlement Layer, hereinafter HRSL; and WorldPop). The findings of this study indicate that hydrogeomorphic floodplain maps can be a valuable way of producing high-resolution maps of flood-prone zones to support riverine flood risk studies, but caution should be taken in regions that are dry, steep, very flat, or near the coast.

Introduction
Flood disasters are a major cause of loss throughout the globe, claiming thousands of lives and causing substantial economic damage every year (CRED and UNDRR, 2020). Our ability to map population growth within flood-prone zones is important since increased exposure is a key driver of flood risk (Ceola et al., 2014;CRED and UNDRR, 2020;Jongman et al., 2012;Winsemius et al., 2016). Global maps of flood-prone zones and human settlements are useful for detecting risk hotspots across the world and may also be used for local studies in data-poor regions (UN SDSN, 2020;Ward et al., 2015). Open-access global maps are providing a variety of alternative products (Lindersson et al., 2020); navigating among these offers might be challenging to the users.
Riverine flood risk studies traditionally outline floodprone zones with hazard maps from hydrodynamic or hydraulic models . These hazard maps typically show flood extent corresponding to a certain probability, for example, the 100-year flood (meaning it has a 1 % chance to occur any given year). This is, however, a computationally demanding method that requires a lot of data: estimating river flow corresponding to a certain probability requires long time series of meteorological or hydrological data, which is a rarity (Blöschl et al., 2013;Kidd et al., 2017).
Hydrogeomorphic methods for mapping floodplains, on the other hand, distinguish the characteristic shapes of floodplains based on topography with the advantage of not relying on the availability of hydrologic data measurements (Dodov and Foufoula-Georgiou, 2006;Nardi et al., 2006). Hydrogeomorphic terrain analysis is furthermore computationally ef-S. Lindersson et al.: How do hydrogeomorphic floodplain maps compare to flood hazard maps? ficient and scale-invariant; the floodplain maps can without difficulty be renewed and made more detailed whenever refined terrain models become available (Manfreda et al., 2014;Nardi et al., 2018;Tavares da Costa et al., 2019). Yet, a floodplain map does not provide the user with any information about flood extent probability, which is often needed in flood risk applications (Dottori et al., 2016a;Sampson et al., 2015). Annis et al. (2019) and Tavares da Costa et al. (2019) suggest that hydrogeomorphic models can be useful for largescale floodplain mapping in ungauged basins and in locations where reliable flood hazard maps are unavailable. Di  argue that flood hazard maps and hydrogeomorphic floodplain maps are complementary and should both be used for identifying flood-prone zones in data-scarce regions following the precautionary principle (Foster et al., 2000). The recently developed global floodplain map GF-PLAIN250m , hereafter GFPLAIN, built by hydrogeomorphic terrain analysis, has raised interest due to its potential for large-scale riverine flood risk applications (Akhter et al., 2021;Mazzoleni et al., 2020;Nardi et al., 2019).
What we know, however, is that the results of large-scale flood exposure analysis heavily depend upon the datasets used (Aerts et al., 2020;Bernhofen et al., 2018;Dottori et al., 2016a;Smith et al., 2019;Trigg et al., 2016;. This is exemplified in the work undertaken by Trigg et al. (2016) showing that outputs from six individual global flood hazard models yield considerably different exposure estimates for the African continent. While finding many areas of agreement, particularly in connection to large rivers with distinct floodplain boundaries, the overall model agreement was only 30 % to 40 % for the entire African continent (Trigg et al., 2016). The flood models particularly disagreed in arid regions, deltas, and large wetlands (Trigg et al., 2016). The floodplain map GFPLAIN has not been included in large-scale comparison studies, however, so the suitability of GFPLAIN compared to other global flood hazard maps for analysing riverine flood exposure remains unclear.
Previous work comparing hydrogeomorphic floodplain maps with flood hazard maps has primarily been limited to local and regional case studies. Nardi et al. (2019) quantitatively compared the consistency between GFPLAIN and a continental flood hazard map for several European rivers. Annis et al. (2019) found that the consistency between a hydrogeomorphic floodplain map and a flood hazard map was affected by the interplay of terrain model resolution, terrain analysis scaling parameter, and river stream order. Furthermore, the consistency increased with increasing return period of the flood hazard map, meaning that the hydrogeomorphic model delineates floodplains generated by high-magnitude, low-frequency events . Nardi et al. (2018) analysed how well hydrogeomorphic floodplain results mimicked flood hazard maps and predominantly found consistencies in floodplain areas unaltered by humans.
In this study, we build upon the existing literature comparing outputs of global flood models, by examining how the global floodplain map GFPLAIN  compares to two commonly used global flood hazard maps when estimating riverine flood exposure: the Flood Hazard Map of the World by the European Commission's Joint Research Centre (Dottori et al., 2016a) and the flood hazard maps produced for the Global Assessment Report on Disaster Risk Reduction 2015 (CIMA Foundation, 2015). The aim of our analysis is to (a) examine the spatial agreement between GF-PLAIN and global flood hazard maps across a range of geographic conditions and (b) demonstrate how model differences affect the estimation of riverine flood exposure. We performed this comparison in three steps. First, we quantified the model agreement for all the river basins of the world covered by all three models. Second, we controlled if the model agreement is associated with specific hydro-environmental attributes. Third, we mapped riverine flood exposure for 26 countries across the global south by intersecting the maps of flood-prone zones with three individual population datasets. The purpose of this paper is to shed light on the usability of hydrogeomorphic floodplain maps in flood risk studies and how the usability varies across geographic conditions.

Data
This section describes the datasets used for comparing the usability of a global hydrogeomorphic floodplain map to two flood hazard maps for riverine flood exposure analysis. A technical summary of the flood and population maps can also be found in Table A1.

Flood maps
We represent riverine flood hazard by comparing three models of flood-prone zones: the global floodplain map GF-PLAIN ; the Flood Hazard Map of the World by the European Commission's Joint Research Centre, hereafter JRC (Dottori et al., 2016a); and the flood hazard maps produced for the Global Assessment Report on Disaster Risk Reduction 2015, hereafter GAR (CIMA Foundation, 2015). The purpose of this comparison is to analyse how GF-PLAIN, derived with hydrogeomorphic terrain analysis, represents riverine flood-prone zones compared to the flood hazard models of the JRC and GAR.
There are currently several outputs from global flood models available, as exemplified by the hazard maps of CaMa-Flood (Yamazaki et al., 2011), the GAR (CIMA Foundation, 2015), Fathom Global (Sampson et al., 2015), the JRC (Dottori et al., 2016a), and GLOFRIS . We used the JRC and GAR since they are open-access datasets and are, like GFPLAIN, based on the global elevation model Shuttle Radar Topography Mission (SRTM) (Farr et al., 2007;Reuter et al., 2007). This allows for a compar-ison of methods delineating flood-prone zones rather than a comparison of underlying terrain model performance. Furthermore, the JRC and GAR have been in high demand in flood exposure studies Ehrlich et al., 2018;UNISDR, 2015;Zischg and Bermúdez, 2020) and included in previous inter-model comparison and validation studies (Aerts et al., 2020;Bernhofen et al., 2018;Trigg et al., 2016). The GAR model has also been referred to by previous literature as the CIMA-UNEP model (Bernhofen et al., 2018;Trigg et al., 2016).
The model structure of the JRC and GAR differs in the sense that the GAR builds upon one-dimensional hydraulic modelling, while the JRC builds upon two-dimensional hydrodynamic modelling (Dottori et al., 2016a;CIMA Foundation, 2015). The model chain of the JRC used 33 years of reanalysed climate data (ERA-Interim) to calculate the probability of each pixel being flooded (Dottori et al., 2016a). The model of the GAR used statistical regionalization of gauged streamflow observations to calculate extreme flow values, which were then used as input to a hydraulic model for flood inundation mapping (CIMA Foundation, 2015). The maps of the JRC differ from GFPLAIN and the GAR in river network coverage: GFPLAIN and the GAR include rivers with upstream drainage areas larger than 1000 km 2 , while the corresponding threshold for the JRC is 5000 km 2 due to the coarse spatial resolution of the climate data (Dottori et al., 2016a;Nardi et al., 2019;CIMA Foundation, 2015).
A validation study by Bernhofen et al. (2018) included both the JRC and GAR when comparing global flood models to satellite images of past floods for three regions in Nigeria and Mozambique. The two-dimensional hydrodynamic model structure of the JRC had the highest performance scores across the three sites, but the one-dimensional method of the GAR also showed benefits for modelling central parts of the floodplains missed by the other flood models (Bernhofen et al., 2018). The models generally performed better in confined floodplains compared to flat extensive ones (Bernhofen et al., 2018).
The flood model of the JRC does not take into account flood protection infrastructure; overbank flow tends to result in floodplain inundation due to unconstrained lateral extents (Dottori et al., 2016a). This should favour agreement with the floodplain delineation of GFPLAIN since the hydrogeomorphic terrain method does not capture disrupted connectivity between the river channel and floodplain either. The GAR hazard maps, on the other hand, include the effect of flood protection infrastructure by having been post-processed based on the assumption that the design level of flood defence is a function of the maximum gross domestic product (GDP) of the area (CIMA Foundation, 2015). The modelled defence levels in the GAR were assumed to partially fail if the design return period was exceeded (CIMA Foundation, 2015).
The morphology of rivers, which the hydrogeomorphic models aim to capture, is predominantly shaped by high-magnitude, low-recurrence flood events Bhowmik, 1984;Nardi et al., 2018). We therefore conducted all analyses in this study using the JRC and GAR hazard maps with return periods of 100 and 500 years (hereafter JRC-100, JRC-500, GAR-100, and GAR-500).
The spatial resolution of GFPLAIN is 8.33 arcsec (∼ 250 m near the line of the Equator), and the spatial resolution of the JRC and GAR hazard maps is 30 arcsec (∼ 1 km). The GFPLAIN dataset is provided as one raster file per continent, covering a near-global extent (60 • N, 56 • S). The JRC is offered as global (85 • N, 85 • S) seamless raster files; raster files of the GAR also cover the near-global extent (60 • N, 56 • S).

Population maps
Throughout this paper, we use the term flood exposure to refer to the number of people located within flood-prone zones. Exposure analysis was conducted by intersecting GFPLAIN and the JRC with three individual population maps: the High Resolution Settlement Layer, hereafter HRSL (Facebook Connectivity Lab and CIESIN, 2016); Global Human Settlement population grid, hereafter GHS (Schiavina et al., 2019); and the WorldPop population grid, hereafter World-Pop (Gaughan et al., 2013;Linard et al., 2012;Sorichetta et al., 2015;. We chose to conduct the exposure analysis with these population datasets since they represent diverse methodologies for mapping populations, each having individual advantages and limitations. By including all three datasets in the exposure analysis we are able to present a range of results, illustrating how the choice of flood map interacts with the choice of population map. We briefly describe the main differences between the population datasets here; other studies have more closely reviewed how the choice of population dataset affects exposure analysis (Leyk et al., 2019;Smith et al., 2019). First of all, the individual population maps offer a range of spatial resolutions: the HRSL is 1 arcsec (∼ 30 m), WorldPop is 3 arcsec (∼ 90 m), and the GHS is 9 arcsec (∼ 250 m).
Dasymetric mapping refers to, in this case, the spatial reallocation of census data to individual pixels using information from ancillary data (Leyk et al., 2019;Wright, 1936). The GHS used a binary dasymetric approach to map population count, allocating census data to built-up areas detected with Landsat satellite imagery (Schiavina et al., 2019). The HRSL used convolutional neural networks for allocating census data to buildings that have been detected with DigitalGlobe high-resolution satellite imagery (Smith et al., 2019). WorldPop used a multivariate dasymetric approach to allocate population data to settlements detected with Landsat satellite imagery, with multiple ancillary data layers, e.g. land cover, built structures, infrastructure, travel time to major cities, nighttime lights (Lloyd et al., 2017). The multi-variate modelling approach of WorldPop means that these covariate variables might prevent interaction studies.
The main asset of the GHS is the temporal depth of the dataset, enabling change detection analysis with its globally consistent population grids for 1975, 1990(Ehrlich et al., 2018. WorldPop currently offers population count maps for every year between 2000-2020. The rather coarse spatial resolution of Landsat means, however, that the GHS and WorldPop tend to leave out dispersed rural settlements (Smith et al., 2019;UN SDSN, 2020). The HRSL has been shown to better represent settlements in rural areas (Smith et al., 2019;Tiecke et al., 2017) but is, however, presently only available as a static dataset.
The population maps of the HRSL and WorldPop are provided as one raster file per country; the HRSL covers ∼ 140 countries, while WorldPop is global. The GHS is offered as a global (85 • N/85 • S) seamless raster file. Population maps used in this study represent the year 2015 (GHS) and 2018 (HRSL and WorldPop).

Data homogenization
The geospatial analysis has primarily been conducted using the cloud computation platform Google Earth Engine (Gorelick et al., 2017). GFPLAIN was tailored for analysis by merging the individual continental images into one single image. GFPLAIN, the JRC, and the GAR were then reclassified to binary wet-dry maps. Normally wet areas were masked from GFPLAIN, the JRC, and the GAR with the global water mask MOD44W, a product that combines waterbody data from the SRTM and the satellite imagery of MODIS (Carroll et al., 2009). MOD44W shares the spatial resolution with GFPLAIN,8.33 arcsec (∼ 250 m), to which all flood maps were homogenized.
The individual country images of the HRSL and WorldPop were also merged into seamless global images using the median pixel value where multiple images overlap. The metadata of the population maps were filtered to the year 2015 for the GHS and the year 2018 for the HRSL and WorldPop.

Model agreement
To quantify the model agreement between GFPLAIN, the JRC, and the GAR, we used the model agreement index (MAI; Eq. 1), as proposed by Trigg et al. (2016). This agreement index allows for an evaluation of multiple raster files. The maps from all three models were first aggregated (separately for each return period) into categories based on how many models agree that each pixel is flooded: 0 -all models are dry, 1 -one model is wet, 2 -two models are wet, 3 -all models are wet. This aggregation was conducted at the spatial resolution of 8.33 arcsec. MAI values (Eq. 1) were then calculated for all the basins in the world that are cov-ered by all three models, resulting in 2776 river sub-basins (hereafter basins). By only including the basins covered by all three flood models we ensured that differences in spatial coverage did not affect the results. For instance, the individual flood maps mask arid areas, to different degrees, since aridity poses a challenge for traditional flood model assumptions. The HydroBASINS Level 5 dataset, based on 15 arcsec resolution raster data, was used for outlining the river basin boundaries (Lehner and Grill, 2013). This dataset has subdivided watershed boundaries to sub-basins according to the Pfafstetter coding system (Lehner, 2014;Lehner and Grill, 2013). Information about the area distribution of the 2776 analysed basins is provided in Table A2; the mean area is 33 000 km 2 . The output value of MAI (Eq. 1) ranges between 0 (no agreement) and 1 (full agreement): where A is the total number of flooded pixels by all evaluated models, a i is the number of pixels flooded by that particular aggregated category, N is the number of models in comparison, and i is the number of models in agreement for that particular category. This index does not assume that one model is preferable to the other and also ignores the large areas that are marked as dry, which would otherwise bias the performance measure. MAI values (Eq. 1) evaluating the agreement of the models GFPLAIN, the JRC, and the GAR were calculated for all 2776 river basins for the return periods 100 and 500 years separately, hereafter MAI-500 and MAI-100. Following this, we also quantified pairwise model agreement for each combination of flood model (Eq. 2). The pairwise model agreement in Eq. (2) corresponds to Eq. (1) when letting N = 2 and has frequently been recommended for evaluating binary maps of inundation models (Aronica et al., 2002;Bates and De Roo, 2000;Pappenberger et al., 2007;Schumann et al., 2009;Trigg et al., 2016). The agreement index in Eq. (2), also referred to as an F 2 measure (Aronica et al., 2002;Pappenberger et al., 2007) or F index , ranges between 0 (no agreement) and 1 (full agreement): where a, b, and c denote the number of pixels within each river basin according to the contingency table in Table 1. Each combination of GFPLAIN, the JRC, and the GAR was evaluated with Eq.

Spatial agreement clusters
We identified spatial clusters of basins with high and low model agreement relative to the mean, i.e. hot and cold spots, using the MAI-500 value for each river basin. This local spatial autocorrelation analysis was carried out in the software GeoDa, version 1.18 (Anselin et al., 2006). Positive spatial autocorrelation was found in high-agreement basins also having high-agreement neighbouring basins or low-agreement basins with low-agreement neighbours. More specifically, we used a local Moran statistic (Anselin, 1995), with 9999 random permutations, to identify significant clusters with pseudo p values > 0.05. The significant clusters were mapped together with the original MAI-500 values to exhibit global patterns of the model agreement between the three flood models. The clusters are calculated irrespective of basin area since a majority of the basins are relatively consistent in size, and the agreement index in Eq. (1) ignores areas marked as dry.

Hydro-environmental attributes
The association between MAI-500 and several hydroenvironmental attributes was then quantified for all 2776 river basins using statistic techniques in R (R Core Team, 2014). Basin level data for all attributes were retrieved from the datasets BasinATLAS version 1.0  and HydroBASINS Level 5 (Lehner and Grill, 2013). Nonparametric measures were used for this evaluation since the response variable, MAI-500, did not fulfil the normality assumption of general linear models. An alpha level of 0.05 was used for all statistical tests. The Kruskal-Wallis test by ranks (Hollander and Wolfe, 1973) was performed to control differences in MAI-500 values for the following factors: geographic region, river stream order (Lehner and Grill, 2013), and freshwater major habitat type (Abell et al., 2008). The freshwater major habitat types entail a combination of topographic, climatic, and hydrological properties since this classification is based on similarities in biological, chemical, and physical characteristics (WWF and TNC, 2019). All Kruskal-Wallis tests were supplemented with post hoc Wilcoxon rank-sum tests, performing pairwise comparisons between the groups. The Kruskal-Wallis tests used the Benjamini and Hochberg (1995) correction method for controlling false discovery rate. We also performed Kruskal-Wallis and Wilcoxon rank-sum tests to evaluate differences between the pairwise model agreement distributions (MAI GFPLAIN GAR-100 , MAI GFPLAIN GAR-500 , MAI GFPLAIN JRC-100 , MAI GFPLAIN JRC-500 , MAI GAR-100 JRC-100 , and MAI GAR-500 JRC-500 ).
We then quantified the level of association between the model agreement and individual hydro-environmental attributes. Spearman's rank-order correlation coefficients (Harell and Dupont, 2021;Hollander and Wolfe, 1973;Spearman, 1904) were calculated for MAI-500, MAI GFPLAIN GAR-500 , MAI GFPLAIN JRC-500 , MAI GAR-500 JRC-500 , and 23 control variables listed in Table 2. The resulting correlation coefficients between the model agreement scores and the hydro-environmental attributes were plotted in a correlogram (Wei and Simko, 2017).

Country selection
The subsequent exposure and area analyses have been conducted for 26 countries in the global south: Bangladesh, Bolivia, Brazil, Cambodia, Central African Republic, Colombia, Republic of the Congo, Ecuador, Ghana, Guatemala, Honduras, India, Indonesia, Kenya, Lao People's Democratic Republic, Liberia, Malawi, Mozambique, Nicaragua, Nigeria, Peru, Sri Lanka, Thailand, Uganda, United Republic of Tanzania, and Viet Nam. These countries are situated in the eastern, middle, and western parts of Africa; South-East Asia; South Asia; Central America; and South America.
The countries were selected based on five criteria. First, we only included countries fully covered by all six datasets used for the comparative exposure analysis (GFPLAIN, JRC, GAR, GHS, HRSL, and WorldPop). Second, only countries belonging to the low-, lower-middle-, or upper middleincome groups were selected ( Fig. 1), here classified using gross national income (GNI) per capita for the year 2015 (The World Bank, 2021b). The assumption was that, in general, the high-income countries have less need for global flood products since they tend to be more well equipped with national flood maps. Third, the selected countries all have a relatively high degree of riverine flood risk ( Fig. 1), here represented by the combination of the population within the 100-year flood hazard map (intersecting the GHS and JRC) and actual disaster reports of people affected by riverine floods between 1900 and 2020 (Guha-Sapir et al., 2014). Fourth, we excluded countries with the main climate classification being arid or snowy according to the World Map of Köppen-Geiger (Beck et al., 2018). Finally, we have aimed to get a representative sample across the regions of the world.

Exposure and area analysis
The number of people located in flood-prone zones was calculated for each country by intersecting the binary and masked images of GFPLAIN, JRC-500, JRC-100, GAR-500, and GAR-100 with the population maps of the GHS, the HRSL, and WorldPop. Country boundaries were outlined with the Global Administrative Unit Layers (GAUL) dataset Table 2. Control variables for which the association with the model agreement between the flood maps was evaluated. All control variables were retrieved from the dataset BasinATLAS version 1.0 , except river stream order and basin area, imported from the HydroBASINS dataset (Lehner and Grill, 2013). The upstream watershed is the total upstream watershed directly connected to the basin.

Attribute
Description Unit Spatial dimension Source River stream order Classification according to a classical river ordering system. Order 1 indicates the mainstem segment of the river, Order 2 indicates tributaries that flow into Order 1 segments, etc. Order 0 indicates lumped coastal basins that drain into the ocean, situated between large river basins.

Classes (4) Basin classification
HydroBASINS (Lehner, 2014;Lehner and Grill, 2013) Freshwater major habitat types Classification based on freshwater ecoregions. Classes (11) Spatial majority Freshwater Ecoregions of the World (Abell et al., 2008) Basin area Area of the individual basin and the upstream watershed, respectively.

Cubic metres per second
Annual average WaterGAP v2.2 (Döll et al., 2003) Land surface run-off (annual) Long-term (1971Long-term ( -2000   for the year 2015 (FAO UN, 2015). The number of people living in flood-prone zones was then exported for all countries and dataset combinations. The population maps were also used for calculating country population totals, which have been compared to numbers provided by the World Bank in Table A7. The percentage of the country population living in flood-prone zones was then derived for each country and dataset combination. For all area calculations of raster images, we used a pixel area function in Google Earth Engine (Gorelick et al., 2017) that minimizes projection distortions, which otherwise may be an issue when calculating areas over large regions. The function first calculated the area of each pixel using individual Lambert azimuthal equal-area projections for each block of pixels. The resulting grid of pixel areas was subsequently multiplied with the masked and binary images, for which the sum could be exported for each region of interest.
The total land surface area was calculated for all 26 countries by subtracting the total country area, as outlined by the polygons in the GAUL dataset (FAO UN, 2015), with the area of permanent surface water as given by the raster water mask MOD44W (Carroll et al., 2009). The area estimations of flood-prone zones, given by the binary and masked images of GFPLAIN, JRC-500, JRC-100, GAR-500, and GAR-100, were finally exported for all countries. The percentage of country area that is flood-prone was then derived for all countries and flood maps.

Results and discussions
In this section, we first discuss the model agreement between the flood maps in Sect. 4.1 and then turn to the implications for riverine exposure analysis in Sect. 4.2.

Model agreement across model combinations
The overall spatial pattern of the floodplain map GFPLAIN resembles the flood hazard maps of the GAR more than the flood hazard maps of the JRC (Fig. 2). A possible explanation of this is that GFPLAIN and the GAR cover larger The box hinges correspond to the first and third quartile, and the yellow line corresponds to the second quartile (median value). The mean value is indicated with a cross. The whiskers extend from the box hinges with maximum lengths equal to the inter-quartile range multiplied by a factor of 1.5. Values outside of the whiskers are indicated with dots. A higher return period shows higher agreement with GFPLAIN, but the choice of flood model influences the level of agreement even more. The flood hazard maps the GAR and JRC show the lowest level of agreement. The index has been calculated here for the 2776 river basins across the world that are covered by all three models. parts of each river network compared to the JRC since they exhibit lower inclusive thresholds for the upstream drainage area. Figure 2 shows that the model agreement between the floodplain map GFPLAIN and the flood hazard maps of the GAR is, on average, around 35 %. The corresponding agreement between GFPLAIN and the flood hazard maps of the JRC is around 30 %. These agreement levels are in line with the results of Trigg et al. (2016) when comparing the outputs of six global flood models for the African continent, finding an average model agreement of around 30 % to 40 %. It is also clear from Fig. 2 that there is a large spread of agreement scores across all 2776 river basins: all pairwise comparisons range between the maximum model agreement value of 100 % and very close to the minimum value of 0 %.
The pairwise model agreement evaluation also confirms that the model agreement between GFPLAIN and the hazard maps is higher for the return period of 500 years compared to 100 years. This is consistent with previous findings in the literature about how the morphology of rivers, which the hydrogeomorphic floodplain maps delineate, is predominantly shaped by high-magnitude, low-recurrence flood events Bhowmik, 1984;Nardi et al., 2018). Figure 2 pinpoints, however, that choice of hazard model has a greater influence on the degree of model agreement with GFPLAIN compared to the choice of return period.
Contrary to expectations, the model agreement between the hazard maps of the JRC and GAR is lower compared to their agreement with GFPLAIN. The median agreement values across all river basins are found to be 0.34 for GFPLAIN and GAR-500, 0.27 for GFPLAIN and JRC-500, and 0.20 for GAR-500 and JRC-500. The reasons for this result are not evident, but it supports earlier findings of significant disagreement among individual global flood models (Trigg et al., 2016;Ward et al., 2015). All distributions from the pairwise model agreement evaluation show positive skewness, meaning that the bulk of river basins have low values (Fig. 2). The portion of river basins having low agreement scores is highest for the GAR and JRC comparison, illustrated by the relatively big differences between the average and median values. A Kruskal-Wallis test showed that there is a significant difference in model agreement between the distributions (H (5) = 1217, p < 0.001); the pairwise comparisons from the post hoc test can be found in Table A3.

Model agreement across geographic conditions
We also analysed how MAI-500, the model agreement between GFPLAIN, GAR-500, and JRC-500, varies across space and a set of hydro-environmental groups. We chose to represent the model agreement with MAI-500 since we generally found a higher model agreement using the 500-year return period compared to the 100-year return period. Figure 3 provides the spatial distribution of MAI-500 across all 2776 river basins and local clusters of high and low model agreement relative to the mean, as identified from the spatial autocorrelation analysis. These maps suggest that the model agreement is associated with factors related to climate, topography, and coastal proximity.
The river basins with the highest model agreement are generally located in humid regions (Fig. 3). For instance, it is the equatorial humid climate regions that exhibit the highest model agreement in the South American continent, including the north-western part of Brazil and the north-eastern part of Peru. The high-agreement cluster in North America, near the Mississippi River Delta, and the high-agreement cluster in south-eastern Europe are also situated in warm temperate and fully humid regions. In contrast, a poor model agreement is generally found in dry regions. The regions in North America with cold climate entail large clusters of basins with a low model agreement. Arid desert regions also exhibit low model agreement clusters, such as the Kalahari Desert in southern Africa.
Relating to topography, Fig. 3 also shows that alpine regions like the Andes in South America and high-mountain Asia exhibit poor model agreement. This can, at least partly, be explained by the same regions being dry. It also seems possible that very steep regions would be prone to error. However, very flat regions also tend to exhibit poor model agreement across climate types. This may be exemplified by the flat grassland of the humid Pampa region in Argentina and the flat inland areas on the tropical islands of Indonesia. Figure 3a furthermore illustrates that coastal river basins seem to exhibit low model agreement, visible as white lines along many coastlines. This tendency is not captured well by the clusters in Fig. 3b since one single low-agreement basin near the coast would not count as a cluster. A possible explanation for the low agreement in coastal river basins might be due to complex flood hydraulics yielding different results, in line with previous findings of low agreement in deltas (Dottori et al., 2016a;Trigg et al., 2016). For instance, GFPLAIN tends to miss floodplain areas near the coast (on the order of a few kilometres), possibly because these areas are challenging to detect with hydrogeomorphic analysis. The individual riverine flood maps might also differ in their respective demarcation to coastal flooding. Another possible explanation for the low model agreement in coastal river basins is that the JRC might miss small coastal watersheds compared to GAR and GFPLAIN due to having a higher threshold of upstream drainage area. It can also be seen in Fig. 3 that major river deltas often exhibit high model agreement on the basin level, as exemplified by the Mississippi River Delta in the US and the Mekong River Delta in Viet Nam and Cambodia. This is an indication that the volume of the river also affects the model agreement, which seems possible since large river discharge imposes large forces on the surrounding landforms, the very shapes that GFPLAIN aims to capture.
Thus far, we have elaborated on what might cause the spatial variation in MAI-500 based on a visual inspection of the global maps in Fig. 3. We also quantified these results by assessing how the MAI-500 value varies between a set of factoring groups: geographic region, river stream order, and freshwater major habitat type. Across the basins situated in each geographic region, Fig. 4a conveys that the highest model agreement can, on average, be found in South America (39 %) and Central and South-East Asia (39 %), while North America (29 %) and Africa (32 %) exhibit the lowest model agreement. This supports, again, that humid re-gions tend to perform better compared to dry regions. The Kruskal-Wallis test showed that there is a significant difference in model agreement between the regions (H (6) = 146, p < 0.001); the pairwise comparisons from the post hoc test can be found in Table A4. Figure 4b illustrates how MAI-500 varies with river stream order, supporting that model agreement tends to be negatively associated with coastal proximity. Furthermore, the Kruskal-Wallis test showed that the model agreement significantly varies between river stream orders (H (3) = 101, p < 0.001). More specifically, the model agreement in coastal basins, stream order 0, was identified as significantly lower compared to inland stream order groups, verified by the post hoc Wilcoxon rank test (p values < 0.001; see Table A5).
We now move on to the freshwater major habitat types, a classification based on the spatially dominating habitat type in each river basin (WWF and TNC, 2019). Before presenting the model agreement variations between the habitat types, however, we point out that a classification on the river basin level will inevitably contain multiple smaller habitat types. For instance, the habitat type "large lakes" includes river basins that are dominated by large lentic systems, in our case covering three large regions: Lake Baikal in Siberia, Lake Malawi in Africa, and Michigan-Huron in North America. But a river basin of this habitat type can, for example, also contain grassy savannas or swamps (WWF and TNC, 2019). Another point is that the habitat type "large river deltas" only includes regions that exhibit both deltaic physical features and deltaic fauna (WWF and TNC, 2019). This habitat type covers four large regions within our analysed river basins: the Niger Delta in Nigeria, the Mekong Delta in Viet Nam and Cambodia, the Orinoco Delta in Venezuela, and Brazilian delta regions between the rivers Amazon and Mearim. In other words, deltas without the characteristic deltaic fauna do not belong to this habitat type. For instance, the river basins containing the Mississippi River Delta belong to the habitat type "temperate floodplain rivers and wetland complexes". Another point is that the difference between rivers classified as floodplain rivers and upland rivers is the presence of cyclically flooded floodplains. Floodplain rivers are characterized by having cyclically flooded floodplains (today or historically), while the upland rivers are not (WWF and TNC, 2019). Upland rivers can for instance be tributaries of large river systems. Figure 4c shows how the model agreement varies between the freshwater major habitat types. Floodplain rivers, in temperate and tropical climate regions, exhibit higher model agreement compared to upland rivers, followed by coastal rivers. We can also see, as previously discussed, that freshwaters in arid, here expressed as xeric, and montane regions exhibit poor model agreement. The group "large river deltas" is ranked as having the highest model agreement, clearly influenced by the high agreement in large river deltas, like the Mekong. These results should be interpreted with caution, however, due to the unbalanced sample sizes between the groups (Fig. 4c). Nonetheless, the inherent order between the groups supports previous discussion about how factors related to climate, topography, coastal proximity, and river volume affect MAI-500 values. The Kruskal-Wallis test showed that the agreement difference is significant between the habitat types (H (10) = 478, p < 0.001). The pairwise comparisons from the post hoc test can be found in Table A6.

Model agreement association with hydro-environmental attributes
Section 4.1 has so far demonstrated that MAI-500, the level of model agreement between GFPLAIN, GAR-500, and JRC-500, is linked to factors related to climate, topography, coastal proximity, and river volume. We have done this by visually exploring the spatial pattern of the model agreement and by grouping the values according to geographic region, river stream order, and freshwater major habitat type. Each of these habitat groups contains a variety of geographic characteristics, so we also analysed the level of association between model agreement and individual hydro-environmental attributes and investigated whether the association differs between the model agreement pairs. Figure 5 presents the Spearman rank-order correlation coefficients in a correlogram, supporting the statement that wet regions tend to exhibit higher model agreement compared to dry regions. Precipitation, land surface run-off, the aridity index, and the climate moisture index are positively associated with the level of model agreement. Precipitation exhibits the strongest positive association among these variables, especially for MAI GFPLAIN JRC-500 . The association levels between MAI GAR-500 JRC-500 and these climatic variables are, however, relatively weak or even insignificant (the aridity index and the climate moisture index). In other words, the level of agreement between GFPLAIN and the hazard maps seems to be positively influenced by a wet climate, while the agreement between the JRC and GAR seems to be controlled by other factors. This result further supports the comment that the skill of hydrogeomorphic floodplain maps to represent flood hazard depends on the availability of water.
The variables that represent the magnitude of river discharge also exhibit a positive association with the model agreement in an even stronger sense compared to the previously mentioned climatic variables (Fig. 5). This is particularly evident for the variables natural discharge and upstream river volume, having moderate to strong positive association with all model agreement distributions. For the very same reason, we can see that the size of the upstream drainage area exhibits a positive association with the model agreement but to a moderate degree. As previously discussed, increased discharge means larger forces forming the landscape and hence more well-defined floodplains. The size of the river basin being evaluated, on the other hand, co-variates with the model agreement in a weak negative direction. One explanation for this is that very small river basins get maximum model agree- Figure 4. How the model agreement between GFPLAIN, GAR-500, and JRC-500 varies with (a) geographic region, (b) river stream order, and (c) freshwater major habitat type. The model agreement index ranges between 0 % (no agreement) and 100 % (full agreement) and was calculated here for all the river basins of the world that are covered by all three models. The box hinges correspond to the first and third quartile, and the yellow line corresponds to the second quartile (median value). The mean value is indicated with a cross. The whiskers extend from the box hinges with maximum lengths equal to the inter-quartile range multiplied by a factor of 1.5. Values outside of the whiskers are indicated with dots. The number of river basins corresponding to each group is indicated in numbers and box width. Coastal river basins tend to have lower model agreement scores compared to inland river basins. The freshwater major habitat type specifies the spatially dominating habitat type in each river basin (WWF and TNC, 2019). The habitat type "large river deltas" only includes regions that exhibit both deltaic physical features and deltaic fauna. The habitat type "large lakes" includes river basins that are dominated by large lentic systems. The grouping of basins into geographic regions follows the HydroBASINS dataset. ment scores when they are fully covered by all three flood models.
The association between model agreement and the variables elevation, slope, and stream gradient is significant in a negative direction (Fig. 5). This finding supports previous discussion about how high-altitude and steep regions tend to exhibit poor model agreement. The association between these topographic variables is strongest for the model agree-ment between GFPLAIN and the GAR, whereas the association is weakest for the agreement between GFPLAIN and the JRC. The reason for this result is not evident, but one explanation could be that the one-dimensional hydraulic modelling of the GAR tends to be sensitive to these topographic variations. These findings may also be somewhat limited since the relationship between model agreement and, for instance, the slope is not monotonic: we have already observed Figure 5. Spearman's rank-order correlation coefficients between the flood model agreement scores and 27 hydro-environmental attributes. Wetter conditions due to climate (precipitation, land surface run-off) and river size (e.g. river volume, natural discharge) are positively associated with the model agreement between the flood maps. Topographic variables like high altitude (elevation) and steep slope (e.g. slope and stream gradient) are negatively associated with the model agreement, especially between GFPLAIN and GAR-500. Please note that the aridity index increases with humidity. how many very flat areas also exhibit poor model agreement (Fig. 3).
The overall association between model agreement and the anthropogenic influences (population count, GDP) and the land cover characteristics (e.g. urban extent, cropland extent) is weak. These results are likely to be related to the scale of the analysis; any possible association would be averaged out on the aggregated basin level. Nardi et al. (2018) predominantly found consistencies between a hydrogeomorphic floodplain map and flood hazard maps in areas unaltered by humans. However, the population count and GDP in the upstream watershed regions show moderate positive association with the model agreement scores since these variables are also correlated with the variables that represent the magnitude of river discharge (upstream drainage area, upstream river volume, and natural discharge).

Implications for exposure analysis
Our analysis has shown that the level of agreement between GFPLAIN and the GAR and JRC flood hazard maps is, on average, higher compared to the agreement between the GAR and JRC. Furthermore, we have shown that the model agreement is linked to factors related to climate, topography, river size, and coastal proximity. We further analysed how these differences affect riverine flood exposure analysis, which we have conducted for 26 countries using the GHS, WorldPop, and HRSL population maps.
The flood hazard maps GAR-500 and GAR-100 cover an area of ∼ 3.3×10 6 and ∼ 3.18×10 6 km 2 across the 26 countries, while the floodplain map GFPLAIN covers ∼ 3.17 × 10 6 km 2 . The hazard maps JRC-500 and JRC-100 cover an area of ∼ 1.77×10 6 and ∼ 1.63×10 6 km 2 , respectively. One explanation for this large difference is that the GAR and GF-PLAIN include smaller rivers compared to the JRC. Figure 6 shows the flood exposure estimates of each map combina- Figure 7. Riverine flood exposure estimates, aggregated at country level, using the hydrogeomorphic floodplain map GFPLAIN (bars on the left) and the flood hazard maps of the JRC (bars on the right) and GAR (shaded bars on the right) with a 500-year return period. The green, orange, and pink horizontal bars indicate the percentage of the country's population in flood-prone zones using the population maps Global Human Settlement population grid, High Resolution Settlement Layer, and WorldPop, respectively. The vertical blue bars indicate the percentage of land area that is flood-prone. Country boundaries follow the GAUL 2015 dataset, and permanent water has been masked by the product MOD44W. tion, aggregated across all 26 countries. Interestingly, GF-PLAIN yields a larger exposure estimate compared to the GAR and JRC, even though GAR-500 covers a larger total area than GFPLAIN. One explanation of the higher population estimate given by GFPLAIN is that the floodplain map covers a larger area of densely populated regions in India, resulting in 44 million more exposed people with GFPLAIN compared to GAR-500 (using the HRSL population map). This difference is, however, counteracted by the instances where GAR-500 yields larger exposure estimates, for example, 21 million more exposed people in Bangladesh.
We would like to highlight that (1) the choice of flood model influences the exposure estimates to a higher degree than the choice of the hazard return period. We think that this finding provides an important perspective to end users of global flood maps; (2) we observe an inherent order across the population maps (GHS > HRSL > WorldPop), which can be explained by the varying population totals across these datasets (Table A7). WorldPop, for instance, exhibits considerably lower population values for all 26 countries. Scaling the country totals would enable a more consistent comparison of the exposure hit between the individual population datasets. We did not scale the population estimates, however, since this study aims to capture how dataset choice affects exposure estimates.
Let us now turn to discuss how the riverine exposure estimates vary across individual countries and map combinations. Figure 7 presents the flood-prone area and exposed population as proportions for each country. Among our analysed 26 countries, countries in South Asia and South-East Asia stand out as having the highest riverine flood exposure. For instance, the largest number of people living within riverine flood-prone zones can be found in India (346 million), Bangladesh (100 million), Viet Nam (53 million), and Indonesia (37 million). The proportion of exposed people is largest for Bangladesh (59 %), Viet Nam (54 %), Cambodia (53 %), and Laos (50 %). The proportion of land area that is flood-prone is largest for Bangladesh (57 %), Cambodia (28 %), Bolivia (26 %), and Thailand (21 %). The numbers listed here represent the intersection of GFPLAIN and the HRSL. All results of the area calculations and exposure estimations have been made available in Tables A8-A10. It can also be seen in Fig. 7 that GAR-500 covers the largest area in 17 countries, whereas GFPLAIN covers the largest area in the remaining 9 countries. Honduras is the only country where the JRC covers a larger area compared to GFPLAIN. This is explained by the fact that coastal areas make up a large part of this country, and these are more often masked from GFPLAIN compared to the JRC. This tendency is not consistent, however, and for instance does not apply to Sri Lanka. One of the issues emerging from this finding is that masked areas near the coast can affect the exposure analysis result. Even though one would not expect the usage of riverine flood maps for estimating coastal flood risk, it might still affect riverine exposure estimations on, say, the country level since the coastal areas also tend to be densely populated. This is most certainly the case in Liberia. GFPLAIN covers a larger area compared to the JRC but misses parts of the densely populated coastal city Monrovia, resulting in an exposure estimate that is smaller by 0.43 million compared to the JRC. At the same time, variations among the flood maps can also affect the coverage of inland cities, as in the case of Colombia. GAR-500 covers a larger area compared to GFPLAIN, but GFPLAIN yields an exposure estimate that is larger by 4.5 million from covering a larger part of the capital city Bogotá.
As far as the influence of the return period is concerned, the difference between the exposure estimates of JRC-500 and JRC-100 is largely proportional to the corresponding area difference. This is also the case for the hazard maps of the GAR, with some exceptions. The area ratio between GAR-500 and GAR-100 in Thailand, for example, is 1.09, while the corresponding exposure ratio using the HRSL is 1.83. Similar results can be found for Indonesia (1.04 compared to 1.22), Brazil (1.03 compared to 1.17), and Colombia (1.02 compared to 1.13). This finding might reveal how pop-ulations in these countries tend to live outside but close to the 100-year flood hazard zone. Figure 8 illustrates some of the implications that the dataset differences have for riverine exposure analysis. Figure 8a covers a part of the Amazon River in Brazil and, in Sect. 4.1.2, is identified as a high-model-agreement cluster. The spatial patterns of GFPLAIN, JRC-500, and GAR-500 are quite similar for this major river. GFPLAIN and the GAR cover smaller river tributaries compared to the JRC, but this difference does not affect the exposure estimates since the population is clustered in the city of Manaus near the main river. Figure 8b illustrates a major discrepancy in the handling of coastal regions between the individual models, exemplified by the Niger Delta in Nigeria. The JRC covers the entire river basin, while both GFPLAIN and the GAR have masked the coastal areas to different degrees. These discrepancies affect the exposure hits in cities like Port Harcourt and Warri. Figure 8c covers a part of the Krishna River in India. This map also illustrates how GFPLAIN and the GAR include smaller rivers than the JRC, as is the case for the river tributaries north of the city Khammam. We can also see how GFPLAIN covers a smaller area in the south-eastern map corner but reaches the city of Guntur, nonetheless, in contrast to the GAR and JRC. Figure 8d covers a part of the Mekong River in Cambodia, also identified as a high-modelagreement cluster in Sect. 4.1.2. The spatial pattern is indeed quite similar between the flood maps for this large river but with some discrepancy. For instance, GFPLAIN does not exhibit the same gaps in the flood extent, as can be seen in the northern part of the JRC and GAR flood maps. These gaps represent hilly areas that in reality would be unlikely to flood and could be an artefact of the hydrogeomorphic delineation method. GFPLAIN furthermore covers a larger portion of the capital Phnom Penh, affecting the exposure hits.
In closing this section, we want to pinpoint one last remark from the findings illustrated by Fig. 7 related to the inherent order between the population maps (GHS > HRSL > WorldPop). As discussed, a scaling of the country totals would enable a more consistent comparison of the exposure hit between the individual population datasets. Nonetheless, we have indeed conducted a form of scaling in Fig. 7 when normalizing the exposed population estimates with the population totals. The proportion of the exposed population is still generally highest for the GHS population map, whereas the results vary for the HRSL and WorldPop. This can be contrasted to the study of Smith et al. (2019), finding considerably lower exposure estimates for the HRSL compared to the GHS and WorldPop when scaling the demographic datasets to share the country totals. But an important aspect of their finding, besides the scaling of country totals, is that the hazard data need to be high-resolution to make use of the detailed settlement representation of the HRSL (Smith et al., 2019). We think that this also highlights an important aspect of the usability of hydrogeomorphic floodplain maps in riverine exposure analysis. The opportunity to delineate high-resolution floodplain maps from new terrain models can play an important role in conducting riverine flood exposure estimations in data-poor regions if one wishes to make the best use of the detailed representations of new population maps, like the HRSL.

Conclusions
In this paper, we evaluated model agreement between the hydrogeomorphic floodplain map GFPLAIN with the flood hazard maps of the JRC and GAR across geographic conditions and a set of hydro-environmental attributes. We demonstrated that the level of model agreement between GFPLAIN and the flood hazard maps is linked to climatic conditions, topography, coastal proximity, and river volume. We also conducted riverine exposure analysis for 26 countries by intersecting the flood maps with three demographic datasets to explore how model differences affect riverine flood exposure estimations. Our findings can be summarized by three main points.
First, our results confirm that the consistency between GF-PLAIN and the flood hazard maps increases with the return period. The choice of model for the flood hazard map is, nevertheless, more important for evaluating model agreement on the river basin level and, also, for affecting the results of exposure analysis.
Second, our study confirms that the results of riverine exposure analysis are highly dependent upon the choice of datasets. Contrary to expectations, the model agreement between the JRC and GAR hazard maps is lower compared to their agreement with GFPLAIN. The median agreement values across all river basins are found to be 0.34 for GFPLAIN and GAR-500, 0.27 for GFPLAIN and JRC-500, and 0.20 for GAR-500 and JRC-500. There is a large spread across all basins, however, ranging between the maximum model agreement value 1 and very close to the minimum value 0. This finding (yet again) stresses the uncertainties in global flood models, suggesting that users might benefit by using an ensemble of models as a way of precaution.
Third, the agreement level between GFPLAIN and the flood hazard maps suggests that hydrogeomorphic terrain analysis can indeed be viewed as a valuable way of estimating flood-prone zones, especially in data-poor regions. The highest model agreement was generally found for large rivers in temperate or tropical climate regions. However, we do not wish to reduce hydrogeomorphic methods as mere substitutes for global flood models: the individual methodologies ultimately serve different purposes. Furthermore, floodplain maps built by hydrogeomorphic terrain analysis should be used with caution in regions that are dry, steep, very flat, or near the coast. The tendency by GFPLAIN to not cover coastal areas may in many cases affect the riverine flood exposure estimates, even on the country level, since coastal areas also tend to be densely populated.
Inter-model comparisons like this study do not answer the question of how well the individual flood layers agree with actual flood events. Future research can build upon this comparative study by conducting a validation analysis using flood delineation from satellite imagery, similar to the work by Bernhofen et al. (2018) or Hawker et al. (2020) but for a larger set of regions across the world (e.g. using the recent Global Flood Database; Tellman et al., 2021). Future studies can also investigate how model agreement varies with the fluvial geomorphology of the rivers (e.g. as given by the global river classification dataset GloRiC; Ouellet Dallaire et al., 2019) or land cover (e.g. as given by the ESA CCI 2015 land cover map; ESA, 2017).
To conclude, this study provided initial insights on how and where hydrogeomorphic floodplain maps could serve as candidates for identifying flood-prone areas in riverine flood risk studies. One particular benefit of hydrogeomorphic terrain analysis is that it does not require hydrological information to produce high-resolution floodplain maps whenever refined terrain models become available, also enabling the mapping of finer parts of the river network. This could particularly play an important role in riverine flood exposure analysis as the detail of the hazard layer needs to meet the detail of the settlement layer to avoid overestimations of flood exposure, which is especially important in dispersed rural regions.
Appendix A Uses hydrological simulations of Glo-FAS to derive river discharge. Gumbel extreme value distribution is used for fitting the daily annual discharge maxima to peak discharge maps for the respective return periods for all river basins with an upstream drainage area > 5000 km 2 and river width > 100 m. Streamflow is downscaled on the river network HydroSHEDS and used as input for local flood inundation simulations with a 2-D hydraulic model CA2D.  (Gaughan et al., 2013;Linard et al., 2012;Sorichetta et al., 2015; Pixel values indicate the number of inhabitants for the year 2018. GeoTIFF, 3 arcsec (∼ 90 m), global Uses a range of input datasets, e.g. land cover, roads, slope, nighttime lights. Built-up land cover is detected with satellite imagery of Landsat (30 m).
Uses a random-forest-based dasymetric modelling approach for allocating census counts to built-up areas.   -7. North America and Caribbean 470 0.291 0.149 * * * * * * * * * * * * * * * * * * Note: * p < 0.05, * * p < 0.01, * * * p < 0.001. n.s.: p > 0.05. Table A5. Summary statistics and pairwise comparisons of how MAI-500 (the model agreement index of GFPLAIN, GAR-500, and JRC-500) varies with different river stream orders. A Kruskal-Wallis test showed that there is a significant difference in model agreement between the river stream orders (H (3) = 101, p < 0.001). The pairwise comparisons, for detecting significant differences between the stream orders, have been conducted using the Wilcoxon rank-sum test with continuity correction.    Table A8. Total land area and flood-prone area (square kilometres) using GFPLAIN, JRC-100, JRC-500, GAR-100, and GAR-500. Land area is the total country area minus surface water area, given by the water mask MOD44W. The percentage of country area that is normally occurring surface water is less than 5 % for all countries, except Uganda with Lake Victoria (15 %) and Malawi with Lake Malawi (20 %). Code availability. The geospatial analysis has primarily been conducted in Google Earth Engine, and the statistical analysis has been conducted in R. The corresponding codes are made available upon request.
Competing interests. The contact author has declared that neither they nor their co-authors have any competing interests.
Disclaimer. Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.