How uncertain are precipitation and peakflow estimates for the July 2021 flooding event?
- 1Institute for Bio- and Geosciences (IBG-3, Agrosphere), Forschungszentrum Jülich, Jülich 52425, Germany
- 2Centre for High-Performance Scientific Computing in Terrestrial Systems, Geoverbund ABC/J, Jülich 52428, Germany
- 3Institute for Geosciences, Department of Meteorology, Universität Bonn, Bonn 53121, Germany
- 4Laboratory for Clouds and Precipitation Exploration, Geoverbund ABC/J, Bonn 53121, Germany
- 1Institute for Bio- and Geosciences (IBG-3, Agrosphere), Forschungszentrum Jülich, Jülich 52425, Germany
- 2Centre for High-Performance Scientific Computing in Terrestrial Systems, Geoverbund ABC/J, Jülich 52428, Germany
- 3Institute for Geosciences, Department of Meteorology, Universität Bonn, Bonn 53121, Germany
- 4Laboratory for Clouds and Precipitation Exploration, Geoverbund ABC/J, Bonn 53121, Germany
Abstract. The disastrous July 2021 flooding events made us question the ability of current hydrometeorological tools in providing timely and reliable flood forecasts. This is an urgent concern since extreme events are increasing due to global warming. For the July 2021 events, we simulated the hourly streamflows of seven catchments in western Germany, by combining five, partly polarimetric, radar-based quantitative precipitation estimates (QPE) with two hydrological models: a conceptual lumped model (GR4H) and a physically-based, 3D distributed model (ParFlowCLM). GR4H parameters were calibrated with emphasis on high flows using historical discharge observations, whereas ParFlowCLM parameters were estimated based on landscape and soil properties. The key results are as follows: (1) All radar-based QPE products underestimated the total precipitation depth relatively to rain gauges due to intense collision-coalescence processes near the surface, i.e. below the height levels monitored by the radars. (2) The use of polarimetric radar variables led to clear improvements compared to reflectivity-based QPE products. (3) The probability of exceeding the highest measured peakflow before July 2021 was highly impacted by the QPE product, and depended on the catchment for both models. (4) The estimation of model parameters had a larger impact than the choice of QPE product, but simulated peakflows of ParFlowCLM agreed with those of GR4H for five of the seven catchments. This study highlights the need for the correction of vertical profiles of reflectivity and other polarimetric variables near the surface to improve radar-based QPE for extreme floods. It also underlines the large uncertainty in peakflow estimates due to model parameter estimation.
-
Notice on discussion status
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
-
Preprint
(1427 KB)
-
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
Journal article(s) based on this preprint
Mohamed Saadi et al.
Interactive discussion
Status: closed
-
RC1: 'referee comment on nhess-2022-111', Anonymous Referee #1, 03 May 2022
Review of manuscript “How uncertain are precipitation and peakflow estimates for the July
2021 flooding event?” by Mohamed Saadi et al., submitted to NHESS
The authors present a modelling study targeted at evaluating different rainfall products and two hydrological models to simulate the flood event of 2021 in West Germany, in order to sow the uncertainties in quantitative precipitation estimates (QPE) and the modelling. In general, the study provides insights in the usefulness and weaknesses of the different radar-based rainfall products and their use in simulating extreme flood events. The manuscripts is overall well written and structured.
However, I have some reservations to the conclusions drawn, mainly because of the study design, in particular the hydrological modelling part. My concerns are as follows:
Different model parameterizations (ParFlowCLM) and calibrations (GR4H) were derived and later used without any differentiation of their performance simulating the historic period. This is actually hindering a proper evaluation of the QPEs, because poor performing hydrological models might be (or are) used to simulate the flood in 2021 with the different QPEs. I strongly recommend to list the performance of the different model parameterizations/calibrations and sort out poor performing ones. In any case the model performances should be provided by the Nash-Sutcliffe and Kling-Gupta performance measures, because these were already calculated. The claim of the authors that the model parameterization/calibration has a larger impact than the QPEs is not that surprising, considering the sensitivity-analysis-like selection of the parameters and calibration routines. The conclusions towards selecting a particular QPE would be more meaningful, if only well performing models for flood events (high discharge) during the calibration period would be used.
The parameterization of ParFlowCLM with uniformly distributed roughness values is very unrealistic for these catchments with diverse land uses, i.e. land surface properties. I am surprised that such a simplistic approach is used for such a sophisticated, physically based and spatially distributed model. Thus I strongly recommend to re-run the simulation with distributed roughness values estimated based on land use and standard roughness values, as mentioned in the outlook. This would give the ParFlowCLM simulation much more credibility.
For the GR4H model I find using the calibration not focussing on extremes for the analysis of the QPEs not convincing, because a conceptual model calibrated on mean flow is unlikely to get the peak discharges of floods right, and should thus not be used for evaluating the QPEs. You might prove me wrong listing the performance values.
Furthermore, some of the comparisons/evaluations of the QPEs and simulations are based on comparison with uncertain or unknown quantities. The missing flood hydrographs are a major obstacle here. Meanwhile reconstructed flood hydrographs are available at least for the catchments in Rhineland-Palatine by the Landesamt fuer Umwelt (LfU). Similar data should be available from the authorities in Northrhine-Westphalia. These hydrographs can be seen as the best estimate of the actual flood event. I strongly recommend to obtain these data sets. This would increase the impact of the evaluation in terms of ability to simulate the flood 2021 significantly.
Another point: the comparison of the catchment average precipitation used the Thiessen polygons as reference, but these values are also very uncertain. Thus, the general statement that some of the QPEs outperform RADOLAN in catchment average is actually not supported. You only show that these products are closer to the uncertain catchment average based on rain gauges. Which of the QPEs is actually closer to reality cannot be derived form this comparison. This should be mentioned.
I am also missing the discussion of hydrologic processes that might become relevant or only occur during extreme floods. This is a generally ongoing discussion in hydrology, but for this particular event the increase interflow and thus runoff generation by field drainage pipes or the creation of additional drainage channels by erosion has been reported. Unfortunately, this is not published yet, thus you cannot cite it, but there should be reports in newspapers or by the authorities available.
The role of the antecedent soil moisture has been briefly discussed in the manuscript, but studies for its impact on flood generation has been given as an outlook only. I wonder about two aspects: First, the used initial soil moisture for the simulation of the flood 2021: what initial soil moisture was assumed? Was it assumed dry, a guess of some wetness, or maybe based on satellite observation? Or did you use the hydrological simulations until the event to prime the model for the flood simulation? In the latter case the antecedent soil moisture should be realistic to some extent. If assumed, some justification or at least explanation has to be given. Second, an interesting aspect would be if the flood would have been different if the soil was in different state (drier, wetter) than in reality. You mentioned this in the outlook, and this is surely worth investigating, as the role of antecedent soil moisture is likely to differ in different flood/rainfall situations. If you have any capacities, I recommend to include this aspect, and drop the discussion of the simulation results of poor performing models.
In addition to these general comments, I have some more specific comments in the annotated manuscript.
- AC1: 'Reply on RC1', Mohamed Saadi, 18 Jul 2022
-
RC2: 'Comment on nhess-2022-111', Anonymous Referee #2, 13 Jun 2022
This work aims to investigate the influence of using a set of different radar-based QPE and different hydrological models on the uncertainties in simulating the record-breaking July 2021 flood event in Germany. Given the lack of peak flow information (the flood partly destroyed the monitoring systems), the analysis is focused on the probability that the simulated peakflow exceeds the highest historically observed peakflow before the flood. This is a very interesting point of view, given the challenges offered by the prediction of a record breaking flood to both precipitation estimation and hydrological prediction. The work is appropriate for NHESS and its readership.
The manuscript is broadly well written and well structured. However, there are some specific issues listed below that should be considered before acceptance.
- Better identifying the main focus of the work. The July 2021 flood in Germany is not only a record-breaking flood. It is a flood that far exceeded previously observed records (the authors could report existing post flood estimates that shows how far the estimated July 2021 peak exceeded the previous records). Of course, existing methods and models for flood forecasting cannot predict these floods well because flood generation processes of large extremes differ from those of smaller, more frequently observed events. Therefore, research aiming precisely to this issue by considering these kind of megafloods is timely and helpful. However, this point is completely ignored in the abstract, and it is elaborated relatively late in the introduction.
- The point (L205-2010) made on the different results obtained based on considering raingauges and raingauge-based catchment-scale precipitation estimates is someway misleading. First, it totally ignores the uncertainty in the catchment-scale estimates based on raingauges (and here I urge the authors to consider techniques better than Thiessen for this). Second, this conclusion obviously depends on the set of raingauges considered. If the reference raingauges are those considered for estimating the catchment-scale precipitation, I doubt outcomes may be different. By the way, this conclusion is missed in the conclusion section.
- The point (L254-256) about the causes leading to the strong underestimation (For the 14 July 2021 event, this underestimation may be explained by intense collision-coalescence processes taking place close to the surface..) lacks any ground. I mean: it is likely that collision-coalescence processes may cause those underestimation, but this attribution needs a far better explanation.
- Information on how antecedent conditions were computed, and about the accuracy of these estimates, is missing, in spite of the critical role this information have on the sensitivity of the model to QPE error.
- The parameter uncertainty of ParFlowCLM is strongly underestimated when focusing only on Manning values, as the authors did. At least they should do a better job considering uncertainty in the information about soil properties (lets only think to soil depth).
- The use of English in the paper, while of a reasonably high standard, contains many idiosyncrasies, like the sentence: “The QPE impacted both GR4H and ParFlowCLM simulations”, where ‘Errors in the QPE impacted both…’ is more likely.
- References are missing lot of standard information.
- AC2: 'Reply on RC2', Mohamed Saadi, 18 Jul 2022
Peer review completion






Interactive discussion
Status: closed
-
RC1: 'referee comment on nhess-2022-111', Anonymous Referee #1, 03 May 2022
Review of manuscript “How uncertain are precipitation and peakflow estimates for the July
2021 flooding event?” by Mohamed Saadi et al., submitted to NHESS
The authors present a modelling study targeted at evaluating different rainfall products and two hydrological models to simulate the flood event of 2021 in West Germany, in order to sow the uncertainties in quantitative precipitation estimates (QPE) and the modelling. In general, the study provides insights in the usefulness and weaknesses of the different radar-based rainfall products and their use in simulating extreme flood events. The manuscripts is overall well written and structured.
However, I have some reservations to the conclusions drawn, mainly because of the study design, in particular the hydrological modelling part. My concerns are as follows:
Different model parameterizations (ParFlowCLM) and calibrations (GR4H) were derived and later used without any differentiation of their performance simulating the historic period. This is actually hindering a proper evaluation of the QPEs, because poor performing hydrological models might be (or are) used to simulate the flood in 2021 with the different QPEs. I strongly recommend to list the performance of the different model parameterizations/calibrations and sort out poor performing ones. In any case the model performances should be provided by the Nash-Sutcliffe and Kling-Gupta performance measures, because these were already calculated. The claim of the authors that the model parameterization/calibration has a larger impact than the QPEs is not that surprising, considering the sensitivity-analysis-like selection of the parameters and calibration routines. The conclusions towards selecting a particular QPE would be more meaningful, if only well performing models for flood events (high discharge) during the calibration period would be used.
The parameterization of ParFlowCLM with uniformly distributed roughness values is very unrealistic for these catchments with diverse land uses, i.e. land surface properties. I am surprised that such a simplistic approach is used for such a sophisticated, physically based and spatially distributed model. Thus I strongly recommend to re-run the simulation with distributed roughness values estimated based on land use and standard roughness values, as mentioned in the outlook. This would give the ParFlowCLM simulation much more credibility.
For the GR4H model I find using the calibration not focussing on extremes for the analysis of the QPEs not convincing, because a conceptual model calibrated on mean flow is unlikely to get the peak discharges of floods right, and should thus not be used for evaluating the QPEs. You might prove me wrong listing the performance values.
Furthermore, some of the comparisons/evaluations of the QPEs and simulations are based on comparison with uncertain or unknown quantities. The missing flood hydrographs are a major obstacle here. Meanwhile reconstructed flood hydrographs are available at least for the catchments in Rhineland-Palatine by the Landesamt fuer Umwelt (LfU). Similar data should be available from the authorities in Northrhine-Westphalia. These hydrographs can be seen as the best estimate of the actual flood event. I strongly recommend to obtain these data sets. This would increase the impact of the evaluation in terms of ability to simulate the flood 2021 significantly.
Another point: the comparison of the catchment average precipitation used the Thiessen polygons as reference, but these values are also very uncertain. Thus, the general statement that some of the QPEs outperform RADOLAN in catchment average is actually not supported. You only show that these products are closer to the uncertain catchment average based on rain gauges. Which of the QPEs is actually closer to reality cannot be derived form this comparison. This should be mentioned.
I am also missing the discussion of hydrologic processes that might become relevant or only occur during extreme floods. This is a generally ongoing discussion in hydrology, but for this particular event the increase interflow and thus runoff generation by field drainage pipes or the creation of additional drainage channels by erosion has been reported. Unfortunately, this is not published yet, thus you cannot cite it, but there should be reports in newspapers or by the authorities available.
The role of the antecedent soil moisture has been briefly discussed in the manuscript, but studies for its impact on flood generation has been given as an outlook only. I wonder about two aspects: First, the used initial soil moisture for the simulation of the flood 2021: what initial soil moisture was assumed? Was it assumed dry, a guess of some wetness, or maybe based on satellite observation? Or did you use the hydrological simulations until the event to prime the model for the flood simulation? In the latter case the antecedent soil moisture should be realistic to some extent. If assumed, some justification or at least explanation has to be given. Second, an interesting aspect would be if the flood would have been different if the soil was in different state (drier, wetter) than in reality. You mentioned this in the outlook, and this is surely worth investigating, as the role of antecedent soil moisture is likely to differ in different flood/rainfall situations. If you have any capacities, I recommend to include this aspect, and drop the discussion of the simulation results of poor performing models.
In addition to these general comments, I have some more specific comments in the annotated manuscript.
- AC1: 'Reply on RC1', Mohamed Saadi, 18 Jul 2022
-
RC2: 'Comment on nhess-2022-111', Anonymous Referee #2, 13 Jun 2022
This work aims to investigate the influence of using a set of different radar-based QPE and different hydrological models on the uncertainties in simulating the record-breaking July 2021 flood event in Germany. Given the lack of peak flow information (the flood partly destroyed the monitoring systems), the analysis is focused on the probability that the simulated peakflow exceeds the highest historically observed peakflow before the flood. This is a very interesting point of view, given the challenges offered by the prediction of a record breaking flood to both precipitation estimation and hydrological prediction. The work is appropriate for NHESS and its readership.
The manuscript is broadly well written and well structured. However, there are some specific issues listed below that should be considered before acceptance.
- Better identifying the main focus of the work. The July 2021 flood in Germany is not only a record-breaking flood. It is a flood that far exceeded previously observed records (the authors could report existing post flood estimates that shows how far the estimated July 2021 peak exceeded the previous records). Of course, existing methods and models for flood forecasting cannot predict these floods well because flood generation processes of large extremes differ from those of smaller, more frequently observed events. Therefore, research aiming precisely to this issue by considering these kind of megafloods is timely and helpful. However, this point is completely ignored in the abstract, and it is elaborated relatively late in the introduction.
- The point (L205-2010) made on the different results obtained based on considering raingauges and raingauge-based catchment-scale precipitation estimates is someway misleading. First, it totally ignores the uncertainty in the catchment-scale estimates based on raingauges (and here I urge the authors to consider techniques better than Thiessen for this). Second, this conclusion obviously depends on the set of raingauges considered. If the reference raingauges are those considered for estimating the catchment-scale precipitation, I doubt outcomes may be different. By the way, this conclusion is missed in the conclusion section.
- The point (L254-256) about the causes leading to the strong underestimation (For the 14 July 2021 event, this underestimation may be explained by intense collision-coalescence processes taking place close to the surface..) lacks any ground. I mean: it is likely that collision-coalescence processes may cause those underestimation, but this attribution needs a far better explanation.
- Information on how antecedent conditions were computed, and about the accuracy of these estimates, is missing, in spite of the critical role this information have on the sensitivity of the model to QPE error.
- The parameter uncertainty of ParFlowCLM is strongly underestimated when focusing only on Manning values, as the authors did. At least they should do a better job considering uncertainty in the information about soil properties (lets only think to soil depth).
- The use of English in the paper, while of a reasonably high standard, contains many idiosyncrasies, like the sentence: “The QPE impacted both GR4H and ParFlowCLM simulations”, where ‘Errors in the QPE impacted both…’ is more likely.
- References are missing lot of standard information.
- AC2: 'Reply on RC2', Mohamed Saadi, 18 Jul 2022
Peer review completion






Journal article(s) based on this preprint
Mohamed Saadi et al.
Mohamed Saadi et al.
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
432 | 185 | 17 | 634 | 9 | 7 |
- HTML: 432
- PDF: 185
- XML: 17
- Total: 634
- BibTeX: 9
- EndNote: 7
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
- Preprint
(1427 KB) - Metadata XML