Articles | Volume 26, issue 5
https://doi.org/10.5194/nhess-26-2353-2026
© Author(s) 2026. This work is distributed under the Creative Commons Attribution 4.0 License.
Enhancing hydrological hazard early warning: a 60 d streamflow forecasting framework integrating deep learning and process-based modeling
Download
- Final revised paper (published on 26 May 2026)
- Supplement to the final revised paper
- Preprint (discussion started on 12 Feb 2026)
- Supplement to the preprint
Interactive discussion
Status: closed
Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor
| : Report abuse
-
RC1: 'Comment on egusphere-2026-393', Ningpeng Dong, 08 Mar 2026
- AC1: 'Reply on RC1', Zhijie Liu, 31 Mar 2026
-
RC2: 'Comment on egusphere-2026-393', Samantha Hartke, 17 Mar 2026
- AC2: 'Reply on RC2', Zhijie Liu, 31 Mar 2026
Peer review completion
AR – Author's response | RR – Referee report | ED – Editor decision | EF – Editorial file upload
ED: Reconsider after major revisions (further review by editor and referees) (10 Apr 2026) by Zhe Li
AR by Zhijie Liu on behalf of the Authors (20 Apr 2026)
Author's response
Author's tracked changes
Manuscript
ED: Referee Nomination & Report Request started (07 May 2026) by Zhe Li
RR by Ningpeng Dong (07 May 2026)
ED: Publish as is (08 May 2026) by Zhe Li
AR by Zhijie Liu on behalf of the Authors (14 May 2026)
Greetings! This study proposes a 60-day streamflow forecasting framework for the Upper Yangtze River Basin that integrates a convolutional neural network for precipitation bias correction, a hydrological model for runoff simulation, and an autoregressive model for error post-processing. The paper is generally well-written, and below are some of my comments that I hope will help improve the paper.
My first concern is about the temporal partitioning of the CNN model training. The paper mentions the CNN is trained on a 20-year data using cross-validation, yet the specific years assigned to each fold should also be elaborated. Since the final streamflow evaluation covers the period from 2009 to 2012, there is a risk of data leakage if any of those years were included in the training set. It is also unclear why the calibration periods for the GBEHM and ARX components differ from one another and from the CNN training period.
Second, the CNN generates probabilistic precipitation forecasts via the CSG distribution, which is a well-motivated design choice. However, this statistical information seems to be discarded before feeding to the hydrological model. Is this an intended choice and why did authors choose that? For a system intended for hazard early warning, I think including uncertainty information through the modelling chain would greatly enhance its reliability.
Third, more evaluation metrics targeting hazards (i.e., the topic of this paper) could be introduced. NSE and MSE are often dominated by baseflow conditions and do not necessarily reflect a model performance during extreme events. I recommend the authors select representative flood events from the 2009-2012 evaluation period and present event-scale forecast performance, such as peak flow errors to manifest the model ability for hazard warning.
Finally, the paper would be much stronger if it provides more technical detail on the GBEHM calibration and its operational feasibility. There is currently little information on which specific parameters were tuned. The study states “In this application, the UYRB is discretized into an 8 km × 8 km grid system and further delineated into 479 sub-basins based on the DEM”, which makes me confused. Is the model grid-based or sub-basin based? More details could be provided. The manuscript would also benefit from a brief discussion of operational feasibility, for example, if the modelling system can be applied in operation, and to achieve that what are the challenges and possible solutions, etc.
Best,
Ningpeng Dong