A Robust Strategy to Account for Data Sampling Variability in the Development of Hydrological Models
Affiliation
Department of Hydrology and Atmospheric Sciences, The University of ArizonaIssue Date
2023-02-21Keywords
data sampling variabilityhydrological model
model calibration
stochastic gradient descent
uncertainty analysis
Metadata
Show full item recordPublisher
John Wiley and Sons IncCitation
Zheng, F., Chen, J., Ma, Y., Chen, Q., Maier, H. R., & Gupta, H. (2023). A robust strategy to account for data sampling variability in the development of hydrological models. Water Resources Research, 59, e2022WR033703. https://doi.org/10.1029/2022WR033703Journal
Water Resources ResearchRights
© 2023. American Geophysical Union. All Rights Reserved.Collection Information
This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at repository@u.library.arizona.edu.Abstract
It is typical to use a single portion of the available data to calibrate hydrological models, and the remainder for model evaluation. To minimize model-bias, this partitioning must be performed so as to ensure distributional representativeness and mutual consistency. However, failure to account for data sampling variability (DSV) in the underlying Data Generating Process can weaken the model's generalization performance. While “K-fold cross-validation” can mitigate this problem, it is computationally inefficient since the calibration/evaluation operations must be repeated numerous times. This paper develops a general strategy for stochastic evolutionary parameter optimization (SEPO) that explicitly accounts for DSV when calibrating a model using any population-based evolutionary optimization algorithm (EOA), such as Shuffled Complex Evolution (SCE). Inspired in part by the machine-learning strategy of stochastic gradient descent (SGD), we use various representative random sub-samples to drive the EOA toward the distribution of the model parameters. Unlike in SGD, derivative information is not required and hence SEPO can be applied to any hydrological model where such information is not readily available. To demonstrate the effectiveness of the proposed strategy, we implement it within the well-known SCE, to calibrate the GR4J conceptual rainfall-runoff model to 163 hydro-climatically diverse catchments. Using only a single optimization run, our Stochastic SCE method converges to population-based estimates of model parameter distributions (and corresponding simulation uncertainties), without compromising model performance during either calibration or evaluation. Further, it effectively reduces the need to perform independent evaluation tests of model performance under conditions that are represented by the available data. © 2023. American Geophysical Union. All Rights Reserved.Note
6 month embargo; 21 February 2023ISSN
0043-1397Version
Final Published Versionae974a485f413a2113503eed53cd6c53
10.1029/2022WR033703
