On the choice of calibration metrics for “high-flow” estimation using hydrologic models
Author
Mizukami, NaokiRakovec, Oldrich
Newman, Andrew J.
Clark, Martyn P.
Wood, Andrew W.
Gupta, Hoshin V.
Kumar, Rohini
Affiliation
Univ Arizona, Dept Hydrol & Atmospher SciIssue Date
2019-06-17
Metadata
Show full item recordPublisher
COPERNICUS GESELLSCHAFT MBHCitation
Mizukami, N., Rakovec, O., Newman, A. J., Clark, M. P., Wood, A. W., Gupta, H. V., & Kumar, R. (2019). On the choice of calibration metrics for “high-flow” estimation using hydrologic models. Hydrology and Earth System Sciences, 23(6), 2601-2614.Rights
© Author(s) 2019. This work is distributed under the Creative Commons Attribution 4.0 License.Collection Information
This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at repository@u.library.arizona.edu.Abstract
Calibration is an essential step for improving the accuracy of simulations generated using hydrologic models. A key modeling decision is selecting the performance metric to be optimized It has been common to use squared error performance metrics, or normalized variants such as Nash-Sutcliffe efficiency (NSE), based on the idea that their squared-error nature will emphasize the estimates of high flows. However, we conclude that NSE-based model calibrations actually result in poor reproduction of high-flow events, such as the annual peak flows that are used for flood frequency estimation. Using three different types of performance metrics, we calibrate two hydrological models at a daily step, the Variable Infiltration Capacity (VIC) model and the mesoscale Hydrologic Model (mHM), and evaluate their ability to simulate high-flow events for 492 basins throughout the contiguous United States. The metrics investigated are (1) NSE, (2) Kling-Gupta efficiency (KGE) and its variants, and (3) annual peak flow bias (APFB), where the latter is an application-specific metric that focuses on annual peak flows. As expected, the APFB metric produces the best annual peak flow estimates; however, performance on other high-flow-related metrics is poor. In contrast, the use of NSE results in annual peak flow estimates that are more than 20 % worse, primarily due to the tendency of NSE to underestimate observed flow variability. On the other hand, the use of KGE results in annual peak flow estimates that are better than from NSE, owing to improved flow time series metrics (mean and variance), with only a slight degradation in performance with respect to other related metrics, particularly when a non-standard weighting of the components of KGE is used. Stochastically generated ensemble simulations based on model residuals show the ability to improve the high-flow metrics, regardless of the deterministic performances. However, we emphasize that improving the fidelity of streamflow dynamics from deterministically calibrated models is still important, as it may improve high-flow metrics (for the right reasons). Overall, this work highlights the need for a deeper understanding of performance metric behavior and design in relation to the desired goals of model calibration.Note
Open access journalISSN
1027-5606EISSN
1607-7938Version
Final published versionae974a485f413a2113503eed53cd6c53
10.5194/hess-23-2601-2019
Scopus Count
Collections
Except where otherwise noted, this item's license is described as © Author(s) 2019. This work is distributed under the Creative Commons Attribution 4.0 License.

