Toward a Multi-Representational Approach to Prediction and Understanding, in Support of Discovery in Hydrology
Name:
Toward_Multi‐Representational.pdf
Size:
1.858Mb
Format:
PDF
Description:
Final Published Version
Affiliation
Department of Hydrology and Atmospheric Sciences, The University of ArizonaIssue Date
2022-12-27Keywords
catchmentsconceptual model
discovery
GR4J
hydro-geo-climatology
hydrological processes
LSTM
lumped water balance model
machine learning
Random Forest
representation
understanding
Metadata
Show full item recordPublisher
John Wiley and Sons IncCitation
De la Fuente, L. A., Gupta, H. V., & Condon, L. E. (2023). Toward a multi-representational approach to prediction and understanding, in support of discovery in hydrology. Water Resources Research, 59, e2021WR031548. https://doi.org/10.1029/2021WR031548Journal
Water Resources ResearchRights
© 2022. The Authors. This is an open access article under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs License.Collection Information
This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at repository@u.library.arizona.edu.Abstract
Key to model development is the selection of an appropriate representational system, including both the representation of what is observed (the data), and the formal mathematical structure used to construct the input-state-output mapping. These choices are critical, because they completely determine the questions we can ask, the nature of the analyses and inferences we can perform, and the answers we can obtain. Accordingly, a representation that is suitable for one kind of investigation might be limited in its ability to support some other kind. Arguably, how different representational approaches affect what we can learn from data is poorly understood. This paper explores three representational strategies as vehicles for understanding how catchment scale hydrological processes vary across hydro-geo-climatologically diverse Chile. Specifically, we test a lumped water-balance model (GR4J), a data-based dynamical systems model (LSTM), and a data-based regression tree model (Random Forest). Insights were obtained regarding system memory encoded in data, spatial transferability by use of surrogate attributes, and informational deficiencies of the data set that limit our ability to learn an adequate input-output relationship. As expected, each approach exhibits specific strengths, with LSTM providing the best characterization of dynamics, GR4J being the most robust under informationally deficient conditions, and Random Forest regression-tree method being most supportive of interpretation. Overall, the contrasting nature of the three approaches suggests the value of adopting a multi-representational framework to more fully extract information from the data and, by doing so, find information that better facilities the goals of robust prediction and improved understanding, ultimately supporting enhanced scientific discovery. © 2022. The Authors.Note
Open access articleISSN
0043-1397Version
Final Published Versionae974a485f413a2113503eed53cd6c53
10.1029/2021WR031548
Scopus Count
Collections
Except where otherwise noted, this item's license is described as © 2022. The Authors. This is an open access article under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs License.