Radix+: High-throughput georeferencing and data ingestion over voluminous and fast-evolving phenotyping sensor data
Name:
Concurrency_and_Computation_20 ...
Size:
2.901Mb
Format:
PDF
Description:
Final Published Version
Affiliation
Department of Biosystems Engineering, University of ArizonaIssue Date
2023-02-16Keywords
data ingestiondistributed analytics
georeferencing
high-throughput phenotyping
sensor
visualization
Metadata
Show full item recordPublisher
John Wiley and Sons LtdCitation
Mitra S, Roselius M, Andrade-Sanchez P, McKay JK, Pallickara SL. Radix+: High-throughput georeferencing and data ingestion over voluminous and fast-evolving phenotyping sensor data. Concurrency Computat Pract Exper. 2023; 35(8):e7484. doi: 10.1002/cpe.7484Rights
© 2023 The Authors. Concurrency and Computation: Practice and Experience published by John Wiley & Sons Ltd. This is an open access article under the terms of the Creative Commons Attribution License.Collection Information
This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at repository@u.library.arizona.edu.Abstract
Remote sensing of plant traits and their environment facilitates non-invasive, high-throughput monitoring of the plant's physiological characteristics. However, voluminous observational data generated by such autonomous sensor networks overwhelms scientific users when they have to analyze the data. In order to provide a scalable and effective analysis environment, there is a need for storage and analytics that support high-throughput data ingestion while preserving spatiotemporal and sensor-specific characteristics. Also, the framework should enable modelers and scientists to run their analytics while coping with the fast and continuously evolving nature of the dataset. In this paper, we present Radix+, a high-throughput distributed data storage system for supporting scalable georeferencing, and interactive query-based spatiotemporal analytics with trackable data integrity. We include empirical evaluations performed on a commodity machine cluster with up to 1 TB of data. Our benchmarks demonstrate subsecond latency for majority of our evaluated queries and (Formula presented.) improvement in data ingestion rate over systems such as Geomesa. © 2023 The Authors. Concurrency and Computation: Practice and Experience published by John Wiley & Sons Ltd.Note
Open access articleISSN
1532-0626DOI
10.1002/cpe.7484Version
Final Published Versionae974a485f413a2113503eed53cd6c53
10.1002/cpe.7484
Scopus Count
Collections
Except where otherwise noted, this item's license is described as © 2023 The Authors. Concurrency and Computation: Practice and Experience published by John Wiley & Sons Ltd. This is an open access article under the terms of the Creative Commons Attribution License.