NCBI's Virus Discovery Codeathon: Building "FIVE" -The Federated Index of Viral Experiments API Index
Name:
viruses-12-01424-v3.pdf
Size:
2.070Mb
Format:
PDF
Description:
Final Published Version
Author
Martí-Carreras, J.Gener, A.R.
Miller, S.D.
Brito, A.F.
Camacho, C.E.
Connor, R.
Deboutte, W.
Glickman, C.
Kristensen, D.M.
Meyer, W.K.
Modha, S.
Norris, A.L.
Saha, S.
Belford, A.K.
Biederstedt, E.
Brister, J.R.
Buchmann, J.P.
Cooley, N.P.
Edwards, R.A.
Javkar, K.
Muchow, M.
Muralidharan, H.S.
Pepe-Ranney, C.
Shah, N.
Shakya, M.
Tisza, M.J.
Tully, B.J.
Vanmechelen, B.
Virta, V.C.
Weissman, J.L.
Zalunin, V.
Efremov, A.
Busby, B.
Affiliation
School of Animal and Comparative Biomedical Sciences, University of ArizonaIssue Date
2020
Metadata
Show full item recordPublisher
MDPICitation
Martí-Carreras, J., Gener, A. R., Miller, S. D., Brito, A. F., Camacho, C. E., Connor, R., ... & Busby, B. (2020). NCBI’s Virus Discovery Codeathon: Building “FIVE”—The Federated Index of Viral Experiments API Index. Viruses, 12(12), 1424.Journal
VirusesRights
Copyright © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).Collection Information
This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at repository@u.library.arizona.edu.Abstract
Viruses represent important test cases for data federation due to their genome size and the rapid increase in sequence data in publicly available databases. However, some consequences of previously decentralized (unfederated) data are lack of consensus or comparisons between feature annotations. Unifying or displaying alternative annotations should be a priority both for communities with robust entry representation and for nascent communities with burgeoning data sources. To this end, during this three-day continuation of the Virus Hunting Toolkit codeathon series (VHT-2), a new integrated and federated viral index was elaborated. This Federated Index of Viral Experiments (FIVE) integrates pre-existing and novel functional and taxonomy annotations and virus-host pairings. Variability in the context of viral genomic diversity is often overlooked in virus databases. As a proof-of-concept, FIVE was the first attempt to include viral genome variation for HIV, the most well-studied human pathogen, through viral genome diversity graphs. As per the publication of this manuscript, FIVE is the first implementation of a virus-specific federated index of such scope. FIVE is coded in BigQuery for optimal access of large quantities of data and is publicly accessible. Many projects of database or index federation fail to provide easier alternatives to access or query information. To this end, a Python API query system was developed to enhance the accessibility of FIVE.Note
Open access journalISSN
1999-4915PubMed ID
33322070Version
Final published versionae974a485f413a2113503eed53cd6c53
10.3390/v12121424
Scopus Count
Collections
Except where otherwise noted, this item's license is described as Copyright © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Related articles
- Accessing the SEED genome databases via Web services API: tools for programmers.
- Authors: Disz T, Akhter S, Cuevas D, Olson R, Overbeek R, Vonstein V, Stevens R, Edwards RA
- Issue date: 2010 Jun 14
- Robust Analysis of Time Series in Virome Metagenomics.
- Authors: Martí JM
- Issue date: 2018
- Host Taxon Predictor - A Tool for Predicting Taxon of the Host of a Newly Discovered Virus.
- Authors: Gałan W, Bąk M, Jakubowska M
- Issue date: 2019 Mar 5
- drVM: a new tool for efficient genome assembly of known eukaryotic viruses from metagenomes.
- Authors: Lin HH, Liao YC
- Issue date: 2017 Feb 1
- Single-virus genomics and beyond.
- Authors: Martínez Martínez J, Martinez-Hernandez F, Martinez-Garcia M
- Issue date: 2020 Dec

