Highly accurate long-read HiFi sequencing data for five complex genomes
Name:
s41597-020-00743-4.pdf
Size:
1.495Mb
Format:
PDF
Description:
Final Published Version
Author
Hon, TingMars, Kristin
Young, Greg
Tsai, Yu-Chih
Karalius, Joseph W
Landolin, Jane M
Maurer, Nicholas
Kudrna, David
Hardigan, Michael A
Steiner, Cynthia C
Knapp, Steven J
Ware, Doreen
Shapiro, Beth
Peluso, Paul
Rank, David R
Affiliation
Univ Arizona, Arizona Genom InstUniv Arizona, Sch Plant Sci
Issue Date
2020-11-17
Metadata
Show full item recordPublisher
NATURE RESEARCHCitation
Hon, T., Mars, K., Young, G., Tsai, Y. C., Karalius, J. W., Landolin, J. M., ... & Rank, D. R. (2020). Highly accurate long-read HiFi sequencing data for five complex genomes. Scientific Data, 7(1), 1-11.Journal
SCIENTIFIC DATARights
© The Author(s) 2020. Open Access. This article is licensed under a Creative Commons Attribution 4.0 International License. The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.Collection Information
This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at repository@u.library.arizona.edu.Abstract
The PacBio® HiFi sequencing method yields highly accurate long-read sequencing datasets with read lengths averaging 10-25 kb and accuracies greater than 99.5%. These accurate long reads can be used to improve results for complex applications such as single nucleotide and structural variant detection, genome assembly, assembly of difficult polyploid or highly repetitive genomes, and assembly of metagenomes. Currently, there is a need for sample data sets to both evaluate the benefits of these long accurate reads as well as for development of bioinformatic tools including genome assemblers, variant callers, and haplotyping algorithms. We present deep coverage HiFi datasets for five complex samples including the two inbred model genomes Mus musculus and Zea mays, as well as two complex genomes, octoploid Fragaria × ananassa and the diploid anuran Rana muscosa. Additionally, we release sequence data from a mock metagenome community. The datasets reported here can be used without restriction to develop new algorithms and explore complex genome structure and evolution. Data were generated on the PacBio Sequel II System.Note
Open access journalISSN
2052-4463EISSN
2052-4463PubMed ID
33203859Version
Final published versionae974a485f413a2113503eed53cd6c53
10.1038/s41597-020-00743-4
Scopus Count
Collections
Except where otherwise noted, this item's license is described as © The Author(s) 2020. Open Access. This article is licensed under a Creative Commons Attribution 4.0 International License. The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.
Related articles
- Highly accurate long reads are crucial for realizing the potential of biodiversity genomics.
- Authors: Hotaling S, Wilcox ER, Heckenhauer J, Stewart RJ, Frandsen PB
- Issue date: 2023 Mar 16
- Comprehensive assessment of 11 de novo HiFi assemblers on complex eukaryotic genomes and metagenomes.
- Authors: Yu W, Luo H, Yang J, Zhang S, Jiang H, Zhao X, Hui X, Sun D, Li L, Wei XQ, Lonardi S, Pan W
- Issue date: 2024 Mar 20
- Improved Assembly of Metagenome-Assembled Genomes and Viruses in Tibetan Saline Lake Sediment by HiFi Metagenomic Sequencing.
- Authors: Tao Y, Xun F, Zhao C, Mao Z, Li B, Xing P, Wu QL
- Issue date: 2023 Feb 14
- Long-Read Sequencing Improves Recovery of Picoeukaryotic Genomes and Zooplankton Marker Genes from Marine Metagenomes.
- Authors: Patin NV, Goodwin KD
- Issue date: 2022 Dec 20
- Linked read technology for assembling large complex and polyploid genomes.
- Authors: Ott A, Schnable JC, Yeh CT, Wu L, Liu C, Hu HC, Dalgard CL, Sarkar S, Schnable PS
- Issue date: 2018 Sep 4