Genome puzzle master (GPM): an integrated pipeline for building and editing pseudomolecules from fragmented sequences
Name:
Bioinformatics-2016-Zhang-3058 ...
Size:
602.4Kb
Format:
PDF
Description:
Final Published Version
Name:
Suppl_Table_S1-05-30.docx
Size:
27.46Kb
Format:
Microsoft Word
Description:
Supplementary Data - docx file
Name:
Suppl_Table_S2-05-30.xlsx
Size:
12.44Kb
Format:
Microsoft Excel
Description:
Supplementary Data - xlsx file
Author
Zhang, JianweiKudrna, Dave
Mu, Ting
Li, Weiming
Copetti, Dario
Yu, Yeisoo
Goicoechea, Jose Luis
Lei, Yang
Wing, Rod A.
Affiliation
Arizona Genomics Institute and BIO5 Institute, School of Plant Sciences, University of ArizonaIssue Date
2016-10-15
Metadata
Show full item recordPublisher
OXFORD UNIV PRESSCitation
Genome puzzle master (GPM): an integrated pipeline for building and editing pseudomolecules from fragmented sequences 2016, 32 (20):3058 BioinformaticsJournal
BioinformaticsRights
© The Author 2016. Published by Oxford University Press. 3058 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/).Collection Information
This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at repository@u.library.arizona.edu.Abstract
Motivation: Next generation sequencing technologies have revolutionized our ability to rapidly and affordably generate vast quantities of sequence data. Once generated, raw sequences are assembled into contigs or scaffolds. However, these assemblies are mostly fragmented and inaccurate at the whole genome scale, largely due to the inability to integrate additional informative datasets (e.g. physical, optical and genetic maps). To address this problem, we developed a semi-automated software tool-Genome Puzzle Master (GPM)-that enables the integration of additional genomic signposts to edit and build 'new-gen-assemblies' that result in high-quality 'annotation-ready' pseudomolecules. Results: With GPM, loaded datasets can be connected to each other via their logical relationships which accomplishes tasks to 'group,' 'merge,' 'order and orient' sequences in a draft assembly. Manual editing can also be performed with a user-friendly graphical interface. Final pseudomolecules reflect a user's total data package and are available for long-term project management. GPM is a web-based pipeline and an important part of a Laboratory Information Management System (LIMS) which can be easily deployed on local servers for any genome research laboratory.Note
Open access article.ISSN
1367-48031460-2059
Version
Final published versionSponsors
Start-up Fund of National Key Laboratory of Crop Genetic Improvement; National Science Foundation Grant [1026200]Additional Links
https://academic.oup.com/bioinformatics/article-lookup/doi/10.1093/bioinformatics/btw370ae974a485f413a2113503eed53cd6c53
10.1093/bioinformatics/btw370
Scopus Count
Collections
Except where otherwise noted, this item's license is described as © The Author 2016. Published by Oxford University Press. 3058 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/).