Genome assembly of the JD17 soybean provides a new reference genome for comparative genomics
Author
Yi, X.Liu, J.
Chen, S.
Wu, H.
Liu, M.
Xu, Q.
Lei, L.
Lee, S.
Zhang, B.
Kudrna, D.
Fan, W.
Wing, R.A.
Wang, X.

Zhang, M.
Zhang, J.
Yang, C.
Chen, N.
Affiliation
Arizona Genomics Institute, University of ArizonaBIO5 Institute, School of Plant Sciences, University of Arizona
Issue Date
2022
Metadata
Show full item recordPublisher
Oxford AcademicCitation
Yi, X., Liu, J., Chen, S., Wu, H., Liu, M., Xu, Q., Lei, L., Lee, S., Zhang, B., Kudrna, D., Fan, W., Wing, R. A., Wang, X., Zhang, M., Zhang, J., Yang, C., & Chen, N. (2022). Genome assembly of the JD17 soybean provides a new reference genome for comparative genomics. G3 (Bethesda, Md.).Journal
G3 (Bethesda, Md.)Rights
Copyright © The Author(s) 2022. Published by Oxford University Press on behalf of Genetics Society of America. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/).Collection Information
This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at repository@u.library.arizona.edu.Abstract
Cultivated soybean (Glycine max) is an important source for protein and oil. Many elite cultivars with different traits have been developed for different conditions. Each soybean strain has its own genetic diversity, and the availability of more high-quality soybean genomes can enhance comparative genomic analysis for identifying genetic underpinnings for its unique traits. In this study, we constructed a high-quality de novo assembly of an elite soybean cultivar Jidou 17 (JD17) with chromosome contiguity and high accuracy. We annotated 52,840 gene models and reconstructed 74,054 high-quality full-length transcripts. We performed a genome-wide comparative analysis based on the reference genome of JD17 with 3 published soybeans (WM82, ZH13, and W05), which identified 5 large inversions and 2 large translocations specific to JD17, 20,984-46,912 presence-absence variations spanning 13.1-46.9 Mb in size. A total of 1,695,741-3,664,629 SNPs and 446,689-800,489 Indels were identified and annotated between JD17 and them. Symbiotic nitrogen fixation genes were identified and the effects from these variants were further evaluated. It was found that the coding sequences of 9 nitrogen fixation-related genes were greatly affected. The high-quality genome assembly of JD17 can serve as a valuable reference for soybean functional genomics research. © The Author(s) 2022. Published by Oxford University Press on behalf of Genetics Society of America.Note
Open access journalISSN
2160-1836PubMed ID
35188189Version
Final published versionae974a485f413a2113503eed53cd6c53
10.1093/g3journal/jkac017
Scopus Count
Collections
Except where otherwise noted, this item's license is described as Copyright © The Author(s) 2022. Published by Oxford University Press on behalf of Genetics Society of America. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/).
Related articles
- Construction and comparison of three reference-quality genome assemblies for soybean.
- Authors: Valliyodan B, Cannon SB, Bayer PE, Shu S, Brown AV, Ren L, Jenkins J, Chung CY, Chan TF, Daum CG, Plott C, Hastie A, Baruch K, Barry KW, Huang W, Patil G, Varshney RK, Hu H, Batley J, Yuan Y, Song Q, Stupar RM, Goodstein DM, Stacey G, Lam HM, Jackson SA, Schmutz J, Grimwood J, Edwards D, Nguyen HT
- Issue date: 2019 Dec
- Evaluation of genetic variation among Brazilian soybean cultivars through genome resequencing.
- Authors: Maldonado dos Santos JV, Valliyodan B, Joshi T, Khan SM, Liu Y, Wang J, Vuong TD, de Oliveira MF, Marcelino-Guimarães FC, Xu D, Nguyen HT, Abdelnoor RV
- Issue date: 2016 Feb 13
- Genome assembly of the popular Korean soybean cultivar Hwangkeum.
- Authors: Kim MS, Lee T, Baek J, Kim JH, Kim C, Jeong SC
- Issue date: 2021 Sep 27
- Genome-wide SNP identification and characterization in two soybean cultivars with contrasting Mungbean Yellow Mosaic India Virus disease resistance traits.
- Authors: Yadav CB, Bhareti P, Muthamilarasan M, Mukherjee M, Khan Y, Rathi P, Prasad M
- Issue date: 2015
- The pan-genome of the cultivated soybean (PanSoy) reveals an extraordinarily conserved gene content.
- Authors: Torkamaneh D, Lemay MA, Belzile F
- Issue date: 2021 Sep
Related items
Showing items related by title, author, creator and subject.
-
Changes in the Genome: Polyploidy, Hybridization, and Genome Size Evolution Explored Through Selaginella and Vascular PlantsBarker, Michael S.; Baniaga, Anthony Ernest-Fiorentino; Ferriere, Regis; Robichaux, Robert H.; Sanderson, Michael J.; Worobey, Michael (The University of Arizona., 2019)The evolutionary processes responsible for generating and maintaining the remarkable diversity of life on earth are mutation, selection, drift, recombination, and gene flow. The relative magnitude of these processes, and their tempo, can be inferred from studying genomes or samples of the genome from individuals or multiple individuals. My dissertation focuses on three types of change in vascular plant genomes, with a focus on lycophytes in the genus Selaginella (Selaginellaceae). In Appendix A I characterize the extremely small genome sizes of Selaginella, and compare their observed disparity in genome size to other vascular plant clades. In Appendix B I examine the temporal activity of long terminal repeat retrotransposons (LTR-RTs) in vascular plants. I illustrate that across vascular plants LTR-RT activity largely explains the observed diversity in genome size. In Appendix C and D I focus on abrupt changes in the genome via polyploidy and hybridization. In Appendix C I demonstrate the importance of climatic niche divergence in polyploid plant species. In Appendix D I investigate the evidence of hybrid speciation in hybrids formed between Selaginella arizonica and S. eremophila in the Sonoran Desert. Using transcriptome sequencing and complementary morphological and ploidal inference, I suggest that both homoploid hybrid and allopolyploid species were formed from the same parents S. arizonica and S. eremophila. This system is the first known example of two types of stabilized hybrid derivatives from the same parents in natural populations.
-
PhiX174 genome-capsid interactions: Evidence for a scaffolding-like function for the genome during morphogenesisFane, Bentley A.; Hafenstein, Susan (The University of Arizona., 2003)The assembly of viral proteins and nucleic acids into mature and biologically active virions involves a diverse spectrum of macromolecular interactions. After capsid formation, structural and packaging proteins must interact with viral nucleic acids. These interactions may confer packaging specificity, spatially organize the genome, enhance particle stability, or contribute directly to capsid quaternary structure. In the Microviridae, packaging and capsid proteins are tightly associated, tethering the genome to the inner surface and guiding it into the overall icosahedral symmetry of the particle. All of these factors may influence the final stages of maturation, which involves an inward collapse of coat proteins around the packaged genome. These packaging parameters were altered in three ways. (1) The DNA binding residues of the DNA binding protein were altered. Although the genome and protein are in the interior of the capsid, alterations were expressed on the capsid's outer surfaces. The results of second site genetic analyses illustrate how coat protein modifications can compensate for defective phenotypes. (2) Non-DNA binding amino acid residues believed to be of structural importance were mutated. The results of these analyses elucidate the function of these residues in optimizing DNA-protein interactions and organizing the DNA into the capsid's symmetry. Again, the results of second site genetic analyses demonstrate the inherent evolutionary plasticity of the system. (3) Packaged DNA was changed by altering base composition and folding parameters. The experimental results support a model in which the secondary structure of the packaged genome acts in a scaffold-like manner during the final stage of virion morphogenesis, affecting the biophysical and biological properties of the mature virion. Finally a chimeric particle was constructed by placing a wild type DNA binding protein in a wild type, but foreign environment. The biophysical characterization of these particles is consistent with the mentioned model. A structural analysis was performed, in collaboration with Dr. Rossmann's group (Purdue University), to provide a structural context in which to interpret the observed biophysical effects. While not all questions were answered or hypotheses verified, the results elucidate the limitations and interpretation of structural analyses, a current debate in the structural biology field.
-
Inference of Recent Demographic History of Population Isolates using Genome-wide High Density SNP Arrays and Whole Genome SequencesHammer, Michael; Gladstein, Ariella Leah; Barker, Michael; Gutenkunst, Ryan; Walsh, Bruce (The University of Arizona., 2018)In this dissertation I addressed the problem of SNP array bias when finding runs of homozygosity. I demonstrated the pitfalls of using uninformed methods for finding runs of homozygosity and provide better alternatives, including a more reliable algorithm for identifying runs of homozygosity than the most commonly used program. I then provide a review of Ashkenazi population genetics. Next, I developed software to efficiently run millions of whole chromosome simulations, which is publicly available through GitHub, DockerHub, and on the CyVerse Discovery Environment. I applied my computational method to use Approximate Bayesian Computation to test models of Ashkenazi Jewish demographic history. I found that the Ashkenazi Jews are comprised of genetically distinct subgroups from Eastern and Western Europe, as a result of massive population growth in the Eastern Ashkenazi Jews, but not in the Western Ashkenazi Jews. I further confirmed that the Ashkenazi Jews do not primarily originate from Khazaria. Finally, I created a correction for SNP array ascertainment bias in the median and total length of runs of homozygosity, and applied this correction to world-wide human populations. However, I found that ascertainment bias plays a minor role compared to SNP array bias in human populations.