A Genomic Analysis of Factors Driving lincRNA Diversification: Lessons from Plants
AuthorNelson, A. D. L.
Forsythe, E. S.
Devisetty, U. K.
Clausen, D. S.
Haug-Batzell, A. K.
Meldrum, A. M. R.
Frank, M. R.
Beilstein, M. A.
AffiliationUniv Arizona, Dept Chem & Biochem
Univ Arizona, Dept Phys
MetadataShow full item record
PublisherGENETICS SOCIETY AMERICA
CitationA Genomic Analysis of Factors Driving lincRNA Diversification: Lessons from Plants 2016, 6 (9):2881 G3: Genes|Genomes|Genetics
RightsCopyright © 2016 Nelson et al. This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).
Collection InformationThis item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at firstname.lastname@example.org.
AbstractTranscriptomic analyses from across eukaryotes indicate that most of the genome is transcribed at some point in the developmental trajectory of an organism. One class of these transcripts is termed long intergenic noncoding RNAs (lincRNAs). Recently, attention has focused on understanding the evolutionary dynamics of lincRNAs, particularly their conservation within genomes. Here, we take a comparative genomic and phylogenetic approach to uncover factors influencing lincRNA emergence and persistence in the plant family Brassicaceae, to which Arabidopsis thaliana belongs. We searched 10 genomes across the family for evidence of >5000 lincRNA loci from A. thaliana. From loci conserved in the genomes of multiple species, we built alignments and inferred phylogeny. We then used gene tree/species tree reconciliation to examine the duplication history and timing of emergence of these loci. Emergence of lincRNA loci appears to be linked to local duplication events, but, surprisingly, not whole genome duplication events (WGD), or transposable elements. Interestingly, WGD events are associated with the loss of loci for species having undergone relatively recent polyploidy. Lastly, we identify 1180 loci of the 6480 previously annotated A. thaliana lincRNAs (18%) with elevated levels of conservation. These conserved lincRNAs show higher expression, and are enriched for stress-responsiveness and cis-regulatory motifs known as conserved noncoding sequences (CNSs). These data highlight potential functional pathways and suggest that CNSs may regulate neighboring genes at both the genomic and transcriptomic level. In sum, we provide insight into processes that may influence lincRNA diversification by providing an evolutionary context for previously annotated lincRNAs.
VersionFinal published version
SponsorsNational Science Foundation [1409251, 144490]