Evaluating single-subject study methods for personal transcriptomic interpretations to advance precision medicine
Author
Rachid Zaim, SamirKenost, Colleen
Berghout, Joanne
Vitali, Francesca
Zhang, Helen Hao
Lussier, Yves A
Affiliation
Univ Arizona Hlth Sci, Ctr Biomed Informat & BiostatUniv Arizona, Grad Interdisciplinary Program Stat
Univ Arizona, Dept Math, Coll Sci
Univ Arizona, Ctr Canc
Issue Date
2019-07-11Keywords
Genomic medicineMedical genomics
N-of-1
N-of-1 studies
Precision medicine
Single-subject studies
Transcriptome
Metadata
Show full item recordPublisher
BMCCitation
Zaim, S. R., Kenost, C., Berghout, J., Vitali, F., Zhang, H. H., & Lussier, Y. A. (2019). Evaluating single-subject study methods for personal transcriptomic interpretations to advance precision medicine. BMC medical genomics, 12(5), 96.Journal
BMC MEDICAL GENOMICSRights
Copyright © The Author(s). 2019 Open Access. This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/). The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.Collection Information
This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at repository@u.library.arizona.edu.Abstract
Background Gene expression profiling has benefited medicine by providing clinically relevant insights at the molecular candidate and systems levels. However, to adopt a more precision' approach that integrates individual variability including omics data into risk assessments, diagnoses, and therapeutic decision making, whole transcriptome expression needs to be interpreted meaningfully for single subjects. We propose an all-against-one framework that uses biological replicates in isogenic conditions for testing differentially expressed genes (DEGs) in a single subject (ss) in the absence of an appropriate external reference standard or replicates. To evaluate our proposed all-against-one framework, we construct reference standards (RSs) with five conventional replicate-anchored analyses (NOISeq, DEGseq, edgeR, DESeq, DESeq2) and the remainder were treated separately as single-subject sample pairs for ss analyses (without replicates).ResultsEight ss methods (NOISeq, DEGseq, edgeR, mixture model, DESeq, DESeq2, iDEG, and ensemble) for identifying genes with differential expression were compared in Yeast (parental line versus snf2 deletion mutant; n=42/condition) and a MCF7 breast-cancer cell line (baseline versus stimulated with estradiol; n=7/condition). Receiver-operator characteristic (ROC) and precision-recall plots were determined for eight ss methods against each of the five RSs in both datasets. Consistent with prior analyses of these data, similar to 50% and similar to 15% DEGs were obtained in Yeast and MCF7 datasets respectively, regardless of the RSs method. NOISeq, edgeR, and DESeq were the most concordant for creating a RS. Single-subject versions of NOISeq, DEGseq, and an ensemble learner achieved the best median ROC-area-under-the-curve to compare two transcriptomes without replicates regardless of the RS method and dataset (>90% in Yeast, >0.75 in MCF7). Further, distinct specific single-subject methods perform better according to different proportions of DEGs.ConclusionsThe all-against-one framework provides a honest evaluation framework for single-subject DEG studies since these methods are evaluated, by design, against reference standards produced by unrelated DEG methods. The ss-ensemble method was the only one to reliably produce higher accuracies in all conditions tested in this conservative evaluation framework. However, single-subject methods for identifying DEGs from paired samples need improvement, as no method performed with precision>90% and obtained moderate levels of recall.Note
Open access journalISSN
1755-8794PubMed ID
31296218Version
Final published versionSponsors
University of Arizona Health Sciences Center for Biomedical Informatics and Biostatistics; BIO5 Institute; NIH [U01AI122275, HL132532, NCI P30CA023074, 1UG3OD023171, 1S10RR029030]ae974a485f413a2113503eed53cd6c53
10.1186/s12920-019-0513-8
Scopus Count
Collections
Except where otherwise noted, this item's license is described as Copyright © The Author(s). 2019 Open Access. This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/). The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
Related articles
- Interpretation of 'Omics dynamics in a single subject using local estimates of dispersion between two transcriptomes.
- Authors: Li Q, Zaim SR, Aberasturi D, Berghout J, Li H, Vitali F, Kenost C, Zhang HH, Lussier YA
- Issue date: 2019
- Personalized beyond Precision: Designing Unbiased Gold Standards to Improve Single-Subject Studies of Personal Genome Dynamics from Gene Products.
- Authors: Rachid Zaim S, Kenost C, Zhang HH, Lussier YA
- Issue date: 2020 Dec 31
- Single subject transcriptome analysis to identify functionally signed gene set or pathway activity.
- Authors: Berghout J, Li Q, Pouladi N, Li J, Lussier YA
- Issue date: 2018
- Emergence of pathway-level composite biomarkers from converging gene set signals of heterogeneous transcriptomic responses.
- Authors: Zaim SR, Li Q, Schissler AG, Lussier YA
- Issue date: 2018
- Usability of reference-free transcriptome assemblies for detection of differential expression: a case study on Aethionema arabicum dimorphic seeds.
- Authors: Wilhelmsson PKI, Chandler JO, Fernandez-Pozo N, Graeber K, Ullrich KK, Arshad W, Khan S, Hofberger JA, Buchta K, Edger PP, Pires JC, Schranz ME, Leubner-Metzger G, Rensing SA
- Issue date: 2019 Jan 30