Normalization Methods on Single-Cell RNA-seq Data: An Empirical Survey
Affiliation
Univ Arizona, Interdisciplinary Program Stat, Stat Bioinformat LabUniv Arizona, Dept Epidemiol & Biostat
Univ Arizona, Dept Biosyst Engn, Stat Bioinformat Lab
Issue Date
2020-02-07
Metadata
Show full item recordPublisher
FRONTIERS MEDIA SACitation
Lytal N, Ran D and An L (2020) Normalization Methods on Single-Cell RNA-seq Data: An Empirical Survey. Front. Genet. 11:41. doi: 10.3389/fgene.2020.00041Journal
FRONTIERS IN GENETICSRights
Copyright © 2020 Lytal, Ran and An. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY).Collection Information
This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at repository@u.library.arizona.edu.Abstract
Data normalization is vital to single-cell sequencing, addressing limitations presented by low input material and various forms of bias or noise present in the sequencing process. Several such normalization methods exist, some of which rely on spike-in genes, molecules added in known quantities to serve as a basis for a normalization model. Depending on available information and the type of data, some methods may express certain advantages over others. We compare the effectiveness of seven available normalization methods designed specifically for single-cell sequencing using two real data sets containing spike-in genes and one simulation study. Additionally, we test those methods not dependent on spike-in genes using a real data set with three distinct cell-cycle states and a real data set under the 10X Genomics GemCode platform with multiple cell types represented. We demonstrate the differences in effectiveness for the featured methods using visualization and classification assessment and conclude which methods are preferable for normalizing a certain type of data for further downstream analysis, such as classification or differential analysis. The comparison in computational time for all methods is addressed as well.Note
Open access journalISSN
1664-8021PubMed ID
32117453Version
Final published versionae974a485f413a2113503eed53cd6c53
10.3389/fgene.2020.00041
Scopus Count
Collections
Except where otherwise noted, this item's license is described as Copyright © 2020 Lytal, Ran and An. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY).
Related articles
- Improving small RNA-seq by using a synthetic spike-in set for size-range quality control together with a set for data normalization.
- Authors: Locati MD, Terpstra I, de Leeuw WC, Kuzak M, Rauwerda H, Ensink WA, van Leeuwen S, Nehrdich U, Spaink HP, Jonker MJ, Breit TM, Dekker RJ
- Issue date: 2015 Aug 18
- Comparison of normalization methods for differential gene expression analysis in RNA-Seq experiments: A matter of relative size of studied transcriptomes.
- Authors: Maza E, Frasse P, Senin P, Bouzayen M, Zouine M
- Issue date: 2013 Nov 1
- Comparison of normalization and differential expression analyses using RNA-Seq data from 726 individual Drosophila melanogaster.
- Authors: Lin Y, Golovnina K, Chen ZX, Lee HN, Negron YL, Sultana H, Oliver B, Harbison ST
- Issue date: 2016 Jan 5
- Assessment of Single Cell RNA-Seq Normalization Methods.
- Authors: Ding B, Zheng L, Wang W
- Issue date: 2017 Jul 5
- Normalization of low-density microarray using external spike-in controls: analysis of macrophage cell lines expression profile.
- Authors: Fardin P, Moretti S, Biasotti B, Ricciardi A, Bonassi S, Varesio L
- Issue date: 2007 Jan 17