Ensemble dimensionality reduction and feature gene extraction for single-cell RNA-seq data
Name:
s41467-020-19465-7.pdf
Size:
1.740Mb
Format:
PDF
Description:
Final Published Version
Affiliation
Univ Arizona, Dept Epidemiol & BiostatUniv Arizona, Dept Biosyst Engn
Issue Date
2020-11-17
Metadata
Show full item recordPublisher
NATURE RESEARCHCitation
Sun, X., Liu, Y., & An, L. (2020). Ensemble dimensionality reduction and feature gene extraction for single-cell RNA-seq data. Nature Communications, 11(1), 1-9.Journal
NATURE COMMUNICATIONSRights
© The Author(s) 2020. Open Access. This article is licensed under a Creative Commons Attribution 4.0 International License.Collection Information
This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at repository@u.library.arizona.edu.Abstract
Single-cell RNA sequencing (scRNA-seq) technologies allow researchers to uncover the biological states of a single cell at high resolution. For computational efficiency and easy visualization, dimensionality reduction is necessary to capture gene expression patterns in low-dimensional space. Here we propose an ensemble method for simultaneous dimensionality reduction and feature gene extraction (EDGE) of scRNA-seq data. Different from existing dimensionality reduction techniques, the proposed method implements an ensemble learning scheme that utilizes massive weak learners for an accurate similarity search. Based on the similarity matrix constructed by those weak learners, the low-dimensional embedding of the data is estimated and optimized through spectral embedding and stochastic gradient descent. Comprehensive simulation and empirical studies show that EDGE is well suited for searching for meaningful organization of cells, detecting rare cell types, and identifying essential feature genes associated with certain cell types. Dimensionality reduction is used to make the analysis of single-cell RNA sequencing data more efficient. Here the authors propose a method, EDGE, which simultaneously carries out dimensionality reduction and feature gene extraction.Note
Open access journalISSN
2041-1723EISSN
2041-1723PubMed ID
33203837Version
Final published versionae974a485f413a2113503eed53cd6c53
10.1038/s41467-020-19465-7
Scopus Count
Collections
Except where otherwise noted, this item's license is described as © The Author(s) 2020. Open Access. This article is licensed under a Creative Commons Attribution 4.0 International License.
Related articles
- Visualization of Single Cell RNA-Seq Data Using t-SNE in R.
- Authors: Zhou B, Jin W
- Issue date: 2020
- TSEE: an elastic embedding method to visualize the dynamic gene expression patterns of time series single-cell RNA sequencing data.
- Authors: An S, Ma L, Wan L
- Issue date: 2019 Apr 4
- Deep generative model embedding of single-cell RNA-Seq profiles on hyperspheres and hyperbolic spaces.
- Authors: Ding J, Regev A
- Issue date: 2021 May 5
- Spectral clustering of single cells using Siamese nerual network combined with improved affinity matrix.
- Authors: Jiang H, Huang Y, Li Q
- Issue date: 2022 May 13
- A Gene Rank Based Approach for Single Cell Similarity Assessment and Clustering.
- Authors: Xu Y, Li HD, Pan Y, Luo F, Wu FX, Wang J
- Issue date: 2021 Mar-Apr