College of Mathematics and Physics, Qingdao University of Science and Technolog, China.
College of Mathematics and Physics, Qingdao University of Science and Technology, China.
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa316.
The rapid development of single-cell RNA sequencing (scRNA-Seq) technology provides strong technical support for accurate and efficient analyzing single-cell gene expression data. However, the analysis of scRNA-Seq is accompanied by many obstacles, including dropout events and the curse of dimensionality. Here, we propose the scGMAI, which is a new single-cell Gaussian mixture clustering method based on autoencoder networks and the fast independent component analysis (FastICA). Specifically, scGMAI utilizes autoencoder networks to reconstruct gene expression values from scRNA-Seq data and FastICA is used to reduce the dimensions of reconstructed data. The integration of these computational techniques in scGMAI leads to outperforming results compared to existing tools, including Seurat, in clustering cells from 17 public scRNA-Seq datasets. In summary, scGMAI is an effective tool for accurately clustering and identifying cell types from scRNA-Seq data and shows the great potential of its applicative power in scRNA-Seq data analysis. The source code is available at https://github.com/QUST-AIBBDRC/scGMAI/.
单细胞 RNA 测序 (scRNA-Seq) 技术的快速发展为准确高效地分析单细胞基因表达数据提供了强大的技术支持。然而,scRNA-Seq 的分析伴随着许多障碍,包括数据缺失事件和维度灾难。在这里,我们提出了基于自动编码器网络和快速独立成分分析 (FastICA) 的新的单细胞高斯混合聚类方法 scGMAI。具体来说,scGMAI 利用自动编码器网络从 scRNA-Seq 数据中重建基因表达值,并使用 FastICA 来降低重建数据的维度。这些计算技术的集成在聚类 17 个公共 scRNA-Seq 数据集的细胞方面比现有的工具(包括 Seurat)表现出更好的结果。总之,scGMAI 是一种从 scRNA-Seq 数据中准确聚类和识别细胞类型的有效工具,并且在 scRNA-Seq 数据分析中显示出了其应用潜力。源代码可在 https://github.com/QUST-AIBBDRC/scGMAI/ 获得。