School of Life Sciences, Anhui University, Hefei, 230601, China.
College of Biology and Food Engineering, Chuzhou University, Chuzhou, 239000, China.
BMC Genomics. 2024 Oct 4;25(1):930. doi: 10.1186/s12864-024-10855-5.
Huntington's disease (HD) is a hereditary neurological disorder caused by mutations in HTT, leading to neuronal degeneration. Traditionally, HD is associated with the misfolding and aggregation of mutant huntingtin due to an extended polyglutamine domain encoded by an expanded CAG tract. However, recent research has also highlighted the role of global transcriptional dysregulation in HD pathology. However, understanding the intricate relationship between mRNA expression and HD at the cellular level remains challenging. Our study aimed to elucidate the underlying mechanisms of HD pathology using single-cell sequencing data.
We used single-cell RNA sequencing analysis to determine differential gene expression patterns between healthy and HD cells. HD cells were effectively modeled using a residual neural network (ResNet), which outperformed traditional and convolutional neural networks. Despite the efficacy of our approach, the F1 score for the test set was 96.53%. Using the SHapley Additive exPlanations (SHAP) algorithm, we identified genes influencing HD prediction and revealed their roles in HD pathobiology, such as in the regulation of cellular iron metabolism and mitochondrial function. SHAP analysis also revealed low-abundance genes that were overlooked by traditional differential expression analysis, emphasizing its effectiveness in identifying biologically relevant genes for distinguishing between healthy and HD cells. Overall, the integration of single-cell RNA sequencing data and deep learning models provides valuable insights into HD pathology.
We developed the model capable of analyzing HD at single-cell transcriptomic level.
亨廷顿病(HD)是一种遗传性神经疾病,由 HTT 突变引起,导致神经元变性。传统上,HD 与突变亨廷顿蛋白的错误折叠和聚集有关,这是由于扩展的 CAG 重复编码的长多聚谷氨酰胺结构域。然而,最近的研究也强调了全局转录失调在 HD 病理学中的作用。然而,理解 mRNA 表达与 HD 在细胞水平上的复杂关系仍然具有挑战性。我们的研究旨在使用单细胞测序数据阐明 HD 病理学的潜在机制。
我们使用单细胞 RNA 测序分析来确定健康细胞和 HD 细胞之间的差异基因表达模式。使用残差神经网络(ResNet)有效地对 HD 细胞进行建模,该网络的性能优于传统和卷积神经网络。尽管我们的方法有效,但测试集的 F1 得分为 96.53%。使用 SHapley Additive exPlanations(SHAP)算法,我们确定了影响 HD 预测的基因,并揭示了它们在 HD 病理生物学中的作用,例如调节细胞铁代谢和线粒体功能。SHAP 分析还揭示了传统差异表达分析忽略的低丰度基因,强调了其在识别区分健康和 HD 细胞的生物学相关基因方面的有效性。总体而言,单细胞 RNA 测序数据和深度学习模型的整合为 HD 病理学提供了有价值的见解。
我们开发了能够在单细胞转录组水平分析 HD 的模型。