Suppr超能文献

使用集成机器学习方法识别前列腺癌中可能的候选者。

Identification of Plausible Candidates in Prostate Cancer Using Integrated Machine Learning Approaches.

作者信息

Kour Bhumandeep, Shukla Nidhi, Bhargava Harshita, Sharma Devendra, Sharma Amita, Singh Anjuvan, Valadi Jayaraman, Sadasukhi Trilok Chand, Vuree Sugunakar, Suravajhala Prashanth

机构信息

Department of Biotechnology, Lovely Professional University, Jalandhar, Punjab, India.

Bioclues.org, India.

出版信息

Curr Genomics. 2023 Dec 20;24(5):287-306. doi: 10.2174/0113892029240239231109082805.

Abstract

BACKGROUND

Currently, prostate-specific antigen (PSA) is commonly used as a prostate cancer (PCa) biomarker. PSA is linked to some factors that frequently lead to erroneous positive results or even needless biopsies of elderly people.

OBJECTIVES

In this pilot study, we undermined the potential genes and mutations from several databases and checked whether or not any putative prognostic biomarkers are central to the annotation. The aim of the study was to develop a risk prediction model that could help in clinical decision-making.

METHODS

An extensive literature review was conducted, and clinical parameters for related comorbidities, such as diabetes, obesity, as well as PCa, were collected. Such parameters were chosen with the understanding that variations in their threshold values could hasten the complicated process of carcinogenesis, more particularly PCa. The gathered data was converted to semi-binary data (-1, -0.5, 0, 0.5, and 1), on which machine learning (ML) methods were applied. First, we cross-checked various publicly available datasets, some published RNA-seq datasets, and our whole-exome sequencing data to find common role players in PCa, diabetes, and obesity. To narrow down their common interacting partners, interactome networks were analysed using GeneMANIA and visualised using Cytoscape, and later cBioportal was used (to compare expression level based on Z scored values) wherein various types of mutation w.r.t their expression and mRNA expression (RNA seq FPKM) plots are available. The GEPIA 2 tool was used to compare the expression of resulting similarities between the normal tissue and TCGA databases of PCa. Later, top-ranking genes were chosen to demonstrate striking clustering coefficients using the Cytoscape-cytoHubba module, and GEPIA 2 was applied again to ascertain survival plots.

RESULTS

Comparing various publicly available datasets, it was found that BLM is a frequent player in all three diseases, whereas comparing publicly available datasets, GWAS datasets, and published sequencing findings, SPFTPC and PPIMB were found to be the most common. With the assistance of GeneMANIA, TMPO and FOXP1 were found as common interacting partners, and they were also seen participating with BLM.

CONCLUSION

A probabilistic machine learning model was achieved to identify key candidates between diabetes, obesity, and PCa. This, we believe, would herald precision scale modeling for easy prognosis.

摘要

背景

目前,前列腺特异性抗原(PSA)通常用作前列腺癌(PCa)的生物标志物。PSA与一些经常导致错误阳性结果甚至导致老年人不必要活检的因素有关。

目的

在这项初步研究中,我们从几个数据库中挖掘潜在基因和突变,并检查是否有任何假定的预后生物标志物是注释的核心。该研究的目的是开发一种有助于临床决策的风险预测模型。

方法

进行了广泛的文献综述,并收集了相关合并症(如糖尿病、肥胖症)以及PCa的临床参数。选择这些参数是因为了解到它们阈值的变化可能会加速致癌的复杂过程,尤其是PCa。将收集到的数据转换为半二进制数据(-1、-0.5、0、0.5和1),并应用机器学习(ML)方法。首先,我们交叉检查了各种公开可用的数据集、一些已发表的RNA测序数据集以及我们的全外显子测序数据,以找出PCa、糖尿病和肥胖症中常见的参与者。为了缩小它们的共同相互作用伙伴范围,使用GeneMANIA分析相互作用组网络并使用Cytoscape进行可视化,随后使用cBioportal(基于Z评分值比较表达水平),其中可获得各种类型突变的表达和mRNA表达(RNA序列FPKM)图。使用GEPIA 2工具比较正常组织与PCa的TCGA数据库之间所得相似性的表达。随后,选择排名靠前的基因以使用Cytoscape-cytoHubba模块展示显著的聚类系数,并再次应用GEPIA 2以确定生存图。

结果

比较各种公开可用的数据集,发现BLM在所有三种疾病中都很常见,而比较公开可用的数据集、GWAS数据集和已发表的测序结果时,发现SPFTPC和PPIMB是最常见的。在GeneMANIA的帮助下,发现TMPO和FOXP1是共同的相互作用伙伴,并且它们也与BLM一起参与。

结论

实现了一种概率机器学习模型,以识别糖尿病、肥胖症和PCa之间的关键候选物。我们相信,这将预示着易于预后的精确规模建模。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3f90/10790336/8bbd6e5a64f1/CG-24-287_F1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验