MTA TTK Lendület Cancer Biomarker Research Group, Institute of Enzymology, Budapest, H-1117, Hungary.
2nd Department of Pediatrics, Semmelweis University, Budapest, H-1094, Hungary.
Br J Cancer. 2018 Apr;118(8):1107-1114. doi: 10.1038/s41416-018-0030-0. Epub 2018 Mar 21.
Sequence variations in coding and non-coding regions of the genome can affect gene expression and signalling pathways, which in turn may influence disease outcome.
In this study, we integrated somatic mutations, gene expression and clinical data from 930 breast cancer patients included in the TCGA database. Genes associated with single mutations in molecular breast cancer subtypes were identified by the Mann-Whitney U-test and their prognostic value was evaluated by Kaplan-Meier and Cox regression analyses. Results were confirmed using gene expression profiles from the Metabric data set (n = 1988) and whole-genome sequencing data from the TCGA cohort (n = 117).
The overall mutation rate in coding and non-coding regions were significantly higher in ER-negative/HER2-negative tumours (P = 2.8E-03 and P = 2.4E-07, respectively). Recurrent sequence variations were identified in non-coding regulatory regions of several cancer-associated genes, including NBPF1, PIK3CA and TP53. After multivariate regression analysis, gene signatures associated with three coding mutations (CDH1, MAP3K1 and TP53) and two non-coding variants (CRTC3 and STAG2) in cancer-related genes predicted prognosis in ER-positive/HER2-negative tumours.
These findings demonstrate that sequence alterations influence gene expression and oncogenic pathways, possibly affecting the outcome of breast cancer patients. Our data provide potential opportunities to identify non-coding variations with functional and clinical relevance in breast cancer.
基因组编码和非编码区域的序列变异会影响基因表达和信号通路,进而可能影响疾病的结局。
本研究整合了 TCGA 数据库中 930 例乳腺癌患者的体细胞突变、基因表达和临床数据。采用 Mann-Whitney U 检验鉴定与分子乳腺癌亚型中单个突变相关的基因,通过 Kaplan-Meier 和 Cox 回归分析评估其预后价值。通过使用 Metabric 数据集(n=1988)的基因表达谱和 TCGA 队列的全基因组测序数据(n=117)对结果进行了验证。
ER-阴性/HER2-阴性肿瘤的编码和非编码区域的总突变率明显更高(P=2.8E-03 和 P=2.4E-07)。在几个癌症相关基因的非编码调控区域发现了反复出现的序列变异,包括 NBPF1、PIK3CA 和 TP53。多变量回归分析后,与三个编码突变(CDH1、MAP3K1 和 TP53)和两个非编码变异(CRTC3 和 STAG2)相关的基因特征可预测 ER-阳性/HER2-阴性肿瘤的预后。
这些发现表明,序列改变会影响基因表达和致癌途径,可能影响乳腺癌患者的结局。我们的数据提供了潜在的机会,可以识别乳腺癌中具有功能和临床相关性的非编码变异。