McKusick-Nathans Institute of the Department of Genetic Medicine, Johns Hopkins School of Medicine, Baltimore, Maryland.
Department of Oncology, Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins School of Medicine, Baltimore, Maryland.
Cancer Res. 2019 Oct 1;79(19):5102-5112. doi: 10.1158/0008-5472.CAN-18-3882. Epub 2019 Jul 23.
Tumor heterogeneity provides a complex challenge to cancer treatment and is a critical component of therapeutic response, disease recurrence, and patient survival. Single-cell RNA-sequencing (scRNA-seq) technologies have revealed the prevalence of intratumor and intertumor heterogeneity. Computational techniques are essential to quantify the differences in variation of these profiles between distinct cell types, tumor subtypes, and patients to fully characterize intratumor and intertumor molecular heterogeneity. In this study, we adapted our algorithm for pathway dysregulation, Expression Variation Analysis (EVA), to perform multivariate statistical analyses of differential variation of expression in gene sets for scRNA-seq. EVA has high sensitivity and specificity to detect pathways with true differential heterogeneity in simulated data. EVA was applied to several public domain scRNA-seq tumor datasets to quantify the landscape of tumor heterogeneity in several key applications in cancer genomics such as immunogenicity, metastasis, and cancer subtypes. Immune pathway heterogeneity of hematopoietic cell populations in breast tumors corresponded to the amount of diversity present in the T-cell repertoire of each individual. Cells from head and neck squamous cell carcinoma (HNSCC) primary tumors had significantly more heterogeneity across pathways than cells from metastases, consistent with a model of clonal outgrowth. Moreover, there were dramatic differences in pathway dysregulation across HNSCC basal primary tumors. Within the basal primary tumors, there was increased immune dysregulation in individuals with a high proportion of fibroblasts present in the tumor microenvironment. These results demonstrate the broad utility of EVA to quantify intertumor and intratumor heterogeneity from scRNA-seq data without reliance on low-dimensional visualization. SIGNIFICANCE: This study presents a robust statistical algorithm for evaluating gene expression heterogeneity within pathways or gene sets in single-cell RNA-seq data.
肿瘤异质性给癌症治疗带来了复杂的挑战,是治疗反应、疾病复发和患者生存的关键组成部分。单细胞 RNA 测序 (scRNA-seq) 技术揭示了肿瘤内和肿瘤间异质性的普遍性。计算技术对于量化不同细胞类型、肿瘤亚型和患者之间这些谱之间差异的变化至关重要,以充分描述肿瘤内和肿瘤间分子异质性。在这项研究中,我们改编了我们的通路失调算法,表达变异分析 (EVA),以对 scRNA-seq 中基因集表达的差异变异进行多变量统计分析。EVA 对模拟数据中具有真实差异异质性的通路具有高灵敏度和特异性。EVA 应用于几个公共 scRNA-seq 肿瘤数据集,以量化癌症基因组学中几个关键应用中的肿瘤异质性全景,如免疫原性、转移和癌症亚型。乳腺癌肿瘤中造血细胞群体的免疫通路异质性与每个个体 T 细胞库中存在的多样性量相对应。与转移细胞相比,头颈部鳞状细胞癌 (HNSCC) 原发肿瘤中的细胞在多条通路上的异质性明显更大,这与克隆生长的模型一致。此外,HNSCC 基底原发性肿瘤中的通路失调差异巨大。在基底原发性肿瘤中,存在大量成纤维细胞的肿瘤微环境中个体的免疫失调增加。这些结果表明,EVA 具有广泛的用途,可以在不依赖低维可视化的情况下从 scRNA-seq 数据中定量肿瘤间和肿瘤内异质性。意义:本研究提出了一种稳健的统计算法,用于评估单细胞 RNA-seq 数据中通路或基因集中的基因表达异质性。