Reinhold William C, Varma Sudhir, Rajapakse Vinodh N, Luna Augustin, Sousa Fabricio Garmus, Kohn Kurt W, Pommier Yves G
Developmental Therapeutic Branch, Center for Cancer Research, NCI, NIH, 9000 Rockville Pike, Building 37, room 5041, Bethesda, MD, 20892, USA,
Hum Genet. 2015 Jan;134(1):3-11. doi: 10.1007/s00439-014-1482-9. Epub 2014 Sep 12.
The current convergence of molecular and pharmacological data provides unprecedented opportunities to gain insights into the relationships between the two types of data. Multiple forms of large-scale molecular data, including but not limited to gene and microRNA transcript expression, DNA somatic and germline variations from next-generation DNA and RNA sequencing, and DNA copy number from array comparative genomic hybridization are all potentially informative when one attempts to recognize the panoply of potentially influential events both for cancer progression and therapeutic outcome. Concurrently, there has also been a substantial expansion of the pharmacological data being accrued in a systematic fashion. For cancer cell lines, the National Cancer Institute cell line panel (NCI-60), the Cancer Cell Line Encyclopedia (CCLE), and the collaborative Genomics of Drug Sensitivity in Cancer (GDSC) databases all provide subsets of these forms of data. For the patient-derived data, The Cancer Genome Atlas (TCGA) provides analogous forms of genomic information along with treatment histories. Integration of these data in turn relies on the fields of statistics and statistical learning. Multiple algorithmic approaches may be chosen, depending on the data being considered, and the nature of the question being asked. Combining these algorithms with prior biological knowledge, the results of molecular biological studies, and the consideration of genes as pathways or functional groups provides both the challenge and the potential of the field. The ultimate goal is to provide a paradigm shift in the way that drugs are selected to provide a more targeted and efficacious outcome for the patient.
当前分子数据与药理学数据的融合提供了前所未有的机会,以深入了解这两类数据之间的关系。多种形式的大规模分子数据,包括但不限于基因和微小RNA转录本表达、来自新一代DNA和RNA测序的DNA体细胞和种系变异,以及来自阵列比较基因组杂交的DNA拷贝数,当人们试图识别对癌症进展和治疗结果具有潜在影响的一系列事件时,都可能提供有用信息。与此同时,以系统方式积累的药理学数据也有了大幅扩展。对于癌细胞系,美国国立癌症研究所细胞系面板(NCI-60)、癌症细胞系百科全书(CCLE)以及癌症药物敏感性合作基因组学(GDSC)数据库都提供了这些数据形式的子集。对于患者来源的数据,癌症基因组图谱(TCGA)提供了类似形式的基因组信息以及治疗史。这些数据的整合反过来依赖于统计学和统计学习领域。根据所考虑的数据以及所提出问题的性质,可以选择多种算法方法。将这些算法与先前的生物学知识、分子生物学研究结果以及将基因视为通路或功能组的考虑相结合,既带来了该领域的挑战,也带来了潜力。最终目标是在药物选择方式上实现范式转变,为患者提供更具针对性和有效性的治疗结果。