Silva-Rodríguez Julio, Colomer Adrián, Sales María A, Molina Rafael, Naranjo Valery
Institute of Transport and Territory, Universitat Politècnica de València, Valencia, Spain.
Institute of Research and Innovation in Bioengineering, Universitat Politècnica de València, Valencia, Spain.
Comput Methods Programs Biomed. 2020 Oct;195:105637. doi: 10.1016/j.cmpb.2020.105637. Epub 2020 Jul 4.
Prostate cancer is one of the most common diseases affecting men worldwide. The Gleason scoring system is the primary diagnostic and prognostic tool for prostate cancer. Furthermore, recent reports indicate that the presence of patterns of the Gleason scale such as the cribriform pattern may also correlate with a worse prognosis compared to other patterns belonging to the Gleason grade 4. Current clinical guidelines have indicated the convenience of highlight its presence during the analysis of biopsies. All these requirements suppose a great workload for the pathologist during the analysis of each sample, which is based on the pathologist's visual analysis of the morphology and organisation of the glands in the tissue, a time-consuming and subjective task. In recent years, with the development of digitisation devices, the use of computer vision techniques for the analysis of biopsies has increased. However, to the best of the authors' knowledge, the development of algorithms to automatically detect individual cribriform patterns belonging to Gleason grade 4 has not yet been studied in the literature. The objective of the work presented in this paper is to develop a deep-learning-based system able to support pathologists in the daily analysis of prostate biopsies. This analysis must include the Gleason grading of local structures, the detection of cribriform patterns, and the Gleason scoring of the whole biopsy.
The methodological core of this work is a patch-wise predictive model based on convolutional neural networks able to determine the presence of cancerous patterns based on the Gleason grading system. In particular, we train from scratch a simple self-design architecture with three filters and a top model with global-max pooling. The cribriform pattern is detected by retraining the set of filters of the last convolutional layer in the network. Subsequently, a biopsy-level prediction map is reconstructed by bi-linear interpolation of the patch-level prediction of the Gleason grades. In addition, from the reconstructed prediction map, we compute the percentage of each Gleason grade in the tissue to feed a multi-layer perceptron which provides a biopsy-level score.
In our SICAPv2 database, composed of 182 annotated whole slide images, we obtained a Cohen's quadratic kappa of 0.77 in the test set for the patch-level Gleason grading with the proposed architecture trained from scratch. Our results outperform previous ones reported in the literature. Furthermore, this model reaches the level of fine-tuned state-of-the-art architectures in a patient-based four groups cross validation. In the cribriform pattern detection task, we obtained an area under ROC curve of 0.82. Regarding the biopsy Gleason scoring, we achieved a quadratic Cohen's Kappa of 0.81 in the test subset. Shallow CNN architectures trained from scratch outperform current state-of-the-art methods for Gleason grades classification. Our proposed model is capable of characterising the different Gleason grades in prostate tissue by extracting low-level features through three basic blocks (i.e. convolutional layer + max pooling). The use of global-max pooling to reduce each activation map has shown to be a key factor for reducing complexity in the model and avoiding overfitting. Regarding the Gleason scoring of biopsies, a multi-layer perceptron has shown to better model the decision-making of pathologists than previous simpler models used in the literature.
前列腺癌是全球影响男性的最常见疾病之一。格里森评分系统是前列腺癌的主要诊断和预后工具。此外,最近的报告表明,与格里森4级的其他模式相比,格里森分级中的某些模式(如筛状模式)的存在也可能与更差的预后相关。当前临床指南指出,在活检分析过程中突出其存在很方便。所有这些要求都意味着病理学家在分析每个样本时工作量巨大,这基于病理学家对组织中腺体形态和结构的视觉分析,是一项耗时且主观的任务。近年来,随着数字化设备的发展,用于活检分析的计算机视觉技术的使用有所增加。然而,据作者所知,尚未在文献中研究开发自动检测属于格里森4级的单个筛状模式的算法。本文所呈现工作的目的是开发一个基于深度学习的系统,以支持病理学家在前列腺活检的日常分析中。这种分析必须包括局部结构的格里森分级、筛状模式的检测以及整个活检的格里森评分。
这项工作的方法核心是基于卷积神经网络的逐块预测模型,能够根据格里森评分系统确定癌性模式的存在。具体而言,我们从零开始训练一个具有三个滤波器的简单自行设计架构和一个具有全局最大池化的顶级模型。通过重新训练网络中最后一个卷积层的滤波器集来检测筛状模式。随后,通过对格里森分级的块级预测进行双线性插值来重建活检级预测图。此外,从重建的预测图中,我们计算组织中每个格里森分级的百分比,以输入一个多层感知器,该感知器提供活检级评分。
在我们由182张带注释的全切片图像组成的SICAPv2数据库中,对于从零开始训练的所提出架构,在测试集中块级格里森分级的科恩二次kappa值为0.77。我们的结果优于文献中先前报道的结果。此外,在基于患者的四组交叉验证中,该模型达到了微调后的最先进架构水平。在筛状模式检测任务中,我们获得的ROC曲线下面积为0.82。关于活检格里森评分,我们在测试子集中实现的科恩二次kappa值为0.81。从零开始训练的浅层卷积神经网络架构在格里森分级分类方面优于当前的最先进方法。我们提出的模型能够通过三个基本模块(即卷积层+最大池化)提取低级特征来表征前列腺组织中的不同格里森分级。使用全局最大池化来减少每个激活图已被证明是降低模型复杂性和避免过拟合的关键因素。关于活检的格里森评分,多层感知器已被证明比文献中先前使用的更简单模型能更好地模拟病理学家的决策。