Kronberg Raphael M, Haeberle Lena, Pfaus Melanie, Xu Haifeng C, Krings Karina S, Schlensog Martin, Rau Tilman, Pandyra Aleksandra A, Lang Karl S, Esposito Irene, Lang Philipp A
Department of Molecular Medicine II, Medical Faculty, Heinrich-Heine-University, Universitätsstrasse 1, 40225 Düsseldorf, Germany.
Mathematical Modelling of Biological Systems, Heinrich-Heine University, Universitätsstrasse 1, 40225 Düsseldorf, Germany.
Cancers (Basel). 2022 Apr 13;14(8):1964. doi: 10.3390/cancers14081964.
Pancreatic cancer is a fatal malignancy with poor prognosis and limited treatment options. Early detection in primary and secondary locations is critical, but fraught with challenges. While digital pathology can assist with the classification of histopathological images, the training of such networks always relies on a ground truth, which is frequently compromised as tissue sections contain several types of tissue entities. Here we show that pancreatic cancer can be detected on hematoxylin and eosin (H&E) sections by convolutional neural networks using deep transfer learning. To improve the ground truth, we describe a preprocessing data clean-up process using two communicators that were generated through existing and new datasets. Specifically, the communicators moved image tiles containing adipose tissue and background to a new data class. Hence, the original dataset exhibited improved labeling and, consequently, a higher ground truth accuracy. Deep transfer learning of a ResNet18 network resulted in a five-class accuracy of about 94% on test data images. The network was validated with independent tissue sections composed of healthy pancreatic tissue, pancreatic ductal adenocarcinoma, and pancreatic cancer lymph node metastases. The screening of different models and hyperparameter fine tuning were performed to optimize the performance with the independent tissue sections. Taken together, we introduce a step of data preprocessing via communicators as a means of improving the ground truth during deep transfer learning and hyperparameter tuning to identify pancreatic ductal adenocarcinoma primary tumors and metastases in histological tissue sections.
胰腺癌是一种预后不良且治疗选择有限的致命恶性肿瘤。在原发和继发部位进行早期检测至关重要,但充满挑战。虽然数字病理学可以辅助组织病理学图像的分类,但此类网络的训练始终依赖于一个基本事实,而由于组织切片包含多种类型的组织实体,这一事实常常受到影响。在此,我们表明使用深度迁移学习的卷积神经网络可以在苏木精和伊红(H&E)切片上检测到胰腺癌。为了改进基本事实,我们描述了一种使用通过现有数据集和新数据集生成的两个通信器进行预处理数据清理的过程。具体而言,通信器将包含脂肪组织和背景的图像块移动到一个新的数据类别。因此,原始数据集的标签得到了改进,从而基本事实准确性更高。ResNet18网络的深度迁移学习在测试数据图像上实现了约94%的五类准确率。该网络用由健康胰腺组织、胰腺导管腺癌和胰腺癌淋巴结转移组成的独立组织切片进行了验证。对不同模型进行了筛选并对超参数进行了微调,以优化独立组织切片的性能。综上所述,我们引入了通过通信器进行数据预处理的步骤,作为在深度迁移学习和超参数调整过程中改进基本事实的一种手段,以识别组织学组织切片中的胰腺导管腺癌原发肿瘤和转移灶。