Section for Bioinformatics, Department of Health Technology, Technical University of Denmark, 2800 Kongens Lyngby, Denmark.
Bioinformatics Centre, Department of Biology, University of Copenhagen, 2200 Copenhagen, Denmark.
Nucleic Acids Res. 2024 Jul 5;52(W1):W215-W220. doi: 10.1093/nar/gkae237.
DeepLoc 2.0 is a popular web server for the prediction of protein subcellular localization and sorting signals. Here, we introduce DeepLoc 2.1, which additionally classifies the input proteins into the membrane protein types Transmembrane, Peripheral, Lipid-anchored and Soluble. Leveraging pre-trained transformer-based protein language models, the server utilizes a three-stage architecture for sequence-based, multi-label predictions. Comparative evaluations with other established tools on a test set of 4933 eukaryotic protein sequences, constructed following stringent homology partitioning, demonstrate state-of-the-art performance. Notably, DeepLoc 2.1 outperforms existing models, with the larger ProtT5 model exhibiting a marginal advantage over the ESM-1B model. The web server is available at https://services.healthtech.dtu.dk/services/DeepLoc-2.1.
DeepLoc 2.0 是一个用于预测蛋白质亚细胞定位和分拣信号的流行网络服务器。在这里,我们介绍 DeepLoc 2.1,它还可以将输入的蛋白质分类为膜蛋白类型:跨膜、外周、脂锚定和可溶性。该服务器利用基于预训练的转换器的蛋白质语言模型,采用基于序列的三阶段架构进行多标签预测。在一个经过严格同源分区构建的 4933 个真核蛋白质序列测试集上,与其他已建立的工具进行的比较评估表明,该服务器具有最先进的性能。值得注意的是,DeepLoc 2.1 优于现有的模型,较大的 ProtT5 模型比 ESM-1B 模型表现出略微的优势。该网络服务器可在 https://services.healthtech.dtu.dk/services/DeepLoc-2.1 上获取。