Centre for Novel Agricultural Products, Department of Biology, University of York, York, UK.
Mol Microbiol. 2023 Nov;120(5):754-762. doi: 10.1111/mmi.15144. Epub 2023 Aug 30.
The increasing availability of microbial genome sequences provides a reservoir of information for the identification of new microbial enzymes. Genes encoding proteins engaged in extracellular processes are of particular interest as these mediate the interactions microbes have with their environments. However, proteomic analysis of secretomes is challenging and often captures intracellular proteins released through cell death and lysis. Secretome prediction workflows from sequence data are commonly used to filter proteins identified through proteomics but are often simplified to a single step and are not evaluated bioinformatically for their effectiveness. Here, a workflow to predict a fungal secretome was designed and applied to the coding regions of the Parascedosporium putredinis NO1 genome. This ascomycete fungus is an exceptional lignocellulose degrader from which a new lignin-degrading enzyme has previously been identified. The 'secretome isolation' workflow is based on two strategies of localisation prediction and secretion prediction each utilising multiple available tools. The workflow produced three final secretomes with increasing levels of stringency. All three secretomes showed increases in functional annotations for extracellular processes and reductions in annotations for intracellular processes. Multiple sequences isolated as part of the secretome lacked any functional annotation and made exciting candidates for novel enzyme discovery.
微生物基因组序列的日益丰富为鉴定新的微生物酶提供了丰富的信息资源。参与细胞外过程的蛋白质编码基因特别有趣,因为这些基因介导了微生物与其环境的相互作用。然而,细胞外分泌物的蛋白质组学分析具有挑战性,并且通常会捕获通过细胞死亡和裂解释放的细胞内蛋白质。从序列数据预测分泌蛋白的工作流程通常用于筛选通过蛋白质组学鉴定的蛋白质,但通常简化为单个步骤,并且没有针对其有效性进行生物信息学评估。在这里,设计了一种预测真菌分泌组的工作流程,并将其应用于 Parascedosporium putredinis NO1 基因组的编码区。这种子囊菌是一种特殊的木质纤维素降解菌,此前已从该菌中鉴定出一种新的木质素降解酶。“分泌组分离”工作流程基于两种定位预测和分泌预测策略,每种策略都利用了多个可用的工具。该工作流程产生了三个最终的分泌组,其严格程度逐渐提高。所有三个分泌组的细胞外过程的功能注释都增加了,而细胞内过程的注释则减少了。作为分泌组一部分分离出的多个序列缺乏任何功能注释,是发现新型酶的令人兴奋的候选者。