Key Laboratory of Functional Protein Research of Guangdong Higher Education Institutes, Institute of Life and Health Engineering, College of Life Science and Technology , Jinan University , Guangzhou 510632 , China.
J Proteome Res. 2018 Nov 2;17(11):3719-3729. doi: 10.1021/acs.jproteome.8b00352. Epub 2018 Oct 8.
In most proteome mass spectrometry experiments, more than half of the mass spectra cannot be identified, mainly because of various modifications. The open search strategy allows for a larger precursor tolerance to utilize more spectra, especially those with post-translational modifications; however, thorough quality control based on independent information is lacking. Here, we used the "Suspicious Discovery Rate (SDR)" based on translatome sequencing (RNC-seq) as an independent source to reference the proteome open search results in steady-state cells. We found that the open search strategy increased the spectra utilization with the cost of increased suspicious identifications that lack translation evidence. We further found that restricting the peptide FDR below 0.1% efficiently controlled the suspicious identifications of open search methods and thus enhanced the confidence of the peptide identification with modifications comparable to the level of the traditional narrow window search. We then demonstrated the successful and validated identification of 27 single amino acid variations from the spectra of two cell lines using the open search strategy without a predefined database. These results validated the proper use of open search methods for higher-quality proteome identifications with information on post-translational modifications and single amino acid polymorphisms.
在大多数蛋白质组学质谱实验中,超过一半的质谱无法被鉴定,主要是因为存在各种修饰。开放搜索策略允许更大的前体容忍度,以利用更多的谱图,特别是那些具有翻译后修饰的谱图;然而,缺乏基于独立信息的彻底质量控制。在这里,我们使用基于翻译组测序 (RNC-seq) 的“可疑发现率 (SDR)”作为独立资源,参考稳态细胞中蛋白质组开放搜索结果。我们发现,开放搜索策略增加了谱图的利用率,但代价是增加了缺乏翻译证据的可疑鉴定。我们进一步发现,将肽 FDR 限制在 0.1%以下,可以有效地控制开放搜索方法的可疑鉴定,从而提高了修饰肽鉴定的置信度,与传统窄窗口搜索的水平相当。然后,我们通过使用开放搜索策略从两个细胞系的谱图中成功鉴定出 27 个单氨基酸变异,而无需预设数据库。这些结果验证了开放搜索方法的正确使用,可用于具有翻译后修饰和单氨基酸多态性信息的高质量蛋白质组鉴定。