Phyloinformatics Unit, RIKEN Center for Life Science Technologies, Kobe, Hyogo 650-0047, Japan.
Bioinformatics. 2017 Nov 15;33(22):3635-3637. doi: 10.1093/bioinformatics/btx445.
Along with the increasing accessibility to comprehensive sequence information, such as whole genomes and transcriptomes, the demand for assessing their quality has been multiplied. To this end, metrics based on sequence lengths, such as N50, have become a standard, but they only evaluate one aspect of assembly quality. Conversely, analyzing the coverage of pre-selected reference protein-coding genes provides essential content-based quality assessment, but the currently available pipelines for this purpose, CEGMA and BUSCO, do not have a user-friendly interface to serve as a uniform environment for assembly completeness assessment.
Here, we introduce a brand-new web server, gVolante, which provides an online tool for (i) on-demand completeness assessment of sequence sets by means of the previously developed pipelines CEGMA and BUSCO and (ii) browsing pre-computed completeness scores for publicly available data in its database section. Completeness assessments performed on gVolante report scores based on not just the coverage of reference genes but also on sequence lengths (e.g. N50 scaffold length), allowing quality control in multiple aspects. Using gVolante, one can compare the quality of original assemblies between their multiple versions (obtained through program choice and parameter tweaking, for example) and evaluate them in comparison to the scores of public resources found in the database section.
gVoalte is freely available at https://gvolante.riken.jp/.
随着综合序列信息(如全基因组和转录组)可获取性的提高,对其质量评估的需求也呈指数级增长。为此,基于序列长度的指标(如 N50)已成为标准,但它们仅评估了组装质量的一个方面。相反,分析预先选择的参考蛋白编码基因的覆盖度提供了必要的基于内容的质量评估,但目前为此目的提供的流水线 CEGMA 和 BUSCO 没有用户友好的界面,无法作为组装完整性评估的统一环境。
在这里,我们介绍了一个全新的网络服务器 gVolante,它提供了一个在线工具,用于 (i) 通过先前开发的 CEGMA 和 BUSCO 流水线按需完成序列集的完整性评估,以及 (ii) 在其数据库部分浏览预先计算的公共数据的完整性得分。gVolante 上执行的完整性评估报告的分数不仅基于参考基因的覆盖度,还基于序列长度(例如 N50 支架长度),从而可以在多个方面进行质量控制。使用 gVolante,用户可以比较原始组装在其多个版本之间的质量(例如通过程序选择和参数调整获得),并将其与数据库部分中公共资源的得分进行评估。
gVoalte 可在 https://gvolante.riken.jp/ 免费获得。