Suppr超能文献

使用自动化工具 Rayyan 进行摘要筛选:三项诊断测试准确性系统评价的有效性结果。

Abstract screening using the automated tool Rayyan: results of effectiveness in three diagnostic test accuracy systematic reviews.

机构信息

Neuroscience Institute, Tehran University of Medical Sciences, Tehran, Iran.

Sports Medicine Research Center, Neuroscience Institute, Tehran University of Medical Sciences, Tehran, Iran.

出版信息

BMC Med Res Methodol. 2022 Jun 2;22(1):160. doi: 10.1186/s12874-022-01631-8.

Abstract

OBJECTIVE

To evaluate the performance of the automated abstract screening tool Rayyan.

METHODS

The records obtained from the search for three systematic reviews were manually screened in four stages. At the end of each stage, Rayyan was used to predict the eligibility score for the remaining records. At two different thresholds (≤2.5 and < 2.5 for exclusion of a record) Rayyan-generated ratings were compared with the decisions made by human reviewers in the manual screening process and the tool's accuracy metrics were calculated.

RESULTS

Two thousand fifty-four records were screened manually, of which 379 were judged to be eligible for full-text assessment, and 112 were eventually included in the final review. For finding records eligible for full-text assessment, at the threshold of < 2.5 for exclusion, Rayyan managed to achieve sensitivity values of 97-99% with specificity values of 19-58%, while at the threshold of ≤2.5 for exclusion it had a specificity of 100% with sensitivity values of 1-29%. For the task of finding eligible reports for inclusion in the final review, almost similar results were obtained.

DISCUSSION

At the threshold of < 2.5 for exclusion, Rayyan managed to be a reliable tool for excluding ineligible records, but it was not much reliable for finding eligible records. We emphasize that this study was conducted on diagnostic test accuracy reviews, which are more difficult to screen due to inconsistent terminology.

摘要

目的

评估自动化摘要筛选工具 Rayyan 的性能。

方法

通过手动筛查四个阶段对三个系统评价检索到的记录进行筛选。在每个阶段结束时,使用 Rayyan 预测剩余记录的合格分数。在两个不同的阈值(≤2.5 和 < 2.5 用于排除记录)下,比较 Rayyan 生成的评分与手动筛选过程中人类评审员的决策,并计算工具的准确性指标。

结果

共筛选了 2054 条记录,其中 379 条被判断为有资格进行全文评估,最终有 112 条被纳入最终综述。对于找到有资格进行全文评估的记录,在排除的阈值 < 2.5 的情况下,Rayyan 的敏感性值为 97-99%,特异性值为 19-58%,而在排除的阈值为 ≤2.5 的情况下,特异性值为 100%,敏感性值为 1-29%。对于纳入最终综述的合格报告的任务,几乎得到了类似的结果。

讨论

在排除的阈值 < 2.5 的情况下,Rayyan 成功地成为了排除不合格记录的可靠工具,但在找到合格记录方面并不是很可靠。我们强调,本研究是在诊断测试准确性综述上进行的,由于术语不一致,这些综述更难筛选。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e0c/9161508/b728e4e0b689/12874_2022_1631_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验