Alberta Research Centre for Health Evidence, Department of Pediatrics, University of Alberta, Edmonton, Alberta, Canada.
Alberta Strategy for Patient-Oriented Research (SPOR) SUPPORT Unit Knowledge Translation Platform, University of Alberta, Edmonton, Alberta, Canada.
BMC Med Res Methodol. 2020 Jun 3;20(1):139. doi: 10.1186/s12874-020-01031-w.
We investigated the feasibility of using a machine learning tool's relevance predictions to expedite title and abstract screening.
We subjected 11 systematic reviews and six rapid reviews to four retrospective screening simulations (automated and semi-automated approaches to single-reviewer and dual independent screening) in Abstrackr, a freely-available machine learning software. We calculated the proportion missed, workload savings, and time savings compared to single-reviewer and dual independent screening by human reviewers. We performed cited reference searches to determine if missed studies would be identified via reference list scanning.
For systematic reviews, the semi-automated, dual independent screening approach provided the best balance of time savings (median (range) 20 (3-82) hours) and reliability (median (range) proportion missed records, 1 (0-14)%). The cited references search identified 59% (n = 10/17) of the records missed. For the rapid reviews, the fully and semi-automated approaches saved time (median (range) 9 (2-18) hours and 3 (1-10) hours, respectively), but less so than for the systematic reviews. The median (range) proportion missed records for both approaches was 6 (0-22)%.
Using Abstrackr to assist one of two reviewers in systematic reviews saves time with little risk of missing relevant records. Many missed records would be identified via other means.
我们研究了使用机器学习工具的相关性预测来加快标题和摘要筛选的可行性。
我们对 11 项系统评价和 6 项快速评价进行了四项回顾性筛选模拟(在 Abstrackr 中使用自动化和半自动化方法进行单 reviewer 和双独立 screening),Abstrackr 是一个免费的机器学习软件。我们计算了与单 reviewer 和双独立 screening 相比,人工 reviewer 错过的比例、工作量节省和时间节省。我们进行了被引参考文献搜索,以确定通过参考文献扫描是否可以识别出遗漏的研究。
对于系统评价,半自动、双独立 screening 方法在节省时间(中位数(范围)20(3-82)小时)和可靠性(中位数(范围)错过记录的比例,1(0-14)%)方面提供了最佳平衡。被引参考文献搜索确定了 59%(n=10/17)遗漏的记录。对于快速评价,完全和半自动化方法节省了时间(中位数(范围)分别为 9(2-18)小时和 3(1-10)小时),但不如系统评价节省的多。两种方法的中位数(范围)错过记录的比例均为 6(0-22)%。
在系统评价中使用 Abstrackr 协助两名 reviewer 中的一名可以节省时间,而不会有错过相关记录的风险。许多遗漏的记录将通过其他方式识别。