Rutjes Anne W S, Reitsma Johannes B, Di Nisio Marcello, Smidt Nynke, van Rijn Jeroen C, Bossuyt Patrick M M
Deptartment of Clinical Epidemiology and Biostatistics, Academic Medical Center, University of Amsterdam, Amsterdam, The Netherlands.
CMAJ. 2006 Feb 14;174(4):469-76. doi: 10.1503/cmaj.050090.
Studies with methodologic shortcomings can overestimate the accuracy of a medical test. We sought to determine and compare the direction and magnitude of the effects of a number of potential sources of bias and variation in studies on estimates of diagnostic accuracy.
We identified meta-analyses of the diagnostic accuracy of tests through an electronic search of the databases MEDLINE, EMBASE, DARE and MEDION (1999-2002). We included meta-analyses with at least 10 primary studies without preselection based on design features. Pairs of reviewers independently extracted study characteristics and original data from the primary studies. We used a multivariable meta-epidemiologic regression model to investigate the direction and strength of the association between 15 study features on estimates of diagnostic accuracy.
We selected 31 meta-analyses with 487 primary studies of test evaluations. Only 1 study had no design deficiencies. The quality of reporting was poor in most of the studies. We found significantly higher estimates of diagnostic accuracy in studies with nonconsecutive inclusion of patients (relative diagnostic odds ratio [RDOR] 1.5, 95% confidence interval [CI] 1.0-2.1) and retrospective data collection (RDOR 1.6, 95% CI 1.1-2.2). The estimates were highest in studies that had severe cases and healthy controls (RDOR 4.9, 95% CI 0.6-37.3). Studies that selected patients based on whether they had been referred for the index test, rather than on clinical symptoms, produced significantly lower estimates of diagnostic accuracy (RDOR 0.5, 95% CI 0.3-0.9). The variance between meta-analyses of the effect of design features was large to moderate for type of design (cohort v. case-control), the use of composite reference standards and the use of differential verification; the variance was close to zero for the other design features.
Shortcomings in study design can affect estimates of diagnostic accuracy, but the magnitude of the effect may vary from one situation to another. Design features and clinical characteristics of patient groups should be carefully considered by researchers when designing new studies and by readers when appraising the results of such studies. Unfortunately, incomplete reporting hampers the evaluation of potential sources of bias in diagnostic accuracy studies.
存在方法学缺陷的研究可能高估医学检验的准确性。我们试图确定并比较研究中一些潜在偏倚来源和变异对诊断准确性估计的影响方向及程度。
通过对MEDLINE、EMBASE、DARE和MEDION数据库(1999 - 2002年)进行电子检索,确定检验诊断准确性的荟萃分析。我们纳入至少有10项原始研究且未基于设计特征进行预选的荟萃分析。成对的评审员独立从原始研究中提取研究特征和原始数据。我们使用多变量元流行病学回归模型来研究15项研究特征与诊断准确性估计之间关联的方向和强度。
我们选择了31项荟萃分析,包含487项检验评估的原始研究。只有1项研究没有设计缺陷。大多数研究的报告质量较差。我们发现,在非连续纳入患者的研究(相对诊断比值比[RDOR] 1.5,95%置信区间[CI] 1.0 - 2.1)和回顾性数据收集的研究(RDOR 1.6,95% CI 1.1 - 2.2)中,诊断准确性估计显著更高。在有重症病例和健康对照的研究中估计值最高(RDOR 4.9,95% CI 0.6 - 37.3)。基于是否被转诊进行索引检验而非临床症状来选择患者的研究,诊断准确性估计显著更低(RDOR 0.5,95% CI 0.3 - 0.9)。对于设计类型(队列研究与病例对照研究)、复合参考标准的使用以及差异验证的使用,设计特征效应的荟萃分析之间的方差为中到高;对于其他设计特征,方差接近零。
研究设计中的缺陷会影响诊断准确性的估计,但影响程度可能因情况而异。研究人员在设计新研究时以及读者在评估此类研究结果时,应仔细考虑设计特征和患者群体的临床特征。不幸的是,报告不完整妨碍了对诊断准确性研究中潜在偏倚来源的评估。