Lombardi Gemma, Crescioli Giada, Cavedo Enrica, Lucenteforte Ersilia, Casazza Giovanni, Bellatorre Alessandro-Giacco, Lista Chiara, Costantino Giorgio, Frisoni Giovanni, Virgili Gianni, Filippini Graziella
University of Florence, Department of Neurosciences, Psychology, Drug Research and Child Health (NEUROFARBA), Largo Brambilla, 3, Florence, Italy, 50134.
Pitie-Salpetriere Hospital, Sorbonne University, Alzheimer Precision Medicine (APM), AP-HP, 47 boulevard de l'Hopital, Paris, France, 75013.
Cochrane Database Syst Rev. 2020 Mar 2;3(3):CD009628. doi: 10.1002/14651858.CD009628.pub2.
Mild cognitive impairment (MCI) due to Alzheimer's disease is the symptomatic predementia phase of Alzheimer's disease dementia, characterised by cognitive and functional impairment not severe enough to fulfil the criteria for dementia. In clinical samples, people with amnestic MCI are at high risk of developing Alzheimer's disease dementia, with annual rates of progression from MCI to Alzheimer's disease estimated at approximately 10% to 15% compared with the base incidence rates of Alzheimer's disease dementia of 1% to 2% per year.
To assess the diagnostic accuracy of structural magnetic resonance imaging (MRI) for the early diagnosis of dementia due to Alzheimer's disease in people with MCI versus the clinical follow-up diagnosis of Alzheimer's disease dementia as a reference standard (delayed verification). To investigate sources of heterogeneity in accuracy, such as the use of qualitative visual assessment or quantitative volumetric measurements, including manual or automatic (MRI) techniques, or the length of follow-up, and age of participants. MRI was evaluated as an add-on test in addition to clinical diagnosis of MCI to improve early diagnosis of dementia due to Alzheimer's disease in people with MCI.
On 29 January 2019 we searched Cochrane Dementia and Cognitive Improvement's Specialised Register and the databases, MEDLINE, Embase, BIOSIS Previews, Science Citation Index, PsycINFO, and LILACS. We also searched the reference lists of all eligible studies identified by the electronic searches.
We considered cohort studies of any size that included prospectively recruited people of any age with a diagnosis of MCI. We included studies that compared the diagnostic test accuracy of baseline structural MRI versus the clinical follow-up diagnosis of Alzheimer's disease dementia (delayed verification). We did not exclude studies on the basis of length of follow-up. We included studies that used either qualitative visual assessment or quantitative volumetric measurements of MRI to detect atrophy in the whole brain or in specific brain regions, such as the hippocampus, medial temporal lobe, lateral ventricles, entorhinal cortex, medial temporal gyrus, lateral temporal lobe, amygdala, and cortical grey matter.
Four teams of two review authors each independently reviewed titles and abstracts of articles identified by the search strategy. Two teams of two review authors each independently assessed the selected full-text articles for eligibility, extracted data and solved disagreements by consensus. Two review authors independently assessed the quality of studies using the QUADAS-2 tool. We used the hierarchical summary receiver operating characteristic (HSROC) model to fit summary ROC curves and to obtain overall measures of relative accuracy in subgroup analyses. We also used these models to obtain pooled estimates of sensitivity and specificity when sufficient data sets were available.
We included 33 studies, published from 1999 to 2019, with 3935 participants of whom 1341 (34%) progressed to Alzheimer's disease dementia and 2594 (66%) did not. Of the participants who did not progress to Alzheimer's disease dementia, 2561 (99%) remained stable MCI and 33 (1%) progressed to other types of dementia. The median proportion of women was 53% and the mean age of participants ranged from 63 to 87 years (median 73 years). The mean length of clinical follow-up ranged from 1 to 7.6 years (median 2 years). Most studies were of poor methodological quality due to risk of bias for participant selection or the index test, or both. Most of the included studies reported data on the volume of the total hippocampus (pooled mean sensitivity 0.73 (95% confidence interval (CI) 0.64 to 0.80); pooled mean specificity 0.71 (95% CI 0.65 to 0.77); 22 studies, 2209 participants). This evidence was of low certainty due to risk of bias and inconsistency. Seven studies reported data on the atrophy of the medial temporal lobe (mean sensitivity 0.64 (95% CI 0.53 to 0.73); mean specificity 0.65 (95% CI 0.51 to 0.76); 1077 participants) and five studies on the volume of the lateral ventricles (mean sensitivity 0.57 (95% CI 0.49 to 0.65); mean specificity 0.64 (95% CI 0.59 to 0.70); 1077 participants). This evidence was of moderate certainty due to risk of bias. Four studies with 529 participants analysed the volume of the total entorhinal cortex and four studies with 424 participants analysed the volume of the whole brain. We did not estimate pooled sensitivity and specificity for the volume of these two regions because available data were sparse and heterogeneous. We could not statistically evaluate the volumes of the lateral temporal lobe, amygdala, medial temporal gyrus, or cortical grey matter assessed in small individual studies. We found no evidence of a difference between studies in the accuracy of the total hippocampal volume with regards to duration of follow-up or age of participants, but the manual MRI technique was superior to automatic techniques in mixed (mostly indirect) comparisons. We did not assess the relative accuracy of the volumes of different brain regions measured by MRI because only indirect comparisons were available, studies were heterogeneous, and the overall accuracy of all regions was moderate.
AUTHORS' CONCLUSIONS: The volume of hippocampus or medial temporal lobe, the most studied brain regions, showed low sensitivity and specificity and did not qualify structural MRI as a stand-alone add-on test for an early diagnosis of dementia due to Alzheimer's disease in people with MCI. This is consistent with international guidelines, which recommend imaging to exclude non-degenerative or surgical causes of cognitive impairment and not to diagnose dementia due to Alzheimer's disease. In view of the low quality of most of the included studies, the findings of this review should be interpreted with caution. Future research should not focus on a single biomarker, but rather on combinations of biomarkers to improve an early diagnosis of Alzheimer's disease dementia.
阿尔茨海默病所致轻度认知障碍(MCI)是阿尔茨海默病痴呆的症状性痴呆前期,其特征为认知和功能障碍程度不足以达到痴呆的诊断标准。在临床样本中,遗忘型MCI患者发展为阿尔茨海默病痴呆的风险较高,与阿尔茨海默病痴呆每年1%至2%的基础发病率相比,MCI每年进展为阿尔茨海默病的发生率估计约为10%至15%。
评估结构磁共振成像(MRI)对MCI患者早期诊断阿尔茨海默病痴呆的诊断准确性,并与阿尔茨海默病痴呆的临床随访诊断作为参考标准(延迟验证)进行比较。研究准确性的异质性来源,如使用定性视觉评估或定量体积测量,包括手动或自动(MRI)技术,或随访时间长度以及参与者年龄。MRI被评估为除MCI临床诊断之外的附加测试,以改善MCI患者早期诊断阿尔茨海默病痴呆的情况。
2019年1月29日,我们检索了Cochrane痴呆与认知改善专业注册库以及MEDLINE、Embase、BIOSIS Previews、科学引文索引(Science Citation Index)、PsycINFO和LILACS等数据库。我们还检索了电子检索所识别的所有合格研究的参考文献列表。
我们纳入了任何规模的队列研究,这些研究前瞻性招募了任何年龄且诊断为MCI的人群。我们纳入了比较基线结构MRI与阿尔茨海默病痴呆临床随访诊断(延迟验证)的诊断测试准确性的研究。我们没有根据随访时间长度排除研究。我们纳入了使用MRI的定性视觉评估或定量体积测量来检测全脑或特定脑区萎缩的研究,如海马体、内侧颞叶、侧脑室、内嗅皮质、内侧颞回、外侧颞叶、杏仁核和皮质灰质。
四组,每组两名综述作者,各自独立审查检索策略所识别文章的标题和摘要。两组,每组两名综述作者,各自独立评估所选全文文章的合格性,提取数据并通过共识解决分歧。两名综述作者使用QUADAS - 2工具独立评估研究质量。我们使用分层汇总接受者操作特征(HSROC)模型来拟合汇总ROC曲线,并在亚组分析中获得相对准确性的总体测量值。当有足够的数据集时,我们还使用这些模型来获得敏感性和特异性的合并估计值。
我们纳入了1999年至2019年发表的33项研究,共3935名参与者,其中1341名(34%)进展为阿尔茨海默病痴呆,2594名(66%)未进展。在未进展为阿尔茨海默病痴呆的参与者中,2561名(99%)仍为稳定的MCI,33名(1%)进展为其他类型的痴呆。女性的中位数比例为53%,参与者的平均年龄在63至87岁之间(中位数73岁)。临床随访的平均时间长度在1至7.6年之间(中位数2年)。由于参与者选择或指标测试存在偏倚风险,或两者皆有,大多数研究的方法学质量较差。大多数纳入研究报告了总海马体体积的数据(合并平均敏感性0.73(95%置信区间(CI)0.64至0.80);合并平均特异性0.71(95% CI 0.65至0.77);22项研究,2209名参与者)。由于存在偏倚风险和不一致性,该证据的确定性较低。七项研究报告了内侧颞叶萎缩的数据(平均敏感性0.64(95% CI 0.53至0.73);平均特异性0.65(95% CI 0.51至0.76);1077名参与者),五项研究报告了侧脑室体积的数据(平均敏感性0.57(95% CI 0.49至0.65);平均特异性0.64(95% CI 0.59至0.70);1077名参与者)。由于存在偏倚风险,该证据的确定性为中等。四项研究共529名参与者分析了内嗅皮质总体积,四项研究共424名参与者分析了全脑体积。由于可用数据稀少且异质性大,我们未估计这两个区域体积的合并敏感性和特异性。我们无法对小样本个体研究中评估的外侧颞叶、杏仁核、内侧颞回或皮质灰质体积进行统计学评估。我们未发现关于总海马体体积准确性在随访时间或参与者年龄方面研究之间存在差异的证据,但在混合(大多为间接)比较中,手动MRI技术优于自动技术。由于仅有间接比较、研究异质性大且所有区域的总体准确性中等,我们未评估MRI测量的不同脑区体积的相对准确性。
海马体或内侧颞叶体积,这两个研究最多的脑区,显示出低敏感性和特异性,不能将结构MRI作为MCI患者早期诊断阿尔茨海默病痴呆的独立附加测试。这与国际指南一致,国际指南建议进行影像学检查以排除认知障碍的非退行性或手术性原因,而不是诊断阿尔茨海默病痴呆。鉴于大多数纳入研究质量较低,本综述的结果应谨慎解释。未来研究不应专注于单一生物标志物,而应关注生物标志物组合以改善阿尔茨海默病痴呆的早期诊断。