Morey Rajendra A, Petty Christopher M, Xu Yuan, Hayes Jasmeet Pannu, Wagner H Ryan, Lewis Darrell V, LaBar Kevin S, Styner Martin, McCarthy Gregory
Duke-UNC Brain Imaging and Analysis Center, Duke University, Durham, NC 27710, USA.
Neuroimage. 2009 Apr 15;45(3):855-66. doi: 10.1016/j.neuroimage.2008.12.033. Epub 2008 Dec 30.
Large databases of high-resolution structural MR images are being assembled to quantitatively examine the relationships between brain anatomy, disease progression, treatment regimens, and genetic influences upon brain structure. Quantifying brain structures in such large databases cannot be practically accomplished by expert neuroanatomists using hand-tracing. Rather, this research will depend upon automated methods that reliably and accurately segment and quantify dozens of brain regions. At present, there is little guidance available to help clinical research groups in choosing such tools. Thus, our goal was to compare the performance of two popular and fully automated tools, FSL/FIRST and FreeSurfer, to expert hand tracing in the measurement of the hippocampus and amygdala. Volumes derived from each automated measurement were compared to hand tracing for percent volume overlap, percent volume difference, across-sample correlation, and 3-D group-level shape analysis. In addition, sample size estimates for conducting between-group studies were computed for a range of effect sizes. Compared to hand tracing, hippocampal measurements with FreeSurfer exhibited greater volume overlap, smaller volume difference, and higher correlation than FIRST, and sample size estimates with FreeSurfer were closer to hand tracing. Amygdala measurement with FreeSurfer was also more highly correlated to hand tracing than FIRST, but exhibited a greater volume difference than FIRST. Both techniques had comparable volume overlap and similar sample size estimates. Compared to hand tracing, a 3-D shape analysis of the hippocampus showed FreeSurfer was more accurate than FIRST, particularly in the head and tail. However, FIRST more accurately represented the amygdala shape than FreeSurfer, which inflated its anterior and posterior surfaces.
高分辨率结构磁共振成像的大型数据库正在组建中,用于定量研究脑解剖结构、疾病进展、治疗方案以及基因对脑结构的影响之间的关系。在如此庞大的数据库中,由专业神经解剖学家通过手工追踪来定量脑结构实际上是无法完成的。相反,这项研究将依赖于能够可靠且准确地分割和量化数十个脑区的自动化方法。目前,几乎没有可用的指导来帮助临床研究团队选择此类工具。因此,我们的目标是比较两种常用的全自动工具FSL/FIRST和FreeSurfer在测量海马体和杏仁核方面与专家手工追踪的性能。将每种自动化测量得出的体积与手工追踪结果进行比较,以计算体积重叠百分比、体积差异百分比、样本间相关性以及三维组水平形状分析。此外,针对一系列效应大小计算了进行组间研究所需的样本量估计值。与手工追踪相比,FreeSurfer对海马体的测量在体积重叠方面更大,体积差异更小,相关性更高,且FreeSurfer的样本量估计值更接近手工追踪。FreeSurfer对杏仁核的测量与手工追踪的相关性也高于FIRST,但体积差异比FIRST更大。两种技术的体积重叠相当,样本量估计值相似。与手工追踪相比,对海马体的三维形状分析表明FreeSurfer比FIRST更准确,尤其是在头部和尾部。然而,FIRST比FreeSurfer更准确地呈现了杏仁核的形状,FreeSurfer夸大了其前后表面。