Ye Jiayu, Pan Dan, Zeng An, Zhang Yiqun, Chen Qiuping, Liu Yang
School of Computer Science, Guangdong University of Technology, Guangzhou 510006, China.
School of Electronics and Information, Guangdong Polytechnic Normal University, Guangzhou 510665, China.
IEEE Trans Emerg Top Comput Intell. 2025 Apr;9(2):1454-1468. doi: 10.1109/tetci.2025.3537942. Epub 2025 Feb 19.
Deep learning models are widely used in medical image-guided disease recognition and have achieved outstanding performance. Voxel-based models are typically the default choice for deep learning-based MRI analysis, which require high computational resources and large data volumes, making them inefficient for rapid disease screening. Simultaneously, the existing Alzheimer's disease (AD) recognition model is primarily comprised of Convolutional Neural Network (CNN) structures. With the increasing of the network depth, the fine-grained details of global features tend to be partially lost. Therefore, we propose a Multi-scale spatial self-attention Network (MssNet) that effectively captures both coarse-grained and fine-grained features. We design to select the target slice based on image entropy to achieve efficient slice-based AD recognition. To capture multi-level spatial information, a novel spatial attention mechanism and spatial self-attention attention are designed. The former is utilized to collect critical spatial information and identify areas that are likely to be lesions, the latter investigates the relationship between features in different image regions through spatial interaction by pure convolutional blocks. MssNet fully utilizes multi-scale information at different granularities for spatial feature interaction, providing it with strong modeling and information understanding capabilities. It has achieved excellent performance in the recognition tasks of Alzheimer's Disease Neuroimaging Initiative (ADNI) and Open Access Series of Imaging Studies (OASIS) datasets. Moreover, MssNet is a lightweight model involving lower scale parameters against the Voxel-based ones, while demonstrating strong generalization capability.
深度学习模型在医学图像引导的疾病识别中被广泛应用,并取得了出色的性能。基于体素的模型通常是基于深度学习的MRI分析的默认选择,这种模型需要高计算资源和大量数据,导致其在快速疾病筛查方面效率低下。同时,现有的阿尔茨海默病(AD)识别模型主要由卷积神经网络(CNN)结构组成。随着网络深度的增加,全局特征的细粒度细节往往会部分丢失。因此,我们提出了一种多尺度空间自注意力网络(MssNet),它能有效捕捉粗粒度和细粒度特征。我们设计基于图像熵选择目标切片,以实现基于切片的高效AD识别。为了捕捉多级空间信息,设计了一种新颖的空间注意力机制和空间自注意力机制。前者用于收集关键空间信息并识别可能是病变的区域,后者通过纯卷积块的空间交互研究不同图像区域中特征之间的关系。MssNet充分利用不同粒度的多尺度信息进行空间特征交互,赋予其强大的建模和信息理解能力。它在阿尔茨海默病神经影像倡议(ADNI)和开放获取影像研究系列(OASIS)数据集的识别任务中取得了优异的性能。此外,与基于体素的模型相比,MssNet是一个轻量级模型,涉及的参数规模较小,同时展现出强大的泛化能力。