Suppr超能文献

推进呼吸系统疾病诊断:一种基于深度学习和视觉Transformer的方法及新型X射线数据集

Advancing respiratory disease diagnosis: A deep learning and vision transformer-based approach with a novel X-ray dataset.

作者信息

Alghadhban Amer, Ramadan Rabie A, Alazmi Meshari

机构信息

Electrical Engineering Department, College of Engineering, University of Ha'il, Ha'il, 55476, Saudi Arabia.

College of Economics, Management & Information Systems, Department of Information Systems, Nizwa University, Nizwa, Sultanate of Oman; Department of Computer Engineering, Faculty of Engineering, Cairo University, Giza, Egypt.

出版信息

Comput Biol Med. 2025 Aug;194:110501. doi: 10.1016/j.compbiomed.2025.110501. Epub 2025 Jun 9.

Abstract

With the increasing prevalence of respiratory diseases such as pneumonia and COVID-19, timely and accurate diagnosis is critical. This paper makes significant contributions to the field of respiratory disease classification by utilizing X-ray images and advanced machine learning techniques such as deep learning (DL) and Vision Transformers (ViT). First, the paper systematically reviews the current diagnostic methodologies, analyzing the recent advancement in DL and ViT techniques through a comprehensive analysis of the review articles published between 2017 and 2024, excluding short reviews and overviews. The review not only analyses the existing knowledge but also identifies the critical gaps in the field as well as the lack of diversity of the comprehensive and diverse datasets for training the machine learning models. To address such limitations, the paper extensively evaluates DL-based models on publicly available datasets, analyzing key performance metrics such as accuracy, precision, recall, and F1-score. Our evaluations reveal that the current datasets are mostly limited to the narrow subsets of pulmonary diseases, which might lead to some challenges, including overfitting, poor generalization, and reduced possibility of using advanced machine learning techniques in real-world applications. For instance, DL and ViT models require extensive data for effective learning. The primary contribution of this paper is not only the review of the most recent articles and surveys of respiratory diseases and DL models, including ViT, but also introduces a novel, diverse dataset comprising 7867 X-ray images from 5263 patients across three local hospitals, covering 49 distinct pulmonary diseases. The dataset is expected to enhance DL and ViT model training and improve the generalization of those models in various real-world medical image scenarios. By addressing the data scarcity issue, this paper paves the for more reliable and robust disease classification, improving clinical decision-making. Additionally, the article highlights the critical challenges that still need to be addressed, such as dataset bias and variations of X-ray image quality, as well as the need for further clinical validation. Furthermore, the study underscores the critical role of DL in medical diagnosis and highlights the necessity of comprehensive, well-annotated datasets to improve model robustness and clinical reliability. Through these contributions, the paper provides the basis and foundation of future research on respiratory disease diagnosis using AI-driven methodologies. Although the paper tries to cover all the work done between 2017 and 2024, this research might have some limitations of this research, including the review period before 2017 might have foundational work. At the same time, the rapid development of AI might make the earlier methods less relevant.

摘要

随着肺炎和新冠肺炎等呼吸道疾病的患病率不断上升,及时准确的诊断至关重要。本文通过利用X射线图像以及深度学习(DL)和视觉Transformer(ViT)等先进的机器学习技术,为呼吸道疾病分类领域做出了重大贡献。首先,本文系统地回顾了当前的诊断方法,通过对2017年至2024年发表的综述文章进行全面分析(不包括简短综述和概述),分析了DL和ViT技术的最新进展。该综述不仅分析了现有知识,还确定了该领域的关键差距以及用于训练机器学习模型的综合多样数据集缺乏多样性的问题。为了解决这些局限性,本文在公开可用的数据集上广泛评估了基于DL的模型,分析了诸如准确率、精确率、召回率和F1分数等关键性能指标。我们的评估表明,当前的数据集大多局限于肺部疾病的狭窄子集,这可能会导致一些挑战,包括过拟合、泛化能力差以及在实际应用中使用先进机器学习技术的可能性降低。例如,DL和ViT模型需要大量数据才能有效学习。本文的主要贡献不仅在于对最新文章以及呼吸道疾病和DL模型(包括ViT)的综述,还引入了一个新颖、多样的数据集,该数据集包含来自三家当地医院的5263名患者的7867张X射线图像,涵盖49种不同的肺部疾病。该数据集有望加强DL和ViT模型的训练,并提高这些模型在各种实际医学图像场景中的泛化能力。通过解决数据稀缺问题,本文为更可靠、更稳健的疾病分类铺平了道路,改善了临床决策。此外,本文还强调了仍需解决的关键挑战,如数据集偏差和X射线图像质量的变化,以及进一步进行临床验证的必要性。此外,该研究强调了DL在医学诊断中的关键作用,并突出了全面、标注良好的数据集对于提高模型稳健性和临床可靠性的必要性。通过这些贡献,本文为未来使用人工智能驱动方法进行呼吸道疾病诊断的研究提供了基础。尽管本文试图涵盖2017年至2024年期间完成的所有工作,但本研究可能存在一些局限性,包括2017年之前的综述期可能有基础工作。同时,人工智能的快速发展可能使早期方法的相关性降低。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验