Del Giudice Libera Lucia, Piersanti Agnese, Göbl Christian, Burattini Laura, Tura Andrea, Morettini Micaela
Department of Information Engineering, Università Politecnica delle Marche, Ancona, Italy.
CNR Institute of Neuroscience, Padua, Italy.
J Diabetes Sci Technol. 2025 Feb 14:19322968251316896. doi: 10.1177/19322968251316896.
Poor data availability and accessibility characterizing some research areas in biomedicine are still limiting potentialities for increasing knowledge and boosting technological advancement. This phenomenon also characterizes the field of diabetes research, in which glycemic data may serve as a basis for different applications. To overcome this limitation, this review aims to provide a comprehensive analysis of the publicly available data sets related to dynamic glycemic data.
Search was performed in four different sources, namely scientific journals, Google, a comprehensive registry of clinical trials and two electronic databases. Retrieved data sets were analyzed in terms of their main characteristics and on the typology of data provided.
Twenty-five data sets were identified including data from challenge tests (5 of 25) or data from Continuous Glucose Monitoring (CGM, 20 of 25). As for the data sets including challenge tests, all of them were freely downloadable; most of them (80%) related only to oral glucose tolerance test (OGTT) with standard duration (2 h), but varying for timing and number of collected blood samples, and variables collected in addition to glucose levels (with insulin levels being the most common); the remaining 20% of them also included intravenous glucose tolerance test (IVGTT) data. As for the data sets related to CGM, 7 of 20 were freely downloadable, whereas the remaining 13 were downloadable upon completion of a request form.
This review provided an overview of the readily usable data sets, thus representing a step forward in fostering data access in diabetes field.
生物医学某些研究领域存在数据可用性和可获取性差的问题,这仍然限制了知识增长和技术进步的潜力。这种现象在糖尿病研究领域也很突出,其中血糖数据可作为不同应用的基础。为克服这一限制,本综述旨在对与动态血糖数据相关的公开可用数据集进行全面分析。
在四个不同来源进行搜索,即科学期刊、谷歌、临床试验综合注册库和两个电子数据库。对检索到的数据集的主要特征和所提供数据的类型进行了分析。
共识别出25个数据集,包括挑战试验数据(25个中的5个)或连续血糖监测(CGM)数据(25个中的20个)。对于包含挑战试验的数据集,所有数据集均可免费下载;其中大多数(80%)仅与标准时长(2小时)的口服葡萄糖耐量试验(OGTT)相关,但采血时间和数量以及除血糖水平外收集的变量有所不同(胰岛素水平最为常见);其余20%还包括静脉葡萄糖耐量试验(IVGTT)数据。对于与CGM相关的数据集,20个中有7个可免费下载,其余13个需填写申请表后下载。
本综述概述了易于使用的数据集,从而在促进糖尿病领域的数据获取方面向前迈进了一步。