Intel Corporation, Santa Clara, CA 95052, United States of America.
University of Pennsylvania, 3700 Hamilton Walk, Richards Medical Research Laboratories (7th Fl), Philadelphia, PA 19104, United States of America.
Phys Med Biol. 2022 Oct 19;67(21):214001. doi: 10.1088/1361-6560/ac97d9.
Federated learning (FL) is a computational paradigm that enables organizations to collaborate on machine learning (ML) and deep learning (DL) projects without sharing sensitive data, such as patient records, financial data, or classified secrets.Open federated learning (OpenFL) framework is an open-source python-based tool for training ML/DL algorithms using the data-private collaborative learning paradigm of FL, irrespective of the use case. OpenFL works with training pipelines built with both TensorFlow and PyTorch, and can be easily extended to other ML and DL frameworks.In this manuscript, we present OpenFL and summarize its motivation and development characteristics, with the intention of facilitating its application to existing ML/DL model training in a production environment. We further provide recommendations to secure a federation using trusted execution environments to ensure explicit model security and integrity, as well as maintain data confidentiality. Finally, we describe the first real-world healthcare federations that use the OpenFL library, and highlight how it can be applied to other non-healthcare use cases.The OpenFL library is designed for real world scalability, trusted execution, and also prioritizes easy migration of centralized ML models into a federated training pipeline. Although OpenFL's initial use case was in healthcare, it is applicable beyond this domain and is now reaching wider adoption both in research and production settings. The tool is open-sourced atgithub.com/intel/openfl.
联邦学习(FL)是一种计算范例,它使组织能够在不共享敏感数据(如患者记录、财务数据或机密信息)的情况下合作进行机器学习(ML)和深度学习(DL)项目。OpenFL 是一个基于 Python 的开源框架,用于使用 FL 的数据隐私协作学习范例训练 ML/DL 算法,无论使用案例如何。OpenFL 与使用 TensorFlow 和 PyTorch 构建的培训管道兼容,并且可以轻松扩展到其他 ML 和 DL 框架。在本文中,我们介绍了 OpenFL 并总结了其动机和开发特点,旨在促进其在生产环境中应用于现有 ML/DL 模型训练。我们进一步提供了使用受信任执行环境保护联邦的建议,以确保明确的模型安全性和完整性,并保持数据机密性。最后,我们描述了第一个使用 OpenFL 库的真实医疗保健联邦,并强调了它如何应用于其他非医疗保健用例。OpenFL 库旨在实现真正的可扩展性、受信任的执行,并且还优先考虑将集中式 ML 模型轻松迁移到联邦训练管道中。尽管 OpenFL 的初始用例是在医疗保健领域,但它不仅限于此领域,现在在研究和生产环境中得到了更广泛的采用。该工具在 github.com/intel/openfl 上开源。