Alpert Jordan, Kim Hyehyun Julia, McDonnell Cara, Guo Yi, George Thomas J, Bian Jiang, Wu Yonghui
Cleveland Clinic, Center for Value-Based Care Research, Cleveland, OH, United States.
College of Journalism and Communications, University of Florida, Gainesville, FL, United States.
JMIR Form Res. 2022 Dec 27;6(12):e43059. doi: 10.2196/43059.
Social determinants of health (SDoH), such as geographic neighborhoods, access to health care, education, and social structure, are important factors affecting people's health and health outcomes. The SDoH of patients are scarcely documented in a discrete format in electronic health records (EHRs) but are often available in free-text clinical narratives such as physician notes. Innovative methods like natural language processing (NLP) are being developed to identify and extract SDoH from EHRs, but it is imperative that the input of key stakeholders is included as NLP systems are designed.
This study aims to understand the feasibility, challenges, and benefits of developing an NLP system to uncover SDoH from clinical narratives by conducting interviews with key stakeholders: (1) oncologists, (2) data analysts, (3) citizen scientists, and (4) patient navigators.
Individuals who frequently work with SDoH data were invited to participate in semistructured interviews. All interviews were recorded and subsequently transcribed. After coding transcripts and developing a codebook, the constant comparative method was used to generate themes.
A total of 16 participants were interviewed (5 data analysts, 4 patient navigators, 4 physicians, and 3 citizen scientists). Three main themes emerged, accompanied by subthemes. The first theme, importance and approaches to obtaining SDoH, describes how every participant (n=16, 100%) regarded SDoH as important. In particular, proximity to the hospital and income levels were frequently relied upon. Communication about SDoH typically occurs during the initial conversation with the oncologist, but more personal information is often acquired by patient navigators. The second theme, SDoH exists in numerous forms, exemplified how SDoH arises during informal communication and can be difficult to enter into the EHR. The final theme, incorporating SDoH into health services research, addresses how more informed SDoH can be collected. One strategy is to empower patients so they are aware about the importance of SDoH, as well as employing NLP techniques to make narrative data available in a discrete format, which can provide oncologists with actionable data summaries.
Extracting SDoH from EHRs was considered valuable and necessary, but obstacles such as narrative data format can make the process difficult. NLP can be a potential solution, but as the technology is developed, it is important to consider how key stakeholders document SDoH, apply the NLP systems, and use the extracted SDoH in health outcome studies.
健康的社会决定因素(SDoH),如地理位置、医疗保健可及性、教育和社会结构,是影响人们健康及健康结果的重要因素。患者的SDoH在电子健康记录(EHR)中很少以离散格式记录,但在诸如医生笔记等自由文本临床叙述中通常可以获取。正在开发自然语言处理(NLP)等创新方法,以从EHR中识别和提取SDoH,但在设计NLP系统时,必须纳入关键利益相关者的意见。
本研究旨在通过对关键利益相关者进行访谈,了解开发一个从临床叙述中揭示SDoH的NLP系统的可行性、挑战和益处,这些关键利益相关者包括:(1)肿瘤学家,(2)数据分析师,(3)公民科学家,以及(4)患者导航员。
邀请经常处理SDoH数据的个人参与半结构化访谈。所有访谈均进行录音,随后转录。在对转录文本进行编码并制定编码手册后,采用持续比较法生成主题。
共访谈了16名参与者(5名数据分析师、4名患者导航员、4名医生和3名公民科学家)。出现了三个主要主题及子主题。第一个主题“获取SDoH的重要性和方法”描述了每位参与者(n = 16,100%)如何将SDoH视为重要因素。特别是,经常会参考与医院的距离和收入水平。关于SDoH的沟通通常在与肿瘤学家的初次交谈中进行,但患者导航员通常会获取更多个人信息。第二个主题“SDoH以多种形式存在”举例说明了SDoH如何在非正式沟通中出现,并且难以录入EHR。最后一个主题“将SDoH纳入卫生服务研究”讨论了如何收集更全面的SDoH。一种策略是增强患者的意识,使其了解SDoH的重要性,以及采用NLP技术将叙述性数据以离散格式呈现,这可以为肿瘤学家提供可操作的数据摘要。
从EHR中提取SDoH被认为是有价值且必要的,但诸如叙述性数据格式等障碍可能使这一过程变得困难。NLP可能是一个潜在的解决方案,但随着技术的发展,重要的是要考虑关键利益相关者如何记录SDoH、应用NLP系统以及在健康结果研究中使用提取的SDoH。