Analytics, Intelligence, and Technology, Los Alamos National Laboratory, Los Alamos, NM, United States.
Computer Science, University of New Mexico, Albuquerque, NM, United States.
J Med Internet Res. 2021 May 25;23(5):e27059. doi: 10.2196/27059.
Health authorities can minimize the impact of an emergent infectious disease outbreak through effective and timely risk communication, which can build trust and adherence to subsequent behavioral messaging. Monitoring the psychological impacts of an outbreak, as well as public adherence to such messaging, is also important for minimizing long-term effects of an outbreak.
We used social media data from Twitter to identify human behaviors relevant to COVID-19 transmission, as well as the perceived impacts of COVID-19 on individuals, as a first step toward real-time monitoring of public perceptions to inform public health communications.
We developed a coding schema for 6 categories and 11 subcategories, which included both a wide number of behaviors as well codes focused on the impacts of the pandemic (eg, economic and mental health impacts). We used this to develop training data and develop supervised learning classifiers for classes with sufficient labels. Classifiers that performed adequately were applied to our remaining corpus, and temporal and geospatial trends were assessed. We compared the classified patterns to ground truth mobility data and actual COVID-19 confirmed cases to assess the signal achieved here.
We applied our labeling schema to approximately 7200 tweets. The worst-performing classifiers had F1 scores of only 0.18 to 0.28 when trying to identify tweets about monitoring symptoms and testing. Classifiers about social distancing, however, were much stronger, with F1 scores of 0.64 to 0.66. We applied the social distancing classifiers to over 228 million tweets. We showed temporal patterns consistent with real-world events, and we showed correlations of up to -0.5 between social distancing signals on Twitter and ground truth mobility throughout the United States.
Behaviors discussed on Twitter are exceptionally varied. Twitter can provide useful information for parameterizing models that incorporate human behavior, as well as for informing public health communication strategies by describing awareness of and compliance with suggested behaviors.
卫生当局可以通过有效且及时的风险沟通,将突发传染病的影响降到最低,这可以建立信任并促进后续行为信息的传播。监测疫情对公众心理的影响,以及公众对这些信息的遵守情况,对于最大限度地减少疫情的长期影响也很重要。
我们使用来自 Twitter 的社交媒体数据来识别与 COVID-19 传播相关的人类行为,以及 COVID-19 对个人的感知影响,作为实时监测公众认知以告知公共卫生通信的第一步。
我们为 6 个类别和 11 个子类别制定了编码方案,其中包括广泛的行为以及针对大流行影响的代码(例如,经济和心理健康影响)。我们使用此方案来开发培训数据,并为具有足够标签的类别开发监督学习分类器。表现良好的分类器应用于我们剩余的语料库,并评估时间和地理空间趋势。我们将分类模式与地面真实移动数据和实际 COVID-19 确诊病例进行比较,以评估这里获得的信号。
我们将标签方案应用于大约 7200 条推文。在尝试识别有关监测症状和测试的推文时,性能最差的分类器的 F1 分数仅为 0.18 至 0.28。但是,关于社交距离的分类器要强得多,F1 分数为 0.64 至 0.66。我们将社交距离分类器应用于超过 2.28 亿条推文。我们展示了与现实世界事件一致的时间模式,并且我们展示了在美国各地,Twitter 上的社交距离信号与地面真实移动性之间高达-0.5 的相关性。
Twitter 上讨论的行为非常多样化。Twitter 可以提供有用的信息,用于参数化包含人类行为的模型,以及通过描述对建议行为的认识和遵守情况,为公共卫生通信策略提供信息。