比较用于从叙述文本中提取医学问题的自然语言处理工具。

Comparing natural language processing tools to extract medical problems from narrative text.

作者信息

Meystre Stéphane M, Haug Peter J

机构信息

Department of Medical Informatics, University of Utah, Salt Lake City, USA.

出版信息

AMIA Annu Symp Proc. 2005;2005:525-9.

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1560561/

Abstract

To help maintain a complete, accurate and timely Problem List, we are developing a system to automatically retrieve medical problems from free-text documents. This system uses Natural Language Processing to analyze all electronic narrative text documents in a patient's record. Here we evaluate and compare 3 different applications of NLP technology in our system: the first using MMTx (MetaMap Transfer) with a negation detection algorithm (NegEx), the second using an alpha version of a locally developed NLP application called MPLUS2, and the third using keyword searching. They were adapted and trained to extract medical problems from a set of 80 problems of diagnosis type. The version using MMTx and NegEx was improved by adding some disambiguation and modifying the negation detection algorithm, and these modifications significantly improved recall and precision. The different versions of the NLP module were compared, and showed the following recall / precision results: standard MMTx with NegEx version 0.775 / 0.398; improved MMTx with NegEx version 0.892 / 0.753; MPLUS2 version 0.693 / 0.402; and keyword searching version 0.575 / 0.807. Average results for the reviewers were a recall of 0.788 and a precision of 0.912.

摘要

为了帮助维护一个完整、准确且及时的问题列表，我们正在开发一个系统，用于从自由文本文件中自动检索医疗问题。该系统使用自然语言处理技术来分析患者记录中的所有电子叙述文本文件。在此，我们评估并比较了自然语言处理技术在我们系统中的3种不同应用：第一种使用带有否定检测算法（NegEx）的MMTx（MetaMap Transfer），第二种使用名为MPLUS2的本地开发的自然语言处理应用程序的alpha版本，第三种使用关键词搜索。它们经过调整和训练，以从一组80个诊断类型的问题中提取医疗问题。使用MMTx和NegEx的版本通过添加一些消歧和修改否定检测算法得到了改进，这些修改显著提高了召回率和精确率。对自然语言处理模块的不同版本进行了比较，结果显示了以下召回率/精确率：标准MMTx与NegEx版本为0.775 / 0.398；改进后的MMTx与NegEx版本为0.892 / 0.753；MPLUS2版本为0.693 / 0.402；关键词搜索版本为0.575 / 0.807。评审人员的平均结果是召回率为0.788，精确率为0.912。

相似文献

1

Comparing natural language processing tools to extract medical problems from narrative text.

AMIA Annu Symp Proc. 2005;2005:525-9.

2

Natural language processing to extract medical problems from electronic clinical documents: performance evaluation.

J Biomed Inform. 2006 Dec;39(6):589-99. doi: 10.1016/j.jbi.2005.11.004. Epub 2005 Dec 5.

3

Evaluation of Medical Problem Extraction from Electronic Clinical Documents Using MetaMap Transfer (MMTx).

Stud Health Technol Inform. 2005;116:823-8.

4

Medical problem and document model for natural language understanding.

AMIA Annu Symp Proc. 2003;2003:455-9.

5

A normalized lexical lookup approach to identifying UMLS concepts in free text.

Stud Health Technol Inform. 2007;129(Pt 1):545-9.

6

Automation of a problem list using natural language processing.

BMC Med Inform Decis Mak. 2005 Aug 31;5:30. doi: 10.1186/1472-6947-5-30.

7

Implementation and evaluation of a negation tagger in a pipeline-based system for information extract from pathology reports.

Stud Health Technol Inform. 2004;107(Pt 1):663-7.

8

Natural language processing and inference rules as strategies for updating problem list in an electronic health record.

Stud Health Technol Inform. 2013;192:1163.

9

Using NLP to extract concepts from chief complaints.

AMIA Annu Symp Proc. 2005;2005:1029.

10

Negation recognition in clinical natural language processing using a combination of the NegEx algorithm and a convolutional neural network.

BMC Med Inform Decis Mak. 2023 Oct 13;23(1):216. doi: 10.1186/s12911-023-02301-5.

引用本文的文献

1

Building a Shared, Scalable, and Sustainable Source for the Problem-Oriented Medical Record: Developmental Study.

JMIR Med Inform. 2021 Oct 13;9(10):e29174. doi: 10.2196/29174.

2

Natural Language Processing for EHR-Based Computational Phenotyping.

IEEE/ACM Trans Comput Biol Bioinform. 2019 Jan-Feb;16(1):139-153. doi: 10.1109/TCBB.2018.2849968. Epub 2018 Jun 25.

3

Text Mining and Automation for Processing of Patient Referrals.

Appl Clin Inform. 2018 Jan;9(1):232-237. doi: 10.1055/s-0038-1639482. Epub 2018 Mar 28.

4

Electronic problem lists: a thematic analysis of a systematic literature review to identify aspects critical to success.

J Am Med Inform Assoc. 2018 May 1;25(5):603-613. doi: 10.1093/jamia/ocy011.

5

Leveraging Electronic Health Care Record Information to Measure Pressure Ulcer Risk in Veterans With Spinal Cord Injury: A Longitudinal Study Protocol.

JMIR Res Protoc. 2017 Jan 19;6(1):e3. doi: 10.2196/resprot.5948.

6

The Use of Evidence-Based, Problem-Oriented Templates as a Clinical Decision Support in an Inpatient Electronic Health Record System.

Appl Clin Inform. 2016 Aug 17;7(3):790-802. doi: 10.4338/ACI-2015-11-RA-0164.

7

Enabling claims-based decision support through non-interruptive capture of admission diagnoses and provider billing codes.

AMIA Annu Symp Proc. 2014 Nov 14;2014:1950-9. eCollection 2014.

8

An evaluation of a natural language processing tool for identifying and encoding allergy information in emergency department clinical notes.

AMIA Annu Symp Proc. 2014 Nov 14;2014:580-8. eCollection 2014.

9

Feasibility and implementation of a literature information management system for human papillomavirus in head and neck cancers with imaging.

Cancer Inform. 2014 Oct 13;13(Suppl 1):49-57. doi: 10.4137/CIN.S13884. eCollection 2014.

10

Anatomical entity recognition with a hierarchical framework augmented by external resources.

PLoS One. 2014 Oct 24;9(10):e108396. doi: 10.1371/journal.pone.0108396. eCollection 2014.

本文引用的文献

1

Failure analysis of MetaMap Transfer (MMTx).

Stud Health Technol Inform. 2004;107(Pt 2):763-7.

2

Automated encoding of clinical documents based on natural language processing.

J Am Med Inform Assoc. 2004 Sep-Oct;11(5):392-402. doi: 10.1197/jamia.M1552. Epub 2004 Jun 7.

3

Extracting structured information from free text pathology reports.

AMIA Annu Symp Proc. 2003;2003:584-8.

4

A study of biomedical concept identification: MetaMap vs. people.

AMIA Annu Symp Proc. 2003;2003:529-33.

5

Medical problem and document model for natural language understanding.

AMIA Annu Symp Proc. 2003;2003:455-9.

6

Towards linking patients and clinical information: detecting UMLS concepts in e-mail.

J Biomed Inform. 2003 Aug-Oct;36(4-5):334-41. doi: 10.1016/j.jbi.2003.09.017.

7

Using LOINC to link an EMR to the pertinent paragraph in a structured reference knowledge base.

Proc AMIA Symp. 2002:652-6.

8

A simple algorithm for identifying negated findings and diseases in discharge summaries.

J Biomed Inform. 2001 Oct;34(5):301-10. doi: 10.1006/jbin.2001.1029.

9

Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.

Proc AMIA Symp. 2001:17-21.

10

Text-based discovery in biomedicine: the architecture of the DAD-system.

Proc AMIA Symp. 2000:903-7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

文档翻译

学术文献翻译模型，支持多种主流文档格式。