Meystre Stéphane M, Haug Peter J
Department of Medical Informatics, University of Utah, Salt Lake City, USA.
AMIA Annu Symp Proc. 2005;2005:525-9.
To help maintain a complete, accurate and timely Problem List, we are developing a system to automatically retrieve medical problems from free-text documents. This system uses Natural Language Processing to analyze all electronic narrative text documents in a patient's record. Here we evaluate and compare 3 different applications of NLP technology in our system: the first using MMTx (MetaMap Transfer) with a negation detection algorithm (NegEx), the second using an alpha version of a locally developed NLP application called MPLUS2, and the third using keyword searching. They were adapted and trained to extract medical problems from a set of 80 problems of diagnosis type. The version using MMTx and NegEx was improved by adding some disambiguation and modifying the negation detection algorithm, and these modifications significantly improved recall and precision. The different versions of the NLP module were compared, and showed the following recall / precision results: standard MMTx with NegEx version 0.775 / 0.398; improved MMTx with NegEx version 0.892 / 0.753; MPLUS2 version 0.693 / 0.402; and keyword searching version 0.575 / 0.807. Average results for the reviewers were a recall of 0.788 and a precision of 0.912.
为了帮助维护一个完整、准确且及时的问题列表,我们正在开发一个系统,用于从自由文本文件中自动检索医疗问题。该系统使用自然语言处理技术来分析患者记录中的所有电子叙述文本文件。在此,我们评估并比较了自然语言处理技术在我们系统中的3种不同应用:第一种使用带有否定检测算法(NegEx)的MMTx(MetaMap Transfer),第二种使用名为MPLUS2的本地开发的自然语言处理应用程序的alpha版本,第三种使用关键词搜索。它们经过调整和训练,以从一组80个诊断类型的问题中提取医疗问题。使用MMTx和NegEx的版本通过添加一些消歧和修改否定检测算法得到了改进,这些修改显著提高了召回率和精确率。对自然语言处理模块的不同版本进行了比较,结果显示了以下召回率/精确率:标准MMTx与NegEx版本为0.775 / 0.398;改进后的MMTx与NegEx版本为0.892 / 0.753;MPLUS2版本为0.693 / 0.402;关键词搜索版本为0.575 / 0.807。评审人员的平均结果是召回率为0.788,精确率为0.912。