Hostname: page-component-5db58dd55d-ggg9q Total loading time: 0 Render date: 2026-06-28T02:00:31.328Z Has data issue: false hasContentIssue false

Natural language processing in an operational clinical information system

Published online by Cambridge University Press:  12 September 2008

C. Friedman
Affiliation:
Queens College of the City University of New YorkComputer Science Department Flushing, New York 11367, USADepartment of Medical Informatics, Columbia-Presbyterian Medical Center
G. Hripcsak
Affiliation:
Department of Medical Informatics, Columbia-Presbyterian Medical Center Department of Medicine, Columbia-Presbyterian Medical Center
W. DuMouchel
Affiliation:
Department of Medical Informatics, Columbia-Presbyterian Medical Center Division of Biostatistics, Columbia-Presbyterian Medical Center
S. B. Johnson
Affiliation:
Department of Medical Informatics, Columbia-Presbyterian Medical CenterDepartment of Medicine, Columbia-Presbyterian Medical Center
P. D. Clayton
Affiliation:
Department of Medical Informatics, Columbia-Presbyterian Medical CenterDepartment of Medicine, Columbia-Presbyterian Medical Center

Abstract

This paper describes a natural language text extraction system, called MEDLEE, that has been applied to the medical domain. The system extracts, structures, and encodes clinical information from textual patient reports. It was integrated with the Clinical Information System (CIS), which was developed at Columbia-Presbyterian Medical Center (CPMC) to help improve patient care. MEDLEE is currently used on a daily basis to routinely process radiological reports of patients at CPMC.

In order to describe how the natural language system was made compatible with the existing CIS, this paper will also discuss engineering issues which involve performance, robustness, and accessibility of the data from the end users' viewpoint.

Also described are the three evaluations that have been performed on the system. The first evaluation was useful primarily for further refinement of the system. The two other evaluations involved an actual clinical application which consisted of retrieving reports that were associated with specified diseases. Automated queries were written by a medical expert based on the structured output forms generated as a result of text processing. The retrievals obtained by the automated system were compared to the retrievals obtained by independent medical experts who read the reports manually to determine whether they were associated with the specified diseases. MEDLEE was shown to perform comparably to the experts. The technique used to perform the last two evaluations was found to be a realistic evaluation technique for a natural language processor.

Information

Type
Articles
Copyright
Copyright © Cambridge University Press 1995

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable