|
|
||||||||
infoRAD |
1 From the Department of Radiology, Children's Hospital and Regional Medical Center, 4800 Sandpoint Way NE, Mailstop CH-69, Seattle, WA 98105 (R.K.T.); and the Department of Radiological Sciences, University of California, Los Angeles (S.G.S., R.M.J.). Presented as an infoRAD exhibit at the 1999 RSNA scientific assembly. Received March 24, 2000; revision requested May 1 and received June 1; accepted June 13. Supported in part by National Institutes of Health Grant No. P01-CA51198. Address correspondence to R.K.T. (e-mail: rtaira@u.washington.edu).
| Abstract |
|---|
|
|
|---|
Index Terms: Computers Computers, diagnostic aid Radiology and radiologists, design of radiological facilities Radiology reporting systems
| Introduction |
|---|
|
|
|---|
Previous studies have demonstrated the potential benefits of structured medical data for medical practice, research, and teaching. In the clinical setting, structured reports can be used to help organize and improve the presentation of the medical record (1,2). For example, if a radiologist is interested only in a given clinical episode, he or she may elect to retrieve only those reports in which the relevant anatomy or findings are described. Accurate extraction of lesion size information from radiology reports can allow a system to automatically construct a growth timeline for an indicator lesion. Accurate extraction of imaging section information (eg, "mass seen on cut #12") can allow default presentation of key images. A clinical expert system can use structured report information for decision support (3,4) and to automatically flag alarm conditions such as pneumothorax (5,6). For research and teaching, structured reports can greatly improve the recall and precision of information retrieval tasks. Only structured data are amenable to advanced causal, spatial, temporal, and evolutionary database modeling techniques that are now being developed in the fields of medical informatics and computer science (79). Complex queries such as "Find 50 cases of spiculated masses greater than 5 cm in diameter in the left upper lobe" can be expected to yield the desired results. The implications for teaching files and data collection for retrospective research studies are obvious. Finally, the most practical means of indexing radiologic image data is by extracting the important findings from the associated dictated report. Thus, structured reports serve as a key for creating medical multimedia digital libraries.
However, automatic structuring of radiology reports is a difficult task for the following reasons:
The task of automatically structuring radiology reports can be divided into three subtasks. The first is creating expert-defined data models of the targeted information contained within the report (1015). The second subtask is creating a medical natural language processor that can map the free-text report dictated by the radiologist into the knowledge structures defined in the first subtask. Several medical natural language processing systems are currently being researched and are in various stages of clinical testing (3,6,1619). Spyns (20) has provided an excellent review article on medical natural language processing systems. The third subtask is mapping the slot values within the output knowledge frames into an understandable controlled vocabulary (eg, Systematized Nomenclature of Medicine [SNOMED], Unified Medical Language System [UMLS]) (16,17,21,22).
We have developed a system that automatically structures the important medical information contained in a radiology free-text document as a formal information model that can be interpreted by a computer program. The input to the system is an American Standard Code for Information Interchange (ASCII) free-text report from a radiologic study. The system imposes no constraints on the dictation style of the radiologist. The system outputs a formalized representation of the important information contained within the free-text document in the form of a structured report (23,24). This includes information on the existence, properties, location, and diagnostic interpretation of findings. For example, consider the following sentence: "A mass is seen in the right lower lobe measuring 5 cm in maximum diameter and is unchanged from the previous examination." The desired output frame for this sample sentence is shown in Figure 1. Note that the frame consists of a head (or topic) and a set of property subframes. Each property subframe contains a value and any expert-defined context modifiers for that property value (eg, time of measurement, precision, dimension). Frame slots can contain either values from a controlled vocabulary or values as expressed in the input text.
|
| System Architecture |
|---|
|
|
|---|
|
Structural Analyzer
The input to the structural analysis module is a free-text report received from a radiology information system. The structural analyzer performs a preprocessing step that standardizes the character representation of the text and removes custom text formatting. We convert all text to International Organization for Standardization (ISO) ASCII standard representation. Reports at our institution, for example, are stored using a DOS-based character set. The effect is seen mainly in the representation of control characters such as line feeds. In older radiology information systems, it is common for the text to be custom formatted for VT-100type terminals and for the text to contain long sequences of blanks between words. We use simple heuristic rules to remove these unnecessary white spaces.
After this preprocessing step, the structural analyzer identifies the various sections of the report. A section is informally defined as follows: If the report is viewed as a document form, the sections correspond to the fields of the form. Often, there are certain major sections that are consistently seen (eg, "Study Description," "Clinical History," "Findings," "Conclusions") as well as context-dependent or minor sections (eg, "Age," "Sedation," "Social History," "Cardiac Status"). Major sections usually begin on a new line. Minor sections may or may not do so, depending on the length of the section body. The structural analyzer uses a simple construction model of a radiology report and a rule-based classifier to detect section boundaries. The model includes a list of common section labels (eg, "Clinical History," "Findings," "Impressions") and common formatting layouts (eg, section order, section length, use of colons).
Once the text within each individual section has been determined, the next step is to identify the individual sentences within each section. The algorithm for determining sentence boundaries uses a maximum entropy classifier (25). The classifier in turn uses some 40 overlapping features to determine whether a period corresponds to an end-of-sentence marker. The system was trained on 6,500 sentences from a test corpus of radiology reports.
Lexical Analyzer
The input to the lexical analyzer is a single sentence, and the lexical analyzer outputs the semantic and syntactic features of each word or phrase in the sentence. In this labeling process, punctuation is identified; dates, numeric measurements, special symbols (eg, TNM cancer staging codes), and proper nouns are recognized with use of special-purpose, finite-state machines; words and phrases are looked up in medical lexicons; and prefixes and suffixes are analyzed. Any words that remain unknown after this process are inserted into a database of unrecognized words together with their frequency of occurrence. A medical language expert is responsible for later assigning semantic and syntactic features to these unknown words.
Our lexicons were developed manually. We maintain both single words and multiple-word phrases within our lexicon. The latter are useful when they represent a concept that cannot be precisely described using only the component words. We do not use existing lexical sources such as the UMLS (24,26) because we believe that they do not currently support the level of knowledge or detail required for a natural language understanding system. (For example, they do not contain a sufficient number of semantic classes to support our statistical parsing and semantic interpretation algorithms. In addition, the coverage of important descriptive adjectives common in radiology is as yet insufficient.) Words and word phrases to be included in the lexicon were gathered from two distinct types of sources: published sources and actual radiology reports. The published sources included indexes from thoracic radiology textbooks (27), radiology review manuals (28), radiology word compilations (29), and published thoracic radiology glossaries (30,31). The terms from these published sources ensure the generality of the concepts covered by the lexicon but do not include all the string representations for these concepts. The collection of words and phrases from actual radiology reports ensures that the system works at a practical level and that most of the string representations for the basic concepts related to the target findings are included. Our current radiology lexicon has been developed for the fields of thoracic radiology and neuroradiology and includes over 5,000 entries with over 200 semantic classes. Each entry in the lexicon is characterized by a concept identifier and a set of features organized into multiple channels. Within each channel, features are organized hierarchically, from general to specific.
Parser
The input to the parser is the semantic and syntactic features of each word or phrase in the sentence, and the parser outputs a dependency diagram showing relations between the words of the sentence (Fig 3). An arc from word A to word B indicates that word A modifies word B. The arcs are left unlabeled, with the modifier semantics left for the following step (semantic interpretation). The parser algorithm makes use of statistical methods and relies on two sets of probabilities: (a) the "affinity" between words A and B, and (b) the "valence" preferences of each word in the sentence. The dependency diagram that globally maximizes these probabilities is the one that is selected.
|
|
In addition to the affinity between pairs of words, our linguistic model considers the valence of individual words. This refers to the preference of words for certain types of complements and provides additional contexts to help determine whether a word A modifiers another word B. For example, a verb "prefers" to have one direct object rather than two. A noun that is the object of a preposition "prefers" not to have a verbal complement. We determine the valence probability of a word in a given parse structure from the number and type of arcs going to and from the word. Valence statistics are tabulated from the same set of annotated training sentences that are used for affinity statistics. In an earlier study, we provided additional details of the parser algorithm (32).
Semantic Interpreter
The input of the semantic interpreter is the dependency diagram of the parser. The semantic interpreter verifies that the arcs of the dependency diagram make medical sense and creates logical relationships that indicate the type of conceptual dependency between pairs of words. Each logical relation has a name (eg, "has-size") and three arguments. The first argument is the head, which is the word being modified (eg, "mass"). The second argument indicates the relationship between the head and the modifying value (eg, "in"), and the third argument is the modifying value itself (eg, "apex"). The logical relations serve to normalize alternate ways of expressing the same concept in free text (eg, "a right apical mass" and "a mass in the right apex" have identical logical relationships). Consider the dependency diagram for the sentence, "The well-demarcated 8-cm mass in the right apex is again noted" (Fig 4). The Table shows the logical relationships that can be inferred from this sentence and that together form a semantic network representing the sentence conceptually. In some cases, an argument value is selected from a predefined set of labels. The second argument of the "Evidence" predicate is chosen from the set ("Observed," "Inferred") and is inferred from the link between "noted" and "mass." The predicate "has-newness" was created with the arc from "again" to "noted." Currently, there are 51 logical relations defined for the Findings section of radiology reports.
|
Frame Constructor
The final processing step is to bundle all the logical relations that were found into appropriate frames (23). Each frame represents knowledge about a specific topic (eg, "mass"), together with descriptions of select properties. There are three classes of topics: (a) abnormal findings, (b) anatomy, and (c) medical procedures. For abnormal findings, there exists 11 types of properties: existence, location, quantity, size, severity, trend, normalcy, external architecture, interpretation, association, and "other." Each property subframe has relevant context modifiers such as time, evidence, certainty, degree, and dimension. The anatomy topic class includes subframes for normalcy, subparts, direction, and distribution modifiers. The medical procedure topic class includes subframes for reason for procedure, technical description, and anatomic site. Figure 5 shows a sample frame output along with intermediate parser results.
|
| Current Status |
|---|
|
|
|---|
We reported on a pilot technical evaluation at the 85th Scientific Assembly and Annual Meeting of the Radiological Society of North America in 1999. We conducted a tenfold cross-validation study and used 470 randomly selected sentences from the domain of thoracic radiology (from computed tomographic [CT] and projectional x-ray reports only). Our evaluation measures included recall (ie, the number of correct answers divided by the number of possible answers) and precision (ie, the number of correct answers divided by the number of reported answers). The results of the evaluation were as follows: For the parser, recall was 87% and precision was 88%. This evaluation included 4,314 possible parser links. For the semantic interpreter, recall was 79% and precision was 87%. The semantic interpreter evaluation included 4,300 possible semantic relations.
Our current clinical area of focus is serial CT reports obtained in pediatric patients with metastatic lung disease. The technical difficulty in this setting is tracking objects across sentence and report boundaries (ie, the linguistics-related problem of coreference resolution).
| Discussion |
|---|
|
|
|---|
The accuracy and hence the utility of a medical natural language processor relies heavily on the number and diversity of training examples, and unfortunately, no large-scale training corpus for radiology reports is currently available. Furthermore, the accuracy of a language system is not uniform in terms of the specific information it extracts. For example, a system trained heavily on reports that contain many measurements may extract size information well but perform poorly in terms of location information. Fortunately, in radiology, we may be able to decide a priori what are the important types of information for a given type of study. Thus, information modeling of radiology reports is an important codevelopment. System development could then proceed by emphasizing training on high-priority information items (eg, size, location, trend).
| Conclusions |
|---|
|
|
|---|
| Footnotes |
|---|
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
C. W. Arnold, A. A. T. Bui, C. Morioka, S. El-Saden, and H. Kangarloo Informatics in Radiology: A Prototype Web-based Reporting System for Onsite-Offsite Clinician Communication RadioGraphics, July 1, 2007; 27(4): 1201 - 1211. [Abstract] [Full Text] [PDF] |
||||
![]() |
A S Lowe and C L Kay Recent developments in CT: a review of the clinical applications and advantages of multidetector computed tomography Imaging, June 1, 2006; 18(2): 62 - 67. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Huang, H. J. Lowe, D. Klein, and R. J. Cucina Improved Identification of Noun Phrases in Clinical Radiology Reports Using a High-Performance Statistical Natural Language Parser Augmented with the UMLS Specialist Lexicon J. Am. Med. Inform. Assoc., May 1, 2005; 12(3): 275 - 285. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. J. Thomas, H. Ouellette, E. F. Halpern, and D. I. Rosenthal Automated Computer-Assisted Categorization of Radiology Reports Am. J. Roentgenol., February 1, 2005; 184(2): 687 - 690. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Roy Sr., S. El Ghali, X. Buy, V. Lindner, H. Lang, C. Saussine, and D. Jacqmin Significance of the Pseudocapsule on MRI of Renal Neoplasms and Its Potential Application for Local Staging: A Retrospective Study Am. J. Roentgenol., January 1, 2005; 184(1): 113 - 120. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. B. Ergen, H. K. Hussain, E. M. Caoili, M. Korobkin, R. C. Carlos, W. J. Weadock, T. D. Johnson, R. Shah, S. Hayasaka, and I. R. Francis MRI for Preoperative Staging of Renal Cell Carcinoma Using the 1997 TNM Classification: Comparison with Surgical and Pathologic Staging Am. J. Roentgenol., January 1, 2004; 182(1): 217 - 225. [Abstract] [Full Text] [PDF] |
||||
![]() |
U. Sinha and H. Kangarloo Principal Component Analysis for Content-based Image Retrieval RadioGraphics, September 1, 2002; 22(5): 1271 - 1289. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| RADIOGRAPHICS | RADIOLOGY | RSNA JOURNALS ONLINE |