Section Heading Recognition in Electronic Health Records Using Conditional Random Fields - Technologies and Applications of Artificial Intelligence

Information Technology Reference

In-Depth Information

Table 2. Performance comparison among different methods

DataSet

Config.

P (%)

R (%)

F (%)

Dictionary-based (SecTag) 38.09 67.44 48.69

Dictionary-based(Training) 47.24 98.88 63.93

Dictionary-based(SecTag+Training) 46.05 93.65 61.74

CRF-based

Set2

(Develop. Set)

94.47

91.87

93.15

CRF-based + Layout Features

94.67 94.15 94.41

Dictionary-based (SecTag)

38.43

69.1

49.39

Dictionary-based(Training)

45.9

95

61.89

Test

Dictionary-based(SecTag+Training)

44.56

89.28

59.45

CRF-based

94.33

90.74

92.5

CRF-based + Layout Features

95.45

92.48

93.94

4

Discussion

As shown in the previous section, CRF-based methods evidently outperformed the

dictionary-based approach. The diversity of section heading names of EHRs is a key

issue that resulted in the huge performance gap between CRF- and dictionary-based

methods. A physician can combine any section heading names in an EHR to form a

new section, or insert any supplemental information in a section heading. For exam-

ple, the bold texts in the two sections “Meds ( confirmed with patient )” and “DATA

( 08/25/61 ):”. These cases cannot be dealt by the dictionary-based method, since there

are no dictionaries that can cover all of these variations. By contrast, the CRF model

is capable of identifying these names, because the sequential labelling formulation

can model the dependency between tokens. In addition, the results showed that the

inclusion of the terminology from SecTag resulted in a decreased precision. This is

caused by the various section headings of different granular levels within the SecTag

content. For example, it includes terms like “toenail exam” and “muscle tone exam”,

which usually does not belong to the topmost section headings.

On the other hand, the results of CRF-based section recognition are not completely

flawless. A comparison of the “CRF-based” configuration with the “CRF-

based+Layout feature” indicates that adding the layout feature enables the CRF model

to recognize section headings that did not appear in the training set. For example,

section headings such as “HCP/FAMILY CONTACT”, “INDICATIONS FOR TPN”,

“Allergies or adverse reactions” “Course on floor” and “Oncology CONSULTATION

NOTE” were not present in the training set, but with their clear layout, the model with

our layout features is able to recognize them.

Finally, this work conducted error analysis on the predicted results of our best con-

figuration “CRF-based+Layout features”, and categorized the errors into two catego-

ries: false negative (FN) and FP cases. The following subsections discuss them

respectively.

Technologies and Applications of Artificial Intelligence

Search WWH ::

Custom Search

Home