Information Technology Reference
In-Depth Information
process, an element < frames > is added to the XML representation of the sentence,
containing information about the frame. Excerpts of the XML code are shown in
Figure 4.13.
Risk
Evidence
Manner
Cause
Find
Symptom
Loc
NP
S
PP
AP
PP
,
Unregelmässigkeiten
eine
nicht
mehr
kontinuierliche
im
die
auf
Spannugssteuerung
Wickelkopfbereich
hindeuten
irregularities
which
to
one
not
anymore
continuous
steering of voltage
in
winding's head area
point
Fig. 4.12. Annotation with roles with the Salsa tool.
<frames>
<frame name="Evidence" id="sentences._108__f1">
<target><fenode idref="sentences._108_31"/></target>
<fe name="Symptom" id="sentences._108_f1_e1">
<fenode idref="sentences._108_22"/>
</fe>
<fe name="Cause" id="sentences._108__f1_e2">
<fenode idref="sentences._108_509"/>
</fe>
<fe name="Loc" id="sentences._108__f1_e5">
<fenode idref="sentences._108_510"/>
</fe>
...
Fig. 4.13. XML Representation of an annotated frame.
4.4.7 Active Learning
Research in IE has indicated that using an active learning approach for acquiring
labels from a human annotator has advantages over other approaches of selecting
instances for labeling [16]. In our learning framework, we have also implemented an
active learning approach. The possibilities for designing an active learning strategy
are manifold; the one we have implemented uses a committee-based classification
scheme that is steered by corpus statistics. The strategy consists of the following
steps:
Search WWH ::




Custom Search