Database Reference
In-Depth Information
In the main process window, ensure that both the
exa
and
wor
ports on the Process Documents
operator
are connected to
res
ports as shown in Figure 12-16.
Figure 12-16. The Federalist Papers text mining model.
The
exa
port will generate a tab in results perspective showing the words (tokens) from our
documents as attributes, with the attributes' relative strength in each of the four documents
indicated by a decimal coefficient. The
wor
port will create a tab in results perspective that shows
the words as tokens with the total number of occurrences, and the number of documents each
token appeared in. Although we will do a bit more modeling in this chapter's example, at this
point we will go ahead and proceed to…
EVALUATION
Let's run our model again. We can see the WordList tab in results perspective, showing our tokens
and their frequencies in the input documents.