Java Reference
In-Depth Information
, abbreviations such as found in Mr. ]
233-319 [Smith, and possibly ellipses either within a
sentence … , or at the end of a sentence…]
319-
This output works for simple sentences but is not successful with more complex sen-
tences.
The uses of both regular expressions and the BreakIterator class have limitations.
They are useful for text consisting of relatively simple sentences. However, when the text
becomes more complex, it is better to use the NLP APIs instead, as discussed in the next
section.
Search WWH ::




Custom Search