Information Technology Reference
In-Depth Information
The online and offline databases used are the IAM-OnDB [31] and the IAM-
DB [32] respectively. Note that these do not correspond to the same handwriting
samples: the IAM-OnDB was acquired from a whiteboard, while the IAM-DB
consists of scanned images of handwritten forms. 1
2.1 General Processing Steps
A recognition system for unconstrained Roman script is usually divided into con-
secutive units which iteratively process the handwritten input data to finally obtain
the transcription. The main units are illustrated in Fig. 1 and summarized in this
section. Certainly, there are differences between offline and online processing, but
the principles are the same. Only the methodology for performing the individual
steps differs.
First, preprocessing steps are applied to reduce noise in the raw data. The input
is raw handwritten data and the output usually consists of extracted text lines. The
amount of effort that need to be invested into the preprocessing depends on the
given data. If the data have been acquired from a system that does not produce any
noise and only single words have been recorded, there is nothing to do in this step.
But usually the data contains noise which need to be removed to improve the qual-
ity of the handwriting, e.g., by means of image enhancement. The offline images
are furthermore binarized and the online data, which usually contain noisy points
and gaps within strokes, is processed with some heuristics to recover from these
artifacts. These operations are described in Ref. [33]. The cleaned text data is then
automatically divided into lines using some simple heuristics.
Next, the data is normalized, i.e., it is attempted to remove writer-specific cha-
racteristics of the handwriting to make writings from different authors looking
more similar to each other. This is a very important step in any handwriting rec-
ognition system, because the writing styles of the writers differ with respect to
skew, slant, height, and width of the characters. In the literature there is no stan-
dard way of normalizing the data, but many systems use similar techniques. First,
the text line is corrected in regard to its skew, i.e., it is rotated, so that the baseline
is parallel to the x-axis. Then, slant correction is performed so that the slant be-
comes upright. The next important step is the computation of the baseline and the
corpus line. These two lines divide the text into three areas: the upper area, which
mainly contains the ascenders of the letters; the middle area, where the corpus of
the letters is present; and the lower area with the descenders of some letters. These
three areas are normalized to predefined heights. Often, some additional normali-
zation steps are performed, depending on the domain. In offline recognition, thin-
ning and binarization may be applied. In online recognition the delayed strokes,
e.g., the crossing of a “t” or the dot of an “i”, are usually removed, and equidistant
resampling is applied.
1 The databases and benchmark tasks are available on
http://www.iam.unibe.ch/fki/databases
Search WWH ::




Custom Search