Information Technology Reference
In-Depth Information
1.
First fixes up a regular expression pattern which represents the general distri-
bution of raw data in the web-page (mainly data is stored within font tags of
face specifying the proprietary font name).
2.
Then iteratively it searches for the regular expression pattern through the file.
3.
While a match is found it retrieves the pattern and append to a string variable
iteratively.
4.
After this match retrieval it will find that in the string along with raw text,
html tag information is also present. So, it removes all tags and separates only
raw text from it.
5.
It stores the Tag separated raw text into a string and returns it to its invoker
module.
3.6 Fontmapper (String)
This function runs on the extracted raw text by reading character by character from it
and writing the corresponding UTF-8 glyphs.
Arguments: This function will take the extracted regional language raw text as
string and returns the same but after mapping it from the default proprietary character
set into Unicode character set.
1.
It will read character by character and executes the following operations until
end of string reaches.
2.
It calls iteratively the Search module with each time read decimal value of
proprietary font characters as its argument.
3.
It stores the decimal values of UTF-8 glyphs returned by the Search module
into a string variable.
4.
After completion of iteration it returns the Font-Mapped raw text string to its
invoker module.
3.7 Search (Proprietaryfontvalue)
This function searches the corresponding character value to the Proprietary font
glyphs in respective array structure returns the corresponding UTF-8 font value.
Arguments: This function takes the decimal value of a glyph corresponds proprie-
tary font of regional language and returns corresponding UTF-8 font values.
1.
The module runs a sequential search or linear search on the array structure
representing the decimal values of the Proprietary Font glyphs of regional
language.
2.
While search is successful then it stores the position at where hit is found.
3.
It returns the multiple decimal entries of UTF-8 font glyphs from the
corresponding array structure as found at the particular position.
Search WWH ::




Custom Search