Information Technology Reference
In-Depth Information
is to combine the selection schemes with elementary statistical operations. We have
shown that the latter can easily be implemented with awk .
A third class of application of sed and awk is lexical-etymological analysis. Using
sed and awk , dictionaries of related languages can be compared and roots of words
determined through rule-based and statistical analysis.
Various selection schemes can easily be formulated and implemented using set
and vector operations on files. We have shown the implementation of set union, set
complement, vector addition, and other such operations.
Finally, all the above shows that sed and awk are ideally suited for the develop-
ment of prototype programs in certain areas of language analysis. One saves time in
formatting the text source into a suitable database for certain types of programming
languages such as prolog . One saves time in compiling and otherwise handling C ,
which is required if one does analysis with lex and yacc . In particular, if the devel-
oped program runs only a few times this is very e cient.
Disclaimer
The authors do not accept responsibility for any line of code or any programming
method presented in this work. There is absolutely no guarantee that these methods
are reliable or even function in any sense. Responsibility for the use of the code and
methods presented in this work lies solely in the domain of the applier/user.
References
1. H. Abramson, S. Bhalla, K.T. Christianson, J.M. Goodwin, J.R. Goodwin, J.
Sarraille (1995): Towards CD-ROM based Japanese English dictionaries:
Justification and some implementation issues. In: Proc. 3rd Natural Language
Processing Pacific-Rim Symp. (Dec. 4-6, 1995), Seoul, Korea
2. H. Abramson, S. Bhalla, K.T. Christianson, J.M. Goodwin, J.R. Goodwin, J.
Sarraille, L.M. Schmitt (1996): Multimedia, multilingual hyperdictionaries: A
Japanese English example. Paper presented at the Joint Int. Conf. Associa-
tion for Literary and Linguistic Computing and Association for Computers and
the Humanities (June 25-29, 1996), Bergen, Norway, available from the authors
3. H. Abramson, S. Bhalla, K.T. Christianson, J.M. Goodwin, J.R. Goodwin, J.
Sarraille, L.M. Schmitt (1996): The Logic of Kanji lookup in a Japanese
English hyperdictionary. Paper presented at the Joint Int. Conf. Association
for Literary and Linguistic Computing and Association for Computers and the
Humanities (June 25-29, 1996), Bergen, Norway, available from the authors
4. A.V. Aho, B.W. Kernighan, P.J. Weinberger (1978): awk — A Pattern Scan-
ning and Processing Language (2nd ed.). In: B.W. Kernighanm, M.D. McIl-
roy (eds.), UNIX programmer's manual (7th ed.) , Bell Labs, Murray Hill,
http://cm.bell-labs.com/7thEdMan/vol2/awk
5. A.V. Aho, B.W. Kernighan, P.J. Weinberger (1988): The AWK programming
language . Addison-Wesley, Reading, MA
6. B.T.S. Atkins (1992): Acta Linguistica Hungarica 41 :5-71
7. J. Burstein, D. Marcu (2003): Computers and the Humanities 37 :455-467
Search WWH ::




Custom Search