Languages and Grammars - Infobiotics: Information in Biotic Systems

Information Technology Reference

In-Depth Information

REC

⊂

RE

.

The inclusion of the class CS

= L 1 in the class RE is a consequence of the monotony

condition for the grammars of type 1. Moreover, the existence of a language of REC

which does not belong to CS can be obtained by a kind of reasoning going back to

Cantor and Russell ( the diagonal argument ).

It can be shown that any rule of type 1 is equivalent, from the generative point

of view, to a set of rules having the following context-sensitive form, where X is a

non-terminal symbol and

α , β , γ

are strings of the alphabet A with

γ = λ

:

α

X

β → αγβ .

Languages generated by grammars with context-sensitive rules are also called

context-sensitive and their class is denoted by CS . Therefore, CS

= L 1 . Rules of

type 2 are called context-free and languages generated by grammars of type 2 are

also denoted by CF . Therefore, CF

= L 2 .

In the next section we show that

L 3 coincides with the class REG of regular

languages obtainable from finite languages by applying the language operations of

concatenation, sum, and Kleene star.

If X

,

Y are non-terminal symbols and a is terminal symbol, then X

→

aY is a

right-linear rule , while X

→

Ya is a left-linear rule ,and X

→

a is at same time

right-linear and left-linear.

Grammars with both types of rules right-linear and left-linear generate the class

LIN of linear languages which is strictly included in CF and which strictly includes

REG .

In conclusion, Chomsky hierarchy can be reformulated and extended in the fol-

lowing way:

FIN

⊂

REG

⊂

LIN

⊂

CF

⊂

CS

⊂

REC

⊂

RE

.

Two important aspects of grammatical generation which are interrelated are cru-

cial: i) rules are applied without no strategy, ii) terminal symbols are the only mech-

anism for discriminating, among the strings generated from S , those which belong

to the language

L(

)

. It can be shown that if we do not use terminal symbols,

then the generation power of grammars is reduced. In this way, the terminal/non-

terminal distinction plays the role of integrating the rules toward the realization of

forms which are specific of a given language. In other words, only string genera-

tions where rules are applied in a “correct” way are able to “terminalize”. The other

generations can be viewed as unsuccessful trials in a specific process of string gen-

eration.

Natural languages, programming languages, and strings representing linear forms

that occur in natural processes require grammars having a complexity between

context-free and context-sensitive grammars.

G

Infobiotics: Information in Biotic Systems

Search WWH ::

Custom Search

Home