Enzyme Genetic Programming - Cellular Computing

Information Technology Reference

In-Depth Information

This new way of representing the parse tree has a number of advantages from

an evolutionary perspective. First, it introduces positional independence. The

position of a component in the array need not have any direct bearing on the

position of the component in the parse tree. This is of benefit to recombination

because position is not conserved where different programs are of different

shapes and sizes—which, most of the time, is the case for GP. Consequently,

the position of a component in a child program will most likely not be its position

in the parent program. In tree-based GP, where context is given by position,

this is a problem. In enzyme GP, where context is independent of position, it is

not a problem.

Second, the linear encoding has a homogenous structure. In a typical parse

tree about half of the nodes are leaf nodes. Consequently, during subtree

crossover, a large proportion of randomly chosen crossover points will occur at

or near the leaf nodes. This, in turn, means that a disproportionately high number

of small subtrees will be targeted during crossover. In practice the selection of

crossover points is biased so that more are chosen nearer the root of the tree.

However, this bias has to be carefully chosen to avoid biasing the shapes and

sizes of solutions explored during evolution. In enzyme GP, by comparison,

there is no need to bias crossover because it will automatically sample subtrees

of all sizes. Furthermore, enzyme GP does not have to target complete subtrees

during recombination. It can, in principle, target any collection of components.

In some cases, components will be connected, such as subtrees, in other cases

they will not. Consequently, crossover can carry out a wide range of behaviors.

However, it is possible that during evolution linkage will develop between

neighboring genes, meaning that crossover is more likely to target connected

groups of components. The more evolved the solutions, therefore, the more

focused the behaviors of crossover could become.

Third, not all the components described within a genome need be expressed

in the developed program. One example of the utility of this is TR crossover,

which allows components to be added to a genome without replacing exist-

ing components. If the new components do not naturally interact with the

existing components, then they will not be used in the program. This is less

disruptive than forcing components to be replaced, which is the only option

with subtree crossover. Redundant components, more generally, can form a

source of backtracking and neutral evolution. Backtracking can be enabled

by saved copies of components that are no longer used in developed pro-

grams. These, and unused copies of current components, are subject to mu-

tation. Because they do not contribute to the phenotype, these mutations are

neutral and have no fitness penalty. Nevertheless, these mutations may be ben-

eficial and result in improvements to existing components. This might lead

to the development of gene families comparable to those found in biological

genomes.

Search WWH ::

Custom Search

Home