METHODS AND TECHNIQUES OF COMPLEX SYSTEMS SCIENCE: AN OVERVIEW - Complex Systems Science in Biomedicine

Biomedical Engineering Reference

In-Depth Information

number of computational steps the minimal program for x must execute. For

incompressible data, the minimal program is print ( x ), so L ( x ) | x |. For peri-

odic data, the minimal program cycles over printing out one period over and

over, so L ( x ) | x | again. For some compressible data, however, the minimal

program must do nontrivial computations, which are time-consuming. Thus, to

produce the second quadrillion digits of Q, the minimal program is one that cal-

culates the digits, and this takes considerably more time than reading them out

of a list. Thus, Q is deep, while random or periodic data are shallow.

While logical depth is a clever and appealing idea, it suffers from a number

of drawbacks. First, real data are not, so far as we know, actually produced by

running their minimal programs, 24 and the run-time of that program has no

known physical significance, and that's not for lack of attempts to find one

(182). Second, and perhaps more decisively, there is still no procedure for find-

ing the minimal program.

8.2.2.

Algorithmic Statistics

Perhaps the most important modification of the Kolmogorov complexity is

that proposed by Gács, Tromp and Vitanyi (183), under the label of "algorithmic

statistics." Observe that, when speaking of the minimal program for x , I said

nothing about the inputs to the program; these are to be built in to the code. It is

this which accounts for the length of the programs needed to generate random

sequences: almost all of the length of print ( x ) comes from x , not print ().

This suggests splitting the minimal program into two components, a "model"

part, the program properly speaking, and a "data" part, the inputs to the program.

We want to put all the regularities in x into the model, and all the arbitrary, noisy

parts of x into the inputs. Just as in probability theory a "statistic" is a function of

the data that summarizes the information they convey, Gács et al. regard the

model part of the program as an algorithmic statistic , summarizing its regulari-

ties. To avoid the trivial regularity of print () when possible, they define a no-

tion of a sufficient algorithmic statistic, based on the idea that x should be in

some sense a typical output of the model (see their paper for details). They then

define the complexity of x , or, as they prefer to call it, the sophistication , as the

length of the shortest sufficient algorithmic statistic.

Like logical depth, sophistication is supposed to discount the purely random

part of algorithmic complexity. Unlike logical depth, it stays within the confines

of description in doing so; programs, here, are just a particular, mathematically

tractable, kind of description. Unfortunately, the sophistication is still uncom-

putable, so there is no real way of applying algorithmic statistics.

Search WWH ::

Custom Search

Home