Information Technology Reference
In-Depth Information
Fig. 2.52
The repeat-sharing whole network of Escherichia coli (repeat length 150)
Fig. 2.53
A portion of the E. coli network of Fig. 2.52
There are several possible lines of development:
1. A systematic analysis of other genome sequences (for example of all human
chromosomes) should give more arguments and hints for biological evaluations
of the genomic indexes;
2. The systematic analysis of repeat sharing networks, based on specific dictionar-
ies, could reveal/confirm important functional gene organization explaining the
semantic roles of repeats;
3. Approaches based on genomic dictionaries could be exported to other kinds of
genomic data, for example those obtained by means of RNA-Seq methodology;
4. Genomic dictionaries of
k
-mers for
k
12 are very often computationally very
hard to be computed. Therefore, specific algorithms and data representations for
“long”
k
-mers could be developed;
>