Biomedical Engineering Reference
In-Depth Information
Hplot
Mplot
1.0
1.0
0.8
0.8
0.6
0.6
0.4
0.4
0.2
0.2
0.0
0.0
0
10
20
30
40
50
60
70
0
10
20
30
40
50
60
70
Nucleotide Position
Nucleotide Position
Ala corresponding to those presented in Fig. 1, where
max ErrVal =10 9 (thick gray lines) and fix ErrVal =10 9 (thick dotted darker gray lines) have
been chosen for generating the disturbances
Fig. 2. Sampling results for E.coli tRNA
3
Reducing the Preprocessing Time
According to the previous discussion, the proclaimed aim of this section is to lower
the
( n 2 ) ,such
that the preprocessing has the same worst-case time requirements as the subsequent
sampling process (for constructing a constant number of random secondary structure of
size n ).
( n 3 ) time complexity for preliminary inside-outside calculations to
O
O
3.1
Basic Idea
The main idea for reaching this time complexity reduction by a factor n in the worst-
case is actually quite simple: Instead of deriving the inside values α X ( i,j ) (and the
corresponding outside probabilities β X ( i,j ) ), X
∈I G s , for any combination of start
position i and end position j , 1
n , we abstract from the actual position of sub-
word R i,j = r i ...r j in the input sequence and consider only its length d =
i,j
|
r i ...r j |
.
( n 2 ) values α X ( i,j ) (and
Thus, for any X
∈I G s , we do not need to calculate
O
β X ( i,j ) )for 1
n .
However, the problem with this approach is that distance d alone may be associated
with any of the strings in
i,j
n , but only
O
( n ) values α X ( d ) (and β X ( d ) )for 0
d
, i.e. without using positions i and
j we are inevitably forced to additionally abstract from the actual input sequence r .
Note that it is also possible to combine both alternatives, that is we can first use
the traditional algorithms to calculate exact values α X ( i,j ) (and β X ( i,j ) ) within a
window of fixed size W exact ,i.e.for j
{
r i ...r j |
j
i +1= d
}
i +1
W exact (and j
i +1
n
W exact ), and afterwards derive the remaining values for W exact <d
n (and 0
d<n
W exact ) in an approximate fashion by employing the time-reduced variant
for obtaining α X ( d ) (and β X ( d ) ) for each X
∈I G s .Since W exact is constant, this
effectively yields an improvement in the time complexity of the corresponding complete
inside computation, which is then given by
( n 2 ·
W exact ) .However,evenforfix
W exact the time requirements for such a mixed outside computation are
O
( n 3 ) .
O
3.2
Approximation of Emission Probabilities
Due to the unavoidable abstraction from sequence, we have to determine some approx-
imated terms for the emissions of unpaired bases and base pairs, respectively, that
Search WWH ::




Custom Search