Phylogenetic Tree Reconstruction: Geometric Approaches - Mathematical Concepts and Methods in Modern Biology

Biology Reference

In-Depth Information

their representative sequences). Suppose D

D T ,ω

for an

edge-weighted phy-

logenetic X -tree T (with

n ). Then one can phrase the problem of finding a

( n edge-weighted) phylogenetic tree T that “best fits” the observed data D on X as one

of finding a pair

is as small as possible. Equivalently,

one may take a least-squares approach , since minimizing

(

,ω) ∈ T n so that

δ(

D T ,ω )

δ(

T T ,ω )

is the same as

2 . When there is an exact fit,

minimizing

1 i < j n (

(

) −

d T ,ω (

))

δ =

To summarize:

For a set X

, |

n , each positive edge-weighted phylogenetic X -tree

(

,ω)

For a fixed ordering of reading off upper diagonal elements in the matrix D T ,ω

one obtains a vector

givesrisetoatreemetric D T ,ω

m m

v D , T ,ω ∈ R

There are one-to-one correspondences between

- pairs

∈ T n and positive edge-weightings,

- the n by n dissimilarity maps D that are tree metrics D T ,ω , and

- the vectors

(

,ω) ∈ T n of trees T

m .

v D , T ,ω ∈ R

m , and match-

ing such an edge-weighted tree to a dissimilarity map D arising from evolutionary

distances on sequences corresponding to a set X of species (or genes, etc.) becomes

a problem of minimizing distances of the vectors

In this way, edge-weighted phylogenetic X -trees become points in

m .

v D , T ,ω

and

v D in

Regarding positively (or nonnegatively) edge-weighted trees

(

,ω)

with T

∈ T n

m , the goal is, when presented with a point D

m ,

as a subset of points D T ,ω ∈ R

∈ R

m as is possible.

to seek one of these points D T ,ω

as close to D in

In this way, a fundamental problem of biology is turned into a geometric problem.

In particular, one wants to understand better the set

={ (

,ω) |

= (

) ∈

T n

→ R + }

m .Theset

T n ,ω :

regarded as a set of points

⊂ R

T n is often called

T n

“tree space,” though there is one for each n .

Example 10.6.

3, then there is only one tree topology, and there is only

one (unrooted binary) phylogenetic tree T

If n

∈ T 3 , but infinitely many edge-weightings

for any choice of T . More precisely, for each such T , there are three edges, and

for each edge e

R + , independent of the other

,ω(

)

can take all positive real values

edges. The edge-weighting

on T is completely described by the ordered triple

(ω(

), ω(

))

. In this way, geometrically, the significance of a choice of edge-

3 of

R +

weighting

for T is that

describes a point in the positive orthant

associated to T . Taking all possible positive weightings on

on T corresponds pre-

R + 3

∈ T 4 , there are

cisely to the full positive orthant

for T .For n

4, for any T

(

2 n

−

) = (

−

) =

5 edges and

(

2 n

−

) !! = (

−

) !! =

3 phylogenetic X -trees

R + 5 . (For those not

so familiar with high-dimensional geometry, this is analogous to the way in which,

for every point in time, there is a three-dimensional copy of space at that point in time,

only here the finitely many trees T

∈ T 4 . The pairs

(

,ω)

correspond to points in three copies of

∈ T n play the role of selecting out just finitely

Mathematical Concepts and Methods in Modern Biology

Search WWH ::

Custom Search

Home