Phylogenetic Tree Reconstruction: Geometric Approaches - Mathematical Concepts and Methods in Modern Biology

Biology Reference

In-Depth Information

m is aweighted

tree with underlying topology T if and only if D is in the union of NJ cones C T .

Example 10.7. For an unrooted binary phylogenetic X -tree T

theNJAlgorithmgeometrically—the output of theNJ on a point D

∈ R

= (

)

onaset X

{

}

of five leaves, one has

8, and the orders of picking cherries in the

NJ Algorithm applied to any dissimilarity map D

will consist of sequences

of pairs from V . Necessarily, T has two pairs of cherries, say

∈ R

{

}

and

{

, }

, one

,v,w ∈

other leaf b , and three interior nodes u

V . In the NJ Algorithm, for any order

of picking cherries

that yields T (with some weighting) the first cherry picked will

be either

{

}

{

, }

. After this, if, say, u is the created ancestor node for

{

}

, then

either

a cherry to be picked. Once either cherry

is chosen, the other is completely determined for the next step, since there is only one

(binary, unrooted) tree topology for n

{

, }

is a cherry to be picked, or

{

}

5. Thus, for T

∈ T 5 and cherries

{

}

and

{

, }

, the NJ cones C

, and C

σ are the same, for

starting with the subsequence

σ starting with

{

} , {

}

and for

{

} , {

, } , {

}

{

} , {

, } , { v,

}

where

and

σ does yield different orders and cones. Thus, for any binary phylogenetic X tree

T on

is the ancestor of

{

, }

. Switching the order of

{

}

and

{

, }

5 leaves, there are two essentially distinct NJ cones which encompass

all dissimilarity maps D that output T ;in[ 26 ] these are labelled C ij , b , and C k , b .

Consequently, there are 30

5. For more details, see [ 26 ].

Neighbor-joining cones are defined and studied in detail for small trees ( n

2 total NJ cones for n

6) in

[ 26 ]. By studying and comparing the volumes of the NJ cones, [ 26 ] can geometrically

formulate results about the behavior of the NJ Algorithm. These include how likely

the NJ Algorithm is to return a particular tree topology given a random input vector

(dissimilarity map), and the robustness of the NJ Algorithm. More complete informa-

tion about these NJ cones, including information about “dimensions” of the cones,

rays of the cones, faces, and other results, also appears in [ 27 ]. In both [ 27 , 23 ], geo-

metric information is used to gain further insight into a comparison between the NJ

Algorithm as a phylogenetic tree reconstruction method and another distance-based

method, the balanced minimum evolution (BME) method, a topic we now explore.

10.4.2 Balanced Minimum Evolution

Recall that for any set

with

n , and any edge-weighted phylogenetic X -tree

(

One can set the total length of the tree to be the total sum of all the edge weights:

ω(

,ω) ∈ T n , with T

= (

)

, there is a naturally associated tree metric D T ,ω

) = e ∈ E ω(

.The minimum evolution principle for tree reconstruction uses

the idea of minimizing tree length in order to find the best-fitting tree

)

(

,ω) ∈ T n

to a given dissimilarity map D

∈ R

. Specifically, given a dissimilarity map

as input, a minimum evolution method seeks to locate as output an edge-

weighted tree

∈ R

is as small

as possible. Biologically, the minimum evolution principle is driven by the idea that

(

,ω)

so that D T ,ω (

) =

(

)

for all i

∈

X , and

ω(

)

Mathematical Concepts and Methods in Modern Biology

Search WWH ::

Custom Search

Home