PRALINE: A Versatile Multiple Sequence Alignment Toolkit - Multiple Sequence Alignment Methods

Biology Reference

In-Depth Information

Chapter 16

PRALINE: A Versatile Multiple Sequence Alignment Toolkit

Punto Bawono and Jaap Heringa

Abstract

Profile ALIgNmEnt (PRALINE) is a versatile multiple sequence alignment toolkit. In its main alignment

protocol, PRALINE follows the global progressive alignment algorithm. It provides various alignment

optimization strategies to address the different situations that call for protein multiple sequence alignment:

global profile preprocessing, homology-extended alignment, secondary structure-guided alignment, and

transmembrane aware alignment. A number of combinations of these strategies are enabled as well.

PRALINE is accessible via the online server http://www.ibi.vu.nl/programs/PRALINEwww/ . The

server facilitates extensive visualization possibilities aiding the interpretation of alignments generated,

which can be written out in pdf format for publication purposes. PRALINE also allows the sequences in

the alignment to be represented in a dendrogram to show their mutual relationships according to the

alignment. The chapter ends with a discussion of various issues occurring in multiple sequence alignment.

Key words Multiple sequence alignment, Progressive alignment, Sequence preprocessing, Homology-

extended MSA, Secondary structure-guided MSA, Transmembrane-aware protein alignment

1

Introduction

Multiple sequence alignments (MSAs) are pervasive in biology.

They are often used to elucidate conserved and variable regions in

protein or DNA sequences, which can reveal crucial information

regarding the functional and evolutionary relationships between

the aligned sequences. One of the initial breakthroughs in the

field of MSA, which addressed the computational burden asso-

ciated with MSA, was the invention of the progressive alignment

strategy [ 1 ].This strategy builds up an MSA by first constructing an

approximate phylogenetic tree ( guide tree ) for the query sequences

[ 1 , 2 ]. In many methods the guide tree is constructed from the

scores of all-against-all pairwise alignments of the query proteins.

The sequences are then progressively aligned according to the order

specified by the tree. However, an MSA produced using this

method might contain errors due to the so-called greediness of

this algorithm; i.e., alignments affected are not reconsidered any-

more and any match error occurring in the process will be

Search WWH ::

Custom Search

Home