Biomedical Engineering Reference
In-Depth Information
are available (MAXHOM [
44
], CLUSTALW [
47
], T-Cofee [
29
]andMUSCLE[
10
]
are currently the most widely used) and none of them is globally accepted as a
standard.
Few years ago, the procedure described above for building MSA was almost
standard and time-consuming; thus, during the construction and tuning of new
prediction methods most of the researchers used the homology-derived secondary
structure of proteins (HSSP) precompiled multiple sequence alignments generated
with the MAXHOM software. Currently, a faster and more accurate method for
the construction of reliable sequence profiles is the adoption of the position spe-
cific iterative (PSI) feature in BLAST [
2
]. In PSI-BLAST a sequence profile and
a position-specific scoring matrix (PSSM) are automatically constructed from a
pseudo-multiple alignment of the highest scoring hits in an initial BLAST search.
The PSSM is generated by calculating position-specific scores for each position
in the alignment. Highly conserved positions receive high scores and weakly con-
served positions receive scores near zero. The profile is used to perform a further
BLAST search and the current profile is refined according to the outcomes of the
new search. This iterative procedure is performed until the retrieved sequences re-
main constant or a fixed number of iterations are achieved. In [
21
], the prediction
accuracy of secondary structure was improved by using directly the PSI-BLAST
PSSM to feed a neural network system.
2.4
Secondary Structure Prediction
In biochemistry and structural biology, the protein secondary structure refers to
the three-dimensional shape of consecutive residue segments. The most common
secondary structure elements are alpha-helices and beta-sheets. The formation of
secondary structure elements is mostly guided by local inter-residue interactions
mediated by hydrogen bonds. For example, an alpha-helix is formed when hydro-
gen bonds occur regularly between positions i and i
C
4 in a protein segment. When
hydrogen bonds occur between positions i and i
C
3,thena3
10
helix is formed. A
beta-sheet is formed when two strands are joined by hydrogen bonds involving alter-
nating residues on each participating strand. In the 1950s, Pauling correctly guessed
the formation of helices and strands [
31
,
32
], before any protein structure had been
determined experimentally.
There are several methods for defining protein secondary structure elements. The
dictionary of protein secondary structure (DSSP) method [
23
] is actually considered
the de facto standard for secondary structure definition. The DSSP defines eight
types of secondary structure elements, based on hydrogen-bonding patterns as those
initially proposed by Pauling (Fig.
2.2
):
G
D
3-turn helix (3
10
helix). Min length three residues.
H
D
4-turn helix (alpha helix). Min length four residues.
I
D
5-turn helix (pi helix). Min length five residues.
T
D
hydrogen bonded turn (3, 4 or 5 turn).
Search WWH ::
Custom Search