Hostname: page-component-78c5997874-4rdpn Total loading time: 0 Render date: 2024-11-15T21:14:59.374Z Has data issue: false hasContentIssue false

Improving the quality of twilight-zone alignments

Published online by Cambridge University Press:  01 August 2000

LUKASZ JAROSZEWSKI
Affiliation:
The Burnham Institute, 10901 North Torrey Pines Road, La Jolla, California 92037
LESZEK RYCHLEWSKI
Affiliation:
International Institute of Molecular and Cell Biology, Trojdena 4, Warszawa 02-109, Poland
ADAM GODZIK
Affiliation:
The Burnham Institute, 10901 North Torrey Pines Road, La Jolla, California 92037
Get access

Abstract

Several recent publications illustrated advantages of using sequence profiles in recognizing distant homologies between proteins. At the same time, the practical usefulness of distant homology recognition depends not only on the sensitivity of the algorithm, but also on the quality of the alignment between a prediction target and the template from the database of known proteins. Here, we study this question for several supersensitive protein algorithms that were previously compared in their recognition sensitivity (Rychlewski et al., 2000). A database of protein pairs with similar structures, but low sequence similarity is used to rate the alignments obtained with several different methods, which included sequence–sequence, sequence–profile, and profile–profile alignment methods. We show that incorporation of evolutionary information encoded in sequence profiles into alignment calculation methods significantly increases the alignment accuracy, bringing them closer to the alignments obtained from structure comparison.

In general, alignment quality is correlated with recognition and alignment score significance. For every alignment method, alignments with statistically significant scores correlate with both correct structural templates and good quality alignments. At the same time, average alignment lengths differ in various methods, making the comparison between them difficult. For instance, the alignments obtained by FFAS, the profile–profile alignment algorithm developed in our group are always longer that the alignments obtained with the PSI-BLAST algorithms. To address this problem, we develop methods to truncate or extend alignments to cover a specified percentage of protein lengths. In most cases, the elongation of the alignment by profile–profile methods is reasonable, adding fragments of similar structure. The examples of erroneous alignment are examined and it is shown that they can be identified based on the model quality.

Type
Research Article
Copyright
© 2000 The Protein Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)