Hostname: page-component-586b7cd67f-dsjbd Total loading time: 0 Render date: 2024-11-24T09:04:35.383Z Has data issue: false hasContentIssue false

Cascaded multiple classifiers for secondary structure prediction

Published online by Cambridge University Press:  01 June 2000

MOHAMMED OUALI
Affiliation:
Department of Computer Science, University of Wales, Aberystwyth Penglais, Aberystwyth, Ceredigion SY23 3DB, Wales, United Kingdom
ROSS D. KING
Affiliation:
Department of Computer Science, University of Wales, Aberystwyth Penglais, Aberystwyth, Ceredigion SY23 3DB, Wales, United Kingdom
Get access

Abstract

We describe a new classifier for protein secondary structure prediction that is formed by cascading together different types of classifiers using neural networks and linear discrimination. The new classifier achieves an accuracy of 76.7% (assessed by a rigorous full Jack-knife procedure) on a new nonredundant dataset of 496 nonhomologous sequences (obtained from G.J. Barton and J.A. Cuff). This database was especially designed to train and test protein secondary structure prediction methods, and it uses a more stringent definition of homologous sequence than in previous studies. We show that it is possible to design classifiers that can highly discriminate the three classes (H, E, C) with an accuracy of up to 78% for β-strands, using only a local window and resampling techniques. This indicates that the importance of long-range interactions for the prediction of β-strands has been probably previously overestimated.

Type
Research Article
Copyright
2000 The Protein Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)