Introduction

Richard Durbin; Sean R. Eddy; Anders Krogh; Graeme Mitchison

doi:10.1017/CBO9780511790492.002

1 - Introduction

Published online by Cambridge University Press: 05 September 2012

Anders Krogh and

Richard Durbin: Affiliation:
Sanger Centre, Cambridge
Sean R. Eddy: Affiliation:
Washington University, Missouri
Anders Krogh: Affiliation:
Technical University of Denmark, Lyngby

Book contents

Get access

Summary

Astronomy began when the Babylonians mapped the heavens. Our descendants will certainly not say that biology began with today's genome projects, but they may well recognise that a great acceleration in the accumulation of biological knowledge began in our era. To make sense of this knowledge is a challenge, and will require increased understanding of the biology of cells and organisms. But part of the challenge is simply to organise, classify and parse the immense richness of sequence data. This is more than an abstract task of string parsing, for behind the string of bases or amino acids is the whole complexity of molecular biology. This book is about methods which are in principle capable of capturing some of this complexity, by integrating diverse sources of biological information into clean, general, and tractable probabilistic models for sequence analysis.

Though this book is about computational biology, let us be clear about one thing from the start: the most reliable way to determine a biological molecule's structure or function is by direct experimentation. However, it is far easier to obtain the DNA sequence of the gene corresponding to an RNA or protein than it is to experimentally determine its function or its structure. This provides strong motivation for developing computational methods that can infer biological information from sequence alone. Computational methods have become especially important since the advent of genome projects. The Human Genome Project alone will give us the raw sequences of an estimated 70,000 to 100,000 human genes, only a small fraction of which have been studied experimentally.

Type: Chapter
Information: Biological Sequence Analysis
Probabilistic Models of Proteins and Nucleic Acids
, pp. 1 - 11

DOI: https://doi.org/10.1017/CBO9780511790492.002 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 1998

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Book contents

1 - Introduction

Summary

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive