corner
corner

Phys. Rev. E 70, 021906 (2004) [9 pages]

Finite width model sequence comparison

Download: PDF (474 kB) Buy this article Export: BibTeX or EndNote (RIS)

Nicholas Chia and Ralf Bundschuh*
Department of Physics, Ohio State University, 174 West 18th Street, Columbus, Ohio 43210, USA

Received 17 March 2004; published 17 August 2004

Sequence comparison is a widely used computational technique in modern molecular biology. In spite of the frequent use of sequence comparisons, the important problem of assigning statistical significance to a given degree of similarity is still outstanding. Analytical approaches to filling this gap usually make use of an approximation that neglects certain correlations in the disorder underlying the sequence comparison algorithm. Here, we use the longest common subsequence problem, a prototype sequence comparison problem, to analytically establish that this approximation does make a difference to certain sequence comparison statistics. In the course of establishing this difference we develop a method that can systematically deal with these disorder correlations.

© 2004 The American Physical Society

URL:
http://link.aps.org/doi/10.1103/PhysRevE.70.021906
DOI:
10.1103/PhysRevE.70.021906
PACS:
87.15.Cc, 87.10.+e, 02.50.−r, 05.40.Fb

*Electronic address: bundschuh@mps.ohio-state.edu