Phys. Rev. E 66, 031910 (2002) [9 pages]Recognition of an organism from fragments of its complete genomeReceived 13 May 2002; revised 13 June 2002; published 24 September 2002 This paper considers the problem of matching a fragment to an organism using its complete genome. Our method is based on the probability measure representation of a genome. We first demonstrate that these probability measures can be modeled as recurrent iterated function systems (RIFS) consisting of four contractive similarities. Our hypothesis is that the multifractal characteristics of the probability measure of a complete genome, as captured by the RIFS, is preserved in its reasonably long fragments. We compute the RIFS of fragments of various lengths and random starting points, and compare with that of the original sequence for recognition using the Euclidean distance. A demonstration on five randomly selected organisms supports the above hypothesis. © 2002 The American Physical Society URL:
http://link.aps.org/doi/10.1103/PhysRevE.66.031910
DOI:
10.1103/PhysRevE.66.031910
PACS:
87.14.Gg, 87.10.+e, 47.53.+n
|
