corner
corner

Phys. Rev. E 74, 021913 (2006) [9 pages]

Informational structure of two closely related eukaryotic genomes

Download: PDF (896 kB) Buy this article Export: BibTeX or EndNote (RIS)

Manuel Dehnert1, Werner E. Helm2, and Marc-Thorsten Hütt1
1Computational Systems Biology, School of Engineering and Science, International University Bremen, Campus Ring 1, D-28759 Bremen, Germany
2Mathematics and Science Faculty, University of Applied Sciences, D-64295 Darmstadt, Germany

Received 23 January 2006; published 15 August 2006

Attempts to identify a species on the basis of its DNA sequence on purely statistical grounds have been formulated for more than a decade. The most prominent of such genome signatures relies on neighborhood correlations (i.e., dinucleotide frequencies) and, consequently, attributes species identification to mechanisms operating on the dinucleotide level (e.g., neighbor-dependent mutations). For the examples of Mus musculus and Rattus norvegicus we analyze short- and intermediate-range statistical correlations in DNA sequences. These correlation profiles are computed for all chromosomes of the two species. We find that with increasing range of correlations the capacity to distinguish between the species on the basis of this correlation profile is getting better and requires ever shorter sequence segments for obtaining a full species separation. This finding suggests that distinctive traits within the sequence are situated beyond the level of few nucleotides. The large-scale statistical patterning of DNA sequences on which such genome signatures are based is thus substantially determined by mobile elements (e.g., transposons and retrotransposons). The study and interspecies comparison of such correlation profiles can, therefore, reveal features of retrotransposition, segmental duplications, and other processes of genome evolution.

© 2006 The American Physical Society

URL:
http://link.aps.org/doi/10.1103/PhysRevE.74.021913
DOI:
10.1103/PhysRevE.74.021913
PACS:
87.10.+e, 87.14.Gg, 02.50.−r, 02.50.Ga