corner
corner

Phys. Rev. E 75, 056102 (2007) [6 pages]

Power law exponents characterizing human DNA

Download: PDF (241 kB) Buy this article Export: BibTeX or EndNote (RIS)

A. Provata1,* and Th. Oikonomou1,2
1Institute of Physical Chemistry, National Center for Scientific Research “Demokritos,” 15310 Athens, Greece
2Department of Biological Chemistry, School of Medicine, University of Athens, 11527 Athens, Greece

Received 7 February 2006; revised 9 February 2007; published 3 May 2007

The size distributions of all known coding and noncoding DNA sequences are studied in all human chromosomes. In a unified approach, both introns and intergenic regions are treated as noncoding regions. The distributions of noncoding segments Pnc(S) of size S present long tails Pnc(S)∼S−1−μnc, with exponents μnc ranging between 0.71 (for chromosome 13) and 1.2 (for chromosome 19). On the contrary, the exponential, short-range decay terms dominate in the distributions of coding (exon) segments Pc(S) in all chromosomes. Aiming to address the emergence of these statistical features, minimal, stochastic, mean-field models are proposed, based on randomly aggregating DNA strings with duplication, influx and outflux of genomic segments. These minimal models produce both the short-range statistics in the coding and the observed power law and fractal statistics in the noncoding DNA. The minimal models also demonstrate that although the two systems (coding and noncoding) coexist, alternating on the same linear chain, they act independently: the coding as a closed, equilibrium system and the noncoding as an open, out-of-equilibrium one.

© 2007 The American Physical Society

URL:
http://link.aps.org/doi/10.1103/PhysRevE.75.056102
DOI:
10.1103/PhysRevE.75.056102
PACS:
89.75.Da, 87.14.Gg, 05.40.−a, 87.15.Aa

*Author to whom correspondence should be addressed. Electronic address: aprovata@limnos.chem.demokritos.gr