corner
corner

Phys. Rev. E 70, 056135 (2004) [5 pages]

Euclidean distance between syntactically linked words

Download: PDF (61 kB) Buy this article Export: BibTeX or EndNote (RIS)

Ramon Ferrer i Cancho*
ICREA-Complex Systems Laboratory, Universitat Pompeu Fabra, Dr. Aiguader 80, 08003 Barcelona, Spain and INFM udR Roma 1, Dipartimento di Fisica, Università La Sapienza, Piazzale A. Moro 5, 00185 Roma, Italy

Received 26 April 2004; published 30 November 2004

We study the Euclidean distance between syntactically linked words in sentences. The average distance is significantly small and is a very slowly growing function of sentence length. We consider two nonexcluding hypotheses: (a) the average distance is minimized and (b) the average distance is constrained. Support for (a) comes from the significantly small average distance real sentences achieve. The strength of the minimization hypothesis decreases with the length of the sentence. Support for (b) comes from the very slow growth of the average distance versus sentence length. Furthermore, (b) predicts, under ideal conditions, an exponential distribution of the distance between linked words, a trend that can be identified in real sentences.

© 2004 The American Physical Society

URL:
http://link.aps.org/doi/10.1103/PhysRevE.70.056135
DOI:
10.1103/PhysRevE.70.056135
PACS:
89.75.−k, 89.20.−a

*Electronic address: ramon@pil.phys.uniroma1.it