Do Grammars Minimize Dependency Length?
Article first published online: 27 OCT 2009
DOI: 10.1111/j.1551-6709.2009.01073.x
Copyright © 2009 Cognitive Science Society, Inc.
Additional Information
How to Cite
Gildea, D. and Temperley, D. (2010), Do Grammars Minimize Dependency Length?. Cognitive Science, 34: 286–310. doi: 10.1111/j.1551-6709.2009.01073.x
Publication History
- Issue published online: 9 FEB 2010
- Article first published online: 27 OCT 2009
- Received 4 March 2008; received in revised form 22 July 2009; accepted 24 July 2009
- Abstract
- Article
- References
- Cited By
Keywords:
- Syntax;
- Natural language processing;
- Word order
Abstract
A well-established principle of language is that there is a preference for closely related words to be close together in the sentence. This can be expressed as a preference for dependency length minimization (DLM). In this study, we explore quantitatively the degree to which natural languages reflect DLM. We extract the dependencies from natural language text and reorder the words in such a way as to minimize dependency length. Comparing the original text with these optimal linearizations (and also with random linearizations) reveals the degree to which natural language minimizes dependency length. Tests on English data show that English shows a strong effect of DLM, with dependency length much closer to optimal than to random; the optimal English grammar also has many specific features in common with English. In German, too, dependency length is significantly less than random, but the effect is much weaker than in English. We conclude by speculating about some possible reasons for this difference between English and German.

1551-6709/asset/COGS_left.gif?v=1&s=e6356e7948de0d0d2160ef560660ca21adae9c8c)
1551-6709/asset/COGS_right.gif?v=1&s=057fe71f80a24ff4016ade3c8aace8dd77cc980f)
