Large-Scale Modeling of Wordform Learning and Representation
Article first published online: 10 FEB 2010
DOI: 10.1080/03640210802066964
2008 Cognitive Science Society, Inc.
Additional Information
How to Cite
Sibley, D. E., Kello, C. T., Plaut, D. C. and Elman, J. L. (2008), Large-Scale Modeling of Wordform Learning and Representation. Cognitive Science, 32: 741–754. doi: 10.1080/03640210802066964
Publication History
- Issue published online: 10 FEB 2010
- Article first published online: 10 FEB 2010
- Received 16 November 2005; received in revised form 24 August 2007; accepted 24 August 2007
- Abstract
- Article
- References
- Cited By
Keywords:
- Large-scale connectionist modeling;
- Sequence encoder;
- Simple recurrent network;
- Lexical processing;
- Orthography;
- Phonology;
- Wordforms
Abstract
The forms of words as they appear in text and speech are central to theories and models of lexical processing. Nonetheless, current methods for simulating their learning and representation fail to approach the scale and heterogeneity of real wordform lexicons. A connectionist architecture termed the sequence encoder is used to learn nearly 75,000 wordform representations through exposure to strings of stress-marked phonemes or letters. First, the mechanisms and efficacy of the sequence encoder are demonstrated and shown to overcome problems with traditional slot-based codes. Then, two large-scale simulations are reported that learned to represent lexicons of either phonological or orthographic wordforms. In doing so, the models learned the statistics of their lexicons as shown by better processing of well-formed pseudowords as opposed to ill-formed (scrambled) pseudowords, and by accounting for variance in well-formedness ratings. It is discussed how the sequence encoder may be integrated into broader models of lexical processing.

1551-6709/asset/COGS_left.gif?v=1&s=e6356e7948de0d0d2160ef560660ca21adae9c8c)
1551-6709/asset/COGS_right.gif?v=1&s=057fe71f80a24ff4016ade3c8aace8dd77cc980f)
1551-6709/asset/cover.gif?v=1&s=2e2195cda8fb83e5e9a53fa32d90c7d7f9cec837)