Differences between skilled and less‐skilled young readers in the retrieval of semantic, phonological, and shape information
Abstract
This study aimed to determine whether the reading skills of third‐grade schoolchildren are associated with their preferences for semantic, phonological, and shape competitors (images or printed words) after being exposed to a spoken critical word. Two groups of children participated: skilled readers and less‐skilled readers. Through a language‐mediated visual search, children's fixations on the three competitors and a distractor were measured. When looking at images, both groups of readers preferred to look at the semantic competitor. When reading words, both groups showed a preference for the phonological competitor, but only skilled readers were sensitive to semantic information. These results suggest that early reading skills influence access to different types of representations in response to hearing a word, and they confirm the existence of a cascaded activation of information retrieval in childhood.
What is already known about this topic
- Reading skills influence linguistic processing in children and adults.
- Retrieval of information is cascaded from childhood.
- Learning to read modifies previous word representations.
What this paper adds
- Children's reading skills influence attention to specific information in both image and printed word modalities.
- Skilled and less‐skilled young readers access phonological information when reading words and semantic information when looking at pictures.
- Only skilled readers are sensitive to semantic relationships between spoken and printed words.
Implications for practice and/or policy
- The speed and type of information children can access in a visual task depend on their reading skills.
- Understanding that children with similar age and IQ and in the same grade at school have different reading skills can help us to design reading programs.
- Longitudinal evaluations will help us determine the effects of reading skills on children's ability to access perceptual, phonological, and semantic information.
Studies have found differences in the processing of linguistic stimuli attributable to reading skills and years of literacy. Literacy modulates the prediction of linguistic stimuli: adults with a mean of 15 years of formal education, but not those with 2, predict a visual target before it is named, based on semantic, associative, and syntactic cues (Mishra, Singh, Pandey, & Huettig, 2012). Mani and Huettig (2014) also reported a correlation between reading skills and the ability to predict linguistic input in 8‐year‐old children exposed to a spoken semantically related verb (‘eat’), a visual target (‘cake’), and a visual distractor: good readers were better at predicting a target than poor readers. The authors suggest that the acquisition of orthographic representations of words in learning to read shapes children's preexisting lexical representations and affects their subsequent processing of linguistic stimuli.
Unlike oral language, reading acquisition requires formal instruction in decoding written alphabetic language. Learning to read modifies previous word representations and is related to phonological as well as orthographic ability. Phonological representations are likely to be adjusted when the orthographic code is acquired. Children not only establish relationships among the letters that constitute written words, but also among oral language phonemes (Mani & Huettig, 2014; Schild, Röder, & Friedrich, 2011).
Phonological awareness – the discovery that discourse can be fragmented into phonemes, which can be reordered to form different words – precedes the acquisition of reading skills (Bowyer‐Crane et al., 2008). It has also been suggested that reading acquisition changes the approach to solving phonological awareness tasks, given that orthographic and phonological information are simultaneously available (Castles & Coltheart, 2004). Although both high and low literates show similar performance in phonemic discrimination, syllable rhyme detection, and phonological sensitivity involving the manipulation of sounds, illiterates have difficulty in identifying and interchanging phonemes and syllable order, which are also phonological awareness abilities (Petersson, Reis, Askelöf, Castro‐Caldas, & Ingvar, 2000). Thus, although phonological abilities exist before reading, specific phonological abilities may be more accurate after the acquisition of a written code. The connectionist model of Plaut, McClelland, Seidenberg, and Patterson (1996), which allows word‐reading simulations of readers with and without reading difficulties, explains the processing of orthographical representations during reading. The model suggests an interaction between the phonological and semantic routes for reading: it shows how phonological and semantic aspects interact and how the nature of such interaction can be modified as readers learn and gain experience.
Beginning in childhood, hearing a word (‘cat’) activates its phonemes (/kat/) (Huang & Snedeker, 2011), visual features (Arias‐Trejo & Plunkett, 2010), and semantically related words (e.g., ‘dog’) (Mani, Durrant, & Floccia, 2012); literate adults also activate orthographic representations (Huettig & McQueen, 2007), which are linked to phoneme detection (Perre, Pattamadilok, Montant, & Ziegler, 2009; Ziegler, Ferrand, & Montant, 2004). Adults find it easier to detect orthographically consistent words, where the orthographic representation of the phoneme is the same (e.g., /b/, ‘boat’, ‘bowl’), than those that vary (e.g., /f/, ‘farm’, ‘pharmacy’) (Cutler & Davis, 2012). Adults are slower at deciding whether two words rhyme when orthography varies (e.g., ‘boat‐note’) than when it is consistent (e.g., ‘bank‐tank’) (Ziegler et al., 2004); and they are efficient at detecting phonological similarities (e.g., in the onset and coda). This work aims to shed light on whether children with different reading skills but the same years of literacy also respond to phonological similarities within written and auditory modalities.
It is unclear whether children's degree of literacy has equal effects on semantic and shape‐similarity processing. Reis and Castro‐Caldas (1997) found more difficulties in semantic and phonological memory tasks in illiterate adults than in literate ones (though the differences were significant only in phonological memory tasks). At the electrophysiological level, there are differences between adults with high and low comprehension abilities in the amplitude of the P200 and the N400 wave forms (associated with phonological and semantic processing, respectively), yet there are no differences in phonological processing (Landi & Perfetti, 2007). Low comprehension skills could therefore be explained in part by semantic processing difficulties (Landi & Perfetti, 2007).
A model proposed by Harm and Seidenberg (2004) sheds light on the processes involved in determining meaning from printed words. The mechanism can be direct lexical access (orthography to meaning), phonologically mediated lexical access (orthography to phonology to meaning), or both routes simultaneously. The activation of meaning is built in time based on a continuous input of available information, mainly from these two sources. In this model, there is a distributed semantic pattern rather than the instant access to meaning.
Reading abilities also correlate with response time in lexical decision tasks: fourth‐ and sixth‐grade children with poor reading comprehension have slower reaction times than their average counterparts (Choi & Hwang, 2010). In verbal fluency tasks, there are significant differences between literates and illiterates in producing words with a specific phoneme, but not in producing words from a category (e.g., animals) (Kosmidis, Tsapkini, Folia, Vlahou, & Kiosseoglou, 2004). Literacy also has an effect on the perceptual abilities needed to rapidly identify and name black‐and‐white as opposed to color images (Petersson, Reis, & Ingvar, 2001), mirror images as opposed to originals (Kolinsky & Verhaeghe, 2012), and two‐dimensional as opposed to three‐dimensional images (Reis, Petersson, Castro‐Caldas, & Ingvar, 2001).
Words produce a cascaded activation of related phonological, orthographic, and perceptual information in children and adults (Huang & Snedeker, 2011; Huettig & McQueen, 2007; Mani et al., 2012). Huettig and McQueen (2007) investigated adults' access to long‐term memory after exposure to an auditory word and images or words. Participants looking at images tended to retrieve phonological, shape, and semantic information, while those looking at words retrieved primarily phonological information. The competition among these types of information suggests the existence of cascaded processing, the incremental propagation of information across multiple levels of analysis (Huang & Snedeker, 2011), at least for images.
Linguistic input can activate phonological, orthographical, perceptual, and semantic representations of the processed word. The identification of auditory input favours the activation of phonological information first, followed by perceptual and semantic information (McClelland, 1979). The identification of written words facilitates access to orthographic and phonological information, followed by semantic and perceptual information (Coltheart, Rastle, Perry et al., 2001). Finally, the identification of images allows rapid access to perceptual and semantic characteristics, and then to phonological activation (Levelt, Schriefers, Vorberg, Meyer, Pechman, & Havinga, 1991). The activation can be propagated to other words through characteristics shared within the same or different sensory modality. Huettig and McQueen (2007), for example, observed that participants presented with an auditory stimulus (‘balloon’) looked first at phonologically related images (‘bath’), and then at those with a perceptual relation (of shape: ‘sun’) and a semantic relation (‘doll’). However, when the images were replaced with written words, participants looked only at the phonologically related competitor (‘bath’). This difference suggests that auditory words enhance coactivation of different representations than written ones, and that there are discrepancies in the timing of the coactivation.
The ability to retrieve information from visual stimuli is related to reading skills. Huettig, Singh, and Mishra (2011) found that highly literate adults, but not those with lower literacy, retrieve phonological information from exposure to images in an auditory–visual cross‐modal eye‐tracking task. In Experiment 1, participants listened to spoken sentences containing a critical word and looked at a visual display of four objects: a phonological competitor, a semantic competitor, and two unrelated objects. In Experiment 2, the semantic competitor was replaced by an unrelated object. In both experiments, high literates shifted their attention to the phonological competitor as soon as phonological information was available. However, low literates preferred to look at the phonological competitor only in Experiment 2. Low‐literacy participants did not engage in a struggle between different types of cognitive representations.
In image processing, the spontaneous fixation on a visual referent upon hearing a word can be the product of the partial semantic overlap between a word and an object. Ocular movements respond to the degree of coincidence between the word and the mental representation of the objects present in the visual field (Huettig & Altmann, 2005). However, in word identification, the activation of information depends on the precision and flexibility of the mental representations of words in relation to their orthography, phonology, and meaning (Perfetti, 2007). The lexical high‐quality hypothesis suggests that the quality of word representations affects reading skills such as comprehension (Perfetti, 2007). High‐quality representations consist of phonological, orthographic, and meaning characteristics that are fully specified and strongly linked, activating different types of information in relation to a particular word (Richter, Isberner, Naumann, & Neeb, 2013). Such representations are required for reading comprehension (Perfetti & Hart, 2001; Perfetti, 2007). Richter et al. (2013) reported that the efficiency and precision of orthographic, phonological, and semantic representations in a group of children (mean age 8.27 years) explained individual variability in a computerised test of text comprehension. There is a link between lexical quality and reading skills: readers with poor lexical representations are at risk of retrieving imprecise or incomplete lexical information during reading (Richter et al., 2013).
The present study uses an eye‐tracking experiment to identify the recovery of phonological, semantic, and perceptual knowledge of third‐grade children (average age 8 years) upon hearing a critical word while exposed either to images or printed words. It aims to establish whether the retrieval of information by these skilled and less‐skilled children is partially modulated by their reading skills, depending on the presentation modality (images or words). Differences between these groups would suggest an early impact of literacy on the manner in which information is retrieved, although its influence may be stronger with words than with images. Skilled readers (SRs) might possess high‐quality representations that allow them to retrieve precise information from auditory linguistic stimuli.
Method
Participants
The participants were 194 Mexican third‐grade students, recruited from local public schools, who were tested for intelligence quotient (IQ) and reading skills. All children were monolingual native Spanish speakers, with normal or corrected‐to‐normal vision. The final sample, after IQ and reading skills tests, consisted of 62 children (M = 8 years 7 months, SD = .28, range: 8;1–9;1), 25 boys and 37 girls. Two groups were formed: SRs and less‐skilled readers (LSRs). One participant was not included in the statistical analysis because of data loss. Participant characteristics and test results are given in Tables 1 and 2. Half of the participants saw the image condition (n = 30; 17 SRs and 13 LSRs) and the other half the word condition (n = 31; 16 SRs and 15 LSRs).
| Group | N | Age | Boys/Girls | IQ | Reading speed | Reading precision | Reading comprehension |
|---|---|---|---|---|---|---|---|
| M (SD) | M (SD) | M (SD) | M (SD) | M (SD) | |||
| Skilled readers | 17 | 8.7 (0.27) | 4/13 | 98.05 (5.5) | 110.88 (4.04) | 102.88 (12.25) | 111.76 (13.8) |
| Less‐skilled readers | 13 | 8.5 (0.30) | 9/4 | 96.00 (5.5) | 92.84 (3.6) | 93.15 (16.3) | 98.53 (13.67) |
| t tests | — | t(28) = 1.7, p = ns | — | t(28) = 1.0, p = ns | t(28) = 12.6, p < .001 | t(28) = 1.8, p = .07 | t(28) = 2.61, p < .05 |
| Group | N | Age | Boys/Girls | IQ | Reading speed | Reading precision | Reading comprehension |
|---|---|---|---|---|---|---|---|
| M (SD) | M (SD) | M (SD) | M (SD) | M (SD) | |||
| Skilled readers | 16 | 8.56 (0.24) | 6/10 | 95.50 (6.0) | 110.31 (4.2) | 102.06 (9.9) | 102.06 (11.3) |
| Less‐skilled readers | 15 | 8.60 (0.28) | 6/9 | 95.06 (4.9) | 92.6 (4.01) | 74.2 (39.6) | 95.33 (17.39) |
| t tests | — | t(29) = 1.2, p = ns | — | t(29) = 0.21, p = ns | t(29) = 9.93, p < .001 | t(29) = 3.27, p < .05 | t(29) = 2.76, p = ns |
With the aim of verifying that both groups of readers differed in their reading skills independent of the modality (images or written words), we performed a series of mixed independent analyses of variance for the reading measures of speed, comprehension, and precision, with the within‐subjects factors Reading Level (SR or LSR) and Modality (images or words). A main effect of Reading Level was found for speed (F(1,57) = 300.02, p < .001), with a higher score in the SR group than in the LSR group; however, neither Modality (F(1,57) = .15, p = .69) nor the interaction between Reading Level and Modality (F(1,57) = .02, p = .87) were significant. A main effect of Reading Level was found for comprehension (F(1,57) = 7.48, p = .008); however, neither Modality (F(1,57) = 3.13, p = .08) nor the interaction between Reading Level and Modality (F(1,57) = .79, p = .37) were significant. A significant effect of Reading Level was obtained for precision (F(1,57) = 10.42), p = .002); SRs had higher scores than LSRs, but neither the main effect of Modality (F(1,57) = 2.88, p = .09) nor the interaction between Reading Level and Modality (F(1,57) = 2.42), p = .12) were significant. In sum, significant differences exist between groups of readers, independent of the modality. See Tables 1 and 2 for detailed statistics.
Instruments
IQ scoring
Children's IQ scores were calculated using the abbreviated Spanish version of the Wechsler Scale of Intelligence for Children WISC‐IV (Wechsler, 2007), including block design, digit span, similarities, and coding tests (Sattler, 2010). To guarantee that participants had similar IQ scores independent of their reading skills, both SR and LSR children were selected with average scores (90–110).
Reading skills
Reading skills were tested using the reading subtest of the Evaluación Neuropsicológica Infantil ENI1 (Matute, Rosselli, Ardila, & Ostrosky‐Solís, 2007), standardised for Mexican school‐age children. This subtest evaluates reading speed, precision, and comprehension. Reading speed scores were correlated with comprehension (r = .539, p < .001) and precision (r = .605, p < .001). SRs scored in the 63rd–84th percentiles on the reading speed task, and LSRs in the 16th–37th percentiles. Reading speed scores tend to be a predictor of overall reading skills (Rosselli, Matute, & Ardila, 2006). In Romance languages, the number of letters in a word determines the time a child takes to read it (Zoccolotti et al., 2005). As a good reader is thus capable of identifying letters, syllables, and words faster than a poor reader (Rosselli et al., 2006), we decided to classify participants' reading skills according to speed scores.
Stimuli
One‐hundred Spanish words familiar to 8‐year‐old children were selected from the corpus Cómo usan los niños las palabras2 (Alva et al., 2001). Twenty experimental trials were arranged, each with five different words: a spoken critical word, three related competitors, and a distractor. Each critical word was embedded in a neutral carrier phrase (e.g., Mira, pájaro. ‘Look, bird’). Word length was controlled by the number of syllables in the four options (phonological, semantic, shape competitor, distractor). Of the 20 experimental trials, eight presented disyllabic and five trisyllabic words, and seven combined two disyllabic and two trisyllabic words. Speech stimuli were produced by a female speaker of Mexican Spanish in a neutral intonation (48,000 Hz, 16 bits). The other four stimuli in each set were either images (black and white, 300 × 300 pixels) or printed words (Arial Black font, 100 points). Images were selected from the pictorial objects set of Snodgrass and Vanderwart (1980) and from public internet databases.
The three competitors and the distractor consisted of (1) a phonological and orthographic competitor for which the first consonant and vowel were the same as for the spoken critical word (e.g., pájaro‐pala; ‘bird‐shovel’), unrelated to the critical word both semantically and in the shape denoted; (2) a semantic competitor (e.g., leche, ‘milk’ for the critical word galleta, ‘biscuit’), selected from Word Association Norms for Mexican Children (Arias‐Trejo & Barrón‐Martínez, 2014) and in two cases from the adult Word Association Norms in Mexican Spanish (Barrón‐Martínez & Arias‐Trejo, 2014); it was unrelated to the critical word both phonologically and in shape; (3) a shape competitor, perceptually similar to the referent of the critical word, not an orthographic competitor, and unrelated phonologically and semantically to the critical word (e. g., galleta, ‘biscuit’ for balón, ‘ball’); and (4) a distractor unrelated in phonology, semantics, or shape to the critical word.
In Spanish, an onset phonological competitor is also an onset orthographic competitor. This was the case with the first two phonemes where we matched the critical word and the phonological competitor. Although there are phonological competitors that are not full orthographic competitors in the whole syllable (e.g., vaca‐bata), there are not enough such familiar words to be able to manipulate this competition. Moreover, there are few phonemes in Spanish that are phonological but not orthographic competitors (e.g., g–j).
Experimental design
Trials lasted 5,000 ms each. A central fixation point appeared on screen for 600 ms, followed by a blank screen for another 600 ms, and then the four pictures or printed words appeared simultaneously and the carrier phrase began. The onset of the spoken critical word occurred 800 ms after the onset of pictures or printed words. The analysis window started 240 ms after the onset of the critical word and ended at 2,000 ms (see Figure 1). Preliminary tests were conducted to ensure that children were able to track the four words.

No trial was presented to a child more than once. The order of presentation of the 20 trials and the position of the four images on screen were counterbalanced across children in four sequences. Each competitor type was presented five times per sequence in the same location of the quadrant.
Procedure
The trials were displayed on a 23‐inch screen located above a Tobii X2‐30 eye‐tracker, using the Tobii Studio 3.2 package. Participants were seated approximately 60 cm from the screen. Auditory stimuli were presented through a single loudspeaker located behind the screen. Children were told that they would watch a video. They were asked to listen to the sentences and to look at the screen throughout the presentation of the video. The eye‐tracker records gaze data at 30 Hz, with an average accuracy of 0.5° visual angle. The gaze of each child was calibrated prior to testing using a five‐point calibration procedure. If four or more points were successfully calibrated for both eyes, the experiment proceeded.
Scoring
Areas of interest were defined according to the size of the individual images (300 × 300 pixels) and printed words (444 × 138 pixels). Total looking time and the proportional time of fixation were calculated for each trial. Trials in which participants looked at stimuli less than 40% of the time during the 240–2,000 ms analysis window were discarded (image modality: 2.7%; printed word modality: 4.5%). Similar criteria have been used in research with children to eliminate trials where participants are not paying attention (e.g., Arias‐Trejo & Plunkett, 2009; Graham, Baker, & Poulin‐Dubois, 1998).
Results
Analysis of proportion of looking time
The proportion of looking time was entered into a 4 × 2 × 2 analysis of variance with the within‐subjects factor Competitor (phonological–semantic–shape–distractor) and the inter‐subjects factors Reading Level (SR‐LSR) and Modality (images‐printed words). Greenhouse–Geisser correction was used for significance values to avoid Type I Error. The analysis revealed a main effect of Competitor (F(3, 55) = 18.59, p < .001), a two‐way interaction between Competitor and Modality (F(3, 55) = 44.16, p < .001), and a three‐way interaction between Competitor, Reading Level, and Modality (F(3, 57) = 3.39, p < .05). We discuss the results of this three‐way interaction below (see Figure 2).

Image modality
Post hoc t tests were carried out, and p‐values were adjusted with Bonferroni correction. The t tests indicated that SRs had more fixations on the semantic than on the shape competitor (t(15) = 4.77, p < .001), the phonological competitor (t(15) = 6.85, p < .001), or the distractor (t(15) = 8.61, p < .001). They also had more fixations on the shape than on the phonological competitor (t(15) = 3.37, p = .004) or the distractor (t(15) = 6.56, p < .001), and marginally more fixations on the phonological competitor than on the distractor (t(15) = 2.03, p = .06). SRs looked significantly above chance level at the semantic competitor (t(15) = 7.26, p < .001), and below at the phonological competitor (t(15) = 4.35, p = .001) and the distractor (t(15) = 8.85, p < .001).
LSRs had more fixations on the semantic than on the shape competitor (t(13) = 4.97, p < .001), the phonological competitor (t(13) = 7.79, p < .001), or the distractor (t(13) = 8.15, p < .001). They had more fixations on the shape than on the phonological competitor (t(13) = 5.73, p < .001), and their fixations on the phonological competitor and the distractor were similar (t(13) = .84, p = .41). LSRs looked above chance level at the semantic competitor (t(13) = 5.27, p < .001) and below at the phonological competitor (t(13) = 9.48, p < .001).
As a group, SRs had more fixations on the phonological competitor than LSRs (t(13) = 2.57, p = .02) and marginally fewer on the semantic competitor (t(13) = 1.81, p = .09). Fixation differences were not significant for the shape competitor (t(13) = .86, p = .40) or distractor (t(13) = .84, p = .41).
Printed word modality
SRs had more fixations on the phonological than on the semantic competitor (t(16) = 2.21, p = .04), the shape competitor (t(16) = 4.77, p < .001), or the distractor (t(16) = 2.47, p = .02). They had more fixations on the semantic than on the shape competitor (t(16) = 2.70, p = .01). They looked above chance level at the phonological competitor (t(16) = 3.35, p = .004) and below at the shape competitor (t(16) = 4.71, p < .001).
LSRs had more fixations on the phonological than on the semantic competitor (t(13) = 3.69, p = .003), the shape competitor (t(13) = 3.98, p = .002), or the distractor (t(13) = 2.38, p = .03). LSRs looked above chance level at the phonological competitor (t(13) = 3.34, p = .004) and below at the shape (t(13) = 3.36, p = .005) and semantic competitors (t(13) = 2.85, p = .01).
As a group, SRs had more fixations on the semantic competitor than LSRs (t(14) = 3.89, p = .02), but not for the phonological (t(14) = 1.58, p = .13) or shape competitor (t(14) = .33, p = .74) or the distractor (t(14) = .11, p = .91).
Time‐course analysis
A time‐course analysis revealed that the looking trajectories were dependent on the modality of presentation and the participant's reading skills (Figure 3). The proportion of fixations on the three related competitors was compared with that on the distractor to verify the sensitivity of the established relationships between the spoken critical word and the competitors. We performed a baseline correction
, where x is each fixation in the period from 200 to 2,000 ms after the spoken critical word, and BL is the baseline average beginning at 200 ms before the onset of the spoken critical word. The data are therefore symmetrical around zero: positive values mean a higher proportion of fixation on a competitor and negative values a lower proportion of fixation on a competitor. The mean fixation proportion for each type of competitor was compared with that for the distractor, with t tests performed every 200 ms in the time interval from 200 to 2,000 ms after onset of the spoken critical word (as an estimate of the earliest point at which a fixation could reflect a response based on information).

Image modality
SRs had more fixations on the semantic competitor than on the distractor from 1,000 to 2,000 ms (p < .001), while LSRs had more from 200 to 2,000 ms (p < .05). SRs had more fixations on the shape competitor than on the distractor from 600 to 2,000 ms (p < .05), whereas LSRs had more from 1,600 to 2,000 ms (p < 05). SRs had more fixations on the phonological competitor than on the distractor from 1,100 to 1,200 ms (p < .01). LSRs had similar fixations on the phonological competitor and the distractor in all time intervals (p > .99).
LSRs had more fixations than SRs on the semantic competitor from 200 to 600 ms and 1,000 to 1,400 ms (p < .05). SRs looked more at the shape competitor than LSRs from 400 to 600 ms (p = .04). SRs looked marginally more at the phonological competitor than LSRs from 1,200 to 1,500 ms (p = .06). Both groups had similar fixations on the distractor in all time intervals (p > .05) (See Table 3 for detailed statistics).
| Image modality | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Phonological | Semantic | Shape | ||||||||||
| SRs | LSRs | SRs | LSRs | SRs | LSRs | |||||||
| Time (ms) | Mean (SD) | p‐Value | Mean (SD) | p‐Value | Mean (SD) | p‐Value | Mean (SD) | p‐Value | Mean (SD) | p‐Value | Mean (SD) | p‐Value |
| 200–399 | 0.05 (0.32) | > .99 | 0.01 (0.31) | > .99 | 0.05 (0.15) | > .99 | 0.16 (0.19) | .008 | 0.07 (0.22) | > .99 | 0.01 (0.21) | > .99 |
| 400–599 | 0.05 (0.31) | > .99 | 0.05 (0.43) | > .99 | 0.09 (0.25) | > .99 | 0.20 (0.31) | .008 | 0.11 (0.30) | .5 | ‐0.03 (0.24) | > .99 |
| 600–799 | 0.11 (0.31) | .88 | 0.08 (0.29) | > .99 | 0.19 (0.37) | .18 | 0.26 (0.27) | .04 | 0.20 (0.33) | .04 | 0.03 (0.18) | > .99 |
| 800–999 | 0.13 (0.30) | .44 | 0.09 (0.28) | > .99 | 0.18 (0.29) | .08 | 0.33 (0.24) | .0006 | 0.16 (0.26) | .04 | 0.07 (0.19) | > .99 |
| 1,000–1,199 | 0.20 (0.24) | .006 | 0.04 (0.19) | > .99 | 0.28 (0.23) | < .0001 | 0.39 (0.21) | < .0001 | 0.25 (0.26) | .002 | 0.17 (0.28) | .4 |
| 1,200–1,399 | 0.17 (0.35) | .15 | 0.05 (0.20) | > .99 | 0.35 (0.35) | .0004 | 0.44 (0.21) | < .0001 | 0.25 (0.26) | .002 | 0.19 (0.22) | .06 |
| 1,400–1,599 | 0.16 (0.28) | .15 | 0.04 (0.24) | > .99 | 0.43 (0.35) | < .0001 | 0.36 (0.22) | .0002 | 0.29 (0.27) | .0007 | 0.15 (0.25) | .1 |
| 1,600–1,799 | 0.13 (0.30) | .44 | 0.05 (0.26) | > .99 | 0.49 (0.29) | < .0001 | 0.45 (0.22) | < .0001 | 0.30 (0.25) | .0002 | 0.17 (0.28) | .01 |
| 1,800–1,999 | 0.11 (0.28) | .69 | 0.08 (0.25) | > .99 | 0.44 (0.23) | < .0001 | 0.48 (0.30) | < .0001 | 0.28 (0.24) | .0001 | 0.22 (0.18) | .004 |
- Note: p‐Values were adjusted with Bonferroni correction.
Printed word modality
SRs had more fixations on the phonological competitor than on the distractor in the interval from 1,200 to 2,000 ms (p < .05), while LSRs had more from 1,200 to 2,000 ms (p < .01). SRs had more fixations on the semantic competitor than on the distractor from 1,400 to 2,000 ms (p < .01), while LSRs did not show significant differences between the semantic competitor and the distractor in any of the time segments. SRs did not show significant differences between fixations on the shape competitor and those on the distractor, whereas LSRs had more on the shape competitor between 1,600 and 1,800 ms (p = .04).
SRs had more fixations than LSRs on the semantic competitor from 1,200 to 2,000 ms (p < .05), while LSRs had more fixations than SRs on the shape competitor from 400 to 1,000 ms (p < .05). LSRs had marginally more fixations than SRs on the phonological competitor from 1,200 to 1,800 ms (p < .06). Both groups had similar fixations on the distractor in all time intervals (p > .05) (See Table 4 for detailed statistics).
| Printed word modality | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Phonological | Semantic | Shape | ||||||||||
| SRs | LSRs | SRs | LSRs | SRs | LSRs | |||||||
| Time (ms) | Mean (SD) | p‐Value | Mean (SD) | p‐Value | Mean (SD) | p‐Value | Mean (SD) | p‐Value | Mean (SD) | p‐Value | Mean (SD) | p‐Value |
| 200–399 | 0.00 (0.22) | > .99 | 0.00 (0.28) | > .99 | ‐0.02 (0.16) | > .99 | ‐0.01 (0.24) | > .99 | ‐0.02 (0.16) | > .99 | ‐0.02 (0.20) | > .99 |
| 400–599 | 0.13 (0.24) | .46 | 0.04 (0.35) | > .99 | 0.02 (0.19) | > .99 | 0.05 (0.35) | > .99 | ‐0.07 (0.18) | > .99 | 0.02 (0.28) | > .99 |
| 600–799 | 0.21 (0.27) | .06 | 0.13 (0.36) | .93 | 0.07 (0.24) | > .99 | 0.03 (0.37) | > .99 | ‐0.01 (0.25) | > .99 | 0.07 (0.27) | > .99 |
| 800–999 | 0.23 (0.34) | .09 | 0.17 (0.38) | .59 | 0.09 (0.24) | > .99 | ‐0.01 (0.46) | > .99 | ‐0.01 (0.24) | > .99 | 0.13 (0.30) | .48 |
| 1,000–1,199 | 0.26 (0.40) | .09 | 0.31 (0.45) | .08 | 0.12 (0.24) | .97 | 0.03 (0.45) | > .99 | 0.05 (0.20) | > .99 | 0.12 (0.35) | > .99 |
| 1,200–1,399 | 0.32 (0.38) | .008 | 0.47 (0.37) | .0006 | 0.19 (0.22) | .06 | 0.09 (0.36) | > .99 | 0.14 (0.21) | .2 | 0.10 (0.30) | > .99 |
| 1,400–1,599 | 0.30 (0.33) | .01 | 0.47 (0.39) | .0003 | 0.24 (0.19) | .003 | 0.12 (0.31) | .92 | 0.08 (0.20) | > .99 | 0.09 (0.27) | > .99 |
| 1,600–1,799 | 0.28 (0.35) | .03 | 0.51 (0.41) | .0002 | 0.27 (0.23) | .002 | 0.14 (0.30) | .49 | 0.06 (0.17) | > .99 | 0.15 (0.24) | .04 |
| 1,800–1,999 | 0.21 (0.39) | .24 | 0.41 (0.41) | .006 | 0.25 (0.29) | .006 | 0.03 (0.26) | > .99 | 0.01 (0.15) | > .99 | 0.12 (0.27) | .15 |
- Note: p‐Values were adjusted with Bonferroni correction.
Discussion
The aim of this study was to identify the recovery of phonological, semantic, and perceptual knowledge of 8‐year‐old SRs and LSRs upon hearing a critical word while exposed to either images or printed words in an eye‐tracking experiment.
In the image modality, both SRs and LSRs preferred to look at the semantic competitor over both the phonological and shape competitors and the distractor. The proportion of looking time and the tracking trajectory showed that their next preference was the shape competitor. SRs, but not LSRs, exhibited a marginal late preference for the phonological competitor. Time‐course analysis showed that attentional shifts in SRs were initially to the shape competitor, then to the semantic competitor, and finally to the phonological competitor. However, these preferences showed a delay that may reflect a strategic effect. In LSRs, attentional shifts were initially to the semantic competitor and then to the shape competitor. This group did not show an attentional shift to the phonological competitor.
The time‐course analysis for the image modality is partially consistent with the findings in adults of Huettig and McQueen (2007), but only for SRs. Although in the present study the previsualisation stimulus time (800 ms) before the spoken critical word was similar to that of Huettig and McQueen's Experiment 1, our results were similar to those of their Experiment 2, in which the previsualisation time was shorter (200 ms). However, we did not find early attentional shifts to the phonological competitor in the image modality, as in their Experiment 2. It is possible that because children's processing is slower than that of adults (Jescheniak, Hahne, Hoffmann, & Wagner, 2006), the previsualisation time was insufficient to retrieve image names. The late retrieval of phonological similarity by SRs was possibly accomplished through a postlexical coactivation.
An alternative or complementary explanation for our results is related to children's short memory span (Vogan, Morgan, Powell, Smith, & Taylor, 2016), which causes difficulties in recalling and maintaining labels. However, children's retrieval of image names is an automatic process, and the free propagation of information causes phonological coactivation. In addition, it has been shown that 8‐year‐olds can recall four verbal items (Vogan et al., 2016). Thus, a postlexical phonological coactivation due to slow retrieval of picture names is the likely explanation for our results. Although this explanation might be valid for SRs, LSRs presented a temporal retrieval of information morphologically different from that of SRs or the adults in Huettig and McQueen (2007). The LSRs showed a faster attentional shift to semantic information, at 200 ms, and a delayed attentional shift to shape information, at 1,600 ms; they did not show attentional shifts to the phonological competitor.
The influence seen here of reading skills on retrieving the names of images coincides with reports employing other techniques, such as rapid automatised naming (RAN) tasks. In RAN tasks, participants must name images of familiar objects, digits, or letters as quickly as possible. Poor performance in RAN is related to dyslexia, due to participants' difficulties in retrieving phonological information (Di Filippo, Zoccolotti, & Ziegler, 2008). Georgiou, Parrila, Cui, and Papadopoulos (2013) examined why RAN is related to reading in children, through the manipulation of processes at different production stages: input, processing, and output. Their explanation for the relationship is that both require serial processing and the pronunciation of the stimulus names.
The ability to identify an image depends both on detection of perceptual, physical properties, and semantic processing of meaning (Farah, 1997). Exposure to words also involves activation of different components, including those that are phonological, perceptual, and semantic. The divergence between our results and those of Huettig and McQueen (2007) could be explained by difficulties for LSRs in the activation of a semantic competitor, which requires a link between a word (spoken critical word), a second word (name of semantic competitor), and an image (semantic competitor). Activation of a shape competitor requires a link between a word (spoken critical word), an image (spoken critical word), and another image (shape match). In the latter case, a comparison between the physical properties of two images is required: one not available (spoken critical word) and another available (target); this comparison involves online processing of the physical characteristics of the stimuli at a lexical representation level.
With the aim of assessing the similarity in shape of our images, an image validation study with 20 adults was conducted, after completing the experiment, to assess the perceptual similarity between a referent for the critical word and the shape competitor. This study used a Likert scale from 0 (less similar) to 7 (more similar). Although for these adults the shape competitor was more similar to the referent of the spoken critical word than the other competitors (semantic, phonological, and distractor), the scores (M = 2.88, range: 2.20–3.57) were not different from chance (3.5). These ratings suggest that the shape competitor was not highly similar to the referent of the critical word and that children thus found it difficult to detect the perceptual similarity. However, Huettig and McQueen (2007) also reported similarity ratings around chance level (M = 5.6 on a scale from 0 to 11). LSRs, unlike SRs, might have experienced difficulty in creating an online mental image of the referent of the spoken critical word to compare with the shape competitor. However, this explanation is not parsimonious, given evidence about the early categorisation of visual stimuli based on perceptual features (Colombo, McCollam, Coldren, Mitchell, & Rash, 1990) and the extension of words to perceptually similar referents (Graham & Poulin‐Dubois, 1999; Landau, Smith, & Jones, 1998). In these studies, children are generally presented with two images (target‐distractor), not four; shape similarity may be more evident in a simplified scenario in which the shape target is paired with a distractor, but not in a challenging scenario in which the shape competitor is in conflict with semantic and phonological competitors.
It has been suggested that learning to read is related to phonological processing (Blaiklock, 2004; Bowyer‐Crane et al., 2008; González, Romero, & Blanca, 1995); people with high‐level reading skills and literacy also have good phonological skills (Kosmidis et al., 2004; Reis et al., 2001). Thus, it is possible that phonological decoding abilities improve as reading and writing abilities develop. The differences between SRs and LSRs in the present study point to the existence of divergences in phonological processing in the early stages of reading acquisition, suggesting an incipient ability to transform the visual input into a phonological representation of the stimulus to compare it with the spoken critical word. The phonological processing of SRs is more similar to that previously described in low literacy adults than that of high literacy adults (Mishra et al., 2012).
In the written word modality, both SRs and LSRs preferred to look at the phonological competitor over the other options during the first 1,000 ms of the trials. SRs looked at the phonological competitor earlier (600 ms) than LSRs (1,200 ms). These results coincide with those of Huettig and McQueen (2007), who found that in reference to a previously spoken critical word, adults prefer to look at a phonological competitor early in the trial (300–600 ms in Experiments 3 and 4). In the present study, LSRs were slower at processing phonological information (1,200 ms). As previously noted, in Spanish, the onset of a phonological competitor is generally an orthographic competitor; both SRs and LSRs could benefit from this natural coincidence in the language.
In a later time window (1,100–2,000 ms) of the written word modality, SRs showed visual preference for the semantic competitor, but the visual attention of LSRs remained with the phonological competitor. Thus, although both groups detected phonological similarity between the onset of the spoken critical word and that of the phonological competitor, better reading skills facilitated identification of alternative semantic relationships. High‐quality representations might have allowed SRs in the written modality to activate different types of information, such as phonology, orthography, and semantics, upon hearing the critical word (Perfetti, 2007). Learning and experience with reading can also influence the interaction of phonology and semantics (Plaut et al., 1996). The greater fixation of SRs on the semantic competitor in the written modality indicates that they are in the process of developing high lexical representations. Incomplete retrieval of lexical information by LSRs – attention to phonological but not semantic information – may be the consequence of poor lexical representation. Although both groups of readers develop their reading abilities and refine the necessary skills to access meaning, in the written modality only SRs employ equally the direct route (orthography–meaning) and the mediated route (orthography–phonology–meaning) (Harm & Seidenberg, 2004) to determine the meaning of a word.
The sustained attention of LSRs to the phonological competitor in the written word modality reduced their probability of fixating on another competitor. Although Huettig and McQueen (2007) did not find a preference for the semantic competitor in adults, they analysed only an interval of 1,000 ms, as compared with the 2,000 ms in the present study; it thus remains unknown whether those adults had a preference for the semantic competitor after 1,000 ms. SRs in the present study might have been using reading strategies similar to those of more experienced readers, who use simultaneous grapheme decoding (lexical–semantic route), while LSRs were still decoding graphemes consecutively (sublexical route) (Dehaene, 2009). The reading scores of SRs were above the average reading percentile for their age, while the scores of LSRs were below average.
Existing evidence suggests that semantic processing abilities are not affected by reading acquisition because innate mechanisms allow for the establishment of semantic similarities, making it possible to form word associations without an orthographic code (Kosmidis et al., 2004). This evidence comes from verbal fluency tasks that show differences between literate and nonliterate adults in phonological but not semantic abilities: nonliterate adults evoke a greater number of semantically associated words than phonologically related ones (Reis & Castro‐Caldas, 1997). In the printed word modality of the present study, associating the spoken critical word with a semantically related written word required decoding and identification of the written word in order to access the mental lexicon and retrieve the associations stored in semantic memory. LSRs experienced difficulty in identifying the semantic relationship between the critical word and the semantic competitor. Choi and Hwang (2010) reported longer reaction times in a lexical decision task for low‐comprehension children than for their high‐comprehension counterparts. The authors argued that low‐comprehension participants are slower at, but not incapable of, accessing semantic information.
In the printed word modality, SRs but not LSRs had a significant preference for the semantic competitor, followed by the phonological competitor. SRs might have benefited from their reading speed, as this was the criterion by which they were categorised; however, given the correlations among reading speed, comprehension, and precision, it is likely that all three factors contributed. If the performance of SRs was due mainly to their speed, then a random pattern in their visual attention to words after the preference to the phonological competitor would be expected. However, SRs showed a systematic subsequent preference for the semantic competitor, while LSRs showed only a preference for the phonological competitor. In addition, participants had a display preview of 800 ms before the onset of the stimuli, so both groups should have had time to track the words on screen.
According to lexical processing models, phonological, orthographic, and semantic representations are important to the visual identification of words (Coltheart, Rastle, Perry et al., 2001). It is possible that school‐age children are capable of retrieving phonological representations in a rapid and precise manner, but that their retrieval of semantic knowledge is not fully developed (Nation & Snowling, 1998). Reading comprehension involves extracting meaning from written words, which relies strongly on the quality of the semantic representation (Richter et al., 2013). In reading words, LSRs did not find a relation between the spoken critical word and the displayed written competitors.
It is possible that SRs in this study had closely unified phonological, orthographic, and semantic representations – that is, high‐quality lexical representations – as is suggested by their preference for the semantic competitor. SRs may be successful at reading because they have accurate lexical representations that can be retrieved, while LSRs may have poor lexical representations, causing the retrieval of inaccurate or incomplete lexical information.
Although the preference of LSRs for the shape competitor, in the printed words condition, between 800 and 1,100 ms was not statistically significant, this result could be related to variability in fixations to distractors among individual LSRs. The preference for the shape competitor meant that children with reading difficulties needed to retrieve physical features or mental images of the words in order to retrieve semantic information and facilitate the reading comprehension process. In another cross‐modal lexical decision task (auditory‐reading), Moss, McCormick, and Tyler (1997) found semantic coactivation preceding shape activation; however, LSRs in the present study only coactivated shape information.
This study provides evidence of the differences among SRs and LSRs, at approximately 8 years of age, in their access to different representations of words in two modalities: images and printed words. Both groups showed a looking preference for the semantic competitor in the image modality and for the phonological competitor in the printed word modality. However, in the printed word modality, only SRs found a semantic relationship between the spoken critical word and the semantically related printed word, while in the image condition, only SRs were able to marginally detect a phonological relationship between the spoken critical word and the image of a phonologically related word.
This study cannot offer conclusions related to reading development in children, but research including prereaders and expert readers, as well as longitudinal evaluations, may allow analysis of the development of the ability to access perceptual, phonological, and semantic representations. Although it is beyond the scope of this study, it is worth noting that an equal number of years of formal education in children with similar age and IQs does not necessarily imply acquiring the same reading abilities. Future research should help us to understand the factors implicated in these differences and the possible consequences of classroom disparities.
Acknowledgments
We would like to thank parents, teachers, and students from the participating schools: Professor Jorge Casahonda Castillo, Alberto Lenz, Guadalupe Victoria, and Gral. Juan N. Álvarez. We are also grateful to the members of the Laboratorio de Psicolingüística for their help in testing participants.
This research was supported by a grant awarded to the third author: Investigación Científica Básica CONACyT (Grant No. 167900).
Notes
- 1 Child Neuropsychological Assessment.
- 2 How Children Use Words.
Biographies
Diana Rosalba Cortés‐Monter is pursuing an Applied Linguistics Master at the Faculty of Philosophy and Literature, the Philological Research Institute, and the Foreign Language Teaching Centre (CELE) at UNAM. Also, she is a part‐time researcher at the Psycholinguistics Laboratory at the Department of Psychology at UNAM. She obtained her bachelor's degree in Hispano‐American Linguistics and Literature at UNAM. She is interested in language processing, eye‐tracking, literacy, and reading acquisition.
Armando Quetzalcóatl Angulo-Chavira is pursuing a Behavioural Science Master at the Neuroscience Institute, Universidad de Guadalajara. He collaborates at the Psycholinguistics Laboratory at UNAM. He obtained his bachelor's degree in Psychology from Universidad de Guadalajara. His research is centered in language acquisition, language development, inhibition, and attention in typical and atypical populations.
Dr. Natalia Arias‐Trejo heads the Psycholinguistics Laboratory at the Department of Psychology at UNAM, which focuses on language acquisition and development within the context of typical and atypical development.




