Individual differences in bilingual word recognition: The role of experiential factors and word frequency in cross-language lexical priming

Adel Chaouch-Orozco; Jorge González Alonso; Jason Rothman

doi:10.1017/S014271642000082X

Individual differences in bilingual word recognition: The role of experiential factors and word frequency in cross-language lexical priming

Published online by Cambridge University Press: 22 December 2020

Adel Chaouch-Orozco

Jorge González Alonso

and

Jason Rothman

Show author details

Adel Chaouch-Orozco*: Affiliation:
University of Reading
Jorge González Alonso: Affiliation:
UiT The Arctic University of Norway
Jason Rothman: Affiliation:
UiT The Arctic University of Norway; and Universidad Nebrija
*: *Corresponding author. Email: [email protected]

Article contents

Abstract
The asymmetry in models of bilingual lexical processing
Factors investigated in the literature as potential modulators of L2–L1 priming
Language exposure/use and immersion
The present study
Method
Results
Discussion
Footnotes
References

Rights & Permissions

Abstract

In studies of bilingual word recognition with masked priming, first language (L1) primes activate their second language (L2) translation equivalents in lexical decision tasks, but effects in the opposite direction are weaker (Wen & van Heuven, 2017). This study seeks to clarify the relative weight of stimulus-level (frequency) and individual-level (L2 proficiency, L2 exposure/use) factors in the emergence of asymmetrical priming effects. We offer the first data set where L2 proficiency and L1/L2 exposure/use are simultaneously investigated as continuous variables, along with word frequency. While we replicate the asymmetry in priming effects, our data provide useful insights into the factors driving L2–L1 priming. These fall almost exclusively under the category of stimulus-level factors, with L2 exposure/use being the only experiential variable to show considerable influence, although complex interactions involving L2 proficiency and word frequency are also present. We discuss the implications of these results for models of bilingual lexical processing and for the appropriate measurement of experiential factors in this type of research.

Keywords

bilingual lexical access bilingual word recognition lexical priming second language proficiency word frequency

Type: Original Article
Information: Applied Psycholinguistics , Volume 42 , Special Issue 2: Special Issue: Individual Differences in Bilingualism , March 2021 , pp. 447 - 474

DOI: https://doi.org/10.1017/S014271642000082X [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright: © The Author(s), 2020. Published by Cambridge University Press

The literature on multilingual lexical organization has been dominated by two different but interconnected debates. The first focused on whether the languages of a bi-/multilingual are subserved by the same or different neural networks in the brain (i.e., physically stored together). The second debate asked whether the lexical items of different languages are functionally independent of each other, or rather lexical selection is open to competition among potential candidates from several languages, irrespective of what the response-relevant language is in a given context. Researchers speak of language selective versus language nonselective lexical access, in reference to the disputed claim that, in a language selective model, only words from the response-relevant language may be considered for selection (e.g., Costa, Miozzo, & Caramazza, Reference Costa, Miozzo and Caramazza1999; Gerard & Scarborough, Reference Gerard and Scarborough1989; Macnamara & Kushnir, Reference Macnamara and Kushnir1971; see Costa, Reference Costa, Kroll and de Groot2005, for an extensive account).

In recent years, growing evidence has led to a moderate consensus around a view of the multilingual lexicon organized as a unitary system, where access occurs in a nonselective manner. That is, words from all languages are simultaneously active, to some degree, in comprehension and production (e.g., de Groot, Delmaar, & Lupker, Reference de Groot, Delmaar and Lupker2000; Dijkstra, Grainger, & van Heuven, Reference Dijkstra, Grainger and van Heuven1999; Kroll & Stewart, Reference Kroll and Stewart1994; van Heuven, Schriefers, Dijkstra, & Hagoort, Reference van Heuven, Schriefers, Dijkstra and Hagoort2008). Assuming this type of system, the focus must be placed on how exactly words from different languages are connected and interact, and what the nature of that relationship is—that is, at which level of representation it is established (e.g., Brysbaert & Duyck, Reference Brysbaert and Duyck2010; Brysbaert, Verreyt, & Duyck, Reference Brysbaert, Verreyt and Duyck2010; Dijkstra & Rekké, Reference Dijkstra and Rekké2010; Kroll, van Hell, Tokowicz, & Green, Reference Kroll, van Hell, Tokowicz and Green2010).

Most of the evidence supporting the nonselective view of the bi-/multilingual lexicon comes from studies where the degree of form and meaning similarity within the stimuli has been manipulated. The speed of access to cognate words, translation equivalents with a form and meaning overlap, has been shown to be faster than that to noncognate words, even in monolingual tasks (see Caramazza & Brones, Reference Caramazza and Brones1979, for the first report on the effect; van Hell & Dijkstra, Reference van Hell and Dijkstra2002, for cognate effects on first language [L1] lexical access; and Dijkstra et al., Reference Dijkstra, Grainger and van Heuven1999, for a second language [L2]; see also Lemhöfer, Dijkstra, & Michel, Reference Lemhöfer, Dijkstra and Michel2004, for cumulative effects in multilingualism). Words with similar orthography and/or phonology but with different meanings across languages, interlingual homographs, have also been exhaustively explored during the last decades. However, whether they yield facilitatory or inhibitory effects seems to be less clear, as this is dependent on factors such as the task employed or the stimulus list composition (e.g., Brenders, van Hell, & Dijkstra, Reference Brenders, van Hell and Dijkstra2011; Dijkstra et al., Reference Dijkstra, Grainger and van Heuven1999). The interaction of mental representations in the multilingual lexicon is not restricted to meaning and orthographic/phonological form. Cross-language activation has also been shown in priming studies exploring bilingual processing of compounds (Ko, Wang, & Kim, Reference Ko, Wang and Kim2011; Wang, Reference Wang2010) and derivation (e.g., Duñabeitia, Dimitropoulou, Morris, & Diependaele, Reference Duñabeitia, Dimitropoulou, Morris and Diependaele2013). What this body of research suggests is that words from different languages are activated and available for selection during production and comprehension, even in situations where only one of the languages is required.

While cognates and interlingual homographs are obvious candidates for shared or intimately related lexical representations across languages, it is likely that these are not the only points of contact within the multilingual lexicon. In that sense, noncognate translation equivalents (e.g., English arrow and Spanish flecha) have been a major focus in research on bi-/multilingual lexical access for the past two decades. Because sublexical features (e.g., orthography and phonology) are not shared in these pairs, we may reasonably assume them to be connected, at least, through their largely overlapping conceptual semantics. The existence of priming effects between them suggests that translation equivalents, cognate or not, activate shared semantic representations (Xia & Andrews, Reference Xia and Andrews2015, p. 295), and, therefore, have the potential to activate each other.

The masked translation priming paradigm, which employs the same mechanisms of subliminal priming originally devised by Forster and Davis (Reference Forster and Davis1984), has become one of the most common experimental setups in bi-/multilingual lexical processing research. In the typical procedure, a forward mask (e.g., #####) is displayed for a short period of time (typically 500 ms) and replaced by a word in one of the multilingual’s languages: the prime (e.g., flecha, Spanish for “arrow”). This is usually followed by the target word (or a backward mask), which in critical trials is the prime’s translation equivalent in another of the participant’s languages (e.g., arrow). Response times in these trials are compared to those in control ones, where the prime and the target also belong to different languages but bear no resemblance in meaning or form. As in standard masked priming, two measures are taken to ensure that the prime is processed only subconsciously. The first is to reduce the perceptual saliency of its onset and outset by means of forward and backward masking (note that in standard procedures the target itself acts as a backward mask); the second is to reduce display time to only a few milliseconds, typically between 40 and 70 and never above 85 (Clahsen, Balkhair, Schutter, & Cunnings, Reference Clahsen, Balkhair, Schutter and Cunnings2013), to avoid the risk of entering into the conscious processing time window, at about 100 ms prime duration, most subjects can report the primes.

The masked translation priming paradigm has most often been used in combination with lexical decision tasks (LDTs). In a (visual) LDT, participants are asked to indicate whether the letter string presented on screen is a word in the target language. For this reason, half of the target items in a standard LDT are nonwords. Studies employing masked translation priming in LDTs have consistently reported an asymmetry in the direction of the priming effects obtained with noncognate translation equivalents. Priming effects are robust and widely attested with L1 primes and L2 targets (e.g., de Groot & Nas, Reference de Groot and Nas1991; Jiang, Reference Jiang1999; Xia & Andrews, Reference Xia and Andrews2015). However, in the opposite translation direction (L2 primes and L1 targets) priming effects are either absent (e.g., Gollan, Forster, & Frost, Reference Gollan, Forster and Frost1997; Grainger & Frenck-Mestre, Reference Grainger and Frenck-Mestre1998) or significantly smaller than those produced by L1 primes on L2 targets (e.g., Basnight-Brown & Altarriba, Reference Basnight-Brown and Altarriba2007; Schoonbaert, Duyck, Brysbaert, & Hartsuiker, Reference Schoonbaert, Duyck, Brysbaert and Hartsuiker2009; see Wen & van Heuven, Reference Wen and van Heuven2017, for a comprehensive review).

The asymmetry in models of bilingual lexical processing

We briefly introduce here two models of bilingual lexical processing: the revised hierarchical model (RHM; Kroll & Stewart, Reference Kroll and Stewart1994; Kroll et al., Reference Kroll, van Hell, Tokowicz and Green2010), and Multilink (Dijkstra et al., Reference Dijkstra, Wahl, Buytenhuijs, van Halem, Al-Jibouri, De Korte and Rekké2019). The RHM and the bilingual interactive activation + model (BIA+; Dijkstra & van Heuven, Reference Dijkstra, van Heuven, Grainger and Jacobs1998, Reference Dijkstra and van Heuven2002), the model from which Multilink has evolved, have been by far the most influential proposals to date. They focus predominantly on word production and translation (RHM) and word recognition (BIA+). Multilink essentially continues in the computational tradition of the BIA+, while incorporating insights from the RHM. Regardless of the type of data that initially motivated these models, the architectures they propose for the mental lexicon should hold both in production and comprehension (Brysbaert et al., Reference Brysbaert, Verreyt and Duyck2010).

The RHM

Like most current models of the multilingual lexicon, the RHM is a three-store proposal (see Paradis, Reference Paradis2004): words from different languages are represented separately but share access to conceptual representations. These relationships between words and conceptual features are established through links that vary in intensity. L1 words are strongly connected to the conceptual system, reflecting the fact that an L1 lexicon is completely developed by the time late L2 learners start acquiring the new language (Kroll & Tokowicz, Reference Kroll, Tokowicz, Kroll and de Groot2005). Conversely, the lexicosemantic mapping is typically weak(er) for L2 words, especially in low-proficiency bilinguals, who rely on L1 words to access semantic information as L2 words are generally learned through their L1 translation equivalents. In other words, a strong lexical connection in the L2–L1 direction allows L2 words to access L1 lexical representations, which, in turn, activate the shared conceptual nodes, indirectly connecting the L2 words with the relevant semantic information.

Xia and Andrews (Reference Xia and Andrews2015) discuss a way in which the RHM could account for the priming asymmetry. If we assume that priming between (noncognate) translation equivalents obtains exclusively through semantic mediation (and, crucially, not via lexical links), the model would predict that an L1 word can prime the lexical representation of its L2 translation equivalent because it can easily activate the shared conceptual nodes; however, as L2 primes cannot reliably stimulate these shared conceptual representations (or, at least, not fast enough), they fail to produce priming in the L2–L1 direction. The RHM states that the connections between L2 lexical items and conceptual representations become stronger as a direct function of L2 proficiency, which would eventually allow L2 primes to activate shared concepts in a similar way to that of L1 primes. Studies showing cross-language priming effects with simultaneous or balanced bilinguals (e.g., Basnight-Brown & Altarriba, Reference Basnight-Brown and Altarriba2007; Duñabeitia, Dimitropoulou, Uribe-Etxebarria, Laka, & Carreiras, Reference Duñabeitia, Dimitropoulou, Uribe-Etxebarria, Laka and Carreiras2010; Duñabeitia, Perea, & Carreiras, Reference Duñabeitia, Perea and Carreiras2010) could, in principle, support this prediction. However, recall that research with unbalanced bilinguals specifically testing the role of proficiency has reported mixed results (e.g., Dimitropoulou, Duñabeitia, & Carreiras, Reference Dimitropoulou, Duñabeitia and Carreiras2011; cf. Nakayama, Ida, & Lupker, Reference Nakayama, Ida and Lupker2016). While the model’s proponents have gradually abandoned the idea of L1-mediated access to conceptual representations for L2 words, they maintain that conceptual links are weaker in the L2, even at higher levels of proficiency (Kroll et al., Reference Kroll, van Hell, Tokowicz and Green2010), and that this has proven to be more noticeable in the concept-to-word direction than the other way around, which would predict differences between comprehension and production data.

Multilink

In spreading/interactive-activation accounts of language processing (see Collins & Loftus, Reference Collins and Loftus1975; McClelland & Rumelhart, Reference McClelland and Rumelhart1981), the activation level of a node in the network, in this case, a lexical entry, has to rise from its resting-level activation (RLA) to a certain threshold for it to become active (Jiang, Reference Jiang and Schwieter2015), and thus be, for example, identified in visual word recognition. Multilink claims that the elusiveness of L2–L1 priming effects might lie on the RLAs of L2 words, which are lower than those of L1 words. Given short prime presentations under masked priming conditions, L2 primes may not receive sufficient stimulation or have enough time to process that stimulation and pass that activation on to their L1 translation equivalents.

Multilink, like the RHM, proposes that higher L2 proficiency may change this situation, as this tends to correlate with higher frequency and recency of use, which should, in turn, raise the RLA of L2 lexical representations. As the distance between the RLA and the threshold is shortened, the amount of stimulation, and, therefore, the processing time, needed to activate these words is reduced, increasing the chance of observing priming effects on their translation equivalents. However, proficiency may not be the only factor at play in determining the RLA of these words. Word frequency and (recent) high exposure to the L2 are likely to modulate the RLA, potentially making L2 lexical processing faster for (a) high-frequency words and/or (b) speakers that are immersed in or otherwise more frequently exposed to the L2.

While the asymmetry seems to be observed when unbalanced bilinguals are tested, no attempt has been made so far to understand the granularity of this factor and its relationship with L2 proficiency. This study attempts to fill that gap by examining a group of L1 Spanish–L2 English late bilinguals living in an L2-dominant environment, differing in degree of active exposure/use and L2 proficiency. Anticipating the results, the data show significant priming effects for L1 primes. The effect for L2 primes is modulated by L2 active exposure/use, measured as a continuous variable at the individual level. Differently, the effect of L2 proficiency was only found to be significant in an interaction with the frequency of the L2 targets.

Our results raise several questions regarding the nature of cross-language masked priming patterns and the role of methodological factors. In this sense, they highlight the need for more fine-grained measures to tap into individual differences that can serve as proxies of bilingual language use and representation.

Factors investigated in the literature as potential modulators of L2–L1 priming

While there is consensus that L2–L1 priming effects are notably less robust than their L1–L2 counterparts, effect sizes have varied considerably across studies, which range from null effects (e.g., Gollan et al., Reference Gollan, Forster and Frost1997; Grainger & Frenck-Mestre, Reference Grainger and Frenck-Mestre1998; Xia & Andrews, Reference Xia and Andrews2015) to significant L2–L1 priming (e.g., Duyck & Warlop, Reference Duyck and Warlop2009; Lee, Jang, & Choi, Reference Lee, Jang and Choi2018; Lijewska, Ziegler, & Olko, Reference Lijewska, Ziegler and Olko2018; Nakayama et al., Reference Nakayama, Ida and Lupker2016). Although these studies have investigated a substantial number of factors potentially involved in L2–L1 priming effects (e.g., L2 proficiency, prime duration, word frequency, or the dominant language in the participants’ environment, among others) results are mixed for all of these variables.

Word frequency

There is substantial evidence indicating that word frequency is a major predictor of the speed of lexical access in both the L1 and the L2 (e.g., Brysbaert, Lagrou, & Stevens, Reference Brysbaert, Lagrou and Stevens2017; Brysbaert, Mandera, & Keuleers, Reference Brysbaert, Mandera and Keuleers2018; Brysbaert, Stevens, Mandera, & Keuleers, Reference Brysbaert, Stevens, Mandera and Keuleers2016; Diependaele, Lemhöfer, & Brysbaert, Reference Diependaele, Lemhöfer and Brysbaert2013). Despite this well-known effect, whereby more frequent words are accessed faster than less frequent ones, the factor has rarely been studied in the translation priming literature, to the extent that the word “frequency” does not even appear in the only currently available meta-analysis on the priming asymmetry (Wen & van Heuven, Reference Wen and van Heuven2017). What is more, the great majority of studies have used stimuli with word frequencies within the range of, approximately, 3 to 4.3 in the Zipf scale (i.e., between 1 and 24 occurrences per million; see the Methodology section below for an explanation of the Zipf scale), where frequency effects in the access to L2 words are reported to appear (Brysbaert et al., Reference Brysbaert, Lagrou and Stevens2017). Thus, frequency could certainly be expected to play a role in masked translation priming effects, and yet it is almost never examined as a factor.

A recent study by Nakayama, Lupker, and Itaguchi (Reference Nakayama, Lupker and Itaguchi2018) offers relevant insights on what the role of word frequency might be. These authors carried out distributional and frequency-based analyses of response times obtained with very highly proficient bilinguals and high-frequency words in an LDT using L2 primes. The observed 20 ms priming effect was reflected in a shift and a differential positive skewing on the response latency distributions. Furthermore, they observed that the distributional pattern was caused by an interaction of target frequency and the experimental condition (i.e., related vs. control L2 primes). Nakayama et al. argue that these results suggest that high-frequency translation primes (but, crucially, not control primes) are able to mitigate the target frequency effect, whereby less frequent targets are responded to more slowly.

L2 proficiency

Regarding the influence of L2 proficiency, Dimitropoulou, Duñabeitia, and Carreiras (Reference Dimitropoulou, Duñabeitia and Carreiras2011) found that it did not play a key role in their data, given the similar L2–L1 priming effects (between 11 and 14 ms, differences not significant) displayed by three different L2 proficiency groups. More recently, however, Nakayama et al. (Reference Nakayama, Ida and Lupker2016) report significant L2–L1 (English to Japanese) priming in two experiments with highly proficient bilinguals (mean Test of English for International Communication [TOEIC] scores: 872 and 917, respectively, out of 990). Of importance, the materials for Experiment 2 were the same as the ones used in a previous study by members of the same cohort, Nakayama, Sears, Hino, and Lupker (Reference Nakayama, Sears, Hino and Lupker2013), where no significant L2–L1 priming had been observed with less proficient L2 speakers (mean TOEIC score: 740). To confirm their results in Experiments 1 and 2, Nakayama et al. (Reference Nakayama, Ida and Lupker2016) conducted a third experiment in which less proficient bilinguals (mean TOEIC score: 710) were tested with the materials of Experiment 1. No significant L2–L1 priming was found this time. Together with the insight provided by regression analyses in the first two experiments, which showed that L2 proficiency modulated the effect size of L2–L1 priming, these results indicate that (very) high proficiency is a crucial factor behind the disappearance of the priming asymmetry. To the extent that high proficiency is a necessary condition, this could explain the discrepancy in results from other studies where lower proficiency groups do not yield the effect.

Language exposure/use and immersion

Although the language environment of participants has been discussed and tangentially addressed in the literature, few studies have examined it directly. Zhao, Li, Liu, Fang, and Shu (Reference Zhao, Li, Liu, Fang, Shu, Carlson, Hoelscher and Shipley2011) investigated translation priming in three groups of Chinese–English bilinguals: two groups of low- and high-proficiency participants living in China (i.e., nonimmersed) and one high-proficiency group living in an L2-dominant environment. Replicating the priming asymmetry, L1–L2 priming effects obtained across the board, but L2–L1 priming was observed only for the immersed group. These results, while illuminating, effectively confound two individual-level variables, (high) L2 proficiency and immersion, because the factorial design is incomplete: there is no low-proficiency group in an immersed context.

Sabourin, Brien, and Burkholder (Reference Sabourin, Brien and Burkholder2014) tested four groups of English–French bilinguals who had acquired the L2 at different ages (i.e., from birth, 3–5 years, 3–10 years, and 2–29 years). The participants’ self-reported L2 proficiency (approximately intermediate) was matched across the early and late bilinguals groups to test how age of acquisition (AoA) could account for the translation priming effects in the L2–L1 direction. Their results showed significant priming only for the simultaneous and early bilinguals, but not for the late bilinguals, providing evidence for the role of AoA on the emergence of the priming asymmetry. Nevertheless, in this study, AoA was determined by the age of immersion in the L2 environment, thus confounding the potential influence of these two factors.

Finally, at least two studies have shown the importance of balanced bilingualism when considering cross-language masked priming effects. In Duñabeitia, Perea, et al. (Reference Duñabeitia, Perea and Carreiras2010), a symmetric priming pattern was reported when testing a group of highly proficient bilinguals (i.e., native speakers of Basque and Spanish). Although they differed in their frequency of use of the languages in academic contexts, using much more Basque than Spanish, the use in nonacademic contexts was almost identical. Likewise, Wang (Reference Wang2013) reported a beneficial effect of balanced bilingualism on the emergence of L2–L1 priming effects when investigating highly proficient Chinese–English bilinguals living in a bilingual society like Hong Kong. Group 1 consisted of English-dominant bilinguals, whereas Group 2 was formed by bilinguals whose use of (and proficiency in) Chinese and English were equal. Although the L1–L2 translation direction was not tested, preventing us from drawing conclusions on the priming asymmetry itself, only Group 2 showed significant L2–L1 priming effects.

Language exposure/use and L2 proficiency are both ultimately proxies of L2 subjective word frequency, that is, how often a given speaker has encountered a given word. Although the studies reviewed in this section highlight the relevance of these two factors on the processing of L2 words and the ability of L2 primes to efficiently activate their translation equivalents in priming experiments, the field still needs more fine-grained measures that allow for a better estimation of their role in bi-/multilingual lexical processing. After all, discrete-variable approaches, while providing a good approximation to the presence or absence of certain effects, are likely to miss subtle transitions and nonlinear trajectories along the continuum of influence of these factors. Van Hell and Tanner (2012, p. 165), in discussing individual differences in L2 proficiency and their relationship to cross-language lexical activation, argue that

[…] providing a clearer picture of the relationship between cross-language activation effects and individual differences in L2 proficiency requires a move away from group designs and toward designs that allow for more robust statistical modelling of the interaction between individual-level characteristics (e.g., language proficiency) and stimulus-level characteristics (e.g., word cognate status). As previously mentioned, regression-based approaches can model the continuous nature of individual-level variables, like language proficiency.

Furthermore, the fact that all these variables have typically been investigated separately (or, at best, in pairs) potentially obscures important interactions between them (Diependaele et al., Reference Diependaele, Lemhöfer and Brysbaert2013). For these reasons, the present study aims to examine the role of L2 exposure/use and L2 proficiency in cross-language masked priming effects by treating these variables as continuously distributed, in an attempt to reflect their nature more efficiently and weigh their role on the priming asymmetry. If models such as Multilink are right in their assumption that asymmetrical priming patterns have their origin in RLA differences between L1 and L2 words, these differences must be a direct consequence of the individual experience of a given speaker with a given word. This experience, in turn, can only be approached through factors that quantify the relative exposure of the speaker to linguistic contexts potentially containing the word, as well as the relative availability of the word itself. While we examine here two individual-level variables (i.e., active exposure/use and L2 proficiency), we will only deal with one stimulus-level factor: word frequency, albeit effectively represented twice in the design, through the independent contribution of prime and target frequencies. This is not to deny that other properties of the stimulus (e.g., length, orthographic and/or phonological neighborhood size, concreteness, and morphological family size) have the potential to affect responses. However, their effects have been consistently proven to be smaller in size and more reduced in scope than those of word frequency (Diependaele et al., Reference Diependaele, Lemhöfer and Brysbaert2013).

The present study

Sixty L1 Spanish–L2 English sequential bilinguals living in the United Kingdom took part in an LDT experiment with masked translation priming. Participants were tested in both translation directions to investigate the priming asymmetry directly. In light of the available literature, we expected to replicate the priming asymmetry, as L1–L2 effects are relatively robust, and our choice of participant profiles and word frequencies did not favor the appearance of L2–L1 priming effects. However, our study was also designed to shed light on the role of three variables, which we quantified and included as continuous predictors: L2 proficiency, amount of L2 exposure/use, and word frequency. If, as expected, these factors impact the processing of L1/L2 words and consequently the priming effects, we should observe three-way interactions (potentially four-way interactions too) between translation direction, type of prime, and individual- and stimulus-level predictors in the statistical models’ outcomes.

Method

Participants

Sixty Spanish–English sequential bilinguals were recruited from the Spanish-speaking communities in three large cities in northern and southwestern England (see Table 1 for participant characteristics). The data was collected in sound-insulated rooms at a university or teaching institution in each location. To evaluate English proficiency, all participants took the Oxford Quick Placement Test (OQPT; Oxford University Press, University of Cambridge, & Association of Language Testers in Europe, 2001). The test examines English grammar and vocabulary knowledge and consists of 60 multiple-choice questions.^{Footnote 1} The participants’ mean score was 50 (SD = 4.84, range: 40–60), corresponding to a lower advanced proficiency according to the OQPT’s manual. The scores of English proficiency were normally distributed throughout our sample, as indicated by the exploration of a Q–Q plot and a Shapiro–Wilk test for normality (p = .38). The participants had started learning English, on average, at the age of 9 (SD = 2.9, range: 4–16). A version of the Dominance Scale questionnaire by Dunn and Tree (Reference Dunn and Tree2009) was employed to collect information regarding the participants’ use of English. The questionnaire provides a scale based on the relative use of one language over the other (Dunn & Tree, Reference Dunn and Tree2009, p. 1). The scale ranges from –25 to 25. Following the authors, a score above 5 was considered to reflect greater use of the L1 (Spanish) over the L2 (English), whereas the range between –5 and 5 was considered to reflect an equal use of both languages. Although the scale originally makes reference to “dominance,” as it was designed to test simultaneous bilinguals of balanced proficiency, we speak here of “active language exposure/use” instead, which we consider a better reflection of what the scale actually measures, as well as being the variable of interest in our study. Consider, for instance, a 20-year-old late sequential bilingual who has lived in an L2 environment for a year, speaks the L2 at her new home as well as at her new job, and received more than 10 years of education in the L2 at a bilingual school in Spain. Such a participant would most probably have a score below –5 in the scale; nevertheless, should we conclude that her L2 is now the dominant language over the L1, despite her having been overwhelmingly more exposed to the L1 for 19 years (95%) of her life? With this example, we hope to highlight the potential misinterpretation that the use of “dominance” can lead to. However, we acknowledge that the term is still operationalized as a function of language use in much work on bilingualism. As Treffers-Daller (Reference Treffers-Daller2019, p. 1) explains,

[…] language dominance is often seen as relative proficiency in two languages, but it can also be analyzed in terms of language use—that is, how frequently bilinguals use their languages and how these are divided across domains.

Table 1. Participant characteristic

Note: Mean values (standard deviations and ranges).

Materials

Fifty pairs of Spanish–English noncognate translation equivalents were used in the experiment (see Table 2 for sample stimuli). To avoid the concreteness effect found in different studies (e.g., Finkbeiner, Forster, Nicol, & Nakamura, Reference Finkbeiner, Forster, Nicol and Nakamura2004; Schoonbaert et al., Reference Schoonbaert, Duyck, Brysbaert and Hartsuiker2009), whereby abstract words are responded to more slowly than concrete words, only concrete nouns were used. As shown in Table 3, the Spanish words had a standardized mean frequency of 4.01 (SD = 0.43, range: 2.72–4.9) on the 1 to 7 Zipf scale (van Heuven, Mandera, Keuleers, & Brysbaert, Reference van Heuven, Mandera, Keuleers and Brysbaert2014). The standardized mean frequency of English words was 3.97 (SD = 0.34, range: 2.94–4.92; Table 3). In the Zipf scale, word frequencies between 1 and 3 are considered low, whereas those between 4 and 7 are considered high frequencies (see van Heuven et al., Reference van Heuven, Mandera, Keuleers and Brysbaert2014, for details). Word frequencies for the English items were extracted from the SUBTLEX-UK database (van Heuven et al., Reference van Heuven, Mandera, Keuleers and Brysbaert2014), and the ones for the Spanish words from SUBTLEX-ESP (Cuetos, Glez-Nosti, Barbón, & Brysbaert, 2011). Word frequencies were normally distributed in the English stimuli, as indicated by the exploration of a Q–Q plot and a Shapiro–Wilk test for normality (p = 0.36). This was not true, however, of the Spanish stimuli. Although this is not ideal, we were limited by the small amount of translation pairs at our disposal (recall that these had a relatively low frequency) and the need for our participants to know the L2 words. For this (and other) reason(s), we chose a statistical method (linear mixed modeling; see below) that can accommodate deviations from normality in both independent and dependent variables.

Table 2. Sample stimuli

Table 3. Stimuli characteristics

Note: Average frequency (Zipf scale, range: 1–7), and length (in characters), with standard deviations and ranges in parentheses.

In addition, 50 nonwords were created in both languages to make the lexical decision possible. Spanish nonwords were created by substituting one letter from real words while respecting the phonotactics of the language. The English nonwords were created using the ARC Nonword Database (Rastle, Harrington, & Coltheart, Reference Rastle, Harrington and Coltheart2002).^{Footnote 2} All nonwords were phonologically and orthographically plausible in Spanish and English, respectively. The complete list of stimuli is provided in Appendix A.

Four stimulus lists (two in each language) of 50 word and 50 nonword targets were created. In one of the lists, half of the target words were preceded by their translation equivalents and the other half by control primes. The translation equivalents from those pairs in the baseline condition of each list were scrambled to serve as control primes, paying attention to keep the pair semantically unrelated. In the other list, the order was inverted, so that across both lists all the words were preceded by their translation equivalents and control primes. Each list began with 16 practice items. All words were matched in frequency and word length.

English–Spanish translation task

To ensure that responses to the L2 words were not arbitrary, participants completed an English to Spanish translation task with the English items. Only answers identical to the translation pairs used in the experiment were counted as correct. All the items had a minimum 65% rate of correct answers, and the correct answer was given on average 88% of the time. Although a 65% rate of correct responses might seem a low cutoff, some of the answers provided were synonyms of the expected translations, even though they did not count as correct answers. More important, in the posttask debriefing, many of the subjects reported knowing the translation of certain English words but having been unable to recall them during the translation task. Their incapacity to remember the translation at that point, or the fact that they chose to provide a synonym to the target translation, would not necessarily entail an insensitivity to those English primes during the experimental task.

Test of familiarity with Spanish words

The degree of familiarity with the Spanish materials was normed across the first 29 participants, roughly half, to control for the fact that the materials used in the lexical decision experiments were created using European Spanish, spoken by 20 of those first 29 participants. The other 9 subjects spoke other varieties of Spanish. The test used a 7-point Likert scale, where 1 represented no knowledge of the word, and 7 described a word that was known and frequently employed in the participant’s variety of Spanish. No items were removed from the experiments due to a lack of familiarity, as all the words’ mean scores were higher than the cutoff value of 4, well above what could be considered unfamiliarity with a given lexical item.^{Footnote 3}

Procedure

The experiment was programmed using the PsychoPy v1.8 software (Peirce, Reference Peirce2007). Each trial began with a 500 ms forward mask (########), followed by a 60 ms prime (in lowercase letters). Immediately after, the target (in uppercase letters) appeared and remained on screen until the participant’s response. Stimuli were presented on a white screen in a 44-point black Arial font. Participants were asked to judge whether the targets were real words or not by pressing on a keyboard, “0” for NO or “1” for YES, as quickly and accurately as possible. They were not informed about the presence of the primes. During a postexperiment debriefing, participants were asked about their awareness of any wordlike material other than the target words in the course of the experimental trials.

The tasks were presented in the following order: the OQPT was the first test to be administered, as a score below 40 (i.e., equivalent to intermediate proficiency in the Common European Framework of Reference for Languages) was used as exclusion criteria to participate in the study. Then, the experimental tasks were conducted. After completing them, the participants took the English word translation task and the familiarity task.

Results

Following Baayen and Milin (Reference Baayen and Milin2010), responses to experimental trials with latencies below 200 ms and above 5000 ms were removed from the data set (1 observation), on the assumption that those latencies would be too short to reflect a conscious judgment of the targets or too long to ensure that conscious strategies are not involved in the decision. Eighteen data points were removed due to glitches in the presentation, and 100 data points were excluded because of a problem during the counterbalancing of the critical condition for one of the subjects in the L1–L2 direction. After removing incorrect responses and responses to nonwords, the data set contained a total of 5,881 observations.

An exploratory analysis of the response time (RT) distribution was performed by transforming the latencies to obtain inverse Gaussian, log-normal, and Box–Cox distributions. The exploration of Q–Q plots and the results of Shapiro–Wilk tests for both translation directions showed that the inverse Gaussian transformation provided a slightly better correction of the distribution’s skewness than did the other two (inverse Gaussian: p = 1; Box–Cox: p = .99; log-normal: p = .73).

Analyses of the error rates for word targets and the transformed RTs for correct responses to word targets were conducted using (generalized) linear mixed-effects models (Baayen, Reference Baayen2008; Baayen, Davidson, & Bates, Reference Baayen, Davidson and Bates2008) in R (version 3.3.1; R Core Team, 2016) with the lme4 package (Bates, Maechler, Bolker, & Walker, Reference Bates, Maechler, Bolker and Walker2015). A theory-driven model was used for both the accuracy and response latency analyses. The model included the following factors: target language (Spanish or English), prime type (related or control), proficiency (modelled as a continuous variable quantified by English placement test scores), language exposure/use (modelled as a continuous variable quantified by the Dominance Scale questionnaire) and prime and target frequency (Zipf values).^{Footnote 4} Sum contrasts were used for categorical variables. Proficiency, language exposure/use, and prime and target frequency were scaled and centered, and converted to z units. This model thus contained the main effect of target Language, the interaction between target Language and prime type, and three- and four-way interactions between target language, prime type, and the stimulus- and individual-level factors. (See Appendix B for the complete models and rationale.)

The random structure of this initial model included random intercepts for subjects, primes, and targets (Feldman, Milin, Cho, Moscoso del Prado Martín, & O’Connor, Reference Feldman, Milin, Cho, Moscoso del Prado Martín and O’Connor2015), and random slopes for subjects within target language, prime type, target frequency, prime frequency, and the interaction between target language and prime type; as well as random slopes for primes and targets within target language, prime type, and the interaction between the two factors.

Response time analysis

Table 4 provides a summary of error rates, mean RTs, and priming effects (calculated as the difference between mean RTs to control and critical trials) for correct responses to word targets.

Table 4. Mean response times (RTs; in milliseconds; standard errors), error rates (%), and priming effects (in milliseconds) in the lexical decision tasks

*p < .01.

Following Matuschek, Kliegl, Vasishth, Baayen, and Bates (Reference Matuschek, Kliegl, Vasishth, Baayen and Bates2017), we carried backward-selection and employed the likelihood ratio test criterion to obtain a more parsimonious model. The reason for this is that, given our relatively small sample size, models with complex random structures might not be supported by the data (Matuschek et al., Reference Matuschek, Kliegl, Vasishth, Baayen and Bates2017, p. 307). Thus, during backward-selection, we iteratively removed the random slopes that accounted for the least amount of variance from the model, until convergence was achieved. Full code for the analyses can be found in the first author’s GitHub repository (https://github.com/chaouch-orozco/Individual-factors-in-bilingual-word-recognition). The final model had the fixed effects specified above as well as random intercepts for subject, prime, and target (see Table 5 for the full model summary). Exploration of this model’s residuals through Q–Q plots showed that the residuals did not follow a normal distribution in the longer latencies. Therefore, as suggested in Baayen and Milin (Reference Baayen and Milin2010), we applied further model criticism by excluding those observations with absolute standardized residuals above 2.5 SD (116 observations were removed, 2% of the total).

Table 5. Intercept and significant or marginally significant factors included in the final model for the analysis of response times and their coefficients, standard errors, t values, and p values

The interaction between target language and prime type indicated that translation primes elicited faster responses to targets as compared to control ones (i.e., a priming effect) in the L1–L2 direction. A significant interaction between target language (Spanish), prime type, and language exposure/use indicated that those participants with a higher degree of active L2 use benefited more from the L2-related primes during the processing of the L1 targets. The interaction of target language (English), prime type, and target frequency showed that RTs were significantly faster for more frequent L2 targets in both the related and the control conditions. A significant four-way interaction between target language (English), prime type (related), proficiency, and target frequency was observed, indicating that RTs for low-frequency L2 English words preceded by L1 Spanish related primes were significantly slower for less proficient bilinguals. In addition, three marginally significant interactions were observed. First, that of target language (English) and proficiency, showing faster RTs for more proficient participants. Second, a target language (Spanish), prime type (control), and prime frequency interaction, indicating that, in the control condition of the L2–L1 direction (Spanish targets), responses were slower with more frequent L2 English primes. Third, a target language (Spanish), prime type (related), and prime frequency interaction, suggesting the opposite effect: faster RTs for more frequent L2-related primes.

Awareness of the prime was included as a post hoc factor. Although unexpected at 60 ms prime duration, 24 participants reported having seen some characters on the screen during the prime presentation time window, that is, between the forward mask and the target word, on at least one trial. Of importance, this only happened during the L1–L2 task (Spanish primes), and most of the subjects reported only one occurrence. The reason to include prime awareness in the analysis, then, instead of excluding these participants from the study altogether, was that linear mixed-effects models allow us to control for and estimate the influence of similar factors without discarding the data. To investigate the influence of prime awareness on participant responses, we carried out an analysis including awareness in an interaction with target language and prime type. The results showed that this factor did not significantly modulate priming effects. Given this outcome, we consider it reasonably safe to keep all the participants in the analysis.

Accuracy analysis

Accuracy was dummy-coded as 1 (correct) or 0 (incorrect) and generalized linear mixed-effects models with a binomial family were fit to the error data. In this case, the initial model, which was the same as in the response time analysis, did not converge. We thus proceeded to simplify its random structure applying the same backward-selection method. In the final model, the fixed effects were the same as in the model for the RT analysis. The random structure contained intercepts for subject, prime, and target, and slopes for subject within target language, prime type, target frequency, prime frequency, and the interaction between target language and prime type. It also contained slopes for prime within target language and the interaction between target language and prime type, as well as for target within target language, prime type, and the interaction between target language and prime type.

Table 6 provides the summary of the model, which shows a significant effect of the three-way interaction between target language (Spanish), prime type (related and control), and target frequency. This effect shows significantly lower accuracy rates for low-frequency L1 targets in both prime type conditions. That is, overall, participants were less accurate with less frequent L1 targets. Furthermore, a significant four-way interaction between target language (Spanish), prime type (Control), proficiency, and prime frequency was observed. In the L2–L1 translation direction, the frequency of the L2 primes affects less and more proficient bilinguals differently. Whereas for the less proficient participants a potential inhibitory effect of the control primes is larger when these are less frequent, the effect is the opposite for the more proficient bilinguals. This finding is intriguing, but it is hard to attribute it confidently to a single (group of) factor(s) or combination thereof, for example, cognitive or methodological. Given the inherent difficulty to interpret higher order interactions, the overall small differences in error rates across the four data subsets (lower proficiency/low-frequency: 1.07; lower proficiency/high-frequency: 2.78; higher proficiency/low-frequency: 2.59; higher proficiency/high-frequency: 0.69), and the fact that error rate analyses have typically received far less attention in this type of studies, we are cautious in interpreting this result and will not comment on it further.

Table 6. Intercept and significant or marginally significant factors included in the final model for the analysis of accuracy and their coefficients, standard errors, z values, and p values

Discussion

In this study, we conducted a masked translation priming lexical decision task, testing late sequential Spanish–English bilinguals immersed in an L2-dominant environment. Overall, our data do replicate the priming asymmetry in general terms, but provide a fairly more nuanced picture, as (a) the priming effects were numerically similar in both translation directions, and (b) the main effect of prime type was significant only in the L1–L2 direction, albeit modulated by language exposure/use in the L2–L1 direction (i.e., participants with increased active exposure and use of the L2 showed larger priming effects). Furthermore, we observed a complex interaction between target language, prime type, proficiency, and target frequency: the less proficient bilinguals responded more slowly to low-frequency L2 English words in the related condition (i.e., when preceded by their Spanish translations).

Recall that one of the goals of the present study was to shed light on the role that L2 proficiency and, somewhat novelly, active language exposure/use at the individual level play in translation priming effects, by treating them both as continuous predictors. Doing so allows for a more fine-grained understanding of each factor’s weight. With respect to L2 proficiency, a central factor in Multilink and especially the RHM, we do not observe an effect directly modulating priming in either translation direction. However, our data do show that, when less proficient bilinguals had to respond to less frequent L2 targets, their responses were slower only in the related condition. Therefore, the L2 proficiency measure was able to account for some differences in the processing of the low-frequency L2-related targets, potentially closing or widening the gap in priming effects by modulating the speed of related trials with respect to a (presumably constant) unrelated baseline. More deterministic in our data, however, is the role of language exposure/use. This factor directly interacted with prime type (and target language), conditioning priming effects in the L2–L1 direction. Recall that this is the direction of interest in most previous studies, as translation priming effects have been less reliably found across the board. In our study, those participants showing a higher active exposure/use to the L2 showed larger priming effects.

Despite the less salient role of L2 proficiency in our data as compared to that of language exposure/use, we cannot conclude that this predictor plays no significant part in shaping masked translation priming effects. Although there were methodological reasons for doing so, the range of L2 proficiencies covered in this study (i.e., upper intermediate to advanced) prevents us from making conclusive claims in this regard. Alternatively, and especially considering that any potential factors involved in such complex phenomena may have nonlinear trajectories, we would have needed to test a broader range of L2 proficiencies (e.g., low to high), although the feasibility of such manipulations is directly conditioned (and directly conditions) the frequency range of the stimuli.

At the time of the experiment, all participants had been living in the United Kingdom for 5 years on average (SD = 3.62, range: 0.75–13). Observing the broadness of this range, and given that lexical attrition is a well-documented phenomenon, one might argue that the Spanish of some of these participants might have attrited to some extent. To test this hypothesis, we conducted a post hoc analysis of the L2–L1 task (where the targets were Spanish words) including the interaction between length of immersion and target language as a fixed factor, as well as the three-way interaction between length of immersion, target language, and prime type. The outcome of this model contained nonsignificant effects for all of these interactions, suggesting that participants’ responses in Spanish were not dependent on their time living in an L2-dominant environment.

With respect to the effects of word frequency, we observe that target frequency significantly interacted with target language (English), prime type (related), and proficiency. As reported above, RTs to L2 targets were significantly slower in two contexts: with control primes overall and, when L2 target frequency was low, for less proficient as compared to more proficient participants. This result suggests that, when responding to less frequent L2 targets (i.e., in longer/more difficult trials), only the high-proficiency participants benefitted from the presence of the L1-related primes. It would be problematic to argue that this outcome is due to the inability of the primes to be processed. Given the linguistic profile of our participants, it should not be difficult to process the L1 primes (even the less frequent ones). Such difficulties should have had a larger impact on those bilinguals who had the largest potential for attrition, that is, those on the upper ends of the proficiency and active L2 exposure/use scales. However, this is not the case in our data. Alternatively, one could also argue that the less proficient bilinguals did not know the low-frequency L2 targets. This is unlikely, as the accuracy rates for low- and high-proficiency bilinguals were numerically similar and high (96% vs. 97%). Therefore, lack of knowledge of the lower frequency L2 words does not seem to explain the slower latencies in the related condition for less proficient bilinguals. This significant interaction thus remains an open question and should be further investigated if it were found to replicate in future data sets.

Returning to our most novel result, the modulation of L2–L1 priming effects by language exposure/use, it should be noted that this finding does not provide a reliable way to adjudicate between the RHM and Multilink, as their predictions largely overlap here. For the RHM, a larger amount of active L2 exposure/use should bring about stronger L2 lexicosemantic connections, which would in turn enhance L2–L1 priming effects. Alternatively, Multilink would predict the L2 lexical representations of bilinguals with more L2 exposure/use to have higher RLAs, facilitating their processing and increasing the likelihood of observing L2–L1 priming effects, because, in short, they should be more effective primes.

The RHM and Multilink explain the differences in L1/L2 lexical processing by resorting to different conceptualizations of the operations underlying cross-language effects in the bilingual lexicon. Those models of the lexicon lead to different predictions about how words are (differently) processed depending on an array of experience-level factors (e.g., frequency, language membership, and learning context), which, in many cases, predict the most common pattern of L1/L2 differences: L1 words tend to be processed faster than L2 words. However, the factor that ultimately shapes lexical processing, word form- and semantic-level variables such as word length, concreteness being equal, might be the same: subjective frequency. In that sense, the present results suggest that, in our data, language exposure/use was a better proxy for subjective frequency than L2 proficiency. It might fare even better than a stimulus-level variable such as (corpus) word frequency, although we also find effects for these two predictors, showing that their validity as proxies cannot be disregarded.

Here we should note that, for Multilink, all these factors might potentially affect cross-language priming effects, as all of them approach subjective frequency to some degree. Similarly, although not originally specified by the RHM, a deterministic role of exposure/use is not necessarily incompatible with its tenets. For instance, bilinguals who are exposed and use their L2 more (on a scale) might have available more entrenched L2 word–meaning connections, whose strength would be independent of how proficient they are in the L2 overall. This point is of significant consequence not least because proficiency is often measured as a categorical variable, predicated on a relative standard model (i.e., how one fares juxtaposed against an idealized standard, often a monolingual one). By its very nature, proficiency measures are only able to test a subset of knowledge a truly competent speaker would have, which is more or less attainable and/or is a greater or lesser proxy for what it seeks to uncover depending very much on context (e.g., Norris & Ortega, Reference Norris, Ortega, Gass and Mackey2012; Rothman & Iverson, Reference Rothman, Iverson and Cruz-Ferreira2010). At the end of the day, especially in light of these models, opportunity for links within an individual’s mental lexicon is of primary importance. Therefore, it is not clear how or if L2 proficiency measured as typically done can faithfully proxy for actual competencies (grammatical and/or performative), even if, in many cases, they will ultimately overlap. Consequently, it is worth looking into and taking more seriously measures that are more fine-grained proxies for actual opportunities that should, reasonably, correlate with greater linking. This discussion is accentuated under two conditions, both of which apply in our study: (a) at so-called higher levels of proficiency, where a threshold of specific knowledge has been attained to test relatively high on measures we currently have but which do not necessarily say anything about real-world abilities in the language per se, and (b) under conditions of increased potential exposure such as immersion, where individual differences in how immersion is capitalized on might nevertheless have some determinism.

In contrast, stimulus-level factors such as word frequency have been shown to function as reliable proxies when investigating lexical processing, to the point that frequency has been highlighted as the single most critical variable influencing lexical decision time (Brysbaert et al., Reference Brysbaert, Buchmeier, Conrad, Jacobs, Bölte and Böhl2011, p. 1). However accurate this measure has proven to be, aside from debates on which types of corpora better capture its effects, one should not overlook the fact that (a) L1 corpora are far from ideal sources of language use when one is interested in studying lexical retrieval in the L2; and (b) by their very nature, frequency counts assume equal word frequencies across speakers of a given language and are, thus, inherently imperfect approximations to the concept of subjective frequency. Thus, to understand how the speed of L2 lexical access is determined and to account for its variability, it is crucial to first identify which other factors, especially those bearing upon each individual’s language experience, might be at play.

Conclusions and future directions

The typically reported translation priming asymmetry presumably reflects a relative inability of the L2 primes to stimulate (noncognate) translation equivalent targets under masked priming conditions. Several factors have been suggested to underlie the asymmetry. The data in the present study is compatible with a deterministic role of subjective word frequency in bilingual lexical processing. The present findings might help explain divergent results found in the literature with respect to the role that different individual- and stimulus-level factors have on the priming asymmetry. In particular, conflicting results might reflect how accurately the predictors under examination proxied subjective frequency in those studies. For instance, we have argued that L2 proficiency might not be the most appropriate candidate to gauge the relative frequency with which an L2 word is encountered and used by each individual. Instead, the present data point toward active language exposure/use as a more efficient approximation to individual encounters with each word.

Moving forward with the present program, we are currently working in what we believe are the necessary next steps in characterising and tapping L2 subjective frequency. First, we are preparing a follow-up translation priming study, which will employ a more nuanced operationalization of active language exposure/use. Detailed language history questionnaires and the comparison of immersed and nonimmersed L2 speakers will allow us to better estimate how the amount (and context) of L2 use affect bilingual lexical processing. Second, by examining populations with differential exposure to the L2, we will test the predictability of traditional frequency measures (extracted from L1 corpora) when bilingual populations of different types and in different contexts are investigated. Our goal is to contribute to building better approximations to what is ultimately a major factor in the online recruitment of lexical representations: subjective word frequency. Third and finally, we will test a larger population and employ a larger set of words. Having a larger sample size along with a simpler design will contribute to overcome the shortcoming of potentially low statistical power in the present study.

We consider that addressing the above issues is a necessary step in the integration of current theories of mental linguistic representation and processing, particularly at the lexical level. By doing so, we hope to contribute to a better understanding of bi-/multilingual lexical processing, inclusive of related questions pertaining to native versus nonnative differences and the role that input quantity and quality play in shaping the observable spectrum of linguistic competencies.

Acknowledgments

We wish to acknowledge the following funding: Adel Chaouch-Orozco is supported by a PhD studentship from the University of Reading (GS18-009), and the Language Learning Dissertation Grant. Jorge González Alonso and Jason Rothman are supported by a grant for the HeLPiNG project via Tromsø Forskningsstiftelse (TFS), 2019–2023. We wish to thank Jon Andoni Duñabeitia for helpful discussion.

Appendix A Complete list of stimuli

Table A.1. Prime and target words and nonwords for the L1–L2 and L2–L1 tasks

Table A.2. Spanish and English primes for nonword targets in the L1–L2 and L2–L1 tasks

Appendix B Theory-driven model and description of the rationale behind each fixed-effect’s inclusion

RT ~ Target Language + Target Language : Prime Type + Target Language : Proficiency + Target Language : Language Exposure/use + Target Language : Prime Type : Prime Frequency + Target Language : Prime Type : Target Frequency + Target Language : Prime Type : Proficiency + Target Language : Prime Type : Language Exposure/use + Target Language : Prime Type : Prime Frequency : Target Frequency + Target Language : Prime Type : Prime Frequency : Proficiency + Target Language : Prime Type : Prime Frequency : Language Exposure/use + Target Language : Prime Type : Target Frequency : Proficiency + Target Language : Prime Type : Target Frequency : Language Exposure/use

Target Language: to test whether responses were overall faster in one language or the other.
Target Language × Prime Type interaction: to test priming effects in the two translation directions.
Target Language × Proficiency: to test the role of Proficiency in the overall responses.
Target Language × Language Exposure/Use: to test the role of Language Exposure/Use in the overall responses.
Target Language × Prime Type × Proficiency: to test the (potentially different) role of Proficiency as a modulator of priming effects across translation directions.
Target Language × Prime Type × Language Exposure/Use: to test the (potentially different) role of L2 exposure/use as a modulator of priming effects across translation directions.
Target Language × Prime Type × Prime Frequency: to test the role of Prime Frequency on priming effects across both translation directions.
Target Language × Prime Type × Target Frequency: to test the role of Target Frequency on priming effects across both translation directions.
Target Language × Prime Type × Prime Frequency × Target Frequency: to test the potentially different effects of the interaction between Prime and Target Frequency as modulators of priming effects across both translation directions.
Target Language × Prime Type × Prime Frequency × Proficiency: to test the role of the interaction between Prime Frequency and Proficiency as a potential modulator of priming effects across both translation directions.
Target Language × Prime Type × Prime Frequency × Language Exposure/Use: to test the role of the interaction between Prime Frequency and L2 exposure/use as a potential modulator of priming effects across both translation directions.
Target Language × Prime Type × Target Frequency × Proficiency: to test the role of the interaction between Target Frequency and Proficiency as a potential modulator of priming effects across both translation directions.
Target Language × Prime Type × Target Frequency × Language Exposure/Use: to test the role of the interaction between Target Frequency and L2 exposure/use as a potential modulator of priming effects across both translation directions.

Footnotes

1. Geranpayeh (Reference Geranpayeh2003) reports a standard error of mesurement of around 4 for the 60-item OQPT (the one used in the current study), and test–retest reliabilities of around 0.9 during the task’s validation procedure.

2. Applying different methodologies to create our nonword stimuli might have caused divergences in the baseline difficulty to reach a lexical decision in each translation direction. In our data, then, an effect of nonword construction should have translated into not only (equal or) shorter RTs for L2 targets as compared to L1 targets but also lower accuracy rates in the L1–L2 experiment as compared to the L2–L1one. However, the latter was not found: as we will see, our participants were faster and more accurate when responding to L1 targets than to L2 targets.

3. To ensure that dialectal differences had no effect on the results, we conducted post hoc analyses running the models with the interaction of dialect (coded binarily as Castilian vs. non-Castilian Spanish), prime type, and target language as a fixed factor. The interaction was nonsignificant.

4. Due to concerns about the potential collinearity between some independent variables, tests were conducted to examine the correlation between prime and target frequency, and L2 proficiency and language exposure/use. While the second pair of variables did show some correlation (Prime/Target freq.: r = .02, p = .25; L2 Prof./Exposure: r = .21, p < .001), this was not a strong one.

References

Baayen, R. H. (2008). Analyzing linguistic data: A practical introduction to statistics using R. Cambridge: Cambridge University Press.CrossRef Google Scholar

Baayen, R. H., Davidson, D. J., & Bates, D. M. (2008). Mixed-effects modeling with crossed random effects for subjects and items. Journal of Memory and Language, 59, 390–412.CrossRef Google Scholar

Baayen, R. H., & Milin, P. (2010). Analyzing reaction times. International Journal of Psychological Research, 3, 12–28.CrossRef Google Scholar

Basnight-Brown, D. M., & Altarriba, J. (2007). Differences in semantic and translation priming across languages: The role of language direction and language dominance. Memory & Cognition, 35, 953–965.CrossRef Google Scholar PubMed

Bates, D., Maechler, M., Bolker, B., & Walker, S. (2015). Fitting linear mixed-effects using lme4. Journal of Statistical Software, 67, 1–48.CrossRef Google Scholar

Brenders, P., van Hell, J., & Dijkstra, T. (2011). Word recognition in child second language learners: Evidence from cognates and false friends. Journal of Experimental Child Psychology, 109, 383–396.CrossRef Google Scholar PubMed

Brysbaert, M., Buchmeier, M., Conrad, M., Jacobs, A. M., Bölte, J., & Böhl, A. (2011). The word frequency effect. Experimental Psychology, 58, 412–424.CrossRef Google Scholar PubMed

Brysbaert, M., & Duyck, W. (2010). Is it time to leave behind the Revised Hierarchical Model of bilingual language processing after fifteen years of service? Bilingualism: Language and Cognition, 13, 359–371.CrossRef Google Scholar

Brysbaert, M., Lagrou, E., & Stevens, M. (2017). Visual word recognition in a second language: A test of the lexical entrenchment hypothesis with lexical decision times. Bilingualism: Language and Cognition, 20, 530–548.CrossRef Google Scholar

Brysbaert, M., Mandera, P., & Keuleers, E. (2018). The word frequency effect in word processing: An updated review. Current Directions in Psychological Science, 27, 45–50.CrossRef Google Scholar

Brysbaert, M., Stevens, M., Mandera, P., & Keuleers, E. (2016). How many words do we know? Practical estimates of vocabulary size dependent on word definition, the degree of language input and the participant’s age. Frontiers in Psychology, 7, 1116.CrossRef Google Scholar PubMed

Brysbaert, M., Verreyt, N., & Duyck, W. (2010). Models as hypothesis generators and models as roadmaps. Bilingualism: Language and Cognition, 13, 383–384.CrossRef Google Scholar

Caramazza, A., & Brones, I. (1979). Lexical access in bilinguals. Bulletin of the Psychonomic Society, 13, 212–214.CrossRef Google Scholar

Clahsen, H., Balkhair, L., Schutter, J. S., & Cunnings, I. (2013). The time course of morphological processing in a second language. Second Language Research, 29, 7–31.CrossRef Google Scholar

Collins, A. M., & Loftus, E. F. (1975). A spreading-activation theory of semantic processing. Psychological Review, 82, 407–428.CrossRef Google Scholar

Costa, A. (2005). Lexical access in bilingual production. In Kroll, J. F. & de Groot, A. M. B. (Eds.), Handbook of bilingualism: Psycholinguistic approaches (pp. 308–325). New York: Oxford University Press.Google Scholar

Costa, A., Miozzo, M., & Caramazza, A. (1999). Lexical selection in bilinguals: Do words in the bilingual’s two lexicons compete for selection? Journal of Memory and Language, 41, 365–397.CrossRef Google Scholar

Cuetos, F., Glez-Nosti, M., Barbon, A., & Brysbaert, M. (2011). Subtlex-ESP: Spanish word frequencies based on film subtitles. Psicológica, 32, 133–143.Google Scholar

de Groot, A. M. B., Delmaar, P., & Lupker, S. J. (2000). The processing of interlexical homographs in translation recognition and lexical decision: Support for non-selective access to bilingual memory. Quarterly Journal of Experimental Psychology, 53A, 397–428.CrossRef Google Scholar

de Groot, A. M. B., & Nas, G. L. (1991). Lexical representation of cognates and noncognates in compound bilinguals. Journal of Memory and Language, 30, 90–123.CrossRef Google Scholar

Diependaele, K., Lemhöfer, K, & Brysbaert, M. (2013). The word frequency effect in first- and second-language word recognition: A lexical entrenchment account. Quarterly Journal of Experimental Psychology, 66, 843–863.CrossRef Google Scholar PubMed

Dijkstra, T., Grainger, J., & van Heuven, W. J. B. (1999). Recognition of cognates and interlingual homographs: The neglected role of phonology. Journal of Memory and Language, 41, 496–518.CrossRef Google Scholar

Dijkstra, T., & Rekké, S. (2010). Towards a localist-connectionist model for word translation. Mental Lexicon, 5, 403–422.CrossRef Google Scholar

Dijkstra, T., & van Heuven, W. J. B. (1998). The BIA model and bilingual word recognition. In Grainger, J. & Jacobs, A. M. (Eds), Localist connectionist approaches to human cognition (pp. 189–225). Mahwah, NJ: Erlbaum.Google Scholar

Dijkstra, T., & van Heuven, W. J. B. (2002). The architecture of the bilingual word recognition system: From identification to system. Bilingualism: Language and Cognition, 5, 175–197.CrossRef Google Scholar

Dijkstra, T., Wahl, A., Buytenhuijs, F., van Halem, N., Al-Jibouri, Z., De Korte, M., & Rekké, S. (2019). Multilink: A computational model for bilingual word recognition and word translation. Bilingualism: Language and Cognition, 22, 657–679.CrossRef Google Scholar

Dimitropoulou, M., Duñabeitia, J. A., & Carreiras, M. (2011). Masked translation priming effects with low proficiency bilinguals. Memory & Cognition, 39, 260–275.CrossRef Google Scholar

Dunn, A. L., & Tree, J. F. (2009). A quick, gradient bilingual dominance scale. Bilingualism: Language and Cognition, 12, 273–289.CrossRef Google Scholar

Duñabeitia, J. A., Dimitropoulou, M., Uribe-Etxebarria, O., Laka, I., & Carreiras, M. (2010). Electrophysiological correlates of the masked translation priming effect with highly proficient simultaneous bilinguals. Brain Research, 1359, 142–154.CrossRef Google Scholar PubMed

Duñabeitia, J. A., Perea, M., & Carreiras, M. (2010). Masked translation priming effects with highly proficient simultaneous bilinguals. Experimental Psychology, 57, 98–107.CrossRef Google Scholar PubMed

Duñabeitia, J. A., Dimitropoulou, M., Morris, J., & Diependaele, K. (2013). The role of form in morphological priming: Evidence from bilinguals. Language and Cognitive Processes, 28, 967–987.CrossRef Google Scholar

Duyck, W., & Warlop, N. (2009). Translation priming between the native language and a second language. New evidence from Dutch-French bilinguals. Experimental Psychology, 56, 173–189.CrossRef Google Scholar

Feldman, L. B., Milin, P., Cho, K.W., Moscoso del Prado Martín, F., & O’Connor, P. A. (2015). Must analysis of meaning follow analysis of form? A time course analysis. Frontiers in Human Neuroscience, 9, 111.CrossRef Google Scholar

Finkbeiner, M., Forster, K., Nicol, J., & Nakamura, K. (2004). The role of polysemy in masked semantic and translation priming. Journal of Memory and Language, 51, 1–22.CrossRef Google Scholar

Forster, K. I., & Davis, C. (1984). Repetition priming and frequency attenuation in lexical access. Journal of Experimental Psychology: Learning, Memory, and Cognition, 10, 680–698.Google Scholar

Geranpayeh, A. (2003). A quick review of the English Quick Placement Test. UCLES Research Notes, 12, 8–10.Google Scholar

Gerard, L. D., & Scarborough, D. L. (1989). Language-specific lexical access of homographs by bilinguals. Journal of Experimental Psychology: Learning, Memory, and Cognition, 15, 303–315.Google Scholar

Gollan, T. H., Forster, K. I., & Frost, R. (1997). Translation priming with different scripts: Masked priming with cognates and noncognates in Hebrew-English bilinguals. Journal of Experimental Psychology: Learning, Memory, and Cognition, 23, 1122–1139.Google Scholar PubMed

Grainger, J., & Frenck-Mestre, C. (1998). Masked priming by translation equivalents in proficient bilinguals. Language and Cognitive Processes, 13, 601–623.CrossRef Google Scholar

Jiang, N. (1999). Testing processing explanations for the asymmetry in masked cross-language priming. Bilingualism: Language and Cognition, 2, 59–75.CrossRef Google Scholar

Jiang, N. (2015). Six decades of research on lexical representation and processing in bilinguals. In Schwieter, J. (Ed.), The Cambridge handbook of bilingual processing (pp. 29–85). Cambridge: Cambridge University Press.CrossRef Google Scholar

Ko, Y., Wang, M., & Kim, S. Y. (2011). Bilingual reading of compound words. Journal of Psycholinguistic Research, 40, 49–73.CrossRef Google Scholar PubMed

Kroll, J. F., & Stewart, E. (1994). Category interference in translation and picture naming: Evidence for asymmetric connection between bilingual memory representations. Journal of Memory and Language, 33, 149–174.CrossRef Google Scholar

Kroll, J. F., & Tokowicz, N. (2005). Models of bilingual representation and processing: Looking back and to the future. In Kroll, J. F. & de Groot, A. M. B. (Eds.), Handbook of bilingualism: Psycholinguistic approaches (pp. 531–553). New York: Oxford University Press.Google Scholar

Kroll, J. F., van Hell, J. G., Tokowicz, N., & Green, D. W. (2010). The revised hierarchical model: A critical review and assessment. Bilingualism: Language and Cognition, 13, 373–381.CrossRef Google Scholar PubMed

Lee, Y., Jang, E., & Choi, W. (2018). L2-L1 translation priming effects in a lexical decision task: Evidence From low proficient Korean-English bilinguals. Frontiers in Psychology, 9, 1–10.CrossRef Google Scholar

Lemhöfer, K., Dijkstra, T., & Michel, M. (2004). Three languages, one ECHO: Cognate effects in trilingual word recognition. Language and Cognitive Processes, 19, 585–611.CrossRef Google Scholar

Lijewska, A., Ziegler, M., & Olko, S. (2018). L2 primes L1–translation priming in LDT and semantic categorisation with unbalanced bilinguals. International Journal of Bilingual Education and Bilingualism, 21, 744–759.CrossRef Google Scholar

Macnamara, J., & Kushnir, S. L. (1971). Linguistic independence of bilinguals: The input switch. Journal of Verbal Learning and Verbal Behavior, 10, 480–487.CrossRef Google Scholar

Matuschek, H., Kliegl, R., Vasishth, S., Baayen, H., & Bates, D. (2017). Balancing Type I error and power in linear mixed models. Journal of Memory and Language, 94, 305–315.CrossRef Google Scholar

McClelland, J. L., & Rumelhart, D. E. (1981). An interactive activation model of context effects in letter perception: Part 1. An account of basic findings. Psychological Review, 88, 375–407.CrossRef Google Scholar

Nakayama, M., Ida, K., & Lupker, S. J. (2016). Cross-script L2-L1 noncognate translation priming in lexical decision depends on L2 proficiency: Evidence from Japanese-English bilinguals. Bilingualism: Language and Cognition, 19, 1001–1022.CrossRef Google Scholar

Nakayama, M., Lupker, S. J., & Itaguchi, Y. (2018). An examination of L2-L1 noncognate translation priming in the lexical decision task: Insights from distributional and frequency-based analyses. Bilingualism: Language and Cognition, 21, 265–277.CrossRef Google Scholar

Nakayama, M., Sears, C. R., Hino, Y., & Lupker, S. J. (2013). Masked translation priming with Japanese–English bilinguals: Interactions between cognate status, target frequency, and L2 proficiency. Journal of Cognitive Psychology, 25, 949–981.CrossRef Google Scholar

Norris, J. M., & Ortega, L. (2012). Assessing learner knowledge. In Gass, S. M. & Mackey, A. (Eds.), The Routledge handbook of second language acquisition (pp. 591–607). New York: Routledge.Google Scholar

Oxford University, Press, University of Cambridge, & Association of Language Testers in Europe. (2001). Quick placement test: Paper and pen test. Oxford: Oxford University Press.Google Scholar

Paradis, M. (2004). A neurolinguistic theory of bilingualism. Amsterdam: Benjamins.CrossRef Google Scholar

Peirce, J. W. (2007). PsychoPy—Psychophysics software in Python. Journal of Neuroscience Methods, 162, 8–13.CrossRef Google Scholar PubMed

Rastle, K., Harrington, J., & Coltheart, M. (2002). 358,534 nonwords: The ARC Nonword Database. Quarterly Journal of Experimental Psychology, 55A, 1339–1362.CrossRef Google Scholar

R Core Team. (2016). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. Available from http://www.R-project.org Google Scholar

Rothman, J., & Iverson, M. (2010). Independent multilingualism normative assessments, where art thou? In Cruz-Ferreira, M. (Ed.), Multilingual norms (pp. 33–51). Berlin: Lang.Google Scholar

Sabourin, L., Brien, C., & Burkholder, M. (2014). The effect of age of L2 acquisition on the organization of the bilingual lexicon: Evidence from masked priming. Bilingualism: Language and Cognition, 17, 542–555.CrossRef Google Scholar

Schoonbaert, S., Duyck, W., Brysbaert, M., & Hartsuiker, R. J. (2009). Semantic and translation priming from a first language to a second and back: Making sense of the findings. Memory & Cognition, 37, 569–586.CrossRef Google Scholar PubMed

Treffers-Daller, J. (2019). What defines language dominance in bilinguals? Annual Review of Linguistics, 5, 375–393.CrossRef Google Scholar

van Hell, J. G., & Dijkstra, T. (2002). Foreign language knowledge can influence native language performance in exclusively native contexts. Psychonomic Bulletin & Review, 9, 780–789.CrossRef Google Scholar PubMed

van Hell, J. G., & Tanner, D. (2012). Second language proficiency and cross-language lexical activation. Language Learning, 62, 148–171.CrossRef Google Scholar

van Heuven, W. J. B., Mandera, P., Keuleers, E., & Brysbaert, M. (2014). Subtlex-UK: A new and improved word frequency database for British English. Quarterly Journal of Experimental Psychology, 67, 1176–1190.CrossRef Google Scholar PubMed

van Heuven, W. J. B., Schriefers, H., Dijkstra, T., & Hagoort, P. (2008). Language conflict in the bilingual brain. Cerebral Cortex, 18, 2706–2716.CrossRef Google Scholar PubMed

Wang, M. (2010). Bilingual compound processing: The effects of constituent frequency and semantic transparency. Writing Systems Research, 2, 117–137.CrossRef Google Scholar

Wang, X. (2013). Language dominance in translation priming: Evidence from balanced and unbalanced Chinese–English bilinguals. Quarterly Journal of Experimental Psychology, 66, 727–743.CrossRef Google Scholar PubMed

Wen, Y., & van Heuven, W. J. B. (2017). Noncognate translation priming in masked priming lexical decision experiments: A meta-analysis. Pyschonomic Bulletin & Review, 24, 879–886.CrossRef Google Scholar

Xia, V., & Andrews, S. (2015). Masked translation priming asymmetry in Chinese–English bilinguals: Making sense of the Sense Model. Quarterly Journal of Experimental Psychology, 68, 294–325.CrossRef Google Scholar PubMed

Zhao, X., Li, P., Liu, Y., Fang, X., & Shu, H. (2011). Cross-language priming in Chinese-English bilinguals with different second language proficiency levels. In Carlson, L., Hoelscher, C., & Shipley, T. F. (Eds.), Proceedings of the 33rd annual meeting of the Cognitive Science Society (pp. 801–806). Austin, TX: Cognitive Science Society.Google Scholar

Table 1. Participant characteristic

Table 2. Sample stimuli

Table 3. Stimuli characteristics

Table 4. Mean response times (RTs; in milliseconds; standard errors), error rates (%), and priming effects (in milliseconds) in the lexical decision tasks

Table 5. Intercept and significant or marginally significant factors included in the final model for the analysis of response times and their coefficients, standard errors, t values, and p values

Table 6. Intercept and significant or marginally significant factors included in the final model for the analysis of accuracy and their coefficients, standard errors, z values, and p values

Table A.1. Prime and target words and nonwords for the L1–L2 and L2–L1 tasks

Table A.2. Spanish and English primes for nonword targets in the L1–L2 and L2–L1 tasks

Article contents

Individual differences in bilingual word recognition: The role of experiential factors and word frequency in cross-language lexical priming

Abstract

Keywords

The asymmetry in models of bilingual lexical processing

The RHM

Multilink

Factors investigated in the literature as potential modulators of L2–L1 priming

Word frequency

L2 proficiency

Language exposure/use and immersion

The present study

Method

Participants

Materials

English–Spanish translation task

Test of familiarity with Spanish words

Procedure

Results

Response time analysis

Accuracy analysis

Discussion

Conclusions and future directions

Acknowledgments

Appendix A Complete list of stimuli

Appendix B Theory-driven model and description of the rationale behind each fixed-effect’s inclusion

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests