Do readers anticipate wh-in-situ questions? Cross-linguistic reading time evidence from Mandarin Chinese and French

Leticia Pablos Robles; Yang Yang; Jenny Doetjes; Lisa Lai-Shen Cheng

doi:10.1017/S0142716425000074

Do readers anticipate wh-in-situ questions? Cross-linguistic reading time evidence from Mandarin Chinese and French

Published online by Cambridge University Press: 11 April 2025

Leticia Pablos Robles

Yang Yang ,

Jenny Doetjes

and

Lisa Lai-Shen Cheng

Show author details

Leticia Pablos Robles*: Affiliation:
Leiden University, Leiden, Netherlands
Yang Yang: Affiliation:
Center for Linguistics and Applied Linguistics, Guangdong University of Foreign Studies, Guangzhou, China
Jenny Doetjes: Affiliation:
Leiden University, Leiden, Netherlands
Lisa Lai-Shen Cheng: Affiliation:
Leiden University, Leiden, Netherlands
*: Corresponding author: Leticia Pablos Robles; Email: [email protected]

Article contents

Abstract
Introduction
Mandarin Chinese and French
Processing in-situ questions in French
Experiment 1: processing in-situ questions with simplex wh-phrases in French
Experiment 2: processing in-situ questions with complex wh-phrases in French
Processing in-situ questions in Mandarin Chinese
Experiment 3: Processing in-situ questions with simplex wh-phrases in Mandarin Chinese
Results
Experiment 4: Processing in-situ questions with complex wh-phrases in Mandarin Chinese
Qualitative comparison of results of French and Mandarin Chinese
Conclusion
Replication package
Author contribution statement
Footnotes
References

Rights & Permissions

Abstract

The understanding of wh-in-situ questions relies naturally on contextual and prosodic information for their early discrimination from declarative sentences. However, there is scarce evidence on the parsing processes involved during the online incremental processing of these questions. In this study, we investigate the incremental reading of wh-in-situ sentences with no prosodic or contextual information available to aid the parser by comparing them to their declarative counterparts. We investigated two wh-in-situ languages: Mandarin Chinese (in-situ only) and French (optionally in situ). This comparison allows us to determine whether wh-in-situ questions are processed similarly across languages and whether the parsing process is related to language-specific question formation strategies. Results of four word-by-word self-paced reading experiments on two types of wh-in-situ phrases (simplex or complex) in Mandarin Chinese and French show an interpretation strategy in which the most frequent structure, declarative, is considered in both languages, independently of the available question formation strategy. Nevertheless, the timing of the online interpretation and the observed effects are affected by the nature of the wh-phrases (simplex or complex) and the definiteness of the noun phrases contained in the declaratives, which confirms that several processes occur concurrently introducing a limit on the capability to extract conclusions on the processes based solely on behavioral measures.

Keywords

Complex wh-questions definiteness French In-situ wh-questions Mandarin Chinese question formation strategies

Type: Original Article
Information: Applied Psycholinguistics , Volume 46 , 2025 , e9

DOI: https://doi.org/10.1017/S0142716425000074 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike licence (https://creativecommons.org/licenses/by-nc-sa/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the same Creative Commons licence is used to distribute the re-used or adapted article and the original article is properly cited. The written permission of Cambridge University Press must be obtained prior to any commercial use.
Copyright: © The Author(s), 2025. Published by Cambridge University Press

Introduction

How information-seeking questions (also known as wh-questions) are interpreted is an issue that has received quite a lot of attention in the sentence-processing literature. Most of the studies focus on long-distance dependencies examining how fronted wh-phrases are interpreted at their canonical position and the effects that result from keeping the fronted wh-phrase in the parser’s working memory until resolving the open dependency. Standard “filled-gap effects” (Crain & Fodor, Reference Crain, Fodor, Dowty, Karttunen and Zwicky1985; Stowe, Reference Stowe1986; see Pablos, Reference Pablos2008 for overview) are associated with reading time evidence showing that readers expect the wh-phrase (the filler) to be discharged and interpreted at the first available grammatical position. The failure of discharging the wh-phrase results in longer reading times than their declarative counterparts. In addition, there is growing evidence that the interpretation of sentence meaning is achieved incrementally with comprehenders predicting upcoming information (including lexical and syntactic structure) based on available input (e.g. Altmann & Kamide, Reference Altmann and Kamide1999; Levy, Reference Levy2008). Under prediction accounts, projecting a wh-gap in fronted wh-questions occurs at the moment the wh-phrase is encountered.

The above scenario cannot be directly extended to wh-questions where the wh-phrase stays in its canonical position, known as wh-in-situ questions. The interpretation of these questions may result in a temporary syntactic ambiguity in comparison with their declarative counterparts that have a non-wh-word at the same site. Research shows that available contextual and prosodic information is used to predict the upcoming structure (Fodor, Reference Fodor2002; Déprez et al., Reference Déprez, Syrett and Kawahara2013; Gryllia et al., Reference Gryllia, Cheng and Doetjes2016; Gryllia et al., Reference Gryllia, Doetjes, Yang and Cheng2020; Yang et al., Reference Yang, Gryllia, Pablos and Cheng2019; Kawahara et al., Reference Kawahara, Shaw and Ishihara2022), practically resolving the ambiguity before encountering the wh-phrase. Nonetheless, this raises the question of how such wh-questions are parsed and to what extent readers anticipate an upcoming in-situ wh-question in the absence of other information (e.g., prosody, context, information structure) during the reading process. These questions form the focus of this study. More specifically, we examine how speakers of two wh-in-situ languages, Mandarin Chinese and French, proceed in the real-time reading of sentences presented without any preceding contextual or prosodic cues that could bias interpretations as questions or declarative statements. These two languages differ in the types of strategies they permit for wh-questions. Whereas Mandarin wh-questions are always in situ, French permits both in situ and fronted wh-questions. We investigate whether this variation in wh-in-situ strategies influences processing difficulty and predictability. In addition, we also examine an additional potential factor which might influence the parsing of wh-in-situ questions, namely, the complexity of the wh-phrase (i.e., simplex wh-phrase such as who or complex wh-phrase such as which person). Our second research question thus addresses whether there are processing differences between the complex and simplex wh-phrases in in-situ wh-questions.

Mandarin Chinese and French

Question formation strategies across languages

Languages differ in the number and type of strategies for forming wh-questions (see for instance Cheng, Reference Cheng1991) and can be categorized into three primary groups. The first type obligatorily fronts wh-words in wh-questions, as in English (1).Footnote ^1, Footnote ²

(1) Who _i did you meet t _i at the art museum yesterday?

The second type consists of languages that always retain wh-words in their canonical position (i.e., in-situ) when formulating a wh-question, as in Mandarin Chinese (2).

The third language type permits both fronting and in-situ wh-question formation, as in French illustrated in (3a) and (3b).Footnote ^3, Footnote ⁴

Adli (Reference Adli, Adli, García García and Kaufmann2015) examined the prevalence and distribution of wh-in-situ questions in relation to other variants of wh-question formation in French. He presented an assessment of spontaneous speech in French obtained from the Sgs database (with 10943 sentences) and showed that 56.2% of the total number of 1721 interrogative utterances (excluding echo-questions) are in-situ wh-questions. The study further found that the relative frequency of these in-situ questions is 0.62 for wh-adjuncts and 0.43 for wh-objects. A more recent study found an increase in the use of wh-in-situ in the last decade (Baunaz & Bonan, Reference Baunaz and Bonan2023).

These different strategies pose interesting questions regarding the processes used in the online comprehension of wh-in-situ constructions where the clause type of the sentence (question or declarative) is only obvious when the wh-word is encountered. If English were to permit in-situ wh-questions like Mandarin and French, a comparison of (4) and (5) illustrates that the difference between the wh-in-situ question (4) and the declarative sentence (5) is only revealed at the postverbal object position (see Note 3).

(4) (You said) Peter would like to meet whom tomorrow?
(5) (You said) Peter would like to meet a friend tomorrow.

Crucially, unless prosodic or contextual information is available, no distinction can be made between these two sentences by readers (up to the object position) as they proceed incrementally in the interpretation.

The syntax and processing of in-situ wh-questions: previous studies

Syntactic studies of in-situ wh-questions analyze these questions as involving a covert dependency, such that the in-situ wh-phrase either is related to an interrogative operator or raised to the structurally higher operator position (SpecCP position) at the Logical Form (LF; for further discussion see Aoun and Li, Reference Aoun and Li1993; Cheng, Reference Cheng1991, Reference Cheng2003, Reference Cheng2009; Huang, Reference Huang1982; Tsai, Reference Tsai1994; and Bayer & Cheng, Reference Bayer, Cheng, Everaert and van Riemsdijk2017 for an overview). The covert dependency is thus on par with overt dependencies in questions with overt wh-fronting in that it involves a syntactic representation where an (covert) operator is in the structurally higher position (i.e., SpecCP), which determines the clause type of the sentence.Footnote ⁵ This in turn raises an interesting question concerning the representation of in-situ wh-questions in the processing system. If the same processing mechanisms are used in processing dependencies, the abstract link between the wh-phrase and the SpecCP position in the case of wh-in-situ questions should manifest as a nonlocal dependency formation.

There has been to date limited research on the processing of in situ wh-questions. In French, for example, most of the research focused on the production of the prosodic features or the acceptability of in-situ wh-questions, but not on how these questions are interpreted incrementally (see Adli, Reference Adli, Meisenburg and Selig2004, Reference Adli2006; Beyssade et al., Reference Beyssade, Delais-Roussarie and Marandin2007; Delattre, Reference Delattre1966; Deprez et al., Reference Déprez, Syrett and Kawahara2013; Wunderli, Reference Wunderli1983, Reference Wunderli1984; Oiry, Reference Oiry2011; Tual, Reference Tual2017a, Reference Tual2017b from discussion in Glasbergen-Plas, Reference Glasbergen-Plas2021). Ueno and Kluender’s (Reference Ueno and Kluender2009) study of Japanese wh-in-situ constructions showed an effect, manifested as a right-lateralized-anterior negativity (RLAN), on longer distance covert dependency formation. In Japanese, however, the question marker (also a scope marker) is morphologically overt. In Mandarin Chinese, studies by Xiang et al. (Reference Xiang, Dillon, Wagers, Liu and Guo2013, Reference Xiang, Wang and Cui2015) examined the processing of in-situ complex wh-questions (i.e., which x questions) of different lengths (mono-clause vs. embedded clause) in comparison with declaratives (mono-clause vs. embedded clause) using the Speed-Accuracy Tradeoff (SAT) methodology. They looked at differences in wh-dependency length across their stimuli and found that length had an impact on processing accuracy but not on processing speed. Their results showed that questions such as (7b) had lower processing accuracy than those in (7a) but were equally slow in comparison to declaratives such as (6a) and (6b).

The increased processing time of wh-in-situ questions in (7a, b) was attributed to the effects of establishing a long-distance covert dependency. The effect of length on the accuracy, but not on the speed of processing of wh-in-situ, supports the notion of a covert dependency retrieved by a content-addressable memory process (McElree, Reference McElree2000; McElree et al., Reference McElree, Foraker and Dyer2003).

Predictions for parsing in-situ wh-questions

The evolution of the sentence comprehension models over the years has led to a consensus of an incremental interpretation process where the human parser interprets the available information incrementally building up a representation of the sentence meaning as the input unfolds, without delay. Still, there are a few aspects in which available models differ and that are relevant for the interpretation of observed processing difficulty. A growing amount of evidence points to the predictive nature of the comprehension processes (e.g. Levy, Reference Levy2008; Altmann & Mirkovic, Reference Altmann and Mirković2009), although there remains a debate on the interpretation of the concept of prediction and the difference with respect to integration processes (for a summary discussion see Pickering & Gambi, Reference Pickering and Gambi2018 and Kuperberg & Jaeger, Reference Kuperberg and Jaeger2016, and counter-arguments in Huettig & Mani, Reference Huettig and Mani2016). In simple terms, prediction implies the activation of linguistic information before input is available. Predictive models can thus be understood in a probabilistic framework in which the parser updates continuously the projected structure and expected lexical content based on the information as it becomes available (Levy, Reference Levy2008).

Considering the fact that declarative sentences are more frequent than questions in the world’s languages and that they tend to be the most unmarked of the clause types (Ma et al., Reference Ma, Ciocca and Whitehill2011), in the case of parsing a wh-in-situ question up to the wh-phrase (i.e., parsing the part of the sentence which is the same in both wh-questions and declaratives), the initial prediction made by the parser would be based on the most frequent structure. It should also be noted that Adli (Reference Adli, Adli, García García and Kaufmann2015) in his study of spontaneous speech in French also reported that questions (including wh-in-situ questions) constitute only 15.72% of utterances (1721 out of 10943 sentences), highlighting the dominance of declaratives in the dataset and thus reinforcing the expectation of declaratives over interrogatives. We therefore predict a processing slowdown at the wh-phrase when processing wh-in-situ questions as compared to the declarative counterpart.

The nature of the observed processing difficulty, however, can have a different interpretation depending on the theoretical processing model considered. In “classical terms,” it can be considered an indication of re-analysis to reconstruct the projected structure (Fodor & Ferreira, Reference Fodor and Ferreira1998), or the activation of the alternative structure. Further, the level of difficulty has been postulated to be quantified by measures such as the surprisal (Levy, Reference Levy2008) and entropy (Linzen & Jaeger, Reference Linzen and Jaeger2016), which represent formalizations of the predictability of a word or structure in a certain context. These measures can be estimated from corpora or, traditionally, from Cloze probability procedures. These models are considered serial as only one interpretation is active at a given time, in comparison to models where multiple interpretations are concurrently active in parallel with different levels of activation. Under parallel activation, we can consider activation-based retrieval models (Van Dyke & Lewis, Reference Van Dyke and Lewis2003; Lewis & Vasishth, Reference Lewis and Vasishth2005) or more recently, the proposed parallel architecture model (Huettig et al., Reference Huettig, Audring and Jackendoff2022). In the activation-based retrieval models, processing difficulties at the wh-in-situ site reflect reactivation of the alternative structure combined with the integration of the covert dependency. In the parallel architecture model (Huetting et al., Reference Huettig, Audring and Jackendoff2022), the potential structures, encoded as a lexicon (Jackendoff & Audring, Reference Jackendoff and Audring2020), are all active simultaneously as the first words are encountered (within-item activation) with different “resting activations”, linked to their frequency.

All the models described above would predict readers of Mandarin Chinese and French to have additional processing costs (observed as longer reading times) when encountering the wh-in-situ phrase, as compared to the non-wh noun phrase in the declarative counterpart. This processing cost could either be due to reanalysis, reactivation or covert dependency integration. The extent to which the parser anticipates upcoming structure when there is no other cue available might be modulated by the likelihood of encountering in-situ wh-phrases in each of the languages under study. In Mandarin, an in-situ question is the only option for wh-questions. In contrast, in French, as mentioned above, Adli (Reference Adli, Adli, García García and Kaufmann2015) showed that 56.2% of the produced interrogative utterances in the Sgs database were wh-in-situ.

The complexity and definiteness of (wh)-noun phrases

The processing study of Mandarin Chinese that we mentioned above by Xiang et al. (Reference Xiang, Dillon, Wagers, Liu and Guo2013) and Xiang et al. (Reference Xiang, Wang and Cui2015) used only complex wh-phrases (i.e., which x phrases). Nonetheless, there is experimental evidence showing differences in the processing of complex and simplex wh-questions for languages such as English, Dutch and Italian in that the complex wh-questions take longer to read than simplex wh-questions (De Vincenzi Reference De Vincenzi1996; Kaan et al., Reference Kaan, Harris, Gibson and Holcomb2000; Donkers et al., Reference Donkers, Hoeks and Stowe2011). Other studies, however, provide opposite claims on the processing cost of complex wh-phrases, where these are facilitated (see Frazier & Clifton Reference Frazier and Clifton2002; Clifton et al., Reference Clifton, Fanselow and Frazier2006; Hofmeister et al., Reference Hofmeister, Jaeger, Sag, Arnon, Snider, Featherston and Sternefeld2007; Hofmeister & Sag, Reference Hofmeister and Sag2010).

In addition, the syntactic and semantic literature has made different claims as to which type of noninterrogative noun phrase is more comparable to the type of wh-phrase, even though previous processing research on in-situ wh-questions primarily focused on comparisons with declaratives with definite noun phrases. Evidence from the theoretical syntax and semantics literature (Cheng, Reference Cheng1991, Reference Cheng1994) shows that in Mandarin Chinese, simplex wh-words are closer to indefinite noun phrases, whereas complex wh-phrases are more akin to definite noun phrases (Giannakidou & Cheng, Reference Giannakidou and Cheng2006). Previous sentence processing studies showed differences in reading time depending on the referential nature of the noun phrase being tested (e.g., Warren & Gibson, Reference Warren and Gibson2002, Reference Warren and Gibson2005; Gordon, et al., Reference Gordon, Hendrick and Johnson2004; Kaan & Vasić, Reference Kaan and Vasić2004). These studies based their predictions on the Accessibility or Givenness Hierarchy (Gundel et al., Reference Gundel, Hedberg and Zacharski1993), which determines the accessibility of referents in the discourse and the relation between the type of noun phrase and the degree to which its antecedent is accessible, and they found that, in the absence of prior discourse, definite noun phrases take longer time to be read than their indefinite counterparts. This is because definite noun phrases require the reader to reconstruct their referent from zero, whereas indefinite noun phrases usually introduce new referents and do not require the reader to search for one.

Given the potential influence of noun phrase definiteness on parsing differences between wh-questions and declaratives, our experimental manipulation introduced two declarative types: one with definite and one with indefinite noun phrases in the wh-phrase position. To investigate the predictions outlined above and extend research on in-situ wh-question processing in Mandarin Chinese and the processing of complex and simplex wh-phrases, as well as studies on definite and indefinite noun phrases, we conducted four self-paced reading (SPR) experiments (see Jegerski, Reference Jegerski, Jegerski and VanPatten2014 for a summary description). SPR’s incremental processing methodology is well-suited for this investigation. The first two experiments in French compared the processing of in-situ questions with simplex object wh-phrases (qui “who”) and complex wh-phrases (quel N “which N”) with their declarative counterparts containing both definite and indefinite noun phrases. The second two experiments carried out the same comparisons in Mandarin Chinese. The next sections describe the experimental paradigm, design, and results.

Processing in-situ questions in French

Experiment 1: processing in-situ questions with simplex wh-phrases in French

As described above, research on the processing of in-situ wh-questions in French is scarce and researchers mainly concentrated on the prosodic characteristics of these questions or on their acceptability but not so much on their reading comprehension. The goal of Experiment 1 is to determine whether French in-situ questions with simplex wh-phrases (qui “who”) incur predicted processing costs at the disambiguation point in the absence of prosodic and contextual cues, compared to their declarative counterparts.

Method

Participants

Participants (n = 36, mean age = 22 years, 18 females) were all native speakers of French. They were recruited in two groups: one from the University of Nantes (France) (n = 30, mean age = 20 years, 16 females) and one from the Expat French community in the Leiden areaFootnote ⁶ (n= 6, mean age = 35 years, 2 females). Testing participants at different locations was done for practical reasons and to achieve the required statistical power. None of the participants suffered from dyslexia and all of them had normal or corrected vision. All participants provided informed consent and were monetarily compensated for their participation.

Materials

We compared object in-situ wh-questions with qui “who” in (8a), with indefinite noun phrases such as quelqu’un “someone” in (8b), and with monosyllabic (n = 9) and bisyllabic (n = 15) half masculine (n = 12) and half feminine (n = 12) proper names such as Marie in (8c).

An example of a stimuli set is given in (8).Footnote ⁷ The sentences were presented word-by-word incrementally from left-to-right.

The experiment consisted of 24 sets of three sentences distributed across three lists in a Latin Square design, which were combined with 72 filler sentences of similar length. Half of the fillers were questions and the other half declaratives.

The modifier of the subject noun phrase le braqueur “the robber” varied minimally in its length between two and three words. Most of the items (i.e., 20/24) contained two-word modifiers for the subject, as de banque “of bank” in le braqueur de banque “the bank robber” (8). The region dans sa fuite “on his escape” following the critical position given in bold (i.e., wh-word qui, indefinite quelqu’un, or proper name Marie) also differed minimally in length across items, ranging between three (in 15/24 sentences) to four words (in 9/24 sentences). This variation was kept so that the stimuli would sound as natural as possible to French speakers. All materials were checked for grammaticality and naturalness by a French native speaker.

Procedure

Participants signed an informed consent form before the experiment in compliance with the Ethics Code for linguistic research in the Faculty of Humanities at Leiden University. A self-paced-reading, word-by-word moving window task (Just et al., Reference Just, Carpenter and Wooley1982, Aaronson & Scarborough, Reference Aaronson and Scarborough1976) was conducted on a MacBook Pro laptop running the software Linger (Rhode, Reference Rhode2003) in a quiet room at the University of Nantes and in Leiden University. Each trial began with a group of dashes that corresponded to each word in the sentence. Therefore, participants could see the length of the sentence but not the words behind the dashes. Participants were asked to press the space bar to read the sentence word-by-word and to reply to the comprehension question that appeared immediately afterwards on a different screen by pressing the “F” (YES) or “J” (NO) buttons. These responses were indicated with a sticker above the corresponding keys. As participants pressed the space bar to read the sentences, each word was revealed individually and the previously read word disappeared. The punctuation mark at the end of the sentence, which unambiguously determined the interrogative or declarative nature of the sentence, appeared together with the last word of the sentence. This meant that in principle readers of French could not determine whether they were reading a question or a declarative until they reached this sentence final position. Therefore, the reason to choose a word-by-word moving window was to check what the predictions with respect to upcoming material were per word. To keep participants attentive, each sentence was followed by a yes/no comprehension question. The experiment lasted approximately 30 minutes. An example question for item (8a) (repeated here) is shown below.Footnote ⁸

Comprehension Question:

Est-ce un braqueur de bijouterie qui a blessé quelqu’un dans sa fuite ?
Was it a jewelry store robber who injured someone on his escape? (Answer : No)

Reading time data analysis

All trials (independently of whether the corresponding comprehension question was answered correctly or not) were included in the analysis. The average comprehension accuracy for the 36 participants was 96% (SD = 1.95%); thus, no participant was rejected on this basis. There was no significant difference in accuracy between declaratives (97.7%) and questions (96.9%), (χ²(1, N=859) = 0.277, Fischer’s p = 0.49).

The regions used for the analysis corresponded to single words, except for those cases where French clusters the determiner or preposition with the noun by means of an apostrophe (e.g., l’infirmière “the nurse,” d’une “of one”). The collected reading time data was inspected and outlier data points with reading times smaller than 150 ms or larger than 2000 ms were removed. The total number of discarded data points represented ∼1% of the complete data including both fillers and experimental sentences.

There is experimental evidence that word length and frequency impact reading time both in eye tracking (e.g., Kliegl et al., Reference Kliegl, Grabner, Rolfs and Engbert2004; Hyönä & Olson, Reference Hyönä and Olson1995) and in self-paced-reding studies (e.g., Bultena et al., Reference Bultena, Dijkstra and van Hell2014), with low frequency and longer words both shown to display increased fixation or reading durations, which is associated with a higher processing cost. To avoid possible confounding effects unrelated to our experimental manipulation and research questions, we addressed the impact of word length and word frequency (when relevant) of the critical regions in the obtained reading times by conducting an ad hoc analysis. First, we tackled the relation of reading time with word length in two ways: by calculating length-corrected residual reading times (RSRT) (Ferreira & Clifton, Reference Ferreira and Clifton1986) and by considering individual experimental items as a random factor in the mixed effects model analysis (Barr et al., Reference Barr, Levy, Scheepers and Tily2013). The reading time was residualized by computing a linear regression between the word length and reading time for each subject and then subtracting the predicted reading time from the observed reading time for each word. The resulting RSRT were used for all subsequent analyses. Second, to account for the possible effects of word frequency, the experimental dataset was expanded with the information contained in the Lexique database (New et al., Reference New, Pallier, Brysbaert and Ferrand2004) for French. This was done by extracting a frequency of use for each critical word and matching it to the relevant syntactic category. For inflected words, we used the frequency of the lemma to account for possible effects of word familiarity, whereas in the clusters containing an apostrophe (e.g., l’infirmière ‘the nurse’, d’une “of one”) we used the frequency of the noun (e.g., infirmière “nurse”).

We analyzed differences in the RSRT at two regions, - the site of the disambiguation (wh-question or NP qui/quelqu’n/Marie) and the following word to account for possible spillover effectsFootnote ⁹ (Vasishth, Reference Vasishth2006), using Linear Mixed Effects Regression (LMER; Baayen et al., Reference Baayen, Davidson and Bates2008) by means of the statistical computing language R (R Core Team, 2016) and the lm4 package (Bates et al., Reference Bates, Maechler, Bolker and Walker2015). The model included one fixed-effect factor, Condition, with three levels (wh-word, indefinite and Proper Name). In the region of the wh-question/NP (region 8 in Figure 1), as shown in (8), all the experimental items consist of the same pronouns qui or quelqu’n or a Proper Name, so we did not include in the model the word frequency for that region.Footnote ¹⁰ In the region immediately after the wh-site, in addition to Condition, a fixed effect for Word Frequency was considered based on the log-transformed, centered word frequency as extracted from Lexique. The maximal random effects structure justified by the model was considered (Barr et al. Reference Barr, Levy, Scheepers and Tily2013): variance introduced by subjects and items was modeled as random intercepts. In addition, we considered random slopes by subject for the factor Condition.

Figure 1. Mean RSRT per word for the comparison between in-situ questions with simplex wh-phrases (Qui), declaratives with indefinites (Quelqu’un) and proper names (Marie) in Experiment 1. Bars indicate the standard error per region.

The best model fitting the data was obtained by likelihood ratio test of models including and excluding the relevant effect and against a “null” model containing only an intercept parameter and the random effects structure. A follow-up analysis was performed when a significant effect of Condition was found to assess if a different behavior appears between the wh-in-situ question and the two types of declaratives.Footnote ¹¹

Results

Figure 1 shows the average RSRT at the different regions of the experimental items against a sample sentence for reference. As shown in this figure, there are two regions that show significant effects. One is the critical region (i.e., wh-word “Qui”/ indefinite “Quelqu’n”/ Proper Name “Marie”) and the other is the immediately following region (i.e., the preposition “dans”). In both regions, the in-situ wh-question condition in (8a) is read significantly slower than its indefinite declarative counterpart in (8b). The definite declarative with a proper name in (8c) is only read significantly slower than the indefinite declarative in (8b) at the critical word region. There is a difference observed between the definite declarative in (8c) and the wh-question condition in (8a) at the region following the critical region (i.e., “dans”), where the question appears to be read slower, but this difference did not reach statistical significance.

Post hoc analysis at the critical region (region 8: Qui/Quelqu’n/Marie), presented in Table 1, confirmed both in-situ questions with simplex wh-phrases (qui) (D = 63.70 ms, χ²(1) = 12.37, p = 0.001) and declaratives with proper names (Marie) (D = 46.99 ms, χ²(1) = 8.43, p = 0.007) were read significantly slower than declaratives that contain indefinites (quelqu’un). No significant difference was found between the reading time of in-situ questions (qui) and declaratives that contain proper names (Marie). At the region immediately after the critical region (region 9: dans in the example in (8)), we observe a significant increase in reading time on the interrogative condition when compared to the declarative with indefinite pronoun (D = 31.72 ms, χ²(1) = 6.11, p = 0.04), but no significant difference in reading time when comparing the interrogative condition with the declarative with a proper name.

Table 1. Pairwise comparison for RSRT at the critical region “Qui/Quelqu’n/Marie” (region 8) and following word “dans” (region 9) in Experiment 1. P-values adjusted with the Holm method for multiple comparisons

* p < 0.05, ** p < 0.01, *** p < 0.001.

Table 2 provides a summary of the maximal fitted model for the wh/NP disambiguation site (region 8: Qui/Quelqu’n/Marie) and the region after (region 9), respectively.

Table 2. Model summary for RSRT at the critical region “Qui/Quelqu’n/Marie” (region 8) and following word “dans” (Region 9) in Experiment 1

* p < 0.05, ** p < 0.01, *** p < 0.001.

p-values calculated based on conditional F-tests with Kenward-Roger approximation.

Marginal R2 based on Nakagawa et al., (Reference Nakagawa, Johnson and Schielzeth2017).

The results above show the expected increased effort in processing the in-situ questions with a simplex wh-phrase, such as qui “who,” in French, when compared to declaratives with an indefinite NP. However, this effect is absent when contrasted with a declarative with a definite (proper name) NP. Furthermore, the same processing difficulty is observed between the two declaratives: ProperName conditions such as (8c) are also read slower than indefinite declaratives such as (8b). This difference between indefinite and proper names relative to in-situ questions with a simplex wh-phrase can be attributed to the greater integration difficulty of proper names compared to other definite or indefinite noun phrases (Ledoux et al., Reference Ledoux, Gordon, Camblin and Swaab2007; Camblin et al., Reference Camblin, Ledoux, Boudewyn, Gordon and Swaab2007). This finding aligns with the Accessibility Hierarchy (Gundel et al., Reference Gundel, Hedberg and Zacharski1993), which posits that indefinites require minimal contextual information for interpretation, while definites and proper names necessitate prior knowledge of the referent.

Experiment 2: processing in-situ questions with complex wh-phrases in French

The sentence processing literature showed that complex wh-phrases presented in isolation produce longer reading times than their simplex wh-phrase counterparts (see De Vincenzi, Reference De Vincenzi1996; Donkers et al., Reference Donkers, Hoeks and Stowe2011). In Experiment 2, we compared, again in the absence of prosodic and contextual information, the processing of in-situ questions with complex wh-phrases (e.g., quelle caissière “which cashier,” quel garçon “which boy”) with declaratives with a definite or indefinite noun phrase at the wh-phrase. Our prediction again, following the hypothesis described earlier on a bias for a declarative interpretation, is that in-situ questions with complex wh-phrases will be read slower than both declarative definite and declarative indefinite sentences in French, as shown by Adli (Reference Adli, Adli, García García and Kaufmann2015).

The aim of this experiment was to examine: first, if complex wh-phrases lead to comparable processing cost as the processing of in-situ questions with simplex wh-phrases, as examined in Experiment 1 and secondly, whether the contrast between complex wh-phrases and their declarative definite/indefinite counterparts will show similar effects in terms of timing and effect size as those observed in Experiment 1. The second research question is motivated by the syntactic and semantic debate regarding the comparability of different noun phrase and of wh-phrase types (Giannakidou & Cheng, Reference Giannakidou and Cheng2006), as discussed above.