Is morphosyntactic agreement reflected in acoustic detail? The s duration of English regular plural nouns

MARCEL SCHLECHTWEG; GREVILLE G. CORBETT

doi:10.1017/S1360674322000223

Is morphosyntactic agreement reflected in acoustic detail? The s duration of English regular plural nouns

Published online by Cambridge University Press: 12 September 2022

MARCEL SCHLECHTWEG

and

GREVILLE G. CORBETT

Show author details

MARCEL SCHLECHTWEG: Affiliation:
Department of English and American Studies Carl von Ossietzky University Oldenburg Ammerländer Heerstraße 114-118 26129 Oldenburg Germany [email protected]
GREVILLE G. CORBETT: Affiliation:
Surrey Morphology Group University of Surrey Guildford Surrey GU2 7XH United Kingdom [email protected]

Article contents

Abstract
Introduction
Theoretical background
Methodology
Summary and discussion
Conclusion
Footnotes
References

Rights & Permissions

Abstract

Studies have challenged the assumption that different types of word-final s in English are homophonous. On the one hand, affixal (e.g. laps) and non-affixal s (e.g. lapse) differ in their duration; on the other hand, variation exists across several types of affixal s (e.g. between the plural (cars) and genitive plural (cars’)). This line of research was recently expanded in a study in which an interesting side effect appeared: the s was longer if followed by a past tense verb (e.g. The pods/odds eventually dropped), in comparison to a following present tense verb (e.g. The old screens/jeans obviously need replacing.). Put differently, the s became longer in the absence of overt morphosyntactic agreement, where it was mostly the sole plurality marker in the sentence. The objective of the present article is to examine whether this effect can be replicated in a more controlled setting. Having considered a large number of potential confounding variables in a reading experiment, we found an effect in the expected direction, one that is compatible with the literature on the impact that predictability has on duration. We interpret this finding against the background of the role of fine acoustic detail in language.

Keywords

English plural s agreement duration acoustics

Type: Research Article
Information: English Language & Linguistics , Volume 27 , Issue 1 , March 2023 , pp. 67 - 92

DOI: https://doi.org/10.1017/S1360674322000223 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: Copyright © The Authors, 2022. Published by Cambridge University Press

1 Introduction

The role of acoustic detail in phonologically identical forms is rather limited according to some well-established psycholinguistic and linguistic models, and semantic, syntactic, or morphological information is not expected to be reflected in the acoustics. For instance, the difference in morphological complexity between the English word laps, which is complex, and lapse, which is simplex, should not be expressed in the acoustic output, since the level between morphology and the acoustic output, namely phonology, produces the same form for both laps and lapse. As one case in point, psycholinguistic feed-forward models of speech production (see, e.g., Fromkin Reference Fromkin1971/1973; Harley Reference Harley1984; Levelt Reference Levelt1989, Reference Levelt1995; Roelofs Reference Roelofs1997; Levelt, Roelofs & Meyer Reference Levelt, Roelofs and Meyer1999) do not leave room for acoustic variation in the presence of phonological identity. Once the discrete symbolic representations are specified at the phonological level, and are alike for two words like laps and lapse, acoustic differences are excluded as long as everything else, such as the context, is held constant. We find a similar prediction in linguistic models describing the interaction of morphology and phonology (e.g. Chomsky & Halle Reference Chomsky and Halle1968; Kiparsky Reference Kiparsky, van der Hulst and Smith1982; Bermúdez-Otero Reference Bermúdez-Otero, Hannahs and Bosch2018). Here again, if laps and lapse are not distinct on the abstract and underlying level of lexical phonology, post-lexical phonology and phonetics should not cause acoustic variation. Although these psycholinguistic and linguistic models represent (or represented) the standard view, they have been challenged by many empirical studies showing that the role of acoustic detail in the language system is greater than previously assumed. These findings are more compatible with exemplar-based accounts, which offer more flexibility in the speech production process and in which the acoustic realization of items can be directly affected by activated information in, say, the semantic, syntactic or morphological domain (see, e.g., Dell Reference Dell1986; Pierrehumbert Reference Pierrehumbert, Bybee and Hopper2001, Reference Pierrehumbert, Gussenhoven and Warner2002).Footnote ²

The present article connects to all the previous research which asks whether acoustic detail plays a more significant role in language than well-known psycholinguistic and linguistic models presume. Specifically, we investigate whether morphosyntactic agreement in English is reflected in the acoustics, namely in the duration of the word-final s of regular plural nouns. Both noun–determiner and noun–verb agreement are in focus: while the determiner these agrees overtly with the subsequent plural noun with respect to the number value (e.g. these cabs), the does not do so (e.g. the cabs); similarly, while a present tense verb agrees overtly with the noun (e.g. cabs break down), a past tense form does not (e.g. cabs broke down). Such an effect would be remarkable, and so we proceed cautiously. However, a previous experiment (Schlechtweg & Corbett Reference Schlechtweg and Corbett2021) gave a tantalizing hint that there might be such an effect, and we therefore decided to investigate further. For this purpose, we conducted a well-controlled reading experiment in which native speakers of English participated.

Before presenting the details of this study in section 3, we provide the theoretical foundation of our analysis in section 2. This includes, first of all, a general overview of variables that seem to affect the acoustic realization of items. In a second step, we concentrate on one particular case, namely the duration of the word-final s in English, which has been measured in several contributions already and which is also the response variable in our own study. In the third component of section 2, we reflect upon why morphosyntactic agreement might be potentially mirrored in acoustic detail by considering previous research on how the concepts of informativeness and, crucially, predictability can influence the duration of linguistic material. Having presented our study in section 3, we discuss our findings in connection to previous research in section 4 and conclude in section 5.

2 Theoretical background

2.1 Phonological identity but acoustic variation: overview

In the last decades, a great number of studies have revealed that phonologically identical forms can differ acoustically, in their duration for instance. The decisive question in this research area is which particular variables are the origin of the acoustic variation. Four examples of such variables are frequency, syntactic category, morphosyntactic number and morphological status. Forms of higher frequency, such as the English noun time, are typically produced with a shorter duration than forms of lower frequency, like the phonologically identical word thyme (see, e.g., Whalen Reference Whalen1991; Gahl Reference Gahl2008; Drager Reference Drager2011; Conwell Reference Conwell2018; Lohmann Reference Lohmann2018a, Reference Lohmann2018b; but see also, for conflicting results, Jurafsky, Bell & Girand Reference Jurafsky, Bell, Girand, Gussenhoven and Warner2002; Cohn et al. Reference Cohn, Brugman, Crawford and Joseph2005). Moreover, Sereno & Jongman's (Reference Sereno and Jongman1995) data suggest that the syntactic category of an item affects the acoustics of this item; they detected variation between words like answer (verb) and the respective nominal equivalent (answer). Crucially, however, Lohmann (Reference Lohmann2020) did not replicate the effect. A further variable that seems to be reflected in acoustic detail is morphosyntactic number, since Schlechtweg & Heinrichs (Reference Schlechtweg and Heinrichs2022) and Schlechtweg, Heinrichs & Linnenkohl (Reference Schlechtweg, Heinrichs, Linnenkohl and Schlechtweg2020) found that German plural nouns (e.g. Schatten ‘shadows’) are longer than the phonologically identical singular forms (e.g. Schatten ‘shadow’). A fourth example of a variable is the morphological status. Elements of morphologically complex words, like the dis prefix of the English verb discolor, differ in their acoustic properties from structures that are phonologically alike but lack a morphological function, such as dis in discover (see, e.g., Kemps et al. Reference Kemps, Ernestus, Schreuder and Baayen2005a; Kemps et al. Reference Kemps, Wurm, Ernestus, Schreuder and Harald Baayen2005b; Sugahara & Turk Reference Sugahara and Turk2009; Smith, Baker & Hawkins Reference Smith, Baker and Hawkins2012). The variable morphological status connects to several studies examining the duration of the word-final s in English. Since we also measured the s duration in our own study, we consider this aspect in more detail in the next section.

2.2 Word-final s in English

After the general overview of variables potentially affecting the acoustics of phonologically identical forms, we focus on research on the duration of the English word-final s here. A central comparison in former investigations was the duration of affixal and non-affixal s. On the one hand, there is evidence that affixal s, as in laps, is longer than non-affixal s, as in lapse (Walsh & Parker Reference Walsh and Parker1983; Schwarzlose & Bradlow Reference Schwarzlose and Bradlow2001; Song et al. Reference Song, Demuth, Evans and Shattuck-Hufnagel2013; Seyfarth et al. Reference Seyfarth, Garellek, Gillingham, Ackerman and Malouf2018). Interestingly, the opposite effect, longer non-affixal s, was found in quite a few other studies (Zimmermann Reference Zimmermann, Carignan and Tyler2016; Plag et al. Reference Plag, Homann and Kunter2017; Schmitz, Baer-Henney & Plag Reference Schmitz, Baer-Henney and Plag2021; Tomaschek et al. Reference Tomaschek, Plag, Ernestus and Baayen2021). These conflicting findings are surprising in the first instance, but there are several aspects that must be taken into account. First of all, some of the studies are limited and caution is needed when interpreting the respective data. As argued in Plag et al. (Reference Plag, Homann and Kunter2017: 185), it is difficult to evaluate Schwarzlose & Bradlow (Reference Schwarzlose and Bradlow2001) and Walsh & Parker (Reference Walsh and Parker1983), owing to a small sample size and since many decisive details, including statistical details, are not presented. Second, at closer inspection, the results are not necessarily incompatible. Tomaschek et al. (Reference Tomaschek, Plag, Ernestus and Baayen2021: 128) point to the fact that Seyfarth et al. (Reference Seyfarth, Garellek, Gillingham, Ackerman and Malouf2018) predominantly looked at the voiced s; this specific group was not only longer for affixal than for non-affixal s in Seyfarth et al. (Reference Seyfarth, Garellek, Gillingham, Ackerman and Malouf2018) but also in Plag et al. (Reference Plag, Homann and Kunter2017).

Apart from the comparison of affixal and non-affixal s, different types of affixal s have also been examined. Hsieh, Leonard & Swanson (Reference Hsieh, Leonard and Swanson1999), but not Song et al. (Reference Song, Demuth, Evans and Shattuck-Hufnagel2013), found longer plural (e.g. laps) than third-person singular s (e.g. plays), but they admit that sentence position is a potential confound: the fact that plural forms occur more often than third-person singular forms at the end of a sentence might also be responsible for the effect. Plag et al.'s (Reference Plag, Hedia, Lohmann, Zimmermann, Körtvélyessy and Stekauer2020) experiment revealed that plural-genitive s (e.g. colleagues’) is longer than plural s (e.g. colleagues). The authors consider the lower frequency of the plural-genitive to be a possible reason for this result. In a recent study, Schlechtweg & Corbett (Reference Schlechtweg and Corbett2021) concentrated on two other types of affixal s, namely the word-final s in regular plural (e.g. toggles) and pluralia tantum nouns (e.g. goggles). In a reading study, they tested 40 native speakers of English and nine pairs like toggles/goggles. The s was manually segmented and no difference in duration was detected between the groups of interest. The null effect was attributed to the fact that both regular plural and pluralia tantum nouns control morphosyntactic agreement regularly (since both take a plural verb form). However, the statistical analysis, including linear mixed-effects models, showed an interesting side effect. Before discussing the effect, let us look at the test sentences used in the experiment (see table 1).

Table 1. Test sentences used in Schlechtweg & Corbett (Reference Schlechtweg and Corbett2021)

In the study presented in Schlechtweg & Corbett (Reference Schlechtweg and Corbett2021), it was essential to control for potentially confounding variables across the two conditions regular plural and pluralia tantum nouns. One way to achieve this was by relying on the same sentences in the two conditions so that, say, toggles and goggles were read out in exactly the same environment. For the present purpose, however, we need to consider a type of variation between the different test sentences: while four were in the present tense, a past tense verb occurred in five others. VerbTense was included in the mixed effects model as a fixed effect and three criteria, outlined in Plag et al. (Reference Plag, Homann and Kunter2017: 194), showed that VerbTense played a crucial role in the study. First, after the elimination of non-significant fixed effects, VerbTense remained in the final model as a significant fixed effect with t statistics smaller than -2. Second, it turned out that VerbTense improved the fit of the model, since the model with this fixed effect was significantly different from the model without VerbTense. Third, the Akaike Information Criterion (AIC) was smaller if VerbTense was in the model, in comparison to the model without it. The robustness of the effect was indicated by the fact that different models confirmed the finding. Table 2 presents the details of one model, in which the effect of VerbTense on the s duration becomes clear.Footnote ³ The descriptive statistics showed mean values of 0.062 seconds for the sentences with present tense verbs (standard deviation (SD) = 0.016) and 0.070 seconds for those in the past (SD = 0.017).

Table 2. VerbTense in the mixed-effects model of Schlechtweg & Corbett (Reference Schlechtweg and Corbett2021)

In sum, we observe two aspects in Schlechtweg & Corbett (Reference Schlechtweg and Corbett2021), which are relevant to the current article. First, the s of the respective nouns was shorter if the sentence contained a present tense verb, in comparison to sentences with a past tense verb. That is, the s duration was reduced in the presence of overt morphosyntactic agreement, with the verb form functioning as another plurality marker. It could be that the longer s duration in sentences with a past tense verb compensates for the lack of another plurality marker. Second, as can be seen in table 1, we have to take into consideration that the groups, present tense (overt agreement) and past tense (no overt agreement), included totally different test sentences. The effect must therefore be treated with caution, and a controlled experiment needs to be designed and conducted to evaluate whether the effect is indeed real. This is the objective of the current work. Apart from the cases of noun–verb agreement just referred to, we intend to examine a second type of agreement in English, namely noun–determiner agreement, by contrasting the sentences with these, which reflects overt plural agreement between noun and determiner, to those with the, which might precede both a singular and a plural noun and hence does not signal overt number agreement. In table 1, we see that there was overt noun–determiner agreement in some (the sentences with these) but not in other sentences (the sentences with the, his, our). Although no effect of DeterminerAgreement was detected in the above-named study, we investigate this in a controlled experiment, too. Hence, in the controlled experiment, both noun–verb (present versus past tense verb) and noun–determiner agreement (these versus the) are examined.

2.3 No overt versus overt agreement: why the s might differ in duration

Before presenting and discussing the controlled experiment, this section reflects upon why distinct s durations might be theoretically plausible. On the basis of the data presented in the experiment described above and on the basis of two further reasons – the informative value and the syntagmatic probability of the s in the respective sentences – we hypothesize that overt morphosyntactic agreement leads to a reduced s. The first reason is that reduction in speech production is common for less informative, or relevant, material (see, e.g., Krasheninnikova Reference Krasheninnikova, Hollien and Hollien1979: 75; Demuth Reference Demuth, Goldsmith, Riggle and Yu2011). Engelhardt & Ferreira (Reference Engelhardt and Ferreira2014) present evidence for this idea. They contrasted the acoustic realization of necessary and unnecessary modifiers. That is, while blue in the phrase the blue triangle is necessary if triangles of different colors exist in the same context, it is unnecessary if only a single triangle is present in a given situation. It was shown that unnecessary modifiers, which do not provide an essential piece of information for the unique identification of the object (e.g. a triangle), were shorter in duration than necessary ones, which are, in turn, informative and decisive for the specification of the target object (e.g. the blue but not the purple triangle). Transferring these findings to the present project, we suggest that the s is most informative in sentences without overt plural agreement and hypothesize that its duration is longer here.

The second reason why agreement might affect the duration of the s is the concept of syntagmatic probability or predictability. A well-known idea in psycholinguistics, which has good empirical support, is that speakers tend to reduce elements in speech if they are predictable, since less articulatory effort is needed for reduced speech and since successful communication is still likely in reduced structures due to the high predictability of these structures (see, e.g., Jurafsky et al. Reference Jurafsky, Bell, Gregory, Raymond, Bybee and Hopper2001; Bell et al. Reference Bell, Jurafsky, Fosler-Lussier, Girand, Gregory and Gildea2003; Gahl & Garnsey Reference Gahl and Garnsey2004; Frank & Jaeger Reference Frank and Jaeger2008; Bell et al. Reference Bell, Brenier, Gregory, Girand and Jurafsky2009; Moore-Cantwell Reference Moore-Cantwell2013; Kurumada & Jaeger Reference Kurumada and Jaeger2015; Norcliffe & Jaeger Reference Norcliffe and Jaeger2016; Kurumada & Grimm Reference Kurumada and Grimm2017; for on overview, see also Rose Reference Rose2017: 3–4). For morphology, paradigmatic and syntagmatic predictability are kept apart (see, e.g., Cohen Reference Cohen2014; Rose Reference Rose2017). Paradigmatic predictability specifies the probability of occurrence of one particular form of a word paradigm, in contrast to the probability of occurrence of other forms of the same paradigm. Beyond this point, we do not consider paradigmatic predictability in the current paper. Instead, we focus on the concept of syntagmatic predictability, which describes how likely it is that a form occurs in a specific context or environment.

Some studies have analyzed the characteristics of s against the background of syntagmatic predictability. For Spanish, there is some, but overall inconclusive, evidence that the probability of reduction or deletion of s increases if the grammatical information expressed by the s is redundant and, hence, highly predictable (see, e.g., Poplack Reference Poplack1980; Hundley Reference Hundley1987; Erker Reference Erker2010; Torreira & Ernestus Reference Torreira and Ernestus2012). For instance, in un par de cervezas ‘a couple of beers’, the s attached to the noun is less important for the detection of the plural number value since un par de also signals plurality (see Hundley Reference Hundley1987: 893). For English, two studies are relevant in our context. Cohen (Reference Cohen2014) found, among other aspects, that the duration of the English word-final verbal s suffix, indicating singular agreement (e.g. reads), becomes shorter when the probability of singular agreement rises.

The key reference in connection to our present investigation is, however, another one. Rose (Reference Rose2017: 12–13, chapter 3) investigated the effect of syntagmatic predictability on the duration of word-final s in New Zealand English. On the basis of corpus data, she found that the s is reduced if it and the plurality of the noun are more predictable in the environment. For instance, the probability of a plural noun containing the s suffix is higher if a word like various precedes the plural noun than if a word like pretty appears. Rose's (Reference Rose2017) analysis revealed that only the preceding context (e.g. various) but not the following one has an impact on the duration of the plural s. On the one hand, her work supports our hypothesis that the s becomes longer if syntagmatic predictability is lower. On the other hand, since our own study to be presented in section 3 differs from Rose (Reference Rose2017) in several respects, it will contribute further insights into the effects of the environment on the acoustic realization of a suffix. A first, but minor, difference between Rose (Reference Rose2017) and our own analysis is the variety of English examined: while she concentrated on New Zealand English, our participants are speakers of North American English. Having access to data from more than one variety provides us with a broader picture of the subject. Second, while Rose (Reference Rose2017) restricts her analysis to the word immediately preceding or following the target plural noun, our test sentences contain only cases in which the second word before or the second word after the target plural noun represents, or does not represent, an additional plurality marker (e.g. The/These blue cabs always break/broke down). The advantage of our design is that we can exclude the potential influence of the phonetic environment on the target noun. That is, since blue is placed between the determiner and cabs, the distinct phonetic structure of the and these does not affect the acoustic realization of cabs. Third, while Rose (Reference Rose2017) relies on the automatically extracted s durations of the corpora, our data is segmented manually using a clearly defined protocol. Although her dataset is quite large, manual segmentation is overall more reliable, in particular if one considers conversational speech (see, e.g., Schiel, Draxler & Harrington Reference Schiel, Draxler and Harrington2011; Schuppler et al. Reference Schuppler, Grill, Menrath, Morales-Cordovilla, Besacier, Dediu and Martín-Vide2014). Most parts of the corpora used in Rose (Reference Rose2017) were based on interviews, which contain conversational speech. Fourth, Rose (Reference Rose2017) is not interested in the morphosyntactic phenomenon of number agreement, as we are, but collapses a quite diverse set of items that signal plurality to a greater (e.g. various, six) or smaller extent (e.g. pretty, of). We are, in contrast, specifically concerned with two types of plurality markers, namely these and present tense verbs. Fifth, Rose (Reference Rose2017) includes both the voiceless and voiced variant of the plural suffix; we, in contrast, concentrate on the voiced one only, since findings regarding the voiced /z/ are generally more homogenous than those for the voiceless /s/ (see section 2.2). Sixth, and crucially, the results from Schlechtweg & Corbett (Reference Schlechtweg and Corbett2021), which form the origin of the present study, are not compatible with those from Rose (Reference Rose2017): while she concludes that the plural s is longer if the plurality can be less predicted on the basis of the preceding word, Schlechtweg & Corbett (Reference Schlechtweg and Corbett2021) did not find an effect for the determiner, that is, the word appearing earlier than the target plural noun. Moreover, while Rose (Reference Rose2017) did not detect an effect for the word following the plural noun, Schlechtweg & Corbett (Reference Schlechtweg and Corbett2021) found evidence to suggest that the verb tense, with the verb following the noun, plays a role in that the s duration increases for past tense verb forms. These conflicting results, together with the more reliable segmentation strategy and the benefits of our less diverse and neatly controlled experiment, explains the need for the novel study presented in the next section.

3 Methodology

We conducted a study in which subjects read sentences containing English plural nouns in four different agreement conditions, created on the basis of the two factors Determiner (the versus these) and Tense (present versus past). We investigated whether the duration of the word-final s depends on overt morphosyntactic agreement.

3.1 Subjects

Thirty-eight native speakers of North American English with a mean age of 29.3 years (SD: 6.6 years) participated in the study (24 female, 14 male). They had an academic background, corrected or corrected-to-normal vision, and declared no speech disorder.

3.2 Materials

Sixteen English nouns formed the center of the materials. They were monosyllabic, regular plurals, singular-dominant (had a higher frequency in the singular than in the plural), inanimate, and contained the voiced /z/ word-finally in the plural. The nouns were embedded in 16 different test sentences, which, in turn, had the four variants given in (1).

(1)
1. (a) The blue cabs always break down.
2. (b) The blue cabs always broke down.
3. (c) These blue cabs always break down.
4. (d) These blue cabs always broke down.

The four versions of each sentence differed with respect to (i) the determiner at the beginning of the sentence (the versus these) and (ii) the verb tense (present versus past). All of the 16 test sentences and the respective variants are presented in appendix A. In (1a), the determiner the does not specify the number value of the following noun, it could be both a singular and a plural noun form. As opposed to this, the verb form in (1a), a present tense form, clearly signals plurality, since the singular noun would take the verb form breaks. In (1b), neither the determiner nor the verb form indicates plurality, and could occur not only with a plural but also with a singular noun. In (1c), both the determiner and the verb tense signal plurality. Finally, in (1d), only the determiner does so. In sum, apart from the s suffix on the target noun (e.g. cabs), there are two additional plurality markers in (1c), one in (1a) and (1d), and none in (1b).

Each of the 16 sentences contained an irregular verb with the same number of syllables in the present and past tense, resulting in four different test versions with the same length (see (1)). As illustrated in (1), the four sentence variants were only minimally different from each other. With the exception of the determiner and the verb tense, the four variants of each sentence were exactly identical and we therefore controlled our test materials for syntactic, phonological and phonetic aspects. The s suffix and the target noun, whose durations were measured in the analysis, were placed in the same sentence type and position, and between the same words. Doing so, we further controlled for bigram frequencies of the sequences ‘preceding word + target noun’ and ‘target noun + following word’.

3.3 Procedure

The experiment was conducted in a silent room. Subjects were seated about 30 centimeters (12 inches) from a large-diaphragm condenser microphoneFootnote ⁴ and 60 centimeters (24 inches) from a computer screen.Footnote ⁵ The sentences were read silently first and then aloud while the subjects were recorded with Praat (Boersma & Weenink Reference Boersma and Weenink2020). All sentences were left-aligned, appeared in a single line in the middle of the screen, and were written in the same font type and size.

Participants produced each of the 16 test sentences in the four conditions introduced in (1), reading out a total of 64 test sentences. Subjects therefore served as their own control, and we balanced the study for the issue of inter-subject variation. Moreover, we included 64 filler sentences in order to minimize the influence of one version of a sentence on the same sentence in another condition. A further 31 sentences were placed between one version of a sentence (e.g. (1a)) and the next variant of the same sentence (e.g. (1b)). The order of the four experimental conditions described in (1) was counterbalanced both within and across subjects. Also, the order of the items varied across participants.

3.4 Data analysis

3.4.1 Data preparation and segmentation

A total of 2,432 test cases (38 subjects x 64 test cases per subject) were part of the experiment. The dataset was reduced by 98 files (4%) due to slips of the tongue and technical problems. The remaining 2,334 sound files were phonetically segmented in Praat. All productions of a particular noun (e.g. cabs) from the same speaker were analyzed together in order to increase the segmentation consistency. Both the spectrogram and the waveform were used to detect the beginning and end of the word-final [z]. Spectrum settings of 5,000 to 11,000 Hertz (Hz) facilitated the recognition of the fricatives. We relied on the acoustic characteristics of the fricative and segmentation steps from the literature to develop an appropriate segmentation strategy (see, e.g., Ladefoged & Maddieson Reference Ladefoged and Maddieson1996; Ladefoged Reference Ladefoged2003; Turk, Nakai & Sugahara Reference Turk, Satsuki Nakai, Sugahara, Sudhoff, Lenertová, Meyer, Pappert, Augurzky, Mleinek, Richter and Schließer2006; Machač & Skarnitzl Reference Machač and Skarnitzl2009; Schlechtweg & Härtl Reference Schlechtweg and Härtl2020), which was the same as the one used in Schlechtweg & Corbett (Reference Schlechtweg and Corbett2021) (see also figure 1). That is, increased energy in the higher frequencies, visible in the spectrogram, functioned as the primary criterion to find the beginning and end of the target fricative. Visible fricative noise in the waveform represented the second criterion. If the two criteria did not coincide, priority was given to the primary one.

Figure 1. Segmentation of [z] using waveform (top), spectrogram (middle) and Praat TextGrid (bottom). Taken from Schlechtweg & Corbett (Reference Schlechtweg and Corbett2021) (with permission)

3.4.2 Statistical analysis and modeling

Having segmented the sound files, we first considered the simple descriptive statistics of the data. In a neatly controlled study like ours, these values give us a first idea of how the different conditions behave. Further, we relied on the program R (R Core Team 2021), the lme4 package (Bates et al. Reference Bates, Maechler, Bolker and Walker2015), and the lmerTest package (Kuznetsova et al. Reference Kuznetsova, Brockhoff, Bojesen Christensen and Jensen2020) to statistically analyze the data using linear mixed effects models (see, e.g., Winter Reference Winter2020).Footnote ⁶ Models were fitted for the two response variables DurationSuffix (= absolute s duration) and RelativeDurationSuffix, the latter being defined as the quotient of the absolute s duration and the absolute word duration.Footnote ⁷ For each of the two response variables, the following steps were implemented.

Statistical outliers in the absolute or relative s durations, defined as values plus and minus 2.5 standard deviations from the mean (see, e.g., Loewen & Plonsky Reference Loewen and Plonsky2016: 134), were discarded from the dataset. The s durations were then log transformed (to the base 10). Determiner (the versus these), Tense (present versus past) and their interaction were entered as the central fixed effects in the models. Log10SpeechRate_z, the log-transformed (to the base 10), centered and standardized speech rate, represented a control fixed effect. Speech rate refers to the quotient of the number of syllables of the whole sentence and the duration of the sentence measured in seconds. We further included Log10Frequency_z, the log-transformed (to the base 10), centered and standardized frequency of the target nouns as specified in the Google Books Ngram Viewer (https://books.google.com/ngrams) for American English, and Bigram_z, the centered and standardized counts of the sequence ‘target noun + following word’ (e.g. cabs always) in the Google Books Ngram Viewer, in the initial model. Due to zeroes in the dataset, the bigrams were not log transformed. Note that our experiment had actually been controlled for many aspects prior to the study. Since we used the same nouns and sentences in all conditions (with the exception of the/these and the verb tense), the frequencies and bigrams were balanced across the conditions. Nevertheless, we examined whether the two play a role overall in that, for instance, higher frequency triggers shorter s durations. Since we are interested in the duration of the suffix / the end of the word, we consider the bigram ‘target noun + following word’ only (and not the bigram ‘preceding word + target noun’).

For each response variable, we started model fitting with a model with the maximal random effects structure, consisting of the intercepts for Subject and Item and the four random slopes for Determiner by Subject, Determiner by Item, Tense by Subject and Tense by Item. Three of these four random slopes did not remain in the model since the maximal and the other random effects structures (i.e., those with the two intercepts and three, two or one random slope(s)) were not appropriate (‘Singular fit’ issue) and therefore manually and in a step-by-step manner simplified. It is well known from the literature that complex random effects structures can cause problems (see, e.g., Barr et al. Reference Barr, Levy, Scheepers and Tily2013; Matuschek et al. Reference Matuschek, Kliegl, Vasishth, Baayen and Bates2017; Cohen & Kang Reference Cohen and Kang2018; Martin Schweinberger p.c.), hence we opted for the reduced model. In the analysis of the absolute s durations, the only model containing (a) random slope(s) that was appropriate was the one with the slope for Determiner by Item; in the analysis of the relative s durations, it was the model with the slope for Determiner by Subject. The two random intercepts were part of these models, too.

The models containing the fixed and random effects structure as specified above were then reduced step by step by removing non-significant fixed effects from the model. Non-significant factors were excluded on the basis of the R column ‘Pr(>|t|)’, removing the factor with the highest value and a value greater than 0.05 at each step.Footnote ⁸ Once we had a model with significant fixed effects only, we additionally verified whether the criteria mentioned in Plag et al. (Reference Plag, Homann and Kunter2017: 194) went in the same direction. Plag et al. (Reference Plag, Homann and Kunter2017: 194) relied on three criteria, or tests, to decide whether a specific factor remained in the model. The first criterion refers to the t-statistics, which had to be greater than 2 or smaller than -2 for a factor to remain in the model. Moreover, a significant improvement of the fit of the model should occur if the factor is part of the model, in comparison to the model without the factor, and this would be indicated by a p value smaller than .05 when contrasting the model with and the model without the respective factor in an ANOVA. Finally, the Akaike Information Criterion (AIC) needed to be smaller if the factor was in the model, in comparison to the model without the factor (see also, e.g., Pinheiro & Bates Reference Pinheiro and Bates2000: 10; Wu Reference Wu2010: 90).

After completion of the manual reduction of the model, we additionally performed an automatic elimination of the non-significant fixed effects using the step function of the lmerTest package (Kuznetsova et al. Reference Kuznetsova, Brockhoff, Bojesen Christensen and Jensen2020; see also, e.g., Lohmann Reference Lohmann2020: 436) to see whether the result is the same.

3.5 Results

Figures 2 to 7 summarize the descriptive statistics of the datasets without statistical outliers, for the absolute and relative s durations, respectively.

Figure 2. Error bars of absolute s durations, 95 percent confidence intervals, the diamond symbols represent the means, reduced dataset without statistical outliers (2,307 values)Footnote ⁹

Figure 3. Error bars of absolute s durations, 95 percent confidence intervals, the diamond symbols represent the means, reduced dataset without statistical outliers (2,307 values)

Figure 4. Error bars of absolute s durations, 95 percent confidence intervals, the diamond symbols represent the means, reduced dataset without statistical outliers (2,307 values)

Figure 5. Error bars of relative s durations, 95 percent confidence intervals, the diamond symbols represent the means, reduced dataset without statistical outliers (2,314 values)

Figure 6. Error bars of relative s durations, 95 percent confidence intervals, the diamond symbols represent the means, reduced dataset without statistical outliers (2,314 values)

Figure 7. Error bars of relative s durations, 95 percent confidence intervals, the diamond symbols represent the means, reduced dataset without statistical outliers (2,314 values)

Overall, the differences between the mean values of the individual groups are subtle, and in some cases, there is no difference at all. In an additional step of the descriptive analysis, we examine how consistent and stable the results detected so far are by using a method applied in Durvasula & Liter (Reference Durvasula and Liter2020: 197–8) (see also Schlechtweg & Corbett Reference Schlechtweg and Corbett2021). For this purpose, consider figure 8. We see the cumulative absolute suffix durations of the four conditions for the 38 subjects. That is, ‘1’ on the x-axis refers to the average absolute suffix durations of the four conditions of the first subject only. ‘6’, however, does not simply refer to the sixth subject, but to the cumulative average absolute suffix durations of the four conditions of the first six subjects. Looking at this graph, we get an idea of the development of our results with more and more subjects. We see that the development of the four conditions is comparable and homogeneous starting approximately at ‘21’ on the x-axis. Put differently, once 21 subjects had been tested, the curves of the conditions developed in more or less the same way. On the basis of this figure, we have no reason to assume that drastic changes between the conditions would arise if more subjects participated in the experiment. Hence, we can say that the picture drawn above is robust, and the differences across the conditions are consistently small.

Figure 8. Cumulative mean s durations by subjects in seconds

To sum up our findings so far, we can say that the differences between groups are either small or absent, and this trend is stable and robust. Nevertheless, an inferential statistical analysis is still needed to verify whether the differences are significant, even if they are small. Further, taking a potential influence of speech rate into account is essential, even in a thoroughly controlled experiment. The results for the fixed effects of our final mixed-effects models, after the exclusion of non-significant fixed effects, are given in tables 3 and 4; the results for the random effects are given in appendices B and C. Note that the same fixed-effects structures were found in the automatic analysis with the step function.

Table 3. Fixed-effects statistics of the mixed-effects model of absolute s durations in seconds

Table 4. Fixed-effects statistics of the mixed-effects model of relative s durations

First of all, and unsurprisingly, the s duration decreases with increasing speech rate, which is expressed in the negative estimate for Log10SpeechRate_z. This holds for both the analysis of absolute and the analysis of relative durations. Second, and interestingly, we detect an effect of Determiner in the analysis of the absolute s durations, that is, s durations are longer when the appears as the determiner in contrast to when these occurs. The difference is expressed by the fact that the estimate of the log transformed s duration of Determinerthese is negative and thus smaller than the intercept, which represents the baseline Determinerthe. The difference, if back-transformed from the logarithm, is about 0.0036 seconds. The criteria mentioned in Plag et al. (Reference Plag, Homann and Kunter2017: 194) support the findings, both for Determiner and Log10SpeechRate_z. That is, the t statistics of the significant fixed effects are smaller than -2, each factor significantly improves the fit of the model, and the AIC is smaller if the factor is part of the model. Hence, we can state that (i) the s is longer in absolute terms in combination with the in comparison to these and (ii) the s duration increases with decreasing speech rate.

4 Summary and discussion

Previous research has shown that the duration of the English word-final s depends on both its function and its context. There are two competing factors here. On the one hand, variation exists between different types of s, such as affixal and non-affixal s, or different types of affixal s (see, e.g., Plag et al. Reference Plag, Homann and Kunter2017). On the other hand, reduction and lengthening of the s is connected to its predictability in a given context (see, e.g., Rose Reference Rose2017). The current article expanded research of the second type and examined whether overt morphosyntactic agreement affects the duration of affixal s. Two major results emerged in the analyses. First, noun–verb agreement did not affect the s duration. That is, the suffix duration on the noun did not differ when there was a past tense verb form following (hence no overt agreement) as compared to when there was a present tense verb (hence overt agreement). The noun–verb agreement effect was found in Schlechtweg & Corbett (Reference Schlechtweg and Corbett2021) but could not be replicated here. It is possible that the effect detected in this earlier study derived from other differences in the test sentences. Crucially, the current study was carefully controlled for such possible factors and included a much larger dataset, and the effect did not arise. Second, noun–determiner agreement did affect the duration of the s in the expected direction. This effect, a subtle one, occurred in the analysis of the absolute duration. If the preceded a plural noun (no overt noun–determiner agreement present), the s was longer than if these was used (overtly agreeing with the plural noun). In sum, our experiment gives slight evidence in favor of the idea that the s is reduced if the plural noun has an agreeing determiner (these). Noun–verb agreement, in contrast, has no impact on the duration of s.

There are two aspects which force us to interpret the results with caution: first, the differences between the the and these conditions are small and, second, significance between the two was only reached in the analysis of the absolute durations. Without an effect of relative durations, we do not have evidence that the percentage the suffix takes within the word increases if the precedes the target noun. Nevertheless, we must keep in mind that our results are based on a large dataset (2,307 test cases in the absolute duration analysis and 2,314 test cases in the relative duration analysis), which increases the reliability of the findings. Looking at other comparable studies on the duration of the English s, we see that our experiment is far more comprehensive than the investigations conducted by, for instance, Walsh & Parker (Reference Walsh and Parker1983), Schwarzlose & Bradlow (Reference Schwarzlose and Bradlow2001), Plag et al. (Reference Plag, Homann and Kunter2017), Seyfarth et al. (Reference Seyfarth, Garellek, Gillingham, Ackerman and Malouf2018), Schmitz et al. (Reference Schmitz, Baer-Henney and Plag2021) and Schlechtweg & Corbett (Reference Schlechtweg and Corbett2021). Therefore, we believe that the effect we detected for the absolute suffix durations is not irrelevant and is discussed in more detail below.

Several established psycholinguistic and linguistic models with a feed forward spirit (e.g. Levelt Reference Levelt1989) have been criticized on the basis of empirical data over the last two decades. Their theoretical conceptions seem to be too rigid and inflexible when it comes to the interplay of different types of linguistic information and cannot explain, for instance, why the acoustic output is affected by morphological complexity, since no connection between the two domains is assumed in such theories. The effect detected in the present experiment is equally incompatible with models of the above-named character. In a strict feed forward world, the word form of the English plural noun would be created, its discrete phonological units would be specified, and the acoustic sequence would be realized. Since the phonological structure is identical independently of whether the or these precedes the noun in the sentence, no acoustic distinctions are expected. There is some evidence for a contrast in our study, however, and this calls for a more flexible approach, as described in, for instance, Dell (Reference Dell1986) and Pierrehumbert (Reference Pierrehumbert, Bybee and Hopper2001, Reference Pierrehumbert, Gussenhoven and Warner2002), allowing the possibility that higher-order domains such as morphosyntax can have a direct connection to phonetics and the concrete realization of a word or word part.

The direction of the determiner effect, with the leading to a longer s, finds support in the literature. In a phrase containing the, the s is more informative in that it signals the number value alone, or, more precisely, without an additional plurality indicator on the determiner. In contrast, if these precedes a plural noun, it already specifies that the following noun is a plural one and the s does not contribute a new piece of information. Previous literature has shown that more informative elements are lengthened (e.g. Engelhardt & Ferreira Reference Engelhardt and Ferreira2014), and this is what happens in our data, too: if the s is preceded by the and plays the crucial role in the expression of plurality, it is longer, in comparison to cases with these in the determiner position. Considering syntagmatic predictability, there has been evidence that the s is enhanced if it is less predictable (e.g. Rose Reference Rose2017). So, if words like various precede a regular plural noun, they tell us that the noun must contain the s and the s can be reduced. Other words, like pretty, are neutral in turn and do not predict the occurrence of s, which is therefore likely to be lengthened. Again, our effect fits in nicely here, since the s turned out to be longer if the determiner (the) did not predict its occurrence.

Thus while the effect that we report is surprising, it has a reassuring regularity. In the current experiment the effect is found with a plural determiner but not with a past tense verb. The reverse would have been truly remarkable: it would imply that the length of affixal s is affected by the presence or absence of overt agreement on the verb, which is still to be pronounced. What is the possible basis for the difference between our result for attributive agreement and predicate agreement? There are two candidates: syntactic structure and linear precedence. In our sentence these blue cabs always break down, the determiner these is within the same nominal phrase as cabs, while break is more distant syntactically. Equally, these precedes cabs while break follows. Both syntactic structure and linear precedence are well established as affecting agreement (Corbett Reference Corbett2006: 180, 206–30), and could explain why we detected an effect for Determiner but not for Tense.

5 Conclusion

It is by now well known that fine acoustic detail can mirror different types of linguistic information. A case in point in this research area is the duration of the English word-final s, which has been shown to be modulated by speakers on the basis of both its function and context. On the functional side, affixal and non-affixal s differ, and even distinct types of affixal s are heterogenous. The current experiment adds a further piece of evidence supporting the idea that the s duration is also adjusted in specific contexts: overt noun–determiner agreement leads to a reduction of the s. The effect is subtle, of course, and needs to be replicated, but is compatible with research arguing for a more significant and flexible role of the acoustic output in language.

Appendix A: Test sentences

The blue cabs always break down.

The blue cabs always broke down.

These blue cabs always break down.

These blue cabs always broke down.

The large tags really make the price clear.

The large tags really made the price clear.

These large tags really make the price clear.

These large tags really made the price clear.

The ripe pears usually fall from the tree.

The ripe pears usually fell from the tree.

These ripe pears usually fall from the tree.

These ripe pears usually fell from the tree.

The wide screens eventually become useless.

The wide screens eventually became useless.

These wide screens eventually become useless.

These wide screens eventually became useless.

The old cars unfortunately have mechanical problems.

The old cars unfortunately had mechanical problems.

These old cars unfortunately have mechanical problems.

These old cars unfortunately had mechanical problems.

The thick nails easily hold up the picture on the wall.

The thick nails easily held up the picture on the wall.

These thick nails easily hold up the picture on the wall.

These thick nails easily held up the picture on the wall.

The short rides regularly take an hour.

The short rides regularly took an hour.

These short rides regularly take an hour.

These short rides regularly took an hour.

The cheap creams often sting her skin.

The cheap creams often stung her skin.

These cheap creams often sting her skin.

These cheap creams often stung her skin.

The rough waves regularly shake the beach house.

The rough waves regularly shook the beach house.

These rough waves regularly shake the beach house.

These rough waves regularly shook the beach house.

The soft plums already stink.

The soft plums already stank.

These soft plums already stink.

These soft plums already stank.

The big stones immediately sink in the lake.

The big stones immediately sank in the lake.

These big stones immediately sink in the lake.

These big stones immediately sank in the lake.

The new trains clearly speak for themselves.

The new trains clearly spoke for themselves.

These new trains clearly speak for themselves.

These new trains clearly spoke for themselves.

The deep ponds always freeze during the winter.

The deep ponds always froze during the winter.

These deep ponds always freeze during the winter.

These deep ponds always froze during the winter.

The dried figs amazingly grow sweeter and sweeter.

The dried figs amazingly grew sweeter and sweeter.

These dried figs amazingly grow sweeter and sweeter.

These dried figs amazingly grew sweeter and sweeter.

The small phones unexpectedly ring very loudly.

The small phones unexpectedly rang very loudly.

These small phones unexpectedly ring very loudly.

These small phones unexpectedly rang very loudly.

The thin pads often fall out.

The thin pads often fell out.

These thin pads often fall out.

These thin pads often fell out.

Appendix B: Random effects statistics of the mixed-effects model of absolute s durations in seconds

Appendix C: Random effects statistics of the mixed-effects model of relative s durationsFootnote ¹⁰

Footnotes

Schlechtweg is the principal author.

² See also, for instance, Plag, Homann & Kunter (Reference Plag, Homann and Kunter2017); Schlechtweg & Corbett (Reference Schlechtweg and Corbett2021).

³ Statistical outliers (s durations) were excluded and the s durations were then log transformed (to the base 10).

⁴ Røde NT USB (transmission range: 20 Hz to 20 kHz; limit sound pressure level: 110 dB SPL).

⁵ Acer Aspire (15.6 inch / 39.6 cm).

⁶ The tidyverse package (Wickham et al. Reference Wickham2019) was also involved in the data analysis.

⁷ We relied on fit by maximum likelihood during the model fitting process (see, e.g., Field et al. Reference Field, Miles and Field2012: 879).

⁸ If Determiner or Tense showed the highest value at a step, the non-significant interaction of the two was removed first.

⁹ This and all of the following figures were created in Minitab (Minitab 2019).

¹⁰ Since the fixed effect Determiner was not significant and therefore not part of the final model, the random slope for Determiner by Subject did not remain in the model either.

References

Barr, Dale J., Levy, Roger, Scheepers, Christoph & Tily, Harry J.. 2013. Random effects structure for confirmatory hypothesis testing: Keep it maximal. Journal of Memory and Language 68(3), 255–78.CrossRef Google Scholar PubMed

Bates, Douglas, Maechler, Martin, Bolker, Ben & Walker, Steve. 2015. Fitting linear mixed-effects models using lme4. Version 1.1.27.1. Journal of Statistical Software 67(1), 1–48.Google Scholar

Bell, Alan, Brenier, Jason M., Gregory, Michelle, Girand, Cynthia & Jurafsky, Daniel. 2009. Predictability effects on durations of content and function words in conversational English. Journal of Memory and Language 60, 92–111.CrossRef Google Scholar

Bell, Alan, Jurafsky, Daniel, Fosler-Lussier, Eric, Girand, Cynthia, Gregory, Michelle & Gildea, Daniel. 2003. Effects of disfluencies, predictability, and utterance position on word form variation in English conversation. Journal of the Acoustical Society of America 113(2), 1001–24.CrossRef Google Scholar PubMed

Bermúdez-Otero, Ricardo. 2018. Stratal phonology. In Hannahs, S. J. & Bosch, Anna R. K. (eds.), The Routledge handbook of phonological theory, 100–34. New York: Routledge.Google Scholar

Boersma, Paul & Weenink, David. 2020. Praat: Doing phonetics by computer (version 6.1.16). [Computer program]. Retrieved from www.praat.org Google Scholar

Chomsky, Noam & Halle, Morris. 1968. The sound pattern of English. New York: Harper & Row.Google Scholar

Cohen, Clara. 2014. Probabilistic reduction and probabilistic enhancement: Contextual and paradigmatic effects on morpheme pronunciation. Morphology 24, 291–323.CrossRef Google Scholar

Cohen, Clara & Kang, Shinae. 2018. Flexible perceptual sensitivity to acoustic and distributional cues. The Mental Lexicon 13(1), 38–73.CrossRef Google Scholar

Cohn, Abigail C., Brugman, Johanna, Crawford, Clifford & Joseph, Andrew. 2005. Lexical frequency effects and phonetic duration of English homophones: An acoustic study. The Journal of the Acoustical Society of America 118, 2036.CrossRef Google Scholar

Conwell, Erin. 2018. Token frequency effects in homophone production: An elicitation study. Language and Speech 61(3), 466–79.CrossRef Google Scholar PubMed

Corbett, Greville G. 2006. Agreement. Cambridge: Cambridge University Press.Google Scholar

Dell, Gary S. 1986. A spreading-activation theory of retrieval in sentence production. Psychological Review 93(3), 283–321.Google Scholar PubMed

Demuth, Katherine. 2011. The acquisition of phonology. In Goldsmith, John, Riggle, Jason & Yu, Alan C. L. (eds.), The handbook of phonological theory, 2nd edn, 571–95. Malden, MA: Wiley Blackwell.CrossRef Google Scholar

Drager, Katie K. 2011. Sociophonetic variation and the lemma. Journal of Phonetics 39, 694–707.CrossRef Google Scholar

Durvasula, Karthik & Liter, Adam. 2020. There is a simplicity bias when generalising from ambiguous data. Phonology 37, 177–213.CrossRef Google Scholar

Engelhardt, Paul E. & Ferreira, Fernanda. 2014. Do speakers articulate over-described modifiers differently from modifiers that are required by context? Implications for models of reference production. Language, Cognition and Neuroscience 29(8), 975–85.CrossRef Google Scholar

Erker, Daniel. G. 2010. A subsegmental approach to coda /s/ weakening in Dominican Spanish. International Journal of the Sociology of Language 203, 9–26.Google Scholar

Field, Andy, Miles, Jeremy & Field, Zoë. 2012. Discovering statistics using R. Los Angeles, CA: Sage.Google Scholar

Frank, Austin F. & Jaeger, T. Florian. 2008. Speaking rationally: Uniform information density as an optimal strategy for language production. Proceedings of the Annual Meeting of the Cognitive Science Society 30, 939–44.Google Scholar

Fromkin, Victoria A. 1971/1973. The non-anomalous nature of anomalous utterances. Language 47(1), 27–52. Reprinted in Victoria A. Fromkin (ed.), Speech errors as linguistic evidence (1973; Janua Linguarum 77), 215–42. The Hague: Mouton.CrossRef Google Scholar

Gahl, Susanne. 2008. Time and thyme are not homophones: The effect of lemma frequency on word durations in spontaneous speech. Language 84(3), 474–96.CrossRef Google Scholar

Gahl, Susanne & Garnsey, Susan M.. 2004. Knowledge of grammar, knowledge of usage: Syntactic probabilities affect pronunciation variation. Language 80(4), 748–75.CrossRef Google Scholar

Harley, Trevor A. 1984. A critique of top-down independent levels models of speech production: Evidence from non-plan-internal speech errors. Cognitive Science 8, 191–219.CrossRef Google Scholar

Hsieh, Li, Leonard, Laurence B. & Swanson, Lori. 1999. Some differences between English plural noun inflections and third singular verb inflections in the input: The contributions of frequency, sentence position, and duration. Journal of Child Language 26(3), 531–43.CrossRef Google Scholar PubMed

Hundley, James E. 1987. Functional constraints on plural marker deletion in Peruvian Spanish. Hispania 70(4), 891–4.CrossRef Google Scholar

Jurafsky, Daniel, Bell, Alan & Girand, Cynthia. 2002. The role of the lemma in form variation. In Gussenhoven, Carlos & Warner, Natasha (eds.), Laboratory phonology 7 (Phonology and Phonetics 4–1), 3–34. Berlin: Mouton de Gruyter.Google Scholar

Jurafsky, Daniel, Bell, Alan, Gregory, Michelle & Raymond, William D.. 2001. Probabilistic relations between words: Evidence from reduction in lexical production. In Bybee, Joan L. & Hopper, Paul J. (eds.), Frequency and the emergence of linguistic structure (Typological Studies in Language 45), 229–54. Amsterdam: John Benjamins.CrossRef Google Scholar

Kemps, Rachèl J. J. K., Ernestus, Mirjam, Schreuder, Robert & Baayen, R. Harald. 2005a. Prosodic cues for morphological complexity: The case of Dutch plural nouns. Memory & Cognition 33(3), 430–46.CrossRef Google Scholar PubMed

Kemps, Rachèl J. J. K., Wurm, Lee H., Ernestus, Mirjam, Schreuder, Robert & Harald Baayen, R.. 2005b. Prosodic cues for morphological complexity in Dutch and English. Language and Cognitive Processes 20(1–2), 43–73.CrossRef Google Scholar

Kiparsky, Paul. 1982. From cyclic phonology to lexical phonology (Part 1). In van der Hulst, Harry & Smith, Norval (eds.), The structure of phonological representations, 131–76. Dordrecht: Foris.Google Scholar

Krasheninnikova, E. A. 1979. Phonetic aspects of lingua-informatics. In Hollien, Harry & Hollien, Patricia (eds.), Current issues in the phonetic sciences: Proceedings of the IPS-77 congress, Miami Beach, Florida, 17–19 December 1977 (Current Issues in Linguistic Theory 9), 71–6. Amsterdam: John Benjamins.CrossRef Google Scholar

Kurumada, Chigusa & Grimm, Scott. 2017. Communicative efficiency in language production and learning: Optional plural marking. Proceedings of the 39th Annual Meeting of the Cognitive Science Society (CogSci 2017, London).Google Scholar

Kurumada, Chigusa & Jaeger, T. Florian. 2015. Communicative efficiency in language production: Optional case-marking in Japanese. Journal of Memory and Language 83, 152–78.CrossRef Google Scholar

Kuznetsova, Alexandra, Brockhoff, Per Bruun, Bojesen Christensen, Rune Haubo & Jensen, Sofie Pødenphant. 2020. Package lmerTest. Tests in linear mixed effects models, version 3.1.3. https://github.com/runehaubol/lmerTestR/issues Google Scholar

Ladefoged, Peter. 2003. Phonetic data analysis: An introduction to fieldwork and instrumental techniques. Malden, MA: Blackwell.Google Scholar

Ladefoged, Peter & Maddieson, Ian. 1996. The sounds of the world's languages. Malden, MA: Blackwell.Google Scholar

Levelt, Willem J. M. 1989. Speaking: From intention to articulation. Cambridge, MA: MIT Press.Google Scholar

Levelt, Willem J. M. 1995. The ability to speak: From intensions to spoken words. European Review 3(1), 13–23.CrossRef Google Scholar

Levelt, Willem J. M., Roelofs, Ardi & Meyer, Antje S.. 1999. A theory of lexical access in speech production. Behavioral and Brain Sciences 22, 1–75.CrossRef Google Scholar PubMed

Loewen, Shawn & Plonsky, Luke. 2016. An A–Z of applied linguistics research methods. London: Palgrave.CrossRef Google Scholar

Lohmann, Arne. 2018a. Cut (N) and cut (V) are not homophones: Lemma frequency affects the duration of noun–verb conversion pairs. Journal of Linguistics 54, 753–77.CrossRef Google Scholar

Lohmann, Arne. 2018b. Time and thyme are NOT homophones: A closer look at Gahl's work on the lemma frequency effect, including a reanalysis. Language 94(2), e180–e190.CrossRef Google Scholar

Lohmann, Arne. 2020. No acoustic correlates of grammatical class: A critical re-examination of Sereno and Jongman (1995). Phonetica 77, 429–40.CrossRef Google Scholar

Machač, Pavel & Skarnitzl, Radek. 2009. Principles of phonetic segmentation. Prague: Epocha Publishing House.Google Scholar

Matuschek, Hannes, Kliegl, Reinhold, Vasishth, Shravan, Baayen, Harald & Bates, Douglas. 2017. Balancing type I error and power in linear mixed models. Journal of Memory and Language 94, 305–15.CrossRef Google Scholar

Minitab. 2019. Minitab 19 [Computer program]. www.minitab.com Google Scholar

Moore-Cantwell, Claire. 2013. Syntactic predictability influences duration. Proceedings of Meetings on Acoustics 19, 060206.CrossRef Google Scholar

Norcliffe, Elisabeth & Jaeger, T. Florian. 2016. Predicted head-marking variability in Yucatan Maya relative clause production. Language and Cognition 8, 167–205.CrossRef Google Scholar

Pierrehumbert, Janet B. 2001. Exemplar dynamics: Word frequency, lenition and contrast. In Bybee, Joan L. & Hopper, Paul J. (eds.), Frequency and the emergence of linguistic structure (Typological Studies in Language 45), 137–57. Amsterdam: John Benjamins.CrossRef Google Scholar

Pierrehumbert, Janet B. 2002. Word-specific phonetics. In Gussenhoven, Carlos & Warner, Natasha (eds.), Laboratory phonology 7 (Phonology and Phonetics 4–1), 101–40. Berlin: Mouton de Gruyter.Google Scholar

Pinheiro, José C. & Bates, Douglas M.. 2000. Mixed-effects models in S and S-PLUS. New York: Springer.CrossRef Google Scholar

Plag, Ingo, Hedia, Sonia Ben, Lohmann, Arne & Zimmermann, Julia. 2020. An <s> is an <s’>, or is it? Plural and genitive-plural are not homophonous. In Körtvélyessy, Livia & Stekauer, Pavel (eds.), Complex words: Advances in morphology, 260–92. Cambridge: Cambridge University Press.CrossRef Google Scholar

Plag, Ingo, Homann, Julia & Kunter, Gero. 2017. Homophony and morphology: The acoustics of word-final S in English. Journal of Linguistics 53, 181–216.CrossRef Google Scholar

Poplack, Shana. 1980. Deletion and disambiguation in Puerto Rican Spanish. Language 56(2), 371–85.CrossRef Google Scholar

R Core Team. 2021. R: A language and environment for statistical computing. R version 4.0.5. Vienna: R Foundation for Statistical Computing. www.R-project.org Google Scholar

Roelofs, Ardi. 1997. The WEAVER model of word-form encoding in speech production. Cognition 64, 249–84.CrossRef Google Scholar PubMed

Rose, Darcy Elizabeth. 2017. Predicting plurality: An examination of the effects of morphological predictability on the learning and realization of bound morphemes. PhD dissertation, University of Canterbury, Christchurch, New Zealand.Google Scholar

Schiel, Florian, Draxler, Christoph & Harrington, Jonathan. 2011. Phonemic segmentation and labelling using the MAUS technique. Workshop New Tools and Methods for Very-Large-Scale Phonetics Research, University of Pennsylvania, 28–31 January 2011.Google Scholar

Schlechtweg, Marcel & Corbett, Greville G.. 2021. The duration of word-final s in English: A comparison of regular-plural and pluralia-tantum nouns. Morphology 31(4), 383–407.CrossRef Google Scholar

Schlechtweg, Marcel & Härtl, Holden. 2020. Do we pronounce quotation? An analysis of name-informing and non-name-informing contexts. Language and Speech 63(4), 769–98.CrossRef Google Scholar PubMed

Schlechtweg, Marcel & Heinrichs, Melina. 2022. The acoustics of number: Duration differences in singular-plural syncretism. Sprachwissenschaft 47(1), 77–102.Google Scholar

Schlechtweg, Marcel, Heinrichs, Melina & Linnenkohl, Marcel. 2020. Differences in acoustic detail: The realization of syncretic nouns in German. In Schlechtweg, Marcel (ed.), The learnability of complex constructions: A cross-linguistic perspective (Trends in Linguistics. Studies and Monographs 345), 39–62. Berlin: De Gruyter Mouton.CrossRef Google Scholar

Schmitz, Dominic, Baer-Henney, Dinah & Plag, Ingo. 2021. The duration of word-final /s/ differs across morphological categories in English: Evidence from pseudowords. Phonetica 78(5–6), 571–616.CrossRef Google Scholar PubMed

Schuppler, Barbara, Grill, Sebastian, Menrath, André & Morales-Cordovilla, Juan A.. 2014. Automatic phonetic transcription in two steps: Forced alignment and burst deletion. In Besacier, Laurent, Dediu, Adrian-Horia & Martín-Vide, Carlos (eds.), Statistical language and speech processing: Second International Conference, SLP 2014 Grenoble, France, October 14-16, 2014 Proceedings, 132–46. Cham: Springer.CrossRef Google Scholar

Schwarzlose, Rebecca & Bradlow, Ann R.. 2001. What happens to segment durations at the end of a word? The Journal of the Acoustical Society of America 109, 2292.CrossRef Google Scholar

Sereno, Joan A. & Jongman, Allard. 1995. Acoustic correlates of grammatical class. Language and Speech 38(1), 57–76.CrossRef Google Scholar

Seyfarth, Scott, Garellek, Marc, Gillingham, Gwendolyn, Ackerman, Farrell & Malouf, Robert. 2018. Acoustic differences in morphologically-distinct homophones. Language, Cognition and Neuroscience 33(1), 32–49.CrossRef Google Scholar

Smith, Rachel, Baker, Rachel & Hawkins, Sarah. 2012. Phonetic detail that distinguishes prefixed from pseudo-prefixed words. Journal of Phonetics 40, 689–705.CrossRef Google Scholar

Song, Jae Yung, Demuth, Katherine, Evans, Karen & Shattuck-Hufnagel, Stefanie. 2013. Durational cues to fricative codas in 2-year-olds’ American English: Voicing and morphemic factors. The Journal of the Acoustical Society of America 133, 2931–46.CrossRef Google Scholar PubMed

Sugahara, Mariko & Turk, Alice. 2009. Durational correlates of English sublexical constituent structure. Phonology 26(3), 477–524.CrossRef Google Scholar

Tomaschek, Fabian, Plag, Ingo, Ernestus, Mirjam & Baayen, R. Harald. 2021. Phonetic effects of morphology and context: Modeling the duration of word-final S in English with naïve discriminative learning. Journal of Linguistics 57, 123–61.CrossRef Google Scholar

Torreira, Francisco & Ernestus, Mirjam. 2012. Weakening of intervocalic /s/ in the Nijmegen Corpus of Casual Spanish. Phonetica 69, 124–48.CrossRef Google Scholar PubMed

Turk, Alice, Satsuki Nakai, & Sugahara, Mariko. 2006. Acoustic segment durations in prosodic research: A practical guide. In Sudhoff, Stefan, Lenertová, Denisa, Meyer, Roland, Pappert, Sandra, Augurzky, Petra, Mleinek, Ina, Richter, Nicole & Schließer, Johannes (eds.), Methods in empirical prosody research (Language, Context, and Cognition 3), 1–27. Berlin: Walter de Gruyter.Google Scholar

Walsh, Thomas & Parker, Frank. 1983. The duration of morphemic and non-morphemic /s/ in English. Journal of Phonetics 11(2), 201–6.CrossRef Google Scholar

Whalen, Douglas H. 1991. Infrequent words are longer in duration than frequent words. The Journal of the Acoustical Society of America 90, 2311.CrossRef Google Scholar

Wickham, Hadley et al. 2019. Welcome to the tidyverse. Version 1.3.1. Journal of Open Source Software 4(43), 1686.CrossRef Google Scholar

Winter, Bodo. 2020. Statistics for linguists: An introduction using R. New York: Routledge.Google Scholar

Wu, Lang. 2010. Mixed effects models for complex data (Monographs on Statistics and Applied Probability 113). Boca Raton, FL: CRC Press.Google Scholar

Zimmermann, Julia. 2016. Morphological status and acoustic realization: Findings from New Zealand English. In Carignan, Christopher & Tyler, Michael D. (eds.), Proceedings of the Sixteenth Australasian International Conference on Speech Science and Technology (SST-2016), Parramatta, Australia, 6–9 December 2016, 201–4. Canberra: ASSTA.Google Scholar

Table 1. Test sentences used in Schlechtweg & Corbett (2021)

Table 2. VerbTense in the mixed-effects model of Schlechtweg & Corbett (2021)

Figure 1. Segmentation of [z] using waveform (top), spectrogram (middle) and Praat TextGrid (bottom). Taken from Schlechtweg & Corbett (2021) (with permission)

Figure 2. Error bars of absolute s durations, 95 percent confidence intervals, the diamond symbols represent the means, reduced dataset without statistical outliers (2,307 values)9

Figure 3. Error bars of absolute s durations, 95 percent confidence intervals, the diamond symbols represent the means, reduced dataset without statistical outliers (2,307 values)

Figure 4. Error bars of absolute s durations, 95 percent confidence intervals, the diamond symbols represent the means, reduced dataset without statistical outliers (2,307 values)

Figure 5. Error bars of relative s durations, 95 percent confidence intervals, the diamond symbols represent the means, reduced dataset without statistical outliers (2,314 values)

Figure 6. Error bars of relative s durations, 95 percent confidence intervals, the diamond symbols represent the means, reduced dataset without statistical outliers (2,314 values)

Figure 7. Error bars of relative s durations, 95 percent confidence intervals, the diamond symbols represent the means, reduced dataset without statistical outliers (2,314 values)

Figure 8. Cumulative mean s durations by subjects in seconds

Table 3. Fixed-effects statistics of the mixed-effects model of absolute s durations in seconds

Table 4. Fixed-effects statistics of the mixed-effects model of relative s durations

Article contents

Is morphosyntactic agreement reflected in acoustic detail? The s duration of English regular plural nouns

Abstract

Keywords

1 Introduction

2 Theoretical background

2.1 Phonological identity but acoustic variation: overview

2.2 Word-final s in English

2.3 No overt versus overt agreement: why the s might differ in duration

3 Methodology

3.1 Subjects

3.2 Materials

3.3 Procedure

3.4 Data analysis

3.4.1 Data preparation and segmentation

3.4.2 Statistical analysis and modeling

3.5 Results

4 Summary and discussion

5 Conclusion

Appendix A: Test sentences

Appendix B: Random effects statistics of the mixed-effects model of absolute s durations in seconds

Appendix C: Random effects statistics of the mixed-effects model of relative s durationsFootnote 10

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests

Appendix C: Random effects statistics of the mixed-effects model of relative s durationsFootnote ¹⁰