1. Introduction
Tonemes and tonological processes are known to encode the full range of grammatical meaning known to language (Rolle Reference Rolle2018: 33), just like segmental morphemes. They are, however, inherently different from segmental morphemes in that they require a host to be realised. The dependence of tonemes on segmental hosts raises a number of questions that have just begun to be explored in the literature. How much information does the tonal as opposed to the segmental morpheme contribute to the meaning? Tonemes can also impinge on lexical tone in competition with grammatical tunes. How does grammatical tone (GT) interact with lexical meanings of tone hosts? Investigating the place of GT within a phonological and grammatical system, interacting with segmental morphology and lexical properties of tone hosts, contributes to our understanding of the interface between (tonal) phonology, grammar and lexical meaning. I address these questions for the northwestern Bantu (A.801) language Gyeli from a perspective of property-driven typology (Plank Reference Plank, Auroux, Koerner, Niederehe and Versteegh2001; Hyman Reference Hyman2009). This approach seeks to classify the distribution of individual properties, such as units, categories, construction types and rules, instead of classifying languages.
Gyeli [gyi] is an endangered Bantu language spoken by 4,000–5,000 ‘Pygmy’ hunter-gatherers in southern Cameroon, who call themselves Bagyeli. The data for this article stem from fieldwork that I conducted in Cameroon over a total of 19 months between 2010 and 2017. The most extensive description of the language is provided in Grimm’s (Reference Grimm2021) reference grammar, which is accompanied by a digital collection of natural text and elicitation recordings in Grimm et al. (Reference Grimm, Um and Duke2020).Footnote 1 The transcription system I use in this article was developed with the speech community, and relies substantially on notational conventions typically used for Bantu languages.Footnote 2
In many ways, Gyeli is a typical Bantu language. Bantu languages exhibit some of the most complex GT systems in Africa (Rolle Reference Rolle2018: 37). Although they usually ‘only’ distinguish two to three tonal levels, they are highly diverse with respect to tonological operations and their functions, especially in the verbal domain, for example, with a recognised role of tonal inflection or ‘melodic Hs’ (Odden & Bickmore Reference Odden and Bickmore2014; Odden & Marlo Reference Odden, Marlo, Velde, Bostoen, Nurse and Philippson2018). Tone plays an important role in the encoding of tense/aspect/mood (tam) and polarity categories in various positions of the verb.
While tonal phenomena have been studied by Africanists for a long time, a focus on the typology of GT – a type of non-concatenative morphology in which a morpheme is expressed in part by tonal changes and operations (e.g. tone addition, deletion, replacement, spreading, shifting, assimilation and dissimilation; Rolle Reference Rolle2018) – is a more recent development. Whereas GT is only licensed in specific grammatical constructions, tonal languages also have tonal operations that are predictable from the phonological rules of a language. I refer to this as phonological tone (PT), which in Gyeli is restricted to high-tone spreading (HTS). Both operate against the backdrop of lexical tone, the underlying tones that are lexically specified for words and morphemes.
When comparing tone across languages, typologies of tone systems can be established from different angles: by phonological contrast, the functions of tone, or the underlying rules (Hyman Reference Hyman, Haspelmath, König, Oesterreicher and Raible2001). There has been little discussion, however, of how tonal and segmental morphemes interact and distribute as exponents of grammatical features within a single language and across languages. In the absence of any studies on this question, Rolle (Reference Rolle2018: 267) speculates that ‘[i]t is a straightforward and intuitive prediction that those languages with a high lexical role for tone would have less GT, and vice versa, but this hypothesis is yet to be tested’. Contrary to Rolle’s (Reference Rolle2018) prediction, Gyeli does have a high lexical role for tone in nouns, verbs and the majority of functional word classes while also having an important role for GT, which encodes diverse categories such as tam and polarity.
In this article, I propose a different explanation and relate the amount of information GT contributes to grammatical meaning in Gyeli to the complexity of its exponence type (GT sole exponent, GT co-exponent with suprasegmental, segmental affix or auxiliary co-exponents). Following Miestamo (Reference Miestamo, Kerge and Sepper2006), Dahl (Reference Dahl2004) and McWhorter (Reference McWhorter2001), I define complexity in information-theoretic terms, as a function of the length of the form encoding some category of information. In Gyeli, length can be seen on a scale from minimal, suprasegmental length such as tone (pitch) and lengthened vowels (duration) to segmental bound morphemes and finally free morphemes resulting in complex constructions such as complex predicates.Footnote 3 Segmental morphemes are often accompanied by tonal co-exponents, which add to the form length.
One factor that has been shown to increase complexity is violating transparency, that is, the clarity of the relation between meaning and form (Kusters Reference Kusters2003). A one-meaning-one-form relation is ‘clear’ on the transparency scale, whereas one-meaning-several-forms mappings are considered ‘opaque’. ‘Several forms’ in this context refers to co-exponents, where information contributing to the meaning is distributed across co-exponents. The term functional load describes how much information a (co-)exponent contributes to the meaning. While providing precise measurements of how to calculate the functional load is beyond the scope of this article, it is intuitively clear that there are differences. Thus, in sole exponents, the form carries 100% of the information and is said to carry a high functional load. In co-exponents, information that contributes to the meaning is distributed over various signals, for example, a segmental affix and a tonal component. The important insight is that information is often not equally distributed over co-exponents, but that more complex, segmental exponents usually carry the higher functional load. In turn, the relation between a tonal co-exponent and its meaning is more opaque and its functional load weak.
The decrease in functional load of GT correlates with an increase in length of a segmental co-exponent. GT sole exponents constitute the basic system used in Gyeli simple predicates to distinguish tam categories and object-marking. In these cases, the one-to-one mapping of form to meaning is transparent, and GT has a high functional load, since tone alone carries the information to distinguish between categories. With an increase in complexity through addition of segmental material, the functional load of GT co-exponents becomes weaker and its meaning contribution more opaque. The reason for this is that the cue for the encoded grammatical category comes primarily from the segmental morpheme, whereas the GT co-exponent often takes an arbitrary pattern that deviates from the pattern of GT as sole exponent.
Tone impacts not only meaning contrasts across grammatical categories, but also lexical meaning. I offer an explanation for the interaction between GT and lexical tones couched in a dominance framework (Kiparsky & Halle Reference Kiparsky, Halle and Hyman M.1977; Inkelas Reference Inkelas, Booij and Marle1998; Rolle Reference Rolle2018), showing that GT is dominant and overwrites lexical tone in contexts of competition. The effect of sacrificing lexical tone is that the templatic structure of GT as sole exponent is maintained. With GT as co-exponent accompanying a segmental morpheme, however, no templatic structure is maintained. Instead, the surface tune is merely the result of idiosyncratic tonal cophonologies (Inkelas & Zoll Reference Inkelas and Zoll2007; Sande et al. Reference Sande, Jenks and Inkelas2020).
This article is structured as follows: In §2, I present the basic tonal patterns of nouns and verbs across inflectional paradigms. §3 shows the distribution of Gyeli GTs as sole and co-exponents of functional categories. In §4, I investigate the interaction between GT and lexical tones, including dominance effects and cophonological properties. §5 concludes the article with an outlook on Gyeli’s place within the broader Bantu context.
2. Tonal surface patterns in Gyeli
In investigating GT systems, it is crucial to account for the overall grammatical system which GT operates in and is constrained by. Structurally, Gyeli is a head-initial language with an SVO(X) basic word order. The gender system features nine agreement classes that form six genders. Whereas Bantu languages are known for their overt marking of agreement class affiliation through noun prefixes, about 40% of Gyeli nouns do not take such prefixes (Grimm Reference Grimm2021: 297). Both noun and verb stems are restricted to a three-syllable limit. This constrains the possibilities for multiple verb extensions, such as causative, applicative or reciprocal, that are typical of eastern and southern Bantu languages.
Phonologically, Gyeli distinguishes level tones (H and L), contour tones (HL and LH) and toneless ( $\emptyset $ ) tone-bearing units (TBUs). Valued TBUs are lexically specified for H, L, HL or LH, whereas unvalued TBUs are underlyingly toneless (§3.1.1). I will first show that the syllable is the TBU in §2.1. I then outline the patterns of surface tones in nouns (§2.2) and verbs (§2.3).
2.1 The syllable is the TBU
Rising and falling tones are often analysed as sequences of level tones in Bantu, but they are true contour tones in Gyeli, as I analyse the syllable as the TBU (instead of the segment or mora). The language has light and heavy syllables, contrasted in the pairs in (1).Footnote 4 Light syllables have one mora; heavy syllables have two moras and contain either a long vowel, as in (1), or a diphthong. Both light and heavy syllables can host contour tones.
The examples in (1) contribute some arguments in favour of an analysis with the syllable as TBU and true contour tones. First, if the TBU were the mora, short vowels would not be expected to allow contour tones. On the other hand, if the presence of contours on short vowels were accommodated by allowing individual morae to bear two tones, one would then expect bimoraic syllables to allow two contours, as in *[fûǔ]. These complex tone sequences, however, do not occur.Footnote 5 Evidence for the syllable as TBU comes from how grammatical H tones attach to verb forms with lexical long vowels. H attachment to a long L-toned verb such as lɛ̀ɛ̀ ‘uproot’ yields [lɛ́ɛ́], targeting the entire syllable, and not the mora, which would result in *[lɛ̀ɛ́]. In contrast, in longer verbs, the H does not spread to the initial syllable, as gyàga ‘buy’ yields [gyàgá]. Thus, the all-H form in the long monosyllabic L verb lɛ̀ɛ̀ ‘uporoot’ does not result from unbounded H spreading (nor systematic replacive H).
2.2 Tonal patterns in nouns
In this section, I will present the tonal patterns for nouns, noun prefixes and attributive constructions, which illustrate underlying tones and tonal operations in nouns. Gyeli nouns consist of a nominal stem and a noun class prefix, which in some agreement classes can be a $\emptyset $ -prefix. Noun stems are fully specified for lexical tones, including both level and contour tones, as shown with the minimal pairs in (2) for prefixless nouns.
Noun stems are mono-, di- or trisyllabic, with a preference for disyllabic stems (Grimm Reference Grimm2021: 92). Most of the 875 nominal lexemes in my database are specified exclusively for level tones, as shown in Table 1: 93.3% of disyllabic and 95.6% of trisyllabic noun stems have only level tones. All possible combinations of level tones are attested for every syllable count.
Contour tones are most frequent in monosyllabic stems, but are also found in di- and trisyllabic noun stems, where they can occur in any position except for the medial syllable of a trisyllabic stem, as shown in Table 2. LH is more restricted than HL. Comparing monosyllabic noun stems, the ratio of LH to HL is 20% to 80%.Footnote 6
Noun prefixes consist of either a consonant (N-, d-, j-, b-, bw-) or a CV sequence (ba-, mi-, le-, ma-, be-; Grimm Reference Grimm2021: 296). Only the CV prefixes constitute TBUs. They surface as phonetically L when the noun occurs in isolation and there is no grammatical H tone, but I consider them to be phonologically toneless, as argued below. CV noun prefixes take a H tone in N + N attributive constructions, if the preceding attributive marker has a H tone, as shown in (3a).Footnote 7 Attributive markers are lexically specified for a H in agreement classes 2–8. In agreement classes 1 and 9, however, they are specified L, as in (3b). In these cases, the following noun prefix is L, too. If the noun prefix of the second constituent is not a TBU, no tonal changes occur on the possessor noun (3c).Footnote 8
The autosegmental representation of (3b) is given in (4).
Under the alternative where the noun prefix was specified L, one would either need to assume more complicated rules of featural change or L deletion, or expect to see downstep effects on a H stem. Unlike other languages of the area, however, Gyeli does not have downstep. The phrase in (5) will surface as all H except for the initial noun class prefix, as shown in Figure 1.Footnote 9
Tone attachment and spreading apply in different directions in nouns (as illustrated in (4)) and verbs (as shown in (11)). CV noun prefixes and the plural marker nga (§3.1) receive their tone specification from the left. In contrast, verb stems, vocative markers, demonstratives and adverbs receive GTs from the right (§3).
2.3 Tonal patterns in the VP
Hosts of GT in the VP in Gyeli include the verb stem and its preceding ‘stamp’ clitic, which encodes combinations of subject agreement, tense, aspect, mood and polarity. The surface tunes are determined by different tam and polarity categories and the verb’s position as phrase-final or phrase-medial. In the following, I will outline the tonal patterns in simple and complex predicates.
Like nouns, verb stems are no longer than three syllables. Tonal lexical contrasts are only found in the stem-initial position, that is, the verb root, whereas the other syllables are the locus of GT distinctions.Footnote 10 In (6), the trisyllabic verbs are lexically specified for either L or H on the root (i.e. the stem-initial syllable), whereas tones on non-initial verb syllables are conditioned by specific tense-aspect-mood categories, instead of being predictable from the lexical tone of the first syllable.Footnote 11
The combinations of tonal patterns on the stamp clitic and the verb instantiate seven tam categories in simple predicates: present, inchoative, past 1 (recent), past 2 (remote), imperative and subjunctive. Tones on the verb are further subject to change in certain tam categories, depending on whether the finite verb occurs phrase-finally or phrase-medially. Table 3 shows that, in the present and inchoative, the non-initial syllables of the verb are L in phrase-final position, indicated by the subscript F appended to the tam category.Footnote 12 In contrast, they surface as H in phrase-medial position (marked by a subscript M). All other tam categories only have one line, since their phrase-medial and phrase-final tone patterns are identical. I relate the different phrase-final and phrase-medial patterns to a realis/irrealis distinction, as discussed in §3.1.2.
Tone is the only inflection marking on finite verbs and the only morphological difference between finite and non-finite verb forms. Finite verbs are tonally marked for the various tam categories, for example, HL for imperative, or with the phrase-medial H (i.e. realis/irrealis). In contrast, non-finite forms are unmarked, and only carry the underlying tone of the verb on the initial syllable, with a default L surfacing on the underlyingly toneless non-initial syllables. Non-finite verb forms occur in complex predicate constructions with auxiliaries and modal verbs. They surface with final L tones, even in tense-mood categories that require the medial H tone (present, inchoative). As illustrated in (7), the H tone is realised on the finite modal verb in complex predicates, and the lexical verb is non-finite.Footnote 13
3. Exponence types of GT in Gyeli
I use Rolle’s (Reference Rolle2018) framework and terminology in analysing Gyeli GTs. The general idea for GT is that a specific grammatical context licences a specific tonal pattern on certain morphemes. The terms used are defined in (8).
In this section, I show that GT sole exponents in Gyeli have a high functional load, as defined in §1, and constitute the basic system for distinguishing tam categories in simple predicates. The functional load of GT gradually weakens, however, with increasing complexity of a co-exponent, as shown in (9), where the top part represents the complexity of (co-)exponents on a scale from minimally to maximally segmental. The scale correlates with the tonal exponence type shown in the bottom part, following Hyman’s (Reference Hyman2012) ‘three ways in which tone can be an exponent of a morpheme or morphological process’ – or grammatical feature: tone as sole exponent, systematic co-exponent or arbitrary co-exponent.
For example, present negation (§3.2.2) is expressed by a H that attaches to the verb root and a verbal suffix -lɛ. If the tonal co-exponent were excluded, the negated clause would sound wrong to native speakers, but they would still understand the meaning. If, however, the negation suffix were excluded, the resulting form would not be distinct enough from other existing forms with other GTs, and the meaning of negation would be lost. Thus, the tonal co-exponent plays only a small – and by itelf insufficient – role in distinguishing between grammatical categories, making this distinction opaque. In the following, I discuss each GT exponent type, elaborating on the data introduction given in §2.
3.1 GT as sole exponent
Gyeli has eight GT patterns where tone is the sole exponent of a grammatical category. They all occur in the VP, as illustrated in the second line in (10) with three GT patterns (GT1, GT7 and GT8), which will be described below.
Six GTs serve as tense encoding; they consist of tonal combinations on the stamp clitic and the verb (§3.1.1). A floating H that attaches in phrase-medial position to the right of the verb in certain tam categories correlates with realis marking (§3.1.2). Finally, another floating H surfaces on a toneless element immediately following the verb and marks the verb–object construction. I call this GT the ‘object-linking H tone’ (§3.1.3). In the following, I present more details on each GT, including information about tonal operations and possible alternative analyses.
3.1.1 Tense marking GTs
In Gyeli, tone is the (near) sole exponent to distinguish seven tense-mood categories, as shown in Table 4.Footnote 14 GT is assigned to the preverbal stamp clitic and the non-initial verb stem syllables through attachment of floating tones to the right of both hosts (or, in the case of surface L, lack thereof). In verb stems that have three syllables, this also includes the phonological operation of HTS to the left onto the second stem syllable. A crucial point of this analysis is that I view both hosts – the stamp clitic and non-initial verb syllables – as underlyingly toneless, as I will explain below. The combinations of tone patterns on the stamp clitic and the verb stem for different syllable lengths are given in Table 3, distinguishing verb tones in phrase-final and phrase-medial position.
I argue that Gyeli has unvalued TBUs, which surface as L phonetically or receive their tonal specification from their grammatical environment. These include i) noun prefixes of a CV shape (e.g. ba-kùsì ‘parrots’, le-nángá ‘star’), as described in §2.2; ii) the preverbal clitic stamp, which encodes subject agreement, tense, aspect, mood and polarity; iii) the present negation suffix -lɛ; iv) the postverbal plural marker nga; and most importantly for GT in the VP, v) non-initial syllables of the verb stem (e.g. bìyɔ ‘hit’, lúmɛlɛ ‘send’). With only a few functional morphemes that are tonally unvalued, Gyeli has a relatively high tonal density (Gussenhoven Reference Gussenhoven2004; Hyman Reference Hyman2009): the proportion of valued TBUs, that is, those that are underlyingly H, L, HL or LH, is high compared to unvalued ( $\emptyset $ ) TBUs. This is true for all lexical and functional parts of speech, such as nouns, adjectives, ideophones, adverbs, pronouns, demonstratives and adpositions, with the notable exception of verbs. As shown in Table 5, only stem-initial syllables are valued with lexical tones (as also shown in §2.3), amounting to 377 valued TBUs, whereas 365 TBUs of non-initial syllables are unvalued. Thus, verb stems exhibit a medium level of tonal density, while most verbal clitics (the stamp marker) and verbal affixes (plural suffix, present negation suffix and all verb extension suffixes such as causative, applicative, passive or reciprocal) are unvalued.Footnote 15
Non-initial verb syllables are best viewed as underlyingly toneless instead of L-toned. While there is ultimately no knockdown evidence for this analysis, I use the criterion of simplicity to justify my choice of treating non-initial verb syllables as toneless. Distributional asymmetry is one line of evidence for underspecification, although it is not sufficient by itself (Marlo & Odden Reference Marlo, Odden, Velde, Bostoen, Nurse and Philippson2019: 152). There is a clear asymmetry in Gyeli between noun stems, which allow nearly all possible tonal combinations (Table 1) as well as contour tones (Table 2), and verb stems, which allow lexical tonal contrasts only on their initial syllables (Table 5). Phonologically toneless TBUs, such as noun prefixes and non-initial verb syllables, predictably surface as L phonetically unless they receive another tone from some other source.
Another argument for proposing a ternary distinction between H, L, and $\emptyset $ TBUs is that it is consistent with what is found in other languages in the area. For instance, Marlo & Odden (Reference Marlo, Odden, Velde, Bostoen, Nurse and Philippson2019: 152) distinguish verb-stem–initial L from subsequent $\emptyset $ in the closely related language Mokpwe (A22). The evidence they put forth is behaviour under HTS. In Mokpwe, word-final ‘melodic’ H tones spread to the left, up to but not including the root (stem-initial) syllable, just like in Gyeli. The best explanation for this limit on the spread is in both cases to posit roots that are specified L or H, while intervening TBUs are toneless. This is exemplified with H attachment in (11) for the trisyllabic Gyeli verb vìdega ‘turn’ (Grimm Reference Grimm2021: 110); in disyllabic stems, the H only attaches to the second TBU.
Verbs with a H root maintain this H tone under H attachment so that the entire stem surfaces H, assuming that Obligatory Contour Principle (OCP) violations are silently resolved by tone fusion.
There are two alternative ways to analyse underlying non-initial verb tones in Gyeli, assuming in both that all syllables in verbs are valued, which I argue against. In the first alternative, all non-initial verb TBUs are specified with L tones. These are delinked and replaced under H attachment. Attached H tones spread to the left, with a phonological restriction against spreading to initial syllables. This, however, can be ruled out, since monosyllabic verbs show that root tones can be targeted: kɛ̀ ‘go’ becomes kɛ́. Another possibility in favour of underlying L tones could be that grammatical H tones do not target the right edge of the verb but second syllables, spreading rightwards instead. In this scenario, the (possible) non-initial L tones are delinked and replaced. Indeed, it is common in Bantu languages for GTs to target specific positions in the verb stem, such as the second mora or the penultimate syllable.Footnote 16 Positing second-syllable targets with rightwards spread, however, makes it harder to explain tonal changes on monosyllabic verbs.
A second hypothesis is that non-initial syllables receive their specification through tone spreading from the root syllable, that is, verbs with an initial H are specified all H and initial L verbs are specified all L. In this scenario, in L-toned verbs, H spreads leftwards and delinks all but the initial association of the L tone, maintaining the lexical contrast of the root.Footnote 17 This view, however, does not easily account for the tonal patterns found in simple predicates, failing to explain how all verbs, irrespective of their underlying specification with H or L, end up with the same tone patterns in different tam categories. To make this alternative work, one would need to assume more rules than are necessary under my analysis. First, there is a need to explain how L-toned verbs surface as H in phrase-medial position and only in certain tam categories, requiring the attachment of a H. In turn, one also needs to explain how H-toned verbs surface with L on non-initial syllables in phrase-final position, but again not in all tam categories. This would require the attachment of a L, possibly a boundary L% tone, which I argue against in §3.1.2.
As for the stamp clitic, it is difficult, if not impossible, to prove the underlying tone of this marker since it cannot appear outside of a grammatical context which serves as trigger for a specific GT. In parallel to non-initial verb syllables, I view stamp clitics as underlyingly toneless, with the segmental part contributing person agreement and the tonal exponent contributing tamp marking.
Given that the stamp marker and the verb are two separate words,Footnote 18 I analyse the floating tones as separate as well (rather than constituting a circumfix). Assuming two distinct floating tones is in line with the relative independence of the verb tone, showing functional divisions into non-past ( $\emptyset $ ), past (H) and tenseless (HL). Although verb and stamp tone patterns work in parallel to arrive at the seven categories in the paradigm, the tonal form of one tonal host does not condition the tonal pattern of the other.
Whereas floating tones on verbs clearly attach to the right of the verb, as illustrated in (11), it seems difficult to linearise the segmental part of the stamp morpheme and its floating tone. Since the stamp morpheme is underlyingly toneless, no difference would be observable whether one postulated the tone to dock on the left or the right of the stamp marker. Although there is no formal evidence for linearisation, I assume that the floating tone attaches to the right for historical reasons, and by analogy with other Bantu languages. In Bantu languages, tam information usually follows subject agreement morphemes. Historically, Gyeli likely had segmental co-exponents for tam marking between the subject agreement marker and the verb but lost the tam segments. Closely related languages such as Kwasio, for instance, exhibit a mixture of segmental and tonal morphology between the subject marker and the verb (§5).
Some stamp clitic forms are also lengthened. There is, however, a predictable relationship between lengthening and tone pattern, where a LH or HL pattern co-occurs with a long vowel. This could be analysed not as co-exponence per se, but as a phonotactic restriction, requiring that stamp markers not have monomoraic contour tones. To accommodate the LH or HL tone pattern, the vowel is lengthened. In contrast, with the subjunctive, the lengthened vowel is an instance of co-exponence, a specific requirement for final vowel lengthening. As the imperative allows monomoraic contour tones, a phonotactic restriction for the verb can be ruled out. Thus, the subjunctive marking GT6b has a suprasegmental co-exponent (§3.2.1).
3.1.2 Realis mood marking GT
Every tense category also belongs inherently to a mood category, distinguishing realis from irrealis. This mood distinction is expressed through the presence (realis) or absence (irrealis) of GT7 that, unlike GT1–GT6b, only attaches to the right of the finite verb stem if the verb is in phrase-medial position. The present tense, for instance, belongs to the realis mood. As shown in (12) (Grimm Reference Grimm2021: 389), the verb surfaces as L in phrase-final position, as expected from the GT1 pattern. Phrase-medially, however, the verb stem takes the realis-marking GT7 and surfaces as H.
The presence of GT7 is solely conditioned by the tense category of the verb and not by the morphosyntactic material that follows the verb, since any phrase-internal material that follows the verb in realis tenses triggers the GT7 H to surface. Tense and mood categories are thus intrinsically connected, and so I generally refer to them as tense-mood categories. The distribution of tense-mood categories across realis and irrealis is shown in Table 6 (Grimm Reference Grimm2021: 387).
One may question the evidence for the realis-marking GT7, given that it is directly observable only in the present and the inchoative. The other two categories that surface as H phrase-medially, namely the recent and remote past, are also H phrase-finally through their tense-marking tones (GT4 and GT5). Alternatively, one could assume that there is no realis-marking tone, but that the phrase-medial H tones in the present and inchoative are part of the tense-marking patterns GT1 and GT2, which are lowered in phrase-final position. The surface contrast between H and non-H, however, clearly partitions the tam categories into realis and irrealis. In my opinion, these semantic patterns should not be lightly dismissed as arbitrary phonological patterns. Since tam categories are typically syncretic, it would not be unexpected if tense in the past forms was marked by the same H tone as mood.Footnote 19
Additionally, I argue against L% boundary tones in Gyeli since they would not apply across tam categories. On the one hand, the past tenses in phrase-final position are marked with a final H on the verbs, unlike all other tam categories. On the other hand, in phrase-medial position, the future, imperative and subjunctive all end in L, whereas the other categories have a H tone.Footnote 20 This seems unexpected for a tone that marks prosodic domain edges. One could make this work by assuming that lowering only applies to the floating H tone that attaches to verbs in the present and inchoative, taking medial forms as the default and final forms as special cases. It is simpler, however, to posit that final forms are the default and medial patterns those with added tonal morphology, since only one operation (floating H phrase-medially) is needed instead of two (floating H phrase-medially and lowering phrase-finally): a floating H attaches phrase-medially in certain tam categories, whereas all instances of final L are the phonetic surface forms of underlyingly toneless TBUs. Gyeli has one exception where final lowering occurs, namely in monosyllabic H verbs, which are lowered to HL in non-finite and phrase-final forms. I view this as a case of alignment to imitate the surface pattern of non-initial syllables in the same grammatical contexts.Footnote 21
3.1.3 Object-linking H tone
I analyse GT8 as a floating H, which immediately follows the lexical (finite or non-finite) verb and gets realised on the immediately following TBU whenever there is an in-situ object immediately after the verb. The host of GT8 is either a CV noun prefix on the object or an intervening verbal plural marker nga, both of which are underlyingly toneless. As I view the function of this GT to be to flag the presence of a syntactic object in situ, I gloss it as obj.link. GT8 can only surface when the object has an unvalued TBU, that is, a CV noun prefix, as in (13). Nouns with a C prefix or a $\emptyset $ prefix do not undergo any tonal change, nor do pronominal objects.
The examples in (13) also show that the H on a postverbal object prefix does not stem from HTS, which it could be mistaken for in the many cases in which tense- and mood-marking GTs attach to the preceding verb stem. GT8 occurs in all tense, aspect, mood and polarity categories, including those that do not take a H tone on the preceding verb stem.
While Gyeli allows free ordering of the two objects in ditransitive constructions, only the object that is closest to the verb is marked by the object-linking GT, as shown in (14). This can be naturally explained by the GT’s location immediately after the lexical verb.
The functional distinction that GT8 makes can be neatly observed when comparing immediate-after-verb arguments to oblique NPs. Oblique NPs, as in (15), do not take the object-linking H tone.Footnote 22
The object-linking H tone occurs after the non-finite lexical verb in complex predicates, as in (16), since it is the non-finite verb that is transitive and carries this morphological marker, not the auxiliary. In contrast, tense- and mood-marking GTs only attach to finite verb forms.
When the verbal plural marker nga used in imperative and hortative constructions intervenes between the verb and the object, GT8 is realised on the plural marker instead of the nominal object prefix. The clitic nga is underlyingly toneless and surfaces as L when the verb is phrase-final, as in (17a). If there is an object, however, as in (17b), the plural clitic ‘steals’ GT8 from its target, hosting the H tone, while the object prefix surfaces as L. These examples also show that the presence of an immediate-after-verb object is required for GT8 to surface: if the object is elided, as in (17a), GT8 does not attach.Footnote 23
The verbal plural clitic nga receives its tone purely phonologically, either by insertion of a default L or by rightward HTS. Thus, in complex predicates such as in (18), nga receives its H tone from the preceding auxiliary, while GT8 surfaces on the object noun prefix.
3.2 GT as co-exponent
I distinguish three types of GTs as co-exponent of grammatical features in Gyeli, in an increasing order of complexity: i) the GT co-exponent co-occurs with the suprasegmental addition of vowel lengthening; ii) the GT co-exponent co-occurs with segmental morphemes; and iii) the GT co-exponent co-occurs with complex predicates that require auxiliaries. The first two types tend to constitute systematic co-exponents, while the tonal co-exponents of auxiliaries seem arbitrary.
3.2.1 GT co-exponent with vowel lengthening
Suprasegmental co-exponence is minimally complex and involves vowel lengthening, that is, the addition of a mora. In simple verbal predicates, this type of co-exponence is restricted to the subjunctive (GT6a in Table 4). Verb stems in other tense-mood categories always end in a short vowel. In contrast, I do not consider the lengthened stamp clitics in the inchoative, the future and the remote past as instances of co-exponence, since their occurrence can be explained as a phonotactic restriction in stamp morphs to accommodate HL and LH tones (§3.1.1).
Another case of GT co-exponence with vowel lengthening concerns demonstratives and certain adverbs, which use final vowel length in conjunction with a H tone to express deictic distance. As shown in Table 7 for all nine agreement classes, proximal demonstratives have a short vowel and a HL lexical tone.Footnote 24 The distal demonstrative is derived from the proximal base form by adding a morpheme that consists of a lengthened vowel accompanied by a H GT co-exponent.
Similarly, the adverbs wû ‘there’ and pɛ̀ ‘there’ can take a final H tone, together with final vowel lengthening, to mark distance. This is shown for the adverb pɛ̀ in (19), with the unmarked form in (19a) and the distal form in (19b).
3.2.2 GT co-exponent with segmental morphemes
The next level of complexity involves bound segmental morphemes, which in the case of Gyeli are suffixes. Among the two suffixes that co-occur with GT, the vocative fits in with the tonal pattern and functionality of the deictic distance system with a predictable H signalling distance. The difference is, however, that the vocative has a dedicated segmental suffix -o and not only vowel lengthening, as used by demonstratives and some adverbs (§3.2.1). Vocative suffixes attach to proper names, as in (20), and to certain adverbs, as in (21). A L vocative suffix encodes proximity to the addressee, whereas a H vocative suffix encodes distance.
There is another segmental co-exponent suffix, -lɛ, which encodes present tense negation. As shown in Table 8, H-toned verbs surface as all H, including the negation suffix, regardless of the length of the stem. In contrast, L roots are realised with H on the initial syllable. In monosyllabic stems, the negation suffix is then also H, whereas di- and trisyllabic verbs are all L for non-initial syllables, including the negation suffix.
The accompanying GT pattern deviates from the basic affirmative present tense-mood pattern, as shown in (22). While the verb stem in the affirmative present surfaces as L phrase-finally but with a realis-marking H tone phrase-medially, as in (22a), present negation does not take the medial H tone, as in (22b). For this reason, I classify it as an irrealis category. This is consistent from a semantic perspective, given that it is typologically frequent for negation to correlate fully or partially with irrealis marking, although this does not apply systematically (cf. Elliott Reference Elliott2000).
The negation pattern is an instance of systematic co-exponence, differing from its affirmative counterpart in two predictable ways. First, the stamp clitic receives a LH floating tone in the first and second person singular and agreement class 1, while only the other subject agreement forms are identical to the H stamp clitic of the affirmative present, as shown in (22).
Second, the present negation form on the verb consists of a H tone which attaches at the left edge of the verb stem and an underlyingly toneless negation suffix -lɛ, which receives its tone from the preceding syllable. Table 9 illustrates the analysis of the data presented in Table 8. The H tone displaces the lexical tone rightwards onto the second syllable of the stem. In H-toned roots, the lexical H shifts rightwards and then spreads rightwards, resulting in an all-H form. In L-toned roots, the lexical L shifts rightwards and spreads or is realised as L by default. If there is no second syllable of the verb to shift to, the lexical L is not realised, and the H of the tonal co-exponent spreads onto the negation suffix.
With the addition of the segmental negation suffix -lɛ and its tonal co-exponent, the patterns of tense-marking GT in the affirmative do not apply. The cue for the lexical contrast in monosyllabic verb stems is lost as all negated monosyllabic stems surface as all H, as shown in (23).
3.2.3 GT co-exponent with auxiliaries in complex predicates
The co-exponent type that is structurally most complex involves true auxiliaries in complex predicates.Footnote 25 In contrast to the other types of co-exponents, their tonal co-exponents are arbitrary, as each auxiliary has its own unpredictable tonal cophonology. The opposition of (24a) and (24b) shows that this is not a property of complex predicates per se, since modal semi-auxiliaries such as kwálɛ ‘like’ take the same tonal inflection patterns as simple predicates, for example, in the present tense with a H on the stamp and a realis-marking H phrase-medially.
True auxiliaries encode both aspect and negation. Just like modal (semi-auxiliary) verbs, they act as the inflected verb form, whereas the lexical verb is non-finite. They cannot, however, host tense-marking GT1–GT6a (§3.1.1), but instead display their own tone patterns, as in (24c), where the stamp clitic surfaces with L instead of the expected H of the present tense.
While presenting the entire auxiliary system of Gyeli exceeds the scope of this article,Footnote 26 I choose several examples which illustrate that tonal co-exponence with auxiliaries is arbitrary and that the functional load of GT in these constructions is too weak to contribute enough information to distinguish between grammatical categories.
The arbitrary tonal patterns of auxiliaries heavily restrict most forms to a specific tense-mood category. For instance, the progressive has three segmental forms: nzíí for present progressive, nzí for recent and remote past, and nzɛ́ɛ́ for progressive in subordinate clauses (and none for the future). Each segmental marker determines the tonal pattern of the stamp clitic, as I discuss below. In fact, the information that GT as sole exponent carries in simple predicates is entirely lost in auxiliary constructions, both with the stamp clitic and verb stems.
Floating tones on stamp clitics in auxiliary constructions do not contribute information to tense distinctions because of their lack of systematic oppositions. In addition to their differing patterns from simple predicates, some stamp clitics show paradigm-internal variation, which seems to be parallel to simple predicate patterns. An example is the stamp clitic pattern found with the future negation auxiliary kálɛ̀ with a split between a long stamp with a HL pattern for most subject agreement forms and a long L surface pattern for the first and second person singular and agreement class 1. The same pattern holds in the future marking of simple predicates (Table 4). This pattern identity across simple and complex predicates is, however, unpredictable. There are counterexamples, such as the prospective marker múà, which has a H stamp clitic in most subject agreement classes, but L in first and second person singular and agreement class 1, as shown in (25). This paradigm-internal split does not cluster with any tam category in simple predicates.
Tonal patterns on the segmental part of auxiliaries seem also unpredictable. Most of them end in a H tone, including all three progressive forms nzíí, nzí, nzɛ́ɛ́, the retrospective marker lɔ́, the perfect marker bwàá, the past negation sàlɛ́/pálɛ́ and the imperative and infinitival negator tí. Only the prospective marker múà, the future negation marker kálɛ̀ and the subjunctive negation marker dúù surface with non-H tones. Since auxiliaries can never occur phrase-finally, it is impossible to test whether final H tones are lexically specified or a result of H attachment as proposed for the realis-marking GT7. For this reason, the contribution of GT to mood marking in auxiliary constructions is unclear. In fact, there is a fundamental question as to how much GT there is in these auxiliary constructions. One may argue that the tone on the auxiliary itself is lexical, and only the tone on the stamp morpheme is grammatical. An argument for this is that the true auxiliaries are highly lexicalised forms likely derived from archaic verb forms and so maybe their tones have become lexicalised as well. An argument for viewing auxiliary tones as GTs, on the other hand, is their parallel structure with other (simple and complex) predicates, which all involve GTs on the stamp morpheme and the finite verb form. I do not see any conclusive evidence for either option. I have shown in this section that, with increasing segmental complexity, the information that GT contributes to the meaning becomes less paradigmatic and its functional load weaker.
4. Interaction with lexical tone
GT operates alongside PT and lexical tone. But what happens when GT and lexical tone are in competition? In this section, I explore the interaction between GT and lexical tone within a dominance framework (Kiparsky & Halle Reference Kiparsky, Halle and Hyman M.1977; Inkelas Reference Inkelas, Booij and Marle1998; Rolle Reference Rolle2018). I show that GT in Gyeli is dominant, but only ‘under duress’, that is, when there are not enough toneless syllables to host both lexical tone and GT. In this case, GT wins over lexical tone. Following Rolle (Reference Rolle2018), dominance effects are understood as the interactions between the trigger, with its morphosyntactic properties, and the host, with its tonal value. Rolle (Reference Rolle2018: 10) distinguishes dominant and non-dominant GT, and describes the tension between these two types as follows:
[W]ithin dominant GT all outputs have a uniform tone shape which has the advantage of providing a more consistent cue for the grammatical category of the trigger, but sacrifices the lexical contrast of the target. In contrast with non-dominant GT, outputs do not have a uniform form and thus maintain lexical contrast unambiguously, but at the cost of having a less delimited cue for the trigger.
The dominance type found in Gyeli is replacive-dominant (RD), as defined in (26).Footnote 27
Monosyllabic verbs provide the needed evidence that GT in Gyeli is, in fact, dominant ‘under duress’. In (27), the realis-marking GT7 is in conflict with the lexical L on the verb dè ‘eat’. The lexical tone is delinked and replaced by GT: GT wins out. This explains why underlying (lexical) tones of monosyllabic verbs go unrealised. If GT were non-dominant, then there would be no GT realisation in the monosyllabic forms, or it would combine and create a contour. It looks like Gyeli has a system that strives to keep both lexical and GT, and sacrifices lexical tone only when one of the two must go.
The longer verb forms provide no evidence for dominance or non-dominance, as there is no case of duress. All GTs in (28) are realised on underlyingly toneless syllables, maintaining the lexical tone on the verb. GT1 encodes present and is realised on the stamp clitic for agreement class 1; GT7 expresses the realis category; and GT8 links to the object in the VP.
The phonological system – with its restrictions on syllable length, the status of the syllable as TBU and the distribution of valued and unvalued TBUs – constitutes the framework in which lexical tone and GT operate. Sole-exponent GTs (§3.1) generally target unvalued TBUs: stamp clitics, toneless TBUs in non-initial verb syllables, and CV-noun prefixes. Thus, in (28), the lexical L tones of the verb stem gyàga ‘buy’ and the noun stem -njù ‘banana’ remain unaffected, while the GTs specify the unvalued TBUs. Longer verb forms can be seen as typical, since they constitute around 77% of all verbs (see Table 5). GT can easily exploit the unvalued TBUs in these longer forms to provide templatic cues for tense-mood category distinctions. Conflicts with lexical tone only arise in a minority of verbs, since monosyllabic verbs do not have unvalued TBUs. In these cases, the lexical contrast is sacrificed to rescue the templatic cues of GT patterns, as illustrated in (29), where GT results in a H surface form for both H(L) and L verb roots.
More segmentally complex triggers of GT are not merely segmental additions to the basic system of sole-exponent GT, but come with their own tonal cophonologies (Inkelas & Zoll Reference Inkelas and Zoll2007; Sande et al. Reference Sande, Jenks and Inkelas2020). The negation suffix -lɛ, for instance, is accompanied by a H GT co-exponent that has a different host position in the verb stem than the tense- and mood-marking sole-exponent GTs (§3.2.2). It is dominant, since it targets the first syllable, which is the location of lexical tone, again neutralising lexical contrasts in monosyllabic verbs (30).Footnote 28 It further triggers a different tone pattern of stamp clitics in the first and second person singular and agreement class 1, whereas the other agreement classes exhibit a plain vowel with a H tone.
While there is likely no functional motivation for the difference of tunes from GT as sole exponents, the pattern can be explained by the observations that i) GT co-exponents and their properties (e.g. as a floating prefix) are lexically conditioned and encoded within the trigger (e.g. the present negation form) itself (Inkelas Reference Inkelas, Booij and Marle1998; Rolle Reference Rolle2018) and ii) replaciveness is the general strategy in Gyeli for resolving competition between lexical and GTs, with GT winning out ‘under duress’. At the same time, phonological properties of the GT host pertaining to the availability of unvalued TBUs also determine whether the GT will yield the underlying lexical tones or not.
True auxiliaries come with idiosyncratic tonal patterns that target the stamp clitic. Competition between GT and lexical tones in auxiliaries, however, cannot be observed, at least not synchronically. The reason for that is that these aspect and negation auxiliaries are associated with a specific tense-mood category (§3.2.3). They encode a grammatical function in a specific grammatical position, which does not allow for testing oppositions of underlying (lexical) tones. Lexical tones of the lexical verb in complex predicates are maintained, since lexical verbs occur in their non-finite forms, in which the lexical tone remains intact (§2.3).
5. Outlook: Gyeli’s place in the broader Bantu context
Eastern and southern Bantu languages are agglutinative and known for their rich verbal morphology with one-to-one mappings of form and meaning. In contrast, Gyeli as a typical northwestern Bantu language is heavily restricted in the addition of verbal morphemes due to its phonotactic limit of three syllables. Gyeli compensates for the restriction of segmental additions to the verb by a complex GT system, which fulfils many functions that other Bantu languages express via segmental morphemes. In turn, Gyeli needs the high functional load and transparency of GT in the absence of rich segmental morphology in verb inflection. It is likely that the loss of segmental morphemes and the formation of a GT system that relies heavily on GT as the sole exponent of a grammatical function are historically interrelated. Comparing the Gyeli system with closely related languages of the area, similar tonal ‘ingredients’ in the VP can be observed. These other languages, however, have more segmental morphology in their inflection paradigms, while tonal patterns seem idiosyncratically distributed over certain tam categories, with a weak functional load in contrast to Gyeli sole-exponent GTs.
The closest relative, Kwasio (Bantu A81, [nmg]), for instance, has segmental morphemes between the stamp marker and the verb for the recent and remote past tenses as well as for two future tense forms (Woungly Reference Woungly1971). While Kwasio exhibits similar patterns of tense-marking GTs on the stamp marker and verb stem, these patterns are less systematic, as the segmental morpheme seems to carry the bulk of the category encoding. GTs between the verb and a following object are particularly interesting in the languages of the area. Where Gyeli has slots for GT8 (realis marking) and GT9 (object marking), other languages seem to only have one GT slot. Hyman & Lionnet (Reference Hyman, Lionnet, Marlo, Adams, Green, Morrison and Purvis2012) describe metatony in Abo (Bantu A42, [abb]), which is characterised by tonal alternations in certain conjugated verb forms. These tonal alternations in Abo constitute different phonological patterns that come with specific tam categories, but do not map onto clear functions, unlike Gyeli. The same has been observed for Eton (Bantu A71, [eto]; Van de Velde Reference Van de Velde2008). Tonal alternations on the object, a potential equivalent to Gyeli’s object-linking GT8, are described by Yukawa (Reference Yukawa1992) for Bulu (Bantu A74, [bum]). In Bulu, object tones must match the tone of the final TBU of the verb. Unlike Gyeli, however, this is restricted to certain tam categories, and the meaning contribution of the GT alternation is rather opaque.
From a historical view, sole-exponent GTs with their high functional load may only have developed through the loss of segmental material in Gyeli, giving rise to a system that maximally exploits all tonal patterns for grammatical distinctions. The heavy reliance on GT as sole exponent of distinctions in the tense-paradigm, for instance, has been made possible and constrained by the overall phonological and grammatical system of the language. One key factor for this system to work is the distinction between valued root (i.e. stem-initial) syllables with lexical tones and non-initial toneless syllables. With a large majority of verbs containing unvalued TBUs, competition between lexical contrasts and faithfulness to GT templates does not typically occur. Monosyllabic roots, which lack unvalued TBUs, are an exception. In these cases, GT wins by overwriting lexical tone, maintaining the grammatical tune. In contrast, with segmental co-exponents, GT has a weak functional load and does not, by itself, seem to contribute to the meaning. It is rather an idiosyncratic cophonology of the segmental exponent, which sometimes idiosyncratically overwrites lexical tone, as in present negation.
Acknowledgements
I am very grateful to two anonymous reviewers and Florian Lionnet for a wealth of constructive comments; I appreciate the time and effort they have put towards improving this article. I would also like to thank Joyce McDonough and Peter Guekguezian for their helpful comments and the editors for organising this special issue.
Competing interests
The author declares no competing interests.