1. Introduction
Metathesis, the local transposition of two segments, has long been an area of debate in phonological theory: whether it exists at all, given its typological rarity (Webb Reference Webb1974; Montreuil Reference Montreuil1985; Powell Reference Powell1985), or whether it simply does not exist as a single transposition mechanism, and instead is the serial application of smaller copy-and-delete or coalescence operations (Besnier Reference Besnier1987; Hume Reference Hume1991; Blevins & Garrett Reference Blevins and Garrett1998; Takahashi Reference Takahashi, Gallagher, Gouskova and Yin2018, Reference Takahashi2019). Although recent work has confirmed the existence of metathesis as a surface phonological alternation (Hume Reference Hume1998, Reference Hume, Hume, Smith and Weijer2001; Canfield Reference Canfield2016, among others), there is still considerable debate over how to analyse such alternations – are they best analysed as transposition, coalescence, successive copy-and-delete mechanisms or feature spreading?
In generative grammar, the choice of how to analyse metathesis has robust implications for phonological typology. If transposition is an operation in the phonology proper, then we may expect for it to arise in similar frequency and distribution to better-known patterns like epenthesis or deletion. Yet the typology of metathesis is far more restricted, as metathesis is often limited to only certain segments or only a few morphemes in a given language (Hume Reference Hume, Hume, Smith and Weijer2001; Horwood Reference Horwood2004, among others). Furthermore, generative grammars will often predict multiple transpositions to be possible, creating long-distance metathesis patterns that have been argued to be synchronically unattested (Poser Reference Poser, Hulst and Smith1982; McCarthy Reference McCarthy2000; Horwood Reference Horwood2004; see potential counterexamples in Blevins & Garrett Reference Blevins and Garrett1998; Mielke & Hume Reference Mielke, Hume, Hume, Smith and Weijer2001; Chandlee et al. Reference Chandlee, Athanasopoulou and Heinz2012). In models like Optimality Theory (OT), the observed typology is unexpected, and yet few proposals have addressed how to eliminate these broad predictions.
In this article, I introduce novel data from original fieldwork on Uab Meto (West Timor, Indonesia), an Austronesian language with robust CV metathesis. While detailed descriptive work on the language exists (Edwards Reference Edwards2016, Reference Edwards2018, Reference Edwards2020; Culhane Reference Culhane2018), Meto metathesis has not yet figured into these theoretical discussions. Meto metathesis is both common and productive, occurring with almost all segments in the language, and shows robust interactions with many aspects of the language’s phonology. Through an in-depth case study, I investigate how metathesis interleaves with other phonological processes in the language like epenthesis and deletion, and conclude that metathesis only surfaces where multiple phonological patterns interact.
Based on these data, I argue that there is no transposition in phonology. I analyse Meto metathesis as covert spreading along a CV skeleton, where a timing slot deletes and then a vowel feature spreads leftwards. This view explains why Uab Meto metathesis has such rich interactions with epenthesis, deletion and spreading in the language – these processes are the precursors to phonological metathesis, and so metathesis can only surface where these processes interact. True transposition, if it exists, must be analysed as a morphophonological operation that is driven by morpheme-specific requirements rather than the global phonology. I cast the analysis in Harmonic Serialism (McCarthy & Pater Reference McCarthy and Pater2016), a relative of OT.
The article is organised as follows: §1 introduces the analysis and discusses some initial alternatives. §2 then turns to the full set of CV $\rightarrow$ VC metathesis alternations and shows how Meto metathesis occurs in complementary distribution with other coalescence operations like diphthongisation and deletion. §3 discusses epenthesis in the language and how this relates to locality restrictions on spreading. §4 discusses alternatives and predictions for the typology of metathesis, and §5 concludes.
1.1 Introducing the pattern: metathesis under suffixation
Uab Meto is a dialect chain spoken in West Timor, Indonesia. The Molo dialect of Meto has 7 vowels /a, i, ɪ, e, o, ɔ, u/ and 12 consonants /p, f, b, t, s, k, h, ʔ, m, n, l, /.Footnote 1 Unless otherwise indicated, all generalisations and data apply to the Molo dialect, collected in Bijaepunu, North Molo in 2018 and 2019. Data on Kotos Amarasi (from Oekabiti) and Amanuban (from Noenoni) were also collected in Kupang, West Timor at that time. Previous work on the language has focused on Amarasi (Edwards Reference Edwards2016, Reference Edwards2018, Reference Edwards2020) and Amfo’an dialects (Culhane Reference Culhane2018). Although they are not mutually intelligible with Molo, I offer comparisons with these dialects as the opportunity arises.
Uab Meto has apparent metathesis in CVCV(C) roots when they combine with a vowel-bearing suffix. The end effect is that stress, which is fixed on the penult of a root (e.g. CV́CVC [ˈkokɪs] ‘bread’), then aligns with the penult of the word (e.g. /ˈkokɪs-e/ $\rightarrow$ [ˈkks-e] ‘the bread’). Apparent metathesis occurs only when it improves the prosodic output.Footnote 2 Examples of this pattern are shown in (1).
I propose that apparent metathesis in Meto is not transposition, but instead the serial application of deletion and spreading mechanisms, as in (2). In Step 0, suffixation makes stress antepenultimate. This creates a marked right-edge stress lapse in the phonological word. To correct this, in Step 1, the root-final V-slot deletes through prosodic truncation. Prosodic truncation reduces the post-tonic syllable count at the cost of delinking vowel melody features. In Step 2, the floating vowel melody spreads to the preceding V-slot, giving the surface appearance of transposition even though the features remain in their original order.
The core intuition behind this approach is that Meto metathesis is a way of compressing two syllables into one, whereby the final syllable of a root coalesces with the preceding stressed syllable.
I cast this type of coalescence as autosegmental line-crossing. Although the No Crossing Constraint (NCC; Goldsmith Reference Goldsmith1976) has been previously thought of as a universal, here I allow line-crossing between consonants and vowels. Despite appearances, this approach is not deeply at odds with many spreading-based accounts to vowel harmony (cf. Kimper Reference Kimper2011, Reference Kimper2017). Avoiding violations of the NCC is a major issue for almost all spreading-based accounts of vowel harmony, requiring elaborate representational moves such as assuming planar segregation of consonants and vowels (McCarthy Reference McCarthy1979, Reference McCarthy1981; Steriade Reference Steriade1986, among others), extensive feature geometries (Clements Reference Clements1980, Reference Clements1991; Sagey Reference Sagey1988) or other ways of limiting the NCC to only apply between legitimate targets (see review in Odden Reference Odden1994; Ní Chiosáin & Padgett Reference Ní Chiosáin, Padgett and Lombardi2001). By casting this as line-crossing, I table the issue of representational choice, and contend that the universal prohibition on line-crossing applies only for like spreading over like (cf. Archangeli & Pulleyblank Reference Archangeli and Pulleyblank1994).
That said, Uab Meto still bears restrictions on non-local spreading. For one, only vowels may spread, and spreading is limited to post-tonic environments within morphemes. Parallels to this exist in vowel harmony as well, where some languages only allow harmony to apply in post-tonic environments (e.g. Grabo metaphony; Walker Reference Walker2005, Reference Walker2010). In the analysis, I prevent spreading from creating a full-scale vowel harmony system within morphemes by only relaxing the restriction against line-crossing for delinked features (see §3.1). If features have no associated timing slot, they may spread non-locally, but if associated, spreading must be strictly local. This means that non-local spreading will only occur when a V-slot deletes. I introduce further locality restrictions on spreading as needed.
Line-crossing has been argued to pose conceptual issues for phonetic implementation, and I attempt to resolve these here before moving on. In early work in Autosegmental Phonology, line-crossing was argued to be illicit because it would create segments that must simultaneously precede and follow an intervening segment (see Sagey Reference Sagey1986, Reference Sagey1988). To resolve this, I reinterpret association lines as indicators of gestural overlap rather than simultaneity (Bird & Klein Reference Bird and Klein1990, Gafos Reference Gafos1999; contra Goldsmith Reference Goldsmith1976). If there is an association line between a feature and slot , then some phonetic portion of must overlap a phonetic portion of the slot . When a slot precedes a slot , the midpoint of must precede the midpoint of . The order of features also encodes weak precedence:Footnote 3 if directly precedes , then there must be some phonetic portion of that precedes all given portions of or some portion of that follows all portions of . The result is that when association lines cross, the segment with the crossing association line must fully overlap the crossed segment.
To illustrate, take the gestural score for metathesis of /kokɪs-e/ $\rightarrow$ [kks-e] ‘the bread’ in Figure 1. Under metathesis, the [ɪ] vowel spreads across the intervening [k], overlapping it entirely. The core precedence relations among features are unchanged, because the offset of [ɪ] still follows all portions of [k]. If this were a VC sequence with no line-crossing, we would expect for the [ɪ] offset to precede the [k] offset.
This type of overlap is distinct from strictly local spreading, where the vowel would spread first to the intervening consonant and then to the preceding vowel. A strictly local spreading model may predict that conflicting gestural values will be overwritten, but in metathesis they are not. In §3.2, I provide a phonological argument in favour of treating this overlap as line-crossing rather than strictly local spreading based on a diphthongisation pattern in the language.
In the next section, I introduce the formal implementation of the analysis in Harmonic Serialism. Stress alignment constraints trigger prosodic truncation, and so the resulting floating vowel spreads leftwards to preserve itself.
1.2 Analysis
I cast the analysis in Harmonic Serialism (McCarthy & Pater Reference McCarthy and Pater2016). Harmonic Serialism is a relative of OT that combines aspects of rule-based and constraint-based frameworks. Derivations are serial, with the optimal output for one cycle becoming the input to the next. Derivations converge when the faithful candidate wins; this winning candidate then becomes the output for the entire derivation. Harmonic Serialism also imposes a gradualness restriction on Gen, the phonological component that applies changes to forms. The consequence of this gradualness restriction is that each candidate may only differ from the input by at most one change. Exactly what constitutes one change is an open area of research for Harmonic Serialism. I follow McCarthy (Reference McCarthy2008) in assuming that deletion involves two steps: deletion of a timing slot and deletion of features. To simplify derivations, I assume that syllabification and delinking come for free, even though spreading does not.
The derivation in (2) showing /kokɪs-e/ $\rightarrow$ [ˈkks-e] ‘the bread’ involves three main steps: stress assignment, prosodic truncation and spreading. For stress assignment, Uab Meto stress invariably falls on the penultimate or only syllable of a root. The addition of suffixes does not cause stress to shift, which I assume is because metrical structure cannot be modified after assignment (the assumption that Pruitt Reference Pruitt2010 calls Strict Inheritance). Without fully formalising the analysis, I encode the stress system with the cover constraint RootStress. I also assume that, because stress is penultimate, NonFin $\gg $ Align(X,R):Footnote 4
After stress assignment to the root, suffixes and additional phrasal material are added.Footnote 5 Suffixation creates additional violations of Align(X,R) in (4). To reduce these Align(X,R) violations, the V-slot associated with the [ɪ] vowel deletes and leaves the vowel features floating. Full deletion of features and V-slot would violate the gradualness restriction on Gen, and so candidates like it are not considered. From this point on, floating vowel features are written with the non-syllabic subscript (e.g. ), and featureless slots are written as C or V in tableaux.
In principle, the suffix vowel [-e] could also be a candidate for deletion, but Meto has positional restrictions on truncation that prevent this. A V-slot can only delete when it is (a) the last V-slot of a root and (b) unstressed. In §4.1, I discuss a similar restriction on deletion for C-slots: only word-final C-slots delete. Intuitively, these restrictions follow from the fact that initial segments of words tend to be protected, whereas word-final segments are more prone to undergo alternations (Steriade Reference Steriade1994; Beckman Reference Beckman1998). From here on, I omit candidates that violate these restrictions and assume that they are ruled out by a positional cover constraint on morpheme-initial deletion, Max-Initial.
After prosodic truncation, the derivation has a marked floating vowel melody that is unassociated with any slot. I introduce the constraint *XSpread, which militates against consonant–vowel line-crossing, and *Float, which militates against unassociated features and slots:
In this scenario, Meto prefers to spread rather than delete (MaxF $\gg $ *XSpread), even though that involves crossing a consonantal association line. The vowel coalesces onto the preceding syllable, overlapping the intervening consonant.Footnote 6
After this, the faithful candidate (7b) [ˈkks-e] wins and the derivation converges. There are no floating features, and no further prosodic truncation is possible.
To sum up, I claim that Meto metathesis is prosodically triggered, and acts as a way of preserving vowel features during prosodic reduction. Meto metathesis is not transposition, but instead prosodic truncation of a V-slot followed by spreading.
Before continuing on to more Meto metathesis data, I discuss some salient alternatives: metathesis using transposition (§1.3), coalescence without spreading (§1.4) and allomorphy-based approaches (§1.5). As I will show, the core problem with each of these approaches is that they treat Meto metathesis as the complete transposition of two segments, rather than as gestural overlap.
1.3 Alternative 1: Transposition
In this section, I discuss alternatives that derive metathesis using transposition, a single operation that changes the precedence relations of two segments. While transposition is easy enough to formulate, I argue that analysing metathesis in this way comes at the expense of gross overgeneration and lack of explanatory adequacy for the known typology. I review some broad typological problems with transposition in SPE-style rules and OT, and then introduce specific data from Uab Meto that also suggests metathesised segments do not transpose.
In early work in generative phonology, the transposition operation required a new form for SPE-style phonological rules: 1 2 3 $\rightarrow$ 1 3 2 (see Chomsky & Halle Reference Chomsky and Halle1968 on English and Kenstowicz Reference Kenstowicz1971 on Lithuanian). These rules were not only exceptionally powerful, but also gave the impression that transposition should be like any other operation in phonology, a primitive that should be equally available from language to language. While descriptively adequate, these rules do not successfully predict the restricted typology of metathesis, nor do they easily predict where metathesis occurs in complementary distribution with other processes like deletion or epenthesis (e.g. Rotuman; McCarthy Reference McCarthy1995, Reference McCarthy2000).
Contemporary OT (Prince & Smolensky Reference Prince and Smolensky1993, Reference Prince and Smolensky2004) also usually treats metathesis as transposition, most commonly with the constraint Linearity (McCarthy & Prince Reference McCarthy and Prince1995). However, just like rewrite rules, transposition-based accounts of metathesis in OT tend to overgenerate. For one thing, Linearity must be ranked low in order for metathesis to occur, and so we expect transposition to be a preferred operation throughout the entire phonology of a language. Yet many languages restrict metathesis to only occur between particular morphemes (e.g. Georgian; Butskhrikidze & van de Weijer Reference Butskhrikidze and van de Weijer2003), particular segments (e.g. Faroese and Lithuanian; Hume & Seo Reference Hume and Seo2004) or at the ends of roots (e.g. Kwara’ae; Sohn Reference Sohn1980). These restrictions have led to new families of Linearity-based constraints, which imply a richer typology of metathesis than is actually attested (Horwood Reference Horwood2004).
A greater problem for Parallel OT accounts using Linearity is that the degree of violation should not matter for a dominated constraint – if one transposition is not sufficient, the derivation should still prefer a candidate with multiple transpositions over other operations (cf. McCarthy Reference McCarthy2000). However, this often over-predicts metathesis: metathesis occurring in words of the wrong templatic shape, or long-distance metathesis moving a segment too far. This led to numerous proposals for how to fix this overgeneration issue, ranging from adjacency-preservation constraints (e.g. IO-Adjacency, Carpenter Reference Carpenter2002; Contiguity, Heinz Reference Heinz2005b) to constraint conjunction of Linearity (Horwood Reference Horwood2004) or positional faithfulness constraints (Canfield Reference Canfield2016). None of these proposals adequately explains why the typology is the way it is: Why should transposition be rare? Why should multiple transpositions be unattested, when multiple applications of other phonological processes (like deletion or epenthesis) are fine? The core problem seems to be with transposition itself: as long as transposition is in Gen, OT models will predict a broader typology for metathesis than what actually exists.
On a narrower level, analysing metathesis as transposition also tends to make a number of incorrect predictions for individual metathesis patterns. In Meto, for instance, phonetic and phonological data support the conclusion that metathesised vowel features do not perfectly transpose. In my analysis, I capture this imperfect transposition by proposing that metathesised vowel features remain in situ.
For example, transposition-based models treat metathesis as a complete reordering of two segments. Metathesis under transposition is therefore expected to be phonetically perfect: a metathesised /CVCV/ $\rightarrow$ [CVVC] form should have identical surface phonetics to an underlying [CVVC] form. However, Meto metathesis is not phonetically perfect in this sense, and instead generates phonetically exceptional forms: metathesised VC sequences have greater consonant–vowel overlap than underlying VC sequences. To illustrate, take the Meto metathesised word [ts] ‘sea’ and the non-metathesised word [taɪ-s] ‘sarong’, schematised in (8):
Although these words have the same set of gestures, the precedence relations are not exactly the same. In the metathesised word [ts] ‘sea’, the [ɪ] vowel fully overlaps the [s], palatalising it to [sj]. In contrast, the underlying CVVC word ‘sarong’ does not palatalise [s], showing that the offset of [ɪ] precedes the offset of [s].
Similar types of increased overlap are also seen in fast, casual speech, where metathesised CC forms can sometimes be pronounced as CC, with an excrescent vowel remaining on the right-hand side. For instance, Figure 2 shows a waveform and spectrogram of /manus/ $\rightarrow$ [mns-es] ‘a betel vine’, where an excrescent vowel surfaces after the [n]. In my account of Meto metathesis, this behaviour is expected. During spreading, the core precedence relations among features are unchanged, and so even when a vowel spreads across a consonant, the vowel offset will remain after the consonant offset. In fast speech, sloppy gestural coordination in metathesised forms will yield excrescent vowels as a purely phonetic effect (cf. Hall Reference Hall2003), since the offsets were temporally closer to begin with.
From a phonological perspective, treating metathesis as transposition also fails to predict how templatic word shape determines the surface output. In Meto, CVCV and CVVC words have different phonological behaviour. In small phonological phrases, CVCV roots metathesise to CC to reduce stress lapses at the left edge of the phonological phrase (see §2). However, CVVC roots do not simply diphthongise to CC, but also delete their word-final consonant to become C. This is shown in (9):
In an OT analysis with transposition, it is unexpected that the root CV shape should determine whether we get metathesis or some other phonological alternation. Intuitively, this is because dominated Linearity implies that precedence relations are not important in determining the phonological output. A transposition-based analysis would therefore predict that CVCV and CVVC words should either both surface as CC or both surface as C. This is not the case in Meto, and this behaviour of CVVC words is challenging for any OT account that fully transposes the output. In §4.1, I show how this pattern leads to ranking paradoxes in Parallel OT and Harmonic Serialism, and sketch an analysis that allows us to circumvent these issues by treating metathesis as non-local spreading.
1.4 Alternative 2: Coalescence without spreading
In response to the overgeneration issues with transposition, Takahashi (Reference Takahashi, Gallagher, Gouskova and Yin2018, Reference Takahashi2019) also argues against transposition in Gen. Takahashi dispenses with Linearity entirely, and argues that all metathesis stems from successive fission and coalescence, cast in a serial OT framework. In this way, Takahashi is able to (a) remove several long-distance predictions and (b) derive complementary deletion and metathesis patterns in Rotuman, where templatic word shape determines the alternations present. In contrast, these alternations posed persistent challenges for transposition-based analyses for reasons already discussed – dominated Linearity overgenerates, both by distance and by templatic word shape.
While Takahashi’s approach is conceptually similar to the delete-and-spread model I propose here, there are some formal differences. Takahashi (Reference Takahashi2019) casts Rotuman metathesis as a copy-and-delete pattern that uses indices. Under this account, Rotuman has highly ranked stress-to-weight principle (SWP), and so phrase-medial words will coalesce into heavy, diphthongised syllables. First, the vowel copies leftwards to form a diphthong, and then the original copy of the vowel deletes to satisfy Integrity, e.g. /ˈpu1re2/ → ˈpu1e2re2 → [ˈpu1e2r]. In Takahashi’s account, the two instances of [e2] are separate segments, not a single vowel [e] overlapping the [r] as in my account.
The overall prediction Takahashi’s account makes is that the metathesised CVVC output, [ˈpu1e2r], should be identical to a faithful CVVC sequence. In Meto, this does not appear to be the case: metathesised CVVC sequences are both phonetically and phonologically exceptional. Phonetically, metathesised CVVC sequences have different gestural alignment, resulting in greater consonant–vowel overlap (§1.3) and shorter VV duration (§1.5). From a phonological standpoint, metathesised and underlying CVVC sequences are also distinct. Later on, I present data on diphthongisation (§2.4) and consonant deletion (§4.1) that support giving different representations to each of these CVVC sequences.
Another point of difference between Takahashi’s account and mine is the prosodic constraints driving metathesis. In Takahashi (Reference Takahashi, Gallagher, Gouskova and Yin2018, Reference Takahashi2019), metathesis is driven by the SWP, whereas here they are driven by Align(X,R). Gradient alignment constraints of this type have been challenged on the grounds that they overgenerate midpoint-seeking stress patterns (the ‘Midpoint Pathology’; Eisner Reference Eisner1997; Hyde Reference Hyde2008; Kager Reference Kager2012). However, there may be multiple reasons for this gap. For one, midpoint stress systems are expected to be difficult to learn. Stanton (Reference Stanton2016) argues that to distinguish a midpoint system from an edge-oriented system, learners will need to see many long polysyllabic words (upwards of five syllables). Long words of this type are rare in the world’s languages, and so learners are unlikely to select a midpoint stress system over other alternatives.
The second concern is that theoretical work on the Midpoint Pathology has focused almost exclusively on midpoint-assigning stress systems. However, the Midpoint Pathology could also be understood more broadly as ruling out any phonological pattern that involves high-ranked Align(X,R) and Align(X,L) constraints (Brett Hyde, p.c.). If we take this broader view, it is less clear that the Midpoint Pathology is truly a typological gap. For instance, coalescence patterns like we see in Meto could be an example of the Midpoint Pathology, because Align(X,R) and Align(X,L) conspire together to minimise the length of compounds and phonological phrases (see §2.1).
By contrast, a stress-to-weight analysis of Molo is unsuccessful because it will predict diphthongisation or lengthening even when there is no suffixation. For example, the stress-to-weight analysis might predict lengthening in isolation, e.g. /baˈkaseʔ/ $\rightarrow$ *[ba.ˈka:.seʔ] ‘horse’, instead of [ba.ˈka.seʔ].Footnote 7 In Takahashi (Reference Takahashi2019), these candidates are eliminated by FinalStress, but this is not a valid option in Meto since stress is penultimate. In my analysis, these candidates are ruled out because they do not improve violations of Align(X,R).
To summarise, Takahashi’s analysis does not involve transposition in Gen, but still encounters many of the same pitfalls as a transposition-based account. In particular, it predicts that metathesised CVVC sequences should have the same phonetic and phonological behaviour as faithful CVVC sequences. By contrast, I argue that spreading across a CV skeleton better represents the exceptional temporal relations found in metathesised sequences.
1.5 Alternative 3: Allomorphy-based accounts (Edwards Reference Edwards2016, Reference Edwards2020)
Recent accounts of Meto metathesis have argued against prosodic analyses, instead contending that metathesis is a form of allomorphy (Steinhauer Reference Steinhauer and Steinhauer1996; Edwards Reference Edwards2016, Reference Edwards2018, Reference Edwards2020). Under this analysis, metathesised allomorphs are formed by fully transposing the CV segments. Edwards (Reference Edwards2016, Reference Edwards2018, Reference Edwards2020) claims that this allomorphy is variably conditioned by phonology, syntax, or discourse conditions. Edwards (Reference Edwards2020: 209, 257, 331) lists eight types of constructions, each conditioned by one of these three factors. No general theory is offered for associating construction types with conditioning factors.
The main difference between prosodic and allomorphy-based accounts is the status of vowel length in metathesised CVVC sequences. In the prosodic analysis proposed here, metathesised CVVC sequences are monosyllabic diphthongs (CC) that improve the prosodic output. By contrast, in Edwards (Reference Edwards2016 et seq.), they are disyllabic vowel hiatus (CV.VC) that do not improve the prosodic output. If metathesis is not prosodically improving (following Edwards), it must be allomorphy with non-phonological conditions. If metathesis is prosodically improving (as I propose), then it can be derived by the phonological grammar.
In this section, I lay out my assumptions for vowel length and present a supporting phonetic study in §1.5.1. In §1.5.2, I then contrast these results with Edwards (Reference Edwards2016, Reference Edwards2020) claims about vowel length in the language, and discuss several key issues with Edwards’s phonetic study. Lastly, §1.5.3 reviews the implications of vowel length for Edwards’s analysis. Readers who wish to proceed to the analysis may skip this section, moving directly to §2.
1.5.1 Vowel length in Molo
In this article, I assume Uab Meto has three main categories of vowels: monophthongs, diphthongs and vowel hiatus. Of these, monophthongs and diphthongs are monosyllabic, whereas vowel hiatus is disyllabic. Metathesis will coalesce a disyllabic CVCV word into a monosyllabic CC word. Additionally, I argue there is a diphthongisation pattern in the language (see §2), in which disyllabic CVV(C) words coalesce to monosyllabic C.
In this section, I present a phonetic study that offers supporting evidence in favour of these three categories. The main finding is that vowel hiatus is durationally distinct from diphthongs. I elicited 36 roots in prosodically matched contexts (isolation, short nominal phrases and sentential), for a total of 248 tokens from a single speaker. Data were segmented in Praat (Boersma & Weenink Reference Boersma and Weenink2018), and duration measurements were extracted from text grids with a script. The data were analysed in R (R Core Team 2021). I report only on the isolation forms here, summarising the results in Table 1. The duration column provides the mean duration and its standard deviation, with the range column showing the raw duration range.
The data from Table 1 were then compared using a Welch’s unequal variances t-test, summarised in Table 2. The first factor in each comparison is the baseline.
If duration is apportioned per syllable (Broselow et al. Reference Broselow, Chen and Huffman1997), these results are compatible with treating monophthongs and diphthongs as monosyllabic, and vowel hiatus as disyllabic. Vowel hiatus is substantially longer than any other category. In particular, the fact that vowel hiatus is different from both metathesis-derived and hiatus-derived diphthongs supports separating these V1V2 sequences into different categories.Footnote 8 By contrast, Edwards assumes that Meto has no distinction between diphthongs and hiatus.
That said, metathesised diphthongs are still significantly longer than monophthongs, despite both being monosyllabic. From a phonetic standpoint, this is expected: diphthongs have multiple gestural targets, and so they need more time to reach those targets (e.g. diphthongs in American English; Lehiste & Peterson Reference Lehiste and Peterson1961). We therefore expect metathesised sequences to be long only when they contain a diphthong.
Using the same recordings, I tested this prediction by comparing the penultimate vowels in underlying CVCV words to the penults in words that metathesise into monophthongs (e.g. CV1CV2 and CV1Ca roots, which metathesise to CV1C(C2)). An example of this is the word [ʔbibi] ‘goat’, which metathesises to a monophthong in [ʔbib-e] ‘the goat’. Applying a Welch’s unequal variances t-test, I found no significant differences in length between penults of these types, as shown in Tables 3 and 4.Footnote 9 This again supports treating metathesised sequences as monosyllabic. If metathesis were transposition with no coalescence (e.g. /CVCV/ $\rightarrow$ [CV.VC]), we would expect the vowel to be phonetically long under metathesis.
To sum up, I treat Meto as having monosyllabic monophthongs and diphthongs, and disyllabic vowel hiatus. I claim that metathesis always coalesces a disyllabic CVCV sequence into a monosyllabic CC sequence. Similarly, diphthongisation coalesces a disyllabic CVV(C) sequence into a monosyllabic C sequence. In the next section, I contrast these results with the data reported in Edwards (Reference Edwards2016, Reference Edwards2020).
1.5.2 Vowel length in Edwards (Reference Edwards2016, Reference Edwards2020)
In contrast to my account, Edwards (Reference Edwards2016, Reference Edwards2020) treats metathesised CVVC sequences as disyllabic vowel hiatus. To support this, Edwards (Reference Edwards2020) presents a phonetic study tracking vowel length in metathesised CVVC words and ‘u-form’ CVVC words (e.g. hiut ‘seven’ (from hitu) vs. kuan ‘village’). In the study, 628 tokens were extracted from four naturalistic texts by a single speaker. Edwards compared the duration of the vowels in metathesised CVVC forms (e.g. hiut) to the duration of the vowels in the u-form CVVC words (e.g. kuan), and found no significant differences in length according to a two-tailed t-test. Edwards (Reference Edwards2020: 189) thus concluded that metathesised CVVC and hiatus CVVC forms are both disyllabic.
There are two core problems with the phonetic study in Edwards (Reference Edwards2020). The first is that the u-form CVVC category used in the study is not expected to contain only vowel hiatus, but also some hiatus-derived diphthongs. Edwards (Reference Edwards2020) analyses all lexical roots as having two allomorphs, an m-form and a u-form. In /CVVC/ roots, the u-forms and m-forms are identified by their alternation between CVVC and CVV (e.g. [kuan] vs. [kua] ‘village’, Edwards Reference Edwards2020: 171). Both are claimed to have vowel hiatus. When measuring the vowel hiatus category, the phonetic study used u-forms like [kuan], which can be identified by the presence of a word-final consonant. However, this m-form/u-form distinction does not perfectly line up with where we would expect vowel hiatus versus diphthongisation. For instance, under suffixation I would expect a diphthong for [kn-e] ‘the village’, but Edwards would treat this as a u-form with hiatus because the final consonant is present. These assumptions are expected to artificially lower the mean duration of vowel hiatus in Edwards’s phonetic study, as some diphthongs may be included in the hiatus category.
The second issue is how the data were analysed. The data from Edwards (Reference Edwards2020) phonetic study come from texts, and so none of the tokens are controlled for speech rate, phrasal position or prosody. These factors are expected to dramatically affect vowel length (cf. Edwards Reference Edwards2020: 189), and so a more robust model is needed to evaluate these data. However, Edwards’s phonetic study used a t-test, which cannot account for these factors. As a result, Edwards’s phonetic study is inconclusive: the data are expected to contain meaningful variation that is simply being averaged over. By contrast, in my study, all tokens were elicited in a frame, and so these factors were controlled.
Edwards (Reference Edwards2016, Reference Edwards2020) also claims that Meto has phonetically long vowels in various metathesis environments. For example, Edwards treats metathesis as transposition, and so /CV1CV1/ words are expected to metathesise to [CV1V1C] with a phonetically long vowel. In a phonetic study, Edwards (Reference Edwards2020: 98) claims that this is precisely what happens in Amarasi: /ʔbibi/ ‘goat’ metathesises to a lengthened [ʔbi:b-es] ‘a goat’ under suffixation. However, this phonetic study bears similar problems to the one previously discussed. The data were not elicited in prosodically controlled environments, and then it was analysed using a t-test. Ideally, the analysis would have used a statistical method capable of incorporating phrasal position and stress as independent variables. As is, neither of the phonetic studies in Edwards (Reference Edwards2020) can be considered conclusive.
Outside of these metathesis contexts, Edwards also claims that CV(C) roots have a phonetically long vowel. To account for this, Edwards (Reference Edwards2020: 135) claims that the minimal word in Meto is CVV(C), so all apparent CV(C) words are underlyingly CVV(C). However, the phonetic studies presented do not substantiate this. For instance, Edwards (Reference Edwards2020: 98) presents a study comparing the duration of single vowels, V1V1 vowels and V1V2 vowels extracted from polysyllabic words in texts. Edwards reports that V1V1 sequences are 30 ms longer than single vowels, and again uses a t-test to assess significance.
However, this study does not tell us much about the proposed word minimality effect. For one, this V1V1 category is not well defined. It is unclear if these V1V1 tokens all come from putative CV or CVC roots, metathesised /CV1CV1/ $\rightarrow$ [CV1V1C] words or some mixture of the two.Footnote 10 To convincingly assert that no CV(C) words exist, these cases should have been separately reported on. We also have no indication that durations in this V1V1 category were evaluated to see if their distribution was bimodal, which would indicate that CV(C) and CVV(C) roots were being averaged together. Since there is no convincing evidence to the contrary, I assume henceforth that Molo has monosyllabic CV(C) words.
1.5.3 Implications of vowel length for Edwards (Reference Edwards2016, Reference Edwards2020)
In Edwards’s analysis, metathesis is transposition without coalescence, where /CVCV/ $\rightarrow$ [CV.VC]. Edwards (Reference Edwards2020: 188) argues that because metathesised CVVC forms are disyllabic, there is no clear way metathesis improves the prosodic output.
In §1.5, I examine these claims in the Molo dialect through a small phonetic study, and found different results. Unlike Edwards, I found no evidence that /CV1CV1/ words metathesise into a disyllabic [CV1.V1C] sequence. I therefore treat all metathesised VV sequences as monosyllabic diphthongs, and predict that metathesised VV sequences should only be long when V1V2 qualities are different.
Upon examining the phonetic studies in Edwards (Reference Edwards2020) more closely (§1.5), it appears there are significant methodological errors in the design and analysis. Therefore, Edwards’s claim that there is no coalescence in the language cannot be considered conclusive. Further work is needed on Amarasi to see if there is truly no coalescence in the language. On the other hand, the preliminary data from Molo are compatible with a prosodic account, and so I proceed here assuming that Meto metathesis and diphthongisation coalesce disyllables into monosyllables. If these durational data hold up in future studies, this would provide significant support for a prosodic analysis, because only a prosodic analysis can explain why metathesis and diphthongisation occur in the same environments. On an allomorphy-based account, this connection must be either denied or stipulated.
In the next section, I introduce further data on Meto metathesis and coalescence. I contend that the spreading-based account offers a more robust treatment of Meto phonology as a whole, since it is able to derive a variety of alternations (metathesis, diphthongisation and deletion) under a unified analysis.
2. Coalescence beyond suffixation
In this section, I present an analysis of Meto coalescence alternations. As we saw in §1.1, apparent metathesis reduces right-edge lapses created by suffixation (11a). In this section, I show how metathesis also reduces lapses at the left edge in compounds (11b) and phonological phrases (11c).
In addition to these metathesis patterns, roots of other templatic shapes undergo other coalescence alternations, namely diphthongisation and deletion. These are shown in (12) and (13). These alternations occur in identical prosodic environments to metathesis and also reduce stress lapses.
I now go through each of these cases in turn, starting with metathesis in compounds and phrases (§§2.1 and 2.2), then going on to diphthongisation and deletion subpatterns (§§2.3 and 2.4). Each of these alternations is parasitic on prosodic truncation: a V-slot deletes, and then features spread or remain unassociated to create metathesis, diphthongisation and deletion alternations.
2.1 Coalescing metathesis in compounds
In this section, I focus on morphologically complex words that contain multiple roots. Similar to how suffixation creates right-edge lapses, compounding creates lapses at the left edge of a word. Left edge lapses are dispreferred, but due to positional restrictions on truncation, they can only be improved by deleting a root-final vowel.
In (14), I show examples of compounds. The first root undergoes apparent metathesis, reducing the left-edge lapse by one. Faithful candidates (shown at right) contain more violations of Align(X,L).Footnote 11
I derive this pattern by ranking Align(X,L) below Align(X,R). This left alignment does not affect stress assignment, but can still feed prosodic truncation.
-
(15) Align(X,L): Assign one violation for each syllable that separates the primary stress from the left edge of a prosodic word/phrase (cf. McCarthy & Prince Reference McCarthy, Prince, Booij and Marle1993; Gordon Reference Gordon2002, among others)
In the derivation of /fafi-ʔanaʔ/ $\rightarrow$ [ˌff-ˈʔanaʔ] ‘piglet’ (14c), the first stage of the derivation is cyclic stress assignment. In the first cycle, roots receive penultimate stress, and then in the second cycle, the word promotes the stress of the rightmost root (cf. EndRule-L; Prince Reference Prince1983, McCarthy Reference McCarthy2003).Footnote 12 These cycles of stress assignment produce the output shown in Step 0 of (16), /ˌfafi-ˈʔanaʔ/.
At the input for Step 1, there are two violations of Align(X,L) for the word-level stress. Since Align(X,L) $\gg $ MaxV , the derivation truncates the final V-slot in /ˌfafi/ ‘pig’ to [ˌfaf]. This is shown in (17):
In Step 2, the floating vowel spreads leftwards, giving the appearance of metathesis even though the features remain in situ.
After this, the faithful candidate (18c) [ˌff-ˈʔanaʔ] wins, and the derivation converges. No further truncation is possible, because only unstressed, root-final V-slots may delete (see discussion in §1.2).
In the next section, I turn to metathesis in phonological phrases. Like compounds, metathesis in phrases reduces left-edge lapses. I use the phrasal metathesis data to argue against syntactic accounts of Meto metathesis (e.g. Edwards Reference Edwards2018, Reference Edwards2020).
2.2 Coalescing metathesis in phonological phrases
In phonological phrases ( s), we see an identical pattern to compounds: all roots to the left of primary stress metathesise. From an alignment perspective, the pattern here is the same as in compounds. The rightmost root receives primary stress, and any roots to the left truncate to reduce Align(X,L) violations.
In (19), I show some examples of metathesis in phonological phrases. When there are two roots in one , non-final roots metathesise. In contrast, when the root is final in a phonological phrase, it surfaces in its faithful form.Footnote 13
In previous work, some of these cases have been analysed as ‘syntactic’ metathesis, conditioned directly by phrasal constituency (Steinhauer Reference Steinhauer1993; Edwards Reference Edwards2016, Reference Edwards2018, Reference Edwards2020; see §2.2). In contrast, I view this metathesis as an indirect consequence of the syntax–prosody mapping: small syntactic phrases (NPs and VPs) must align with a edge, and so metathesis will correlate with some syntactic phrase edges, but not with syntactic constituency. Under this analysis, metathesis occurs in every medial root of a , since only the final root bears primary stress.
The prosodic analysis offers clear coverage of how metathesis interacts with focus intonation in the language. As in many languages (Büring Reference Büring, Zimmermann and Féry2009; Féry Reference Féry2013), Meto focus intonation inserts a prosodic boundary to the right of a focused constituent. This has the effect of overriding normal syntax–prosody mappings so that focus intonation bleeds metathesis.
To illustrate, take the focus-sensitive operator ha ‘only’ in (20), which inserts a prosodic boundary after the focused prosodic word /kiso/ ‘see’. This prevents wrapping of the verb and direct object into a single phonological phrase, and so metathesis is blocked in (20b) by NonFin.Footnote 14
This effect is not morphological, as similar results can be found with contrastive focus intonation. If we drop ha but contrastively focus [ˈkiso] ‘see’ with a focus high tone, we obtain the same result.Footnote 15
Focus intonation is valuable in a prosodic account because it also acts as a diagnostic between compound metathesis and phrasal metathesis. Unlike phrases, compounds cannot alternate depending on focus intonation. Only the primary stress of the compound is visible to focus, and earlier stresses may not be promoted. In (21), we see that the first root in the compound [ˌff-ˈʔanaʔ] ‘piglet’ may not receive any focus intonation, either from contrastive focus or a focus-sensitive operator like ha ‘only’:
Under this analysis, focus intonation can only target word- or phrase-level stresses. In compounds, the first root is invisible to focus intonation because it only has root-level stress.
To sum up, here I have argued in favour of a prosodic account to Meto phrasal metathesis. Phonological phrases undergo stress promotion much like compounds, and so pre-tonic roots metathesise to reduce left-edge lapses. Before continuing on, I briefly discuss an alternative account of these alternations, where metathesis is directly conditioned by the syntax. I ultimately dismiss this alternative, because it does not predict syntax–phonology mismatches.
2.2.1 Alternative: syntactic metathesis
The most salient alternative to the prosody-based analysis is a syntactic account, proposed in detail in Edwards (Reference Edwards2016). In Edwards’s account, metathesis can be syntactically conditioned by a head–specifier relation. Nouns metathesise when they have an adjectival specifier, and verbs metathesise when they have serial verb in their specifier. There are two faulty predictions this analysis makes: (i) that NPs can induce only one instance of metathesis and (ii) that metathesis should be able to diagnose syntactic constituency.
In response to (i), we see in (22) that multiple adjectives can be wrapped into a single , where each root undergoes metathesis:
In a prosodic account, this behaviour is predicted: no matter how many phrase-medial roots you add, only the final root bears stress. In a syntactic account, we would need to stipulate that all but the final root in any NP or VP metathesises, since they cannot all be the specifier of N. This stipulation is remarkably similar to my prosodic analysis – only the final roots of NPs and VPs are special – but in the prosodic account this follows from how phrasal stress is assigned.
The core problem with a syntactic analysis is that it predicts that metathesis should be able to diagnose adjunct height. For instance, metathesis should occur on a verb followed by a PP adjunct only when the PP is interpreted in the same domain as that verb. Yet adjunct attachment height is ambiguous in both (19d-i) and (19d-ii). The high-attachment reading persists regardless of metathesis, and the only difference between these two sentences is their intonational contour.Footnote 16 This is not easily compatible with a syntax/allomorphy-based account, and is better analysed as a type of prosodic wrapping (cf. Wrap; Truckenbrodt Reference Truckenbrodt1999, Reference Truckenbrodt and Lacy2006).
An anonymous reviewer suggests that an Edwards-style account would treat the adjunct metathesis in (19d-i) as ‘discourse metathesis’, not syntactic metathesis. Despite listing several examples of where discourse metathesis is expected to occur, Edwards (Reference Edwards2016, Reference Edwards2020) does not provide independent diagnostics for discourse metathesis versus syntactic metathesis. In the absence of diagnostics of this type, I treat syntactic and discourse metathesis as a single phenomenon that is the result of syntax–prosody mappings.
Before continuing on, I discuss a remaining issue for the prosodic account: metathesis in ellipsis environments (cf. Edwards Reference Edwards2016: 287). When answering a yes–no question, it is possible to answer with just the subject and verb, eliding the remainder of the sentence. In these cases, the verb maintains its metathesised form, even though it is phrase-final:
There are several options on how to capture this pattern within a prosodic analysis. For one, the intonation found in these ellipsis environments is not identical to the intonation of most phrase-final words. Phrase-final words (especially those in nominal phrases) generally bear H* or L*+H tones, but verbs preceding ellipsis sites tend to bear L* tones. It is possible that L* tones cannot induce violations of NonFin, and so metathesis will not be blocked in these contexts. A second option is that the ellipsis site is not empty at the time of metathesis – either prosodification occurs before ellipsis takes place, or the ellipsis site contains null prosodic elements. An adequate answer to this question requires more detailed work into intonation and ellipsis in Meto, and so I leave these possibilities for future work.
In the next section, I turn to diphthongisation, another coalescence alternation found in the language. The same contexts that condition metathesis force CVV(C) words to diphthongise. This evidence strengthens the case that Meto metathesis is prosodically driven.
2.3 Diphthongisation: coalescence without metathesis
Outside of metathesis, diphthongisation provides further support for alignment-driven coalescence in Meto. Underlying vowel hiatus shortens into a diphthong to align the primary stress closer to an edge.
In compounds and phonological phrases, diphthongisation reduces a left-edge lapse, as in (24). The coalescence of vowel hiatus into a diphthong reduces violations of Align(X,L).
In contexts with suffixes, diphthongisation reduces a right-edge lapse, as in (25). Diphthongisation is blocked by NonFin in isolation, since then stress would be phrase-final.
In this analysis, the treatment of diphthongisation is almost identical to metathesis: the V-slot deletes, and so the floating vowel features spread leftwards to form a diphthong. Diphthongisation does not apply rightwards, as this often would constitute spreading past a morpheme boundary (see §3.2).
I introduce the constraint *Multiple, which militates against multiple linkage of features and slots:
I show the derivation of /meo-nu/ $\rightarrow$ [ˈm-nu] ‘cats’ in (28). In Step 1, the final V-slot of the root truncates due to Align(X,R), leaving a vowel feature floating. In Step 2, the floating vowel spreads leftwards to the preceding V-slot, violating *Multiple. After Step 2, /m.-nu/ becomes the new input, but no further changes harmonically improve the output and the faithful candidate wins. The derivation converges, yielding [m.-nu] as the output.
The status of diphthongisation in Meto is contested, and previous research claimed there to be no diphthongisation in cases like (24) and (25) (e.g. Edwards Reference Edwards2018: 26). However, there are serious methodological issues with Edwards’s phonetic study (see §1.5.2). When the prosodic context is more controlled (e.g. [ˈku.an] ‘village’ vs. ˈ[kn-e] ‘the village’), there is a difference between vowel hiatus and diphthongs (see §1.5.1).
In the next section, I turn to cases where metathesis is blocked. In these cases, the vowel remains floating instead of spreading leftwards, yielding surface vowel deletion. This pattern provides further evidence in favour of line-crossing, because it shows that metathesis involves spreading that is less local than spreading in diphthongisation.
2.4 Deletion occurs when metathesis is blocked
Meto has a preference against rising-sonority diphthongs, and so non-local spreading is blocked when it would create one. In these cases, the vowel features remain floating instead of reassociating leftwards, giving the appearance of deletion. This holds for words expected to metathesise with suffixes (29a) or in compounds and complex phonological phrases (29b).Footnote 17
However, rising-sonority diphthongs are possible when they do not cross consonantal association lines. They are rare, but some examples derived from vowel hiatus can be found, as in (30):
These data suggest that rising-sonority diphthongs are only illicit when created by metathesis.
I capture this in my analysis using constraint conjunction (Smolensky Reference Smolensky1995). I introduce a HeavyDiph constraint in (31), which penalises rising-sonority diphthongs. I conjoin HeavyDiph with *XSpread to create Heavy $\land $ *XSpr in (32), which is violated when V-slot bearing a rising-sonority diphthong has a crossed association line. I assume a standard sonority hierarchy for vowels (a $\gg $ ɛ, ɔ $\gg $ e, o $\gg$ i, u; de Lacy Reference de Lacy2006: 286).
The constraint Heavy $\land $ *XSpr is undominated, and so it will rule out any rising-sonority diphthong that crosses an association line. Meanwhile, HeavyDiph is dominated, and so rising-sonority diphthongs are licit as long as they are local.
In my analysis, the deletion patterns from (29) are composed of a subset of the operations used in metathesis: assign stress, delete a V-slot and then converge. Since the vowel features are not linked to a timing slot, they are not pronounced (cf. Hyman Reference Hyman, Bogers, Hulst and Mous1986; Kenstowicz & Rubach Reference Kenstowicz and Rubach1987; Rubach Reference Rubach1993).Footnote 19
The crucial step from (33) is Step 2, where we would ordinarily see spreading. In this case, spreading is blocked by Heavy $\land $ *XSpr, since this would create a rising-sonority diphthong [] that is non-local. The features are forced to remain floating, yielding [kibʔ-e] ‘the ant’, as in (34). I assume that [kibʔ-e] is acoustically identical to [kibʔ-e].
In contrast, hiatus-derived diphthongs will not violate Heavy $\land $ *XSpr, and so spreading is preferred over leaving vowel features floating. This is seen in (35) for the derivation of /bian-e/ $\rightarrow$ [bn-e] ‘the other’:
In this pattern, it is crucial that metathesis is non-local spreading rather than spreading that is relatively local along a tier. In a tier-based model, the diphthong generated in (35) [bne] would ostensibly have an identical representation to the illicit diphthong in (34) *[kbʔe]. Both are V-slots associated with vowel features that rise in sonority. For a tier-based model, it is puzzling why diphthongisation should be ruled out in one case but not another, since spreading is still perfectly local along the tier.
This is a well-known problem in related Austronesian languages such as Rotuman (McCarthy Reference McCarthy2000). In Rotuman, falling-sonority diphthongs cannot be generated by metathesis. Besnier (Reference Besnier1987) analyses this pattern using tiers: any spreading that generates a falling-sonority diphthong is blocked, and the vowel must delete instead (e.g. /rako/ $\rightarrow$ [rak] ‘to imitate’ (phrase-medial)). However, this makes the faulty prediction that falling-sonority diphthongs are uniformly illicit. This is not the case – like Meto, Rotuman does allow falling-sonority diphthongs when they are generated locally (e.g. /vao/ $\rightarrow$ [v] ‘net’, McCarthy Reference McCarthy2000: 6).
McCarthy (Reference McCarthy2000) analyses this alternation as resulting from a maximal weight restriction LightDiph, which permits falling-sonority diphthongs only in open syllables. This happens to work in Rotuman because CVCV roots metathesise into closed syllables, whereas CVV roots diphthongise but remain as open syllables. In Meto, a weight-based analysis will not work, because rising-sonority diphthongs can occur in closed syllables, e.g. /buabaʔ-e/ $\rightarrow$ [bb.ʔ-e] ‘gather it’. This leaves only locality as a possible explanation for the [bne] vs. *[kbʔe] distinction. Local spreading can create rising-sonority diphthongs, but non-local spreading cannot. This pattern therefore provides evidence against a tier-based model by showing that spreading in metathesis is truly less local than spreading in diphthongisation.
In models that use coindexation rather than spreading, such as Takahashi (Reference Takahashi2019), we encounter similar problems. The output representations of [bne] ‘the other’ and *[kbʔe] ‘the ant’ have identical surface representations, but are not equally well-formed. In Harmonic Serialism, the way around this problem is to claim that Integrity cannot be violated for high-sonority segments, and so /ki1ba2ʔ-e/ cannot split into [ki1a2ba2ʔ-e] to begin with. The vowel would therefore delete fully, yielding [kibʔ-e] ‘the ant’. A crucial difference between an account using coindexation and one using spreading is that there is no floating feature bundle in the coindexation model. The final vowel is fully deleted, leaving a consonant-final word. In §4.1, I show how this is problematic in Meto, since true word-final consonants undergo deletion in phrases, whereas consonants followed by floating vowel features do not.
As an alternative to the present account, Edwards (Reference Edwards2016) analyses these cases as metathesis, wherein the [a] vowel assimilates to the preceding vowel and lengthens it (e.g. /penaʔ/ $\rightarrow$ [peen] ‘corn’). However, in contrast to Amarasi, the Molo dialect does not have evidence of vowel lengthening in these contexts (see §1.5.1). This raises an interesting set of questions on what the differences really are between these dialects: vowel length could be parametrically set by the phonetics, or Amarasi metathesis could have a weight-sensitive component, with the SWP inducing lengthening if spreading is ruled out (see §4.1). These issues merit independent phonetic study, and so I set them aside for future work.
To sum up, the Molo dialect of Meto does not allow rising-sonority diphthongs to be derived through metathesis, even though rising-sonority diphthongs may occur elsewhere. I analyse this as a restriction on line-crossing for rising-sonority diphthongs. This offers an improvement over tier-based accounts, which cannot distinguish between diphthongs derived from VV(C)# versus VCV# sequences.
2.5 Interim summary
In this section, I provided an analysis of coalescence alternations in Meto, where prosodic factors condition diphthongisation, coalescing metathesis or deletion. Under this analysis, each of these alternations is parasitic on prosodic truncation – a root-final V-slot deletes to improve prosodic well-formedness, leaving floating vowel features that must either spread or remain unassociated. In diphthongisation and coalescing metathesis, the floating features spread leftwards to reassociate with another V-slot. In the deletion cases, non-local spreading is blocked due to the high sonority of the delinked vowel, and so the delinked features remain floating.
In the next section, I turn to epenthetic metathesis, another type of metathesis in the language. Unlike the CV $\rightarrow$ VC coalescing metathesis, epenthetic metathesis is VC $\rightarrow$ CV and does not form a diphthong. However, like coalescing metathesis, epenthetic metathesis is parasitic on prosodic truncation, and so it can only surface in roots that are able to truncate.
3. Interactions with epenthesis
In this section, I explore connections between metathesis and epenthesis in Meto, and present some additional data providing evidence for several locality restrictions on Meto spreading. In particular, I predict that Meto metathesis arises through mechanisms similar to copy-epenthesis, and so in §3.1 I rule out synchronic copy-epenthesis in the language. In §3.2, I also present diphthongisation data that support treating metathesis as line-crossing instead of strictly local spreading.
I first introduce data on epenthetic metathesis, a VC $\rightarrow$ CV alternation that eliminates word-final consonant clusters. I argue that epenthetic metathesis is composed of deletion and spreading mechanisms, just as with coalescing metathesis (§2). The difference is that in epenthetic metathesis, the floating features spread rightwards to an epenthetic V-slot. The main contribution of this section is to establish the locality requirements on spreading active in Meto grammar.
In (36), I show some initial examples of epenthetic metathesis. Epenthetic metathesis eliminates *CC# sequences in non-monosyllabic roots.
It should be noted that (36a) is the only example I have with a non-CVCaC root from my fieldwork. If (36a) is later found to be spurious, we can eliminate predictions of epenthetic metathesis by imposing a ban on rightwards spreading. Under this alternative, we would expect roots to undergo leftwards coalescing metathesis or deletion, while the epenthetic vowel remains featureless (e.g. *[manikna-t] ‘the cold’).
In this analysis, epenthetic metathesis has four steps: stress assignment, epenthesis, truncation and spreading. The derivation of /manikin-t/ $\rightarrow$ [manikni-t] ‘(the) cold’ is shown in (37):
I introduce two constraints: *CC# and DepV . These militate against word-final consonant clusters and V-slot epenthesis. While slot epenthesis is dominated, I treat the constraint against featural epenthesis (DepF ) as undominated in the language. I discuss this in further depth in §3.1.
In tableau form, the derivation of /manikin-t/ $\rightarrow$ [maˈnikni-t] begins by assigning stress, and then epenthesising a V-slot. This is shown in (41). Vowel epenthesis prefers to occur word-internally in Uab Meto (cf. R/L-Anchor; McCarthy Reference McCarthy1995: 123), and so I only consider candidates with epenthesis in those positions.
In Step 2 (42), the post-tonic V-slot truncates to reduce Align(X,R) violations. All other candidates are less well-formed with respect to Align(X,R) or *CC#.
In Step 3 (43), the floating vowel spreads to the epenthetic V-slot. This eliminates both *Float violations in one step. Spreading leftwards (candidate (43c)) is dispreferred because the epenthetic V-slot remains floating and featureless.
In comparison to coalescing metathesis (§2), epenthetic metathesis is rare in Meto. This is largely because epenthetic metathesis only occurs when a CVCVC root combines with a consonantal suffix. Meto has a bias in favour of CVCV roots (Edwards Reference Edwards2020: 135), and so the lexicon is skewed in a way that restricts the environments for epenthetic metathesis. Of the remaining roots that are CV1CV2C, most have [a] as V2 and so epenthetic metathesis could also be analysed as epenthesis (e.g. /CVCC/ $\rightarrow$ [CVCaC]; see §3.1). Under this view, the only unambiguous case of epenthetic metathesis is (36a), /maˈnikin-t/ $\rightarrow$ [manikni-t] ‘the cold’.
Despite appearances, this ambiguity between epenthetic metathesis and true epenthesis is desirable from a learning perspective. A learner’s choice between metathesis and epenthesis will not yield diverging results for most CVCVC roots due to biases in the lexicon, since most CVCVC roots are CVCaC. This strengthens the stability of the Meto metathesis system, since learners can take either analytic route and still produce the correct output for almost all roots.
As an aside, there is also some evidence that Meto spreading cannot cross morpheme boundaries. In words with multiple suffixes, default epenthesis breaks up illicit consonant clusters (e.g. /ʔolɪ-f-m/ $\rightarrow$ [ˈʔl-fa=m]). In these cases, we might have expected epenthetic metathesis, where the truncated vowel spreads across a morpheme boundary (e.g. /ʔolɪ-f=m/ $\rightarrow$ *[ˈʔol-fɪ=m] ‘and the younger sibling’). However, spreading here seems to be blocked by the morpheme boundary, and so the delinked vowel can only spread leftwards, leaving the epenthetic vowel default.Footnote 20 To rule this out, I assume that spreading across morpheme boundaries is prohibited in Meto by an undominated Morph*XSpr constraint.
To sum up, word-final consonant clusters can induce epenthetic metathesis (VC $\rightarrow$ CV). I analyse epenthetic metathesis as the combination of epenthesis, deletion and spreading. In the next section, I show how Meto epenthetic metathesis is dependent on prosodic truncation – where truncation cannot occur, epenthetic metathesis cannot occur. This reveals an important locality restriction on Meto spreading: non-local spreading is only possible for floating features.
3.1 Monosyllabic roots do not metathesise
Monosyllabic roots cannot undergo epenthetic metathesis, and instead have default vowel epenthesis in these contexts. The main reason for this is that monosyllabic roots cannot truncate. Uab Meto has a positional restriction on truncation: only unstressed, post-tonic vowels in roots may delete (see §1.2). Since monosyllabic V-slots are stressed, truncation in these contexts is not possible.
In (44), we see that words with monosyllabic roots have default vowel epenthesis to prevent word-final consonant clusters. The epenthetic vowel [a] is underlined in the examples below.
In this analysis, I treat default epenthesis as a floating, featureless V-slot (cf. Archangeli Reference Archangeli1984, Reference Archangeli1988; Pulleyblank Reference Pulleyblank1988). The phonetics interprets featureless slots as a language-specific default epenthetic segment, in this case [a]. These default epenthetic segments violate *Float, but not DepF . This gives us the constraint ranking DepF $\gg $ *Float $\gg $ *XSpread, which means that epenthetic slots will be default unless they inherit features via spreading.
Historically, this type of constraint ranking has been associated with copy-epenthesis patterns (Kawahara Reference Kawahara2007). If a language allows spreading and disprefers feature epenthesis, then epenthetic consonants should ‘copy’ the features of a nearby segment through spreading. The fact that this cannot happen in Meto monosyllabic roots reveals another restriction on spreading in the language: vowel features cannot spread non-locally if they are already associated. Intuitively, this means that Meto spreading has a contiguity restriction, which permits multiple association only when slots are adjacent. This is conceptually similar to constraints on multiple linkage across syllable boundaries, as proposed for Esimbi ‘flop’ (see Walker Reference Walker1997).
I formalise this spreading restriction as constraint conjunction of *Multiple from (27) and *XSpread. Vowel features can only spread across association lines when they are floating.
-
(45) *Mult $\land $ *XSpr (undominated): ‘Only floating features may cross association lines.’
Assign one violation when a multiply associated vowel feature has an association line that crosses some other association line.
In copy-epenthesis languages, *Mult $\land $ *XSpr is dominated because features spread across an intervening consonant while maintaining their original associations. In contrast, the Molo dialect of Uab Meto has undominated *Mult $\land $ *XSpr. This is schematised in (46):
To illustrate, take the derivation of /ˈplen-t/ $\rightarrow$ [ˈplena-t] ‘the command’. In Step 1 (47), a V-slot is epenthesised to eliminate the *CC# violation.
In Step 2 (48), no further changes harmonically improve the output, and so the faithful candidate (48a) wins and the derivation converges. Copy-epenthesis spreading (candidate (48a)) is ruled out by *Mult $\land $ *XSpr. Deletion of the root’s V-slot is also ruled out (not shown in (48)), because stressed V-slots cannot delete.
At this point, my analysis has independently presented epenthesis and vowel deletion patterns for Meto (see §2.4). It is therefore reasonable to ask if these $\varnothing \sim \textrm{[a]}$ alternations could be analysed as a single phenomenon, instead of positing separate deletion and epenthesis mechanisms. I claim we do need both vowel epenthesis and vowel deletion for Meto, and review some arguments in favour of this here.
I begin with the vowel epenthesis pattern from (44). This pattern must be analysed as epenthesis (and not deletion), due to pairs like [bsoʔ] ‘dance’ and [ʔa-bsoʔ-at] ‘dancer’ in (44c). If the [a] vowel were underlying (i.e. if the UR of ‘dance’ were */bsoʔa/), we would expect for the verb to surface as *[bsoʔa] in phrase-final positions to avoid a NonFin violation. However, the verb surfaces as [ˈbsoʔ], and so we are forced to treat the vowel as epenthetic.
Similarly, the vowel deletion cases from §2.4 cannot be reanalysed as epenthesis. For instance, take an alternation like [nine] ‘edge/wing’ and [nin moloʔ] ‘yellow wing’. This must be analysed as deletion, because the missing vowel in [nin moloʔ] ‘yellow wing’ does not have a predictable quality. Furthermore, if this were epenthesis we would expect that NonFin $\gg $ Dep, so that /nin/ $\rightarrow$ [nine] ‘wing’ in isolation. This would imply that no stress-final words exist in the language, but again this is not the case (e.g. [ˌmn-ˈfu] ‘wild chicken’, *[mn-fu a]; see §2.1).
That said, the alternations in many Meto words can be analysed as either deletion or epenthesis. For instance, in [ʔutan] ‘vegetable’, the UR could be either /ʔutn/ or /ʔutan/: the derivation will predict identical alternations regardless of UR. By Richness of the Base, any [CVCaC] word can have either /CVCaC/ or /CVCC/ as its UR.Footnote 21 I take this as an advantage of the present analysis: where there is unclear evidence in favour of deletion or epenthesis, the grammar will tolerate either option.
I now return to discuss the locality constraint proposed in this section, *Mult $\land $ *XSpr. This constraint prohibits crossing association lines connected to multiply linked features, and is the only thing that prevents Molo from having copy-epenthesis. I therefore predict that languages with synchronic metathesis and copy-epenthesis should be quite similar, since they only differ in their ranking of *Mult $\land $ *XSpr. This prediction seems to be borne out. In Ro’is Amarasi, another dialect of Uab Meto, there is preliminary evidence of a copy-epenthesis system. This is shown in (49):Footnote 22
This pattern suggests that *Mult $\land $ *XSpr is dominated in Ro’is Amarasi.
In the present analysis, *Mult $\land $ *XSpr also rules out metathesis for linked features, and so we might predict that metathesis will also behave differently in Ro’is Amarasi. Specifically, if *Mult $\land $ *XSpr is dominated, we predict that line-crossing should be possible even when vowels do not delete. This prediction is correct: Ro’is Amarasi diphthongises even in isolation (e.g. /manus/ $\rightarrow$ [ma͡unus] ‘betel vine’; Edwards Reference Edwards2020: 195). We can capture this pattern by saying that Ro’is Amarasi differs from Molo in two respects: (i) *Mult $\land $ *XSpr is dominated, and (ii) metathesis is driven by a need to make stressed syllables heavy. By contrast, Molo metathesis is driven by gradient alignment constraints and has stricter locality requirements on spreading, which rule out both copy-epenthesis and diphthongisation in isolation. The fact that Ro’is Amarasi has both copy-epenthesis and diphthongisation in isolation is encouraging, since the present analysis uses *Mult $\land $ *XSpr to militate against both.
In the next section, I turn to consonant epenthesis in Meto. While not strictly related to metathesis, consonant epenthesis provides evidence in favour of treating metathesis as line-crossing rather than coindexation or strictly local spreading.
3.2 Consonant epenthesis and diphthongisation
In this section, I focus on the relationship between consonant epenthesis, metathesis and diphthongisation. I argue that epenthetic consonants receive their features from adjacent vowels by spreading (Staroverov Reference Staroverov2014), building on existing accounts of Meto consonant epenthesis (Edwards Reference Edwards2016, Reference Edwards2020; Culhane Reference Culhane2018). The contiguity restriction on Meto spreading, enforced by *Mult $\land $ *XSpr, means that consonant epenthesis bleeds metathesis. This pattern provides indirect evidence in favour of viewing metathesis as spreading, rather than some other type of coindexation.
In (50), I show examples of consonant epenthesis in the Molo dialect. Consonant epenthesis prevents vowel hiatus across a morpheme boundary, but bleeds metathesis of the truncated vowel:Footnote 23
The quality of the epenthetic consonants in (50) is predictable from the underlying final vowel of the root. Round vowels condition [b], front mid vowels condition [l] and high front vowels condition []. These relationships are unusual, but not unheard-of in consonant–vowel spreading paradigms. In Samoan, for instance, vowel epenthesis in loanwords shows similar tendencies: labial consonants condition epenthetic /u/ and coronal consonants condition epenthetic /i/ (Uffmann Reference Uffmann2006).
There are several reasons why these consonants must be epenthetic, rather than underlying, and I briefly summarise them here. First, if the consonants in (50) were underlying, then most of these words would have a /CVCVC/ templatic shape (e.g. /fatub/ for (50a)). Words of this templatic shape are expected to metathesise (e.g. /kokɪs-e/ $\rightarrow$ [kks-e] ‘the bread’), but the words in (50) cannot (e.g. *[ftb-e], cf. (50a)). Second, plural allomorphy suggests that these words are vowel-final. The plural morpheme has three allomorphs: /-nu/ after VV sequences, /-n/ after CV and /-in/ after consonants (see data in the Supplemental Material). Words that are clearly CVCVC take /-in/ (e.g. /kokɪs-in/ $\rightarrow$ [kks-in] ‘breads’), but the words in (50) all take /-n/ (e.g. [fatu-n] ‘stones’, *[fatub-in], *[ftb-in]). This, again, is evidence that these words are vowel-final, since there is no clear phonotactic reason why one CVCVC word should take /-in/ and the other /-n/. I therefore analyse these consonants as epenthetic, following Edwards (Reference Edwards2016: 165) and Culhane (Reference Culhane2018).
In this analysis, the consonant epenthesis pattern has four main steps: stress assignment, C-slot epenthesis, vowel truncation and spreading. C-slot epenthesis is driven by *V-V, which penalises vowel–vowel transitions at morpheme boundaries.Footnote 24 After spreading to C, metathesis is blocked by *Mult $\land $ *XSpr, even though this leaves the vowel features associated only with a C-slot.
I assume that vowel features prefer to be associated with at least one V-slot (LetVbeV, cf. *Link(C,V); Uffmann Reference Uffmann2006: 1096). This constraint is dominated by *Float in the Molo dialect, and so spreading will target the C-slot instead of the preceding V-slot. In tableau form, the crucial step of the derivation for /fatu-e/ $\rightarrow$ [ˈfatb-e] is shown in (52):Footnote 25
After this step, spreading of the vowel to the preceding V-slot is ruled out by *Mult $\land $ *XSpr, since features can associate with multiple slots only if the slots are adjacent.
Under this account, we expect there to be no restrictions on multiple association for adjacent segments. This means that in CVV words, consonant epenthesis does not interfere with diphthongisation:
Molo has one exception to this pattern: high front vowels cannot multiply associate. In these cases, consonant epenthesis bleeds diphthongisation:
These patterns are simple to derive: LetVbeV outranks *Multiple, and so we get one more instance of spreading after Step 3 in (55). For high front vowels, Meto has an undominated *Multiple[+hi,+fr] constraint that prevents [i, ɪ] from associating with more than one slot.
In an alternative to the present account, Edwards (Reference Edwards2016: 198) analyses consonant epenthesis as being driven by Onset rather than *V-V. If we stipulate that metathesis cannot form valid onsets, this is a viable alternative within the present account.Footnote 26
Returning to the data from (54), I argue that this pattern with high front vowels provides indirect evidence in favour of treating metathesis as line-crossing rather than strictly local spreading. In a strictly local spreading model, metathesised vowels would spread first to the intervening C-slot and then to the preceding V-slot. Every instance of metathesis would have a vowel that is linked to two slots. The problem with this account is that we need to rule out multiple linkage for high front vowels; otherwise, we would expect diphthongisation under consonant epenthesis (e.g. /fai-e/ $\rightarrow$ [fa-e], *[f -e] ‘the fire’). However, this incorrectly predicts that metathesis should not be possible for high vowels in Molo. There is no such restriction – high vowels can metathesise (e.g. /fani/ $\rightarrow$ [fn] ‘return’ (phrase-medial)). This supports the conclusion that metathesis is different from the multiple linkage seen with diphthongisation and epenthetic consonants.
To summarise, Uab Meto consonant epenthesis involves spreading of a truncated vowel to an epenthetic C-slot. This pattern reveals an unusual restriction on spreading: non-local spreading is only possible for floating features. Given this restriction, it follows that Meto metathesis is parasitic on prosodic truncation because only prosodic truncation will generate floating features. I summarise the final constraint ranking in Figure 3.
4. Discussion
In this section, I review alternatives to the analysis proposed here, and then turn to implications this proposal has for the typology of metathesis. Among the alternatives, I consider transposition-based accounts, SPE-style rewrite rules using spreading, indexation-based copying (Takahashi Reference Takahashi2019) and allomorphy-based approaches (Edwards Reference Edwards2018, Reference Edwards2020). Of these, Takahashi (Reference Takahashi2019) comes closest to deriving the typology, but still falls short on deriving the correct phonetic and phonological behaviour for metathesised consonant–vowel sequences. I then discuss what the present proposal means for the typology of metathesis, and lay out some discrete predictions for the distribution of spreading-based versus infixation-based metathesis.
4.1 Alternatives
Previous work in OT has struggled with two incorrect predictions about the typology of metathesis: (i) long-distance metathesis patterns (e.g. abcd $\rightarrow$ dabc) and (ii) multiple metatheses (e.g. abcd $\rightarrow$ badc). Both of these patterns have been argued to be unattested (see McCarthy Reference McCarthy, Baković, Ito and McCarthy2006), and yet Parallel OT generates each one without problems. In the analysis presented here, both of these predictions are eliminated. The long-distance metathesis pattern is eliminated by assuming the NCC is universal for like over like – consonants cannot spread over like consonants, nor vowels over like vowels (cf. Archangeli & Pulleyblank Reference Archangeli and Pulleyblank1994). When combined with the restriction on spreading across morpheme boundaries (§3), this effectively limits Meto metathesis to root-final syllables without further stipulations. In the typology at large, like-over-like spreading restrictions will also limit metathesis to adjacent syllables in most cases.
On the other hand, the multiple-metathesis pattern is largely eliminated by gradualness requirements in Harmonic Serialism. For instance, multiple metathesis in /apetka/ $\rightarrow$ [pateka] is ruled out via the assumption of harmonic improvement, since each intermediate stage between /apetka/ and /pateka/ must be more well-formed than the last (see discussion in McCarthy Reference McCarthy, Baković, Ito and McCarthy2006). In contrast, Parallel OT will predict these patterns to be possible, since all that matters are the net final violations incurred by epenthesis, deletion and spreading. The only time we see something that appears like a multiple-metathesis pattern in Meto is when multiple roots metathesise in compounds and phrases, in which case each root only undergoes a single instance of local CV metathesis. Under this approach, this restriction is expected: metathesis can only occur in syllables that truncate.
Harmonic Serialism has been criticised in recent years on the grounds that it exceeds computational limits expected of phonology. For example, Lamont (Reference Lamont2018) observes that Harmonic Serialism with local transposition in Gen requires use of a Turing machine, since it can model alphabetical sorting. Phonology has been hypothesised to require only finite-state transducers, and so the fact that Harmonic Serialism exceeds this level of expressive power is seen as a serious formal overgeneration issue. This issue is significant, but perhaps not fatal to Harmonic Serialism. Instead, I treat it as strong evidence that we should build new restrictions into the formalism. Eliminating transposition from Gen, as argued for in this article, may be one such example of how Harmonic Serialism could be restricted to help alleviate these formal overgeneration issues.
In SPE-style rewrite rules, it is possible to implement a near-identical analysis to the one proposed here, but with each step implemented via rule rather than tableau. The problem with this is that it decouples the properties of the stress system from the phonological alternations. In principle, a rule-based account should be able to derive Meto metathesis for languages with any type of stress system, since rules of stress assignment and prosodic truncation may be independently manipulated. In contrast, the spreading-based account predicts that Meto metathesis is tightly linked to its stress system: truncation is driven by Align(X,R), which also contributes to penultimate stress assignment. If the Harmonic Serialism account is right, we should only see metathesis systems like this in languages that favour gradient alignment of stress towards edges.
In addition to arguing in favour of Harmonic Serialism, I also employ an enriched CV structure, which allows us to distinguish phonological feature order from surface-level gestural timing relationships. The core argument in favour of this bidimensional CV representation is that metathesised segments often do not have phonetic or phonological behaviour consistent with their surface form (see §1.3). This is predicted under the present analysis because feature order does not change.
For concreteness, I introduce one more argument along these lines, this time using a consonant deletion pattern in the language. While consonant deletion does not directly figure into metathesis, its positional restrictions reinforce the claim that metathesis does not change feature order. In (56), underlying word-final consonants delete when a word does not bear primary stress.
By contrast, the metathesis-derived word-final consonants in (57) are immune to this restriction and do not delete.
I analyse this as a restriction on consonant-final words, *Unstr-FinalC: a word can have a final C-slot only if it bears primary phrasal stress (cf. Final-C, McCarthy & Prince Reference McCarthy and Prince1994: 22).Footnote 27 In (56), this forces word-final C-slots to delete when they are phrase-medial. On the other hand, metathesised words from (57) do not have a word-final consonant – there is a floating vowel feature at the end of the word – and so they do not incur a violation of this constraint. In this way, metathesised words behave as though no transposition has occurred: their surface phonological behaviour is consistent with their underlying precedence structure.
For other models of metathesis, whether they use transposition, index-based coalescence or rules, this pattern is troubling. If metathesis fully transposes a CV sequence to VC, why does the consonant not delete in (57)? A tempting possibility is to appeal to some type of output–output faith here, where consonants occupying medial positions in one output form must be preserved in other outputs as well. However, this leads to a ranking paradox. First, we know that *Unstr-FinalC must be outranked by Align(X,L), because otherwise metathesis would be blocked in (57) to avoid the word-final consonant.
In addition to this, vowel epenthesis shows us that MaxC $\gg $ Dep; otherwise, we would see consonant deletion instead of epenthesis in (59).
Lastly, we know that Dep $\gg $ NonFin $\gg $ Align(X,L); otherwise, we would see epenthesis instead of stress-final words like [kol-ˈkaʔ] ‘crow’ in (60).
This creates a paradox, because (58) and (59) imply that Align(X,L) $\gg $ Dep, but (60) implies that Dep $\gg $ Align(X,L). This paradox suggests that metathesised consonants are not truly word-final, because they must be entirely exempt from the *Unstr-FinalC restriction.
This problem is a deep one, as it applies to any Parallel OT or Harmonic Serialism analysis where the output is fully transposed. For example, in indexation-based models of metathesis such as Takahashi (Reference Takahashi2019), word-final consonants derived by metathesis are predicted to be indistinguishable from underlying ones, both phonetically and phonologically. In §1.3, I introduced data from Meto showing that phonetically, metathesised VC sequences have greater-than-normal overlap (e.g. [taɪs] ‘sarong’ vs. [tsj] ‘sea’). The consonant deletion pattern further reinforces this distinction, since the phonology does not seem to recognise metathesised consonants as true codas.
As a final alternative, I now turn to morphological approaches to Meto metathesis. Edwards (Reference Edwards2018, Reference Edwards2020) proposes that metathesis is a type of allomorphy in which a morphological rule induces transposition in a CV skeleton. Under this approach, the rules for deletion, epenthesis, vowel lengthening and transposition must be independently asserted, instead of being derived directly from the language’s stress system. This is necessary because Edwards treats Amarasi metathesised CVVC sequences as disyllables (Edwards Reference Edwards2018: 44). In the Molo dialect, experimental data suggest that these metathesised VV sequences may be monosyllabic (see §1.3). Provided that this is the case, it is preferable to treat Meto metathesis as prosodically driven coalescence, because it allows unified treatment of a variety of phenomena in the language.
That said, the syllabic status of metathesised CVVC sequences needs further verification for both the Amarasi and Molo dialects, since there are discrepancies between the Amarasi data reported in Edwards (Reference Edwards2018, Reference Edwards2020) and the Molo data reported here. Both phonetic studies are small, and are based on field recordings from just one speaker. At this point, Edwards’s reported facts for Amarasi are consistent with metathesis being partially driven by the SWP, following the lines of Takahashi (Reference Takahashi2019). Under a stress-to-weight analysis, vowels would lengthen, and then deletion and spreading would occur. Vowel lengthening would therefore be predicted in isolation (e.g. /manu/ $\rightarrow$ [ˈma:nu] ‘chicken’) or when metathesis fails (e.g. /penaʔ/ $\rightarrow$ [ˈpe:nʔ-e] ‘corn’). Vowel lengthening in isolated forms would distinguish a stress-to-weight account from the morphological account proposed in Edwards (Reference Edwards2018, Reference Edwards2020) and the alignment-based account proposed here. These predictions are left for future work.
4.2 Predictions for the typology of metathesis
In my account, metathesis is a type of covert non-local spreading, resulting from the serial application of deletion and spreading operations. While this type of approach is not new (e.g. Arabic, McCarthy Reference McCarthy1979; Maltese, Hume Reference Hume1991; Rotuman, Besnier Reference Besnier1987), the Meto case provides unique evidence showing that deletion, epenthesis and spreading are all active in the synchronic grammar. For this reason, I predict that synchronic, productive metathesis should be common in language families with active spreading and deletion patterns, since these are the precursors to apparent metathesis. This prediction shares its core reasoning with earlier diachronic work on metathesis – Blevins and Garrett (Reference Blevins and Garrett1998), for instance, also argue in favour of ‘pseudo-metathesis’ arising diachronically from spreading and deletion precursors. In my account, however, the precursors must also be active in the synchronic grammar.
In Austronesian, I predict that metathesis is common precisely because its precursors – deletion and spreading – are widespread in the family. For instance, prosodic truncation is known to be prevalent throughout the Pacific (Zuraw Reference Zuraw2018). Similarly, vowel spreading has been observed in Samoan loanword epenthesis (Uffmann Reference Uffmann2006), where an epenthetic vowel inherits its place features from a preceding consonant. This pattern is an inverted version of the spreading seen in consonant epenthesis in Molo, where underlying vowels spread to epenthetic C-slots. Copy-epenthesis in Austronesian languages is also fairly common (Blust Reference Blust and Baldi1990; Kitto & de Lacy Reference Kitto and de Lacy1999; Lin Reference Lin2014), and can also be analysed as autosegmental spreading. It is therefore no accident that metathesis is well represented in Austronesian languages: where the precursors of metathesis are common, it is possible for non-transpositional metathesis to arise. I predict that further work in specific languages with metathesis will show phonological evidence of active spreading and deletion sub-patterns, which will be phonetically implemented as gestural overlap. I tentatively put forward the following languages as potentially having metathesis patterns of this type: Sevillian Spanish (Gilbert Reference Gilbert2022), Nivaĉle (Gutiérrez Reference Gutiérrez2015, Reference Gutiérrez2020), Balantak (Pater Reference Pater2003), Zoque (Hall Reference Hall2000), Maltese (Hume Reference Hume1991), Kwara’ae (Heinz Reference Heinz2005a), Leti (internal metathesis; Mills & Grima Reference Mills, Grima and Naylor1980; Hume Reference Hume1997; van Engelenhoven Reference Engelenhoven2004) and Cherokee (Flemming Reference Flemming1996).
While this hypothesis accounts for many cases in the typology of metathesis, it does not capture all of them. Many metathesis patterns show restricted productivity, occurring only with specific morphemes or in certain derived environments. As an example, take Leti ‘external metathesis’, in which the nominaliser n metathesises into a root, as in /n-kili/ $\rightarrow$ [k-n-ili] ‘act of looking’ (Blevins Reference Blevins1999; van Engelenhoven Reference Engelenhoven2004). This type of metathesis creates marked consonant clusters, does not bear any signs of overlap and is morphologically specific. This alternation does not appear to be phonologically optimising, since initial [nk] clusters are licit (Blevins Reference Blevins1999). For Leti, this pattern has been analysed as infixation (Blevins Reference Blevins1999; Kalin Reference Kalin2020), since it seems to be morpheme-driven.
I therefore hypothesise that there are at least two types of metatheses: phonological metathesis and infixational metathesis. Transposition is not possible in Gen, but can be generated by morphophonological processes like infixation. Phonological metathesis is non-transpositional and productive and involves some combination of deletion, spreading and epenthesis. Infixational metathesis is true transposition, bears morphological restrictions and is implemented through morpheme-specific rules such as those used for true infixation.
Exactly how to implement infixational metathesis is beyond the scope of this article, but some possibilities include co-phonologies (Orgun Reference Orgun1996; Anttila Reference Anttila, Hinskens, Hout and Leo Wetzels1997; Inkelas Reference Inkelas, Booij and Marle1998; Inkelas & Zoll Reference Inkelas and Zoll2005, among others), generalised reduplication (Harris & Halle Reference Harris and Halle2005; Arregi & Nevins Reference Arregi and Nevins2012) or prosodic alignment (McCarthy & Prince Reference McCarthy, Prince, Booij and Marle1993; Yu Reference Yu2002). In any of these approaches, it should be possible for infixational metathesis to be non-optimising for the global phonology. If the mechanism for infixational metathesis turns out to be the same as for ordinary infixes, then I predict that infixes and infixational metathesis should have similar distributions. For example, infixational metathesis would be expected have a strong left-edge bias, and so it should occur more frequently at the left edges of morphemes rather than right edges.Footnote 28
In Table 5, I offer some potential cases of each type of metathesis, along with their predicted characteristics. These predictions are left to be tested in future phonetic and phonological studies.
To sum up, I hypothesise that phonology cannot transpose. Phonological metathesis can be decomposed into serial spreading, deletion and epenthesis operations, which when combined give the surface appearance of transposition. On the other hand, I hypothesise that morphophonological operations are responsible for metathesis patterns that do seem to involve true transposition, since these cases are less productive and have morphological restrictions. This would support a model of grammar where transposition is only a syntactic or morphophonological operation, never a purely phonological one.
5. Conclusion
In Uab Meto, metathesis occurs in complementary distribution with a variety of other phonological processes, including epenthesis, deletion and coalescence. Instead of analysing the intricate phonology of the language as happenstance, I derive metathesis from the combination of these synchronic sub-patterns, so that metathesis is essentially a serial delete-and-copy mechanism in the phonology. While this approach is not new (see Mills & Grima Reference Mills, Grima and Naylor1980; Besnier Reference Besnier1987; Hume Reference Hume1991), this places Uab Meto in a previously undescribed position in the typology of spreading phenomena, where non-local spreading is possible only as long as features are not yet associated with a timing slot.
The typological rarity of metathesis thus follows from the complexity of metathesis as a phonological pattern. Phonological metathesis is always based on spreading and deletion operations, and may only arise in languages where the precursors are present and occur in overlapping environments. In the Austronesian family, it so happens that prosodic truncation, spreading and epenthesis are all robust (cf. Blust Reference Blust and Baldi1990; Kitto & de Lacy Reference Kitto and de Lacy1999; Uffmann Reference Uffmann2006; Zuraw Reference Zuraw2018), and so it is unsurprising that metathesis is relatively widespread in the family. Outside of this pathway, I predict that metathesis should be subject to morphological restrictions, and therefore should be derived using morpheme-specific operations such as those used for infixation.
Supplementary material
The supplementary material provides data for the phonetic study on vowel length (§1.5) and an accompanying R script. These materials also contain additional transcribed data for metathesis in nouns and verbs, along with their elicitation contexts. The supplementary material for this article can be found at https://doi.org/10.1017/S0952675723000088.
Acknowledgements
Thanks to Maria Gouskova, Gillian Gallagher, Juliet Stanton and Lisa Davidson for comments on the article, as well as audiences at LSA 2019, Rutgers PhonX, Yale Ling Lunch and PhoNE 2019. Special thanks to the people of Bijaepunu, West Timor, for participating in this research. All data were collected in collaboration with Nona Seko (Sekolah Tinggi Bahasa Asing Cakrawala Nusantara Kupang) in August 2019, under the research title ‘Documenting Traditional Uab Meto’. Special thanks also to Yanti, Peter Cole and Gabriella Hermon for the NSF-sponsored language documentation training program in 2018 (BCS – 1747801), which set the groundwork for this fieldwork. Thanks also to the three anonymous reviewers and the associate editor for their detailed feedback on the manuscript.
Competing interests
The author declares no competing interests.
Funding statement
This research was partially supported by an NSF Graduate Research Fellowship (#DGE2234660).