Methodology in Linguistic Subgrouping

doi:10.1017/9781108758666.002

2 - Methodology in Linguistic Subgrouping

Published online by Cambridge University Press: 15 September 2022

James Clackson

Edited by

Thomas Olander

Show author details

Thomas Olander: Affiliation:
University of Copenhagen

Book contents

Summary

This chapter shows how scholars have both justified, and argued against, the subgrouping of Indo-European in the history of the discipline and sets out the justifications given for methodological choices made by researchers. Since the late nineteenth century, it has been generally agreed that the best supporting evidence for reconstructing a subgroup comprising two or more languages is the presence of non-trivial linguistic innovations which have taken place in common during their prehistory, innovations which were not shared by other languages in the same family. This chapter addresses questions which arise out of this methodology, including whether all shared linguistics innovations should be given equal weight in the assessment of possible subgroups, and whether it is possible to reconstruct dialectal variation in a proto-language.

Keywords

Indo-European linguistics linguistic phylogenetics methodology

Type: Chapter
Information: The Indo-European Language Family
A Phylogenetic Perspective
, pp. 18 - 32

DOI: https://doi.org/10.1017/9781108758666.002 [Opens in a new window]

Publisher: Cambridge University Press

Print publication year: 2022
Creative Commons: This content is Open Access and distributed under the terms of the Creative Commons Attribution licence CC-BY-NC-ND 4.0 https://creativecommons.org/cclicenses/

2.1 Introduction

If two or more languages form a subgroup of a language family, what does it mean? To answer this, it will be helpful to consider the case of three related languages, A, B and C. I shall assume that these three languages are all spoken at the same point in time and are all derived from an unattested proto-language, which I shall call Proto-ABC (I shall also refer to the language family as ABC). If the languages A and B form a subgroup within ABC, this means that it is possible to reconstruct a stage intermediate between Proto-ABC and languages A and B, which I shall call Proto-AB. To put this in other words, there existed a community of Proto-AB speakers at the time when a separate speech community spoke Proto-C, the language ancestral to C. The situation can thus be represented as in Figure 2.1, where languages are placed in a relationship to one another, much as with a family tree of genealogical descent.Footnote ¹ Diagrams such as Figure 2.1 are accordingly called “tree diagrams”.

Figure 2.1 Family tree of the ABC language family

2.2 The History of Subgrouping

The recognition of subgroups of the Indo-European language family precedes the recognition of the language family itself. Reference ScaligerScaliger (1610) was already able to recognise the Romance, Germanic and Slavic families of languages, matrices linguarum in his terms, from shared vocabulary (notoriously using the word for ‘god’ as a diagnostic), and earlier scholars had grouped several languages as one in order to preserve the Biblical notion of seventy-two languages of the world.Footnote ² From the beginning of the nineteenth century, the first scholars of Indo-European operated with subgroups such as Germanic and Slavonic. Thomas Young, in the same article which saw the first use of the term “Indo-European” arranged the languages of the world into a three-step hierarchy: classes (of which Indo-European was one), orders and families (Reference Young1813: 256). Young’s Indo-European class comprised no subordinate orders, but sixteen “families”, some of which are familiar, German, Celtic, Latin and Sclavic, but others less so (Arabian, Etruscan and Cantabrian).Footnote ³ The first representation of the relationship between languages of the Indo-European family by something like a tree diagram is generally attributed to Schleicher, who included a schematic Stammbaum at the beginning of his Compendium (Reference Müller1861: 7) although there was no explanation of how the groupings had been arrived at.Footnote ⁴ The figure from Schleicher’s compendium is reproduced as Figure 2.2. Unlike the diagram given in Figure 2.1, Schleicher’s tree is presented with the parent language on the left, and the daughter languages on the right. The “branches” of the tree are labelled, rather than the nodes as in Figure 2.1.

Figure 2.2 Schleicher’s tree diagram (Reference SchleicherSchleicher 1861: 7).

The first Indo-Europeanists to give serious consideration to the methodology of language subgrouping were the “neogrammarians” (or “Junggrammatiker”), a group of scholars originally based around the University of Leipzig in the 1870s.Footnote ⁵ The neogrammarians are associated today principally with the idea that sound change is regular and exceptionless, but their work on sound change was part of a larger programme which established a firmer basis for comparative linguistics. The neogrammarians were more explicit about how and why they did what they did than their predecessors, with publications on the techniques and practices of linguistic comparison.Footnote ⁶ In the case of subgrouping, the first tangible advance made by the neogrammarians was Hübschmann’s demonstration that that Armenian was not an Iranian language, but a separate branch on its own.Footnote ⁷ The theoretical principles and methods set out for identifying subgroups were put forward by Reference LeskienLeskien (1876), partly as a critical response to the Berlin professor Johannes Schmidt’s work on the “wave model” (Reference SchmidtSchmidt 1872). Leskien was teacher and mentor to many of the neogrammarians, and his work on subgrouping was then refined by Reference DelbrückDelbrück (1880: 135) and Reference BrugmannBrugmann (1884).Footnote ⁸ The methodological advances made by these scholars are enormous. It is to them that we owe the principles that linguistic subgrouping proceeds through the identification of shared innovations, rather than shared archaisms, and the recognition that phenomena which could arise from language contact, such as shared lexical items, should be treated with caution for subgrouping purposes.Footnote ⁹

Indeed, Brugmann’s statement of what constitutes a subgroup (Reference Brugmann1884: 253) has often been cited, and is worth repeating once again:

Es ist hier nicht eine einzelne und sind nicht einige wenige auf zweien oder mehreren Gebieten zugleich auftretende Spracherscheinungen, die den Beweis der näheren Gemeinschaft erbringen, sondern nur die große Masse von Übereinstimmungen in lautlichen, flexivischen, syntaktischen und lexicalischen Neuerungen, die große Masse, die den Gedanken an Zufall ausschließt.

[The proof of a close commonality comes not from a single isolated or a small number of linguistic developments occurring simultaneously in two or more areas, but only through a large number of innovations in phonology, morphology, syntax and vocabulary – a number so great as to exclude chance from consideration.]

Reference BrugmannBrugmann’s 1884 article has set the scene for the subgrouping of the Indo-European family and other language groupings ever since.Footnote ¹⁰ Brugmann discussed the possible subgroups of Indo-European, seeing only two cases where the recognised branches of Indo-European might be grouped together: Indo-Iranian and Balto-Slavic. It is significant that since 1884 there have been no serious suggestions for some of the higher order groupings proposed seen in Schleicher’s family tree, and the Indo-European family continues to be thought of in terms of the branches Brugmann identified.Footnote ¹¹ After the neogrammarians Schleicher’s “Graecoitalokeltisch” and “Slawodeutsch” all but disappear from the academic debate for the next hundred years.Footnote ¹² Representations of the Indo-European family in tree diagrams in the century after Brugmann’s article tended to show the branches of Indo-European radiating out as spokes from a centre.Footnote ¹³ Indeed, the discovery of two new branches of Indo-European in the early twentieth century, Anatolian (of which Hittite was the earliest identified) and Tocharian, had little initial impact on the presentation of the Indo-European languages. Bloomfield’s tree diagram (Reference BloomfieldBloomfield 1933: 315) does not include branches for Anatolian or Tocharian, and Meillet was able to issue a second edition of a book written originally in 1908 (and discussed further below) in 1922 only noting the recent addition of the two branches (Reference MeilletMeillet 1922: 3).

2.3 Criteria for Subgrouping

The reliance on common innovations rather than common retentions, and the need to avoid linguistic agreements that could have arisen independently, or by chance, have been accepted by nearly all those working on subgrouping methodology since Brugmann.Footnote ¹⁴ It has been suggested (Reference DyenDyen 1953: 581–2) that, despite linguists’ theoretical adherence to the methodology of Brugmann, most subgrouping has actually been carried out “by inspection”, that is to say, through the recognition of a large amount of similarities between closely related languages (much as Scaliger was able to recognise that the Romance languages or the Germanic languages belonged together). This may be true at a very basic level, but any serious considerations of subgrouping for individual languages since the 1870s have proceeded through careful application of something like the Brugmannian criteria. This is especially the case for the less well-attested Indo-European varieties, such as Phrygian, Venetic or Lusitanian. If scholars have not used Brugmann’s criteria to test the validity of the Germanic branch, or Slavic, it is because the innovations are numerous and self-evident.

In the rest of this chapter, I shall look first at further clarifications of the criteria for subgrouping given above, before considering alternative models to the family tree. Advances in the neogrammarian methodology outlined above have been made in three principal areas: assessment of what counts as an innovation; ways to avoid “false positives”, that is, apparent shared innovations which actually arise by chance or through language contact; and in the use of computational methods in order to survey large amounts of data (see Chapters 3 and 4).Footnote ¹⁵ I shall discuss the first two of these developments in this chapter, leaving the third to other contributors to this volume.

How do linguists recognise an innovation against a shared retention? In some areas, innovations are easier to detect than others. Once speakers of a language have merged or partially merged two phonemes, this change cannot be undone. Consequently, phonological changes offer the clearest examples of innovations which can be recovered by the historical linguist. As Hoenigswald put it (Reference Hoenigswald, Birnbaum and Puhvel1966: 7), the phonological merger is the “prototype” of the shared innovation. Vocabulary replacement and syntactic change provide examples where it is often more difficult to isolate which development is an innovation and which is not. If languages A and B share a vocabulary item, for example the word for ‘man’ or a verb used to mean ‘stand’, and this vocabulary item is not found in language C, how is it possible to ascertain whether that is a shared retention or a common development of the two languages? Dyen is one of only a few scholars to address this question directly:

If any two or more related languages share a feature, the question arises whether this is a retention or an innovation. If we apply a general rule that such features are taken to be retentions unless there is evidence to the contrary, then a corresponding proto-feature is reconstructed. It follows that (borrowings being excluded) an innovation occurring in two or more languages can be detected only if, as a proto-feature, it contradicts a proto-feature which for some reason appears to be more ancient.

(Dyen 1953: 581)

In the Indo-European domain, researchers have the advantage that the Anatolian languages, now generally agreed to have split first from the Proto-Indo-European parent, can sometimes provide a guide to what forms are more “ancient”. To take one example, Greek, Armenian, Albanian and Tocharian share reflexes of a root ‘hand’, reconstructed as *g̑ʰes‑r- (Greek χείρ, Armenian jeṙn, Albanian dorë, Tocharian A tsar). For Reference Pedersen and EbertPedersen (1924: 225), followed by Reference SoltaSolta (1960: 316–17), this was a significant lexical agreement between these branches. However, it is now clear that the word is also present in the Anatolian branch; the presence of the word in the other languages is much more likely to be a shared retention rather than an innovation.Footnote ¹⁶ Vocabulary items may also be judged to be archaic rather than innovatory through their inflectional class or derivational patterns, or because it is possible to reconstruct a semantic shift in one direction rather than another. Even so, such decisions are often reliant on the judgement of the linguist, and in many cases it is impossible to say whether a lexical agreement reflects an innovation or a retention (Reference Hoenigswald, Birnbaum and PuhvelHoenigswald 1966: 8–9; Reference Klingenschmitt and RasmussenKlingenschmitt 1994: 236).

Innovations in inflectional morphology are also to some extent reliant on the picture the researcher has of the morphology of the parent language, and hence susceptible to the same criticism as the use of shared items of vocabulary for subgrouping purposes. Morphological innovations are, however, generally easier for linguists to spot, since they may be linked to phonological changes and thus more easily linked to a relative chronology.Footnote ¹⁷ Moreover, in inflectional morphology at least, the set of options which can be reconstructed for the parent language is in general much smaller than for lexical innovations. Morphological innovations may also be associated with a larger change in the morphosyntax of the language, such as the creation of a new category or the merger of earlier categories.

In the Indo-European language family, little use has been made of syntactic changes for subgrouping, partly reflecting uncertainties about the reconstruction of Indo-European syntax, with consequent uncertainty about what counts as an innovation.Footnote ¹⁸ In this regard it is important to note recent attempts to find Indo-European subgroups relying on syntactic information put forward by Reference Longobardi and GuardinoLongobardi & Guardino (2009) and Reference LongobardiLongobardi et al. (2013). These researchers, working in a Chomskyan syntactic framework, make use of a set of syntactic parameters, which have been carefully chosen to ensure that the selected parameters show no overlap between them. It is perhaps revealing that the researchers do not attempt to isolate which of the parametric changes count as innovations, relying on the computational approach to identify innovations (Reference LongobardiLongobardi et al. 2013: 148). Given the absence of suitable information about the parametric constraints of most of the older Indo-European languages, this approach has not proved to be especially helpful for refining current thinking on subgrouping in the language family.

The next problem is how to avoid “false positives”, that is, shared innovations between two languages which were not made during a period of genealogical unity but which come about at a later stage in the language histories. In terms of the hypothetical language family discussed at the beginning of this chapter, examples would be developments shared by languages B and C that took place after the break-up of the Proto-ABC community, or shared by A and B but made after the period of Proto-AB. Such shared developments among separate speech communities may reflect a situation of language contact, for example a period when many speakers of A also spoke B, or when speakers of A and B both spoke a third language, or another more complicated contact situation. Alternatively, a shared development made independently is sometimes attributed to “chance”. In effect, what this usually means is that the innovation may reflect a universal tendency of language development, such as the palatalisation of dorsal consonants before front vowels or the “drift” from perfect formations of the verb to perfectives.Footnote ¹⁹ Languages of the same family have inherited similar structures, and it is consequently not unexpected that the same innovations may occur independently.

As has been recognised since Reference MeilletMeillet (1908: 10), understanding the relative chronology of changes is essential in order to determine which shared developments are common shared innovations and which are not. In the terms of the ABC language family, an innovation which is apparently shared by A and B is not diagnostic for subgrouping if it can be shown to have taken place after a development that took place after A had split from B. To take an example from the Italic language family, Oscan and Umbrian have both undergone a process of syncope of short medial vowels so that, for example, an earlier stem *opesā- develops to an Oscan stem úpsa- and Umbrian osa-. But this change is fed by consonant changes in Umbrian, such as the development of intervocalic *d > rs and the palatalisation of velars before front vowels. Syncope, which is not an uncommon change cross-linguistically, is thus an innovation shared by Oscan and Umbrian but is not diagnostic for their subgrouping, since it must have taken place after Umbrian changes not shared by Oscan (see Reference ClacksonClackson 2015: 10).

In the search for diagnostic innovations for subgrouping, it may not always be possible to construct a relative chronology for the feature in question in relation to what else is known about the prehistory of the language family. Accordingly, linguists have looked to assess the likelihood that a particular innovation might be the result of contact or universal processes, rather than a shared innovation. The result is that some shared innovations carry more “weight” than others, which may be dismissed as “trivial” or “insignificant”.Footnote ²⁰ Phonological processes which are frequent across the languages of the world, such as palatalisation, lenition, apocope, are accordingly usually dismissed as easily replicable and non-diagnostic. Many scholars have given greater weight to less common or more “unusual” sound changes, although in the absence of a general cross-linguistic repertoire of all known sound changes, this may rely more on the researcher’s own knowledge than an objective assessment. Note also that in the Indo-European language family, the judgement of whether a change affecting reconstructed consonants such as “laryngeals” or “voiced aspirates” is unusual or not also reflects the reconstructed model of PIE which is used. Individual shared vocabulary which might arise from borrowing from languages now lost is similarly easily discounted for subgrouping purposes. Once again it is innovations in the field of morphology, particularly inflectional morphology, which has been seen as especially significant. Incorporation of the inflectional morphology of one language into another is not unknown in situations of prolonged or close contact, or in particular social situations, but it is generally accounted the most resistant area of language to borrowing.Footnote ²¹ The creation of a new morpheme often reflects the grammaticalisation of a new category or the merger of earlier categories, and accordingly morphological innovations generally have significant structural importance in the languages in question.

The question remains of how many innovations is enough to reconstruct a subgroup? Brugmann rejected the reliance on a “few” innovations, calling rather for a “large number”, but then he had not made the various further refinements of sorting through what were certain, appropriate or significant innovations, using the methodology developed by later scholars and outlined above. Once all potential shared innovations between two branches have been carefully sifted to determine whether they meet all suitable criteria, and those for which there remains any room for doubt have been set to one side, then is it still justified to say that the number remaining is too small to be significant?Footnote ²² Recent family trees of Indo-European arrived at using computational cladistics, which may have examined a large set of data across vocabulary, morphology and phonological changes (as Reference Ringe, Warnow and TaylorRinge, Warnow & Taylor 2002), show a much greater number of branches and subgroups than most of those constructed following Reference BrugmannBrugmann’s 1884 article.Footnote ²³ In the Ringe, Warnow and Taylor tree, binary splits are the norm, as opposed to the fan-like array of earlier trees. The difference partly reflects the ways in which the computational analysis is constructed, but it also reflects the fact that some of the subgroups are constructed on what is in effect quite a small number of shared features. The Greco-Armenian clade, for example, is supported by only six shared lexical features, four of which need not be significant.Footnote ²⁴

2.4 Subgroups and Prehistoric Dialect Continua

So far in this chapter, I have largely followed the assumption that language change operates over uniform speech communities and that language diversification happens when a single speech community splits into two or more separate groups. However, linguistic history is rarely so straightforward. Clean breaks in the tree-diagram, such as that envisaged in our opening example between Proto-AB and Proto-C, may occur as the result of large-scale dispersals of a population after cataclysmic natural disasters, through massive migrations or other situations, but in the majority of documented situations, the diversification of a language into separate, mutually unintelligible, descendants takes place through periods of dialect continua, which might sometimes last for millennia. Indeed, the spoken varieties of Romance, Germanic, Slavic and several other branches of the Indo-European family still can be described, in whole or in part, as dialect continua. Since Reference SchmidtSchmidt (1872), linguists have recognised that the spread of phenomena over dialect continua are not best captured by a tree-diagram model. Schmidt himself famously proposed an alternative to the tree diagram, the “wave theory” (“Wellentheorie”), to explain the rippling effect of linguistic changes over a range of mutually comprehensible varieties.Footnote ²⁵

Leskien and the neogrammarians made a significant advance on Schmidt’s observations by pushing the period of dialectal variation back to the proto-language, rather than, as Schmidt had suggested, a period when it was possible to recognise the first branches of Indo-European. Their methodological justification for this move was that all spoken languages contain some variation, and it is consequently likely that the proto-languages also exhibited variation.Footnote ²⁶ This line of reasoning was followed up by scholars in the early twentieth century, such as Meillet, whose Reference Meillet1908 book Les dialectes indo-européens explored at greater length various shared developments of vocabulary, phonology and morphology that might reflect dialectal divisions within the parent language (Reference MeilletMeillet 1908, second edition Reference Meillet1922). For example, the noteworthy shared agreement of Germanic, Baltic and Slavic in showing *m rather than *bʰ in oblique case markers of the noun could only be explained, according to Meillet, through the supposition of different dialects of Proto-Indo-European (Reference Meillet1908: 119).Footnote ²⁷ The reconstruction of dialects of the proto-language thus allowed historical linguists a way to account for a small number of similarities between languages which were not sufficient on their own to support the reconstruction of a subgroup but were too significant to be ignored. As we have seen, the net effect of this move was that, in contrast to the recognised subgroups lower down the family tree, such as Germanic, Celtic etc., after Reference BrugmannBrugmann (1884), there were only two generally agreed “higher-order” subgroups, Indo-Iranian and Balto-Slavic. The supposition of a “dialectal” Proto-Indo-European could help explain the existence of a small number of exclusive and significant innovations shared between two or more branches, and also the overlapping nature of these agreements, so that some features might be shared between Germanic and Balto-Slavic, and others between Balto-Slavic and Indo-Iranian.

The supposition of a dialectal array of Indo-European has consequently proved popular and in several handbooks of historical linguistics or Indo-European languages it is possible to find “dialect maps” of Proto-Indo-European, with the putative varieties ancestral to the different branches of the family laid out in something approximating to their geographical attestation, with Germanic at the top left and Indo-Iranian in the bottom right, and then isogloss lines linking or separating groups corresponding to shared “dialectal” features, such as the use of *m or *bʰ in oblique case-markers or the operation of a phonological process known as the ruki rule.Footnote ²⁸ Such maps meet with the immediate criticism that they have the potential to include items of different time depths on a single plane. Thus, if the Anatolian branch separated out from the other IE languages first, it is an anachronism to include it in a dialectal area which could not yet have existed.Footnote ²⁹ Reference Meid and RixMeid (1975) has accordingly attempted to reconstruct a “space-time” model for PIE, which is in effect a “three-dimensional” dialect map, incorporating both temporal and dialectal variety.

The reconstruction of a Proto-Indo-European parent language which has variation over time and space meets with a significant methodological objection: it is difficult to falsify. As Reference RingeRinge (2017: 65) elegantly expresses it: “new evidence that is at variance with evidence already in hand can often be accommodated on an abstract dialect ‘map’ without major revisions.” Moreover, it significantly overplays the importance of the evidence which happens to survive. Since 1950 the number of early Indo-European texts available to scholars has been significantly increased through greater knowledge of the Anatolian languages; the decipherment of Linear B and consequent accessibility of the earliest stage of Greek; and discoveries and improvements in the understanding of a number of smaller, fragmentary languages, such as Gaulish, Celtiberian and South Picene. This huge increase in our knowledge of the languages used in the first and second millennia BCE has, paradoxically, made scholars more aware of what has been lost. It is clear that, since the Iron Age, speakers of a relatively small number of language families and subfamilies have been hugely successful in Eurasia, and their dominance has been responsible for the demise of countless other languages, many of which were Indo-European. As Reference Ringe and EskaRinge & Eska (2013: 262–3) note, the branches of Indo-European that we know about are “probably the surviving remnants of what was once a dialect network”, and the apparent sharp distinctions between them are just the reflection of the “pruning” of closer neighbours. Garrett has suggested in a number of articles that this loss of the intermediate languages in a larger dialectal continuum means the dialectal array of Proto-Indo-European after the separation of Anatolian, and probably Tocharian, may not be retrievable (see Reference Garrett, Chang, Liaw and RuppenhoferGarrett 1999; Reference Garrett, Forster and Renfrew2006; Reference Babel, Andrew Garrett and ToosarvandaniBabel et al. 2013). This is not just because of the pruning problem but also because of the fact that the subgroups as we have them may reflect linguistic changes across a dialect continuum, which took place across already divergent dialects. A case in point is Greek, which shows dialectal divisions already in the Mycenaean period, but for which all dialects were to undergo significant shared innovations in the next 500 years. These subsequent wave-like innovations take on a special significance when we have lost so many of the intervening dialects. The combination of shared innovation across an already differentiated dialectal continuum and subsequent “pruning” of intermediate dialects means that the shape of the original dialectal array is forever unobtainable.

Few Indo-Europeanists have been willing to accept Garrett’s arguments for scepticism about subgrouping, however, and most have continued to operate with a branching tree model, with shared innovations as the diagnostic for the construction of a subgroup.Footnote ³⁰ The objections to Garrett’s proposals are founded on a reluctance to give weight to the “unknown unknowns”, that is the unrecorded Indo-European varieties which gave way to the languages which we know about, and which may have formed a bridge between what we now think of as different Indo-European subgroups (see the comment recorded by Garrett of an anonymous referee at Reference Garrett, Forster and Renfrew2006: 48 n. 5; Reference de Vaande Vaan 2008: 1229–30). These varieties doubtless existed, but we don’t know how they would have changed our whole picture of Proto-Indo-European as a whole or, indeed, what they would have been like. To abandon the whole enterprise of subgrouping because we don’t know what we are missing seems a step too far. Moreover, there has been no conclusive demonstration of an Indo-European subgroup that has actually arisen through later convergence.Footnote ³¹ It is likely that subgrouping as currently carried out will continue, even though Garrett’s arguments are a healthy reminder of the importance of considering the relative chronology of linguistic developments, and of guarding against the false reconstruction of a subgroup on the basis of changes which must have actually been convergences.

Footnotes

¹ Reference Hoenigswald, Birnbaum and PuhvelHoenigswald (1966: 3–5) discusses more complicated arrangements between three putative languages A, B and C.

² Reference BorstBorst (1957–63) shows in detail the changing conceptions of languages and language families in the pre-Modern era. For the background to Scaliger’s work, see Reference Simone and LepschySimone (1998: 163–5).

³ Compare Max Müller’s later division of Indo-European languages into divisions, classes and branches (Reference MüllerMüller 1861: 380, discussed by Reference PetitPetit 2012: 25–7). The term Cantabrian is an alternative name for Basque, as used by Adelung.

⁴ See Reference PetitPetit (2012: 22–5) for discussion of an earlier tree-diagram than Schleicher’s, by František Ladislav Čelakovský representing the relations between Slavic languages; Reference BlažekBlažek 2007 gives a survey of the development of tree diagrams after Schleicher.

⁵ See Reference Anna.Morpurgo Davies (1992: 226–78) for the neogrammarian school and its impact on linguistics.

⁶ See for example the two books on theoretical linguistics published in 1880, Reference DelbrückDelbrück 1880 and Reference PaulPaul 1880, discussed by Reference Anna.Morpurgo Davies (1992: 245–51).

⁷ Reference HübschmannHübschmann 1875; see the discussion of Hübschmann’s achievement in Reference Clackson and LéonardClackson 2016.

⁸ In Morpurgo Davies’s words “the neogrammarians, as often, took their cue [sic] from Leskien” (Reference Anna. and Sebeok1975: 650).

⁹ Reference Anna. and SebeokMorpurgo Davies (1975: 650) and Reference PetitPetit (2012: 29–30) associate these ideas directly with Leskien, but as I showed in a recent paper (Reference Clackson and LéonardClackson 2016), they are already implicit in Reference HübschmannHübschmann’s (1875) work on Armenian.

¹⁰ See the discussion of Reference DyenDyen (1953: 580–2), who is the first to use the term “subgrouping” in English.

¹¹ See for example the presentation of the Indo-European languages in Reference FortsonFortson 2010 and Reference Klein, Joseph and FritzKlein, Joseph & Fritz 2017–18.

¹² A Balto-Slavic-Germanic subgroup reappears in the tree-diagram of Reference Gamkrelidze and IvanovGamkrelidze & Ivanov (1984: 415).

¹³ To give just two examples, Reference BloomfieldBloomfield 1933: 312 and the representation of the Indo-European language family in editions of the American heritage dictionary of the English language (first published in 1969).

¹⁴ See Reference PorzigPorzig 1954: 17–52 and Reference ClacksonClackson 1994: 4–11 for surveys of work on Indo-European subgrouping in the twentieth century; Reference Ringe and EskaRinge & Eska (2013: 256–7) and Reference RingeRinge (2017: 63) have recently reiterated the need to base subgroups on significant shared innovations.

¹⁵ See Reference Ringe, Warnow and TaylorRinge, Warnow & Taylor 2002: 66 for the statement that the computational approach to subgrouping “is not intended to replace already existing methods, but to supplement them” (emphasis in the original).

¹⁶ Hittite kessar, Hieroglyphic Luwian istra/i-, Lycian izre/i- (see Reference KloekhorstKloekhorst 2008: 471–2); for the use of the word for ‘hand’ in a recent subgrouping enterprise, see Reference Ringe, Warnow and TaylorRinge, Warnow & Taylor 2002: 82–3.

¹⁷ See further below for the importance of relative chronology.

¹⁸ Reference Ringe, Warnow and TaylorRinge, Warnow and Taylor (2002) include no syntactic features in their data set.

¹⁹ For a survey of the sound changes affecting consonants which occur across Indo-European languages, see Reference KümmelKümmel 2007; on universal paths in the grammaticalisation of tense and aspect, see Reference Bybee, Perkins and PagliucaBybee, Perkins & Pagliuca 1994, which has spawned a large body of work on processes such as the drift from perfects to perfectives (sometimes called “aoristic drift”).

²⁰ The “weighting” of isoglosses is implicit already in Reference HübschmannHübschmann 1875, and highlighted by Reference MeilletMeillet (1908) and Reference PorzigPorzig (1954).

²¹ See Reference Thomason and KaufmanThomason & Kaufman 1988: 18–20 for the dismissal of earlier claims that morphology is impervious to borrowing. Morphological borrowing is not just limited to “exotic” languages: the Latin first declension genitive -aes, found predominantly in texts written by writers with little education, shows the partial transfer of a Greek morpheme (see Reference AdamsAdams 2003: 473–86 for discussion).

²² This is a criticism that has been levelled at me (see, for example, Reference HolstHolst 2009: 53–5) for my “hyper-critical” analysis of the evidence for a Greek-Armenian subgroup (Reference ClacksonClackson 1994).

²³ See the survey in Reference BlažekBlažek 2007.

²⁴ As noted by Reference Ringe, Warnow and TaylorRinge, Warnow & Taylor (2002: 102–3), Reference RingeRinge (2017: 69).

²⁵ See the discussion by Reference PetitPetit (2012: 27–9).

²⁶ As noted by Reference PetitPetit (2012: 31) who cites Reference LeskienLeskien (1876: xv): “auf dem Boden der Urheimat [bestanden] bereits dialektische Unterschiede” [“there were already dialectal differences in the territory of the (Indo-European) homeland”].

²⁷ Reference BloomfieldBloomfield (1933: 314–5) also uses the example of the *m and *bʰ case markers as indications of dialectal differences in PIE.

²⁸ Reference MeilletMeillet’s schematic map (1908: 134) has been followed by many others. Reference AnttilaAnttila 1989: 305 is the most sophisticated with twenty-four isoglosses included; Reference HockHock 1991: 445 has seven isoglosses, Reference Mallory and AdamsMallory & Adams 2006: 73 just six.

²⁹ In the map of Reference AnttilaAnttila (1989: 305), Anatolian sits in the middle separated from all other languages by two isoglosses, one of which is drawn with a thicker line. The maps of Reference HockHock (1991: 445) and Reference Mallory and AdamsMallory & Adams (2006: 73) do not include Anatolian.

³⁰ Reference ClacksonClackson 2007: 14–15 is an exception, but reviewers have taken issue with this point, for example Reference de Vaande Vaan (2008). See also Reference GoldsteinGoldstein 2020: 113, who reiterates the point that the true tree for the PIE phylogeny is unknowable.

³¹ My attempt to explain similarities between languages in the Sabellian subgroup, including Oscan and Umbrian, through convergence (Reference ClacksonClackson 2015) has been countered by Reference FortsonFortson (2017).

References

Adams, J. N. 2003. Bilingualism and the Latin language, Cambridge: Cambridge University Press.Google Scholar

Anttila, Raimo. 1989. Historical and comparative linguistics. 2nd rev. ed. Amsterdam: Benjamins.CrossRef Google Scholar

Babel, Molly, Andrew Garrett, Michael J. Houser & Toosarvandani, Maziar. 2013. Descent and diffusion in language diversification: A study in Western Numic dialectology. International Journal of American Linguistics 79. 445–89.CrossRef Google Scholar

Blažek, Václav. 2007. From August Schleicher to Sergei Starostin: On the development of the tree-diagram models of the Indo-European languages. Journal of Indo-European Studies 35. 82–110.Google Scholar

Bloomfield, Leonard. 1933. Language. New York: Holt, Rinehart & Winston.Google Scholar

Borst, Arno. 1957–63. Der Turmbau von Babel: Geschichte der Meinungen über Ursprung und Vielfalt der Sprachen und Völker. 5 vols. Stuttgart: Hiersemann.Google Scholar

Brugmann, Karl. 1884. Zur Frage nach den Verwandtschaftsverhältnissen der indo-germanischen Sprachen. Internationale Zeitschrift für allgemeine Sprachwissenschaft 1. 226–56.Google Scholar

Bybee, Joan, Perkins, Revere & Pagliuca, William. 1994. The evolution of grammar: Tense, aspect, and modality in the languages of the world. Chicago, IL: University of Chicago Press.Google Scholar

Clackson, James. 1994. The linguistic relationship between Armenian and Greek. Oxford: Blackwell.Google Scholar

Clackson, James. 2007. Indo-European linguistics. Cambridge: Cambridge University Press.Google Scholar

Clackson, James. 2015. Subgrouping in the Sabellian branch of Indo-European. Transactions of the Philological Society 113. 4–37.CrossRef Google Scholar

Clackson, James. 2016. Heinrich Hübschmann and linguistic classification. In Léonard, Jean-Léo (ed.), Actualité des néogrammairiens, 183–201. Paris: Peeters.Google Scholar

Delbrück, Berthold. 1880. Einleitung in das Sprachstudium: Ein Beitrag zur Geschichte und Methodik der vergleichenden Sprachforschung. Leipzig: Breitkopf und Härtel.Google Scholar

Dyen, Isidore. 1953. Review of Otto Chr. Dahl, Malgache et Maanjan: Une comparaison linguistique, Oslo: Gimnes, 1951. Language 29. 577–90.Google Scholar

Fortson, Benjamin W., IV. 2010. Indo-European language and culture: An introduction. 2nd ed. Malden, MA: Wiley-Blackwell.Google Scholar

Fortson, Benjamin W., IV. 2017. The dialectology of Italic. In Klein, Joseph & Fritz 2017–18, vol. 2, 835–57.Google Scholar

Gamkrelidze, T. V. & Ivanov, V. V. 1984. Indoevropejskij jazyk i indoevropejcy, Tbilisi: Tbilisi University Press.Google Scholar

Garrett, Andrew. 1999. A new model of Indo-European subgrouping and dispersal. In Chang, Steve S., Liaw, Lily & Ruppenhofer, Josef (eds.), Proceedings of the twenty-fifth annual meeting of the Berkeley Linguistics Society, February 12–15, 146–56. Berkeley, CA: Berkeley Linguistics Society.Google Scholar

Garrett, Andrew. 2006. Convergence in the formation of Indo-European sub-groups: Phylogeny and chronology. In Forster, Peter & Renfrew, Colin (eds.), Phylogenetic methods and the prehistory of languages, 139–51, Cambridge: McDonald Institute for Archaeological Research.Google Scholar

Goldstein, David. 2020. Indo-European phylogenetics with R: A tutorial introduction. Indo-European Linguistics 8. 110–80.Google Scholar

Hock, Hans Henrich. 1991. Principles of historical linguistics. 2nd rev. and updated ed. Berlin: Mouton de Gruyter.Google Scholar

Hoenigswald, Henry M. 1966. Criteria for the sub-grouping of languages. In Birnbaum, Henrik & Puhvel, Jaan (eds.), Ancient Indo-European dialects, 1–12. Berkeley: University of California Press.Google Scholar

Holst, Jan Henrik. 2009. Armenische Studien. Wiesbaden: Harrassowitz.Google Scholar

Hübschmann, H. 1875. Ueber die stellung des armenischen im kreise der indogermanischen sprachen. Zeitschrift für vergleichende Sprachforschung 23. 5–49.Google Scholar

Klein, Jared S., Joseph, Brian D. & Fritz, Matthias (eds.). 2017–18. Handbook of comparative and historical Indo-European linguistics. 3 vols. Berlin: De Gruyter Mouton.Google Scholar

Klingenschmitt, Gert. 1994. Die Verwandtschaftsverhältnisse der indogermanischen Sprachen. In Rasmussen, Jens Elmegård (ed.), In honorem Holger Pedersen, 235–51. Wiesbaden: Reichert.Google Scholar

Kloekhorst, Alwin. 2008. Etymological dictionary of the Hittite inherited lexicon. Leiden: Brill.Google Scholar

Kümmel, Martin Joachim. 2007. Konsonantenwandel: Bausteine zu einer Typologie des Lautwandels und ihre Konsequenzen für die vergleichende Rekonstruktion. Wiesbaden: Reichert.Google Scholar

Leskien, August. 1876. Die Declination im Slavisch-Litauischen und Germanischen, Leipzig: Fürstlich Jablonowskische Gesellschaft der Wissenschaften.Google Scholar

Longobardi, Giuseppe & Guardino, Cristina. 2009. Evidence for syntax as a signal of historical relatedness. Lingua 119. 1679–1706.Google Scholar

Longobardi, Giuseppe, et al. 2013. Toward a syntactic phylogeny of modern Indo-European languages. Journal of Historical Linguistics 3. 122–52.Google Scholar

Mallory, James P. & Adams, Douglas Q.. 2006. The Oxford introduction to Proto-Indo-European and the Proto-Indo-European world. Oxford: Oxford University Press.Google Scholar

Meid, Wolfgang. 1975. Probleme der räumlichen und zeitlichen Gliederung des Indogermanischen. In Rix, Helmut (ed.), Flexion und Wortbildung, 204–19. Wiesbaden: Reichert.Google Scholar

Meillet, Antoine. 1908. Les dialectes indo-européens. Paris: Champion.Google Scholar

Meillet, Antoine. 1922. Les dialectes indo-européens. 2nd ed. Paris: Champion.Google Scholar

Anna., Morpurgo Davies 1975. Language classification in the nineteenth century. In Sebeok, Thomas (ed.), Current trends in linguistics 13, 607–716. The Hague: de Gruyter.Google Scholar

Anna., Morpurgo Davies 1992. Nineteenth-century linguistics. London: Longman.Google Scholar

Müller, Max. 1861. Lectures on the science of language delivered at the Royal Institution of Great Britain in April, May, and June, 1861. London: Longman, Green, Longman, and Roberts.Google Scholar

Paul, Hermann. 1880. Principien der Sprachgeschichte. Halle: Niemeyer.Google Scholar

Pedersen, Holger. 1924. Armenier. In Ebert, Max (ed.), Reallexikon der Vorgeschichte. Vol. 1, 219–26. Berlin: de Gruyter.Google Scholar

Petit, Daniel. 2012. Débats sur la classification des langues indo-européennes au xix^e siècle. Lalies 32. 9–32.Google Scholar

Porzig, Walter. 1954. Die Gliederung des indogermanischen Sprachgebiets. Heidelberg: Winter.Google Scholar

Ringe, Don. 2017. Indo-European dialectology. In Klein, Joseph & Fritz 2017–18, vol. 1, 62–75.Google Scholar

Ringe, Don Eska, Joseph F.. 2013. Historical linguistics: Toward a twenty-first century reintegration. Cambridge: Cambridge University Press.CrossRef Google Scholar

Ringe, Don, Warnow, Tandy & Taylor, Ann. 2002. Indo-European and computational cladistics. Transactions of the Philological Society 100. 59–129.Google Scholar

Scaliger, Joseph Justus. 1610. Diatriba de europaeorum linguis. In Opuscula varia antehac non edita, 115–18. Paris: Drovart.Google Scholar

Schleicher, August. 1861. Compendium der vergleichenden Grammatik der indogermanischen Sprachen. Weimar: Böhlau.Google Scholar

Schmidt, Johannes. 1872. Die Verwandtschaftsverhältnisse der indogermanischen Sprachen. Weimar: Böhlau.Google Scholar

Solta, Georg Renatus. 1960. Die Stellung des Armenischen im Kreise der indogermanischen Sprachen. Vienna: Mekhitaristen.Google Scholar

Simone, Raffaele. 1998. The Early Modern period. In Lepschy, Giulio (ed.), History of linguistics. Vol. 3. Renaissance and Early Modern linguistics, 149–236. London: Longman.Google Scholar

Thomason, Sarah G. & Kaufman, Terrence. 1988. Language contact, creolization and genetic linguistics. Berkeley: University of California Press.CrossRef Google Scholar

de Vaan, Michiel. 2008. Review of Clackson 2007. Lingua 118. 1228–32.Google Scholar

Young, Thomas. 1813. Review of Johann Christoph Adelung, Mithridates, oder Allgemeine Sprachenkunde, vol. 1–3, Berlin: Voss, 1806. The Quarterly Review 10. 250–92.Google Scholar