SINE QUA NON: INFERRING KODJADERMEN-GUMELNIȚA-KARANOVO VI POPULATION DYNAMICS FROM AGGREGATED PROBABILITY DISTRIBUTIONS OF RADIOCARBON DATES

Gabriel M Popescu; Cristina Covătaru; Ionela Opriș; Adrian Bălășescu; Laurent Carozza; Valentin Radu; Constantin Haită; Tiberiu Sava; C Michael Barton; Cătălin Lazăr

doi:10.1017/RDC.2023.6

SINE QUA NON: INFERRING KODJADERMEN-GUMELNIȚA-KARANOVO VI POPULATION DYNAMICS FROM AGGREGATED PROBABILITY DISTRIBUTIONS OF RADIOCARBON DATES

Published online by Cambridge University Press: 03 March 2023

C Michael Barton and

Gabriel M Popescu: Affiliation:
Research Institute of the University of Bucharest, Division of ArchaeoSciences, University of Bucharest, No. 90, Panduri Street, Sector 5, 050663, Bucharest, Romania School of Complex Adaptive Systems, Arizona State University, 1031 S. Palm Walk, 85281-2701, Tempe, AZ, USA
Cristina Covătaru: Affiliation:
Research Institute of the University of Bucharest, Division of ArchaeoSciences, University of Bucharest, No. 90, Panduri Street, Sector 5, 050663, Bucharest, Romania
Ionela Opriș: Affiliation:
Research Institute of the University of Bucharest, Division of ArchaeoSciences, University of Bucharest, No. 90, Panduri Street, Sector 5, 050663, Bucharest, Romania
Adrian Bălășescu: Affiliation:
Research Institute of the University of Bucharest, Division of ArchaeoSciences, University of Bucharest, No. 90, Panduri Street, Sector 5, 050663, Bucharest, Romania Institute of Archaeology “Vasile Pârvan”, No. 11, Henri Coandă Street, Sector 1, 010667, Bucharest, Romania
Laurent Carozza: Affiliation:
Research Institute of the University of Bucharest, Division of ArchaeoSciences, University of Bucharest, No. 90, Panduri Street, Sector 5, 050663, Bucharest, Romania CNRS, UMR 5602 Géode, Université Toulouse 2 Jean Jaurès, Toulouse, France
Valentin Radu: Affiliation:
Research Institute of the University of Bucharest, Division of ArchaeoSciences, University of Bucharest, No. 90, Panduri Street, Sector 5, 050663, Bucharest, Romania National Museum of History of Romania, No. 12, Calea Victoriei, Sector 3, 030026, Bucharest, Romania
Constantin Haită: Affiliation:
National Museum of History of Romania, No. 12, Calea Victoriei, Sector 3, 030026, Bucharest, Romania
Tiberiu Sava: Affiliation:
IFIN-HH, 30 Reactorului St., Măgurele, Ilfov, 077125, Romania
C Michael Barton: Affiliation:
School of Human Evolution and Social Change, Arizona State University, 900 S. Cady Mall, 85287-2402, Tempe, AZ, USA School of Complex Adaptive Systems, Arizona State University, 1031 S. Palm Walk, 85281-2701, Tempe, AZ, USA
Cătălin Lazăr*: Affiliation:
Research Institute of the University of Bucharest, Division of ArchaeoSciences, University of Bucharest, No. 90, Panduri Street, Sector 5, 050663, Bucharest, Romania
*: *Corresponding author. Email: [email protected]

Article contents

Abstract
INTRODUCTION
DATA
METHODS
RESULTS
DISCUSSION
CONCLUSION
SUPPLEMENTARY MATERIAL
DATA AND MATERIALS AVAILABILITY
CONFLICT OF INTEREST
References

Rights & Permissions

Abstract

Past human population dynamics play a key role in integrated models of understanding socio-ecological change over time. However, little analysis on this issue has been carried out for the prehistoric societies in the Lower Danube and Eastern Balkans area. Here, we use summed probability distributions of radiocarbon dates to investigate potential regional and local variation population dynamics. Our study adopts a formal model-testing approach to the fifth millennium BC archaeological radiocarbon record, performing a region-wide, comparative analysis of the demographic trajectories of the area along lower Danube River. We follow the current framework of theoretical models of population growth and perform global and regional significance and spatial permutation tests on the data. Specifically, we investigate whether populations on both sides of the Danube follow a logistic pattern of steady growth, followed by a major decline over time. Finally, our analysis of local-scale growth investigates whether considerable heterogeneity or homogeneity within the region may be observed over the time span considered here. The results show both similarities and differences in the population trends across the area. Our findings are showcased in relation to the cultural characteristics of the region’s 5th millennium BC societies, and future research directions are also suggested.

Keywords

Balkans fifth millennium BC KGK-VI population dynamics Lower Danube Middle Holocene

Type: Research Article
Information: Radiocarbon , Volume 65 , Issue 2 , April 2023 , pp. 463 - 484

DOI: https://doi.org/10.1017/RDC.2023.6 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: © The Author(s), 2023. Published by Cambridge University Press for the Arizona Board of Regents on behalf of the University of Arizona

INTRODUCTION

The so-called “Kodjadermen-Gumelniţa-Karanovo VI cultural complex” (KGK-VI) is a component of the Southeastern Eneolithic Block (SEB), defined based on material culture by the cultural-historical archaeologists from Bulgaria and Romania. It was identified within a broad region of the Eastern Balkans and Lower Danube Valley, delimited to the north by the Carpathians mountains, but also the steps area from north of the Danube Delta to the east by the Black Sea and reaching as far as the Rhodope mountains and Olt River to the west and the Aegean in the south. The general chronological position of KGK-VI evolution is placed during most of the 5th millennium BC. It is characterized by the widespread development of tell settlements, the emergence of extramural cemeteries (in some cases including wealthy graves), the advent and development of copper and gold metallurgy, along with consistent changes in lithic and ceramic technologies (e.g., graphite and gold pottery painting, exploitation, and processing of various pigments, etc.). Local variants emerged from a common, prior cultural background (e.g., “Varna Culture” on the western coast of the Black Sea) (Todorova and Zhelyaskova Reference Todorova and Zhelyaskova1978; Todorova Reference Todorova1986; Marinescu-Bîlcu Reference Marinescu-Bîlcu2001; Popovici Reference Popovici2010; Petrova Reference Petrova2016; Lazăr et al. Reference Lazăr, Mărgărit and Radu2018; Chapman Reference Chapman2020) and also in areas of interaction with other cultures (e.g., known as Stoicani-Aldeni/Aldeni II cultural group in north-eastern Wallachia) at the intersection between KGK-VI and Cucuteni-Tryplie communities (Frînculeasa Reference Frînculeasa2016), or in previously unoccupied territories (e.g., known as Bolgrad/Bolgrad-Aldeni group) in the region north of the Danube Delta (Todorova and Zhelyaskova Reference Todorova and Zhelyaskova1978; Todorova Reference Todorova1986; Frînculeasa Reference Frînculeasa2016), which marks the most northern and eastern extension of the KGK-VI.

Moreover, the increased mobility of the KGK-VI population is highlighted by a vast exchange network of raw materials and goods. Thus, pottery painted with black, white, and red pigments, graphite or even with gold, along with metal or flint items (e.g., super-blades), adornments made of Mediterranean shells (e.g., Spondylus sp., Dentalium sp., etc.), along with non-local minerals and pigments (e.g., malachite, marble, carnelian, agate, hematite, kaolin, etc.) are common discoveries in the KGK-VI sites (e.g., tells, flat settlements, off-tells, and cemeteries) located in areas that usually lack all of these raw materials (Bailey Reference Bailey2000; Todorova Reference Todorova and Grammenos2003; Anthony et al. Reference Anthony and Chi2010; Popovici Reference Popovici2010). All of this reflects the adoption of new lifeways, economic development and a new way of environmental exploitation and control, in parallel with the development of complex and stratified society and the emergence of elites, as demonstrated by the wealthy graves from Varna I cemetery or other domestic/funerary discoveries (Bailey Reference Bailey2000; Chapman et al. Reference Chapman, Higham, Slavchev, Gaydarska and Honch2007; Anthony et al. Reference Anthony and Chi2010; Slavchev Reference Slavchev2010; Chapman Reference Chapman2020). Consequently, starting from that flourishing development of the SEB human groups, without any parallel in the past, some scholars described the period as the “Ex Balcanae Lux” phenomenon (Todorova and Zhelyaskova Reference Todorova and Zhelyaskova1978; Sterud et al. Reference Sterud, Evans and Rasson1984), the “Climax Copper Age” (Chapman et al. Reference Chapman, Higham, Slavchev, Gaydarska and Honch2007) or the “Golden 5th Millennium BC” (Boyadzhiev and Terzijska-Ignatova Reference Boyadzhiev and St2011), in order to highlight the extraordinary progress of the Neolithic communities.

Within this large region, different human groups have developed specific material culture signatures, or archaeological “cultures,” during this period. Recent aDNA investigations in southeastern Europe have demonstrated that populations responsible for the above mentioned “cultures” of the second half of the 6th millennium and 5th millennium BC, KGK-VI included, have similar genetic features and common ancestry due to their common origin in southwestern Anatolia (Hervella et al. Reference Hervella, Rotea, Izagirre, Constantinescu, Alonso, Ioana, Lazăr, Ridiche, Soficaru and Netea2015). Along with the Anatolian Neolithic ancestry, those populations showcase sporadic evidence of steppe-related ancestry (specimens in Varna I and Smyadovo cemeteries in Bulgaria), but also a consistent hunter-gatherers related ancestry (some resilient native Mesolithic groups from the target area), which indicates a complex population structure and admixture, with several genetic components (Mathieson et al. Reference Mathieson, Alpaslan-Roodenberg, Posth, Szécsényi-Nagy, Rohland, Mallick, Olalde, Broomandkhoshbacht, Candilio and Cheronet2018).

The last decade has seen major advances in developing theoretical, analytical, and methodological instruments, concerning the understanding of demographic change out of large datasets of archaeological radiocarbon (¹⁴C) dates, in different parts of the world and encompassing large temporal spans of the Late Pleistocene and Holocene (Shennan et al. Reference Shennan, Downey, Timpson, Edinborough, Colledge, Kerig, Manning and Thomas2013; Downey et al. Reference Downey, Bocaege, Kerig, Edinborough and Shennan2014; Timpson et al. Reference Timpson, Colledge, Crema, Edinborough, Kerig, Manning, Thomas and Shennan2014; Crema et al. Reference Crema, Habu, Kobayashi and Madella2016; Downey et al. Reference Downey, Haas and Shennan2016; Bevan et al. Reference Bevan, Colledge, Fuller, Fyfe, Shennan and Stevens2017; Barton et al. Reference Barton, Aura Tortosa, Garcia-Puchol, Riel-Salvatore, Gauthier, Vadillo Conesa and Pothier Bouchard2018; Riris Reference Riris2018; Roberts et al. Reference Roberts, Woodbridge, Bevan, Palmisano, Shennan and Asouti2018; Timpson et al. Reference Timpson, Barberena, Thomas, Méndez and Manning2020; Crema and Shoda Reference Crema and Shoda2021). Some of these studies have also been focused on understanding the population dispersals and change during the Neolithic of Southeastern Europe, although mostly on the early Neolithic of the region (Porčić and Nikolić Reference Porčićić and Nikolić2016; Blagojević et al. Reference Blagojević, Porčić, Penezić and Stefanović2017; Harper Reference Harper2019; Vrhovnik Reference Vrhovnik2019; Porčić et al. Reference Porčić, Blagojević, Pendić and Stefanović2021; Vander Linden and Silva Reference Vander Linden and Silva2021).

The current study aims at contributing to the understanding of the population geo-temporal dynamics of one of the archaeological signals of the Chalcolithic in Eastern Europe, namely KGK-VI, which reflects the maximum point of development of human communities that lived a Neolithic way of life in this part of Europe. More specifically, our research explores (1) the nature of population trajectories within the chosen region, on both sides of the Danube and (2) the extent to which KGK-VI differs north and south of Danube. The analyses used in the study allow for local and global tests of significance to be performed and regional population histories to be compared through the comparison of empirical and simulated summed probability distributions (SPDs) of radiocarbon dates (see Bevan et al. Reference Bevan, Colledge, Fuller, Fyfe, Shennan and Stevens2017; Crema and Bevan Reference Crema and Bevan2021 for details).

DATA

In the current study 440 radiocarbon dates from both Romania and Bulgaria ascribed to the KGK-VI were used (Figure 1). The data were acquired from publications, gray literature, unpublished sources, and from on-line databases (Reingruber and Thissen Reference Reingruber and Thissen2017; Weninger et al. Reference Weninger, Joris and Danzeglocke2020) (e.g., http://www.14sea.org/index.html and https://www.academia.edu/40774947/CalPal_Holocene_Palaeolithic_14C_Database). The available radiocarbon dates from the above-mentioned sources were compiled in a database (n = 257 from Romania and n = 183 from Bulgaria), grouped into 135 bins for analysis (see explanation below of binning), recovered from 59 sites (n = 32 in Romania and n = 27 in Bulgaria). The database includes contextual information such as site name, site id, site recovery context, “culture” and phase (where available), region, country, laboratory number, the uncalibrated date and uncertainty, and geographical coordinates (longitude and latitude) (Supplementary Table 1).

Figure 1 Map of radiocarbon distribution data set. Legend: 1-Akladi Cheiri; 2-Bikovo; 3-Čardako-Slatino; 4-Djakovo; 5-Dolnoslav; 6-Drama-Merdžumekja; 7-Durankulak; 8-Ezero; 9-Goljamo Delčevo; 10-Hotnica; 11-Junacite; 12-Karnobat; 13-Košarna; 14-Omurtag; 15-Orlitsa; 16-Ovčarovo; 17-Povelyanovo; 18-Smjadovo; 19-Sušina; 20-Tatul; 21-Tell Azmak; 22-Tell Karanovo; 23-Tell Russe; 24-Varhari; 25-Varna1; 26-Varna2; 27-Varna3; 28-Baia Boruz Tell; 29-Popina Blagodeasca; 30-Bordușani; 31-Carcaliu; 32-CăscioareleOstrovel; 33-Cunești; 34-Dambul lui Haralambie; 35-Gumelnița-terrasse; 36-Gumelnița-tell; 37-Hârșova; 38-Lișcoteanca-Movila Olarului; 39-Lunca; 40-Luncavița; 41-Mălăieștii de Jos; 42-Măriuța-C; 43-Măriuța-T; 44-Navodari; 45-Niculițel; 46-Orbeasca Sus; 47-Panduru; 48-Pietrele; 49-Seciu; 50-Șeinoiu; 51-SultanaGhețărie; 52-Sultana-Malu Roșu-terrasse; 53-Sultana-Malu Roșu-tell; 54-Taraschina; 55-Taraschina_2; 56-Urlați; 57-Vărăști; 58-Vitănești; 59-Vlădiceasca.

The dates selected to use in the current investigation were obtained from samples from various materials including, wood, charcoal, seeds, and bones (herbivores and humans). The dates based on shell, fish and carnivore samples have not been included to avoid potential reservoir effects issues. We also applied a cleaning protocol and excluded all dates with large uncertainties in order to remove any potential spurious dates. However, most of our dates have very low uncertainties, with a mean of 43 and a median of 37 years. Recent studies, dedicated to similar research agenda, have also shown that an overly strict data cleaning and the exclusive use of dates with very low uncertainties might potentially be just as damaging for this kind of analysis just as an uncritical data collection (Timpson et al. Reference Timpson, Colledge, Crema, Edinborough, Kerig, Manning, Thomas and Shennan2014, Reference Timpson, Manning and Shennan2015; Vander Linden and Silva Reference Vander Linden and Silva2021).

METHODS

Our analysis was carried out with the R environment for statistical analysis (R Core Team 2020), and the rcarbon R-package for date calibration and SPD modeling (Bevan et al. Reference Bevan, Crema, Bocinsky, Hinz, Riris and Silva2020; Crema and Bevan Reference Crema and Bevan2021), using the Northern Hemisphere calibration curve (IntCal20) (Reimer et al. Reference Reimer, Austin, Bard, Bayliss, Blackwell, Ramsey, Butzin, Cheng, Edwards and Friedrich2020), along with gstat (Pebesma and Graeler Reference Pebesma and Graeler2021), sp (Pebesma et al. Reference Pebesma, Bivand, Rowlingson, Gomez-Rubio, Hijmans, Sumner, MacQueen, Lemon, Lindgren and O’Brien2022), and other R packages mentioned in the rmarkdown script that accompanies the study. Dataset, supplemental figures and R Markdown scripts needed for reproducing the results of our analysis are available at: https://zenodo.org/record/7587242#.Y9hI9S8Ro4c. The methods that we used in our analysis are based on previously developed quantitative analysis of SPDs by Shennan et al. (Reference Shennan, Downey, Timpson, Edinborough, Colledge, Kerig, Manning and Thomas2013), further refined in several other recent studies (Downey et al. Reference Downey, Haas and Shennan2016; Brown and Crema Reference Brown and Crema2019; Crema et al. Reference Crema, Habu, Kobayashi and Madella2016; Riris Reference Riris2018; Timpson et al. Reference Timpson, Colledge, Crema, Edinborough, Kerig, Manning, Thomas and Shennan2014; Timpson et al. Reference Timpson, Manning and Shennan2015), the non-parametric extension devised by Crema et al. (Reference Crema, Habu, Kobayashi and Madella2016; see also Crema and Kobayashi Reference Crema and Kobayashi2020), and the spatial permutation test (Crema et al. Reference Crema, Bevan and Shennan2017; Crema and Bevan Reference Crema and Bevan2021). The reader should refer to these bibliographical resources (and references therein) for more details on the methods and the concepts behind them.

Spatial Dispersal Analysis

We first analyzed the spatial structure of the dispersal of our ¹⁴C distribution (Figure 2a,b). We considered this as being the first step in our analysis for a better understanding of the spatial dynamics of the dispersal of the KGK-VI to assess its correlation (potentially) with demography. Based on Hengl (Reference Hengl2006) equation for establishing the right pixel size for a grid, and our regional context we superimposed a 23 × 23 km grid on the area, with each grid cell covering approximately 460 km². The analysis was done in R statistical environment with the gstat (Pebesma and Graeler Reference Pebesma and Graeler2021) and sp (Pebesma et al. Reference Pebesma, Bivand, Rowlingson, Gomez-Rubio, Hijmans, Sumner, MacQueen, Lemon, Lindgren and O’Brien2022) packages. We divided the study area into grid cells and hexagon cell shapes were chosen, given their shape being the closest to a circle and the easy to use in a tessellation. Their minimal edge effects as well as the identical neighboring cells and having the same distance between centers for all the neighbors, make them particularly suitable for our analysis (see also Vrhovnik Reference Vrhovnik2019).

Figure 2 (a) Number of radiocarbon dates per grid cell. Values are log10 scaled. (b) The earliest appearance of the KGK-VI settlements in the grid cells, consisting of grid cell centroids with the date for the beginning of the KGK-VI occupation. Gridded area (white hexagons) represents the KGK-VI area with dated sites.

Thus, the study area is covered with 477 grid cells, of which only 47 grid cells are occupied with sites, forming several clusters, and a patchy distribution of samples. The number of radiocarbon dates per grid cell varies from one (seen in 11 grid cells) to a maximum of 67 (seen in one cell) with a median value of 3 dates per grid cell and third quartile at 10.5 dates per grid cell. The distribution of dates per cell is shown in Figure 2a.

For each grid cell, we then calculated a normalized summed calibrated radiocarbon probability distribution. To calculate the calendar age ranges, highest probability density was used, and these are the shortest ranges that include 95% of the probability in the summed probability density function. As such, the starting date of the KGK-VI in a particular grid cell was taken to be the lower 95% range endpoint date. These estimated starting dates are shown in Figure 2b. Consequently, these dates were to estimate the spread of the KGK-VI across the area. Grid cells with only one radiocarbon date were excluded from the interpolation.

Dates Binning, Calibration, and SPDs Production

To mitigate potential issues due to the differences in intensity of sampling, radiocarbon dates are combined in 100-year bins within each archaeological context (e.g., horizontal and vertical provenience units) so that the intensively sampled sites/areas are not overrepresented and cause artificial spikes in the observed SPDs. When multiple dates are present from a single site, they are aggregated within each archaeological context and when their distance in ¹⁴C years is less than 100 years. The procedure applies a hierarchical cluster analysis using the complete linkage method and a cut-off value of 100 years to separate the observations. Although our selection of 100 years for binning is arbitrary, we performed a bin sensitivity analysis (Supplementary Figure 1), which shows this choice has no negative impact on the accuracy of results (all bin sizes fit within the 95% confidence simulated envelope—gray area) and is also well above the median error of 37 years in the dataset (our protocol follows those already applied in the literature; for more details (Bevan et al. Reference Bevan, Colledge, Fuller, Fyfe, Shennan and Stevens2017; Riris Reference Riris2018; Crema and Bevan Reference Crema and Bevan2021).

Dates were calibrated using the Northern Hemisphere Radiocarbon Age Calibration Curve (IntCal20) (Reimer et al. Reference Reimer, Austin, Bard, Bayliss, Blackwell, Ramsey, Butzin, Cheng, Edwards and Friedrich2020) and the rcarbon package (Bevan et al. Reference Bevan, Crema, Bocinsky, Hinz, Riris and Silva2020). Multiple dates within a bin are calibrated and summed “inside” the bin and subsequently divided by the number of dates so that each archaeological context contributes a single date distribution to the overall SPD. The probability distributions of the calibrated dates were summed over the entire KGK-VI period to produce empirically based SPDs using the entire data set, as well as for subsets for the two regions north and south of the Danube. Following detailed discussions in recent works (Weninger et al. Reference Weninger, Clare, Jöris, Jung and Edinborough2015; see also details in Bevan et al. Reference Bevan, Colledge, Fuller, Fyfe, Shennan and Stevens2017), we have not normalized the post-calibration distribution of each date (that ensures it sums to 1 under the curve) before summation of multiple dates. This ensures the reduction of creation of abrupt spikes in the final summed probability distributions, there where the calibration curve is steep (Weninger et al. Reference Weninger, Clare, Jöris, Jung and Edinborough2015; Bevan et al. Reference Bevan, Colledge, Fuller, Fyfe, Shennan and Stevens2017; Crema and Bevan Reference Crema and Bevan2021).

Model Testing

In order to differentiate SPD fluctuations that represent meaningful demographic change from those due to sampling error noise, we compare the observed SPDs against null models of simulated dates derived from hypothesized calendar age distributions of dates. These null (hypothesized) models assume increasing survival of datable radiocarbon material through time according to either an exponential or a logistic population growth. Only portions of the SPD curves that fall outside the 95% confidence interval (CI) of the null models are considered sufficiently significant demographic changes to be considered in our subsequent discussion. Model testing procedures are described in detail below.

We first evaluate the goodness-of-fit of the entire data set SPD, by first fitting the calibrated data to a generalized exponential model, with the help of modelTest function (Figure 3) (Timpson et al. Reference Timpson, Colledge, Crema, Edinborough, Kerig, Manning, Thomas and Shennan2014; Crema and Bevan Reference Crema and Bevan2021) (Supplemental Material for analysis in R). An exponential model had become common practice, as it is assumed to account for population growth with unlimited resources and taphonomic processes. We assessed whether the SPDs of the ¹⁴C dates for the entire region showed statistically relevant deviations when compared against the exponential model, following the procedure described in several other studies (Timpson et al. Reference Timpson, Colledge, Crema, Edinborough, Kerig, Manning, Thomas and Shennan2014; Bevan et al. Reference Bevan, Colledge, Fuller, Fyfe, Shennan and Stevens2017; Crema and Bevan Reference Crema and Bevan2021; Crema et al. Reference Crema, Habu, Kobayashi and Madella2016; Riris Reference Riris2018; Palmisano et al. Reference Palmisano, Bevan and Shennan2017; Roberts et al. Reference Roberts, Woodbridge, Bevan, Palmisano, Shennan and Asouti2018 and references therein).

Figure 3 Results of fitting and comparing the entire regional empirical SPD against the exponential null model of population growth. Monte Carlo 95% confidence null model gray envelope is based on 5000 runs. Observed SPD is shown with solid red line, while the positive and negative deviations from the null are marked in red and blue. (Please see online version for color figures.)

Null models were simulated for the entire KGK-VI region using assumptions of the exponential and logistic growth patterns in the following way, repeated for each type of model.

1. An exponential/logistic model was generated for the entire time period and fit to the empirical SPD produced by the dates we compiled.
2. A set of dates (equivalent in number to the bins used for the empirical SPD) was generated from the model and errors assigned randomly (within the range of empirical date errors).
3. An SPD was generated from the model dates and errors.
4. Steps 2–3 were repeated 5000 times to estimate the 95% CI around the model.

The empirical SPDs are then compared with the 95% CI around each of the two models (exponential and logistic). Portions of the observed regional SPD that fall outside the envelope were considered statistically significant local deviations above and below the null model (red and blue areas respectively). Following methods outlined by Timpson et al. (Reference Timpson, Colledge, Crema, Edinborough, Kerig, Manning, Thomas and Shennan2014) a global p-value can then be calculated from the total area of the empirical SPD curve that falls outside the 95% CI of each null model.

To evaluate how well the exponential model fit to our empirical data we employed Akaike Information Criterion (AIC) to select the most parsimonious fitted model for the entire region (Sakamoto et al. Reference Sakamoto, Ishiguro and Kitagawa1986). AIC suggested that the use of different model might be a better fit for our context. As such, we decided to use the logistic growth model as the null model for comparison and discussion of results (Figure 4), following the same procedures outlined above for the exponential growth model. We also used an extension of the global p-value to evaluate the point-to-point differences along the SPD curve. This test compares the empirically observed difference to the distribution of differences in the SPD curve between two points in time against the distribution of expected values under the null hypothesis (Edinborough et al. Reference Edinborough, Porčić, Martindale, Brown, Supernant and Ames2017). We also deployed a more recent alternative to SPDs, Composite Kernel Density Estimates (CKDEs) (Brown Reference Brown2017; McLaughlin Reference McLaughlin2019), which has the advantage of minimizing calibration artificial spikes, as well as, providing estimates of sampling and calibration-derived uncertainty over time (Figure 5). Based on the qualitative inspection of the SPDs, the CKDE curve and formal AIC test, we decided to also fit a suite of four composite models (see Rmarkdown document in the Zenodo repository) to the summed calibrated probability distribution for KGK-VI, representing potential demographic models (see Goldberg et al. Reference Goldberg, Mychajliw and Hadly2016; Arroyo-Kalin and Riris Reference Arroyo-Kalin and Riris2021; de Souza and Riris Reference de Souza and Riris2021). Assessing for the goodness of fit of these models was achieved following the same procedure as outlined above.

Figure 4 Results of fitting and comparing the entire regional empirical SPD against the logistic null model of population growth. Monte Carlo 95% confidence null model gray envelope is based on 5000 runs. Observed SPD is shown with solid red line, while the positive and negative deviations from the null are marked in red and blue.

Figure 5 Bootstrapped composite kernel density estimate, suggesting a composite model with the breakpoint at approximately 4400 BC should be tested.

Permutation Tests—Regional Comparison

We are obviously interested in empirically testing the variation between the northern and southern regions of the KGK-VI (Danube River is used as a geographical divide). In order to achieve this, each region was compared to a null model assuming no spatial differences. This null model can be obtained by pooling the radiocarbon dates from across the entire region and simulating from the pooled SPD (Figure 6).

Figure 6 The KGK-VI empirical SPD record fitted to logistic (5000 BC–4400 BC) and exponential (4400 BC–5750 BC) models, with significance envelope derived from 5000 Monte Carlo simulations.

Crema et al. (Reference Crema, Habu, Kobayashi and Madella2016) developed a permutation-based test to statistically compare two or more SPDs (Figure 7). The null hypothesis is generated by simulating multiple (e.g., 5000 here) SPDs whose dates are drawn randomly from both subregions (north and south of the Danube). These simulated SPDs are again combined, as above, to produce a 95% CI envelope against which the SPD from each region can be compared (see Crema et al. Reference Crema, Habu, Kobayashi and Madella2016; Crema and Bevan Reference Crema and Bevan2021).

Figure 7 Permutation test showing variation between regional population growth. Observed SPDs for each region are shown with a solid black line, while the dashed line represents the observed pan-regional SPD. Gray areas represent the 95% confidence envelope for the null model, red and blue bands represent areas where the observed SPD significantly positively (red) and negatively (blue) deviates from the pan regional null model.

This is a robust approach to inter-regional differences in the research intensity because the comparison is based on the shape of the SPD (the relative change in summed probabilities within each region) and not on differences in their absolute magnitudes. Maintaining the observed number of bins for each region and comparing population trajectories rather than absolute differences in density, the permutation test bypasses the problem. Moreover, sample size is taken into account in the width of the 95% CI envelope. Significant negative (or positive) deviations of the SPD in one region does not necessarily imply a lower (or higher) absolute population density, but that the drop in the proxy within the dynamics of that region was significantly stronger compared to rest of the data.

Spatial Permutation Test

The spatial permutation test is an extension of the permutation test described above, having the virtue of allowing for the assessment of variation without the imposition of a priori regions of analysis. The steps involved in the spatial permutation test are described in detail by Crema et al. (Reference Crema, Bevan and Shennan2017; see also Crema and Bevan Reference Crema and Bevan2021). Below are summarized the steps involved in the spatial permutation analytical protocol.

1. Produce local SPDs for each site with dates combining the date distribution at the site with date distributions at neighboring sites weighted as a function of their distance from the site. We selected a neighborhood radius of 100 km following a sensitivity analysis of different radii (see Supplemental Figures 2–3).
2. Divide the KGK-VI temporal span of 5050–3800 BC into equal transition blocks (e.g., time slices), 250 years each, which, given the length of our time span, we consider relevant in our endeavor to detect long term regional growth patterns (Figure 8).

Figure 8 Observed rate of growth at each transition computed from the SPD. I: 5050–4800 to 4800–4550; II: 4800–4550 to 4550–4300; III: 4550–4300 to 4300–4050; IV: 4300–4050 to 4050–3800 BC.
3. Calculate the overall growth rate (change in the SPD curve) between each temporal transition block and the subsequent one.

This allowed us to evaluate spatial patterns of demographic growth and decline in two ways. We compared the growth rates calculated for each local SPD with the overall growth rate for each transition block (Figure 9). For each transition block, we also compared the growth rates of local SPDs at each site with a simulated model generated by repeatedly (10,000 iterations) randomly shuffling the local SPDs spatially across all site locations and combining the growth rates of the shuffled SPDs at each site. This allowed us to identify “hot” and “cold” spots (areas of significance), defined as areas where the local growth exceeds the growth observed in the simulation (Figure 10). Following methods discussed in Crema et al. (Reference Crema, Bevan and Shennan2017), two measures of significance are produced in the course of the spatial permutation test. p-values are measures of significance between observed local growth and simulated growth rates. However, the use of multiple testing approach, increases the potential for compounding false positive results (e.g., some local SPDs will be higher or lower than the theoretical expectation by chance alone). A more robust q-values test is therefore also computed by adjusting p-values to account for false positive discovery rate. Thus, a p-value of 0.05, implies that 5% of the tests will result in false positives, a q-value of 0.05 means that 5% of the results that have a q-values less than 0.05 are false positives (see Crema et al. Reference Crema, Bevan and Shennan2017 for further details).

Figure 9 Local geometric growth rate for each transition block. I: 5050–4800 to 4800–4550; II: 4800–4550 to 4550–4300; III: 4550–4300 to 4300–4050; IV: 4300–4050 to 4050–3800 BC; V: Geographical reference map for the local geometric growth rate analysis, shown in transitional blocks I-IV.

Figure 10 Spatial permutation test showing where growth is significantly higher or lower than the null for each transition block. I:5050–4800 to 4800–4550; II:4800–4550 to 4550–4300; III:4550–4300 to 4300–4050; IV:4300–4050 to 4050–3800 BC; V: Geographical reference map for the spatial permutation test analysis, shown in transitional blocks I-IV. Significance is shown in terms of q-values (more robust against false positives) and p-values.