Present-day populations of Hungary do not appear to derive detectable ancestry from early medieval individuals from Longobard contexts, and are instead more similar to Scythian-related ancestry sources (Extended Data Fig. 6), consistent with the later impact of Avars, Magyars and other eastern groups.In fact, present-day Hungarians are overwhelmingly derived from West Slavic and German peasants, showing only minor ancestry from early Magyars (or rather Hungarian Conquerors). So in terms of genetic ancestry they're basically typical East Central Europeans. Scythians and Avars don't even deserve a mention in this context. The reason that Speidel et al. found present-day Hungarians to be broadly similar to Scythians is because they used so called Hungarian Scythians in their analysis. It's important to understand that these Hungarian Scythians are genetically fairly typical Central Europeans for their time, and, by and large, don't show any significant genetic relationship to Asian Scythians, Avars or early Magyars. So they're mostly either just acculturated Scythians or wrongly classified as Scythians by archeologists. That is, the broad similarity that Speidel et al. found between present-day Hungarians and Hungarian Scythians derives from the fact that both of these populations are genetically Central Europeans, rather than the ridiculously false idea that they show strong genetic links to Avars, Hungarian Conquerors and other eastern groups. Here's a Principal Component Analysis (PCA) of West Eurasian genetic variation, courtesy of the excellent Vahaduo:Global25 Views, that perfectly illustrates my point. If Speidel et al. were correct about the genetic origin of present-day Hungarians, then the Hungarian_Modern and Hungary_Scythian samples would be shifted away from other Europeans, much like many of the Hungary_Avar and Hungary_Conqueror individuals. But that's obviously not the case, and instead they cluster strongly with, say, present-day Germans from Hamburg. I emailed two of the authors of this paper, Leo Speidel and Pontus Skoglund, when they posted the preprint of the paper at bioRxiv to cordially discuss this issue (see here). But they totally ignored me. Citation... Speidel et al., High-resolution genomic history of early medieval Europe, Published online: 1 January 2025, https://doi.org/10.1038/s41586-024-08275-2
search this blog
Showing posts with label Central Europe. Show all posts
Showing posts with label Central Europe. Show all posts
Monday, January 6, 2025
Leo Speidel & Pontus Skoglund
This quote, from a new paper at Nature, High-resolution genomic history of early medieval Europe by Speidel et al., is the most idiotic take on the ancestry of present-day Hungarians that I've ever read.
Monday, March 25, 2024
High-resolution stuff
I just emailed this to the authors of High-resolution genomic ancestry reveals mobility in early medieval Europe, a new preprint at bioRxiv [LINK].
I appreciate that Polish population history is not the main focus of your preprint, and also that you're constrained by the lack of relevant and suitably high quality ancient genomes from East-Central and Eastern Europe. However, I must say that your analysis of the Medieval Polish population and resulting conclusions about Polish population history don't reflect reality. Your Poland_Middle_Ages genomic cluster is made up of just six samples that don't fully represent the genetic complexity of the core population of Medieval Poland. As a result, you classified PCA0148 as one of the Poland_Middle_Ages outliers, even though this sample isn't an outlier when analyzed within the context of the full set of published Polish Medieval genomes. Moreover, PCA0148 is very similar to several Polish Viking Age samples that show Scandinavian-specific genome-wide and Y-chromosome haplotypes, and probably likewise shows some Scandinavian-related ancestry. This is important to note when attempting to recapitulate Polish population history, because it suggests that Scandinavian-related ancestry played a formative role in the shaping of the core Polish Medieval genetic cluster. Thus, you might be correct when you claim that the six samples in your Poland_Middle_Ages cluster don't show any "detectable" Scandinavian-related ancestry, but this doesn't necessarily mean that this type of ancestry isn't a key part of the post-Iron Age Polish population history. Below is a self-explanatory Principal Component Analysis (PCA) plot that illustrates my points. Interestingly, Figure 3c in your preprint shows very similar outcomes in regards to the post-Iron Age Polish population history. But the style and scale of your figure makes it difficult to spot the subtle but likely genuine Northwest European-related genetic shifts shown by PCA0148, the Viking context samples and present-day Poles relative to the Poland_Middle_Ages cluster. However, I'm also skeptical that your Poland_Middle_Ages cluster doesn't carry any detectable or even significant Scandinavian-related ancestry. That's because I suspect that there might be some technical issues with your analysis that are masking this type of ancestry in the Polish samples. Your top mixture model for the Poland_Middle_Ages cluster is, in all likelihood, an extreme statistical abstraction of reality, rather than a close reflection of it. That's because, due to a combination of historical, geographical and genetic factors, neither Italy.Imperial(I).SG nor Lithuania.IronRoman.SG are realistic formative source populations for the Medieval Polish gene pool. One of the reasons why you ended up with such a surprising result is probably the lack of suitable samples from East-Central and Eastern Europe, especially those associated with plausibly the earliest Slavic-speaking populations. It's also possible that basing your mixture model on formal statistics played a key part. Formal statistics-based mixture models are known to be biased towards outcomes involving mixture sources from the extremes of mixture clines. If your analysis is affected by this problem, then this would help to explain why you characterized the Poland_Middle_Ages cluster as simply a two-way mixture between a Middle Eastern-related group from Imperial Rome and a Baltic population with a very high cut of European hunter-gatherer ancestry. I do note that on page 6 of your manuscript you consider the possibility that the Southern European-related signal in the Poland_Middle_Ages cluster might only be very distantly related to Italy.Imperial(I).SG, and that it may even have spread across Poland with early Slavic speakers. This is a great point, and I think it should be emphasized and expanded upon, because I suspect that the problem runs deeper than this. For instance, if the early Slavic ancestors of Poles carried substantially more Southern European-related ancestry than Lithuania.IronRoman.SG, and this ancestry was, say, more Balkan-related than Italian-related, then this might radically change your modeling of the Poland_Middle_Ages cluster. That's because these early Slavs would be positioned in a very different genetic space than Lithuania.IronRoman.SG, which could potentially require a significant signal of Scandinavian-related ancestry to get a robust mixture model. Finally, it might be useful to consider Isolation-by-Distance as a partial vector for the Italy.Imperial(I).SG-related signal in Medieval Poland. The full set of published Polish Medieval genomes includes a number of outliers with obvious ancestry from Western Europe and the Balkans. These people probably don't represent any large-scale migrations into Poland, but rather the movements of individuals and small groups. Over time, such small-scale mobility may have had a fairly significant impact on the genetic character of the Polish population.Update 26/03/2024: I sent another email to Speidel et al., this time in regards to their analysis of present-day Hungarians.
Your preprint also claims that present-day Hungarians are genetically similar to Scythians, and that this is consistent with the arrival of Magyars, Avars and other eastern groups in this part of Europe. However, present-day Hungarians are overwhelmingly derived from Slavic and German peasants from near Hungary. This is not a controversial claim on my part; it's backed up by historical sources and a wide range of genetic analyses. Hungarians still show some minor ancestry from Hungarian Conquerors (early Magyars), but this signal only reliably shows up in large surveys of Y-chromosome samples. The Scythians that you used to model the ancestry of present-day Hungarians are of local, Pannonian origin, and they don't show any eastern nomad ancestry. So they're either acculturated Scythians, or, more likely, wrongly classified as Scythians by archeologists. And since these so-called Scythians lack eastern nomad ancestry, the similarity between them and present-day Hungarians is not a sign of the impact from Avars, Hungarian Conquerors and the like, but rather a lack of significant input from such groups in present-day Hungarians.Citation... Speidel et al., High-resolution genomic ancestry reveals mobility in early medieval Europe, bioRxiv, Posted March 19, 2024, doi: https://doi.org/10.1101/2024.03.15.585102 See also... Wielbark Goths were overwhelmingly of Scandinavian origin
Labels:
ancient ancestry,
ancient DNA,
Ancient Rome,
Central Europe,
Eastern Europe,
Goths,
Imperial Rome,
Medieval,
Middle Ages,
Migration Period,
Poland,
Polish,
Slavic,
Slavs,
Viking,
Viking Age,
Vikings,
Wielbark
Sunday, November 13, 2022
A reappraisal of Ashkenazic maternal ancestry
Kevin Brook, who occasionally comments on this blog, has published a peer-reviewed book titled The Maternal Genetic Lineages of Ashkenazic Jews.
The book focuses on 129 mitochondrial (mtDNA) haplogroups that are found in present-day Ashkenazic Jews, and reveals that these lineages can be traced back to a wide range of places, such as Israel, Italy, Poland, Germany, North Africa, and China.
Ergo, it argues that both Israelites and converts to Judaism from a variety of gentile groups made lasting contributions to the Ashkenazic maternal gene pool. In Kevin's own words, the book also:
- shows that all Ashkenazim remain genetically linked to a significant degree to other types of Jewish populations, not only paternally but maternally as well - disproves the myth that Cossack rapists were responsible for any of the non-Israelite DNA in Ashkenazim - presents new DNA evidence in favor of a small contribution of Khazarian and Alan converts to Judaism to the Ashkenazic gene pool.That makes good sense based on what I've learned over the years from studying modern and ancient genome-wide Ashkenazic DNA. More information about Kevin's book is available at the Khazaria.com website HERE. See also... My take on the Erfurt Jews
Tuesday, June 21, 2022
My take on the Erfurt Jews
I had a quick look at the genotype data from the recent Waldman et al. preprint focusing on the ancestry of early Jews from Erfurt, Germany. My impression is that the genetic origins of these Jews are somewhat more complex than claimed in the manuscript.
Indeed, I'd say the Waldman et al. characterization of the Erfurt Jews as a three-way mixture between populations similar to present-day Lebanese, South Italians and Russians doesn't exactly reflect reality.
Unlike Waldman et al., I designed an ADMIXTURE analysis that separated East Asian ancestry into East Asian and Siberian clusters, and also included Mediterranean and North African clusters. The output is available in a spreadsheet HERE. Below is a bar graph based on some of the output.
Now, keeping in mind that ADMIXTURE is not a formal mixture test, and that it estimates ancestry proportions from inferred populations, as opposed to ancient groups that actually existed, here are some key observations:
- in terms of fine scale ancestry, the Erfurt Jews show enough variation to be divided into three or four clusters, as opposed to just two as per Waldman et al. - some of the Erfurt Jews show excess "Mediterranean" ancestry, while others excess "North African" ancestry, and this cannot be explained with ancestral populations similar to Lebanese and/or South Italians, but rather with significant gene flow from the western Mediterranean and possibly North Africa - several of the Erfurt Jews show relatively high levels of "East Asian" ancestry that cannot be explained by admixture from Russians, or even any Russian-like populations, because such populations almost lack this type of ancestry, and instead show significant "Siberian" admixture - as far as I can see, there are no correlations between any of the observations above and the quality of the samples. That is, low coverage doesn't appear to be causing the aforementioned excess "Mediterranean", "North African" and/or "East Asian" ancestry proportions.Investigating this in more detail with, say, formal statistics will take some time. But I was able to reproduce the results from the above ADMIXTURE run using several somewhat different datasets, so that's something. It seems to me that Waldman et al. want a simple and elegant model to explain the data, which is understandable, but I do think they should at least expand their ADMIXTURE analysis to include "Siberian", "Mediterranean" and "North African" clusters, and go from there depending on what they find. Citation... Waldman et al., Genome-wide data from medieval German Jews show that the Ashkenazi founder event pre-dated the 14th century, bioRxiv, posted May 16, 2022, doi: https://doi.org/10.1101/2022.05.13.491805 See also... Mediterranean PCA update
Labels:
admixture,
ancient ancestry,
ancient DNA,
Ashkenazi,
Central Europe,
East Asia,
Eastern Europe,
Erfurt Jews,
Germany,
Jewish,
Jews,
Medieval,
Mediterranean,
Middle Ages,
North Africa,
Poland,
Russia,
Slavic,
Slavs
Sunday, January 23, 2022
Para-Turbo-Balto-Slavic?
I'm seeing increasing numbers of Bronze and Iron Age samples from Central Europe and surrounds with this peculiar set of traits:
- shared genetic drift with present-day Balto-Slavic speakers to the exclusion of most other Europeans - and yet, an unusually low level of Yamnaya-related steppe ancestry - so much so, in fact, that they're often outside the range of modern European genetic variation.As far as I can tell, currently the best examples of this unusual population are HUN_Mako_EBA_o:I1502 (Mathieson et al. Nature 2015) and HUN_EIA_Prescythian_Mezocsat_o1:I18241 (Patterson et al. Nature 2021). Both are from the Carpathian Basin in what is now Hungary. I ran a series of qpAdm mixture models to try and learn more about their origins. The most robust outcomes, out of about 50 different attempts, are these:
right pops: CMR_Shum_Laka_8000BP MAR_Taforalt IRN_Ganj_Dareh_N Levant_PPNB TUR_Barcin_N Iberia_Southeast_Meso UKR_Meso England_Meso RUS_Karelia_HG RUS_West_Siberia_HG MNG_North_N TWN_Hanben BRA_LapaDoSanto_9600BP HUN_Mako_EBA_o Baltic_LTU_Narva 0.149 ∓0.028 POL_Globular_Amphora 0.613 ∓0.028 Yamnaya_RUS_Samara 0.238 ∓0.029 chisq 10.836 tail prob 0.370463 Full output HUN_EIA_Prescythian_Mezocsat_o1 Baltic_LTU_Narva 0.186 ∓0.028 POL_Globular_Amphora 0.592 ∓0.027 Yamnaya_RUS_Samara 0.222 ∓0.029 chisq 12.492 tail prob 0.253499 Full outputCombining the two genomes produces a very similar result:
HUN_EBA-EIA_o Baltic_LTU_Narva 0.160 ∓0.023 POL_Globular_Amphora 0.612 ∓0.023 Yamnaya_RUS_Samara 0.227 ∓0.023 chisq 14.653 tail prob 0.14524 Full outputImportantly, when I move RUS_Karelia_HG from the right pops to the left pops, to test whether HUN_EBA-EIA_o really has steppe ancestry, as opposed to closely related hunter-gatherer ancestry, I still get a very similar outcome:
HUN_EBA-EIA_o Baltic_LTU_Narva 0.158 ∓0.027 POL_Globular_Amphora 0.605 ∓0.033 RUS_Karelia_HG 0.014 ∓0.038 Yamnaya_RUS_Samara 0.223 ∓0.053 chisq 10.461 tail prob 0.234171 Full outputSo these largely Globular Amphora-related individuals do harbor as much as a quarter of steppe ancestry, which is to be expected considering the massive genetic turn-over that most of Europe experienced just before their time as a result of population expansions from the Pontic-Caspian steppe. Nevertheless, this is ~20% less steppe ancestry than in the present-day populations of the region, and it clearly shows in any decent Principal Component Analysis (PCA) of West Eurasia. For instance: At the same time, the relatively close genetic relationship between these ancients and present-day Balto-Slavic speaking populations shows up in fine-scale intra-European PCA. The origins and implications of this population are still a mystery to me. I don't think it's native to the Carpathian Basin. Indeed, my qpAdm models suggest that it may have moved into this region from somewhere to the northeast, because its ancestry is best modeled with ancient groups from present-day Lithuania, Poland and Russia. I'm adamant that these people weren't Balto-Slavic speakers, and certainly not proto-Slavs. Rather, I suspect that much like the Welzin warriors of Bronze Age North-Central Europe, they were closely related to a contemporaneous group that eventually gave rise to proto-Slavs. At best, they may have somehow contributed to the ethnogenesis of Balto-Slavs. By the way, using the Global25 to model their ancestry is highly problematic, because of the strong Balto-Slavic genetic drift that affects some of the dimensions. So be careful when you try it, or better yet, don't try it at all, and stick to formal stats in this particular instance. See also... Tollense Valley Bronze Age warriors were very close relatives of modern-day Slavs
Friday, August 27, 2021
R1a vs R1b in third millennium BCE Central Europe (Papac et al. 2021)
R1a-M417 and R1b-L51 are by far the most important Y-chromosome haplogroups in Europe today. More precisely, R1a-M417 dominates in Eastern Europe, while R1b-L51 in Western Europe.
It's been obvious for a while now, at least to me, that both of these Y-haplogroups are closely associated with the men of the Late Neolithic Corded Ware culture (CWC). Indeed, in my mind they're the main genetic signals of its massive expansion, probably from a homeland somewhere north of the Black Sea in what is now Ukraine.
I'm still not exactly sure how the east/west dichotomy between R1a and R1b emerged in Europe, but, thanks to a new paper by Papac et al. at Science Advances, at least now I have a working hypothesis about that. Below is a quote from the said paper, emphasis is mine:
In addition to autosomal genetic changes through time, we observe a sharp reduction in Y-chromosomal diversity going from five different lineages in early CW to a dominant (single) lineage in late CW (Fig. 4A). We used forward simulations to explore the demographic scenarios that could account for the observed reduction in Y-chromosomal diversity. Performing 1 million simulations of a population with a starting frequency of R1a-M417(xZ645) centered around the observed starting frequency in Bohemia_CW_Early (3 of 11, 0.27), we assessed the plausibility of this lineage reaching the observed frequency in Bohemia_CW_Late (10 of 11, 0.91) in the time frame of 500 years under a model of a closed population and random mating (Materials and Methods). We reject the “neutral” hypothesis, i.e., that this change in frequency occurred by chance, given a wide range of plausible population sizes. Instead, our results suggest that R1a-M417(xZ645) was subject to a nonrandom increase in frequency, resulting in these males having 15.79% (4.12 to 44.42%) more surviving offspring per generation relative to males of other Y-haplogroups. We also find that this change in Y chromosome frequency is extreme compared to the changes in allele frequencies at fully covered autosomal 1240k sites within the same males, suggesting a process that disproportionately affected Y-chromosomal compared to autosomal genetic diversity, ruling out a population bottleneck as the likely cause. Our results suggest that the Y-lineage diversity in early CW males was supplanted by a nonrandom process [selection, social structure, or influx of nonlocal R1a-M417(xZ645) lineages] that drove the collapse in Y-chromosomal diversity. A simultaneous decline of Y-chromosomal diversity dating to the Neolithic has been observed across most extant Y-haplogroups (64), possibly due to increased conflict between male-mediated patrilines (65). We view that changes in social structure (e.g., an isolated mating network with strictly exclusive social norms) could be an alternative cause but would be difficult to distinguish in the underlying model parameters.Right, so even though the CWC was clearly a community of closely related groups, there must have been some competition between its different clans. And since these clans were highly patriarchal and patrilineal, this competition probably led to different paternal lineages dominating different parts of the CWC horizon, with M417 becoming especially common in the east and L51 in the west. Of course, the expansions of post-Corded Ware groups, such as the M417-rich Slavs in Eastern Europe and L51-rich Celts in Western Europe, were also instrumental in creating Europe's R1a/R1b dichotomy, but obviously these groups were in large part the heirs of the CWC. By the way, most of the samples from Papac et al. are already in the Global25 datasheets linked here. Look for the labels listed here. Below is a plot made from the Global25 data courtesy of regular commentator Matt. Citation: L. Papac, M. Ernée, M. Dobeš, M. Langová, A. B. Rohrlach, F. Aron, G. U. Neumann, M. A. Spyrou, N. Rohland, P. Velemínský, M. Kuna, H. Brzobohatá, B. Culleton, D. Daněček, A. Danielisová, M. Dobisíková, J. Hložek, D. J. Kennett, J. Klementová, M. Kostka, P. Krištuf, M. Kuchařík, J. K. Hlavová, P. Limburský, D. Malyková, L. Mattiello, M. Pecinovská, K. Petriščáková, E. Průchová, P. Stránská, L. Smejtek, J. Špaček, R. Šumberová, O. Švejcar, M. Trefný, M. Vávra, J. Kolář, V. Heyd, J. Krause, R. Pinhasi, D. Reich, S. Schiffels, W. Haak, Dynamic changes in genomic and social structures in third millennium BCE central Europe. Sci. Adv. 7, eabi6941 (2021). See also... On the origin of the Corded Ware people Understanding the Eneolithic steppe Conan the Barbarian probably belonged to Y-haplogroup R1a
Labels:
ancient DNA,
Balto-Slavic,
Celtic,
Central Europe,
Corded Ware,
Corded Ware Culture,
CWC,
Eastern Europe,
Germanic,
Indo-European,
Northern Europe,
Proto-Indo-European,
R1a-M417,
R1b-L151,
R1b-L51,
Slavic,
Slavs
Saturday, November 7, 2020
Slavic-like Medieval Germans
The samples labeled DEU_MA_Krakauer_Berg in the Principal Component Analysis (PCA) plot below are from a recent paper by Parker et al. at Scientific Reports. Their remains were excavated from a Medieval cemetery in the now abandoned village of Krakauer Berg in eastern Germany.
Krakauer sounds sort of like Kraków, doesn't it? That's probably not a coincidence, especially considering how these people behave in my analysis. To see an interactive version of the plot, paste the coordinates from the text file here into the relevant field here.
See also...
Yamnaya-related ancestry proportions in present-day Poles
Warriors from at least two different populations fought in the Tollense Valley battle
Viking world open analysis and discussion thread
Tuesday, September 8, 2020
Warriors from at least two different populations fought in the Tollense Valley battle
I can't get the genotype data from the Burger et al. paper. The lead authors, Joachim Burger and Daniel Wegmann, aren't replying to my emails.
But they were gracious enough to release the BAM files for each of their samples, and these files can be converted to genotype data. So I've included ten of the Tollense Valley warriors (DEU_Tollense_BA) in the Global25 datasheets (see here).
The claim in the paper that these warriors "represent an unstructured population" is absolutely false and extremely naive.
Below are a couple of Principal Component Analysis (PCA) plots produced with Vahaduo Global25 views. The samples are labeled according to their Y-chromosome haplogroups. To see interactive versions of the same plots, paste the Global25 coordinates from the text file here into the relevant fields here.
These warriors are not a single unstructured population, because they cover too much ground in the above plots for that to be possible. It's clear to me that they represent at least two different groups from Central Europe and surrounds.
Of course, this would be a lot easier to work out if Burger et al. cared to supply more information about each of the warriors, such as their attire, weapons, circumstances of death, and so on. It's a complete mystery to me why this wasn't included in the paper, and the authors are refusing to talk to me, so it's unlikely that I'll ever be able to get it from them.
In the absence of such crucial archeological and anthropological data, I don't want to speculate too much, and get overly creative, but here are a couple of possible scenarios to explain the ancient DNA results:
- this may have been a battle between two Central European armies, one rich in Y-haplogroup R1b and the other rich in Y-haplogroup I2a, as well as their allies or hired help, including warriors from Eastern Europe belonging to Y-haplogroup R1a - or perhaps it was an invasion from the east by warriors rich in Y-haplogroup R1a, and it was a success, with the local armies, rich in Y-haplogroups R1b and I2a, losing the battle and suffering most of the casualties.I'm sure that one day someone will attempt to undertake a decent multidisciplinary study of this epic battle, and we'll at least have a rough idea about what happened. Or not. Citation... Burger et al., Low Prevalence of Lactase Persistence in Bronze Age Europe Indicates Ongoing Strong Selection over the Last 3,000 Years, Current Biology, Available online 3 September 2020, https://doi.org/10.1016/j.cub.2020.08.033 See also... Genetic and linguistic structure across space and time in Northern Europe
Labels:
ancient DNA,
Balto-Slavic,
battle,
Bronze Age,
Central Europe,
dead,
Eastern Europe,
Germany,
I2a,
Northern Europe,
Poland,
R1a,
R1a-Z283,
R1b,
R1b-P312,
Tollense Valley,
victims,
warlike
Sunday, September 6, 2020
Low prevalence of lactase persistence in Bronze Age Europe (Burger et al. 2020)
Over at Current Biology at this LINK. Unfortunately, this is the long-awaited Tollense Valley battle paper. Despite the obvious presence of some very interesting genetic substructures among the Tollense Valley warriors (see here), the authors have the audacity to claim that these individuals represent a "single unstructured Central/Northern European population".
One of the warriors, labeled WEZ56, belongs to Y-haplogroup R1a and shows an exceedingly Balto-Slavic-like genome-wide genetic structure. But none of this is even mentioned in passing in the paper. Indeed, according to Burger at al., WEZ56 is best classified as belonging to R1, even though the R1a classification is quite secure based on the raw data that the authors posted online.
Be extremely wary of what you read in this paper, and anything else that these scientists have published in the past and will publish in the future. Below is the paper summary:
Lactase persistence (LP), the continued expression of lactase into adulthood, is the most strongly selected single gene trait over the last 10,000 years in multiple human populations. It has been posited that the primary allele causing LP among Eurasians, rs4988235-A [1], only rose to appreciable frequencies during the Bronze and Iron Ages [2, 3], long after humans started consuming milk from domesticated animals. This rapid rise has been attributed to an influx of people from the Pontic-Caspian steppe that began around 5,000 years ago [4, 5]. We investigate the spatiotemporal spread of LP through an analysis of 14 warriors from the Tollense Bronze Age battlefield in northern Germany (∼3,200 before present, BP), the oldest large-scale conflict site north of the Alps. Genetic data indicate that these individuals represent a single unstructured Central/Northern European population. We complemented these data with genotypes of 18 individuals from the Bronze Age site Mokrin in Serbia (∼4,100 to ∼3,700 BP) and 37 individuals from Eastern Europe and the Pontic-Caspian Steppe region, predating both Bronze Age sites (∼5,980 to ∼3,980 BP). We infer low LP in all three regions, i.e., in northern Germany and South-eastern and Eastern Europe, suggesting that the surge of rs4988235 in Central and Northern Europe was unlikely caused by Steppe expansions. We estimate a selection coefficient of 0.06 and conclude that the selection was ongoing in various parts of Europe over the last 3,000 years.Burger et al., Low Prevalence of Lactase Persistence in Bronze Age Europe Indicates Ongoing Strong Selection over the Last 3,000 Years, Current Biology, Available online 3 September 2020, https://doi.org/10.1016/j.cub.2020.08.033 See also... Warriors from at least two different populations fought in the Tollense Valley battle
Labels:
ancient DNA,
Balto-Slavic,
battle,
Bronze Age,
Central Europe,
dead,
Eastern Europe,
Germany,
I2a,
lactase persistence,
Northern Europe,
R1a,
R1a-Z283,
R1b,
R1b-P312,
Tollense Valley,
victims,
warlike
Tuesday, July 14, 2020
First taste of Early Medieval DNA from the Ural region (Csaky et al. 2020 preprint)
Over at bioRxiv at this LINK. From the preprint:
The ancient Hungarians originated from the Ural region of Russia, and migrated through the Middle-Volga region and the Eastern European steppe into the Carpathian Basin during the 9th century AD. Their Homeland was probably in the southern Trans-Ural region, where the Kushnarenkovo culture disseminated. In the Cis-Ural region Lomovatovo and Nevolino cultures are archaeologically related to ancient Hungarians. In this study we describe maternal and paternal lineages of 36 individuals from these regions and nine Hungarian Conquest period individuals from today's Hungary, as well as shallow shotgun genome data from the Trans-Uralic Uyelgi cemetery. We point out the genetic continuity between the three chronological horizons of Uyelgi cemetery, which was a burial place of a rather endogamous population. Using phylogenetic and population genetic analyses we demonstrate the genetic connection between Trans-, Cis-Ural and the Carpathian Basin on various levels. The analyses of this new Uralic dataset fill a gap of population genetic research of Eurasia, and reshape the conclusions previously drawn from 10-11th century ancient mitogenomes and Y-chromosomes from Hungary. ... Majority of Uyelgi males belonged to Y chromosome haplogroup N, and according to combined STR, SNP and Network analyses they belong to the same subclade within N-M46 (also known as N-tat and N1a1-M46 in ISOGG 14.255). N-M46 nowadays is a geographically widely distributed paternal lineage from East of Siberia to Scandinavia 33 . One of its subclades is N-Z1936 (also known as N3a4 and N1a1a1a1a2 in ISOGG 14.255), which is prominent among Uralic speaking populations, probably originated from the Ural region as well and mainly distributed from the West of Ural Mountains to Scandinavia (Finland). Seven samples of Uyelgi site most probably belong to N-Y24365 (also known as N-B545 and N1a1a1a1a2a1c2 in ISOGG 14.255) under N-Z1936, a specific subclade that can be found almost exclusively in todays’ Tatarstan, Bashkortostan and Hungary 17 (ISOGG, Yfull).Csaky et al., Early Medieval Genetic Data from Ural Region Evaluated in the Light of Archaeological Evidence of Ancient Hungarians, bioRxiv, Posted July 13, 2020, doi: https://doi.org/10.1101/2020.07.13.200154 See also... Hungarian Conquerors were rich in Y-haplogroup N On the association between Uralic expansions and Y-haplogroup N More on the association between Uralic expansions and Y-haplogroup N Ancient DNA confirms the link between Y-haplogroup N and Uralic expansions
Labels:
Carpathian Basin,
Central Europe,
Finno-Ugric,
Hungarian Conquerors,
Hungarian Plain,
Indo-European,
N-tat,
N1a1-M46,
N1a1-Y24365,
N1c,
N3a,
Proto-Uralic,
Ugric,
Ural Mountains,
Uralian,
Uralic,
Uyelgi cemetery
Tuesday, July 7, 2020
On the exotic origins of the Hungarian Arpad Dynasty (Nagy et al. 2020)
Hungarians speak a Uralic and Finno-Ugric language. However, the founders of the Medieval Hungarian state, the Arpad Dynasty, probably had Irano-Turkic paternal origins. There's a very interesting new paper on this topic at the European Journal of Human Genetics (see here). From the paper, emphasis is mine:
The phylogenetic origins of the Hungarians who occupied the Carpathian basin has been much contested [40]. Based on linguistic arguments it was proposed that they represented a predominantly Finno-Ugric speaking population while the oral and written tradition of the Árpád dynasty suggests a relationship with the Huns. Based on the genetic analysis of two members of the Árpád Dynasty, it appears that they derived from a lineage (R-Z2125) that is currently predominantly present among ethnic groups (Pashtun, Tadjik, Turkmen, Uzbek, and Bashkir) speaking Iranian or Turkic languages. However, their closest kin, the Bashkirs live in close proximity with Finno-Ugric speaking populations with the N-B539 haplogroup. A recent study shows that this haplogroup is also found in modern Hungarians [41]. Intriguingly, the most recent separation of the N-B539 derived lineages found in Hungarians and Bashkirs is estimated to have occurred ~2000 years before present [42]. This would suggest that a group of people consisting of a Turkic (R-SUR51) component and a Finno-Ugric (N-B539) component left the Volga Ural region about 2000 years ago and started a migration that eventually culminated in settlement in the Carpathian Basin.Citation... Nagy, P.L., Olasz, J., Neparáczki, E. et al. Determination of the phylogenetic origins of the Árpád Dynasty based on Y chromosome sequencing of Béla the Third. Eur J Hum Genet (2020). https://doi.org/10.1038/s41431-020-0683-z See also... Hungarian Conquerors were rich in Y-haplogroup N On the association between Uralic expansions and Y-haplogroup N More on the association between Uralic expansions and Y-haplogroup N Ancient DNA confirms the link between Y-haplogroup N and Uralic expansions
Labels:
Arpad dynasty,
Carpathian Basin,
Central Europe,
Finno-Ugric,
Hungarian Plain,
Hungary,
Indo-European,
King Bela,
N-B539,
N1c,
N3a,
Proto-Indo-European,
Proto-Uralic,
R1a-Z2125,
R1a-Z93,
R1b-M269,
Ugric,
Uralic
Friday, April 17, 2020
Corded Ware cultural and genetic complexity (Linderholm et al. 2020)
Open access at Scientific Reports at this LINK. Although very useful and broadly accurate, I'm really not sure what to make of this paper yet, especially in regards to its more nuanced inferences. I'll need to look at the genotype data at some point. Worthy of note is that most of the Corded Ware males sampled by the authors belong to Y-haplogroup R1b-M269, rather than R1a-M417, which is the dominant Y-haplogroup in previously published Corded Ware samples. From the paper:
During the Final Eneolithic the Corded Ware Complex (CWC) emerges, chiefly identified by its specific burial rites. This complex spanned most of central Europe and exhibits demographic and cultural associations to the Yamnaya culture. To study the genetic structure and kin relations in CWC communities, we sequenced the genomes of 19 individuals located in the heartland of the CWC complex region, south-eastern Poland. Whole genome sequence and strontium isotope data allowed us to investigate genetic ancestry, admixture, kinship and mobility. The analysis showed a unique pattern, not detected in other parts of Poland; maternally the individuals are linked to earlier Neolithic lineages, whereas on the paternal side a Steppe ancestry is clearly visible. We identified three cases of kinship. Of these two were between individuals buried in double graves. Interestingly, we identified kinship between a local and a non-local individual thus discovering a novel, previously unknown burial custom. ... The PCA revealed that despite geographical proximity there is a distinct genetic separation between CWC and BBC individuals from southern Poland. The genetic variation of CWC individuals from southern Poland overlaps with the majority of previously published CWC individuals from Germany while the eight published CWC individuals from the Polish lowland [10,11] more closely resemble BBC individuals (Fig. S21). This fact is not unexpected if we consider the CWC communities in Polish lowlands as representatives of north-western parts of the CWC world called as the Single-Grave culture (see supplementary information). The genetic variation of BBC individuals from south-eastern Poland overlaps with the broad variation of BBC individuals from Central Europe (Bohemia, Moravia, Germany, south-western Poland and Hungary) (Fig. S22) which corresponds well with archaeological data.Linderholm, A., Kılınç, G.M., Szczepanek, A. et al. Corded Ware cultural complexity uncovered using genomic and isotopic analysis from south-eastern Poland. Sci Rep 10, 6885 (2020). https://doi.org/10.1038/s41598-020-63138-w See also... The Battle Axe people came from the steppe Is Yamnaya overrated? Single Grave > Bell Beakers
Tuesday, January 14, 2020
Hungarian Conquerors were rich in Y-haplogroup N (Fóthi et al. 2020)
Open access at Archaeological and Anthropological Sciences at this LINK. Below is the paper abstract. Emphasis is mine:
According to historical sources, ancient Hungarians were made up of seven allied tribes and the fragmented tribes that split off from the Khazars, and they arrived from the Eastern European steppes to conquer the Carpathian Basin at the end of the ninth century AD. Differentiating between the tribes is not possible based on archaeology or history, because the Hungarian Conqueror artifacts show uniformity in attire, weaponry, and warcraft. We used Y-STR and SNP analyses on male Hungarian Conqueror remains to determine the genetic source, composition of tribes, and kin of ancient Hungarians. The 19 male individuals paternally belong to 16 independent haplotypes and 7 haplogroups (C2, G2a, I2, J1, N3a, R1a, and R1b). The presence of the N3a haplogroup is interesting because it rarely appears among modern Hungarians (unlike in other Finno-Ugric-speaking peoples) but was found in 37.5% of the Hungarian Conquerors. This suggests that a part of the ancient Hungarians was of Ugric descent and that a significant portion spoke Hungarian. We compared our results with public databases and discovered that the Hungarian Conquerors originated from three distant territories of the Eurasian steppes, where different ethnicities joined them: Lake Baikal-Altai Mountains (Huns/Turkic peoples), Western Siberia-Southern Urals (Finno-Ugric peoples), and the Black Sea-Northern Caucasus (Caucasian and Eastern European peoples). As such, the ancient Hungarians conquered their homeland as an alliance of tribes, and they were the genetic relatives of Asiatic Huns, Finno-Ugric peoples, Caucasian peoples, and Slavs from the Eastern European steppes.Fóthi, E., Gonzalez, A., Fehér, T. et al., Genetic analysis of male Hungarian Conquerors: European and Asian paternal lineages of the conquering Hungarian tribes, Archaeol Anthropol Sci (2020) 12: 31. https://doi.org/10.1007/s12520-019-00996-0 See also... On the association between Uralic expansions and Y-haplogroup N More on the association between Uralic expansions and Y-haplogroup N Ancient DNA confirms the link between Y-haplogroup N and Uralic expansions
Labels:
Carpathian Basin,
Central Europe,
Finnic,
Finno-Ugric,
Hungarian,
Hungarian Plain,
Indo-European,
N1c,
N3a,
N3a4-B539,
Northeastern Europe,
Proto-Indo-European,
Proto-Uralic,
Ugric,
Ural Mountains,
Uralic
Friday, May 24, 2019
More on the association between Uralic expansions and Y-haplogroup N
Genes don't speak languages, people do. Thus, associations between genetic markers and languages may often not be easy to discern, especially with time. This is the case when it comes to Y-chromosome haplogroup N and the Hungarian language.
I briefly discussed this problem not long ago in the context of new ancient DNA samples from medieval Hungary (see here). Today, a detailed paper on the topic by Post et al. was published at Scientific Reports (open access here). It brings together evidence from modern and ancient DNA, linguistics and archeology to argue that Hungarian was introduced into the Carpathian Basin during the Middle Ages by migrants from near the Ural Mountains rich in Y-haplogroup N3a4-B539. Below is the paper abstract, emphasis is mine:
Hungarians who live in Central Europe today are one of the westernmost Uralic speakers. Despite of the proposed Volga-Ural/West Siberian roots of the Hungarian language, the present-day Hungarian gene pool is highly similar to that of the surrounding Indo-European speaking populations. However, a limited portion of specific Y-chromosomal lineages from haplogroup N, sometimes associated with the spread of Uralic languages, link modern Hungarians with populations living close to the Ural Mountain range on the border of Europe and Asia. Here we investigate the paternal genetic connection between these spatially separated populations. We reconstruct the phylogeny of N3a4-Z1936 clade by using 33 high-coverage Y-chromosomal sequences and estimate the coalescent times of its sub-clades. We genotype close to 5000 samples from 46 Eurasian populations to show the presence of N3a4-B539 lineages among Hungarians and in the populations from Ural Mountain region, including Ob-Ugric-speakers from West Siberia who are geographically distant but linguistically closest to Hungarians. This sub-clade splits from its sister-branch N3a4-B535, frequent today among Northeast European Uralic speakers, 4000–5000 ya, which is in the time-frame of the proposed divergence of Ugric languages.Post et al., Y-chromosomal connection between Hungarians and geographically distant populations of the Ural Mountain region and West Siberia, Scientific Reports 9, Article number: 7786 (2019), DOI: https://doi.org/10.1038/s41598-019-44272-6 See also... Hungarian Conquerors were rich in Y-haplogroup N (Fóthi et al. 2020) On the association between Uralic expansions and Y-haplogroup N Ancient DNA confirms the link between Y-haplogroup N and Uralic expansions
Labels:
Carpathian Basin,
Central Europe,
Finnic,
Finno-Ugric,
Hungarian,
Hungarian Plain,
Indo-European,
N1c,
N3a,
N3a4-B539,
Northeastern Europe,
Proto-Indo-European,
Proto-Uralic,
Ugric,
Ural Mountains,
Uralic
Thursday, February 15, 2018
Modeling genetic ancestry with Davidski: step by step
There are many different ways to model your genetic ancestry but I prefer the Global25/nMonte method. This is a step by step guide to modeling ancient ancestry proportions with this simple but powerful method using my own genome.
As far as I know, the vast majority of my recent ancestors came from the northern half of Europe. This may or may not be correct, but it gives me somewhere to start, so that I can come up with a coherent model. If you don't have this sort of information, because, perhaps, you were adopted, then just look in the mirror, and work from there. Like I say, it's not imperative that you know anything whatsoever about your ancestry, because your genetic data will do the talking, but you do need a model when modeling.
In scientific literature nowadays Northern Europeans are often described as a three-way mixture between Yamnaya-related pastoralists, Anatolian-derived early farmers, and Western European Hunter-Gatherers (WHG). So let's see if this model works for me. Obviously, if it does, then it'll confirm the information that I have about my origins, but it might also reveal finer details that I'm not aware of. The datasheet that I'm using for this model is available here.
[1] distance%=6.9025 / distance=0.069025 Davidski Yamnaya_Samara 53.9 Barcin_N 30.75 Rochedane 15.35 Tepecik_Ciftlik_N 0Yep, the model does work, with a fairly reasonable distance of almost 7%. The ancestry proportions more or less match those from scientific literature and the plethora of analyses that I've featured at this blog on the topic. Please note that I've kept things very simple, using only four reference populations and individuals as proxies for four distinct streams of ancestry. But I've put my own twist on this Neolithic/Bronze Age model by including two populations from Neolithic Anatolia (Barcin_N and Tepecik_Ciftlik_N), just to see what would happen. The WHG proxy is Rochedane. Admittedly, though, my Yamnaya cut of ancestry appears somewhat bloated at over 53%, and the model's distance is a little higher than what I normally see for really strong models. So let's check if I can get a better fitting and more sensible result by adding a slightly more easterly forager proxy than Rochedane: Narva_Lithuania.
[1] distance%=5.9331 / distance=0.059331 Davidski Yamnaya_Samara 45.75 Barcin_N 31.45 Narva_Lithuania 22.8 Rochedane 0 Tepecik_Ciftlik_N 0The statistical fit does improve, and when given a choice between Rochedane and Narva_Lithuania, the algorithm picks the latter as the only source of extra forager input in my genome. What could this mean? It might mean that a large part of my ancestry derives from the Baltic region. Actually, I know for a fact that this is true. But even if I had no idea about my genealogy, this result would be a very strong hint about my genetic origins. Indeed, let's follow this trail and try to further improve the fit of the model by adding a more relevant Yamnaya-related proxy, such as early Baltic Corded Ware (CWC_Baltic_early).
[1] distance%=5.444 / distance=0.05444 Davidski CWC_Baltic_early 54.95 Barcin_N 26.7 Narva_Lithuania 18.35 Rochedane 0 Tepecik_Ciftlik_N 0 Yamnaya_Samara 0Holy shit! To be honest, I wasn't expecting this sort of resolution and accuracy, and I can't promise that everyone using the Global25/nMonte method will see such incredibly nuanced outcomes, but this isn't a fluke. It can't be, because it gels so well with everything that I know about my ancestry. Please note also that I belong to Y-chromosome haplogroup R1a-M417, which is a lineage intimately associated with the Corded Ware expansion across Northern Europe (for instance, see here). But of course, the Baltic and nearby regions haven't been isolated from migrations and invasions since the Corded Ware times. For instance, at some point, probably during the Bronze Age, Uralic-speaking groups moved west across the forest zone of Northeastern Europe and into the East Baltic and northern Scandinavia. It's generally accepted that they brought Siberian admixture with them (see here). Moreover, from the Iron Age to the Middle Ages, East Central Europe was under intense pressure from a wide range of nomadic steppe groups with complex ancestry, such as the Sarmatians, Avars, Huns, and Mongolians. Did any of these peoples leave their mark on my genome? At the risk of overfitting the model, let's explore this possibility by adding a few more reference populations.
[1] distance%=5.444 / distance=0.05444 Davidski CWC_Baltic_early 54.95 Barcin_N 26.7 Narva_Lithuania 18.35 Han 0 Mongolian 0 Nganassan 0 Rochedane 0 Sarmatian_Pokrovka 0 Tepecik_Ciftlik_N 0 Yamnaya_Samara 0Nothing changes when I add the Han Chinese, Mongolians, Nganassans (a Uralic group from Siberia), and Sarmatians to the model. But what about if I throw in the only ancient Slav in my datasheet?
[1] distance%=2.9904 / distance=0.029904 Davidski Slav_Bohemia 85.9 CWC_Baltic_early 7.7 Narva_Lithuania 6.4 Barcin_N 0 Rochedane 0 Tepecik_Ciftlik_N 0 Yamnaya_Samara 0Considering that the vast majority of my recent ancestors were Poles, thus a Slavic-speaking people from near the Baltic, this outcome makes perfect sense. And check out the new distance! But the problem now is that I'm overfitting the model by using two very similar and probably very closely related references, CWC_Baltic_early and Slav_Bohemia. And overfitting should be avoided at all costs. So it might be useful to break up this effort into two models: one focusing on the Neolithic and Bronze Age, and the other on the Iron Age and Middle Ages. I'll do that soon, but not just yet, because there are still too few Iron Age and Medieval samples available from the Baltic region and surrounds for meaningful analyses of this type. See also... Genetic ancestry online store (to be updated regularly)
Labels:
admixture,
ancestry test,
ancient DNA,
Central Europe,
Corded Ware Culture,
Eastern Europe,
Eurasia,
Eurogenes,
genetic ancestry,
genetic genealogy,
Global25,
nMonte,
PCA,
Slavic,
Yamnaya
Wednesday, December 31, 2008
Best of 2008: Corded Ware DNA from Germany
One of the biggest hits of the year for this blogger was the discovery of Y-DNA haplogroup R1a among three Corded Ware skeletons from a burial site in Eulau, eastern Germany. It's an important result, because it links one of Europe's most dominant Y-haplogroups to a major Late Neolithic archeological complex.
All three individuals were confirmed to be paternally related via their shared Y-STR haplotype. Nevertheless, the outcome appears far from a random coincidence. Consider that in Europe today R1a shows its highest frequencies in Poland and Western Russia, which are both located in former Corded Ware territory, and where the Eulau R1a haplotype appears to have its closest modern matches. Moreover, the Corded Ware culture is often classified as an Indo-European culture by archeologists and linguists, while at the same time R1a has been posited as a marker of the early Indo-Europeans by some geneticists. Needless to say, I'm expecting R1a to be a common, and perhaps dominant marker among Corded Ware samples when more of them make it to the lab.
The consensus haplotype of the three individuals (based on most complete profile) gave two exact matches in in an European population sample of 11,213 haplotypes in a set of 100 populations (as of July 2008, Release ‘‘23’’ from 2008–01-15 14:44:25): one individual from Poland (1/939 from Gdansk) and one from Russia (1/48 from Tambov).
...
The Y haplotype was predicted using the Web-based program Haplotype Predictor (9). The three individuals of grave 99 belong to haplotype R1a, with a probability of 100% based on the Y-STR profile of individual 3 (10). To confirm haplogroup status, we further amplified an 85-bp fragment covering the Y-SNP marker SRY10831.2 characteristic for R1a (11). Primer sequences are given in Table S6. Sequences and sequenced clones from independent extract of all three individuals show the specific G to A transition identifying R1a (Fig. S5).
The mitochondrial DNA (mtDNA) lineages of the Eulau skeletons belonged to haplogroups K1b (3), X2 (2), H, I, K1a2, and U5b. Most of these maternal markers aren't particularly common in Europe today, and the overall result appears decidedly unusual compared to the mtDNA frequencies of modern European populations, largely because of the low frequency of H.
I'm quite certain this is at least partly due to the small sample size and presence of several related individuals skewing some of the frequencies. However, it's interesting to note that this pattern of discontinuity between mtDNA gene pools from different time periods has also been reported in other studies, some with larger samples, and focusing on different regions of Europe. So it might well be a signal of significant shifts in mtDNA frequencies during European prehistory and early history, possibly as a result of major migrations leading to significant population replacements.
Interestingly, one of the ancient K1b lineages most closely matched a haplotype shared by two modern Shugnans from Tajikistan. Exactly how the Corded Ware individual is related to these two Central Asians isn't clear yet, but Shugni is an Indo-Iranian language, so some kind of early Indo-European relationship is possible.
Citation...
Wolfgang Haak et al, Ancient DNA, Strontium isotopes, and osteological analyses shed light on social and kinship organization of the Later Stone Age, PNAS, Published online before print November 17, 2008, doi:10.1073/pnas.0807592105
Subscribe to:
Posts (Atom)