search this blog

Showing posts with label Germanic. Show all posts
Showing posts with label Germanic. Show all posts

Friday, November 10, 2023

Wielbark Goths were overwhelmingly of Scandinavian origin


When used properly, Principal Component Analysis (PCA) is an extraordinarily powerful tool and one of the best ways to study fine-scale genetic substructures within Europe.

The PCA plot below is based on Global25 data and focuses on the genetic relationship between Wielbark Goths and Medieval Poles, including from the Viking Age, in the context of present-day European genetic variation.


I'd say that it's a wonderfully self-explanatory plot, but here are some key observations:

- the Wielbark Goths (Poland_Wielbark_IA) and Medieval Poles (Poland_Middle_Ages) are two distinct populations

- moreover, the Wielbark Goths form a relatively compact Scandinavian-related cluster and must surely represent a homogenous population overwhelmingly of Scandinavian origin

- on the other hand, the Medieval Poles form a more extensive and heterogeneous cluster that overlaps with present-day groups all the way from Central Europe to the East Baltic, and that's because they are likely to be in large part of mixed origin

- I know for a fact that at least some of these early Poles harbor recent admixture, because their burials are similar to those of Vikings and their haplotypes have been shown to be partly of Scandinavian origin (see here)

- one of the Wielbark females is an obvious genetic outlier (Poland_Wielbark_IA_outlier), and basically looks like a first generation mixture between a Goth and a Balt.

Please note that the PCA is only based on relatively high quality genomes, so as not to confuse the picture with spurious results and noise. Also, all outliers with potentially significant ancestry from outside of Central, Eastern and Northern Europe were removed from the analysis. The relevant datasheet is available here.

However, sanity checks are always important when studying complex topics like fine-scale genetic ancestry. To that end I've prepared a graph based on f3-statistics of the form f3(X,Cameroon_SMA,Estonia_BA)/(X,Cameroon_SMA,Ireland_Megalithic), that reproduces the key features of my PCA. The relevant datasheet is available here.

Polish groups from the Middle Ages are marked with the MA suffix, while the Iron Age Wielbark Goths are marked with the IA suffix.

If you're wondering why I plotted the f3-statistics that I did, take a look at this (all groups largely of Scandinavian origin are emboldened):

f3(X,Estonia_BA,Cameroon_SMA)
Poland_Legowo_MA 0.226406
Poland_Ostrow_Lednicki_MA 0.225996
Poland_Plonsk_MA 0.225017
Poland_Trzciniec_Culture 0.224215
Poland_Lad_MA 0.224142
Poland_Viking 0.223838
Poland_Niemcza_MA 0.223659
Poland_Weklice_IA 0.223549
Poland_Kowalewko_IA 0.222584
Poland_Pruszcz_Gdanski_IA 0.222324
Sweden_Viking 0.222091
Russia_Viking 0.222042
Poland_Maslomecz_IA 0.221914
Norway_Viking 0.221825
Denmark_EarlyViking 0.221257
Denmark_Viking 0.221174
England_Viking 0.220979

f3(X,Ireland_Megalithic,Cameroon_SMA)
Poland_Maslomecz_IA 0.219816
Poland_Weklice_IA 0.219501
Denmark_Viking 0.2192
Poland_Kowalewko_IA 0.219176
Poland_Ostrow_Lednicki_MA 0.218916
Norway_Viking 0.218854
Poland_Pruszcz_Gdanski_IA 0.218684
Sweden_Viking 0.218626
Denmark_EarlyViking 0.218529
England_Viking 0.218308
Russia_Viking 0.217999
Poland_Viking 0.217914
Poland_Plonsk_MA 0.217756
Poland_Lad_MA 0.217719
Poland_Legowo_MA 0.21765
Poland_Niemcza_MA 0.217001
Poland_Trzciniec_Culture 0.216551

Interestingly, the Middle Bronze Age samples associated with the Trzciniec Culture (Poland_Trzciniec_Culture) show a closer genetic relationship to Medieval Poles than to Wielbark Goths or Northwestern Europeans. This is indeed the case both in terms of genome-wide and uniparental markers, including some very specific lineages under Y-chromosome haplogroup R1a.

But that's a much more complex issue that I'll leave for another time. So please stay tuned.

See also...

Slavs have little, if any, Scytho-Sarmatian ancestry

Sunday, November 13, 2022

A reappraisal of Ashkenazic maternal ancestry


Kevin Brook, who occasionally comments on this blog, has published a peer-reviewed book titled The Maternal Genetic Lineages of Ashkenazic Jews.

The book focuses on 129 mitochondrial (mtDNA) haplogroups that are found in present-day Ashkenazic Jews, and reveals that these lineages can be traced back to a wide range of places, such as Israel, Italy, Poland, Germany, North Africa, and China.

Ergo, it argues that both Israelites and converts to Judaism from a variety of gentile groups made lasting contributions to the Ashkenazic maternal gene pool. In Kevin's own words, the book also:

- shows that all Ashkenazim remain genetically linked to a significant degree to other types of Jewish populations, not only paternally but maternally as well

- disproves the myth that Cossack rapists were responsible for any of the non-Israelite DNA in Ashkenazim

- presents new DNA evidence in favor of a small contribution of Khazarian and Alan converts to Judaism to the Ashkenazic gene pool.

That makes good sense based on what I've learned over the years from studying modern and ancient genome-wide Ashkenazic DNA. More information about Kevin's book is available at the Khazaria.com website HERE.

See also...

My take on the Erfurt Jews

Friday, August 27, 2021

R1a vs R1b in third millennium BCE Central Europe (Papac et al. 2021)


R1a-M417 and R1b-L51 are by far the most important Y-chromosome haplogroups in Europe today. More precisely, R1a-M417 dominates in Eastern Europe, while R1b-L51 in Western Europe.

It's been obvious for a while now, at least to me, that both of these Y-haplogroups are closely associated with the men of the Late Neolithic Corded Ware culture (CWC). Indeed, in my mind they're the main genetic signals of its massive expansion, probably from a homeland somewhere north of the Black Sea in what is now Ukraine.

I'm still not exactly sure how the east/west dichotomy between R1a and R1b emerged in Europe, but, thanks to a new paper by Papac et al. at Science Advances, at least now I have a working hypothesis about that. Below is a quote from the said paper, emphasis is mine:

In addition to autosomal genetic changes through time, we observe a sharp reduction in Y-chromosomal diversity going from five different lineages in early CW to a dominant (single) lineage in late CW (Fig. 4A). We used forward simulations to explore the demographic scenarios that could account for the observed reduction in Y-chromosomal diversity. Performing 1 million simulations of a population with a starting frequency of R1a-M417(xZ645) centered around the observed starting frequency in Bohemia_CW_Early (3 of 11, 0.27), we assessed the plausibility of this lineage reaching the observed frequency in Bohemia_CW_Late (10 of 11, 0.91) in the time frame of 500 years under a model of a closed population and random mating (Materials and Methods). We reject the “neutral” hypothesis, i.e., that this change in frequency occurred by chance, given a wide range of plausible population sizes. Instead, our results suggest that R1a-M417(xZ645) was subject to a nonrandom increase in frequency, resulting in these males having 15.79% (4.12 to 44.42%) more surviving offspring per generation relative to males of other Y-haplogroups. We also find that this change in Y chromosome frequency is extreme compared to the changes in allele frequencies at fully covered autosomal 1240k sites within the same males, suggesting a process that disproportionately affected Y-chromosomal compared to autosomal genetic diversity, ruling out a population bottleneck as the likely cause. Our results suggest that the Y-lineage diversity in early CW males was supplanted by a nonrandom process [selection, social structure, or influx of nonlocal R1a-M417(xZ645) lineages] that drove the collapse in Y-chromosomal diversity. A simultaneous decline of Y-chromosomal diversity dating to the Neolithic has been observed across most extant Y-haplogroups (64), possibly due to increased conflict between male-mediated patrilines (65). We view that changes in social structure (e.g., an isolated mating network with strictly exclusive social norms) could be an alternative cause but would be difficult to distinguish in the underlying model parameters.

Right, so even though the CWC was clearly a community of closely related groups, there must have been some competition between its different clans. And since these clans were highly patriarchal and patrilineal, this competition probably led to different paternal lineages dominating different parts of the CWC horizon, with M417 becoming especially common in the east and L51 in the west.

Of course, the expansions of post-Corded Ware groups, such as the M417-rich Slavs in Eastern Europe and L51-rich Celts in Western Europe, were also instrumental in creating Europe's R1a/R1b dichotomy, but obviously these groups were in large part the heirs of the CWC.

By the way, most of the samples from Papac et al. are already in the Global25 datasheets linked here. Look for the labels listed here. Below is a plot made from the Global25 data courtesy of regular commentator Matt.
Citation: L. Papac, M. Ernée, M. Dobeš, M. Langová, A. B. Rohrlach, F. Aron, G. U. Neumann, M. A. Spyrou, N. Rohland, P. Velemínský, M. Kuna, H. Brzobohatá, B. Culleton, D. Daněček, A. Danielisová, M. Dobisíková, J. Hložek, D. J. Kennett, J. Klementová, M. Kostka, P. Krištuf, M. Kuchařík, J. K. Hlavová, P. Limburský, D. Malyková, L. Mattiello, M. Pecinovská, K. Petriščáková, E. Průchová, P. Stránská, L. Smejtek, J. Špaček, R. Šumberová, O. Švejcar, M. Trefný, M. Vávra, J. Kolář, V. Heyd, J. Krause, R. Pinhasi, D. Reich, S. Schiffels, W. Haak, Dynamic changes in genomic and social structures in third millennium BCE central Europe. Sci. Adv. 7, eabi6941 (2021).

See also...

On the origin of the Corded Ware people

Understanding the Eneolithic steppe

Conan the Barbarian probably belonged to Y-haplogroup R1a

Thursday, June 17, 2021

Balto-Slavic drift


A few years ago I began using the term "Balto-Slavic genetic drift" to describe the fine-scale genetic signal that is shared by the speakers of Baltic and Slavic languages to the exclusion of Europeans without significant Balto-Slavic ancestry.

As a result, nowadays, many people online use the term "Balto-Slavic drift" when referring to this phenomenon.

The easiest way to prove that Balto-Slavic drift exists is to run a fine-scale Principal Component Analysis (PCA) of European genetic variation with a lot of Balto-Slavic samples in the mix. Indeed, my Global25 PCA analysis does a great job of illustrating the impact of Balto-Slavic drift on the population structure of Europe both in PCA plots and mixture models (for instance, see here).

It's also possible to tease out Balto-Slavic drift with formal statistics. I showed this indirectly in a recent blog post about Greek population structure (see here). In this post I'm going to demonstrate how to explicitly and formally test for Balto-Slavic drift both in ancient and present-day samples.

To do this we need to find stats that basically split Baltic and Slavic speakers from other Europeans, such as f4(Outgroup,Test;Bell_Beaker_NDL,Baltic_LVA_BA). In this f4-stat, Baltic_LVA_BA is the ancient reference population with an unusually high level of Balto-Slavic drift, while Bell_Beaker_NDL is a fairly similar population overall in terms of ancient ancestry components, but with practically zero Balto-Slavic drift.

Note that the statistics with the most significant Z scores (>3) involve populations that speak Baltic or Slavic languages, or their neighbors who plausibly harbor significant Baltic and/or Slavic ancestry. Among the ancient, mostly Scandinavian, populations (from Margaryan et al. 2020 and marked with the VK2020 prefix), significant Balto-Slavic drift only appears in the more easterly and/or later groups from the Viking Age (VA).


Unfortunately, one of the problems with this analysis is that Baltic_LVA_BA and Bell_Beaker_NDL aren't identical in terms of their ancient ancestry proportions. For one, the latter has significantly more Neolithic farmer ancestry. No wonder then, that Greeks, who are mostly of early farmer stock, don't show a significant Z score, despite probably packing a significant amount of Balto-Slavic ancestry dating to the Middle Ages.

In the near future, as more ancient samples become available, it might be possible to find better reference populations for the job and create more accurate, finer-scaled tests.

See also...

Uralian genes

That old chestnut: Northeast vs Northwest Euros

Sunday, January 17, 2021

That old chestnut: Northeast vs Northwest Euros


In the last comment thread reader Greg put forth this question:

David, when are you going to explain the genetic discrepancy between Northeastern and Northwestern Europeans? You know, the one that people believe is due to Baltic Hunter-Gatherer admixture, whereas you believe it is due to genetic drift? You ought to make a post about this issue at some point, because a lot of people are wondering what's causing the differences.

Well, Greg, this issue has been discussed to the proverbial death here and elsewhere. In fact, there were two posts and rather lengthy comment threads on the same topic at this blog just a few months ago. See here and here.

Nevertheless, it seems that a fair number of people are still befuddled, so I'm going to try to explain this one last time, as briefly as a I can using just a handful of f4-stats.

Admittedly, Northeast Europeans generally do pack higher levels of indigenous European hunter-gatherer ancestry than Northwest Europeans. This is especially true of Balts, who show more of this type of ancestry than even Scandinavians in practically every type of analysis.

The f4-stats below back this up unambiguously. Note the significantly positive (>3) Z scores, which suggest that Latvians and Lithuanians harbor more Baltic hunter-gatherer-related ancestry than Norwegians and Swedes.

Chimp Baltic_HG Norwegian Latvian 0.001301 7.114
Chimp Baltic_HG Swedish Latvian 0.001017 4.205
Chimp Baltic_HG Norwegian Lithuanian 0.001023 7.341
Chimp Baltic_HG Swedish Lithuanian 0.000763 3.408

Greg, I know what you're thinking: the naysayers are right! But wait, because there's a twist to this tale. Check out these f4-stats:

Chimp Baltic_HG Norwegian Belarusian 0.000265 1.934
Chimp Baltic_HG Swedish Belarusian 0.000152 0.7
Chimp Baltic_HG Norwegian Polish 6.4E-05 0.519
Chimp Baltic_HG Swedish Polish -0.000235 -1.074

Please note, Greg, that none of the Z scores reach significance, which means that these Northwest Europeans and Slavs are symmetrically related to Baltic_HG. They're also symmetrically related to other relevant ancient groups such as the Yamnaya steppe herders. This, of course, suggests that they harbor very similar levels of basically the same ancient genetic components.

Chimp Karelia_HG Norwegian Belarusian 0.000136 0.844
Chimp Karelia_HG Swedish Belarusian 7.9E-05 0.32
Chimp Karelia_HG Norwegian Polish -4.7E-05 -0.304
Chimp Karelia_HG Swedish Polish -0.000134 -0.54

Chimp Yamnaya_Samara Norwegian Belarusian -0.000134 -1.085
Chimp Yamnaya_Samara Swedish Belarusian -6.6E-05 -0.34
Chimp Yamnaya_Samara Norwegian Polish -0.000225 -1.995
Chimp Yamnaya_Samara Swedish Polish -0.000311 -1.574

Chimp Barcin_N Norwegian Belarusian -0.000335 -2.809
Chimp Barcin_N Swedish Belarusian -0.000284 -1.491
Chimp Barcin_N Norwegian Polish -0.000222 -2.057
Chimp Barcin_N Swedish Polish -0.000318 -1.662

Chimp Baikal_N Norwegian Belarusian 0.000186 1.3
Chimp Baikal_N Swedish Belarusian -7E-05 -0.33
Chimp Baikal_N Norwegian Polish -4.6E-05 -0.351
Chimp Baikal_N Swedish Polish -0.000477 -2.277

Interestingly, pairing up Ukrainians with English samples from Cornwall and Kent produces similar outcomes. But that's because most ancient ancestry proportions in Europe show a closer correlation with latitude than longitude.

Chimp Baltic_HG English_Cornwall Ukrainian 0.000282 2.242
Chimp Baltic_HG English_Kent Ukrainian 0.000225 1.748

Chimp Karelia_HG English_Cornwall Ukrainian 0.000323 2.175
Chimp Karelia_HG English_Kent Ukrainian 0.000239 1.634

Chimp Yamnaya_Samara English_Cornwall Ukrainian -6.6E-05 -0.569
Chimp Yamnaya_Samara English_Kent Ukrainian -0.000112 -0.977

Chimp Barcin_N English_Cornwall Ukrainian -0.000519 -4.641
Chimp Barcin_N English_Kent Ukrainian -0.000598 -5.232

Chimp Baikal_N English_Cornwall Ukrainian 0.000385 2.874
Chimp Baikal_N English_Kent Ukrainian 0.00036 2.836

Now, Greg, if at least in terms of genetic ancestry, Latvians, Lithuanians, Belarusians, Poles and Ukrainians all qualify as Northeast Europeans, then what makes them different, as a group, from Northwest Europeans? Do you believe that the key factor is admixture from Baltic hunter-gatherers? Or is it genetic drift?

Of course, considering all of the f4-stats above, logic dictates that it must be relatively recent genetic drift.

Keep in mind, however, that this only applies to Balto-Slavic speaking Northeast Europeans without significant Uralian ancestry. Overall, Uralic speakers have a more complex population history, and indeed genetic differences between them and Northwest Europeans are in large part due to somewhat different ancestry proportions and also Siberian admixture.

See also...

So who's the most (indigenous) European of us all?

Saturday, November 7, 2020

Slavic-like Medieval Germans


The samples labeled DEU_MA_Krakauer_Berg in the Principal Component Analysis (PCA) plot below are from a recent paper by Parker et al. at Scientific Reports. Their remains were excavated from a Medieval cemetery in the now abandoned village of Krakauer Berg in eastern Germany.

Krakauer sounds sort of like Kraków, doesn't it? That's probably not a coincidence, especially considering how these people behave in my analysis. To see an interactive version of the plot, paste the coordinates from the text file here into the relevant field here.

See also...

Yamnaya-related ancestry proportions in present-day Poles

Warriors from at least two different populations fought in the Tollense Valley battle

Viking world open analysis and discussion thread

Monday, July 13, 2020

Don't believe everything you read in peer reviewed papers


Case in point, here's a quote from a recent paper at the Journal of Human Genetics (emphasis is mine):

The Mordovian and Csango samples have a moderate to slight orientation toward the Central-Asian and Siberian Turkic groups. This could suggest the more significant East Eurasian or Turkic ancestry of these populations, which should be further investigated. German samples are inhomogeneous, and some of the German samples also show this tendency, which can be the result of the recent 20th century Turkish immigration into Germany [42].

Nope, these German samples don't show anything even remotely resembling recent Turkish ancestry. The authors of the paper, Ádám, V., Bánfai, Z., Maász, A. et al., should've been able to figure this out, even with the standard analyses that they ran. Failing that, the peer reviewers at the Journal of Human Genetics should've noticed that the authors were confused.

Moreover, if the authors and peer reviewers actually bothered to take a closer look at metadata for these samples, which were sourced from the Estonian Biocentre, they'd see that they're not even from Germany. In fact, they represent self-reported ethnic Germans from Russia.

My own quick and dirty analysis of these individuals suggests that many of them harbor East Slavic and/or Volga Finnic ancestries. Indeed, only some of them can pass genetically for run of the mill Germans from Germany. The Principal Component Analysis (PCA) below is self-explanatory. It was plotted with the Vahaduo Custom PCA tools freely available here. The relevant PCA datasheet can be gotten here.


That's not to say, of course, that some Germans don't have recent Turkish ancestry, because an increasing number of Germans nowadays do, nor that people with German heritage in Russia shouldn't identify as Germans, because that's entirely their choice.

This blog post isn't about what it takes to be German, and this is not something that I ever want to discuss for obvious reasons. The point I'm making here is that the authors and peer reviewers of the said paper at the Journal of Human Genetics were sloppy and half-arsed in their approach. And, sadly, this isn't an isolated case in peer reviewed scientific literature dealing with human population genetics.

I feel that the Estonian Biocentre is also partly to blame for this cock up, due to its somewhat peculiar sampling and labelling strategies. For instance, its scientists rely solely on self-reported identity to establish the ethnic origins of their samples, and they apparently never remove genetic outliers from their datasets or even try to identify them.

Unfortunately, I fear that this relaxed approach will eventually lead to basic errors and even unusual conclusions in a number of so called peer reviewed papers.

I first raised this issue with the Estonian Biocentre about five years ago, when I noticed that some of the supposedly Polish individuals in its dataset were genetically more similar to various groups from northern Russia than to Poles from Poland. These individuals also showed significant Siberian ancestry, which was very unusual indeed. Where the hell did the Estonian Biocentre find Poles who resembled people from near the Arctic Circle, you might ask? Apparently in Estonia.

OK, I can imagine that sampling ethnic Poles from Estonia may have been easier for the Estonian Biocentre than sampling Poles from Poland. And Estonian Poles certainly make for interesting and useful data points. However, as you can see in the PCA below, some of these individuals (labeled Polish_Estonia by me) aren't representative of the native Polish population, and yet the Estonian Biocentre not only lumps them with their Poles from Poland, but even labels them with the word "Poland". The relevant PCA datasheet can be gotten here.


However, based on my communications with some of the scientists at the Estonian Biocentre, including head honcho Mait Mestpalu, it seems that nothing will ever change there in regards to this issue. Who knows, perhaps some day we'll see a paper based on Estonian Biocentre data in the Journal of Human Genetics claiming that Poles originated near the Arctic Circle? I wouldn't be shocked if that actually happened.

Citation...

Ádám, V., Bánfai, Z., Maász, A. et al. Investigating the genetic characteristics of the Csangos, a traditionally Hungarian speaking ethnic group residing in Romania. J Hum Genet (2020). https://doi.org/10.1038/s10038-020-0799-6

See also...

Like three peas in a pod

Saturday, December 14, 2019

Avalon vs Valhalla revisited


Pictured below is a new version of my Celtic vs Germanic genetic map. It's based on the same Principal Component Analysis (PCA) as the original (which can be seen here), but more focused on Northwestern Europe and produced with a different program.


To see the interactive online version, navigate to Vahaduo Custom PCA and copy paste the text from here into the empty space under the PCA DATA tab. Then press the PLOT PCA button under the PCA PLOT tab. For more guidance, refer to the screen caps here and here.

To include a wider range of populations in the key, just edit the data accordingly. For instance, to break up the ancient grouping into more specific populations, delete the Ancient: prefix in all of the relevant rows. This is what you should see:


Conversely, you can leave the ancient sample set intact and instead reorder the present-day linguistic groupings into, say, geographic groupings. To achieve this just delete all of the linguistic prefixes, such as Celtic:, Germanic:, and so on. You should end up with a datasheet like this and plot like this.

Of course, you can design your own plot by using any combination of the ancient and present-day individuals and populations that I've already run in this PCA. Their coordinates are listed here. Indeed, if you're in the possession of your own Celtic vs Germanic PCA coordinates, you can add yourself to the plot. And if you're not, see here.

It's also possible to re-process PCA data via the SOURCE tab. But I don't recommend doing this with the Celtic vs Germanic data, which are derived from a fine scale analysis and don't pack much variation. On the other hand, Global25 data are ideal for such re-processing. I made the plots below from subsets of Global25 coordinates available in a zip file here. To see how, refer to the screen caps here and here.




See also...

Modeling your ancestry has never been easier

Getting the most out of the Global25

Modeling genetic ancestry with Davidski: step by step

Monday, November 25, 2019

Viking Age Iceland


I finally managed to get some of the Icelandic ancients from Ebenesersdóttir et al. 2018 into the Global25 datasheets (see here). Better late than never. Look for the"ISL_Viking_Age" prefix. Below is a screen cap of a Principal Component Analysis (PCA) with the new samples. It was done with an online Global25 PCA runner freely available here.


The individuals classified as unadmixed Gaels and Norse by Ebenesersdóttir et al. generally also look like it based on their Global25 coordinates.

The mixture models below, using all of the populations from the Global25 "modern pop averages scaled" datasheet, were run with an online tool freely available here. Note that the ADD DIST COL option is set to 1X. This is a useful feature for modeling the fine scale ancestry of samples that are derived from very similar populations.






See also...

They came, they saw, and they mixed

Commoner or elite?

Who were the people of the Nordic Bronze Age?

Monday, September 2, 2019

Commoner or elite?


I recently started looking at the correlations between Y-chromosome haplogroups and social standing in ancient Europe, and was surprised by what I learned about the five currently sampled prehistoric Scandinavians belonging to Y-haplogroup R1b. I certainly wasn't expecting to uncover these stories about a mass human sacrifice, a bog body, and an Arctic circle warrior:

- The earliest Scandinavian in the ancient DNA record belonging to R1b comes from a grave site in what is now northern Norway (VK531, Margaryan et al. 2019). This individual has a genome-wide profile similar to that of local Mesolithic hunter-gatherers, but is dated to just ~2,400 BCE. During this time, Scandinavia was dominated by a "new" population associated with the Battle-Axe culture (BAC), with high levels of ancestry from the steppes of Eastern Europe. Since VK531 wasn't buried with any BAC grave goods, and indeed with no grave goods at all, it's possible that he may have been from a remnant forager population that was displaced and ultimately forced into extinction.

- R1b-U106 is today by far the most common R1b subclade in Scandinavia, but it's not yet clear how it managed to attain this status. Was it perhaps through elite dominance? The earliest ancient individual belonging to R1b-U106 is dated to 2275-2032 calBCE and comes from a Late Neolithic, likely post-BAC burial ground in what is now Sweden (RISE98, Lilla Beddinge, grave 49, southern skeleton, Allentoft et al. 2015). However, RISE98 wasn't buried in any way that would suggest he was an individual of high social standing. In fact, he was found in a mass grave, along with two other adults and two infants, possibly representing a human sacrifice. The only artefact in the grave was a bone needle. More details are available here.

- During the Nordic Bronze Age it became customary for Scandinavian elites to be laid to rest in richly furnished barrows, while commoners were buried in flat graves with few or no offerings. Human remains recovered from a "commoner" flat grave cemetery dated to the Early Bronze Age near the present-day city of Aalborg, northern Denmark, included the skeleton of a male belonging to Y-haplogroup R1b-M269 (RISE47, grave 3, skeleton 8, Allentoft et al. 2015). Keep in mind, however, that this might have been another case of an ancient Scandinavian R1b-U106 if not for missing data. A flint dagger was found alongside one of the skeletons in this cemetery, but RISE47 wasn't accompanied by any grave goods (see here).

- One of the most amazing archeological discoveries made in Scandinavia is the Trundholm Sun Chariot. Found in a peat bog on the island of Zealand, Denmark, in 1902, it's thought to be an Indo-European religious artefact dating back to the Nordic Bronze Age; a representation of a horse pulling the sun and perhaps also the moon in a spoked wheel chariot. Another important discovery in a peat bog near Trundholm dating to the Nordic Bronze Age was the body of a man belonging to R1b-M269 (RISE276, Trundholm mose II, bog find 1940, Allentoft et al. 2015). However, chances are slim that RISE276 was a charioteer or, say, a spiritual guru who accidentally drowned in the bog. Most Danish bog bodies are thought to have belonged to sacrificial victims or executed criminals.

- Interestingly, the earliest likely Scandinavian warrior belonging to R1b, and also R1b-U106, is from an early Iron Age burial in present-day northwestern Norway (VK418, Margaryan et al. 2019). This site isn't quite as far north as the grave of the above mentioned VK531, but it's still well within the Arctic circle. Apparently, VK418 was buried with some impressive weapons, potentially of "eastern origin", including a shield, spearheads and a sword. Who knows, he may even have been an elite warrior for his time and place?

The other two main Scandinavian Y-haplogroups, I1a and R1a, haven't yet been found in prehistoric Nordic remains from such, shall we say, depressing burials. That's not to say, of course, that they won't be sooner or later. RISE175, from Allentoft et al. 2015, is currently the only individual who fits the bill as a representative of the Nordic Bronze Age elite. He was buried in a barrow grave in what is now southwest Sweden and probably belongs to Y-haplogroup I1a. That's not much to go on, but perhaps it's a sign of things to come?


See also...

Isotopes vs ancient DNA in prehistoric Scandinavia

Who were the people of the Nordic Bronze Age?

They came, they saw, and they mixed

Wednesday, July 17, 2019

Viking invasion at bioRxiv


A new preprint featuring hundreds of Viking Age genomes has appeared at bioRxiv [LINK]. Titled Population genomics of the Viking world, it looks like a solid effort overall, although I'm skeptical about its conclusions. I might elaborate on that in the comments below, but I'll have a lot more to say on the topic if and when I get to check out the ancient genomes with my own tools. Details about the new samples, including their Y-chromosome haplogroup assignments, are available here. Below is the abstract, emphasis is mine:

The Viking maritime expansion from Scandinavia (Denmark, Norway, and Sweden) marks one of the swiftest and most far-flung cultural transformations in global history. During this time (c. 750 to 1050 CE), the Vikings reached most of western Eurasia, Greenland, and North America, and left a cultural legacy that persists till today. To understand the genetic structure and influence of the Viking expansion, we sequenced the genomes of 442 ancient humans from across Europe and Greenland ranging from the Bronze Age (c. 2400 BC) to the early Modern period (c. 1600 CE), with particular emphasis on the Viking Age. We find that the period preceding the Viking Age was accompanied by foreign gene flow into Scandinavia from the south and east: spreading from Denmark and eastern Sweden to the rest of Scandinavia. Despite the close linguistic similarities of modern Scandinavian languages, we observe genetic structure within Scandinavia, suggesting that regional population differences were already present 1,000 years ago. We find evidence for a majority of Danish Viking presence in England, Swedish Viking presence in the Baltic, and Norwegian Viking presence in Ireland, Iceland, and Greenland. Additionally, we see substantial foreign European ancestry entering Scandinavia during the Viking Age. We also find that several of the members of the only archaeologically well-attested Viking expedition were close family members. By comparing Viking Scandinavian genomes with present-day Scandinavian genomes, we find that pigmentation-associated loci have undergone strong population differentiation during the last millennia. Finally, we are able to trace the allele frequency dynamics of positively selected loci with unprecedented detail, including the lactase persistence allele and various alleles associated with the immune response. We conclude that the Viking diaspora was characterized by substantial foreign engagement: distinct Viking populations influenced the genomic makeup of different regions of Europe, while Scandinavia also experienced increased contact with the rest of the continent.

Margaryan et al., Population genomics of the Viking world, bioRxiv, posted July 17, 2019, doi: https://doi.org/10.1101/703405

See also...

They came, they saw, and they mixed

Who were the people of the Nordic Bronze Age?

Asiatic East Germanics

Monday, July 15, 2019

Asiatic East Germanics


Around a third of the ancient individuals in my dataset associated with East Germanic-speaking cultures show obvious ancestry from Central and/or West Asia.

This shouldn't be too surprising, considering, for instance, the well documented contacts between East Germanic tribes and the Avars, Huns, Sarmatians and other nomadic groups that streamed into Europe from the Asian steppes during the Migration Period. It's a topic that I've raised before at this blog (see here).

But the curious thing is that very little, if any, of this ancestry has percolated down to present-day Europeans.

The easiest way to show this is with a Principal Component Analysis (PCA) based on my Global25 data. The relevant PCA datasheet can be downloaded here. Basic details about the ancient samples in the analysis are available here.

Some of the Northeastern European populations, particularly the Uralic speakers, appear to be attracted to the Hunnic cluster. However, this is mostly an artifact of pre-Migration Period east to west population expansions in the far north of Europe, probably including those of the Proto-Uralians (see here).

So how is it that, despite ruling over vast areas of Europe for hundreds of years, the East Germanics appear not to have contributed significantly to the present-day European gene pool? My theory is that, much like the Avars and Huns, they were militarily and demographically overwhelmed by the ascending groups around them, such as the Slavs, and they simply went extinct.

To wrap things up, here's a basic qpAdm mixture model designed to test for Hunnic-related ancestry in a few Eastern and Northern European populations of interest. Note the significant slice of this type of ancestry in the likely early Goths of the Chernyakhiv culture. Is it real? Feel free to share your thoughts in the comments below.

UKR_Chernyakhiv
DEU_MA 0.863±0.038
Hun_Tian_Shan 0.137±0.038
chisq 12.525
tail prob 0.325466
Full output

Swedish
Baltic_EST_IA 0.126±0.078
DEU_MA 0.849±0.073
Hun_Tian_Shan 0.025±0.020
chisq 8.338
tail prob 0.595877
Full output

Ukrainian
Baltic_EST_IA 0.121±0.064
DEU_MA 0.857±0.060
Hun_Tian_Shan 0.022±0.017
chisq 11.458
tail prob 0.322956
Full output

Estonian
Baltic_EST_IA 0.597±0.069
DEU_MA 0.373±0.064
Hun_Tian_Shan 0.030±0.017
chisq 15.739
tail prob 0.107361
Full output

See also...

Conan the Barbarian probably belonged to Y-haplogroup R1a

More on the association between Uralic expansions and Y-haplogroup N

Uralic-specific genome-wide ancestry did make a signifcant impact in the East Baltic

Saturday, June 1, 2019

They came, they saw, and they mixed


Y-chromosome haplogroup N is strongly associated with Uralic-speaking populations. That's probably because it was a salient feature of the gene pool of the earliest Uralic speakers, and it went with them as they migrated across northern Eurasia. However, some of its younger subclades appear to have spread with the speakers of Indo-European and Turkic languages.

For instance, N-Y10931 seems to be a marker of the Rurikids, a Varangian dynasty that, according to most sources, ruled the Kievan Rus in what are now Russia and Ukraine. And the Kievan Rus was a lose medieval political federation in which Slavic, Finnic (west Uralic) and Germanic languages were probably spoken. The latest on the genetic genealogy of the Rurikids was presented a couple of days ago at the Centenary of Human Population Genetics conference in Moscow, and there's an abstract of the talk available here (download the PDF and scroll down to page 84).

I'm not aware of any Rurikids among the thousands of ancients in my dataset, or even of any samples belonging to N-Y10931. But I do have the genome of someone who belongs to N-Y4339, which, as per the abstract linked to above, is proximally ancestral to N-Y10931. Not only does this person come from Viking Age Scandinavia, but he was buried in a crouched position typical of Slavic funerary customs of the time.

The individual in question is vik_84001. His genome was published recently along with a paper on the population structure of the Swedish town of Sigtuna way back when it was a Viking stronghold (see here). This is where his Y-chromosome sequence, labeled ERS2540883, is positioned on the YFull Y-chromosome phylogenetic tree. Click on the image to go to YFull.


However, the result is likely to be compromised to some extent by missing data. If so, it's possible that vik_84001 does indeed belong to N-Y10931 and ought to be sitting near or even among that cluster of Russian samples (Rurik descendants?) at the bottom of the page.

In any case, vik_84001 seems to be the closest individual in the ancient DNA record to a Rurikid. The Principal Component Analysis (PCA) below is based on my Global25 data. It features 18 other Viking Age individuals from Sigtuna alongside vik_84001 (look for the black dots). The relevant datasheet is available here. Interestingly, despite his eastern Y-haplogroup, vik_84001 is one of the few Sigtuna ancients who clusters strongly with present-day Swedes.
But here's what happens when I model his ancestry proportions with the Global25/nMonte method using a wide range of reference populations from Northern and Eastern Europe. The Swedes in this model are the same as those in the PCA.

vik_84001
Swedish,84.6
Ingrian,9.2
Russian_Tver,6.2

Belarusian,0
Estonian,0
Finnish,0
Finnish_East,0
Karelian,0
Latvian,0
Mordovian,0
Russian_Kostroma,0
Russian_Kursk,0
Russian_Orel,0
Russian_Pinega,0
Russian_Smolensk,0
Russian_Voronez,0
Ukrainian,0
Vepsian,0

[1] "distance%=2.3778"

Yep, despite his position in the PCA, vik_84001 shows a strong signal of ancestry related to the present-day populations of northwestern Russia. I'm not sure what this means exactly, but it's certainly fascinating stuff. And, by the way, I usually wouldn't use so many similar reference populations in a single Global25/nMonte model because of the problem of "overfitting", but in some cases it's OK to do so if the nMonte algorithm has enough recent genetic drift to latch onto.

See also...

More on the association between Uralic expansions and Y-haplogroup N

Fresh off the sledge

Uralic-specific genome-wide ancestry did make a signifcant impact in the East Baltic

It was always going to be this way

Conan the Barbarian probably belonged to Y-haplogroup R1a

Tuesday, May 7, 2019

The execution


Around 2,800 BCE, in what is now southern Poland, a family group of fifteen individuals associated with the Globular Amphora culture (GAC) were massacred. They were probably captured and executed, because each victim was killed with a blow to the head from the same type of weapon, possibly a stone axe, and lacked defensive wounds. The dead were mostly women and children. They were buried in a mass grave, but with great care and very likely by someone who knew them well.

This Late Neolithic mass grave is the focus of a new ancient DNA and archeological research paper at PNAS by Schroeder et al. (see here). The authors tentatively attribute the massacre to the Corded Ware culture (CWC) people, who were expanding rapidly at the time across much of Europe from their homeland on the Pontic-Caspian steppe.


The CWC people may or may not have been responsible; we'll never know for sure. The perpetrators could just as easily have been a competing GAC family group.

In any case, it's interesting to see that the GAC males belong to Y-chromosome haplogroup I2a-L801. This is today a rather uncommon subclade of I2, and almost exclusively found in Germanic-speaking populations, especially Scandinavians. To me this suggests that some Polish GAC males were incorporated into Indo-European-speaking CWC populations that ended up in Scandinavia, and their paternal lineages eventually became a part of the Proto-Germanic gene pool. Admittedly, though, that's just one of many possible scenarios.

See also...

Late PIE ground zero now obvious; location of PIE homeland still uncertain, but...

Corded Ware people =/= Proto-Uralics (Tambets et al. 2018)

Inferring the linguistic affinity of long dead and non-literate peoples: a multidisciplinary approach

Sunday, May 5, 2019

Conan the Barbarian probably belonged to Y-haplogroup R1a


A fresh batch of Iron Age genomes from across the Eurasian steppes is about to be published along with a new paper at Current Biology. The manuscript, titled Shifts in the Genetic Landscape of the Western Eurasian Steppe Associated with the Beginning and End of the Scythian Dominance, is still under review but freely available here.

Most of the male ancients, including two Cimmerians from the North Pontic steppe, in what is now Ukraine, belong to Y-chromosome haplogroup R1a. Wasn't Conan the Barbarian supposed to be a Cimmerian? From the preprint, emphasis is mine:

The Early Iron Age nomadic Scythians have been described as a confederation of tribes of different origins, based on ancient DNA evidence [1-3]. It is still unclear how much of the Scythian dominance in the Eurasian Steppe was due to movements of people and how much reflected cultural diffusion and elite dominance. We present new whole-genome sequences of 31 ancient Western and Eastern Steppe individuals including Scythians as well as samples pre- and postdating them, allowing us to set the Scythians in a temporal context (in the Western/Ponto-Caspian Steppe). We detect an increase of eastern (Altaian) affinity along with a decrease in Eastern Hunter-Gatherer (EHG) ancestry in the Early Iron Age Ponto- Caspian gene pool at the start of the Scythian dominance. On the other hand, samples of the Chernyakhiv culture postdating the Scythians in Ukraine have a significantly higher proportion of Near Eastern ancestry than other samples of this study. Our results agree with the Gothic source of the Chernyakhiv culture and support the hypothesis that the Scythian dominance did involve a demic component.

...

Out of the 31 samples of this study, 16 are male, and with sufficient Y-chromosome coverage for haplogroup assignment (Table S2). R1a (43%) and I (27%) are the two most frequent Y- chromosome hgs in present-day Ukrainians [142]. R1a is also the predominant lineage among Cimmerians, Scy_Ukr and ScySar_SU in our data, and present among Scy_Kaz as well. Thus, although acknowledging our small sample size, the individuals sampled from archaeological context associated with Scythian identity do not appear to stand out from the context of other groups living in the region before and after them. One notable difference from the present is the absence of hg N, nowadays widespread in the Volga-Uralic region and West Siberia as well as among Mongols and Altaians [165-167]; however, this result is consistent with the absence of hg N among Bronze Age and Eneolithic males from the Steppe [168]. In context of their claimed Altaian homeland it is interesting to note that one Scy_Ukr and the single Sar_Cau sample belong to the Q1c-L332 lineage which is a sub-clade of hg Q1c-L330 that today has peak frequency of 68% in Western Mongolians [169] and occurs at 17% in South Altaians [170] while being very rare (<1%) in East European populations and absent elsewhere (https://www.yfull.com/tree/Q-L330/).


Järve et al., Shifts in the Genetic Landscape of the Western Eurasian Steppe Associated with the Beginning and End of the Scythian Dominance, Current Biology (preprint), Posted: 6 Mar 2019, http://dx.doi.org/10.2139/ssrn.3346985

Update 12/07/2019: The paper has just been published and is freely available at Current Biology [LINK].

See also...

The mystery of the Sintashta people

On the association between Uralic expansions and Y-haplogroup N

Late PIE ground zero now obvious; location of PIE homeland still uncertain, but...

Sunday, April 7, 2019

On the association between Uralic expansions and Y-haplogroup N


Almost all present-day populations speaking Uralic languages show moderate to high frequencies of Y-chromosome haplogroup N. I reckon there are two likely explanations for this:

- the speakers of Proto-Uralic were rich in N because they lived in an area, probably somewhere around the Ural Mountains, where it was common, and they spread it with them as they expanded from their homeland

- Uralic languages often came to be spoken in areas of North Eurasia where N was already found at moderate to high frequencies

The major exception to this rule are Hungarians, whose language belongs to the Ugric branch of Uralic. Their frequency of N is close to zero and they don't differ much in terms of overall genetic structure from their Indo-European-speaking neighbors in East Central Europe.


This is an issue that has generated much debate over the years about the nature of Uralic expansions, who the Hungarians really were, and how the Hungarian language came to be spoken in the heart of Europe.

But I never understood what the fuss was about, because based on historical sources alone it seemed rather obvious that Hungarian was introduced into the Carpathian Basin during the Middle Ages by a relatively small number of invaders from the east, probably from somewhere around the Ural Mountains, who imposed it on local Indo-European-speaking populations.

As far as I can remember, this has always been the academic consensus, and the results from one of the first ancient DNA studies of human remains soundly corroborated it. Back in 2008, Csányi et al. reported that two out of four skeletons from elite Hungarian conqueror graves dating to the 10th century carried the Tat C allele, which meant that they belonged to Y-haplogroup N (see here).

We've since had to wait over a decade to get a more comprehensive look at the Y-chromosome haplogroups of medieval Hungarians. The most useful effort to date, a manuscript courtesy of Neparáczki et al., was posted this week at bioRxiv (see here).

The results in the preprint suggest a much more complex picture than simply a migration of an obviously Uralic-speaking population rich in Y-haplogroup N into the medieval Carpathian Basin. But they do confirm the presence of N in Hungarian conqueror elites, and, in fact, of very specific subclades of N that link them to the present-day speakers of Uralic languages from around the Ural Mountains. Here are some pertinent quotes from the prepint:

Three Conqueror samples belonged to Hg N1a1a1a1a2-Z1936, the Finno-Permic N1a branch, being most frequent among northeastern European Saami, Finns, Karelians, as well as Komis, Volga Tatars and Bashkirs of the Volga-Ural region. Nevertheless this Hg is also present with lower frequency among Karanogays, Siberian Nenets, Khantys, Mansis, Dolgans, Nganasans, and Siberian Tatars 23.

...

It is generally accepted that the Hungarian language was brought to the Carpathian Basin by the Conquerors. Uralic speaking populations are characterized by a high frequency of Y-Hg N, which have often been interpreted as a genetic signal of shared ancestry. Indeed, recently a distinct shared ancestry component of likely Siberian origin was identified at the genomic level in these populations, modern Hungarians being a puzzling exception 36. The Conqueror elite had a significant proportion of N Hgs, 7% of them carrying N1a1a1a1a4-M2118 and 10% N1a1a1a1a2-Z1936, both of which are present in Ugric speaking Khantys and Mansis 23.

...

Population genetic data rather position the Conqueror elite among Turkic groups, Bashkirs and Volga Tatars, in agreement with contemporary historical accounts which denominated the Conquerors as “Turks” 38. This does not exclude the possibility that the Hungarian language could also have been present in the obviously very heterogeneous, probably multiethnic Conqueror tribal alliance.

Indeed, a large proportion of the 44 males from elite Hun, Avar and Hungarian Conqueror burials analyzed in the study belonged to Y-haplogroups that can't be plausibly associated with the earliest Uralic speakers, but rather with those of various Indo-European languages, such as I1 and R1b-U106 (these are Germanic-specific markers), I2a-L621 and R1a-CTS1211 (obviously Slavic) and R1a-Z2124 (largely Eastern Iranian).

If most of these results aren't due to contamination, then it's likely that both the early Hungarian commoners and elites were, by and large, derived from Indo-European-speaking populations. No wonder then, that present-day Hungarians are basically indistinguishable genetically from their Indo-European-speaking neighbors and, like them, show hardly any Y-haplogroup N.

See also...

Hungarian Conquerors were rich in Y-haplogroup N (Fóthi et al. 2020)

More on the association between Uralic expansions and Y-haplogroup N

Ancient DNA confirms the link between Y-haplogroup N and Uralic expansions

Sunday, September 16, 2018

Celtic vs Germanic Europe


I have a feeling that ancient DNA from post-Bronze Age Northwestern Europe will be coming thick and fast from now on. To get the most out of such data I've designed a new Principal Component Analysis (PCA) that does a better job of separating the Celtic- and Germanic-speaking populations of Europe than my previous efforts of this sort (see here and here). Below are two different versions of the same PCA. The relevant datasheet is available here.

And here's a Discrimination Analysis (LDA) plot based on the 25 principal components. It further differentiates many of the populations along the east > west cline of genetic diversity.


The difference between the Germanic Anglo-Saxons and the Celtic and Roman Britons of what is now eastern England is obvious. The Anglo-Saxons could pass for Scandinavians, while the Celts and Romans both cluster between the Irish and French. This makes good sense, and is exactly what I was looking for. It's also interesting to see the presumably Celtic-speaking Hallstatt samples from Bylany, Czechia, clustering with the Belgians.

Update 14/12/2019: Pictured below is a new version of my Celtic vs Germanic genetic map. It's based on the same Principal Component Analysis (PCA) as the original, but more focused on Northwestern Europe and produced with a different program.


To see the interactive online version, navigate to Vahaduo Custom PCA and copy paste the text from here into the empty space under the PCA DATA tab. Then press the PLOT PCA button under the PCA PLOT tab. For more guidance, refer to the screen caps here and here.

To include a wider range of populations in the key, just edit the data accordingly. For instance, to break up the ancient grouping into more specific populations, delete the Ancient: prefix in all of the relevant rows. This is what you should see:


Conversely, you can leave the ancient sample set intact and instead reorder the present-day linguistic groupings into, say, geographic groupings. To achieve this just delete all of the linguistic prefixes, such as Celtic:, Germanic:, and so on. You should end up with a datasheet like this and plot like this.

Of course, you can design your own plot by using any combination of the ancient and present-day individuals and populations that I've already run in this PCA. Their coordinates are listed here. Indeed, if you're in the possession of your own Celtic vs Germanic PCA coordinates, you can add yourself to the plot. And if you're not, see here.

It's also possible to re-process PCA data via the SOURCE tab. But I don't recommend doing this with the Celtic vs Germanic data, which are derived from a fine scale analysis and don't pack much variation. On the other hand, Global25 data are ideal for such re-processing. I made the plots below from subsets of Global25 coordinates available in a zip file here. To see how, refer to the screen caps here and here.




See also...

Modeling your ancestry has never been easier

Getting the most out of the Global25

Modeling genetic ancestry with Davidski: step by step