Eurogenes Blog: Fennoscandia

Showing posts with label Fennoscandia. Show all posts

Tuesday, September 29, 2020

Viking world open analysis and discussion thread

Global25 and Celtic vs Germanic coordinates for most of the samples from the recent Margaryan et al. Viking paper are now available HERE and HERE, respectively. Look for the VK2020 prefix.

Feel free to put them through their paces and let me know what you find. Below are a couple of examples of what can be done with these coordinates using Vahaduo Global25 Views.

Monday, December 9, 2019

The BOO people: earliest Uralic speakers in the ancient DNA record?

N-L1026 is the Y-chromosome haplogroup most closely associated with the speakers of Uralic languages. Thus far, the oldest published instances of N-L1026 are in two Siberian-like samples dating to 1473±87 calBCE from the site of Bolshoy Oleni Ostrov (BOO), located within the Arctic Circle in the Kola Peninsula, northern Russia.

So does this mean that the BOO people were Uralic speakers? I'm now thinking that it probably does, even though, as the scientists who published the BOO samples a year ago pointed out, they predate most estimates of the spread of extant Uralic languages into the Kola Peninsula (see Lamnidis et al. here).

Hundreds of ancient human samples from across Eurasia have been sequenced since last year. In fact, thousands if we count unpublished data. But only a handful of them belong to N-L1026.

Indeed, as far as I know, the next oldest instance of N-L1026 from Europe after those at BOO is still in an Iron Age sample from what is now Estonia published earlier this year as 0LS10. Of course, this individual was in all likelihood an early west Uralic (Finnic) speaker (see Saag et al. here).

Moreover, consider these comments by Murashkin et al. in regards to the BOO site (referred to as KOG in their paper, available here):

Most of the bodies had been buried in wooden, boat-shaped, lidded caskets, which looked like small boats or traditional Sámi sledges (Ru. kerezhka).

...

The morphological characteristics of the skull series of the KOG are not like those of any other ancient or modern series from the Kola Peninsula, including the Sámi people. Instead, the series shows closer biological affinities with ancient Altai Neolithic and modern, Ugric-speaking Siberian groups (Moiseyev & Khartanovich 2012). It has earlier been suggested that modern Ugric-speaking Siberians, together with Samoyeds and Volga Finnic populations, share some common morphological characteristics that indicate their common origin (Alekseyev 1974; Bunak 1956; Gokhman 1992).

...

Based on the materials from the grave field, we can argue that there were direct or indirect contacts between the inhabitants of the Kola Peninsula and southern and western Scandinavia (Murashkin & Tarasov 2013).

Thus, the BOO people may have spoken an early west Uralic language related to Sami languages. It's also possible that they are in part ancestral to the N-L1026-rich Sami people.

Another intriguing thing about these mysterious ancients is that individual BOO003 belongs to the rare mitochondrial haplogroup T2d1b1. Now, this clearly is not a lineage native to Europe or indeed any part of North Eurasia. Its ultimate source is probably West or Central Asia. So how did this pioneer polar explorer end up with such an unusual and exotic mtDNA marker, and might the answer be an important clue about the origins of the BOO people?

The most plausible explanation is that the ancestors of BOO003 were associated with the Seima-Turbino phenomenon, which stretched from the taiga zone to the oases of what is now western China along the Ob-Irtysh river system, and probably facilitated cultural, linguistic and genetic exchanges between the populations of North Eurasia and Central Asia.

In other words, considering all of the clues, it would seem that the BOO people came from some part of the Ob-Irtysh basin, which might thus be the best place to look for the population with the oldest and phylogenetically most basal N-L1026 lineages. And if we find that, then we've probably found the proto-Uralians and their homeland.

Below is a Principal Component Analysis (PCA) based on Global25 data featuring the earliest likely Uralic speakers in the ancient DNA record. It was produced with an online PCA runner freely available here. EST_IA includes the above mentioned 0LS10, while FIN_Levanluhta_IA is largely made up of Saami-related samples from western Finland. See anything interesting? Feel free to let me know about it in the comments below.

Sunday, December 1, 2019

Big deal of 2019: ancient DNA confirms the link between Y-haplogroup N and Uralic expansions

The academic consensus is that Indo-European languages first spread into the Baltic region from the Eastern European steppes along with the Corded Ware culture (CWC) and its people during the Late Neolithic, well before the expansion of Uralic speakers into Fennoscandia and surrounds, probably from somewhere around the Ural Mountains.

On the other hand, the views that the Uralic language family is native to Northern Europe and/or closely associated with the CWC are fringe theories usually espoused by people not familiar with the topic or, unfortunately it has to be said, mentally unstable trolls.

The likely close relationship between the CWC expansion and the early spread of Indo-European languages was discussed in several papers in recent years (for instance, see here). This year, we saw the first ancient DNA paper focusing on the transition from the Bronze Age to the Iron Age in the East Baltic, including the likely first arrival of Uralic speech in what is now Estonia.

Published in Current Biology courtesy of Saag et al., the paper showed that the genetic structure of present-day East Baltic populations largely formed in the Iron Age (see here). It was during this time, the authors revealed, that the region experienced a sudden influx of Y-chromosome haplogroup N, which is today common in many Uralic speaking populations and often referred to as a Proto-Uralic marker. Little wonder then that Saag et al. linked this genetic shift in the East Baltic to the westward migrations of early Uralic speakers.

The table below, based on data from the Saag et al. paper, surely doesn't leave much to the imagination about what happened.

Unfortunately, I have to say that the genome-wide analysis in the paper was less informative than it could have been. The authors focused their attention on rather broad genetic components, and, as a result, missed an interesting fine scale distinction between their Bronze Age and Iron Age samples. The spatial maps below, based on my Global25 data for most of the ancients from Saag et al., show what I mean. The hotter the color the higher the genetic similarity between them and present-day West Eurasian populations.

Note that the Bronze Age (Baltic_EST_BA) samples are most similar to the Baltic-speaking, and thus also Indo-European-speaking, Latvians and Lithuanians, rather than the Uralic-speaking Estonians, even though they're from burial sites in Estonia. On the other hand, the Iron Age (Baltic_EST_IA) samples show strong similarity to a wider range of populations, including Estonians and many other Uralic-speaking groups.

Wednesday, July 17, 2019

Viking invasion at bioRxiv

A new preprint featuring hundreds of Viking Age genomes has appeared at bioRxiv [LINK]. Titled Population genomics of the Viking world, it looks like a solid effort overall, although I'm skeptical about its conclusions. I might elaborate on that in the comments below, but I'll have a lot more to say on the topic if and when I get to check out the ancient genomes with my own tools. Details about the new samples, including their Y-chromosome haplogroup assignments, are available here. Below is the abstract, emphasis is mine:

The Viking maritime expansion from Scandinavia (Denmark, Norway, and Sweden) marks one of the swiftest and most far-flung cultural transformations in global history. During this time (c. 750 to 1050 CE), the Vikings reached most of western Eurasia, Greenland, and North America, and left a cultural legacy that persists till today. To understand the genetic structure and influence of the Viking expansion, we sequenced the genomes of 442 ancient humans from across Europe and Greenland ranging from the Bronze Age (c. 2400 BC) to the early Modern period (c. 1600 CE), with particular emphasis on the Viking Age. We find that the period preceding the Viking Age was accompanied by foreign gene flow into Scandinavia from the south and east: spreading from Denmark and eastern Sweden to the rest of Scandinavia. Despite the close linguistic similarities of modern Scandinavian languages, we observe genetic structure within Scandinavia, suggesting that regional population differences were already present 1,000 years ago. We find evidence for a majority of Danish Viking presence in England, Swedish Viking presence in the Baltic, and Norwegian Viking presence in Ireland, Iceland, and Greenland. Additionally, we see substantial foreign European ancestry entering Scandinavia during the Viking Age. We also find that several of the members of the only archaeologically well-attested Viking expedition were close family members. By comparing Viking Scandinavian genomes with present-day Scandinavian genomes, we find that pigmentation-associated loci have undergone strong population differentiation during the last millennia. Finally, we are able to trace the allele frequency dynamics of positively selected loci with unprecedented detail, including the lactase persistence allele and various alleles associated with the immune response. We conclude that the Viking diaspora was characterized by substantial foreign engagement: distinct Viking populations influenced the genomic makeup of different regions of Europe, while Scandinavia also experienced increased contact with the rest of the continent.

Margaryan et al., Population genomics of the Viking world, bioRxiv, posted July 17, 2019, doi: https://doi.org/10.1101/703405

See also...

They came, they saw, and they mixed

Who were the people of the Nordic Bronze Age?

Asiatic East Germanics

Thursday, May 9, 2019

It was always going to be this way

The native peoples of the East Baltic - Estonians, Latvians and Lithuanians - are genetically alike and their paternal gene pools are dominated by the same two Y-chromosome haplogroups: R1a and N3a.

Linguistically, however, Estonians are a world apart from Latvians and Lithuanians. That's because the Estonian language belongs to the Uralic language family, which has an obvious North Eurasian character. On the other hand, Latvian and Lithuanian are both classified as Indo-European languages, along with the vast majority of other European languages.

The Uralic and Indo-European language families may or may not descend from the same ancestral tongue, but even if they do, their relationship is very distant.

So how is it that Estonians came to speak a Uralic language? As far back as I can remember, the basic explanation accepted by most people was that Uralic speech arrived in what is now Estonia and neighboring Finland during the Bronze Age with migrants, or perhaps invaders, rich in N3a from somewhere around the Ural Mountains. Conversely, Latvians and Lithuanians were generally assumed to have retained the Indo-European speech of their R1a-rich forefathers from the Pontic-Caspian steppe, who colonized much of Eastern Europe north of the steppe during the Late Neolithic.

Ancient DNA has now uncannily corroborated these theories (for instance, see Mittnik et al. 2018 and, published today, Saag et al. 2019). All it took was a handful of samples from a few relevant sites. I think that's awesome; I love it when sensible, long-standing hypotheses are validated by cutting edge science.

I'll have a lot more to say about the spread of Uralic languages and Uralian genes to the East Baltic when I get my hands on the genotype data from the new Saag et al. paper. I also have a post coming soon about the Nordic Bronze Age. Stay tuned.

Update 10/05/2019: Uralic-specific genome-wide ancestry did make a signifcant impact in the East Baltic

See also...

Late PIE ground zero now obvious; location of PIE homeland still uncertain, but...

Corded Ware people =/= Proto-Uralics (Tambets et al. 2018)

Inferring the linguistic affinity of long dead and non-literate peoples: a multidisciplinary approach

Monday, December 3, 2018

On the trail of the Proto-Uralic speakers (work in progress)

Historical linguists have long posited that Fennoscandia was a busy contact zone between early Germanic and Uralic languages. The first ancient DNA samples from what is now Finland have corroborated their inferences, by showing that during the Iron Age the western part of the country was inhabited by a genetically heterogeneous population closely related to both the Uralic-speaking Saami and Germanic-speaking southern Scandinavians.

The samples were sequenced and analyzed by two different teams of researches, and their findings published recently in Lamnidis et al. and Sikora et al. (see here and here, respectively).

This is how most of these ancients, whose remains were excavated from the Levanluhta burial site dated to 300–800 CE, behave in a Principal Component Analysis (PCA) based on my Global25 data. Levanluhta_IA are the Saami-related samples, while Levanluhta_IA_o is an Scandinavian-like outlier. Baltic_IA is an Iron Age individual from what is now Lithuania from the recent Damgaard et al. paper (see here). Note the accuracy of the Global25 data in pinpointing their genetic affinities and also the trajectory of the Levanluhta_IA cluster, which seems to be "pulling" towards Levanluhta_IA_o.

The Saami and Levanluhta_IA are clear outliers from the main Northern European cluster. There are two reasons for this: excess East Asian/Siberian-related ancestry and Saami-specific genetic drift. However, this eastern admixture and genetic drift are shared in varying degrees by other North European populations, especially those that also speak Uralic languages, and this is why they appear to be "pulling" towards the Saami/Levanluhta_IA clusters in my PCA. Thus, what this suggests is that the expansion of Uralic languages across Northeastern Europe was intimately linked with the spread of Siberian-related ancestry into the region.

This idea has been around for a long time and is now becoming even more widely accepted (see here). However, Lamnidis et al. also featured samples from a likely pre-Uralic (1523±87 calBCE) burial site at Bolshoy Oleni Ostrov in the Kola Peninsula, present-day northern Russia, and, perhaps surprisingly, found that they showed even more Siberian-related ancestry than Levanluhta_IA. So what's going on?

I'm confident that this discrepancy can be explained by multiple waves of migrations from the east into Northeastern Europe, possibly before, during and after the time of the people buried at Bolshoy Oleni Ostrov, by pre-Uralic, para-Uralic and/or Proto-Uralic-speaking populations.

Consider the following qpAdm output, in which Levanluhta_IA is just barely modeled successfully as a two-way mixture between Levanluhta_IA_o and Bolshoy_Oleni_Ostrov. The statistical fit improves significantly with the addition of Glazkovo_EBA as a third mixture source. This is an ancient population from near Lake Baikal dated to 4597-3726 BC from the aforementioned Damgaard et al. paper.

Levanluhta_IA
Bolshoy_Oleni_Ostrov 0.468±0.036
Levanluhta_IA_o 0.532±0.036
chisq 19.129
tail prob 0.0854706
Full output

Levanluhta_IA
Bolshoy_Oleni_Ostrov 0.241±0.092
Glazkovo_EBA 0.162±0.059
Levanluhta_IA_o 0.597±0.046
chisq 7.756
tail prob 0.734966
Full output

For the sake of being complete, I also tested whether Levanluhta_IA_o could be substituted by other similar ancient samples from the neighborhood, including those associated with the Battle-Axe and Corded Ware cultures. There's not much to report; qpAdm returned poor statistical fits and/or implausible ancestry proportions (for the full output from my runs, see here). Baltic_IA did produce a statistically sound model, but with excess Glazkovo_EBA-related ancestry. I also had to drop Bolshoy_Oleni_Ostrov from the analysis to make things work, which suggests to me that the result shouldn't be taken too literally.

Levanluhta_IA
Baltic_IA 0.677±0.034
Glazkovo_EBA 0.323±0.034
chisq 8.547
tail prob 0.741095
Full output

So as far as I can see, the western ancestry in Levanluhta_IA is likely to be mostly of Germanic origin, and thus Indo-European, meaning that it's logical to look east, perhaps far to the east, for the source of its Uralic ancestry. This might seem like a complicated and uncertain task, considering that Levanluhta_IA could well be at least a thousand years younger than the first entry of Uralic speakers into Fennoscandia. However, take a look what happens when I substitute Glazkovo_EBA with a variety of Uralic-speaking populations from around the Ural Mountains, which is where the Proto-Uralic homeland is generally considered to have been located.

Levanluhta_IA
Bolshoy_Oleni_Ostrov 0.210±0.091
Khanty 0.283±0.090
Levanluhta_IA_o 0.507±0.035
chisq 7.007
tail prob 0.798532
Full output

Levanluhta_IA
Bolshoy_Oleni_Ostrov 0.193±0.098
Levanluhta_IA_o 0.495±0.035
Mansi 0.312±0.100
chisq 7.884
tail prob 0.7237
Full output

Levanluhta_IA
Bolshoy_Oleni_Ostrov 0.300±0.065
Levanluhta_IA_o 0.337±0.072
Mari 0.363±0.121
chisq 8.393
tail prob 0.677705
Full output

Levanluhta_IA
Bolshoy_Oleni_Ostrov 0.238±0.084
Levanluhta_IA_o 0.553±0.036
Nenets 0.209±0.067
chisq 7.210
tail prob 0.78181
Full output

Levanluhta_IA
Bolshoy_Oleni_Ostrov 0.302±0.069
Levanluhta_IA_o 0.324±0.081
Udmurt 0.373±0.135
chisq 9.195
tail prob 0.60393
Full output

All of these models look great, and easily rival the best model with Glazkovo_EBA. Moreover, they make good sense in terms of linguistics. The only problem is that they're anachronistic, because the Uralic-speaking reference populations are younger than Levanluhta_IA. So I can't be certain that they reflect reality without corroboration from ancient DNA. It might turn out, for instance, that a Glazkovo_EBA-like population was already present somewhere deep in Europe before or during the time of Bolshoy_Oleni_Ostrov, while no such population existed around the Ural Mountains until the time of Levanluhta_IA.

By the way, it might be important to note that the present-day Finnish samples in my dataset can't be modeled as a mixture between Levanluhta_IA and Levanluhta_IA_o. But they can be modeled as a mixture between Baltic_IA and Levanluhta_IA. I don't know which part of Finland they're from exactly; probably all over the place, so it'd be useful to test regional Finnish populations to see how they behave in such models. Of course, Finns aren't Saamic speakers, they're Finnic speakers, and they're probably the result of a more recent Uralic expansion into Fennoscandia than the one that gave rise to the Saami.

Finnish
Baltic_IA 0.671±0.076
Levanluhta_IA 0.329±0.076
chisq 14.114
tail prob 0.293508
Full output

Damgaard et al. didn't report the Y-haplogroup for Baltic_IA, but the word round the campfire is that this individual belonged to N1c, which is today the most common Y-haplogroup among Uralic speakers. Obviously, we need a lot more ancient DNA to sort all of this out, but things are already looking pretty much as expected. Stay tuned for new posts in this series following the publication of more ancient DNA relevant to this fascinating topic.

See also...

How did Y-haplogroup N1c get to Bolshoy Oleni Ostrov?

The Uralic cline in the Global25

Late PIE ground zero now obvious; location of PIE homeland still uncertain, but...

search this blog