search this blog

Monday, December 3, 2018

On the trail of the Proto-Uralic speakers (work in progress)


Historical linguists have long posited that Fennoscandia was a busy contact zone between early Germanic and Uralic languages. The first ancient DNA samples from what is now Finland have corroborated their inferences, by showing that during the Iron Age the western part of the country was inhabited by a genetically heterogeneous population closely related to both the Uralic-speaking Saami and Germanic-speaking southern Scandinavians.

The samples were sequenced and analyzed by two different teams of researches, and their findings published recently in Lamnidis et al. and Sikora et al. (see here and here, respectively).

This is how most of these ancients, whose remains were excavated from the Levanluhta burial site dated to 300–800 CE, behave in a Principal Component Analysis (PCA) based on my Global25 data. Levanluhta_IA are the Saami-related samples, while Levanluhta_IA_o is an Scandinavian-like outlier. Baltic_IA is an Iron Age individual from what is now Lithuania from the recent Damgaard et al. paper (see here). Note the accuracy of the Global25 data in pinpointing their genetic affinities and also the trajectory of the Levanluhta_IA cluster, which seems to be "pulling" towards Levanluhta_IA_o.



The Saami and Levanluhta_IA are clear outliers from the main Northern European cluster. There are two reasons for this: excess East Asian/Siberian-related ancestry and Saami-specific genetic drift. However, this eastern admixture and genetic drift are shared in varying degrees by other North European populations, especially those that also speak Uralic languages, and this is why they appear to be "pulling" towards the Saami/Levanluhta_IA clusters in my PCA. Thus, what this suggests is that the expansion of Uralic languages across Northeastern Europe was intimately linked with the spread of Siberian-related ancestry into the region.

This idea has been around for a long time and is now becoming even more widely accepted (see here). However, Lamnidis et al. also featured samples from a likely pre-Uralic (1523±87 calBCE) burial site at Bolshoy Oleni Ostrov in the Kola Peninsula, present-day northern Russia, and, perhaps surprisingly, found that they showed even more Siberian-related ancestry than Levanluhta_IA. So what's going on?

I'm confident that this discrepancy can be explained by multiple waves of migrations from the east into Northeastern Europe, possibly before, during and after the time of the people buried at Bolshoy Oleni Ostrov, by pre-Uralic, para-Uralic and/or Proto-Uralic-speaking populations.

Consider the following qpAdm output, in which Levanluhta_IA is just barely modeled successfully as a two-way mixture between Levanluhta_IA_o and Bolshoy_Oleni_Ostrov. The statistical fit improves significantly with the addition of Glazkovo_EBA as a third mixture source. This is an ancient population from near Lake Baikal dated to 4597-3726 BC from the aforementioned Damgaard et al. paper.

Levanluhta_IA
Bolshoy_Oleni_Ostrov 0.468±0.036
Levanluhta_IA_o 0.532±0.036
chisq 19.129
tail prob 0.0854706
Full output

Levanluhta_IA
Bolshoy_Oleni_Ostrov 0.241±0.092
Glazkovo_EBA 0.162±0.059
Levanluhta_IA_o 0.597±0.046
chisq 7.756
tail prob 0.734966
Full output

For the sake of being complete, I also tested whether Levanluhta_IA_o could be substituted by other similar ancient samples from the neighborhood, including those associated with the Battle-Axe and Corded Ware cultures. There's not much to report; qpAdm returned poor statistical fits and/or implausible ancestry proportions (for the full output from my runs, see here). Baltic_IA did produce a statistically sound model, but with excess Glazkovo_EBA-related ancestry. I also had to drop Bolshoy_Oleni_Ostrov from the analysis to make things work, which suggests to me that the result shouldn't be taken too literally.

Levanluhta_IA
Baltic_IA 0.677±0.034
Glazkovo_EBA 0.323±0.034
chisq 8.547
tail prob 0.741095
Full output

So as far as I can see, the western ancestry in Levanluhta_IA is likely to be mostly of Germanic origin, and thus Indo-European, meaning that it's logical to look east, perhaps far to the east, for the source of its Uralic ancestry. This might seem like a complicated and uncertain task, considering that Levanluhta_IA could well be at least a thousand years younger than the first entry of Uralic speakers into Fennoscandia. However, take a look what happens when I substitute Glazkovo_EBA with a variety of Uralic-speaking populations from around the Ural Mountains, which is where the Proto-Uralic homeland is generally considered to have been located.

Levanluhta_IA
Bolshoy_Oleni_Ostrov 0.210±0.091
Khanty 0.283±0.090
Levanluhta_IA_o 0.507±0.035
chisq 7.007
tail prob 0.798532
Full output

Levanluhta_IA
Bolshoy_Oleni_Ostrov 0.193±0.098
Levanluhta_IA_o 0.495±0.035
Mansi 0.312±0.100
chisq 7.884
tail prob 0.7237
Full output

Levanluhta_IA
Bolshoy_Oleni_Ostrov 0.300±0.065
Levanluhta_IA_o 0.337±0.072
Mari 0.363±0.121
chisq 8.393
tail prob 0.677705
Full output

Levanluhta_IA
Bolshoy_Oleni_Ostrov 0.238±0.084
Levanluhta_IA_o 0.553±0.036
Nenets 0.209±0.067
chisq 7.210
tail prob 0.78181
Full output

Levanluhta_IA
Bolshoy_Oleni_Ostrov 0.302±0.069
Levanluhta_IA_o 0.324±0.081
Udmurt 0.373±0.135
chisq 9.195
tail prob 0.60393
Full output

All of these models look great, and easily rival the best model with Glazkovo_EBA. Moreover, they make good sense in terms of linguistics. The only problem is that they're anachronistic, because the Uralic-speaking reference populations are younger than Levanluhta_IA. So I can't be certain that they reflect reality without corroboration from ancient DNA. It might turn out, for instance, that a Glazkovo_EBA-like population was already present somewhere deep in Europe before or during the time of Bolshoy_Oleni_Ostrov, while no such population existed around the Ural Mountains until the time of Levanluhta_IA.

By the way, it might be important to note that the present-day Finnish samples in my dataset can't be modeled as a mixture between Levanluhta_IA and Levanluhta_IA_o. But they can be modeled as a mixture between Baltic_IA and Levanluhta_IA. I don't know which part of Finland they're from exactly; probably all over the place, so it'd be useful to test regional Finnish populations to see how they behave in such models. Of course, Finns aren't Saamic speakers, they're Finnic speakers, and they're probably the result of a more recent Uralic expansion into Fennoscandia than the one that gave rise to the Saami.

Finnish
Baltic_IA 0.671±0.076
Levanluhta_IA 0.329±0.076
chisq 14.114
tail prob 0.293508
Full output

Damgaard et al. didn't report the Y-haplogroup for Baltic_IA, but the word round the campfire is that this individual belonged to N1c, which is today the most common Y-haplogroup among Uralic speakers. Obviously, we need a lot more ancient DNA to sort all of this out, but things are already looking pretty much as expected. Stay tuned for new posts in this series following the publication of more ancient DNA relevant to this fascinating topic.

See also...

How did Y-haplogroup N1c get to Bolshoy Oleni Ostrov?

The Uralic cline in the Global25

Late PIE ground zero now obvious; location of PIE homeland still uncertain, but...

132 comments:

Gabriel said...

Kinda off topic but how likely is it and if so in which proportion that Germanic groups are a mixture of Corded Ware and Bell Beaker groups, and where would I1 fit into this?

Unknown said...

Pangermanists want to tie themselves to all cultures and people possible. I fail to see how can CWC be ancestral to both Balto-Slavs, Indo-Iranics(both groups are related by genes and language) and Germanics, which are totally unrelated to both.

Davidski said...

@twój stary

The Germanic R1a-Z284 is part of the R1a-Z282 clade, same as Balto-Slavic R1a-Z280 and Slavic R1a-M458. Indo-Iranian R1a-Z93 is a parallel clade to R1a-Z282.

Also...

Shared maternal ancestry between Slavs and Germanics probably dates to the Metal Ages

Mouthful said...

Baltic_IA sample is dated to 2nd-5th centuries, so I'd hold your horses on him being Finnic. From what I can tell the line between Balts and Estonians is very murky what seems to separates them is few percent of "Siberian" admixture, and the Tarand Graves leaks judging from PCA don't seem very different from the current Baltic populations.

My guess since I'm no expert in this that NW Russia from around St.Petersburg south towards Moscow wasn't very different, knowing that there were Iron age Finnic and Baltic cultures there,who imo largely formed on CWC substrate. Hence the shared the high degree of similarity. But anyway that's just my guess.

Davidski said...

@Mouthful

Baltic_IA sample is dated to 2nd-5th centuries, so I'd hold your horses on him being Finnic.

I have no idea what language he spoke, especially since he was dug up in Lithuania. He may have spoken some early dialect of Balto-Slavic.

But that's not the point. The point is that he's a good proxy for the non-Saami input in Finns, so he's likely to be similar to Finnics or Proto-Finnics at some level.

In fact, I think you'll see that he will turn out to be very similar to the N1c-rich samples from the Estonian Tarand graves.

These samples, as well as Baltic_IA, differ fundamentally from Corded Ware in that they show excess Northeast European forager ancestry, as well as probably some Siberian admixture. I don't know what this means exactly, but for now I'm guessing that the Proto-Uralics who came to Northeast Europe passed on their language and paternal ancestry to locals rich in local forager ancestry, and that's how the Proto-Finnics came to be.

Mouthful said...

> Corded Ware in that they show excess Northeast European forager ancestry, as well as probably some Siberian admixture.

Yes, what I meant to say that still the highest share of overall ancestry is still from CWC like source, minus the later input of HG's, farmers and the Siberian ancestry. I mean how different are Baltic_BA samples like Turlojiske who are all R1a compared from Baltic_IA who's N1c? Is there anything that sets them apart?

Davidski said...

@Mouthful

I mean how different are Baltic_BA samples like Turlojiske who are all R1a compared from Baltic_IA who's N1c? Is there anything that sets them apart?

Compared to Corded Ware, Baltic_BA shows more local forager ancestry and a specific signal of Baltic genetic drift.

Baltic_IA is like a more extreme version of Baltic_BA, but, as far as I can see, also with some extra affinity to Finnics.

Joey Beno said...

There's also an Iron Age genome from Valkeakoski Finland coming out soon. According to an interview the researcher gave the Finnish newspaper Helsingin Sanomat, he belonged to N1c, more specifically the Savonian-Karelian CTS1737 subclade. Genome wide it was said that he was closest to Baltic HGs and not at all the Sami, which I guess means he lacked much of the Siberian related ancestry found in Finland today. The article is behind a paywall sadly.

Davidski said...

It's not that surprising, because during the Bronze and Iron Ages the early Uralic speakers would've been moving west across the forests of Northeastern Europe, and at that time there would've still been some largely unadmixed forager populations living there.

Anthony Hanken said...

Levanluhta_IA being modeled by BOO and Levanluhta_IA_o alone would suggest that BOO is not some sort of Pre-Uralic but is instead (Pre)Proto-Saami who spread with Lovozero Ware (1900-700BC) and came into contact with costal Germanic speakers. No need for a second Saamic migration.

Like mentioned on the other post it seems like BOO is a mix of two populations with the BOO006 outlier.

Also on Anthrogenica Shaikorth showed Mordovians take Hun_Tian_Shan over Bolshoy_Oleni_Ostrov which is not the case for East Finns.

Kristiina said...

An amateur on Anthrogenica posted an admixture run with samples from Oleni Ostrov, Čalmn-Varre, Leväluhta, Botai, Latvia HG, several Latvian and Estonian ancient samples and some modern samples. In his run Leväluhta samples form several different clusters.

In this admixture run, only two Leväluhta samples are close to a modern Saami sample and Čalmn-Varre samples that were Saami speakers with 99% certainty. The outlier and two other samples align with BA Kivutkalns samples and BA Turlojiske samples. Leväluhta JK1968 sample with a mtDNA typical of Uralic populations around the Urals (U5a1a1a'b) comes out as Kunda Donkalnis and Narva Kretuonas samples with excess hg ancestry.

The yDNA N bearing IA sample from Lithuania aligns with BA Kivutkalns samples and BA Turlojiske samples although they form their different cluster compared to previously mentioned BA Kivutkalns samples and BA Turlojiske samples. It is worth mentioning that this ‘blue’ component appears also in the modern Saami sample, Čalmn-Varre samples as well as in two Leväluhta samples. This component could form part of the Western Uralic ancestry.

https://anthrogenica.com/showthread.php?15941-First-ancient-DNA-from-mainland-Finland-reveals-origins-of-Siberian-ancestry-in-regio/page11

MOCKBA said...

Finns have migrated from the Baltic region to the South and gradually spread North and East from the coast, displacing the Sami-related peoples and displaying a classic serial founder effect with hardly any Sami admixture. It has major implications for clinical genetics and thus it is a rather thoroughly studied process

Gabriel said...

@twój stary

That’s why I asked if they are a mixture, duh. Also I’m not a pan-Germanist, whatever that means (Germanocentrist?) though to me it’s clear as day that Germanic people’s have CWC ancestry. Now the question to me is do Germanic people have Bell Beaker ancestry, and how much? I presume they do and that it is most of their ancestry anyway...

Shaikorth said...

Two-way fits of Levanluhta_IA_o and Bolshoy for the ancient Saamis work badly, but doesn't look like Levanluhta_IA or Chalmny Varre need extra ENA beyond what's explained by Bolshoy in general (and EHG), as these fits work well:

Levanluhta_IA

Bolshoy 0.607
EHG 0.072
LBK 0.321
p-value 0.698406

ChalmnyVarre

Bolshoy 0.492
WHG 0.114
Yamnaya 0.159
LBK 0.235
p-value 0.814186

The problem is, non-Bolshoy part of Saami is hard to model with just Northwest European-like ancestry which is what the Levanluhta outlier represents. G25 will demonstrate this well (Baltic_IA works but is suboptimal because pre-Saamis were never in the Baltics). A better solution will be available soonish, likely next year.

Synome said...

Those models with modern day Volga Uralic peoples look great.

Now we need ancient forest Volga and Kama samples to corroborate.

Samuel Andrews said...

Iberian Neolithic drift in G25 PCA is not recent. It formed in ancient Anatolia before farmers arrived in Iberia.

One early Neolithic Iberian farmer is set apart from all other Euro farmers. He has this drift as well as much less IranNeo & Natufian-related ancestry.

Iberian farmers have ancestry from people like him, people like normal European farmers (which includes IranNeo & Natufian-like ancestry), and Iberian hunter gatherer admix.

2.0394"

Iberia_ChL

Iberia_N:I0410,60
Balkans_N:I2533,17.4
WHG:Rochedane,16
WHG:I1875,3.5
ElMiron,2.6
Iron_Gates_HG,0.5
Tepecik_Ciftlik_N,0
Krepost_N:I0679_d,0
Romania_HG,0

Samuel Andrews said...

IMO, a unique stream of Anatolian ancestry is why Iberian farmers have unique mtDNA. mHG H1, H3, K1a4a, K1b1a1 probably originated in these yet unsampled Anatolians.

Kristiina said...

@ Shaikorth

When I look at the above-mentioned admixture run posted on Anthrogenica, it catches my attention that modern Saamis or Finns cannot be modelled as a mixture of Bolshoy and Leväluhta outlier. Both Finns and Saamis carry
EHG Sidelkino and
WHG Latvia HG/Kretuonas Narva_Lithuania
which do not appear in Leväluhta samples or in Leväluhta outlier.

The percentage of Leväluhta outlier-like ancestry in Finns is c. 10% and in Saamis a little bit less than that.

Chalmny-Varre sample is in fact surprisingly different from modern Saamis.

Davidski said...

@Kristiina

Some of those clusters in that analysis are largely defined by similarly low counts of markers and/or high levels of ancient DNA damage.

Shaikorth said...

@Kristiina
As Davidski said, low SNP count is messing with that run. One of the Kivutkalns samples is 100% in the Sidelkino cluster there and that would have been noticed long ago in other tests.

Kristiina said...

Yes, but in spite of this, the western Uralic samples probably turn out to have excess EHG and Latvia HG compared to Leväluhta outlier and Bolshoy. We will see soon.

Gabriel said...

How much R1 do Saami have and what subclades do they show? Curious because that can give us a clue about their non-Siberian ancestry and plus I strongly doubt they have any significant Indo-Iranian ancestry like some people have tried to suggest.

Davidski said...

@Them meee

Apart from N1c, Saami mostly carry the Germanic I1, R1a-Z284 and R1b-M269. They probably have some Baltic R1a-Z280 as well, because Finns do.

But I don't know of any cases of R1a-Z93 in Saami or Finns. So they don't share any paternal ancestry with Indo-Iranians or even the Corded Ware people who became Indo-Iranians, except obviously what was mediated via their Germanic R1a-Z284 lines.

Gabriel said...

@Davidski

The fact they have significant I1, R1b and R1a-Z284 to me is evidence that populations like Levanluhta_IA_o and not just Baltic_IA were significantly ancestral to Saami, so Germanic ancestry is the only thing that can explain at least a huge part of IE ancestry in Saami.

That and the linguistic contacts between Germanic and Uralic why I find it hard to believe Mezhovskaya or even Baltic_IA were the most or even only significant contributors of IE ancestry in Saami or that Germanic fits must be wrong, unless a non-Germanic IE population had all of these particular haplogroups, the probability of which is 0.

Davidski said...

@Them meee

The western ancestry in Levanluhta_IA is largely of Germanic origin. No doubt about that.

Present-day Saami are similar, but they might have more Baltic_IA type of ancestry because of recent contacts with Finns.

In any case, the people claiming that Saami are in any significant way of Sintashta, Mezhovskaya and/or Indo-Iranian origin are talking out of their behinds, but this actually happens a lot in these sorts of discussions. Some people just lose most of their IQ points all of a sudden and talk crap.

André de Vasconcelos said...

@Sam

What do you know of mtDNA H20 in Iberia?

Kristiina said...

YDNA is not so important from the autosomal point of view. YDNA changes easily, is subject to drift and elimination. In his article ”An essay on Saami ethnolinguistic prehistory”, Ante Aikio has identified a Palaeo-Laplandic substrate in the Saami language that is missing in Finnish. We know that the Saami burial in Kola Peninsula Chalmny-Varre yielded a male with yDNA I2a1 and mtDNA V7a1 and a female with mtDNA U5b1b1a3. V and U5b1b1a3 cover 80-95% of the mtDNA of modern saamis. IMO, the paleo-Laplandic substrate is probably related to I2a2, and this means that there is also a Palaeo-Scandinavian genetic substrate in Saamis that is different from Bolshoy. No mtDNA V or U5b was found in Bolshoy. V and U5b haplotypes are particularly frequent in Saamis and Finns and also found in some Uralic populations.

Davidski said...

@Kristiina

Considering the ancient autosomal DNA, formal statistics models, Saami Y-haplogroup data and pre-ancient DNA conclusions based on linguistics data, your comment sounds like special pleading.

Kristiina said...

There is an infinite amount of possible modellings. To get a more balanced view, you should take into account M. Myllylä's post: http://terheninenmaa.blogspot.com/2018/12/finnish-and-north-russian-ancestries.html#comment-form

Davidski said...

@Kristiina

There is an infinite amount of possible modellings.

That's a ridiculous statement.

Samuel Andrews said...

If U5b1b1a is found in Uralic(s) in Russia & Siberia that would be important.

@Andre,

I haven't looked at mtDNA in a while. But I do know, H20a1a is unique to Iberia. The oldest of H20a1a example is from Neolithic Spain and dates to 4100 BC. H20a1a is defined by the 16362c mutation.

H20a is non-existent in Europe (outside of Iberia). It exists in every part of SW Asia (except Iran) from the Caucasus to Egypt.

Kristiina said...

@ Samuel

To my knowledge, U5b1b1a (16144C) is found in smaller amounts also in Mordvins, Maris, Komis, Bashkirs, Chuvashes, Lipka Tatars, Yakuts and Buryats. The oldest U5b1b1a to date is from Narva in Lithuania, Kretuonas4, c. 4400 BC. It has not been found in any presumed IE samples.

Gabriel said...


@Samuel Andrews

Interesting. Are there any other haplogroups that are or seem to be unique to Iberia that you know of?

@Kristiina

There is no reason to think Saami are significantly remnants of an ancient European hunter-gatherer population. The EEF/CHG ancestry in them is just too high to justify that idea.

Samuel Andrews said...

@Kritiina,

The Narva sample is U5b1b1*. The authors misreported it as U5b1b1a which is annoying. When the mtDNA was published it showed no 16144c mutation. There's also a U5b1b1* in Mesolithic Poland. U5b1b1* is present throughout Europe today but especially in eastern Europe.

Ok, then U5b1b1a must be a Uralic lineage in Saami.

Samuel Andrews said...

@Them Mee,
"Interesting. Are there any other haplogroups that are or seem to be unique to Iberia that you know of?"

There's many. I'll post about mtDNA again sometime.

H.J.B. said...

@Anthony Haken BOO and 1900-700BC are way too early for (Pre)Proto-Saami, see [1]. Maybe some non/para-Uralic substrate, I don't know. The spread of Saamic languages is relatively recent, likely from the Lakeland northwards.

[1] Ánte, L. S. S. (2012). An essay on Saami ethnolinguistic prehistory. A Linguistic Map of Prehistoric Northern Europe. Mémoires de la Société Finno-Ougrienne, 266, 63-117.

Mauri said...

@Davidski

"Apart from N1c, Saami mostly carry the Germanic I1, R1a-Z284 and R1b-M269. They probably have some Baltic R1a-Z280 as well, because Finns do."

That is right. It is also likely that that Saamis carry new Siberian-Fennoscandinavian admixtures apart from BA, IA, Baltic Finns etc. Therefor only isolated admixture proportions, like Nganasans, EHG and WHG works well enough to map them.

Davidski said...

@M. Myllylä

Nganasans work so well in models for Uralic populations probably because they have some EHG, but not too much.

Despite of what someone at your blog said, Bolshoy can't explain the eastern ancestry in Levanluhta_IA/Saami very well, probably because its EHG ratio is way off.

But one of the Bolshoy samples, the female outlier BOO006 does seem to have the right ratio of EHG, because she can be used successfully instead of the Nganasans.

So it's an interesting question what BOO006 represents and whether such a population already existed within Europe's borders when she was alive.

Here's a qpAdm model for BOO006...

BOO006
Bolshoy_Oleni_Ostrov 0.795±0.039
Nganasan 0.205±0.039
chisq 13.899
tail prob 0.307225
Full output

IMHO these are the sorts of details you should be investigating at your blog.

Anthony Hanken said...

@H.J.B
"Maybe some non/para-Uralic substrate, I don't know. The spread of Saamic languages is relatively recent, likely from the Lakeland northwards"

http://www.academia.edu/34795141/Carpelan_Christian_and_Asko_Parpola_2017._On_the_emergence_contacts_and_dispersal_of_Proto-Indo-European_Proto-Uralic_and_Proto-Aryan_in_an_archaeological_perspective._Pp._77-87_in_Adam_Hyllested_and_al._eds_Language_and_Prehistory_of_the_Indo-European_Peoples._Copenhagen

Check out page 81 for what I was reffering to about the Saami. This of course only works if Levanlhuta can be modeled well by BOO and Germanic alone.

Leron said...

There's no mystery as to Uralic. The equations are simple: language from the west, genes from the east. People are racking their brains trying to makes both go to one of either direction.

From a historical perspective, this usually happens when a subservient group suddenly gains dominance after having adopted the language of their former leaders. An EHG related group is the most likely candidate of the original pre-Proto-Uralic speakers.

Language: IE-influence + Proto-Uralic => Uralic

Genes: EHG <= West Siberia + Nganasan

Shaikorth said...

Here's a G25 model (scaled and without penalization) I posted a while ago using BOO6...

Levanluhta_IA

Bolshoy_Oleni_Ostrov_BOO006,49
Baltic_IA,28.4
Levanluhta_IA_o,22.6

Glazkova_BA,0
CWC_Baltic,0

Since then I tested the BOO samples themselves. The paper offers a convincing explanation of why the BOO samples are not homogenous. They suggest the BOO samples had a recent mixture date in their time and that there was a contact between them and Yakutia and Taymyr:

"Additionally, the nearest counterparts of Vardøy ceramics, appearing in the area around 1,600-1,300 BCE, can be found on the Taymyr peninsula, much further to the East. Finally, the Imiyakhtakhskaya culture" from Yakutia spread to the Kola Peninsula during the same period."

Therefore the Bolshoy samples can be seen simply as a mixture in progress, with BOO6 having the most Taymyr-related ancestry. And as expected with that conclusion, fitting rest of the Bolshoy samples as mixtures of BOO6 and other populations results in a considerable distance improvement and a reduction in overfitting, for instance:

Bolshoy_Oleni_Ostrov:BOO003
"distance%=5.6257"
EHG,42.4
Glazkova_BA,42.4
Comb_Ceramic,5
Sintashta_MLBA_o3,4.6
Barcin_N,3
Shamanka_N,2
CWC_Baltic,0.6

West_Siberia_N,0

With BOO6 added to sources:

Bolshoy_Oleni_Ostrov:BOO003
"distance%=2.1907"
Bolshoy_Oleni_Ostrov:BOO006,71.8
EHG,11.2
Comb_Ceramic,8.6
Sintashta_MLBA_o3,6.2
Barcin_N,2.2

CWC_Baltic,0
Glazkova_BA,0
Shamanka_N,0
West_Siberia_N,0

Mauri said...

@Kristiina,

"There is an infinite amount of possible modellings. To get a more balanced view, you should take into account M. Myllylä's post: http://terheninenmaa.blogspot.com/2018/12/finnish-and-north-russian-ancestries.html#comment-form"

We have two point of views here, in one we can explain more recent populations by using EHG, WHG, CHG, ENA and early farmers. This helps us to map a large number of populations over Eurasian in a way we can compare results, but of cource that kind of results don't explain the whole history. Already Bronze Age populations were more mixed and present-day populations are mixtures of Bronze Age and Iron Age populations. In a best practice we should use contemporary popelations as admixtures, but for more practical reason it is not possible. Anyway, if the time frame between used admixture populations is big we have risk to fail, because same admixtures are hidden.

EastPole said...

A map credited to Max Planck Institute shows that Finland was populated from Siberia and from Poland:

https://knowridge.com/2018/12/ancient-dna-shows-the-sami-and-finns-share-identical-siberian-genes/

On your PCA map it would look like this:

https://i.postimg.cc/XY0X1229/Finland.png

I wonder what they mean by these arrows from Poland and Siberia? Balto-Slavic Corded Ware and Uralic Siberian HG?
I hope they don’t follow Carlos theory.

Davidski said...

@EastPole

It seems to me that Lamnidis et al. didn't notice how extremely Germanic Levanluhta_IA_o was.

And also that Levanluhta_IA was significantly Germanic.

EastPole said...

@Davidski

“It seems to me that Lamnidis et al. didn't notice how extremely Germanic Levanluhta_IA_o was.

And also that Levanluhta_IA was significantly Germanic.”

Yes, I see. Germano-Siberian_Finnic (GS_F) model looks very probable, in view of the presented evidence more probable than Balto-Siberian_Finnic model (BS_F):

https://i.postimg.cc/VLMVXcK9/Finland2.png

That’s why I wondered why on Max Planc’s map they put an arrow from Poland and not from Sweden.

Davidski said...

@Leron

There's no mystery as to Uralic. The equations are simple: language from the west, genes from the east.

An EHG related group is the most likely candidate of the original pre-Proto-Uralic speakers.


Not as far as Saamis and Baltic Finns are concerned. For them it's language from the east, a lot of the genes from the west, and some from the far east.

Take a look at the mixture models in Lamnidis et al. and also my analyses above. EHG doesn't explain anything for Uralic.

Gabriel said...

@EastPole

I wouldn’t be surprised if Max Planck adopted Carlos’ theories about CWC being Uralic. If anything they’re contrarian at best and extremely biased at worst; it’s almost like they get their researchers off a mental ward. At this point they should stop trying.

Davidski said...

@All

Please note that I generally discourage discussing Carlos and his theories at this blog.

There's a dedicated post and thread for his CWC = Uralic nonsense here...

Indo-European crackpottery

Davidski said...

Glazkovo_EBA is practically the same thing as Baikal_HG.

Shaikorth said...

The results of the Tarand Grave samples might "bring back" some CWC=Uralic theory but only at an amateur level which it has never left in the first place, not in academic papers. Besides them having N1c1 CWC lacks the cultural correlate will be Tarand <- Dyakovo <- Textile Ware <- Volosovo, not CWC. The origin theory they're going to present will reflect that.

Anthony Hanken said...

Considering both BOO men are N-L1026 I would bet they carry a decent amout of whatever admixture PU speakers carried. The problem is BOO is seemingly a mix of two respectively eastern and western populations. As far as I know until we have more aDNA we can't know which part of BOO was Uralic.

That being said it seems like what happened in Finland is becoming more clear.

Davidski said...

Here are a couple more models for Levanluhta_IA this time using the Bronze Age samples from the Mongolian steppe. Hovsgol_BA_o1 is the extremely Siberian-like outlier, while Hovsgol_BA_o2 is the half Steppe_MLBA one.

Levanluhta_IA
Bolshoy_Oleni_Ostrov 0.288±0.069
Hovsgol_BA_o1 0.149±0.045
Levanluhta_IA_o 0.563±0.039
chisq 6.499
tail prob 0.83806
Full output

Levanluhta_IA
Bolshoy_Oleni_Ostrov 0.233±0.084
Hovsgol_BA_o2 0.335±0.103
Levanluhta_IA_o 0.432±0.044
chisq 7.395
tail prob 0.766224
Full output

Levanluhta_IA
Bolshoy_Oleni_Ostrov 0.242±0.082
Hovsgol_BA 0.184±0.058
Levanluhta_IA_o 0.575±0.039
chisq 9.033
tail prob 0.618867
Full output

Michalis Moriopoulos said...

Would it be possible to run qpAdm to estimate the amount of ANE or EHG in Nganasans? I guess Shamanka Neolithic or Devil's Gate would suffice to model the East Asian ancestry. Since Nganasans are the closest thing we've got to the Siberian component in Uralics, it'd be nice to know what they're actually made of.

capra internetensis said...

@JuanRivera

None of those are CT*(xC, D, E, F) or BT*(xB, CT), they are just low coverage so the downstream haplogroup cannot be identified. You have to look carefully at what hgs have actually been excluded.

According to genetiker the Malawian BT*s are B2b and the Natufian CT* is E, which is exactly what you'd expect.

Davidski said...

@Mike

I don't have Shamanka_N in this dataset (although it is in the G25 datasheets), but Hovsgol_BA_o1 should be a very good stand in...

Nganasan
EHG 0.072±0.016
Hovsgol_BA_o1 0.928±0.016
chisq 19.047
tail prob 0.060247
Full output

Not sure what to make of that. I might look at this issue in more detail soon and write something up.

Leron said...

@Davidski

What your chart shows is Saami as representatives of Iron Age Nganasan-like people coming westwards along the Artic coast. In contrast, the Finno-Ugric are descendants of the older Mesolithic Karelian group that were EHG rich (R1a), with very little Eastern genetic contribution except perhaps a some females. This is an indication that (proto) Uralic was present in this region prior to any major entry of Nganasan/Easterners (ydna N carriers). This is further strengthened by the fact that Uralic is found as a substrate in Germanic, although a relatively faint one which suggests a very ancient presence. Between the Baltic Sea and the Urals during the Mesolithic existed the Kunda culture. They would best represent a group that spoke an ancestor language to Uralic. To their south Proto-IE was developing, and eventually reached north by way of Iranic which influenced Uralic directly.

If Uralic was coming directly from the east we would see at least a faint trail of Uralic substrates instead of a sprachbund with northeast Asian languages. Uralic terminates in the east with the Nganasan proper. Their genetic brothers to the east speak completely unrelated languages while those to their west speak other branches of Uralic. This implies a western source for Uralic that was predominately EHG with some older Eastern components, which later mixed with newer immigrant tribes (Nganasan-like) who adopted their language and expanded further during the Iron Age after being enriched with eastern ancestry.

edit: meant EHG

Davidski said...

@Leron

If EHG was the Proto-Uralic population, then how do you explain the almost total lack of EHG Y-haplogroups in Uralics, like basal R1a, R1b and J?

Leron said...

@Davidski

EHG as whole was very likely parent to more languages than merely Uralic. In the EHG northeastern zone, where J-carriers never reached to any significant degree, would be where Proto-Uralic develops. It is also here where R1a predominates, in particular during the Mesolithic (Karelia). As the Neolithic approached the Pit–Comb Ware culture enters into the landscape from the east, where Paleosiberian speaking (and N carrying) tribes are expanding from their ancient home in Northeast China (Xinglongwa culture).

These migrants would be relatively small tribes far from their ancestral lands and would naturally begin to adopt the culture and language of the more numerous EHG groups surrounding them without having to thoroughly mix with them. Except in the case of the Uralic speaking Selkups and Khanty, since a small amount of R1a came into them.

Later, Iron Age cultural innovations from the eastern steppe began to invigorate the N-carrying branch of Uralic speakers to carve a bigger settlement for themselves (Saami and Ugric expansions) and carry on the Uralic mantle lost by most EHG descendants. Eventually the western groups took hold again, so the N-carriers were relegated to the less hospitable areas (Samoyeds). In recent times, these remote isolated tribes are being erroneously equated with the progenitors of Uralic. Despite the language itself having once had a more extended presence into areas that would later become Germanic and Baltic.

Davidski said...

@Leron

There's one each of basal R1a and J in the EHG from Karelia.

But these markers are extremely rare in modern Uralics. Practically all of their R1a is the same as in Sredny Stog on the steppe and Corded Ware, which is R1a-M417.

In fact, most of the R1a in Uralics is the Balto-Slavic subclade of R1a-M417, which is R1a-Z280.

Anonymous said...

Linguistically it is a long term consensus that Proto-Uralic already had absorbed many IE words. So, it was Pre-Uralic X plus some IE (Yamna, CW) like adstrate at their point of origin.
All branches of FU (not that sure of Uralic stage itself due to discussion re Samoyeds) share proto-Indo-Iranian loanwords that mostly are dated to proto-FU stage. I can’t believe this happened without at least some adstrate Sintashta genes.
Later West Uralic (Finnic) would absorb lot of words of Balto-Slavic, later North Baltic origin; plus almost at same stage or bit later a lot of words of pre-Germanic, then proto-Germanic, then North Germanic origin.
These Baltic and Germanic layers would not be present at very Eastern Uralic speakers, but IndoIranian would.

That is from current understanding re IE loans to Uralic, at least via my own prism of whatever I have read what made sense to me.

Michalis Moriopoulos said...

@Dave

Thanks, that's very helpful.

Toby_P said...

Where are we at with Uralic ? Linguists sometimes choose a western origin. Is the Siberian admixture seen in FUs from a single source, or various different ones ?

Davidski said...

@Toby_P

Where are we at with Uralic? Linguists sometimes choose a western origin.

How western and which linguists?

The academic consensus is that the Proto-Uralic homeland was somewhere around the southern Urals.

Samuel Andrews said...

I don't know about the Germanic thing. It's always been obvious Saami's non-Asian side has the most European HG and least Anatolian farmer. ANE-admixed, EHG-admixed North Asians can explain the excess ANE signal we've always seen in Saami. But an Iron age Scandinavian pop can't explain all of the WHG in Saami.

More or less I guess the recent hunter gatherers who roamed Fennoscandia, unlike the ones in the Baltic states, left almost no genetic trace because Saami's extra ANE can mostly explained by Uralic migration. This reminds me of how recent hunter gatherers of the Asian Steppe also "disappeared."

Davidski said...

I'm pretty sure that the 0.241±0.092 of Bolshoy_Oleni_Ostrov-related admix in Levanluhta_IA explains its high/low HG/EEF ratios.

Obviously, Bolshoy_Oleni_Ostrov is not all Siberian, and in my model not necessarily Uralic either.

EastPole said...

@Shaikorth

“Besides them having N1c1 CWC lacks the cultural correlate will be Tarand <- Dyakovo <- Textile Ware <- Volosovo, not CWC.”


If Tarand population turns out to be similar to modern East Europeans and contains some EEF component then following chain of genetic influences should be taken into account:

CWC > Fatyanovo–Balanovo/Abashevo > Volosovo > Textile Ware > Dyakovo > Tarand

Queequeg said...

@ Shaikorth and re:

Levanluhta_IA

Bolshoy_Oleni_Ostrov_BOO006,49
Baltic_IA,28.4
Levanluhta_IA_o,22.6

Glazkova_BA,0
CWC_Baltic,0

So, in practical terms you're saying that BOO006 represent the people who spoke Pre Proto Saami, a West Uralic language like Baltic Finnic. Would you say that BOO006 also represents the whole West Uralic speech community or connect just some building parts of BOO006 to West Uralic speakers?

Davidski said...

The model with BOO006 relies on the assumption that there were whole populations like this female in Europe during the Bronze Age, and the more typical Bolshoy_Oleni_Ostrov samples don't represent anything.

Sort of hard to believe.

Toby_P said...

@ Davidski

By ''west'', I meant west of the Urals, in the Russian forests. Linguists such as Haakinen amongst them.
Do we agree there are multiple sources of ''Siberian'' type ancestry in Uralics, over time ?
If so, then something like Leron's scenario seems possible & nuanced.

Shaikorth said...

@Eastpole
Not necessarily because the tarand Estonians are not genetic Volosovo transplants. Dyakovo and Textile ceramics probably have higher CWC than Volosovo or the eastern Comb Ceramic horizon though.

@Huck Finn

Didn't say anything about linguistic side, and if we go by the view of pretty much any linguist BOO and Bronze Age Taymyr populations spoke some unknown language. However again in line with linguistics both CWC and full EHG look very unlikely to be Uralic on the genetic side regardless of what the Southern Urals or Kama region were like. Inspires by Sein and Arza, the non-Evenk like parts of Samoyedic speakers and Ob-Ugrians look like this in G25 terms of deep structure:


Khanty-Evenk
distance%=16.6166
West_Siberia_N,43.6
EHG,41.4
Barcin_N,15
CWC_Baltic,0
Shamanka_N,0


Nenets-Evenk
"distance%=21.9837"
West_Siberia_N,57.6
EHG,29.2
Barcin_N,13.2
CWC_Baltic,0
Shamanka_N,0

Davidski said...

@Toby_P

Do we agree there are multiple sources of ''Siberian'' type ancestry in Uralics, over time?

Sure, if Bolshoy_Oleni_Ostrov and related groups didn't speak languages from now dead, early Uralic language families. They may have.

If so, then something like Leron's scenario seems possible & nuanced.

Not sure where you got the nuanced part? Leron's theory in large part hinges on the R1a in Uralics being from EHG from what is now Karelia and surrounds.

This is obviously impossible, since 99% of the R1a in Uralics is derived from Indo-Europeans, and only distantly related to the R1a(xM417) in the northern EHG.

So once we take this away, Leron's theory relies on EHG admixture in Uralics, which isn't exactly a specific signal, and linguistics.

Queequeg said...

@ Shaikorth: looks very good, IMO.

Anthony Hanken said...

Please read the
Formation of the Indo-European and Uralic (Finno-Ugric) language families in the light of archaeology by Asko Parapola
if you believe CWC was Uralic and keep in mind we have found Ugric N1c in Sargat kurgans and N1c in Tarand grave, the only Uralic cultures mentioned that have been actually tested.

Anthony Hanken said...

@Davidski

What do you think about WSHG or related being a Proto-Uralic marker? Obviously it is missing the Nganassan component but it seems high among Uralic speakers and Neolithic West Siberia is the right place and time.

Synome said...

The link between the Ymmyakhtakh culture and BOO suggests they may have spoken a language related to Yukaghir.

Similarly, Nganasans may be Uralicized Yukaghirs.

The Sikora 2018 supplement discusses the links between Uralic and Yukaghir. The time depth is probably too great to ever be sure of a direct relationship, but there is evidence of contact between the two languages.

Gabriel said...

@Davidski
@Toby_P

Isn’t Leron’s theory based on the outdated idea Uralics were Mesolithic Northern Europeans?

Davidski said...

@Anthony Haken

What do you think about WSHG or related being a Proto-Uralic marker?

It doesn't appear to me that the expansions of Uralic languages and N1c were accompanied by a spread of the WSHG genetic component. But I might take a closer look at that.

Leron said...

I see Early EHG with basal R1a as an indicator for a population that spoke a proto language ("Eurasian") that existed prior to IE, Uralic, and perhaps others, splitting off from it. This early stage is detected linguistically by correspondence between Uralic and IE that exclude later innovations from IE daughter languages. The proto-IE and proto-Uralic branches diverged into different areas, although not that far apart geographically.

After the IE-Uralic split but before the expansion of IE languages is when an eastern foreign people (ydna N/progenitors of Pit Comb Ware), came to interact and shift languages to proto-Uralic. Their eastern ancestral languages are possibly hidden in the substrate of Pre-Finno-Ugric and Saami. The substrate is sometimes assumed to be "Old European" but not found in Germanic or Baltic. However, other linguists propose the substrate being connected to Mongol-Tungusic languages. In addition, Samoyedic is found to have a Far East Paleosiberian substrate. This subject is discussed mostly in Russian, so it's not easily accessible to English readers.

The IE line diversifies with Caucasian people in contrast to the Proto-Uralic one. When IE speakers expand north they replace the Proto-Uralic tongue of people in proximity to them (explaining the Uralic substrate) and enrich the region with their own R1a lineages over the older ones. The people further away from them keep the Uralic language but are influenced linguistically by various branches of IE.

By my estimation CWC people were not speaking Uralic, but instead primarily had a language closer to IE or even IE itself.

Anthony Hanken said...

@Synome

Interestingly modern Yakuts do not belong to N-L1026 like BOO but are instead N-M2019.

Nganassans are thought to be Paleo-Siberians who were assimalated by Samoyeds migrating north.

Shaikorth said...

@Anthony Haken
Karafet 2018 mentions the theory and concurs re:Nganasans and to a lesser degree Nenets. Low L1026 there is also an indicator. For its part, Glazkova doesn't have it either, they're mainly Q with one N1c which is autosomally the most western of them. However that N1c is B187 according to Russian Molgen, present in Okunevo and some modern Siberian Turkics but not in Uralic speakers.

EastPole said...

@Anthony Haken

“if you believe CWC was Uralic”

CWC was the Indo-European culture and spoke the Indo-European language by definition because this is the only certain linguistic, genetic and archeological link between Europe and India.

All the rest like PIE reconstruction and origin is BS speculation, it is like the language of Neanderthals, we will never know anything certain about it.

CWC language is certain and it is IE by definition because we have related R1a in Eastern European CWC and in India and thousands almost identical words (which do not require reconstruction) in Indo-Iranian and Balto-Slavic languages.
I just don’t understand how anybody who knows anything about languages and genetics can doubt it.

Gabriel said...

@Leron

1. But again damn near all R1a in modern Uralic populations is of Indo-European origin, be it Indo-Iranian, Balto-Slavic, Germanic or CWC. Even more northern Uralic populations tend to derive most of their R1a from Indo-Europeans and who knows what’s the source for some of the more basal R1a clades. So I highly doubt R1a had anything significant to do with Uralics.

2. Related to the last point, CWC was indeed Indo-European and the source of Balto-Slavic, possibly Indo-Iranian too and surely an eastern IE or even Balto-Slavic-related substrate in Germanic.

George said...

Hi,

Off topic:

Emergence and Spread of Basal Lineages of Yersinia pestis during the Neolithic Decline. Cell, 2018; DOI: 10.1016/j.cell.2018.11.005

From the Abstract: "We investigated the history of this strain by combining phylogenetic and molecular clock analyses of the bacterial genome, detailed archaeological information, and genomic analyses from infected individuals and hundreds of ancient human samples across Eurasia. These analyses revealed that multiple and independent lineages of Y. pestis branched and expanded across Eurasia during the Neolithic decline, spreading most likely through early trade networks rather than massive human migrations."

https://www.cell.com/action/showPdf?pii=S0092-8674%2818%2931464-8

From Caption of Fig 5B:
"Geographic distribution of the use of animal traction and wheeled transport across Neolithic and Bronze Age populations in Eurasia, which broadly expanded during the period of 5,500 and 5,000 BP. The expansion of these technological innovations overlaps the predicted period for the expansion of the basal Y. pestis strains."

George

EastPole said...

“The Genetic Ancestry of Modern Indus Valley Populations from Northwest India”

https://i.postimg.cc/6QXkVYh5/screenshot-459.png

https://www.cell.com/ajhg/pdfExtended/S0002-9297(18)30398-7#back-bib1

Leron said...

Them meee:

I think we just need to be more flexible with attributing all these R1a lineages as strictly IE. Due to only certain languages having survived, absence of written records and limited ancient genetic sampling, it's easy to overlook the diversity of languages that must have existed before even proto-IE. I'm done with the thread on Uralic. I will wait until newer studies get released to get more insight.

Davidski said...

@Leron

I think we just need to be more flexible with attributing all these R1a lineages as strictly IE.

All of the modern R1a lineages under M417 are strictly IE (and that's 99% of the R1a in the world today), because there are certain subclades that peak in specific Indo-European language families and both Turkic and Uralic populations carry younger subsets of these lineages.

There are no Uralic-specific R1a lineages that reach significant frequencies anywhere today.

You need to have a close look at the R1a in Corded Ware and how it relates to the subclades now found in Balts, Germanics, Slavs and Indo-Iranians. Unless you do that, you can't keep pushing your R1a = Proto-Uralic theory, because that's just sloppy and ignorant.

Synome said...

@East Pole

Something looks very weird with that admixture graph. Paniya are more Steppe than Iran Neolithic? Suspect.

Davidski said...

@Olympus Mons

You can't post here because you're banned and the ban is forever. See here...

Banned commentators list

And as for your charge that the comments at this blog are nothing but a so called echo chamber, you and the other banned commentators of your ilk obviously don't understand that to make a real contribution and an impact here you must offer coherent arguments based on accurate interpretations of data, not some fantasies pulled out of thin air.

All the best in one day understanding what I just said.

Davidski said...

@Synome

Paniya are more Steppe than Iran Neolithic? Suspect.

Not really.

The Paniya may or may not have some steppe ancestry, or at least steppe-related ancestry, like perhaps from Central Asian foragers such as the Kelteminar people.

But Iran Neolithic certainly has zero steppe ancestry.

Gabriel said...

Olympus Mons? Didn’t know he was still lurking here. I wonder what he’s up to these days...

@Leron

So is R1a-M417 Uralic, Indo-European or both? Can’t see them being both. Do R1a English people or Scandinavians derive from Uralics? Are Uralics from the Chalcolithic and R1a-M417 is from a pure EHG population that mixed with CHG at the same time to form Indo-European? Your whole argument sounds like a mess. But who am I to judge, when the data speaks for itself: R1a and steppe ancestry go hand-in-hand in almost all ancient DNA samples.

Davidski said...

@Them meee

Olympus Mons? Didn’t know he was still lurking here. I wonder what he’s up to these days...

He has his own blog now, but obviously feels the need to pop in here for old times' sake, or something.

Toby_P said...

Fair enough
Maybe Shamanka, Lokomotiv will do will to explain at least part of the Siberian signals in Uralics ?

Davidski said...

@Toby_P

Glazkovo_EBA is another name for Shamanka_EBA. But I'm just using it as a proxy for the main East Eurasian signal in Levanluhta_IA and thus Saami.

I don't actually believe that the Proto-Uralics, or even Pre-Proto-Uralics, came from near Lake Baikal.

It seems to me that at some point, probably during the Bronze Age, this Glazkovo_EBA-like ancestry became widespread all the way from far Northeastern Europe to East Asia, and somehow it became a feature of the N1c-rich Proto-Uralics before their main early expansions.

Leron said...

Them meee:

R1a > West Eurasian
R1a1 > Start of IE/Uralic split
R1a-M17 > End of IE/Uralic split
R1a-M417 > Pre-Proto-IE
R1a-Z283 > IE

Davidski said...

@Leron

Some Corded Ware individuals belong to R1a-L664, which is xZ283. And Sintashta is mostly R1a-Z93, which is also xZ283. So the Corded Ware paternal marker is R1a-M417 not R1a-Z283.

And like I said, practically all Uralic R1a is very young and obviously derived from extant Indo-European linguistic groups, mostly Balts and Slavs. Some of it might be from Srubnaya.

So it ain't from any start of the IE/Uralic split or end of such a split. It's from the Middle Bronze Age onwards, and even mostly the historic period.

Leron said...

The northeastern R1a1* lineages that spoke the language that sourced Uralic would have been replaced by immigrants carrying N. As I said previously about Uralic: language moved west to east, genes moved east to west. All these would be small tribes that did hunting and fishing and were bound to be erased by more vigorous populations coming one after another. Hence there's almost no survivals in present times of the older lines while the "young" ones were more successful.

I consider CWC to be speakers of a sister language to IE (Centum IE substrate). Indo-European would developed from the contact zone between Cucuteni–Trypillia and western steppe groups, that infused Caucasian-like culture/language elements to Proto-IE and allowed them to dominate a big part of the continent.

Davidski said...

@Leron

The northeastern R1a1* lineages that spoke the language that sourced Uralic would have been replaced by immigrants carrying N.

How convenient for you.

I consider CWC to be speakers of a sister language to IE (Centum IE substrate). Indo-European would developed from the contact zone between Cucuteni–Trypillia and western steppe groups, that infused Caucasian-like culture/language elements to Proto-IE and allowed them to dominate a big part of the continent.

And where, in your estimation, does CWC come from? Because R1a-M417 comes exactly from the contact zone between Cucuteni–Trypillia and western steppe groups.

So are you saying CWC just happens to be almost 100% R1a-M417 by coincidence?

Never mind, I'm not interested. You're not making any sense.

Gabriel said...

@Leron

“The northeastern R1a1* lineages that spoke the lenguaje that sourced Uralic would have been replaced by immigrants carrying N.”

Sounds like you’re trying to reconcile the genetic evidence, the Indo-Uralic hypothesis and the Uralic = Paleo-European theory with each other, but okay.

Queequeg said...

@ D and re:

"It seems to me that at some point, probably during the Bronze Age, this Glazkovo_EBA-like ancestry became widespread all the way from far Northeastern Europe to East Asia, and somehow it became a feature of the N1c-rich Proto-Uralics before their main early expansions."

Glazkovo_EBA or anything close to that is apparently way too eastern, if you're willing to connect that to Proto Uralic stage. Besides, based on that what we already know, there's no way to derive European N1c1 clades, such as those of Finns (and ancient Hungarians etc.) from BA Baikal.

Either way, in the next couple of months we'll know more.

Davidski said...

I never claimed that the Glazkovo_EBA-like ancestry was from near Lake Baikal. I said that it probably wasn't.

Queequeg said...

From Baikal or not, Glazkova_EBA like ancestry is according to my understanding related to parallel or secondary expansions vs. Proto Uralic. But, as said, some time next year we should know more.

Davidski said...

Well, BOO006 has a lot of Glazkovo_EBA-like ancestry.

Queequeg said...

Yes, related to a parallel expansion from Taymyr and/or Yakutia. Again and based on that what we already know, there's no way you can connect Proto Uralic, from linguistic point of view, to those places.

I'd however say that Proto Uralic speakers were possibly still sort of Siberians i.e. West Siberians, most probably based on some kind of a WSHG/EHG heavy mixture with a little bit neolithic farmer. Just like Shaik's model was implying.

Davidski said...

@Huck Finn

I'm not seeing a huge signal of WSHG in Levanluhta_IA when Bolshoy and Glazkovo_EBA are there, and then if given a choice the algorithm mostly picks Glazkovo_EBA, and gets a significantly better fit with it.

Do you have any ideas how I can model Levanluhta_IA with significant WSHG? Should I try leaving out Bolshoy? But why would I do that?

Levanluhta_IA
Bolshoy_Oleni_Ostrov 0.167±0.264
Glazkovo_EBA 0.188±0.095
Levanluhta_IA_o 0.617±0.079
West_Siberia_N 0.029±0.111
chisq 7.579
tail prob 0.577029
Full output

Queequeg said...

@ D: Relevant samples not being currently available, I think the combined amount of WSHG, naturally including that of BOO samples, is crucial. As Shaik said, BOO seems to be a mixture in process, including a West Siberian element, i.e. WSHG (and N1c1).

Kristiina said...

If there is one female sample in Bolshoy that is more eastern than the others and takes more Glazkovo than the others, we could indeed imagine that this female belongs to an other population on a more northern route and the yDNA of this population would be the same as we have in Glazkovo, i.e. Q1b1b1-YP4004 (Q-L53). In this model, Bolshoy N1c would represent the West Siberian part of the Bolshoy ancestry which would also contain some Baikal ancestry.

Anthony Hanken said...

@Kristiina

Or N-P43 instead of Q-L53. Considering it is down stream from the N-L666 found in BHGs and common in modern Samoyedic speakers.

Some people seem to be confused but modern testing and the little aDNA we have points to L1026 being born west of the Urals not in Yakutia.

Anonymous said...

Can Estonians be modeled with Glazkovo?

Davidski said...

@Parastais

Estonian
Glazkovo_EBA 0.023±0.007
Latvian 0.977±0.007
chisq 7.911
tail prob 0.792076
Full output

Chad said...

Maybe use Baltic BA. Latvians may be eating up some Eastern stuff.

Davidski said...

They are, but it's harder to model Estonians as Baltic_BA, because they have a bit of extra admix from different directions, including probably east, west and south. So using Latvians was quick and easy.

Queequeg said...

According to some models I've seen in the Finnish forum the amount of WSHG in BOO samples is quite sizeable (and there really is WSHG in BOO, as confirmed by Narasimhan). Roughly one fourth (N1c-man), one third (the woman with Iranic type of lineage) and even the most eastern sample i.e the woman coded as BOO006 is rather WSHG, one fourth part of the whole genebase. Then there's EHG, on top of that. Taking BHG (probably coming from Taymyr/Yakutia) away from the samples, i.e. by doing that creating a hypotethic West Uralic zombie, would naturally considerably increase the level of both WSHG and EHG in the samples.

Speculation, of course, but then again: the new IA sample of Finnish Tavastia (not published yet) is heavily biased towards Eastern European/Scandinavian HG's and apparently the amount of BHG is minimal, if it even exists.

Queequeg said...

P.S. BOO006: roughly one fifth part, not fourth. My apologies.

Davidski said...

@Huck Finn

Can you tell me how these Finnish forum members found so much WSHG in BOO, because this is what I'm getting...

Bolshoy_Oleni_Ostrov
CWC_Baltic 0.133±0.025
EHG 0.346±0.048
Nganasan 0.468±0.019
West_Siberia_N 0.054±0.044
chisq 12.861
tail prob 0.116738
Full output

Bolshoy_Oleni_Ostrov
Kivutkalns_BA 0.130±0.024
EHG 0.334±0.047
Nganasan 0.473±0.019
West_Siberia_N 0.063±0.044
chisq 12.186
tail prob 0.143075
Full output

Bolshoy_Oleni_Ostrov
Baltic_IA 0.099±0.035
EHG 0.411±0.068
Nganasan 0.451±0.028
West_Siberia_N 0.039±0.060
chisq 2.707
tail prob 0.951392
Full output

Queequeg said...

@ D: not willing to get involved into lengthy discussions concerning right method, relevant outpops etc., I'd however say that even 4- 6 % of WSHG in BOO is enough, for the sake of argument. By excluding Nganasan you will anyways, just by relying on your own model, end up having lots of EHG and yet quite a lot of WSHG in BOO.

So, in practical terms we anyways end up discussing the right amount of BHG in Proto/West Uralic speakers, before BOO took place. My question is: if there was BHG, why it is not clearly visible in Estonian Tarands or in the new Finnish IA sample? Where did it go?

Disclaimer: I'm deeply convinced that at least Pre Proto Uralic speakers were Siberians, but West and definitely not East Siberians.

Davidski said...

@Huck Finn

Excluding Nganasan doesn't work. You can see that in the full output that I posted. It results in failed models.

So I'm still not seeing any significant WSHG in BOO.

My question is: if there was BHG, why it is not clearly visible in Estonian Tarands or in the new Finnish IA sample? Where did it go?

This is a good question.

On a related note, did you notice that in my model BOO is modeled as part Baltic_IA which belongs to N1c, and in the Lamnidis et al. models BOO requires significant EEF?

Hmm...interesting.

Queequeg said...

@ D: obviously you can't exclude Nganasan if you're looking at BOO, which is an outcome of a BHG related admixture event. What I meant was that by excluding Nganasan i.e. taking it off the model outcome, you'll get a hypotetical Proto/West Uralic speaker, kind of Proto/West Uralic Zombie, if you like.

Baltic_IA shouldn't be too far off, when we're discussing Proto Baltic Finns, still I'd guess it is too Baltic (too much WHG, too much EEF). That being said, I'd guess that this hypotethical Proto/West Uralic Zombie does need some EEF, not just WSHG/EHG.

Gabriel said...

@Davidski

“On a related note, did you notice that in my model BOO is modeled as part Baltic_IA which belongs to N1c, and in the Lamnidis et al. models BOO requires significant EEF?”

What could be this source of N1c then, in regards to that Baltic_IA-like population?

Davidski said...

Seems to me like N1c was introduced into BOO from the south, not from the east.

Gabriel said...

@Davidski

Yeah that’s why I asked. What population could have given this Baltic_IA-like population their N1c?

Also that’s a kinda radical shift in evidence... but maybe it simplifies the question of N1c in Balts a bit?

Anthony Hanken said...

@Them meee

N1c obviously orginally came from the east. BOO is just too late to represent the original N1c expansion west of the urals and doesn't make sense for the wave that brought it to Finland.

Imo N1c into Baltic_IA goes Volosovo-Netted Ware- Tarand graves or other related post Netted Ware cultures- Baltic_IA. Basically whatever PU looked like plus CWC/European HG as they moved west.

Gabriel said...

@Anthony Haken

Yes, I know.

But I wonder, in addition to what kind of Siberian ancestry they have, if any archaeological culture could be linked to it and their N1c.

Though it’s definítely interesting BOO likely got their N1c from a mostly CWC-derived population.

Davidski said...

@Them meee

Though it’s definitely interesting BOO likely got their N1c from a mostly CWC-derived population.

This is rather unlikely. More likely is that a population similar to Baltic_IA, but with a much higher ratio of EHG, spread N1c to the west and north from somewhere just west of the Urals around 1700 BC.

Otherwise we're looking at an almost total population replacement by such migrants in the East Baltic from the Bronze Age to the Iron Age, which doesn't make any sense.

Queequeg said...

@ Them meee and re:"What population could have given this Baltic_IA-like population their N1c?"

This is a good question. I'd guess that migrating N1c's were different from the local population in the same way as Baltic_IA is different from populations of the preceding periods, especially compared to local CW.

Anonymous said...

Re Baltic_IA I recall in anthrogenica forums I asked Shai if it could be modeled using Baltic_BA Kivutkalns to find out the ‘non local component of it’, but I think it was difficult (due to low coverage?) to do so. Instead it preferred modern Latvians, which is a bit strange. And it was like 95% Latvian + EHGish something.

Davidski said...

Baltic_IA is basically the same as Baltic_BA, but this doesn't mean that there was no population movement into the East Baltic at this time bringing N1c.

There probably was, and much of the ancestry of Baltic_IA might not be local, it just looks local.

Taking into account the N1c in Baltic_IA and BOO, the most parsimonious scenario is that there was an EHG-rich population somewhere in between them that received N1c from the east and expanded both west and north, taking Uralic languages with it.

So it's possible that BOO was partly of Uralic stock.


Queequeg said...

@ D: agreed, makes a lot of sense. However, I wouldn't forget about WSHG yet, pls remember Shaik's Khanty/Mansi minus Evenki model. Both Khanty and Mansi are Uralic speakers too.

And, who knows, maybe Narasimhan is not as hopeless as you seem to think? People have their bright moments.

Anonymous said...

Those Proto-Finns who arrived into Baltics would do that via former Fatyanovo (CW) territory, and it is often said in archeological literature that Fatyanovo served as significant substrate for later Finnic cultures. So, CW Fatyanovo substrate (possibly very similar to Baltic BA, perhaps more EHG shifted) for incoming N population is very expected. That Fatyanovo impact would be post-proto-Uralic however, but pre-proto-Finnic.

On other hand I expect some CW substrate for proto-Uralic as well. Every Uralic language contains archaic PII loanwords, at least that used to be consensus, dunno if that has changed.

Anthony Hanken said...

@Them meee

I was giving you archealogical cultures that are likely responsible for N1c in Baltic_IA. Netted Ware related cultures froming along Volga and then migrating west into the Baltic. (D'yakovo-Tarand graves, Akozino-Akhmylovo, etc). Probably responsible for different branches N-Z1936 and N-VL29 for example did not separate in Finland or the Baltic but are both found there today.

I don't think Baltic_IA carries any more Siberian then a modern Lithuanian but if it does it could be explained by a lot of things. Proto-Uralic, CCC, north to south migrations from post-BOO Finland which explains modern Estonians and probably many more scenarios.

Simon_W said...

Mouthful: "Baltic_IA sample is dated to 2nd-5th centuries, so I'd hold your horses on him being Finnic."

Davidski: "I have no idea what language he spoke, especially since he was dug up in Lithuania. He may have spoken some early dialect of Balto-Slavic."


AFAIK Baltic_IA sample DA171 is dated to 350-650 AD. There can be very very little doubt that he spoke a variety of Baltic. At such a late date, right in the middle of the historical Baltic area, and with an ultra-Baltic position in the PCA. But yeah, as Davidski said, it's not the point indeed.

Davidski said...

Oops, yeah the date for Baltic_IA DA171 is BP not BC.

But this doesn't affect my blog post.

Rob M. said...

Can I get a source on the H20a1a sample from neolithic spain?