search this blog

Showing posts with label Central Asia. Show all posts
Showing posts with label Central Asia. Show all posts

Thursday, February 22, 2024

Berkeley, we have a problem


A new preprint at bioRxiv by Kerdoncuff et al. makes the following, somewhat surprising, claim:

One of the individuals, referred to Sarazm_EN_1 (I4290) described above that was discovered with shell bangles showing affiliation with South Asia, has significant amount AHG-related ancestry, while a model without AHG-related ancestry provides the best fit for Sarazm_EN_2 (I4210) (Table S4.5).

First of all, the authors are actually referring to sample ID I4910 not I4210.

The aforementioned table, based on qpAdm output, shows that I4290 has 15.9% AHG-related ancestry and basically no Anatolian farmer-related ancestry. It also shows that I4910 has no AHG-related ancestry but 17.9% Anatolian farmer-related ancestry.

AHG stands for Andaman hunter-gatherer. The authors are using it as a proxy for South Asian hunter-gatherer ancestry.

However, I've looked at I4290 and I4910 in great detail over the years using ADMIXTURE, Principal Component Analysis (PCA), and qpAdm. And I'm quite certain that they do not show any obvious, above noise level South Asian ancestry. Indeed, I'd say that if they do have some minor South Asian ancestry, then I4910 probably has more of it than I4290.

Kerdoncuff et al. used the following "right pops" or outgroups: Ethiopia_4500BP.SG, WEHG, EEHG, ESHG, Dai.DG, Russia_Ust_Ishim_HG.DG, Iran_Mesolithic_BeltCave and Israel_Natufian.

This means they mixed data that were generated in very different ways (DG, SG and capture) and included some poor quality samples. For instance, the highest coverage version of Iran_Mesolithic_BeltCave offers just ~50K SNPs.

Mixing different types of data and relying on low coverage samples, even in part, often has negative consequences when using qpAdm. So I suspect that the above mentioned mixture results for I4290 are skewed by a poor choice of outgroups.

When I run qpAdm I try to stick to one type of data and avoid low quality singletons in the outgroups. This is the best qpAdm model that I can find for Sarazm_EN:

right pops:
Cameroon_SMA
Morocco_Iberomaurusian
Israel_Natufian
Levant_N
Iran_GanjDareh_N
Turkey_N
Russia_Karelia_HG
Russia_WestSiberia_HG
Mongolia_North_N
Brazil_LapaDoSanto_9600BP

Sarazm_EN
Kazakhstan_Botai_Eneolithic 0.113±0.017
Turkmenistan_C_Geoksyur_subset 0.887±0.017
P-value 0.06392

Sarazm_EN_1 (I4290)
Kazakhstan_Botai_Eneolithic 0.129±0.021
Turkmenistan_C_Geoksyur_subset 0.871±0.021
P-value 0.11019

Sarazm_EN_2 (I4910)
Kazakhstan_Botai_Eneolithic 0.104±0.021
Turkmenistan_C_Geoksyur_subset 0.896±0.021
P-value 0.07427

Also...

Sarazm_EN
Andaman_hunter-gatherer -0.018±0.020
Kazakhstan_Botai_Eneolithic 0.123±0.019
Turkmenistan_C_Geoksyur_subset 0.895±0.020
P-value 0.0298403
(Infeasible model)

Please note that Turkmenistan_C_Geoksyur_subset is made up of just three relatively high quality individuals: I8504, I12483 and I12487. That's because it's not possible to model the ancestry of Sarazm_EN using the full Geoksyur set, probably due to subtle genetic substructures within the latter.

Below is a PCA plot that, more or less, reflects my qpAdm model. I4290 and I4910 are sitting right next to each other in a cluster of ancient Central and Western Asians, and it's actually I4910 that is shifted slightly towards the South Asian pole of the PCA. Indeed, I can confidently say that there's no way to design a PCA in which I4290 is shifted significantly towards South Asia relative to I4910.

Citation...

Kerdoncuff et al., 50,000 years of Evolutionary History of India: Insights from ∼2,700 Whole Genome Sequences, bioRxiv, posted February 20, 2024, doi: https://doi.org/10.1101/2024.02.15.580575

See also...

The Nalchik surprise

A comedy of errors

Thursday, February 10, 2022

Mainstream media BS: Europeans owe their height to Asian nomads


From a recent Daily Mail article by some clown named Sam Tonkin:

Present day Europeans owe their blue eyes to hunter gatherers, their height to Asian nomads and their blonde hair to Anatolian Neolithic farmers, a new study suggests.

...

Most of the contemporary European genetic makeup was shaped by movements that occurred in the last 10,000 years when local hunter gatherers mixed with incoming Anatolian farmers — from present-day Turkey — and Asian nomads, or Pontic Steppe pastoralists.

The latter originated from what is now parts of Bulgaria, Romania, Moldova, Ukraine, Russia and Kazakhstan.

Haha.

Bulgaria, Romania, Moldova and Ukraine are European countries. The relevant parts of Russia and Kazakhstan are also located in Europe.

Obviously, the author is referring to the Yamnaya herders who lived on the Pontic-Caspian steppe, which is obviously in Eastern Europe.

I blame Johannes Krause for this.

See also...

Matters of (basic) geography

Blond hair is only indirectly associated with Anatolian ancestry in Estonia...duh

Ancient ancestry and complex traits in Estonians (Marnetto et al. 2022)

Sunday, March 14, 2021

A comedy of errors


A couple of years ago, the authors of a paper about a group of Iron Age nomads from the site of Shirenzigou, in the eastern Tian Shan, made a mistake. They wrongly assigned two of these nomads to Y-haplogroup R1b-M269.

This faux pas made them believe that the Shirenzigou nomads were closely related to the M269-rich population associated with the Afanasievo culture.

Indeed, since the Afanasievo culture was often credited with the spread of Tocharian languages to the Tarim Basin, these authors, led by Chao Ning, also concluded that the Shirenzigou nomads were potentially the missing link between the Afanasievo culture and the Tocharians (see here).

Moreover, Ning et al. used formal statistics to argue that the Shirenzegou nomads harbored Afanasievo-related genome-wide ancestry, rather than Sintashta-related genome-wide ancestry, despite the fact that the latter ancestry was widespread in the Tian Shan and surrounds during the Bronze and Iron ages. Soon after, another group of authors, led by Chuan-Chao Wang, also went out of their way to link the Shirenzigou nomads to the Afanasievo people with genome-wide DNA using formal statistics (see here).

Interestingly, one of the Shirenzigou nomads belongs to Y-haplogroup R1a-Z93, which is an obvious Sintashta-related lineage. Both Ning et al. and Wang et al. missed this important fact.

They also missed the key fact that the R1b lineage found in the Shirenzigou nomads actually belongs to an Inner Asian subclade, which is only very distantly related to the originally Eastern European R1b-M269.

Now, formal stats are a very useful tool for studying genome-wide ancestry. But they're not infallible, and that's actually something of an understatement. Indeed, if you don't run sanity checks when using formal stats, you're likely to come to some unusual, even arse about face, conclusions. Uniparental markers, like Y-chromosome haplogroups, can provide a robust sanity check when running formal stats on genome-wide data.

One problem with formal stats is that Sintashta-related ancestry often looks very much like Afanasievo-related ancestry when it's mixed with indigenous Central Asian ancestry. Basically, the reason why this happens is that the Central Asian ancestry dampens the Early European Farmer (EEF) signal in the Sintashta-related ancestry.

This is an artifact that once caused scientists at Harvard to believe that Central Asian Scythians and present-day South Asians lacked Sintashta-related ancestry.

Unfortunately, since the publication of the Ning et al. paper, a consensus has emerged in academia that the Shirenzigou nomads are indeed the missing link between the Afanasievo culture and the Tocharians. But, let's be objective and honest here, it's a consensus based on nothing more than a comedy of errors.

On the other hand, me and most of the commentators at this blog have formed opinions about the Shirenzigou nomads that are totally at odds with the academic consensus, that:

- they're a complex mixture of Sintashta-related, indigenous Central Asian and Tibetan-related ancestries, with no clear, unambiguous signal of Afanasievo-related ancestry

- they weren't the speakers of Proto-Tocharian or even related in any specific way to the Tocharians

- they were probably the speakers of a now extinct Indo-Iranian language, and, at least based on geographic proximity, possibly related to the Yuezhi.

Feel free to make up your own mind. But for me, the question of how Tocharian languages ended up in the Tarim Basin remains wide open. I admit though, I'm currently quite partial to the idea floated here by commentator Copper Axe that the Chemurchek culture may have had something to do with it.

See also...

Don't believe everything you read in peer reviewed papers

Tuesday, July 21, 2020

The oldest R1a to date


My popular map of the oldest instances of Y-haplogroup R1a in the ancient DNA record has a new entry: PES001 from the recent Saag et al. preprint. PES001 comes from a burial site in what is now northwestern Russia and is dated to a whopping 10785–10626 calBCE.


Indeed, I'm not aware of any R1a samples older than PES001 among the treasure trove of thousands of ancient samples waiting to be published. So it's likely that this individual will remain the oldest member of our R1a clan for some years to come.

See also...

Y-haplogroup R1a and mental health

Like three peas in a pod

The mystery of the Sintashta people

Friday, December 20, 2019

A note on Steppe Maykop


I'm reading a new book titled Dispersals and Diversification: Linguistic and Archaeological Perspectives on the Early Stages of Indo-European (see here). One of the chapters is authored by archeologist David Anthony, in which he makes the following claims:

A previously unknown genetic population actually was identified in Wang et al. (2019), but it was a peculiar relict-seeming group related to Paleo Siberians and American Indians (Kennewick) that had survived isolated somewhere in the Caspian steppes or perhaps in the North Caucasus Mountains. The Maykop people did admix with this previously isolated Siberian/Kennewick population in graves labeled "Steppe Maykop" in Wang et al. (2019).

But this just makes it clearer that a cultural choice motivated the Maykop people to exclude marriages with Yamnaya and pre-Yamnaya people specifically, even while exchanges of material goods, ideas, technologies continued. Neither the Maykop nor the North Caucasus/Siberian/Kennewick population can be the source of most of the CHG [Caucasus hunter-gatherer] ancestry in Yamnaya. In order to narrow down when and where CHG ancestry entered the steppes, we must widen our geographic frame beyond the Caucasus.

Unfortunately, this is way off the mark. Especially unsound is his inference that the CHG-related ancestry in the Yamnaya population may have come from beyond the Caucasus.

In fact, the chances that the Steppe Maykop people were derived from a relict Siberian/Kennewick-related group that survived into the Maykop era in the Caspian steppes or the North Caucasus are exactly zero.

The real story was surely more complicated. In my opinion, it initially involved the migration during the Eneolithic or earlier of a people rich in CHG ancestry from the southernmost steppes into the Volga Delta and surrounds, and then the back-migration during the Early Bronze Age (EBA) of their descendants with around 50% admixture from Central Asian foragers. If so, these foragers were very similar to indigenous West Siberians and also relatively closely related to Native Americans.


I don't know why such an exotic people migrated into the North Caucasus steppes to form the bulk of the Steppe Maykop population, but I'm certain they did, and one interesting possibility is that they were recruited by Maykop chiefs to create a buffer zone against hostile Yamnaya-related groups trying to push into the Caucasus, possibly from the lower Don region.

Of course, the same ancient northward migration of the CHG-rich population that may have eventually given rise to the Steppe Maykop people might also explain the deep origins of the Yamnaya people.

The key sample in all of this is VJ1001 from the Wang et al. paper. This female comes from an Eneolithic (4332-4238 calBCE) kurgan burial in the North Caucasus steppes. But despite her early date, she's genetically very similar to most Yamnaya individuals. And she's also a perfect proxy for half of the ancestry of three out of the six Steppe Maykop individuals. Here's a mixture model that I put together using the Broad MIT/Harvard software qpAdm:

RUS_Steppe_Maykop (3/6)
RUS_Eneolithic_steppe_VJ1001 0.452±0.023
RUS_Tyumen_HG 0.548±0.023
chisq 7.494
tail prob 0.874914
Full output

Indeed, these Steppe Maykop samples don't harbor any Maykop ancestry. They're simply a two-way mixture between a population closely resembling VJ1001 and another one similar to hunter-gatherers from Tyumen, West Siberia.

Importantly, a couple of Steppe Maykop-related populations were inadvertently discovered by Narasimhan et al. northeast of the Caspian Sea in what is now Kazakhstan. One of these groups is labeled Kumsay_EBA, after the location of its cemetery. It's roughly contemporaneous with Steppe Maykop and basically identical to the aforementioned Steppe Maykop trio.

KAZ_Kumsay_EBA
RUS_Eneolithic_steppe_VJ1001 0.440±0.022
RUS_Tyumen_HG 0.560±0.022
chisq 10.573
tail prob 0.646513
Full output

I suppose it's possible that Kumsay_EBA represents the migration of Steppe Maykop people into the Kazakh steppes. But even if this is true, then there had to have been an earlier migration of a group from the Kazakh steppes or West Siberia that mixed with the VJ1001-related natives of the North Caucasus steppes to give rise to Steppe Maykop.

I'm assuming that the Yamnaya-like VJ1001 and her people were the indigenous population of the North Caucasus steppes because there are no indications that they or their ancestors migrated there within any reasonable time frame from anywhere else, and certainly not from as far afield as, say, what is now Iran.

The other three Steppe Maykop individuals, who are genetic outliers in varying degrees from the main Steppe Makyop cluster, show variable levels of Maykop ancestry, with an average of about 50%. But they too harbor significant VJ1001-related ancestry. So despite the fact that there was some irregular mixing between the Maykop and Steppe Maykop peoples, this is not what created the typical Steppe Maykop genetic profile.

RUS_Steppe_Maykop_o
RUS_Eneolithic_steppe_VJ1001 0.234±0.074
RUS_Maykop_Novosvobodnaya 0.461±0.046
RUS_Tyumen_HG 0.305±0.033

chisq 7.378
tail prob 0.831667
Full output

And, of course, it should be obvious by now that the ancestry of the vast majority of Yamnaya individuals is better modeled without any input whatsoever from the Maykop or Steppe Maykop samples.

In fact, early indications are that the Yamnaya people flooded into Steppe Maykop territory from the north and completely replaced its population (see here). Despite this, in Dispersals and Diversification archeologist Kristian Kristiansen makes the following claim: "steppe Maykop expanded north, leading to the formation of the Yamnaya Culture and Proto-Indo-European". Not a chance in hell Professor.

See also...

A final note for the year

The PIE homeland controversy: August 2019 status report

Some myths die hard

An exceptional burial indeed, but not that of an Indo-European

Wednesday, September 11, 2019

Y-haplogroup R1a and mental health


I've updated my map of pre-Corded Ware culture R1a samples with a couple of new entries from Central and South Asia (the original is still here). However, before any of you get overly excited, please note that these samples aren't older than the Corded Ware culture. The reason I added them to my map is to counter the ongoing absurd claims online that South Asian R1a isn't derived from European R1a.


Just in case the map can't be viewed in all of its glory in some devices, here's what the fine print says:

The oldest example of R1a in ancient DNA from Central Asia is dated to 2132-1940 calBCE (ID I3770, Narasimhan 2019). Moreover, this sequence is closely related to much older R1a samples from Central, Eastern and Northern Europe, and phylogenetically nested within their diversity. Thus, it must surely represent a population expansion from Europe to Central Asia. Indeed, it's also associated with the Bronze Age Andronovo archeological culture, which is usually seen as an offshoot of the Corded Ware culture (CWC) of Late Neolithic Europe. The vast majority of present-day R1a lineages in Central Asia are closely related to that of I3770, and so must also ultimately derive from Europe.

The oldest instance of R1a in ancient DNA from South Asia is dated to just 1044-922 calBCE (ID I12457, Narasimhan 2019). This sequence, as well as the vast majority of present-day South Asian R1a lineages, are closely related to much older R1a samples from Central, Eastern and Northern Europe, and phylogenetically nested within their diversity. Thus, they must surely represent a population expansion from Europe to South Asia via Central Asia, in all likelihood during the Bronze Age. Even if R1a existed in South Asia before the Bronze Age, which is extremely unlikely, because it's found in samples from indigenous European hunter-gatherers, the vast majority of present-day R1a lineages in South Asia must be ultimately from Europe.

The idea that most, if not all, South Asian R1a is derived from European R1a seriously scares a lot of people. This is obvious in many online discussions on the topic. I suspect they're so frightened by it because, in their minds, it has the potential to encourage discrimination and even racism, perhaps by re-defining the colonization of much of the world by European nations in the recent past as the natural order of things?

In any case, clearly we're dealing with some sort of mass phobia here. I've got advice for those of you suffering from this problem: if you're honestly worried that the geographic provenance and expansion history of some Y-haplogroup is going to negatively impact on your life in any meaningful way, then it's time to find yourself a quality mental health professional. All the best with that.

See also...

The mystery of the Sintashta people

The Poltavka outlier

Yamnaya isn't from Iran just like R1a isn't from India

Saturday, August 17, 2019

A surprising twist to the Shirenzigou nomads story


Remember those potentially Afanasievo-derived and Tocharian-related Shirenzigou nomads from the Ning et al. paper? Well, in my opinion, they're probably neither. The genotypes and other data for these Iron Age individuals from the eastern Tian Shan are available here.

Below are a few successful and not so successful qpAdm mixture models for them. Note that I tried to use a wide range of relevant "right pops", but also retain a lot of markers, specifically to be able to discriminate between different types of steppe and steppe-derived sources of gene flow (refer to the full output). Admittedly, the Shirenzigou nomads can be modeled with Afanasievo-related ancestry, but...

CHN_Shirenzigou_IA
KAZ_Botai 0.161±0.023
KAZ_Wusun 0.490±0.023
NPL_Mebrak_2125BP 0.349±0.019

chisq 5.793
tail prob 0.926172
Full output

CHN_Shirenzigou_IA
KAZ_Botai 0.143±0.022
NPL_Mebrak_2125BP 0.295±0.019
Saka_Tian_Shan 0.562±0.024

chisq 6.796
tail prob 0.870794
Full output

CHN_Shirenzigou_IA
KAZ_Botai 0.185±0.023
NPL_Mebrak_2125BP 0.428±0.021
RUS_Sintashta_MLBA 0.270±0.026
TJK_Sarazm_En 0.117±0.027

chisq 11.351
tail prob 0.414345
Full output

CHN_Shirenzigou_IA
KAZ_Botai 0.032±0.027
KAZ_Zevakinskiy_LBA 0.567±0.025
NPL_Mebrak_2125BP 0.401±0.019

chisq 15.157
tail prob 0.232961
Full output

CHN_Shirenzigou_IA
NPL_Mebrak_2125BP 0.452±0.031
RUS_Afanasievo 0.435±0.025
RUS_Okunevo_BA 0.114±0.049

chisq 19.808
tail prob 0.0708003
Full output

CHN_Shirenzigou_IA
NPL_Mebrak_2125BP 0.409±0.031
RUS_Okunevo_BA 0.173±0.050
Yamnaya_RUS_Caucasus 0.418±0.026

chisq 20.453
tail prob 0.0589872
Full output

CHN_Shirenzigou_IA
NPL_Mebrak_2125BP 0.464±0.033
RUS_Okunevo_BA 0.104±0.053
Yamnaya_RUS_Samara 0.432±0.027

chisq 27.189
tail prob 0.0072566
Full output

Both the Wusun and Saka are generally accepted to have been the speakers of Indo-Iranian languages. So it's possible that the Shirenzigou nomads were Indo-Iranian speakers too, or at least derived from such peoples.

Surprisingly, NPL_Mebrak_2125BP was the key to obtaining the best statistical fits. This is a trio of samples, roughly contemporaneous with the Shirenzigou nomads, from a burial site high up in the Himalayas in what is now Nepal (see here).

To be honest, I'm not quite sure why the Himalayan ancients work so well in my models. Perhaps they're just a really good proxy for an Iron Age population from the northern edge of the Tibetan Plateau?

By the way, most of the Shirenzigou nomads made it into the latest Global25 datasheets (see here). They can be analyzed in a variety of ways described in this blog post: Getting the most out of the Global25. Below is a screen cap of me comparing the effectiveness of Afanasievo, Sintashta and Wusun samples as proxies for the steppe ancestry in the Shirenzigou nomads with an online tool freely available here. As expected, the algorithm picks Sintashta ahead of Afanasievo, and the Wusun ahead of both.


See also...

They mixed up Huns with Tocharians

Some myths die hard

The mystery of the Sintashta people

Sunday, July 28, 2019

They mixed up Huns with Tocharians


I don't yet have the genomes from the recent Ning et al. paper on the Iron Age nomads from the Shirenzigou site in the eastern Tian Shan. But I do have most of the previously published data featured in the paper, including the Damgaard et al. 2018 Hun and Saka samples from the western Tian Shan.

After reading the Ning et al. paper between the lines and running a few analyses of my own, it's clear to me that most of the supposedly Tocharian-related Shirenzigou individuals actually share a very close relationship with the Tian Shan Huns, and indeed may have been their ancestors.

For instance, Ning et al. found that a large part of the ancestry of the Shirenzigou ancients could be modeled with the Tian Shan Huns, which was an anachronistic approach because the former are older than the latter. They also found that Ulchi-related ancestry was a key part of the genetic structure of eight out of the ten Shirenzigou individuals, and this likewise appears to be an important part of the genetic structure of the Tian Shan Huns.

Note the strong statistical fits in the Global25/nMonte and qpAdm mixture models below, respectively, which characterize these Huns as a two-way mixture between the Ulchi and the earlier Tian Shan Saka. And keep in mind that the Saka also harbor significant Ulchi-related ancestry.

Hun_Tian_Shan
Saka_Tian_Shan,92
Ulchi,8

distance%=1.2553

Hun_Tian_Shan
Saka_Tian_Shan 0.928±0.009
Ulchi 0.072±0.009

chisq 4.409
tail prob 0.992464
Full output

Moreover, the Shirenzigou males belong to Y-haplogroups Q1a and R1b (two instances of each), and they share the latter with one of the Tian Shan Huns. Judging by the data from the relevant BAM files, it's also possible that the Shirenzigou males share a very rare subclade of R1b with the Hun, defined by the PH155 mutation (see here). The Y-haplogroup assignments for the other Tian Shan Huns end at R and R1, but that's almost certainly due to missing data.

On the other hand, two Tian Shan Sakas belong to Y-haplogroup R1a but none to R1b, which fits with the pattern from currently available ancient DNA that R1a was more common than R1b in Saka-related groups, such as the Scythians and Sarmatians (see here).

This is all very interesting, because the Huns replaced the Saka in the western Tian Shan, and, considering their R1b and excess Ulchi-related ancestry, very likely moved into the region from the direction of Shirenzigou. Indeed, in my opinion a strong argument can now be made that the Iron Age population from the Shirenzigou region took part in the formation of the Hunnic confederacy.

So where does that leave the theory presented by Ning et al. that the Shirenzigou ancients may have been closely related, and perhaps even ancestral, to the Tocharians, simply because they packed a lot of Yamnaya-related and possibly proto-Tocharian Afanasievo ancestry, and were living close to the Tarim Basin, where Tocharian languages were subsequently first attested?

I'm not sure, but I now find it difficult to reconcile this theory with the fact that they were closely related, and probably ancestral, to the Tian Shan Huns. As far as I'm aware, Huns cannot be linked to Tocharians in any meaningful way.

Of course it's possible that different Afanasievo-derived groups were living in the Tarim Basin and surrounds, and, as some merged with new populations pushing into the region from the east and adopted non-Indo-European languages, others retained their Tocharian speech and eventually split into communities speaking Tocharian A, B and apparently also C (see here).

But this has to be demonstrated directly with ancient DNA from archeological sites where Tocharian languages were attested. Till then, I'll keep thinking that Ning et al. wrote a paper about Tocharians that really should've been a paper about Huns.

Here's a famous wall painting of Tocharian princes from the cave of the sixteen sword-bearers in the Tarim Basin, dated to 432–538 AD. They don't look like guys with a lot of Ulchi-related admixture to me, but I might be wrong. Feel free to let me know what you think in the comments below.


Update 08/17/2019: The Shirenzigou nomads are now in my dataset. Below are a few successful and not so successful qpAdm mixture models for them. Note that I tried to use a wide range of relevant "right pops", but also retain a lot of markers, specifically to be able to discriminate between different types of steppe and steppe-derived sources of gene flow (refer to the full output). Admittedly, the Shirenzigou nomads can be modeled with Afanasievo-related ancestry, but...

CHN_Shirenzigou_IA
KAZ_Botai 0.161±0.023
KAZ_Wusun 0.490±0.023
NPL_Mebrak_2125BP 0.349±0.019

chisq 5.793
tail prob 0.926172
Full output

CHN_Shirenzigou_IA
KAZ_Botai 0.143±0.022
NPL_Mebrak_2125BP 0.295±0.019
Saka_Tian_Shan 0.562±0.024

chisq 6.796
tail prob 0.870794
Full output

CHN_Shirenzigou_IA
KAZ_Botai 0.185±0.023
NPL_Mebrak_2125BP 0.428±0.021
RUS_Sintashta_MLBA 0.270±0.026
TJK_Sarazm_En 0.117±0.027

chisq 11.351
tail prob 0.414345
Full output

CHN_Shirenzigou_IA
KAZ_Botai 0.032±0.027
KAZ_Zevakinskiy_LBA 0.567±0.025
NPL_Mebrak_2125BP 0.401±0.019

chisq 15.157
tail prob 0.232961
Full output

CHN_Shirenzigou_IA
NPL_Mebrak_2125BP 0.452±0.031
RUS_Afanasievo 0.435±0.025
RUS_Okunevo_BA 0.114±0.049

chisq 19.808
tail prob 0.0708003
Full output

CHN_Shirenzigou_IA
NPL_Mebrak_2125BP 0.409±0.031
RUS_Okunevo_BA 0.173±0.050
Yamnaya_RUS_Caucasus 0.418±0.026

chisq 20.453
tail prob 0.0589872
Full output

CHN_Shirenzigou_IA
NPL_Mebrak_2125BP 0.464±0.033
RUS_Okunevo_BA 0.104±0.053
Yamnaya_RUS_Samara 0.432±0.027

chisq 27.189
tail prob 0.0072566
Full output

Both the Wusun and Saka are generally accepted to have been the speakers of Indo-Iranian languages. So it's possible that the Shirenzigou nomads were Indo-Iranian speakers too, or at least derived from such peoples.

Surprisingly, NPL_Mebrak_2125BP was the key to obtaining the best statistical fits. This is a trio of samples, roughly contemporaneous with the Shirenzigou nomads, from a burial site high up in the Himalayas in what is now Nepal (see here).

To be honest, I'm not quite sure why the Himalayan ancients work so well in my models. Perhaps they're just a really good proxy for an Iron Age population from the northern part of the Tibetan Plateau? By the way, most of the Shirenzigou nomads made it into the latest Global25 datasheets (see here).

See also...

Almost everything you ever wanted to know about the Xiaohe-Gumugou cemeteries

The mystery of the Sintashta people

Late PIE ground zero now obvious; location of PIE homeland still uncertain, but...

Friday, July 26, 2019

Afanasievo people may well have been proto-Tocharian speakers (Ning et al. 2019)


Update 17/08/2019: A surprising twist to the Shirenzigou nomads story

...

During the Early Bronze Age, around 2,900 BCE, a population associated with the Yamnaya archeological culture migrated from the Pontic-Caspian steppe in Eastern Europe deep into Asia, as far as the Minusinsk Basin in South Siberia.

This rapid, long-range expansion was likely to have been the first significant migration of a Yamnaya-related group far to the east of the Ural Mountains, and it resulted in the formation of the Afanasievo archeological culture (see here).

The appearance of Tocharian languages in the Tarim Basin, in what is now western China, is often associated with the Afanasievo culture, mainly because of the confirmed presence of European-related populations in the Tarim Basin during the Bronze Age, as well as the likely highly divergent position of the Tocharian node in the Indo-European language phylogeny.

But the Afanasievo people were separated by considerable distance in space and time from the Tocharians, and can't yet be reliably linked to them with archeological or genetic data. So even though the inference that the former are linguistically ancestral to the latter is quite plausible, it's far from certain.

However, thanks to a new paper at Current Biology by Ning et al., at least we now know that a population with significant Yamnaya/Afanasievo-related ancestry was living in the eastern Tian Shan Mountains just a few hundred years before Tocharian languages were attested nearby [LINK]. Below is the paper summary, emphasis is mine:

Recent studies of early Bronze Age human genomes revealed a massive population expansion by individuals-related to the Yamnaya culture, from the Pontic Caspian steppe into Western and Eastern Eurasia, likely accompanied by the spread of Indo-European languages [1, 2, 3, 4, 5]. The south eastern extent of this migration is currently not known. Modern-day human populations from the Xinjiang region in northwestern China show a complex population history, with genetic links to both Eastern and Western Eurasia [6, 7, 8, 9, 10]. However, due to the lack of ancient genomic data, it remains unclear which source populations contributed to the Xinjiang population and what was the timing and the number of admixture events. Here, we report the first genome-wide data of 10 ancient individuals from northeastern Xinjiang. They are dated to around 2,200 years ago and were found at the Iron Age Shirenzigou site. We find them to be already genetically admixed between Eastern and Western Eurasians. We also find that the majority of the East Eurasian ancestry in the Shirenzigou individuals is-related to northeastern Asian populations, while the West Eurasian ancestry is best presented by ∼20% to 80% Yamnaya-like ancestry. Our data thus suggest a Western Eurasian steppe origin for at least part of the ancient Xinjiang population. Our findings furthermore support a Yamnaya-related origin for the now extinct Tocharian languages in the Tarim Basin, in southern Xinjiang.


Ning et al., Ancient Genomes Reveal Yamnaya-Related Ancestry and a Potential Source of Indo-European Speakers in Iron Age Tianshan, Current Biology, July 25, 2019, DOI: https://doi.org/10.1016/j.cub.2019.06.044

See also...

It was always going to be this way

The mystery of the Sintashta people

Late PIE ground zero now obvious; location of PIE homeland still uncertain, but...

Monday, July 15, 2019

Asiatic East Germanics


Around a third of the ancient individuals in my dataset associated with East Germanic-speaking cultures show obvious ancestry from Central and/or West Asia.

This shouldn't be too surprising, considering, for instance, the well documented contacts between East Germanic tribes and the Avars, Huns, Sarmatians and other nomadic groups that streamed into Europe from the Asian steppes during the Migration Period. It's a topic that I've raised before at this blog (see here).

But the curious thing is that very little, if any, of this ancestry has percolated down to present-day Europeans.

The easiest way to show this is with a Principal Component Analysis (PCA) based on my Global25 data. The relevant PCA datasheet can be downloaded here. Basic details about the ancient samples in the analysis are available here.

Some of the Northeastern European populations, particularly the Uralic speakers, appear to be attracted to the Hunnic cluster. However, this is mostly an artifact of pre-Migration Period east to west population expansions in the far north of Europe, probably including those of the Proto-Uralians (see here).

So how is it that, despite ruling over vast areas of Europe for hundreds of years, the East Germanics appear not to have contributed significantly to the present-day European gene pool? My theory is that, much like the Avars and Huns, they were militarily and demographically overwhelmed by the ascending groups around them, such as the Slavs, and they simply went extinct.

To wrap things up, here's a basic qpAdm mixture model designed to test for Hunnic-related ancestry in a few Eastern and Northern European populations of interest. Note the significant slice of this type of ancestry in the likely early Goths of the Chernyakhiv culture. Is it real? Feel free to share your thoughts in the comments below.

UKR_Chernyakhiv
DEU_MA 0.863±0.038
Hun_Tian_Shan 0.137±0.038
chisq 12.525
tail prob 0.325466
Full output

Swedish
Baltic_EST_IA 0.126±0.078
DEU_MA 0.849±0.073
Hun_Tian_Shan 0.025±0.020
chisq 8.338
tail prob 0.595877
Full output

Ukrainian
Baltic_EST_IA 0.121±0.064
DEU_MA 0.857±0.060
Hun_Tian_Shan 0.022±0.017
chisq 11.458
tail prob 0.322956
Full output

Estonian
Baltic_EST_IA 0.597±0.069
DEU_MA 0.373±0.064
Hun_Tian_Shan 0.030±0.017
chisq 15.739
tail prob 0.107361
Full output

See also...

Conan the Barbarian probably belonged to Y-haplogroup R1a

More on the association between Uralic expansions and Y-haplogroup N

Uralic-specific genome-wide ancestry did make a signifcant impact in the East Baltic

Monday, July 1, 2019

Almost everything you ever wanted to know about the Xiaohe-Gumugou cemeteries


I'm reading an interesting and very comprehensive new archeological thesis about the Tarim Basin mummies. It's freely available via Uppsala University's DiVA portal here:

Shifting Memories: Burial Practices and Cultural Interaction in Bronze Age China: A study of the Xiaohe-Gumugou cemeteries in the Tarim Basin

The author, Yunyun Yang, has some suggestions for the future direction of research on the topic:

1. Analysis of Y chromosomal DNA on the males from 4th-1st layers of the Xiaohe cemetery: it is not clear if they were genetically distinct from the Afanasievo (and Yamnaya) males, and consistent to the Andronovo males.

2. More research on ancient DNA of the six males buried in type I the sun-radiating-spokes graves: the six males were so different in the Gumugou cemetery, and we don't know who they were. In this study, it has been suggested that they came from the parallel Andronovo horizon, and preserved some of their original social identities.

3. Analysis of the white sticky materials painted on the dead’s hair, faces, and bodies: it is not clear what this material is. It might be application of dairy/milk products with some holy functions. And the interesting point is why the dead was painted on such materials, for holy reasons, and/or was embalmed that way for preventing decay of the dead bodies?

4. Research on the use of Ephedra plants: Ephedra twigs were common and important in both cemeteries. Were they related to the “Soma” in ancient India (Vedas) and/or “Haoma” in ancient Iran (Avesta)? Were the Ephedra twigs related to the body painting (whitish sticky materials painting on skins of the dead)? Was there a common use of Ephedra plant in more nomadic groups in the Eurasian Steppe?

5. Research on the comparisons between the Andronovo burials and the stone circular-kerbs with stone-pits in Xinjiang: a major obstacle to such research is the language barriers, with the material published in English, Chinese and Russian. Such research is, however, essential to understand the conjunction of the geographical areas, the expansion of nomadic groups, the spreading of horses and wagons (linked to the noble groups of the Shang Dynasty (1600-1046 BCE) in central China), the formation of the Silk Road in this area (till the expansion of Han Dynasty (206 BCE-220 CE)), the moving of Indo-Iranians, the expansion of Scythians (900 BCE-400 CE), etc.

I agree, but I'd also add that we need a good number of ancient Y-chromosome and genome-wide samples from across space and time in the Tarim Basin, including and especially from attested Tocharian-speaking communities. That's really the only way to figure out whether the Tarim Basin mummies belonged to the speakers of Indo-Iranian or Tocharian languages, and whether the latter were introduced into the region by migrants from the Afanasievo culture.

Citation...

Yang, Yunyun, Shifting Memories: Burial Practices and Cultural Interaction in Bronze Age China: A study of the Xiaohe-Gumugou cemeteries in the Tarim Basin, URN: urn:nbn:se:uu:diva-386612

Update 26/7/2019: Afanasievo people may well have been proto-Tocharian speakers (Ning et al. 2019)

See also...

Another look at the ancient mtDNA from Xiaohe, Tarim Basin

On the doorstep of India

The mystery of the Sintashta people

Late PIE ground zero now obvious; location of PIE homeland still uncertain, but...

Thursday, April 25, 2019

Some myths die hard


Ancient DNA tells us that the Bronze Age wasn't kind to the indigenous populations of Central Asia. It seems to have wiped them out totally. Indeed, Central Asia might well be the only major world region in which native hunter-gatherers failed to make a perceptible impact on the genetics of any extant populations.

Before the Neolithic transition, much of Central Asia was home to hunter-gatherers closely related to those of nearby western Siberia. During the Neolithic, agriculturalists and pastoralists from the Near East gradually moved into the more arable parts of southern and eastern Central Asia, eventually giving rise to the Bactria Margiana Archaeological Complex, or BMAC, and other similar communities.

It's not clear what their relationship was like with the native hunter-gatherers in these areas. But they did mix with them in varying degrees. This is obvious because genome-wide genetic ancestry characteristic of the Botai people, who hunted and eventually domesticated horses on the Kazakh steppe during the 4th millennium BCE, and were probably the archetypal Central Asians for their time, is found at significant levels in a number of later samples from Central Asian farmer and pastoralist sites, such as Dali, Gonur Tepe and Sarazm.

Thus, even though the Neolithic transition did have a big impact on Central Asia, and clearly led to large scale population replacements in some parts of the region, this was just the beginning of these population shifts. Moreover, in some cases the expanding farmer and pastoralist populations seem to have acquired significant indigenous Central Asian ancestry and spread it with them.

The precise geographic extent of the relatively unique Botai-related ancestry in prehistoric Eurasia is still something of a mystery. But to give you a general picture of where it was found from around 6,000 BCE to 2,000 BCE, here's a map with info about samples with significant levels of this type of ancestry from a wide range of sites in space and time.


Going by this map, I'd say it's safe to infer that the Botai-related ancestry was a major feature of practically all forager populations living between the Caspian Sea and the Altai Mountains. It was also present in the Early Bronze Age (EBA) pastoralist population associated with the Steppe Maykop archeological culture of Eastern Europe, so it may have already been in Europe as early as 3,800 BCE, because that's when the Steppe Maykop culture first appeared.

It's an interesting question where the ancestors of the Steppe Maykop herders came from. I once simply assumed that they were closely related to the Maykop people who lived in the Caucasus Mountains. But it's now clear that the populations associated with these two similar cultures were starkly different, with the Maykop people being basically of Near Eastern origin and lacking any discernible Botai-like ancestry. My guess for now is that the Steppe Maykop herders were in large part the descendants of the Kelteminar culture population from just east of the Caspian Sea, but we'll see about that when more ancient DNA comes in.

The other great mystery is what eventually happened to the Steppe Maykop people. Around 3,000 BCE, their culture vanished from the archeological record and their particular genetic signature disappeared from the steppe ancient DNA record. Where did they go? Did they migrate back east?

I don't know, but at about that time other Eastern European steppe herders, those associated with the Yamnaya and Corded Ware archeological cultures, began to stir and migrate in big numbers in basically all directions, including into Steppe Maykop territory. Indeed, unlike the Steppe Maykop population, these groups weren't closely related to any contemporaneous or earlier Central Asians. But they ended up moving into Central Asia, and in a big way too.

Their impact all the way from the Ural Mountains to what are now China and India was profound. For instance, not only did they end up totally replacing the Botai people, but also their horses. For more details on this topic check out the Youtube clip here. I have a strong suspicion that the same sort of thing happened to the aforementioned Steppe Maykop people. In other words, they may have been forced out from the Eastern European steppe, and perhaps sought shelter in the Caucasus Mountains?

Admittedly, I'm not offering anything new here. I just wanted to emphasize a few key points, because I'm still seeing some confusion online about the population history of Central Asia, and especially how it relates to the population history of Europe, and also the Proto-Indo-European homeland question. Make no mistake, thanks to the ancient DNA already available from Central Asia, we can confidently infer the following:

- the chance that the ancient European populations associated with the Yamnaya, Corded Ware and other closely related archeological cultures formed as a result of migrations from Central Asia is zero

- the chance that the Proto-Indo-European homeland was located in Central Asia is zero

- the chance that present-day Europeans, by and large, derive from any ancient Central Asian populations is zero

See also...

Central Asia as the PIE urheimat? Forget it

The Steppe Maykop enigma

Late PIE ground zero now obvious; location of PIE homeland still uncertain, but...

Sunday, October 7, 2018

The resistance crumbles


Over the years some scientists from the Estonian Biocentre have been among the staunchest opponents of the idea that Bronze Age pastoralists originating in the steppes of Eastern Europe had a significant genetic and linguistic impact on South Asia (for instance, see here).

But this week they put out a review paper titled The genetic makings of South Asia [LINK] featuring the figure below. It's a nice visualization of the current state of understanding of the peopling of South Asia, and does acknowledge the major role that the said steppe pastoralists had in this process.


However, there's not a single mention of Y-haplogroup R1a in the review. This is surprising, considering the once common, but now no longer valid, claims that this paternal marker may have originated in India. I guess the grieving process will continue for a little longer for some.

My long-held opinion about the claims that R1a was native to India, Iran, Central Asia, or, indeed, anywhere but its actual homeland, which is certainly Eastern Europe, can be summarized as such: LOL!

See also...

Thursday, July 19, 2018

An early Iranian, obviously


Today, the part of Asia between the Caspian Sea and the Altai Mountains, known as Turan, is largely a Turkic-speaking region. But during the Iron Age it was dominated by Iranian speakers. Throughout this period it was the home of a goodly number of attested and inferred early Iranic peoples, such as the Airya, Dahae, Kangju, Massagetae, Saka and Sogdians.

Indeed, the early Iron Age Yaz II archaeological culture, located in southwestern Turan, is generally classified as an Iranian culture, and even posited to have been the Airyanem Vaejah, aka home of the Iranians, from ancient Avestan literature.

That's not to say that Iranian speakers weren't present in this part of the world much earlier. They probably were, and it's likely that we already have their genomes (see here). But the point I'm making is that Turan can't be reliably claimed to have been an Iranian realm until the Iron Age.

Ergo, any ancient DNA samples from Turan dating to the Iron Age, as opposed to, say, the Bronze Age, are very likely to be those of early Iranian speakers. One such sample is Turkmenistan_IA DA382 from Damgaard et al. 2018.

Below is a screen cap of the "time map" from homeland.ku.dk, with the slider moved to 847 BC, showing the location of the burial site where the remains of DA382 were excavated. The site is marked with the Z93 label because DA382 belongs to the Eastern European-derived Y-chromosome haplogroup R1a-Z93. Interestingly, his burial was located in close proximity to archaeological sites associated with the above mentioned and contemporaneous Yaz II culture.


DA382 didn't get much of a run in the Damgaard et al. paper, and little wonder because the authors also analyzed 73 other ancient samples. So let's take a close look at this individual's genetic structure to see whether there's anything particularly Iranian about it.

Damgaard et al. did mention that DA382 was partly of Middle to Late Bronze Age (MLBA) steppe origin. And indeed, my own mixture models using qpAdm confirm this finding with very consistent results and strong statistical fits. Here are a couple of two-way examples...

Turkmenistan_IA
Namazga_CA 0.528±0.040
Srubnaya_MLBA 0.472±0.040
taildiff: 0.561330411
Full output

Turkmenistan_IA
Dzharkutan1_BA 0.530±0.037
Srubnaya_MLBA 0.470±0.037
taildiff: 0.485083377
Full output

The fact that the MLBA Srubnaya samples from the Pontic-Caspian steppe can be used to model DA382's ancestry (alongside Bronze and Copper Age populations from Turan) with such ease shouldn't be surprising, considering the he belongs to R1a-Z93, which is the dominant Y-haplogroup in the Srubnaya and all other closely related MLBA steppe peoples.

Now, Srubnaya is generally regarded to be the proto-Iranian archaeological culture. How awesome is that considering those qpAdm fits? But, admittedly, this is just an inference, even if a robust one, based on genetic, archaeological and historical linguistics data. So apart from the fact that DA382 comes from Iron Age Turan, an Iranian-speaking realm, is there any other way to link him directly to Iranians?

Well, he's very similar in terms of overall genetic structure to some of the least Turkic-admixed Iranian speakers still living in Turan, and might well be ancestral to them.

For instance, below is a Principal Component Analysis (PCA) featuring a wide range of ancient and present-day West Eurasian samples. Note that, in line with the qpAdm models, DA382 clusters about half-way between the populations of the MLBA steppe and pre-Kurgan expansion Turan, and amongst present-day Yaghnobi and Pamiri Tajiks. In fact, he clusters at the apex of a southeast > northwest cline made up of Tajiks that appears to be pulling towards Europeans.


Needless to say, Tajiks, especially Pamiri Tajiks, also pack a lot of Srubnaya-related ancestry. I've talked about this plenty of times at this blog (for instance, see here). But what happens if I try to model Pamiri and Yaghnobi Tajiks with DA382?

Tajik
Turkmenistan_IA 0.892±0.023
Han 0.108±0.023
taildiff: 0.794566182
Full output

Wow, it's an awesome fit! My mind's made up: DA382 was probably an Iranian speaker and, more specifically, an Eastern Iranian speaker. Who disagrees and why? Feel free to let me know in the comments (unless you're banned, in which case, f*ck off).

See also...

A Mycenaean and an Iron Age Iranian walk into a bar...

Late PIE ground zero now obvious; location of PIE homeland still uncertain, but...

New PCA featuring Botai horse tamers, Hun and Saka warriors, and many more...

Tuesday, September 26, 2017

The beast among Y-haplogroups


A lot has been written about Y-haplogroup R1a over the years. Sadly, most of it was wrong, such as its posited Pleistocene origin in the Indian subcontinent and subsequent migration to Europe.

In all likelihood, R1a was born somewhere in North Eurasia. More importantly, its R1a-M417 subclade, which encompasses almost 100% of modern-day R1a lineages, no doubt came into existence somewhere on the Pontic-Caspian (or Western) steppe in what is now Ukraine and southern Russia just 7,000-6,000 years ago.

And within a couple of thousand years it expanded in almost all directions, probably on the back of the early Indo-European dispersals (see here), to cover a massive range from Scandinavia to South Asia. It is the beast among Y-haplogroups.


The most common subclade of R1a-M417 in South Asia today is R1a-Z93, and, realistically, it couldn't have arrived there earlier than about 2,000BC. So much for the Pleistocene.

See also...

Y-haplogroup R1a and mental health

The Poltavka outlier

Yamnaya isn't from Iran just like R1a isn't from India