search this blog

Showing posts with label Dear Iosif. Show all posts
Showing posts with label Dear Iosif. Show all posts

Wednesday, December 4, 2024

The PIE homeland controversy: December 2024 open thread


It seems like we're getting close to the moment when Iosif Lazaridis has to finally admit that the Proto-Indo-European (PIE) homeland was located in Eastern Europe, and also that the ancestors of the Hittites and other Anatolian speakers entered Anatolia via the Balkans.

Let's discuss.


However, please note that comments from total morons, trolls and/or mentally unstable people will not be approved.

See also...

Indo-European crackpottery

Monday, February 12, 2024

The Nalchik surprise


If, like Iosif Lazaridis, you subscribe to the idea that the Yamnaya people carry early Anatolian farmer-related admixture that spread into Eastern Europe via the Caucasus, then I've got great news for you.

We now have a human sample from the Eneolithic site of Nalchik in the North Caucasus, labeled NL122, that packs well over a quarter of this type of ancestry (see here). Below is a quick G25/Vahaduo model to illustrate the point (please note that Turkey_N = early Anatolian farmers).

Target: Nalchik_Eneolithic:NL122
Distance: 2.1934% / 0.02193447
60.8 Russia_Steppe_Eneolithic
26.2 Turkey_N
13.0 Georgia_Kotias

On the other hand, if, again like Iosif Lazaridis, you subscribe to the idea that the Indo-European language spread into Eastern Europe via the Caucasus in association with this early Anatolian farmer-related admixture, then I've got terrible news for you.

That's because NL122 is apparently dated to a whopping 5197-4850 BCE (see here). This dating might be somewhat bloated, possibly due to what's known as the reservoir effect, because the Nalchik archeological site is generally carbon dated to 4840–4820 BCE.

However, even with the younger dating, this would still mean that early Anatolian farmer-related ancestry arrived in the North Caucasus, and thus in Eastern Europe, around 4,800 BCE at the latest. That's surprisingly early, and just too early to be relevant to any sort of Indo-European expansion from a necessarily even earlier Proto-Indo-Anatolian homeland somewhere south of the Caucasus.

This means that NL122 effectively debunks Iosif Lazaridis' Indo-Anatolian hypothesis. Unless, that is, Iosif can provide evidence for a more convoluted scenario, in which there are at least two early Anatolian farmer-related expansions into Eastern Europe via the Caucasus, and the expansion relevant to the arrival of Indo-European speech came well after 5,000 BCE.

I haven't done any detailed analyses of NL122 with formal stats and qpAdm. But my G25/Vahaduo runs suggest that it might be possible to model the ancestry of the Yamnaya people with around 10% admixture from a population similar to NL122.

Target: Russia_Samara_EBA_Yamnaya
Distance: 3.4123% / 0.03412328
72.6 Russia_Progress_Eneolithic
18.2 Ukraine_N
9.2 Nalchik_Eneolithic

However, I don't subscribe to the idea that the Yamnaya people carry early Anatolian farmer-related admixture that spread into Eastern Europe via the Caucasus (on top of what is already found in Progress Eneolithic). Based on basic logic and a wide range of my own analyses, I believe that they acquired this type of ancestry from early European farmers, probably associated with the Trypillia culture. For instance...

Target: Russia_Samara_EBA_Yamnaya
Distance: 3.2481% / 0.03248061
80.2 Russia_Progress_Eneolithic
13.6 Ukraine_Neolithic
6.2 Ukraine_VertebaCave_MLTrypillia
0.0 Nalchik_Eneolithic

Another way to show this is with a Principal Component Analysis (PCA) that highlights a Yamnaya cline made up of the Yamnaya, Steppe Eneolithic and Ukraine Neolithic samples. As you can see, dear reader, there's no special relationship between the Yamnaya cline and Nalchik_Eneolithic. The Yamnaya samples, which are sitting near the eastern end of the Yamnaya cline, instead seem to show a subtle shift towards the Trypillian farmers.

Indeed, I also don't exactly understand the recent infatuation among many academics, especially Iosif Lazaridis and his colleagues, with trying to put the Proto-Indo-Anatolian homeland somewhere south of the Caucasus. Considering all of the available multidisciplinary data, I'd say it still makes perfect sense to put it in the Sredny Stog culture of the North Pontic steppe, in what is now Ukraine.

Please note that all of the G25 coordinates used in my models and the PCA are available HERE.

See also...

The Caucasus is a semipermeable barrier to gene flow

Saturday, August 12, 2023

Indo-European crackpottery


I've now had the chance to read and digest the following two papers in Science about the origin of Indo-European languages:

Language trees with sampled ancestors support a hybrid model for the origin of Indo-European languages, Heggarty et al.

The genetic history of the Southern Arc: A bridge between West Asia and Europe, Lazaridis, Alpaslan-Roodenberg et al.

The Heggarty et al. paper is pure fluff. It offers nothing useful or even remotely interesting.

For instance, the authors derive some Indo-European languages in Europe from Anatolian farmers and others from Caucasus hunter-gatherers (see here). This is not just exceedingly far fetched, but also obviously forced.

Wolfgang Haak and Johannes Krause, you should be deeply ashamed of yourselves.

I've already commented extensively about the Lazaridis, Alpaslan-Roodenberg et al. paper (for example, see here). But the one thing I need to add is that this paper is what it is due to the inherent bias of some of the lead authors to push the Indo-Anatolian homeland into West Asia. I won't even bother mentioning their names, because we all know who they are.

See also...

Crazy stuff

Dear David, Nick, Iosif...let's set the record straight

The story of R-V1636

Monday, February 13, 2023

Dear David, Nick, Iosif...let me tell you about Yamnaya


Lazaridis, Alpaslan-Roodenberg et al. recently claimed that the Yamnaya people of the Pontic-Caspian (PC) steppe carried "substantial" ancestry from what is now Armenia or surrounds.

However, this claim is essentially false.

Only one individual associated with the Yamnaya culture shows an unambiguous signal of such ancestry. This is a female usually labeled Ukraine_Yamnaya_Ozera_o:I1917. The "o" suffix indicates that she is an outlier from the main Yamnaya genetic cluster.

Unlike I1917, typical Yamnaya individuals carry a few per cent of ancient European farmer admixture. This ancestry is only very distantly Armenian-related via Neolithic Anatolia (see here).

It's difficult for me to understand how Lazaridis, Alpaslan-Roodenberg et al. missed this. I suspect that they relied too heavily on formal statistics and overinterpreted their results.

Formal statistics are a very useful tool in ancient DNA work. Unfortunately, they're also a relatively blunt tool that often has problems distinguishing between similar sources of gene flow.

There are arguably better methods for studying fine scale ancestry, such as Principal Component Analysis (PCA).

Below is a somewhat special PCA featuring a wide range of ancient populations that plausibly might be relevant to the genetic origins of the Yamnaya people. Unlike most PCA with ancient samples, this PCA doesn't rely on any sort of projection, so that all of the actors are interacting with each other and directly affecting the outcome.


Here's another version of the same plot with a less complicated labeling system. Note that I designed this PCA specifically to differentiate between European populations and those from the Armenian highlands, the Iranian plateau and surrounds.


And here's a close up of the part of the plot that shows the Yamnaya cluster. This cluster is made up of samples associated with the Afanasievo, Catacomb, Poltavka and Yamnaya cultures. All of the individuals in this part of the plot are closely related, which is why they're so tightly packed together. The differentiation between them is caused by admixture from different groups mostly from outside of the PC steppe.


The Yamnaya cluster can be broadly characterized as a population that formed along the genetic continuum between the Eneolithic groups of the Progress region and Neolithic foragers from the Dnieper River valley (Progress_Eneolithic and Ukraine_N, respectively). However, this cluster also shows a slight western shift that is increasingly more pronounced in the Corded Ware samples. This shift is due to the aforementioned admixture from early European farmers.

Indeed, the plot reveals two parallel clines extending west from the Progress samples. One of the clines is made up of the Yamnaya cluster and the Corded Ware samples, and pulls towards the ancient European farmers. The other cline includes Ukraine_Yamnaya_Ozera_o:I1917 and pulls towards samples from the Armenian highlands and surrounds.

Being aware of these two clines and knowing how they came about is important to understanding the genetic prehistory of the PC steppe and indeed of much of Eurasia.

At some point, probably during the late Eneolithic, a Progress-related group experienced gene flow from the west and became the Yamnaya and Corded Ware populations. Sporadically, admixture from the Armenian highlands and the Iranian plateau also entered the PC steppe, giving rise to people like the Steppe Maykop outliers and Ukraine_Yamnaya_Ozera_o:I1917.


Unfortunately, this sort of PCA doesn't offer output suitable for mixture modeling, basically because the recent genetic drift shared by many of the samples creates significant noise.

However, to check that my inferences based on the plot are correct I can create composites with specific ancestry proportions to see how they behave. In the plot below Mix1 is 80% Progress_Eneolithic and 20% Iran_Hajji_Firuz_N, Mix2 is 80% Progress_Eneolithic and 20% Armenia_EBA_Kura_Araxes, while Mix3 is 80% Progress_Eneolithic, 15% Ukraine_N and 5% Hungary_MN_Vinca (Middle Neolithic farmers from the Carpathian Basin).


Obviously, we can't get Yamnaya by mixing Progress_Eneolithic with any ancients from the Armenian highlands or the Iranian plateau. On the other hand, Mix3 works quite well, at least in the first two dimensions. In some of the other dimensions genetic drift specific to Ukraine_N pulls it away from the Yamnaya cluster, but this is to be expected.

By the way, the plots were created with the excellent Vahaduo Custom PCA tool freely available here. It's well worth trying the interactive 3D option using my PCA data. The relevant datasheet is available here.

See also...

Dear David, Nick, Iosif...let's set the record straight

The Caucasus is a semipermeable barrier to gene flow

Friday, January 13, 2023

Dear David, Nick, Iosif...let's set the record straight


Almost a decade ago scientists at the David Reich Lab extracted DNA from the remains of three men from the Khvalynsk II cemetery at the northern end of the Pontic-Caspian (PC) steppe.

These Eneolithic Eastern Europeans showed significant genetic heterogeneity, with highly variable levels of Eastern Hunter-Gatherer (EHG) and Near Eastern-related ancestry components.

As a result, the people at the David Reich Lab concluded that the Eneolithic populations of the PC steppe formed from a relatively recent admixture between local hunter-gatherers and Near Eastern migrants.

Unfortunately, this view has since become the consensus among scientists working with ancient DNA.

I say unfortunately because there's a more straightforward and indeed obvious explanation for the genetic heterogeneity among the samples from Khvalynsk II. It's also the only correct explanation, and it doesn't involve any recent gene flow from the Near East.

Here it is, in point form, as simply as I can put it:

- EHG is best represented by samples from Karelia and Lebyazhinka, which are modern-day Russian localities in the forest zone and on the border between the steppe and the forest-steppe, respectively

- Khvalynsk II is also located on the boundary between the steppe and the forest-steppe, and very far from the Near East

- so the genetic structure of the people buried at Khvalynsk II does represent an admixture event

- however, this admixture event simply involved an EHG population from the forest-steppe and a very distantly Near Eastern-related group native to the steppe (that is, two different Eastern European populations).

I've written this blog post because I think David Reich, Nick Patterson, Iosif Lazaridis and colleagues should finally admit that they didn't quite get this right. And it'd be nice if they could put out a paper sometime soon in which they set the record straight.

See also...


Tuesday, November 1, 2022

The story of R-V1636


Who wants to bet against this map? Keep in mind that ART038 (~3000 calBCE) remains the oldest sample with the V1636 and R1b Y-chromosome mutations in the West Asian ancient DNA record. Ergo, there's nothing to suggest that V1636 or R1b entered Eastern Europe from West Asia.

See also...

A tantalizing link

How relevant is Arslantepe to the PIE homeland debate?

Thursday, October 13, 2022

The Kura-Araxes people deserve better


When discussing the Kura-Araxes culture and its people it's important to understand these key points:

- there is Eastern European steppe ancestry in Kura-Araxes samples, and if you're not seeing it then you're not looking hard enough

- Armenian Kura-Araxes samples are mainly a mixture between three different groups currently best represented in the ancient DNA record by ARM_Areni_C, IRN_Hajji_Firuz_C and RUS_Darkveti-Meshoko_En

- ergo, most of the steppe ancestry in the Kura-Araxes population of what is now Armenia must have been mediated via local Chalcolithic groups like ARM_Areni_C

- Kura-Araxes samples show Mesopotamian-related ancestry, and this mustn't be ignored.

Oh, you don't believe it because you just read a big paper in Science claiming otherwise?

Well, the authors of that paper, Lazaridis, Alpaslan-Roodenberg et al., used distal mixture models to study the ancestry of their Kura-Araxes samples, and such models can miss important details.

Consider these three proximate mixture models for a relatively high quality and very homogenous Kura-Araxes sample set from the aforementioned paper. They were done with the qpAdm software

ARM_Kura-Araxes_Berkaber
ARM_Areni_C 0.239±0.068
IRN_Hajji_Firuz_C 0.379±0.068
RUS_Darkveti-Meshoko_En 0.382±0.054
P-value 0.285122 (Pass)
Full output

ARM_Kura-Araxes_Berkaber
IRN_Hajji_Firuz_C 0.569±0.051
RUS_Darkveti-Meshoko_En 0.363±0.058
RUS_Progress_En 0.068±0.020
P-value 0.20306 (Pass)
Full output

ARM_Kura-Araxes_Berkaber
IRN_Hajji_Firuz_C 0.531±0.060
RUS_Darkveti-Meshoko_En 0.469±0.060
P-value 0.0132579 (Fail)
Full output

Some caveats apply. For instance, the pass threshold (P-value ≥0.05) is arbitrary. But the point is that the models look much better with steppe-related and steppe reference populations (ARM_Areni_C and RUS_Progress_En, respectively).

Moreover, the unique and vital Darkveti-Meshoko population is represented by just one individual. I also have the genotypes of his brother and sister, but relatives aren't allowed in these sorts of tests.

Including a singleton in the analysis means that I can't use the inbreed: YES option, which apparently can be a bad thing. Nevertheless, these models do look very solid.

Indeed, I can also model ARM_Kura-Araxes_Berkaber as practically 100% RUS_Maykop_Novosvobodnaya, perhaps with some excess ARM_Areni_C-related input.

ARM_Kura-Araxes_Berkaber
ARM_Areni_C 0.094±0.087
RUS_Maykop_Novosvobodnaya 0.906±0.087
P-value 0.284259 (Pass)
Full ouput

This makes good sense, because RUS_Maykop_Novosvobodnaya can also be modeled solidly as a mixture between IRN_Hajji_Firuz_C, RUS_Darkveti-Meshoko_En and RUS_Progress_En.

RUS_Maykop_Novosvobodnaya
IRN_Hajji_Firuz_C 0.614±0.056
RUS_Darkveti-Meshoko_En 0.307±0.064
RUS_Progress_En 0.080±0.022
P-value 0.141468 (Pass)
Full output

I don't know whether the genetic relationship between ARM_Kura-Araxes_Berkaber and RUS_Maykop_Novosvobodnaya shown in my model is due to Maykop ancestry in the former. It might just be a coincidence in the sense that the same or similar processes led to the formation of both groups. Feel free to let me know your thoughts about that in the comments.

The fact that the Kura-Araxes people harbored steppe ancestry might be very important in the debate over the location of the so called Indo-Anatolian homeland. For instance, it's possible that the proto-Anatolian language spread from the North Caucasus into Anatolia via the Kura-Araxes culture.

But, admittedly, such a solution doesn't have strong support from historical linguistics data, which suggest that the Indo-Anatolian homeland was located in what is now Ukraine and that Anatolian speakers entered West Asia via the Balkans:

Indo-European cereal terminology suggests a Northwest Pontic homeland for the core Indo-European languages

See also...

R-V1636: Eneolithic steppe > Kura-Araxes?

Dear Iosif...Yamnaya

But Iosif, what about the Phrygians?

Monday, September 19, 2022

Dear Iosif...Yamnaya


Even though the Yamnaya culture probably originated in what is now Ukraine, the earliest Yamnaya samples currently available are from the modern-day Samara region of Russia. They mostly date to around 3,000 BCE. I can analyze their ancestry using Principal Component Analysis (PCA) data.

Target: RUS_Yamnaya_Samara
Distance: 3.2816% / 0.03281581
81.0 RUS_Progress_En
14.4 UKR_N
4.6 HUN_Vinca_MN
0.0 ARM_Aknashen_N
0.0 ARM_Masis_Blur_N
0.0 AZE_Caucasus_lowlands_LN
0.0 BGR_C
0.0 BGR_Dzhulyunitsa_N
0.0 IRN_Ganj_Dareh_N
0.0 IRN_Hajji_Firuz_C
0.0 IRN_Seh_Gabi_C
0.0 IRN_Tepe_Abdul_Hosein_N
0.0 IRN_Wezmeh_N
0.0 RUS_Darkveti-Meshoko_En
0.0 RUS_Maykop
0.0 RUS_Maykop_Late
0.0 RUS_Maykop_Novosvobodnaya

The above results show exactly zero ancestry from West Asia. Admittedly, both RUS_Progress_En and HUN_Vinca_MN are European ancients with significant West Asian-related ancestry. However, this ancestry is very distantly West Asian-related, and, for instance, it almost certainly has no relevance to the Indo-Anatolian homeland debate.

The Afanasievo culture of Central Asia is regarded to have been an early offshoot of the Yamnaya culture. A good number of Afanasievo samples are available, so let's have a look if their results match those of the Yamnaya folks. And indeed they do, since BGR_C is very similar to HUN_Vinca_MN.

Target: RUS_Afanasievo
Distance: 3.4055% / 0.03405499
84.0 RUS_Progress_En
11.4 UKR_N
4.6 BGR_C
0.0 ARM_Aknashen_N
0.0 ARM_Masis_Blur_N
0.0 AZE_Caucasus_lowlands_LN
0.0 BGR_Dzhulyunitsa_N
0.0 HUN_Vinca_MN
0.0 IRN_Ganj_Dareh_N
0.0 IRN_Hajji_Firuz_C
0.0 IRN_Seh_Gabi_C
0.0 IRN_Tepe_Abdul_Hosein_N
0.0 IRN_Wezmeh_N
0.0 RUS_Darkveti-Meshoko_En
0.0 RUS_Maykop
0.0 RUS_Maykop_Late
0.0 RUS_Maykop_Novosvobodnaya

To try this at home, stick the PCA data in the text file here into the relevant fields here and cranck up the "Cycles" to 4X. You should see exactly zero ancestry from West Asia every time.

I can, more or less, reproduce these results with tools that are routinely used in peer reviewed papers. Below is a table of mixture models produced with the qpAdm software. I set the pass threshold to P ≥0.05, which is an arbitrary value, but the pattern is clear. The full output from each qpAdm run is available here.


Importantly, qpAdm needs to be fed the relevant "right pop" outgroups to be able to discriminate accurately between reference populations.

right pops:
CMR_Shum_Laka_8000BP
MAR_Taforalt
Levant_Natufian
IRN_Ganj_Dareh_N
Levant_PPNB
TUR_Marmara_Barcin_N
HUN_Starcevo_N
HUN_Koros_N
SRB_Iron_Gates_HG
Iberia_Southeast_Meso
RUS_Karelia_HG
RUS_West_Siberia_HG
RUS_Boisman_MN
MNG_North_N
TWN_Hanben
BRA_LapaDoSanto_9600BP

So, for instance, if one were to use in this role the modern-day Mbuti people, as opposed to, say, the ancient hunter-gatherers of Shum Laka, one might find that many models look statistically better than they should. And then one might also find that the Yamnaya samples carry significant West Asian ancestry.

Actually, I'm not opposed to the idea of some West Asian ancestry in Yamnaya. Indeed, considering the extraordinary mobility of the Yamnaya people and their Eneolithic predecessors on the Pontic-Caspian steppe, it would be unusual if they didn't come into close contact and mix, to some degree, with their neighbors from West Asia.

However, based on everything I've seen, from uniparental markers to different types of autosomal genetic tests, it's clear to me that there's no substantial West Asian ancestry in any Yamnaya samples, except for an outlier female from modern-day Ozera, Ukraine (see here).

Admittedly, ancient DNA does have a habit of throwing curveballs, so I'm eagerly awaiting new Eneolithic samples from the Pontic-Caspian steppe, particularly those associated with the Yamnaya-like Sredni Stog culture, to help finally settle this issue.

Believe it or not, a contact recently sent me a supposedly unpublished female sample from a ~4,200 BCE Sredni Stog burial in modern-day Igren, east central Ukraine. So what the hell, let's assume for the time being that this sample is genuine. This is how Miss Sredni Stog behaves in my PCA mixture test.

Target: UKR_Sredni_Stog
Distance: 4.0769% / 0.04076877
75.6 RUS_Progress_En
17.8 UKR_N
6.6 HUN_Vinca_MN
0.0 ARM_Aknashen_N
0.0 ARM_Masis_Blur_N
0.0 AZE_Caucasus_lowlands_LN
0.0 BGR_C
0.0 BGR_Dzhulyunitsa_N
0.0 HUN_Vinca_MN
0.0 IRN_Ganj_Dareh_N
0.0 IRN_Hajji_Firuz_C
0.0 IRN_Seh_Gabi_C
0.0 IRN_Tepe_Abdul_Hosein_N
0.0 IRN_Wezmeh_N
0.0 RUS_Darkveti-Meshoko_En
0.0 RUS_Maykop
0.0 RUS_Maykop_Late
0.0 RUS_Maykop_Novosvobodnaya

Wow, just wow. Have we actually found Miss Proto-Yamnaya? What does qpAdm have to say in the matter?

UKR_Sredni_Stog
HUN_Vinca_MN 0.034±0.028
RUS_Progress_En 0.796±0.045
UKR_N 0.170±0.034
P-value 0.41088

Again, this is an excellent match with the results from my PCA test, especially if we take into account the standard errors. However, with qpAdm it's also possible to model this individual's ancestry as part West Asian.

UKR_Sredni_Stog
AZE_Caucasus_lowlands_LN 0.056±0.039
RUS_Progress_En 0.761±0.061
UKR_N 0.183±0.036
P-value 0.465667

As I pointed out above, it's plausible for such people to harbor some West Asian ancestry, but I'm very sceptical that this is really the case here, despite the rather solid qpAdm statistical fit. That's because UKR_Sredni_Stog is not a high quality sample, and, from my experience, qpAdm often has problems analyzing fine scale ancestry in singletons or even small groups that show excess DNA damage and/or offer much less than a million markers.

See also...

Dear Iosif, about that ~2%

But Iosif, what about the Phrygians?

Friday, September 9, 2022

Dear Iosif, about that ~2%


The debate over the location of the so called Indo-Anatolian homeland won't be decided by the persistence of any type of genetic ancestry in ancient Anatolia.

It'll be decided by a multidisciplinary study on the interactions between the ancient peoples of the North Pontic steppe, the eastern Balkans, and western Anatolia.

If such a study finds a pulse of steppe-related gene flow from the Balkans into Anatolia sometime during the early metal ages, it'll corroborate the linguistic hypothesis that a language ancestral to Hittite, Luwian and related tongues moved into Anatolia from Eastern Europe.

Why do we only need a pulse of gene flow, you might ask? Obviously, because:

- language and genetic ancestry can start with a strong association but, since they're not linked, they can eventually follow very different trajectories

- the dilution of genetic ancestry is an important factor, especially in ancient West Asia, and it must be taken into account in models of language spread, rather than ignored in favor of simple, elegant models that do not reflect reality.

Here's my favorite quote from the recent Lazaridis, Alpaslan-Roodenberg et al. paper, because, probably unbeknownst to the authors, it's exceptionally revealing about the spread of a wide range of Indo-European speakers into Anatolia.

However, in individuals from Gordion, a Central Anatolian city that was under the control of Hittites before becoming the Phrygian capital and then coming under the control of Persian and Hellenistic rulers, the proportion of Eastern hunter-gatherer ancestry is only ~2%, a tiny fraction for a region controlled by at least four different Indo-European–speaking groups.

Indeed, this is exactly what the Lazaridis, Alpaslan-Roodenberg et al. paper should've been about. That is, the authors should've given us a painstaking account of the spread of different ancient Indo-European speaking groups into Anatolia and explained how, overall, their DNA was rapidly diluted to a trace amount.

However, instead they treated us to a make-believe tale about a so called Indo-Anatolian homeland in what is now Armenia.

See also...

Dear Iosif...Yamnaya

But Iosif, what about the Phrygians?

Dear Iosif...

Dear Iosif #2

Dear Iosif #3

Sunday, September 4, 2022

But Iosif, what about the Phrygians?


A paper in Science co-authored by around 200 scientists from some of the world's top academic institutions surely must mean something, right? Not necessarily.

In this short blog post I'll try to explain, as simply as I can, why the Lazaridis, Alpaslan-Roodenberg et al. paper doesn't get us any closer to solving the riddle of the so called Indo-Anatolian homeland.

However, it must be said that the paper does include many interesting and valuable samples. I'll be using six of these samples, labeled TUR_C_Gordion_Anc, to argue my case.

The TUR_C_Gordion_Anc sample set is from Gordion, the capital of ancient Phrygia, and thus, in all likeliness, it represents Phrygian speakers.

Phrygian is an Indo-European language and the leading hypothesis is that it originated in the Balkans.

In terms of fine scale ancestry, TUR_C_Gordion_Anc can be reliably divided into two genetic clusters. In the Principal Component Analysis (PCA) below these clusters are labeled TUR_C_Gordion_Anc1 and TUR_C_Gordion_Anc2.

Note that TUR_C_Gordion_Anc1 is obviously pulling away from TUR_C_Gordion_Anc2 towards samples from the Balkans. I've used ancient samples from what is now North Macedonia, labeled MKD_Anc, to represent the Balkans. To see an interactive version of the plot, paste the PCA coordinates from here into the relevant field here.

Visually, this is not an especially dramatic outcome, but it's an incredible result nonetheless, because it shows that even a few ancient samples can help to solve an age old mystery.

Across many dimensions of genetic variation, the shift in the PCA from TUR_C_Gordion_Anc1 to TUR_C_Gordion_Anc2 represents about 20% admixture from the Balkans, and about 8% from the Eastern European steppe. That's plenty enough to corroborate the linguistic hypothesis that the Phrygians originated in the Balkans, and that some of their ancestors came from the steppe. The mixture models below were done with the tools here.

Target: TUR_C_Gordion_Anc1
Distance: 1.6634% / 0.01663373
40.6 Kura-Araxes_ARM_Kaps
22.2 Anatolia_Barcin_N
21.8 MKD_Anc
13.6 Levant_PPNB
1.4 IRN_Ganj_Dareh_N
0.4 Han

Target: TUR_C_Gordion_Anc1
Distance: 1.7109% / 0.01710904
40.2 Kura-Araxes_ARM_Kaps
37.8 Anatolia_Barcin_N
12.4 Levant_PPNB
8.0 Yamnaya_RUS_Samara
1.2 IRN_Ganj_Dareh_N
0.4 Han

Target: TUR_C_Gordion_Anc2
Distance: 2.0293% / 0.02029339
51.0 Kura-Araxes_ARM_Kaps
26.8 Anatolia_Tepecik_Ciftlik_N
17.6 Anatolia_Barcin_N
4.6 Levant_PPNB

Surprisingly, Lazaridis, Alpaslan-Roodenberg et al. didn't have much to say about this topic. This quote basically sums it up:

However, in individuals from Gordion, a Central Anatolian city that was under the control of Hittites before becoming the Phrygian capital and then coming under the control of Persian and Hellenistic rulers, the proportion of Eastern hunter-gatherer ancestry is only ~2%, a tiny fraction for a region controlled by at least four different Indo-European–speaking groups.

I have no doubt that Lazaridis, Alpaslan-Roodenberg et al. can run a very decent PCA, and then blow it up to a size big enough to show that the Gordion samples represent two genetically somewhat distinct groups. I'm also sure that, if they really try, they can locate significant levels of proximate and relevant European ancestry in some of these samples.

They don't have to use my methods; they can use any methods they like. My point is that they won't find much if they're just looking for genetic signals from the Upper Paleolithic or Mesolithic.

Now, considering the way that the Phrygian question was treated by Lazaridis, Alpaslan-Roodenberg et al., despite the fact that they managed to sequence a few likely Phrygian speakers from none other than the Phrygian capital, let's not pretend that their paper brought us any closer to understanding the genetic origins of Anatolian speakers or pinpointing their ancestral homeland.

In order to even try to solve these problems with ancient DNA, we need a wide range of samples from Hittite, Luwian and other key sites where Anatolian languages were spoken. And then we must analyze them properly.

I'm guessing that Lazaridis, Alpaslan-Roodenberg et al. went out of their way to get such samples, but for one reason or another they failed. If so, that's OK, but I have a feeling that even if they got them, they wouldn't know what to do with them, because at best these samples would only show ~2% Eastern hunter-gatherer ancestry. Haha.

For what it's worth, I believe that the ancient data in the Lazaridis, Alpaslan-Roodenberg et al. paper point to the North Pontic steppe as the Indo-Anatolian homeland, and I'll lay out my arguments in an upcoming blog post.

See also...

Dear Iosif...Yamnaya

Dear Iosif, about that ~2%

Dear Iosif...

Dear Iosif #2

Dear Iosif #3

Thursday, September 1, 2022

Dear Iosif #3


Back in 2016 I made this prediction about the origins of the Yamnaya people (Steppe_EMBA):

But here's my prediction: Steppe_EMBA only has 10-15% admixture from the post-Mesolithic Near East not including the North Caucasus, and basically all of this comes via female mediated gene flow from farming communities in the Caucasus and perhaps present-day Ukraine.

The relevant blog post is still here. Looking back, my analysis is a bit sloppy and I didn't articulate my ideas too well. But that was a pretty good prediction for its time, and I believe it still has a chance of being confirmed, more or less.

On the other hand, the widely publicized hypothesis that the Yamnaya population is a ~50/50 mixture between indigenous Eastern European hunter-gatherers and Near Eastern or West Asian migrants never looked right to me. So I'm glad that it's now dead and buried.

Those of you not up to date with this topic, all you need to know is that the Yamnaya genotype existed in Eastern Europe at least a thousand years before Yamnaya, and, moreover, the Yamnaya people are largely derived from Eastern European foragers already rich in Near Eastern-related ancestry. The relevant ancient genomes are on the way (for instance, see here).

Nevertheless, the narrative that waves of Near Eastern migrants moved into prehistoric Eastern Europe, leading to the emergence of the Yamnaya culture and even the Proto-Indo-European language, is still being pushed by some notable scientists working with ancient DNA.

My hope is that, considering the latest revelations about the genetic origins of the Yamnaya people, these scientists can embrace a more nuanced view. How about something like this?

- people moved around, and they were especially mobile on the Eastern European steppe from the Eneolithic onwards

- when they made contact they sometimes mixed, so there was admixture between far flung steppe groups

- since population densities on the steppe were low until the Yamnaya period, minor admixture that entered the steppe during the Neolithic and Eneolithic wasn't dilluted easily.

See also...

Dear Iosif #2

But Iosif, what about the Phrygians?

Dear Iosif, about that ~2%

Dear Iosif...Yamnaya