Sunday, April 17, 2016

Estimating Basal Eurasian ancestry?


Basal Eurasians (BE) are a hypothetical ghost population that apparently split from other Eurasians no later than 45,000 years ago. If they actually existed, they had a significant impact on the ancestry of early Neolithic farmers, and thus all present-day West Eurasians.

Testing ancestry proportions from ghost populations isn't easy. However, Haak et al. 2015 made use of an f4 equation that seemingly gave an accurate estimate of BE admixture in LBK farmer Stuttgart: f4(Stuttgart,Loschbour;Onge,MA1)/f4(Mbuti,MA1;Onge,Loschbour) = 44%. The other LBK farmers scored an average of 40% BE, which also made sense.

Unfortunately, this equation doesn't appear to work too well for Caucasus Hunter-Gatherers (CHG) Kotias and Satsurblia. They both score around 25% BE, which, as far as I can see, seems way too low. Perhaps using MA1 in the equation is messing things up because CHG harbor significant MA1-related ancestry?

I tinkered around with Haak's equation and came up with this: f4(X,Iberia_Mesolithic;Dai,Karelia_HG)/f4(Mbuti,Karelia_HG;Dai,Iberia_Mesolithic). The results look solid, at least in relative terms (see image below). But is the equation actually valid?

My main worry is using both Iberia Mesolithic and Karelia HG. They share a lot of drift, much more than Loschbour and MA1. Also, even though both Dai and Onge belong to the so called Eastern non-African (ENA) clade, they're quite distinct, with Dai a lot less basal in the context of ENA diversity. Any thoughts? Suggestions?


Update 04/18/2016: Interestingly, my f4 equation essentially fails for most post-Neolithic Europeans, particularly those with relatively high ratios of Karelia HG-related ancestry. For instance, Yamnaya Kalmykia scores just 2.9% BE, which can't be right. Yamnaya Samara shows -2.2%, which is obviously wrong.

But I tried several combinations of reference samples and found that by replacing Karelia HG with Hungary HG and Dai with Ust-Ishim I was able to obtain coherent results for a wider range of groups, including Yamnaya.


To be honest, I still don't know what the hell I'm testing here exactly. The results appear to reflect the existence of two components within West Eurasia; one representing ancient hunter-gatherers from Europe and probably surrounding areas of the Near East, and another closely related to present-day Near Eastern populations. The latter might well be a signal of the so called Basal Eurasians, or perhaps a number of as yet unsampled meta populations from the ancient Near East?

35 comments:

  1. I agree that there are a lot of questions here. Look at Ust_Ishim, according to these three ways of doing things. I've found that having a stronger drift with ENA, in pop A, can really inflate BE according to this. I'm not sure what we're really finding here, or if we can say there is one simple Basal Eurasian. Another thing about these below, is that West and East Eurasians share significant drift after Ust_Ishim. All but Kostenki.

    Ust_Ishim Iberia_HG Dai Karelia_HG : Mbuti Karelia_HG Dai Iberia_HG 1.248710 0.063351 19.711

    Ust_Ishim Loschbour Onge MA1 : Mbuti MA1 Onge Loschbour 1.171282 0.128513 9.114

    Ust_Ishim Loschbour Onge Kostenki : Mbuti Kostenki Onge Loschbour 0.877311 0.077901 11.262

    ReplyDelete
  2. I'm not sure if anyone with any ENA can have reliable results. It makes this "BE" signal go up. Put an East Asian in there, and you see them get 196% Basal Eurasian. West Asians having recent ENA will get inflated results too.

    ReplyDelete
  3. Certainly, the equation I posted won't work well, or none at all really, for any group with significant East Asian, South Asian and/or Sub-Saharan ancestry.

    It only seems to be viable for basically fully West Eurasian farmer/hunter mixture populations.

    ReplyDelete
  4. It looks difficult to find f4 ratios that would work well for all the populations, though those figures are probably not far off.

    I do wonder if under the assumption of the estimates of BE for EEFs (which should be the most reliable), someone could calculate the theoretical coordinates for a ghost Basal Eurasian, for example from the latest PCA 9 (or any other of the datasheets). Maybe Matt knows a possible way to do that using PAST 3 , and while it would still be a very theoretical approach, it could be interesting for calculating BE admixture in "difficult" populations (with ENA, SSA,...) and making some interesting models.

    ReplyDelete
  5. This comment has been removed by the author.

    ReplyDelete
  6. Basal Eurasian may have never existed. The argument used by Laz 2013 that it did exist was that Loschbour and MA1 have the same relationship to East Asians, despite being so separated in space and time. However, the same could be said about CHG and EEF.

    So, which is it? East Asian admixture in Kostinki/MA1/WHG/EHG or Basal Eurasian in CHG/EEF? Is there any way to know which one? Basal Eurasian makes the most sense, but I'm hesitant to say it's that simple.

    ReplyDelete
  7. I made a ghost Basal Eurasian just based on Stuttgart being 56% Loschbour and 44% Basal_Eurasian (for the latest PCA data with 9 dimensions posted by Davidski).

    It's just for a quick test to see how this could work. With Stuttgart it works as expected:

    Stuttgart:LBK380
    "Loschbour:Loschbour" 56
    "Basal_Eurasian:Ghost1" 44
    "Bichon:Bichon" 0
    "Hungary_HG:I1507" 0
    "Iberia_Mesolithic:I0585" 0
    "Karelia_HG:I0061" 0
    "MA1:MA1" 0
    "Dai:HGDP01308" 0
    "Yoruba:HGDP00920" 0
    Distance: 0

    With others it's harder to say how good it might be:

    Anatolia_Neolithic:I0707
    "Loschbour:Loschbour" 51.35
    "Basal_Eurasian:Ghost1" 48.1
    "Karelia_HG:I0061" 0.55
    "Bichon:Bichon" 0
    "Hungary_HG:I1507" 0
    "Iberia_Mesolithic:I0585" 0
    "Kostenki14_UP:Kostenki14" 0
    "MA1:MA1" 0
    "Dai:HGDP01308" 0
    "Paniya:PNYD1" 0
    "Yoruba:HGDP00920" 0
    distance=0.009987

    Druze:HGDP00559
    "Basal_Eurasian:Ghost1" 42.3
    "Iberia_Mesolithic:I0585" 31.55
    "MA1:MA1" 23.2
    "Yoruba:HGDP00920" 2.2
    "Dai:HGDP01308" 0.75
    "Bichon:Bichon" 0
    "Loschbour:Loschbour" 0
    "Hungary_HG:I1507" 0
    "Karelia_HG:I0061" 0
    distance=0.036596

    Kotias is complicated:

    Kotias:KK1
    "MA1:MA1" 52.55
    "Basal_Eurasian:Ghost1" 30.6
    "Iberia_Mesolithic:I0585" 15.45
    "Yoruba:HGDP00920" 1.4
    "Bichon:Bichon" 0
    "Loschbour:Loschbour" 0
    "Hungary_HG:I1507" 0
    "Karelia_HG:I0061" 0
    "Dai:HGDP01308" 0
    distance=0.101346

    Adding Kostenki14:

    Kotias:KK1
    "MA1:MA1" 38.55
    "Kostenki14_UP:Kostenki14" 28.05
    "Basal_Eurasian:Ghost1" 25.75
    "Hungary_HG:I1507" 7.65
    "Bichon:Bichon" 0
    "Loschbour:Loschbour" 0
    "Iberia_Mesolithic:I0585" 0
    "Karelia_HG:I0061" 0
    "Dai:HGDP01308" 0
    "Yoruba:HGDP00920" 0
    distance=0.098612

    Adding Paniya:

    Kotias:KK1
    "Karelia_HG:I0061" 34.45
    "Basal_Eurasian:Ghost1" 30.2
    "Paniya:PNYD1" 25.7
    "Hungary_HG:I1507" 8.9
    "Yoruba:HGDP00920" 0.75
    "Bichon:Bichon" 0
    "Loschbour:Loschbour" 0
    "Iberia_Mesolithic:I0585" 0
    "Kostenki14_UP:Kostenki14" 0
    "MA1:MA1" 0
    "Dai:HGDP01308" 0
    distance=0.084395

    Not sure what to think. Maybe with a ghost based on more samples from HGs and EEFs it could be interesting.

    Here is the data for this ghost:

    Basal_Eurasian:Ghost1,0.01995,-0.3607192,0.046804818,0.004713,0.03119,-0.08396363,0.1048888,-0.02823418,0.34029281

    ReplyDelete
  8. Ryu,

    I'm not sure about that formula, or what it's capturing.

    Either way, it's going negative.

    result: Stuttgart Onge MA1 Mbuti : Chimp Ust_Ishim MA1 Mbuti -0.032068 0.014474 -2.216
    result: Stuttgart Onge Loschbour Mbuti : Chimp Ust_Ishim Loschbour Mbuti -0.197678 0.014160 -13.960
    result: Anatolia_Neolithic Onge MA1 Mbuti : Chimp Ust_Ishim MA1 Mbuti -0.037357 0.010465 -3.570
    result: Anatolia_Neolithic Onge Loschbour Mbuti : Chimp Ust_Ishim Loschbour Mbuti -0.173997 0.010316 -16.867
    result: Kotias Onge MA1 Mbuti : Chimp Ust_Ishim MA1 Mbuti -0.055225 0.013913 -3.969
    result: Kotias Onge Loschbour Mbuti : Chimp Ust_Ishim Loschbour Mbuti -0.104456 0.013704 -7.622
    result: Stuttgart Ust_Ishim MA1 Mbuti : Chimp Ust_Ishim MA1 Mbuti -0.061164 0.017688 -3.458
    result: Stuttgart Ust_Ishim Loschbour Mbuti : Chimp Ust_Ishim Loschbour Mbuti -0.208354 0.018985 -10.975
    result: Anatolia_Neolithic Ust_Ishim MA1 Mbuti : Chimp Ust_Ishim MA1 Mbuti -0.065635 0.014418 -4.552
    result: Anatolia_Neolithic Ust_Ishim Loschbour Mbuti : Chimp Ust_Ishim Loschbour Mbuti -0.185258 0.015859 -11.682
    result: Kotias Ust_Ishim MA1 Mbuti : Chimp Ust_Ishim MA1 Mbuti -0.088006 0.017845 -4.932
    result: Kotias Ust_Ishim Loschbour Mbuti : Chimp Ust_Ishim Loschbour Mbuti -0.117491 0.017352 -6.771
    result: Stuttgart Ust_Ishim Karelia_HG Mbuti : Chimp Ust_Ishim Karelia_HG Mbuti -0.179750 0.018753 -9.585
    result: Anatolia_Neolithic Ust_Ishim Karelia_HG Mbuti : Chimp Ust_Ishim Karelia_HG Mbuti -0.173216 0.015347 -11.286
    result: Kotias Ust_Ishim Karelia_HG Mbuti : Chimp Ust_Ishim Karelia_HG Mbuti -0.172484 0.018718 -9.215

    ReplyDelete
  9. This comment has been removed by the author.

    ReplyDelete
  10. The results are still the same. Go to Haak et al. (2015). There is a section on the f4-ratio for Basal Eurasian. See if you can come up with something else.

    ReplyDelete
  11. This comment has been removed by the author.

    ReplyDelete
  12. The results are still infeasible. You have to have the West Eurasian and East Eurasian pops flipped. I've run all those. Kostenki seems to be okay, from between Ust_Ishim and West Eurasians. I'm not sure about using EHG, due to reasons you know from our talks. It would only complicate things. Another thing I've noticed from these formulas is that MA1 comes out about 40% BE, which is odd, considering he is basically equidistant from Ust_Ishim as WHG. We may need a whole new formula.

    I still don't think we can completely throw out ENA into WHG, less into EEF, with some SSA, and toss out BE. Kostenki is equally distant from Ust_Ishim as WHG and ENA, yet further from ENA.

    It could be that the Near East and farmers have some Kostenki-like ancestry, plus minor SSA.

    ReplyDelete
  13. Interestingly, the equation I posted totally fails for Yamnaya. It must be because of too much shared drift with Karelia HG.

    Yamnaya_Kalmykia Iberia_Mesolithic Dai Karelia_HG : Mbuti Karelia_HG Dai Iberia_Mesolithic 0.029006 0.052593 0.552

    Yamnaya_Samara Iberia_Mesolithic Dai Karelia_HG : Mbuti Karelia_HG Dai Iberia_Mesolithic -0.022191 0.050312 -0.441

    ReplyDelete
  14. Kostenki gives them 42%, Karelia 29%, and Kotias 49%, but MA1 40%. BE may be an illusion. No matter the sample, someone won't make sense. I can't come up with an alternative yet.

    ReplyDelete
  15. I'm just looking for a test that can estimate Basal Eurasian proportions in ancient farmers, ancient Near Easterners and also modern Near Easterners who don't have too much Sub-Saharan and South/East Asian ancestry.

    All of these groups have very similar ancestry, and I think that's why my equation works. Although it'd be nice to find an alternative to Karelia HG just in case potential shared ancestry between CHG and Karelia HG is underestimating the Basal score in the CHGs.

    I wouldn't worry about anyone else. There's just so much a single f4 test can analyze. It can't cover all the bases, especially for more complicated mixtures.

    Kostenki14 works OK in place of Karelia HG, but the estimates look a little low across the board IMO.

    ReplyDelete
  16. Here is a few ways to look at it. This is supposed to be capturing ancestry not descended from Ust_Ishim, however, it seems this and the inverse are actually only capturing what is descended from Ust_Ishim, rather than deeper. Although, we see other HG and MN get lower amounts, which seems to imply that Ust_Ishim is BE, but not the same as what farmers have.

    result: Ust_Ishim Loschbour Onge MA1 : Mbuti MA1 Onge Loschbour 1.171282 0.128513 9.114
    result: Ust_Ishim Loschbour Onge Kostenki : Mbuti Kostenki Onge Loschbour 0.877311 0.077901 11.262
    result: Ust_Ishim Iberia_HG Dai Karelia_HG : Mbuti Karelia_HG Dai Iberia_HG 1.248710 0.063351 19.711
    result: Ust_Ishim Onge Loschbour Han : Mbuti Han Loschbour Onge 1.101166 0.106606 10.329

    ReplyDelete
  17. Very interesting questions, and the output seems very clean/solid, assuming that "Basal Eurasian" is real.

    Although, I'm not too sure about this anymore, considering that K14 doesn't have any ancestry basal to Ust-Ishim (I recall a set of stats which demonstrated this), yet is much more distant from ENA compared to ANE/EHG/WHG. Like Chad stated, just having substantial ENA ancestry in West Eurasian hunter gatherers, with much less in ENF/CHG, and perhaps some African ancestry in ENF/CHG, probably makes more sense considering the data we have post-Laziridis et al.

    I guess we'll just have to wait and see how things play out with more aDNA.

    Anyway, Alberto's experiment seems to work great with EEF and SHG:

    LBK_EN
    55.70% Loschbour
    43.95% Basal_Eurasian:Ghost1
    0.25% Karelia_HG
    0.10% Yoruba

    Starcevo_EN
    53.65% Loschbour
    46.35% Basal_Eurasian:Ghost1

    Motala_HG
    55.8% Loschbour
    31.3% Samara_HG
    11.9% Karelia_HG
    1.0% Basal_Eurasian:Ghost1

    This is pretty cool. If we assume the Basal Eurasian/Crown Eurasian scenario, ENF/CHG are the mixed ones, so it's great to see how they stack up with unadmixed populations (again, unadmixed as per the Basal Eurasian/Crown Eurasian idea). These were done with the same reference populations as the EEF/SHG ones:

    Lithuanian
    54.3% Loschbour
    30.4% Srubnaya_outlier:I0354
    15.3% Basal_Eurasian:Ghost1

    Albanian
    43.50% Loschbour
    31.20% Basal_Eurasian:Ghost1
    20.95% Srubnaya_outlier:I0354
    4.35% Paniya

    Lebanese_Christian
    43.35% Basal_Eurasian:Ghost1
    23.00% Loschbour
    21.90% Srubnaya_outlier:I0354
    9.30% Paniya
    2.45% Yoruba

    Georgian
    35.35% Basal_Eurasian:Ghost1
    32.25% Srubnaya_outlier:I0354
    20.25% Loschbour
    12.15% Paniya

    Kurdish
    34.15 Basal_Eurasian:Ghost1
    33.60% Srubnaya_outlier:I0354
    16.20% Loschbour
    15.45% Paniya
    0.60% Yoruba

    Kalash
    50.80% Srubnaya_outlier:I0354
    33.20% Paniya
    14.50% Basal_Eurasian:Ghost1
    1.35% Loschbour
    0.15% Yoruba

    Kotias
    47.60% Srubnaya_outlier:I0354
    26.30% Basal_Eurasian:Ghost1
    18.10% Paniya
    6.45% Loschbour
    1.55% Yoruba

    I don't think we really know what the Paniya percentages represent in the Balkans, West Asia, the Caucasus, South Central Asia, and in CHG. Also, Srubnaya_outlier is basically an EHG/ANE sample, which is why I used it.

    A side note, but this estimate of BEA for Kotias matches what David found using the Haak et al. equation.

    ReplyDelete
  18. result: Iberia_HG Ju_hoan_North Stuttgart Yoruba : Iberia_HG Ju_hoan_North Loschbour Yoruba 0.664114 0.010259 64.733
    result: Iberia_HG Ju_hoan_North Iberia_EN Yoruba : Iberia_HG Ju_hoan_North Loschbour Yoruba 0.691873 0.009134 75.745
    result: Iberia_HG Ju_hoan_North Anatolia_Neolithic Yoruba : Iberia_HG Ju_hoan_North Loschbour Yoruba 0.662611 0.007468 88.724
    result: Iberia_HG Ju_hoan_North Kotias Yoruba : Iberia_HG Ju_hoan_North Loschbour Yoruba 0.613168 0.010476 58.531

    ReplyDelete
  19. Paniyas and Srubnaya have BE, if it's real. So it really clouds the picture.

    ReplyDelete
  20. Really not sure if it's possible to test BE with PCA/nMonte.

    Anyway, I updated the post with new results.

    ReplyDelete
  21. I always had problems with the Basal/Crown Eurasian concept. I feel it is (a) oversimplifying things and (b) possibly labelling groups the wrong way, i.e. "Crown" being the more ancient and widespread, and "basal" the later and more geographically restricted group.

    Let's run through our sketchy knowledge of paleolithic population movements and genetic admix:

    1. Hominids: Neandertal as "basal" (all Eurasian), Denisova as a "crown", mostly SEA population.
    Quite early, ca. 100 kya, Neandertals have differentiated into a West and a East Eurasian line. The latter, represented by the Altai Neandertal, received early AMH genetic inflow, which European Neandertal samples are lacking. So far, we lack comprehensive information to which extent these two Neandertal clades may have differently acted on West/ East Eurasian populations, but I am confident that S. Pääbo's team at Leipzig is working on it.

    2. Neandertal admixture shows regionally differentiated patterns, see
    http://www.sciencedirect.com/science/article/pii/S0002929715004863
    "Of the three putatively introgressed core haplotypes, III and IV are most similar to the Altai Neandertal genome (..). Core haplotype III is present in all non-African populations. (..) Core haplotype IV is restricted to specific Asian groups."
    So, on top of Denisova, we have another "crown" East Eurasian component reflected by specific Altai Neandertal gene flow.

    3. yDNA: yDNA K, or K2, estimated to have split off around 45 kya, appears to well represent, in Dave's words, "a hypothetical ghost population that apparently split from other Eurasians no later than 45,000 years ago" . In fact, most West Eurasian yDNA hgs (G/H/I/J) come from upstream that split, East Eurasian ones from downstream. The exception, of course, is yDNA R.
    Irritating here is the tree phylogeny, since Australian K-M256 and Papuan M and S are found downstream of K, yet should in principle represent more ancient, "basal" populations than the West Eurasian G/H/I/J. To put it differently: Typical West Eurasian yDNA is indeed "basal" to the yDNA tree, East Eurasian and Ocean yDNA more downstream ("crown"), but we have little indication that those "basal Eurasians" in pre-historic times ever settled more than West Eurasia. Apparently, there is still much we have to learn about how the second "out of africa" wave some 45-50 kya ago worked. In any case, I suppose that "basal" vs. "crown" Eurasian is closely linked to yDNA pre- vs. post K(2).

    4. mtDNA: Here, the case is less clear-cut. The M-N split appears to predate the emergence of "Basal Eurasian" at around 45kya. More fitting is mtDNA U, which dominates West Eurasian UP mtDNA. Intriguing here is the case of U2, which is predominantly South Asian, but includes West Eurasian U2e. South Asian U2 thus may represent an Upper Paleolithic West Eurasian migration into South Asia; note in this context also Kostenki's mtDNA U2.
    Possibly, U2 is related to the "Paniya" component showing up in MA1 and EHG.

    In summary, we still have a lot to learn about Paleolithic population movement. If a sizeable "common Eurasian" layer has survived at all seeams doubtful. Rather, we may be dealing with a quite early east-west split. The eastern side would be represented by Denisova, Altai Neandertal, yDNA K(2), mtDNA M, and Ust-Ishim; the western side by European Neandertals, yDNA GHIJ, mtDNA U (plus possibly others, e.g. HV), and Kostenki. Already during the UP, there should have been several cross-overs.

    ReplyDelete
  22. Hi All,

    Can anyone explain to me why the ratio is

    f4(X,Iberia_Mesolithic;Dai,Karelia_HG)/f4(Mbuti,Karelia_HG;Dai,Iberia_Mesolithic)

    rather than

    f4(X,Iberia_Mesolithic;Dai,Karelia_HG)/f4(Mbuti,Iberia_Mesolithic;Dai,Karelia_HG)?

    (modeled on the same in Haak with Loschbour rather than Iberia and MA-1 rather than Karelia).

    Also wouldn't

    f4(X,Iberia_Mesolithic;Ust_Ishim,Hungary_HG)/f4(Mbuti,Hungary_HG;Ust_Ishim,Iberia_Mesolithic)

    give more basal results for Yamnaya simply because

    f4(X,Iberia_Mesolithic;Ust_Ishim,Hungary_HG) is going to be more similar to f4(Mbuti,Hungary_HG;Ust_Ishim,Iberia_Mesolithic) for them*, because they don't have much / any WHG admixture?

    (the opposite problem seems to have been attacking the ratio using Karelia). Doesn't seem like it can be valid for EHG+Basal Mixtures. Even for Anatolian farmers, they are in theory WHG+UHG+Basal.

    * As f4(X,Iberia_Mesolithic;Ust_Ishim,Hungary_HG) approaches equal to f4(Mbuti,Hungary_HG;Ust_Ishim,Iberia_Mesolithic) the ratio of the two approaches 1.

    @ Alberto: Yes, like Ryu is mentioning, if you have a set of stable BE estimates for populations that you know to be simple two way mixes of BE and WHG (at least more than 3 populations, including WHG with 0), then you can do a regression equation in PAST to predict a 100% BE zombie in the PCA space. Then use it the zombie data through nMonte for more complex populations. I think you've done something like that?

    One thing I would say about whatever Basal Eurasian construct is produced by this method is, the definition of Basal Eurasian is a population that is equally related to WHG, EHG and ENA. So it should meet this definition, of being equally distant, or is not really Basal Eurasian. However I'm not sure how you would test this, as greater and lesser drift in WHG and ENA branches would interfere with measuring this.

    @ Ryu, apologies for not replying to your post in the other thread. I'm not on Anthrogenica, unfortunately. Did you and Alberto resolve what it was you were looking for?

    @ Krefter, well observed and stated re: CHG and EEF. Also note Kostenki and EEF and CHG also have the same relationship to East Asians (but not to Ust Ishim or Oase1!).

    ReplyDelete
  23. FrankN.

    Good summary. Lot of times I think its all political or ethnic fervor. otherwise how can yDNA GH becomes west?

    ReplyDelete
  24. Matt,

    As you point out, the ratio in the first test above is...

    Stuttgart Iberia_Mesolithic Dai Karelia_HG Mbuti Karelia_HG Dai Iberia_Mesolithic

    That's because, unless I'm losing my grip on reality, the ratio in the Haak supp info on page 79 is...

    Stuttgart Loschbour Onge MA1 Mbuti MA1 Onge Loschbour

    But the ratio you proposed, which puts Karelia_HG in a different place in the second part of the equation, does seem to work for groups that don't have significant Karelia HG admix.

    Iranian_Jew Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.516748 0.028777 17.957
    Druze Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.502651 0.026115 19.248
    Armenian Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.501704 0.028977 17.314
    Georgian_Jew Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.478387 0.029196 16.385
    Cypriot Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.454031 0.029802 15.235
    Georgian Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.453603 0.030366 14.938
    Satsurblia Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.442465 0.048693 9.087
    Kotias Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.438867 0.044729 9.812
    Stuttgart Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.434508 0.04474 9.712
    Anatolia_Neolithic Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.405771 0.031206 13.003
    Sardinian Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.385938 0.029733 12.98
    LBK_EN Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.382318 0.032203 11.872
    Iberia_EN Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.375653 0.034828 10.786
    Iceman_MN Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.354782 0.045951 7.721
    Remedello_BA Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.344965 0.04524 7.625
    Hungary_EN Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.340293 0.033698 10.098
    Esperstedt_MN Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.339333 0.050863 6.672
    Hungary_CA Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.313537 0.049338 6.355
    Iberia_Chalcolithic Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.294134 0.035349 8.321
    Iberia_MN Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.282007 0.037217 7.577
    Hungary_BA Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.174216 0.04601 3.786
    Unetice_EBA Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.130044 0.039846 3.264
    Bell_Beaker_Germany Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.125737 0.039357 3.195
    Andronovo Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.08975 0.043884 2.045
    Corded_Ware_Germany Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.041574 0.043202 0.962
    Afanasievo Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.03894 0.049053 0.794
    Srubnaya Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.038541 0.042581 0.905
    Yamnaya_Kalmykia Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.026593 0.047872 0.555
    Poltavka Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG 0.007984 0.047997 0.166
    Yamnaya_Samara Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG -0.020148 0.045976 -0.438
    Samara_Eneolithic Iberia_Mesolithic Dai Karelia_HG : Mbuti Iberia_Mesolithic Dai Karelia_HG -0.363544 0.070432 -5.162

    ReplyDelete
  25. @Frank

    2. Neandertal admixture shows regionally differentiated patterns, see
    http://www.sciencedirect.com/science/article/pii/S0002929715004863
    "Of the three putatively introgressed core haplotypes, III and IV are most similar to the Altai Neandertal genome (..). Core haplotype III is present in all non-African populations. (..) Core haplotype IV is restricted to specific Asian groups."
    So, on top of Denisova, we have another "crown" East Eurasian component reflected by specific Altai Neandertal gene flow.

    ...

    In summary, we still have a lot to learn about Paleolithic population movement. If a sizeable "common Eurasian" layer has survived at all seeams doubtful. Rather, we may be dealing with a quite early east-west split. The eastern side would be represented by Denisova, Altai Neandertal, yDNA K(2), mtDNA M, and Ust-Ishim; the western side by European Neandertals, yDNA GHIJ, mtDNA U (plus possibly others, e.g. HV), and Kostenki. Already during the UP, there should have been several cross-overs.


    The Dannemann et al. 2015 paper you linked to only uses the Altai Neanderthal genome as a Neanderthal genome, so you cannot make conclusions specific to the Altai Neanderthal based on that paper. Indeed, the Kuhlwilm et al. 2016 paper (the very paper that found the 100,000 yo modern human introgression into the Altai Neanderthal) also found that the Neanderthal introgression into all non-African modern humans (and not just the West Eurasian ones) was from western Neanderthals, and not from eastern Neanderthals such as the Altai Neanderthal:

    "When we refine our estimates of gene flow by adding the chromosome 21 sequences of the European Neanderthals to our genome-wide data, G-PhoCS infers significant rates of gene flow from Neanderthals into modern humans outside Africa only for El Sidrón and Vindija Neanderthals (0.3–2.6%) (Fig. 3a), suggesting that Neanderthals from Europe are more closely related than the Altai Neanderthal to the population that interbred with modern humans outside Africa 47,000–65,000 years ago."

    ReplyDelete
  26. Chad,

    Good points, although this Srubnaya_outlier is actually an EHG/ANE sample, quite distinct from other Srubnaya samples. For example:

    (Chimp, MA1, Srubnaya, Srubnaya_Outlier) d=0.0498
    (Chimp, MA1, Yamnaya_Samara, Srubnaya_Outlier) d=0.0321
    (Chimp, MA1, Samara_Eneolithic, Srubnaya_Outlier) d=0.0011
    (Chimp, MA1, Karelia_HG, Srubnaya_Outlier) d= -0.0133

    Or one can examine the PCA David posted, in which the Srubnaya_Outlier clusters near EHG, at quite a distance from other Srubnaya samples.

    And when it comes to the Paniya, assuming Basal Eurasian is real (which is a very complex/interesting discussion in itself), they are probably (at most) around 5% BEA. So the Paniya percentages (from the Balkans to western Pakistan) probably reflect (at least in part) some sort of Crown Eurasian ancestry (for which we don't have any ancient samples). But who really knows. As always, we need aDNA.

    Odd detail, despite the presence of MA1, Samara hunter, Karelia hunter, and Afontova Gora (in the PCA nMonte fits I tried), modern populations from Lithunia to India prefer Srubnaya_outlier, while ancient samples do often tend towards the Karelia/Samara hunters. Not sure what that could mean, if anything.

    But as David noted, perhaps the PCA-based approach is wholly inadequate in this case, although it does work great with EEF and SHG.

    ReplyDelete
  27. @Matt

    Yes, I basically did that to create the Basal Eurasian ghost, but manually and with only one sample. It seems to work ok for Europe and the Levant (Basal Eurasian peaking at around 52% in BedouinB), but it's more complicated when it comes to CHG-rich populations (Kotias itself being some 30% BE, Satsurblia a bit less). Maybe moving the position of this ghost to the "east" a bit would work better.

    Re: distances of BE to WHG, EHG and East Asian, the theory is that they should be equidistant. But I'm not sure how feasible is that on a PCA. Using BE as the only source population in nMonte (as you did before with 4mix) we can calculate the total distance for any given target population (using the Ncycles=1 parameter saves a lot of time):

    Loschbour <-> Basal_Eurasian:Ghost1 = 0.620881
    Karelia_HG <-> Basal_Eurasian:Ghost1 = 0.626815
    Dai <-> Basal_Eurasian:Ghost1 = 0.753895
    MA1 <-> Basal_Eurasian:Ghost1 = 0.605049

    So clearly Dai look quite a bit further away from BE than the HGs are. Checking with Ust-Ishim to see if it's equidistant from East and West Eurasian:

    Loschbour <-> Ust_Ishim = 0.400614
    Dai <-> Ust_Ishim = 0.355974

    Also not exactly, though I think D-stats do show that Ust-Ishim is slightly closer to East Asian (IIRC). Interestingly, checking the distance for Dai to Western populations:

    Loschbour <-> Dai = 0.592068
    MA1 <-> Dai = 0.482459
    Anatolia_Neolithic <-> Dai = 0.594729
    Kotias <-> Dai = 0.526993

    So there is something strange there in Loschbour being too far from Dai relatively to the others. I checked with the D-stats based datasheet to see if the distances there were more correct (I used the last one provided by Davidski with Scythian as a row):

    Western_HG <-> Dai = 0.350144
    MA1 <-> Dai = 0.244707
    Anatolia_Neolithic <-> Dai = 0.335807
    Caucasus_HG <-> Dai = 0.308551

    A similar pattern, though even a bit more strange to see Loschbour being even further away from Dai than Anatolia_Neolithic is.

    ReplyDelete
  28. pure guess but just in case it nudges anything

    1) Personally I've always thought the idea of the Neanderthals being wiped out in one big sweep was unlikely. If they were particularly cold adapted then at the very least some in the high altitudes of high latitudes should have survived longer imo and in the most extreme case (north Himalayas?) potentially a lot longer. In which case the time since last 50/50 mixture may have been substantially different between the mid-east and northern mountain regions like the Himalayas.

    2) If highly divergent DNA like Neanderthal is generally selected against for some reason (except the particularly good bits) from the time of mixture and at the same rate then if (1) is true then the population that had Neanderthal mixture latest would still have more over time than the earliest (even if declining all the time) until a minimum percentage was reached (which may not have happened yet).

    3) I'm not a mathematician (probably obviously) but I've messed around with various equations as part of modding AI for games so I know equations can throw up odd effects with unusual situations you'd hadn't thought of.

    so

    4) how do the various software packages deal with small quantities of highly divergent DNA?

    I'm wondering if BE is maybe an artifact revolving around how the software deals with populations whose Neanderthal ancestry was least recent relative to those in whom it's more recent.

    ReplyDelete
  29. "To be honest, I still don't know what the hell I'm testing here exactly. The results appear to reflect the existence of two components within West Eurasia; one representing ancient hunter-gatherers from Europe and probably surrounding areas of the Near East, and another closely related to present-day Near Eastern populations. The latter might well be a signal of the so called Basal Eurasians, or perhaps a number of as yet unsampled meta populations from the ancient Near East?" (Davidski)
    "Another thing about these below, is that West and East Eurasians share significant drift after Ust_Ishim" (Chad Rohlfsen).
    "I'm wondering if BE is maybe an artifact revolving around how the software deals with populations whose Neanderthal ancestry was least recent relative to those in whom it's more recent" (Grey).
    ...........................................
    After Copernicus arrived.

    ReplyDelete
  30. It's impossible to find a commonality between groups in terms of things that are defined as the differences between groups.

    ReplyDelete
  31. It appears that the Antillean Culture came to an end due to an abrupt change in stone technology; courtesy of the succeeding Kebaran Culture. Courtesy of Eupedia's hypothesis; N1,N2,X & W=Basal Eurasian and the fact that those mtdna haplogroups were dominate in Mesolithic farmers, how likely could it be that the Basal Eurasian gene came from the Antillian Culture?

    The fact that basal Eurasian is highest in Neolithic Middleeasterners could be a sign that Basal Eurasians were assimilated there.

    ReplyDelete
  32. Hey Davidski, just thought of something. I was looking at your Basal Eurasian K7 D-Stats and saw that La-Branda hasn't been compared yet to Basal Eurasians. La-Branda in Gedmatch=F999915. It's just a wild guess but I'm wondering if Mesolithic Europeans/Gravettian Culture somehow absorbed some Basal Eurasian before heading to Europe. :)

    ReplyDelete
  33. La_Brana1 has been thoroughly tested for Basal Eurasian and it doesn't have any. But Villabruna might have a little bit.

    ReplyDelete
  34. Wish you the best of luck :).
    Ultimately it's gonna come to the fact that as far as prehistoric archeology is concerned, you are way ahead of your time. Burials of Modern Homo Sapiens from Upper Paleolithic; or Paleolithic for that matter are simply hard to come by. Therefore you simply do not have enough Paleolithic archeological remains to DNA test for the soul purpose of finding Mr and Miss Basal. But despite our lack of Paleolithic skeletons, we do have our very own Ydna/Mtdna; these two DNA clusters can trace our direct Paternal and direct Maternal ancestors to Paleolithic times.

    https://en.m.wikipedia.org/wiki/List_of_human_evolution_fossils

    Is there any way to Download Raw Mtdna/Ydna and test them on the Basal-Eurasian K7? If none of the Ydna;A-T or Mtdna;A-Z have any sign of Basal Eurasian Influence then it might be safe to say that Basal-Eurasian is fluke-dna.

    ReplyDelete
  35. I thought that Lazardias et al disputed the idea that Kostenki 14 had Basal Eurasian. At least the same BE that the EEF had. It does not make sense that Kostenki 14 was heavy on Neanderthal genes and yet one of the hallmarks of BE is that it is inversely correlated to such admixture.

    Do the tests make more sense if you just use the more recent samples (<15K)? If the tests using only the more recent samples makes sense but the tests with the older ones don't maybe its because the younger and older are not both the same BE.

    ReplyDelete

Read the rules before posting.

Comments by people with the nick "Unknown" are no longer allowed.

See also...


New rules for comments

Banned commentators list