search this blog

Friday, March 9, 2018

Ancient genomes from Southeast Asia (McColl et al. 2018 preprint)


Over at bioRxiv at this LINK. I'm still reading and trying to figure out what the 25 ancient genomes from this preprint say about the peopling of Eurasia and, in particular, South Asian population structure, including the so called Ancestral South Indian (ASI) genetic component. Any ideas? Below are the abstract and Figure 4 from the preprint.

Two distinct population models have been put forward to explain present-day human diversity in Southeast Asia. The first model proposes long-term continuity (Regional Continuity model) while the other suggests two waves of dispersal (Two Layer model). Here, we use whole-genome capture in combination with shotgun sequencing to generate 25 ancient human genome sequences from mainland and island Southeast Asia, and directly test the two competing hypotheses. We find that early genomes from Hoabinhian hunter-gatherer contexts in Laos and Malaysia have genetic affinities with the Onge hunter-gatherers from the Andaman Islands, while Southeast Asian Neolithic farmers have a distinct East Asian genomic ancestry related to present-day Austroasiatic-speaking populations. We also identify two further migratory events, consistent with the expansion of speakers of Austronesian languages into Island Southeast Asia ca. 4 kya, and the expansion by East Asians into northern Vietnam ca. 2 kya. These findings support the Two Layer model for the early peopling of Southeast Asia and highlight the complexities of dispersal patterns from East Asia.


McColl et al., Ancient Genomics Reveals Four Prehistoric Migration Waves into Southeast Asia, bioRxiv, Posted March 8, 2018, doi: https://doi.org/10.1101/278374

Update 10/03/2018: Harvard and friends strike back with their own preprint on the same topic (LINK). Here's the abstract:

Southeast Asia is home to rich human genetic and linguistic diversity, but the details of past population movements in the region are not well known. Here, we report genome-wide ancient DNA data from thirteen Southeast Asian individuals spanning from the Neolithic period through the Iron Age (4100-1700 years ago). Early agriculturalists from Man Bac in Vietnam possessed a mixture of East Asian (southern Chinese farmer) and deeply diverged eastern Eurasian (hunter-gatherer) ancestry characteristic of Austroasiatic speakers, with similar ancestry as far south as Indonesia providing evidence for an expansive initial spread of Austroasiatic languages. In a striking parallel with Europe, later sites from across the region show closer connections to present-day majority groups, reflecting a second major influx of migrants by the time of the Bronze Age.

Lipson et al., Ancient genomes document multiple waves of migration in Southeast Asian prehistory, bioRxiv, Posted March 10, 2018, doi: https://doi.org/10.1101/279646

124 comments:

Ryukendo K said...

Pots =/= people, and the same can be said for stone tools, but for what it's worth the Hoabinhian cultural complex associated with "group 1" genomes clustering with Onge and displaying South Asian affinities stretches to Nepal and Northeast India, but no further West.

Its likely that Mesolithic South Asians have multiple layers of ancestry, and ASI may be obsolete as a clade by the time we are done.

Rob said...

Great paper . Happy for David
I do wonder if some of the really old specimens might come soon too ?

Arza said...

https://s6.postimg.org/yfnz6vgcx/India_Model_Ami.png

I'm showing such plots since almost 1.5 year. ASI will be in the corner at the bottom of this triangle.

Chamar (best proxy for "Indian cline") = ASI + Iran + a little bit of Indo-Aryan
Bonda = ASI + proto-Autstorasiatic (in G25 Cambodians work best)
Austroasiatic cline = ASI + Iran on the one side and ASI + proto-Austroasiatics on the other.
Pulliyar (at least in G10) = Indian cline + Austroasiatic cline

I don't understand why so many people pushed for using Onge, Bonda or even Paniya as an ASI proxy. It just needs to take a quick look at G10 to see what the data really shows.

Nirjhar007 said...

C-Ydna from the oldest Hunter gatherer again :) .

Anonymous said...

I don't understand the paper well, but it looks like ASI, like ANI, is a actually a combination of things. Onge probably reflects an SE Asian population related to Palaeolithic Indians.

@Davidski, is it possible to use these samples and Onge to filter out the non-Onge/SEA hunter-gatherer component from South Asia? Is there a remnant that forms some kind of cline with Onge?

Kristiina said...

When the genomes are released, it would be interesting to test the pincer model of the colonization of East Asia, if possible:

https://www.semanticscholar.org/paper/Computer-simulation-of-human-leukocyte-antigen-gen-Di-Sanchez-Mazas/89368d2672022e7f73c90159aacd8569deb6fb10/figure/0

https://link.springer.com/article/10.1007/s12284-011-9066-y

Bronze said...

Indians dont have any real Onge, it was apparent from the beginning that south asians and ASI is extremely distant from Onge and all andaman islanders so calling them related is false. I suspect racists wanted to make the false claim that indians used to look like Onge before ANI admixture but we know thats 100% false.

Ryan said...

@Nirjhar007 - "C-Ydna from the oldest Hunter gatherer again :)"

Interesting that the same is true for Europe. The first wave of AMH to break out of the Levant?

Anonymous said...

Very interesting not-told story about East Asians (From Japan to Korea to China to Thailand).
Looking at the Uyghur, it seems that most of the Southern admixture went North very recently, maybe some ~500 years ago.
Now, about Southeast Asians, I don't think the Ancients of this study elucidate that much.

@Bronze
"Indians dont have any real Onge, it was apparent from the beginning that south asians and ASI is extremely distant from Onge and all andaman islanders so calling them related is false"
The Onge were only used due to a PCA device, for the lack of ASI, they did a very nice job.

"I suspect racists wanted to make the false claim that indians used to look like Onge before ANI admixture but we know thats 100% false."
Aren't you the one here being racist against the Onge? Again, people used the Onge because that was the best proxy to use, and no one can affirm nor deny that the ASI didn't look like the Onge.
Genetics isn't only about looks, Africans and Negritos look very much alike but no other groups have so much of a genetic distance from one another than themselves.

@Arza
Yes, you stand correct. It's okay to assume the ASI being in this position here?
https://imgur.com/a/lU9ke

Bronze said...

@namedguest

wrong, we can definitely confirm that ”Pure” ASI never looked anything like Onge since the tribals with the highest ASI dont resemble them, and the oldest human remains in south asia ie balangoda man dont look similar to them based on anthropological measurements either. And now since we know there is no genetic relationship between them either, we can establish that they didnt look like onge, otherwise you could make a similar argument without evidence and claim Yamnaya or paleolthic europeans looked like Onge.

Anonymous said...

@Samuel Andrews
Perhaps Rob doesn't know about the Doric invasions? The Mycenaeans were the offspring of only the first wave.

Anonymous said...

@Bronze
"wrong, we can definitely confirm that ”Pure” ASI never looked anything like Onge since the tribals with the highest ASI dont resemble them"
This here doesn't matter. Phenotypes aren't dependent of %share, but much more of sexual selection. That's why you can have Blue eyed blonde South Asians, who look very much like Europeans, but with only a small fraction of such ancestry.

"and the oldest human remains in south asia ie balangoda man dont look similar to them based on anthropological measurements either."
Also doesn't matter because of standard deviations and lack of a pool.

"And now since we know there is no genetic relationship between them either"
We don't know that, we don't have ASI genomes. If anything, they do share a history in the past based on what's shown in this study.

"we can establish that they didnt look like onge"
We cannot affirm nor deny that.

"otherwise you could make a similar argument without evidence and claim Yamnaya or paleolthic europeans looked like Onge."
But you can make this argument. Certainly.

Trinity said...

Bronze "I suspect racists wanted to make the false claim that indians used to look like Onge"

What would be racist about that? Unless of course you hold negative views about the Onge and their physical appearance.

Seinundzeit said...

Personally, I gravitate towards the view that during the Mesolithic era, India was likely to be home to populations of mixed ENA-West Eurasian ancestry, while South Central Asia was inhabited by populations which were essentially West Eurasian (basically, the same situation we see today).

With regard to the samples in this paper, I do see hints of a deconstruction of the ASI concept. 

Just as ANI was never a real phenomenon (but rather an abstraction, based on the confluence of ANE-related, Iran_N-related, and Steppe_MLBA-related streams of ancestry, with varying clines of these sorts of ancestry stretching from Tajikistan/Afghanistan/Northern Pakistan all the way to southern India), ASI will be shown to represent differential waves from Southeast Asia.

Looking at the bigger picture, in a genetic sense India proper is a meeting point for ancestral streams emanating from South Central Asia and Southeast Asia.

Anyway, it will be fun to see how these samples work for all those populations which tend to show percentages of Indian-modal ENA (Iranians, South Central Asians, and obviosly South Asians).

Regardless, I think these samples go a long way in clarifying deeper phylogenetic history for ENA populations.

I mean, East Asians as a mix of sister groups to Onge/Group 1 and Tianyuan!

Pretty cool.


Chad said...

Bronze,

You have no idea what you're talking about. All statistical analysis shows the Onge as the best modern representative of ASI. Maybe read papers on the issue or learn to run analysis before making assertions.

Seinundzeit said...

Bronze,

You can't expect the Paniya to resemble "Negrito" peoples, since they are 30%-40% West Eurasian, and 30%-40% admixture is enough to create a noticeable amount of phenotypic divergence.

Interestingly, there were people in southern India who resembled the Onge.

If I'm not mistaken, they were/are known as the Kadar.

Most of them had Negrito/Papuan/African hair texture, no facial/body hair, Negrito facial features, etc.

Not sure if the community still exists (even in the late 19th century, they were in the process of being absorbed by neighboring communities, and had a history of receiving admixture from neighbors).

Richard Rocca said...

@David... can you provide the Global 25 coordinates for female I1392 from France? She is a very early Bell Beaker from the French/German border and contains no steppe ancestry. She is likely a good proxy for the Copper Age French population that that interacted with Swiss and German Corded Ware.

Ryukendo K said...

People really aren't reading the paper too carefully man.

This paper solves a large number of issues for the linguistic prehistory of Southeast Asia:
1) Now we do know that an Onge-like population lived all across Sundaland, including into Vietnam, prior to Neolithic intrusions.
2) The first Southeast Asian farmers most resembled the Austro-asiatic speaking H'tin and Mlabri hunter-gatherers living in Thailand and Laos, surrounded on all sides by agricultural Thai/Lao-speakers. Also they contributed to the ancestry of Orang Asli Hunter-gatherers speaking Aslian (also AuAs) in the Malay peninsula, and form a ~50% genetic substrate across all of Sumatra, Malaysia, Borneo, Java and Bali. Cambodians are >50% this ancestry pool.
3) Austronesians came down south into island Southeast Asia, probably via island-hopping from Phillipines which we know since Lipson 2014. The distribution is very uneven, some groups (e.g. Mentawai islanders off Sumatra) are almost 100% Austronesian, but the population of the larger islands are at most 50% Austronesian and 50% First Southeast Asian Farmer and some isolated groups like Bidayuh in Borneo still have first Southeast Asian Farmer ancestry dominating.
4) Iron Age Thailand genomes receive 50% ancestry from Sino-tibetan-like East Asians on top of first farmer ancestry and cluster among present-day Mainland Southeast Asian Austroasiatics like Mon, Lawa, Plang, Palaung etc. Great as we know Mon-Khmer people used to dominate whole of MSEA.
5) Bronze Age Vietnamese genomes receive 50% ancestry from a group like Tai-Kradai Zhuang in China (which are 50% Hmong-like and 50% Austronesian-like) and cluster among present-day Thai and Lao and Daic hill tribes. So there was a genomic shift in Vietman in Bronze Age but the Austroasiatic language (Vietic) persisted, and we get a Tai-like genetic contribution in BA Vietnam 1000 years prior to the actual Tai migrations into MSEA.

So we know that the earliest linguistic layer in Southeast Asia is likely to be the Austroasiatics, and MSEA is distinguished by large number of introgressions onto the First Farmer substrate that still doesn't change the AuAs language affiliation, e.g. East Asian ancestry into the (Mon, Lawa, Wa, Plang, Palaung) group and Tai ancestry into Vietics.

Ryukendo K said...

The Hoabinhian ultimately has a genesis in S China and moved South and West, so its possible that Onge-like ancestry reached India on top of some HG groups that were already there. Certainly there are some uniparentals, like Haplogroup H, that are inconsistent with an 100% Hoabinhian-Onge derivation of South Asian HGs.

Likely some "impossible" ancestry may be thrown our way, like for the North African HGs (which are equidistant between WHG and Yoruba, not possible really by the conventional understanding of the African-Eurasian split)

That said we shouldn't understate the sharing between these Hoabinhian HGs and South Asians, if anything they share more drift with South Asians than Onge.

Seinundzeit said...

RK,

"Likely some "impossible" ancestry may be thrown our way, like for the North African HGs (which are equidistant between WHG and Yoruba, not possible really by the conventional understanding of the African-Eurasian split)"

Certainly; in fact, we may find some ancestry which has a specific relation to the non-basal streams seen in Iran_Hotu/Iran_N (and to a lesser extent, CHG).

Bronze said...

@all

What I said previously is true and you guys are wrong, there was a measurement from a previous study that showed the genetic distance (number of generations separated) between onge and ASI was almost as distant as between west eurasians and east eurasians.

So the only reason people make the stupid claim that ASI and onge is related is because No other currently living or studied cluster/population is as close, but thats just because Onge has been isolated for extremely long and most of its close relatives where wiped out/replaced.

So its quite obvious the original pure ASI did not resemble Onge/andaman islanders phenotypically. End of story.

zulla said...

@Chad
Basu et al 2016 give ANI ancestry of 97.9% in North Indian Khatris with ~1.5% ASI. Why does their model give such low ASI? Thanks.

zulla said...

@ Bronze

"What I said previously is true and you guys are wrong, there was a measurement from a previous study that showed the genetic distance (number of generations separated) between onge and ASI was almost as distant as between west eurasians and east eurasians."
Which study?

Spanked said...

@SeinundzeitInterestingly, there were people in southern India who resembled the Onge.
If I'm not mistaken, they were/are known as the Kadar.

They're still around.
http://www.geocities.ws/iris_com2002/images/tribal/b_kadar_tribe.jpg

I don't understand why Bronze is so horrified and disgusted by the Onge that he calls people who imply a relationship "racists". I think some look quite nice.
https://www.trekearth.com/gallery/Asia/India/East/Andaman_and_Nicobar_Islands/Little_Andaman/photo575208.htm

Anonymous said...

@Bronze
Nothing you said matters, the genetic distance between Modern Europeans and WHG is immense, the same as Modern Europeans compared to the Chinese, and this doesn't make the story any less true.

Michalis Moriopoulos said...

@ Bronze
"So its quite obvious the original pure ASI did not resemble Onge/andaman islanders phenotypically. End of story."

I do know one undated skull from Sri Lanka in W.W. Howell's collection had an Australoid appearance. He went so far as to remark in his notes: "Looks just like an Australian aborigine." Based on the way modern ASI-heavy people look, I wouldn't be surprised if "pure" ASIs looked similar to Australo-Melanesians.

Check these tribals out:
https://upload.wikimedia.org/wikipedia/commons/b/bc/CTSI-shl.jpg
https://i.pinimg.com/736x/b6/fe/4a/b6fe4aa1a071bee2760c6683e57063da--african-men-cgi.jpg

The Onge have a distinctive appearance all their own, but a lot of time has passed since the Mesolithic. Selection might have acted strongly on their appearance since that time. The authors of the new study point out that the Hoabinhian from the Pha Faen site in Laos was relatively tall (176 cm, which is about 5'9"). Obviously the Onge are much shorter than that. In any case, I would never be so foolish as to unequivocally declare that ASI did not resemble Onge. The Negritos continue to surprise in every new study in which they're featured.

Seinundzeit said...

Spanked,

"I don't understand why Bronze is so horrified and disgusted by the Onge..."

Well, probably because he is racist... not much we can do about that.

Jijnasu said...

Too much importance is given to phenotype. Far more than it deserves. I don't understand the bronze's problem with the ASI resembling andamanese islanders. However I don't think it is true that the andamanese, SE asian'negritos', melanesians etc. all lool the same (unless one believes that all dark skinned people look alike.), The ASI probably had physical features in common with peoples to the east but were likely unique looking in other ways as well.

Ryukendo K said...

@ Bronze

You're wrong, and misinterpreting research.

You are probably referencing Metspalu et al 2011, which found a South Asian ADMIXTURE component that is equidistant between West and East Eurasians. But thats an ADMIXTURE component, not an actual ancient population. Unless you think that South Indian Dravidian tribals like Irula are unmixed mesolithic descendants and thus represent "ASI". They don't.

The ASI-ANI model still stands, and is very robust. Here are the facts. There is a cline on PCA between Kalash and Irula on which all Indians fall, with Kalash pointing at Europeans and Irula pointing in the general direction of East and Southeast Asians plus Oceanians (the Onge are close to being at the end of the cline like how Europeans are at the Kalash end, but are somewhat offset as Arza shows).

As we move down the cline from Kalash to Irula, the following happens:
Relatedness to Europeans declines.
Relatedness to West Eurasians declines.
Relatedness to East and Southeast Asians and Oceanians increases.
Relatedness to Onge increases compared to Dai.
Relatedness to Onge increases compared to Papuan.

Tell me, how can you explain this without an Onge-like contribution in ASI?

qpGraph tells us that ASI split off almost immediately after Onge and East Asians split. East Asians look nothing like Onge today, so why should we expect South Asians with high ASI to look like Onge?

Reich et al 2009 has been cited hundreds and hundreds of times, they were the ones who introduced things like f4 stats and f4-ratio estimation. People who wanna talk abt S Asian genetics should familiarise themselves with the basic issues that need explaining. The likes of Moorjani, Thangaraj and so on are still referencing it.

Anonymous said...

@ryukendo kendow
Just to defend Bronze now in this continuous bashing:
What you said is all good, but I don't think the Onge influenced in the ASI - it looks like much more to be the case where the ASI and Onge had a recent common ancestor, and that they plot close/have similarities because of that, not because of flow to one population from another.

@Mike the Jedi
You were spot on with your images. The ASI in my mind looks like the first photo, but a little less rugged, maybe.
https://imgur.com/a/AZEdm

Seinundzeit said...

RK,

(Just to take the conversation away from the stupid discussion on phenotype)

What are your thoughts on the modelling of East Asians as Onge-related and Tianyuan-related?

Ryukendo K said...

@namedguest

No disagreements, the Onge-like part of ASI split almost immediately after Dai did, so the migration took place really long time ago, or they share distant common ancestor, or they influenced each other in ancient times etc, its all the same to me bro ¯\_(ツ)_/¯

@ Davidski

You might want to check out the Khonda_Dora genomes in the SGDP dataset in this paper, if their plotting position on PCA is accurate they might be one of the most extremely ASI populations ever found, much more ASI than even Irula.

Nirjhar007 said...

Ryan ,

Maybe :) .

all,

Again we don't have the aDNA from S Asia , so it is hard to say . My bet is that large proportion of the current ''West Eurasian'' ancestry in S Asia is also very deep ...

Davidski said...

@ryukendo kendow

You might want to check out the Khonda_Dora genomes in the SGDP dataset in this paper, if their plotting position on PCA is accurate they might be one of the most extremely ASI populations ever found, much more ASI than even Irula.

There's a Khonda_Dora individual from the SGDP in the Global25 datasheet.

Rob said...

RK
To go back to Mats comment- are you in fact one of the authors ?

Ryukendo K said...

No.

Bronze said...

@namedguest

everything I said matters, its fact. ASI did not look anything like Onge and is not related to them in any meaningful way. deal with it.

Bronze said...

@mike the jedi

Yep its quite obvious pure ASI is more similar and related to australoids in a greater sense even though the genetic distance makes it appropriate to give ASI its own classification, maybe veddoid. Probably similar to certain australoids but not the same in appearance given how asi heavy south asians look.

Onge and other negritos are obviously more related to south east asians and mongoloids in general considering their rounded heads and facial features. But even so they are distant from everyone and should be considered their own race (negrito)

Anonymous said...

@Bronze
Well, it doesn't matter, and you have run out of arguments, everyone here already explained why you're wrong, if you didn't understand by now, then you're just a waste of time.

Bronze said...

@ryokendo kendow

No im correct. Youre wrong.

Im talking about a graph which came from an relatively old study which showed the branching between ASI and Onge, I dont have time to look it up now but I remember it clearly, and the time depth since the branching time elapsed of Onge and ASI was almost as large as between east and west eurasians. Onge likely just represents some heavily drifted and isolated population that isnt closely related to anything else other than other negritos and south east asians to a lesser extent, which this new study supports with the genetic tests.

Bronze said...

No, you lost the debate since you didnt counter my argument with anything legitimate. So its you and the others who agreed with you who are wrong and wasting the time of others. ASI did not look like Onge and is not related to Onge in any meaningful way. Fact.

Rob said...

Oh my guess would have been you were, given you Segwayed your previous predictions with the abstract of the paper on AG.
But thanks for the summaries nonetheless- very useful !

Chad said...

Statistical analysis says you're wrong. Deal with it and piss off. You're a distraction from meaningful conversation.

Ryukendo K said...

@ Bronze

Why don't you pull out the study eh? Lets look at what it actually says.

Ryukendo K said...

@ Rob

Lol I wish. Maybe one day I'll get there :beerchug:

Ryukendo K said...

@ Sein

There are more and more incompatibilities, e.g. we "knew" that East Asians could be ANE + Onge, and now (with a very similar set of outgroups!) we "know" that East Asians are Tianyuan + Onge (Maybe because of the Tianyuan ---> Goyet thing?? Grasping at straws). We also "knew" that Onge and East Asians form a clade to the exclusion of Papuan but then we got a trifurcation, and now we have this.

Maybe we need a "total qpgraph" or "consensus qpgraph" with extreme amounts of reticulation, like what Chad was working on, and every time new genomes come out, once we are confident some subset of genomes form a clade (e.g. >0.99 correlation in outgroup f3 with all other populations) we keep adding metapopulations to this huge, unwieldy thing to reflect some kind of big pic consensus opinion.

Bronze said...

@Chad Rohlfsen
No it doesnt, it proves me correct and proves you wrong. Piss off yourself.

Bronze said...

Here you go:
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2842210/bin/nihms137159f4.jpg

Rob said...

So what do we think this suggests for what Mesolithic India might look like ?

Ryukendo K said...

@ Bronze

Well, you're a visual thinker probably, you were comparing the length of the lines weren't you?

Shared drift between ASI and Andamanese = 0.0286
Shared drift between ANI and Central European = 0.0320

ASI and Andamanese share (28.6/32) = 89% of the shared drift that ANI and CEU have with each other. Gotta read the numbers my man.

Davidski said...

This Khonda_Dora genome is interesting. Only shows 0.04% Iran_N admix in the Global25, and is preferred as the ASI proxy by all South and South Central Asian pops that I tried, over Onge, Dai, Han, Tianyuan, you name it.

Bronze said...

btw mike what do you have to say about this piece of info regarding the physical characteristics of mesolithic south asians?

”People from the Gangetic plains sites of Sarai Nahar Rai and Mahadaha were described by Oakley as looking like European Cro-Magnon skeletons, and by Kennedy as resembling other Mesolithic South Asians and Eurasians.

This stuffs from God-Apes and Fossil Men: Paleoanthropology of South Asia, by Kenneth A. R. Kennedy

Sarai Nahar Rai

- massive and robust skulls
- mescrany to hyperdolichocrany
- moderately elevated vaults
- prominent supraorbital region
- moderately large to very large mastoid processes with well-defined supramastoid crests
- sharp temporal lines that extend posteriorly to the parietal bones
- development of the nuchal crests varies from slight to pronounced, some male skulls exhibiting very large external occipital protuberances
- parietal eminences are very large for skulls of both sexes
- broad faces for males and females (hypereuryprosopic and hypereuryene)
- narrow to broad noses (leptorrhinic to chamaerrhinic)
- low and rectangular (chamaechonchic) orbits
- low foreheads, vertical n the males or slightly bulbous in the females
- deep nasal notches
- slight projection of the face (mesognathism)
- alveolar pronathism only in some individuals
- malar bones are robust but variable in size
- great interorbital distance
- deep broad (brachystaphaline) palates
- mandibles are massive with everted gonia and well marked pterygoid attachments
- prominent chins

Kenedy states that apart from the chin the mandibles resemble the Heidelberg and Ternefine (Tighenif) mandibles.

Mahadaha

- dolichocranic to ultrdolichocranic vaults
- moderate (metriocranic to tapeinocranic) cranial elevation though one female has a relatively high (acrocranic) vault
- males exhibit large supraorbital tori, sharp temporal lines, well-defined nuchal crests with prominent external orripital protuberance, and massive mastoid processes with well-marked supramastoid crests
- broad (euryprosopic) faces
- narrow to broad (leptorrhinic to hyperchamaerrhinic) noses
- high and rounded (hypsiconchic) orbits
- narrow (leptostaphyline) palates
- massive but narrow or compressed (dolichostenomandibular) lower jaws
- moderately expressed midfacial and alveolar prognathism”

Bronze said...

@ryukendo kendos

no, I was mainly referencing this piece of info that was also included in the study:

"2,000 gens ago Split of ANI and ASI ancestors, 1,700 gens ago Split of Asian populations (‘proto-East Asia’, ASI, and Onge)"

So regardless how you try to spin it, ASI is extremely distant from Onge, so I was correct from the beginning.

Bronze said...

Also, CEU and ANI have not been isolated and drifted like Onge has, which speaks even stronger in my favor.

Anonymous said...

@Davidski
Better than the Gond?

Kristiina said...

From the mtDNA poin-of-view, there is a difference between India and Southeast Asia that mirrors the earliest settlement patterns.

R is the oldest haplogroup in India. The age of R30 and R31 in India is c. 60 kya while the the age of R22 and R24 is only c. 39 kya and 23 kya in Jehai Negritos and Senoi in Malesia.

”Attending to the age of R and its main branches, there are not significant differences among the founder ages of macrohaplogroup R in the five major regions. However, mean radiation age in southeast Asia (17.6 ± 10.4) is significantly younger than those in West-Central Asia (51.2 ± 6.8), East Asia (48.8 ± 1.5) and Near Oceania (61.3 ± 6.4), but not different from the South Asia mean (47.0 ± 10.7). The comparatively late expansion of R in southeast Asia is against the southern coastal route hypothesis and contrasts with the deep age of macrohaplogroup M in the area and with its westward gradual decline from Near Oceania to South Asia.”

While R is the oldest haplogroup in India, M is older in Southeast Asia and Oceania than in India.

”Not only is the founder age of M in East Asia significantly greater than in South Asia. … In turn, the founder age for M in Oceania was significantly older than the ones estimated for East Asia and southeast Asia.”

Age of mtDNA M in South Asia 40.2 kya, in East Asia c. 52 - 55 kya, in Southeast Asia 60.7 kya and in Oceania 68.3 kya.

The both huntergatherers in this new paper carried M haplogroups: Hoabinhian hg Pha Faen Laos, La368 calibr. date 7888, M5 and Hoabinhian Phase 1 Gua Cha Malesia, Ma911, calibr. date 4319 M21b1a.

M5 is today found in Orissa, India, in particular in Chenchu and Koya (25%). M21 is the typical haplogroup of West Malaysia's Orang Asli, Semang and Batak, Senoi, Abor. Malay and Jehai.

Therefore, the oldest settlers in India may have been carriers of mtDNA R and yDNA H (?) and they were probably autosomally diffferent from the southeast Asian huntergatherers. Then, at some point, M started flowing to India and brought India and Southeast Asia autosomally closer to each other.

Ryukendo K said...

@ Bronze

Dude, they only used definite split times for their *coalescent simulation*.... this is not a validated result. This is a parameter they used for computational validation using simulated genomes of their f4-stat methods...

You obviously have no idea how to read scientific literature, first looking at lines in the picture and now this.

Also note that the splits between East Asians, Onge and Papuans likely ocurred extremely soon after that whole group split from West Eurasians, does that mean they are "not meaningfully related"? East and West Eurasians split very quickly after Mota, does that mean OOA populations are "not meaningfully related"? Not how this kinda stuff works...

Bronze said...

@Kristiina

Sounds accurate, good stuff and I agree completely.

Davidski said...

@namedguest

Better than the Gond?

No, but Gond is up to 25% more West Eurasian and something ANE-like than Khonda_Dora.

Gond
Khonda_Dora 74.4
Iran_N 18
MA1 7.6

So ASI is just Khonda_Dora with minor ANE-related ancestry.

Ryukendo K said...

Last thing, isolation and drift is not something that affects the *phylogenetic* relatedness of populations. Some South American hunter-gatherer groups are so isolated and drifted that the fst between them are some of the largest on Earth. Same for many groups of European hunter gatherers.

Does this mean they are not "meaningfully related"? You probably have very little idea of what "related" even means in pop gen...

Bowing out, its a waste of time to talk to you really.

Bronze said...

Its you who are dumb, listen to yourself, In that very same post you claimed that ”east asians papuans and Onge are all meaningfully related” which shows you are completely lost as to what you are talking about.

Those groups you mentioned are not reöated in any meaningful way other than determining huma migration patters after OOA. They are genetically extremely distant from each other and look completely different, just like how ASI and Onge are extremely distant in phenotype and genetic distance.

You obviously have a very different view sbout what a ”meaningful relation” is, which is hilarious that you dont see how much you are reaching with that perspective, especially since we are discussing phenotypes here.

Bronze said...

@ryukendo kendow

Yeah bow out, you lost the debate, ASI and Onge are not meaningfully related when discussing phenotype considering the timegap of branching and genetic distance is extremely large between them, none of the ancient human skeletal remains in south asia even resemble Onge and modern south asians do not resemble them either.

So what I said from the beginning was true.

Matt said...

Spanked: I don't understand why Bronze is so horrified and disgusted by the Onge that he calls people who imply a relationship "racists". I think some look quite nice.

Yeah, not sure about the "racists want ASI to look like Onge", as why they would want that as opposed to an equally small people (which is the overwhelming likelihood based on West Eurasian->ANI->ASI stature clines), who were still highly adapted to the tropics (dark skin, broad nasal aperture) who just cranially looked more like Australian Aborigines or Papuans (more robust).

(Whatever disputes about likely phenotype, the ASI (pre-Neolithic) were likely extremely short, extremely dark, extremely broad featured, beyond any present day populations, whether they had a more gracile cranial phenotype on top of that like the Onge or were more robust like Oceanians. Almost everything from present day information on populations from hot climates and our knowledge of clines of phenotype in India today supports this).

Arza said...

@ Davidski
This Khonda_Dora genome is interesting. Only shows 0.04% Iran_N admix in the Global25, and is preferred as the ASI proxy

Why East Asian shifted population without (or with a small) Iran_N admixture is a better proxy than Iran_N shifted population without East Asian admixture?

Why the total lack of Iran_N is required to be a good ASI proxy?

Khonda_Dora:Khonda_Dora_G251
Paniya:PNYD3 65.6%
Cambodian:HGDP00711 30%
Paniya:PNYD9 4.4%

Distance 3.4541%

Paniya:PNYD3
Khonda_Dora:Khonda_Dora_G251 88%
Iran_N:WC1 12%

Distance 10.2652%

zulla said...

@kristiina

Thanks for that. What is the hypothesis on the route taken by MtDna M from Africa to Oceania? Or did it not originate in Africa?

Kristiina said...

@Zulla

I do not believe in the origin of mtDNA M in Africa but this opinion of mine is not a proven fact.

There is a chapter about M1 in the paper I referred to in my comment: "About the origin of the North African haplogroup M1"
(https://bmcevolbiol.biomedcentral.com/articles/10.1186/s12862-016-0816-8)

There is a lot of uncertainty about the deep origin of macrohaplogroups but I find the graph on p. 54 and 55 of this recent paper "Carriers of mitochondrial DNA macrohaplogroup L3 basic
lineages migrated back to Africa from Asia around 70,000 years
ago" interesting (https://www.biorxiv.org/content/biorxiv/early/2017/12/13/233502.full.pdf)

Matt said...

A few basic PCA and plots using the f3 statistics from the table supplement:

Covariance PCA: https://imgur.com/a/uYW0q

(Note: Group1, the Hoabinhian, is not very powerful in covariance because modern populations share with them to a very low degree by f3 measurement relative the other groups.)

Group2 and Group6 are quite important as the seem to be the two populations who have contributed relatively strongly to modern populations in this PCA, but in an uncorrelated way.)

Correlation PCA: https://imgur.com/a/eItEQ

(Group1 is more important here, because although it shares very little with modern people, its patterns are not very correlated with the other groups, who are very correlated with each other.)

F3 plots of groups against each other: https://imgur.com/a/IMU5u

It may be worth noting that Onge share no more with the Hoabinhian group (Group1) by f3 than they do with any of ancient groups that look East Asian. Onge only high sharing with Group1 relative to what East Asians have with each other; Group1 not high with Onge compared to other groups.

I'm somewhat unsure of how that is explained if Group 1 is a clade with Onge. (Opposed to "ENA" groups that went a separate way).

Language groups can be quite diverse in affinity to the remaining East Asian ancient groups.

The overall levels of Yoruba outgroup f3 for the "ancient East Asians" do seem quite high for many of these populations compared with West Eurasian ancients->modern. This is probably consistent with lower diversity therefore more sharing in ancient East than West Eurasia, with a similar impact of the pooling of diversity from structured populations with Neolithic->Metal Ages->Recent transitions?

@Ryu, thanks for the summary also (I did read through the paper but the focus is slightly different at times, so your comment brings some other things into focus).

Arza said...

Assuming that Paniya:PNYD3 is a mixture of 70% ASI and 30% of averaged Iran_N we can get:

G25, scaled:
Paniya-30_ASI,-0.007414762488387,-0.282433238870025,-0.210099312886268,0.196421855636842,-0.013892709701752,0.070296382822842,-0.00805079300087,0.028357060140218,0.166523308282054,0.10434319017315,0.004041151903876,0.001738452645314,-0.008766733112704,0.027064481576187,-0.057208045572173,-0.061085919333479,0.028934082903548,0.00231659144399,-0.013593346361589,0.067371600817664,0.006691762742052,0.031563245005829,-0.006525086328931,0.027036524391411,-0.015457980599427

Paniya
ASI 66%
Iran_N:WC1 34%
Cambodian:HGDP00711 0%
Distance 2%

Santhal
ASI 61.4%
Iran_N:WC1 25.4%
Cambodian:HGDP00711 13.2%
Distance 2.4672%

Asur
ASI 60.2%
Iran_N:WC1 27.4%
Cambodian:HGDP00711 12.4%
Distance 2.5872%

Ho
ASI 58.6%
Cambodian:HGDP00711 21.6%
Iran_N:WC1 19.8%
Distance 2.1602%

Gadaba
ASI 54%
Cambodian:HGDP00711 28.2%
Iran_N:WC1 17.8%
Distance 2.1864%

Khonda_Dora:Khonda_Dora_G251
ASI 52.4%
Cambodian:HGDP00711 30%
Iran_N:WC1 17.6%
Distance 2.7708%

Bonda
ASI 51.8%
Cambodian:HGDP00711 33.6%
Iran_N:WC1 14.6%
Distance 1.7732%

Bronze said...

@Matt

Wrong, its not an overwhelming likelyhood at all that ASI where extremely small, look in the comments above where the current ancient skeletal remains are described.

And that ASI where extremely dark and had extremely broad features is not likely either, most likely they had very diverse facial features but more on the robust side and a varying range of brown skin. Not darker than that ancient briton that was revealed recently, and certainly not as dark as Onge.

Alberto said...

I hardly had time to look at the paper in detail, but I thought I'd share this experiment I did before. It uses an ASI ghost which is basically a straight line through the SC Asian cline on Global 25. And then I used the same setup as in Lazaridis 2016 for all the SC Asian populations (Onge + Han + Iran_LN + EHG, and the ASI ghost). The point was to see in which direction each population went in their ENA component (the West Eurasian side is irrelevant here, just used the most simple setup that works and is backed by literature).

https://docs.google.com/spreadsheets/d/1EmfmqSBzrbbiyyurpG6Vwu0uB7jCAtp6oqV6CGhtpMI/edit?usp=sharing

Austroasiatics clearly deviate, not just towards Han, but mostly towards Onge. While most Indo-Aryan and Dravidian populations stick to the ASI ghost (Pulliyar and Paniya being Onge-shifted, as noted by Arza).

On another side note, I don't care to debate about phenotype or whatever in ASI, though there is a paper about Mesolithic Indian remains for those interested. Stature is high, apparently:

https://muse.jhu.edu/article/48021/summary

Matt said...

@Bronze, clines within India populations suggest that stature falls and "nasal index" rises with ASI, even over what are a relatively small range of differences compared to what ancient people have. This is obvious: https://tinyurl.com/y7hx3o8s

A "pure ASI" would be beyond any of the averages on the list I've linked above. E.g. assume "Caste" has 40% ASI and stature 165cm, and "Scheduled Tribe" has 60% and stature 159.75 and ASI is then estimated 150cm (if you assume "Caste" 30% ASI and "Scheduled Tribe" 70%, then the 100% ASI 152). Andaman Islands estimate is about 148cm there.

Small stature makes sense, following Bergmann's Rule (though there have been exceptions). Unless anyone's totally obsessed with whether the ASI were 1 inch or 2 inches taller than the Onge, or exactly the same height, or whether they're ten "melanin units" lighter and so have a very, very dark brown tone rather than a cool blue-black look there doesn't seem much to debate (and there's barely any genetic angle in doing so).

(no more comments after this replying to you, you're not winning arguments here. you're ignoring what others say and then declaring you've won. what's the point of debating with such a person? and this kind of debate about physical characters can become very bad for reputation on comments on here unless it is conducted rationally).

Bronze said...

@matt

Thats a really bad argument, considering the fact that there are african americans (who are on average 15% european) with just as wide or wider noses compared to ”pure” africans in sub saharan africa.

The cline in India of the various sub populations is not only affected by ASI, also austro-asiatic affect certain populations in south asia more or less, so you cant just make sn assumption that pure ASI has wider noses compared to these varying populationd when they have many different components that ASI didnt have. Same thing applies to skin color, ASI where not even close to as dark as Onge.

Youre also wrong about height, we are not talking about one or two inches difference but the fact that ASI where much taller than Onge or any kind of negrito, as dsiplayed by mesolithic remains in South asia.

Ive won every argument here in the comment section, and this is no different, you lost as well and you ignored the facts I mentioned. get over it.

Karl_K said...

@Matt

"(no more comments after this replying to you, you're not winning arguments here. you're ignoring what others say and then declaring you've won. what's the point of debating with such a person? and this kind of debate about physical characters can become very bad for reputation on comments on here unless it is conducted rationally)."

Exactly. If people make illogical arguments to support their unwavering pointless stance, then what is the 'debate' really about? Nobody wins at that game, everyone loses.

zulla said...

@alberto

Interesting paper. The samples are from Mesolithic Haryana, 120miles from Rakhigarhi.

"Collectively, North Indian Mesolithic groups are significantly taller than Eastern or Western European Mesolithic samples."

zulla said...

@kristiina thanks for the links

Matt said...

All, Reich lab have dropped a new paper on the same subject: https://www.biorxiv.org/content/early/2018/03/10/279646.full.pdf+html
- "Ancient genomes document multiple waves of migration in Southeast Asian prehistory"

(These labs are very "keen" to get their results out at the same time... ;) )

Ron said...

"Collectively, North Indian Mesolithic groups are significantly taller than Eastern or Western European Mesolithic samples."

does that mean iran neolithic people were very tall people?

Matt said...

From the new Reich paper, I've taken the Dinka outgroup f3 stats and plotted them against each other and used for PCoA: https://imgur.com/a/eNB7G

(Removed most of the ancients as rows, because there seems to be weird ancient attraction / repulsion phenomena going on with them, and the Thailand_BA and Cambodia_IA seem to be producing straight up questionable stats).

Alberto said...

Hhmmm, what do I know about Y-DNA, but I2a2a2a (I2-P78?) in Neolithic Vietnam (1700 BCE)? Is that something expected?

Ryukendo K said...

Lol thanks for the link Matt. Usually the labs release simultaneously but it seems here one lab's hand was forced or smth.

Few interesting things:
1) The East Asian clade is bifurcated into a clade that contributes to Atayal and to the first Southeast Asian Farmers (lets use FSAF), i.e. the Mlabri- and Htin-like farmers. The other half contributes to Mixe and 10% to Dai, who otherwise is quite Southeast Asian. Quite interesting because here, northern East Asians are more related to the ENA of Native Americans than to their Southeast Asian cousins.

2) The 27%-30% of the ancestry of the FSAF that are not closely related to Ami is modeled by McColl as branching closely to MSEA HGs and as having 4-6 shared drift with Onge, but in Lipson they share 0-1 shared drift with Onge, so we need to resolve that at some point. Also for the EAs = Tianyuan + Onge, the % Tianyuan that goes into Ami in McColl varies from tree to tree (4-~30%), it seems like when the number of deg freedom is very large the trees become kinda unstable, so much for the "consensus tree" idea.

3) Lipson (2018) finds that 100% of Mlabri, 94% of the ancestry of Nicobarese, 60% ancestry in Borneo and 37% of the ancestry of Juang in India are identical to FSAF, and divide the sections into (Mlabri, Juang) and (Borneo, Nicobarese) with the FSAF sample (Vietnam Neolithic) as outgroup. Very interesting because, on the linguistic side, (Nicobarese, Mon, Aslian, Bornean substrate language), or Southern Mon-Khmer, are purported to be a clade in both of Diffloth's classifications.

4) Matt, you might wanna take a look at this, in Supp Matt ADMIXTURE in McColl et al at K=13 we get a component that maximises in Naxi, i.e. "Sino-Tibetan" component with high specificity for the first time! Whoooop whoop whoooo

75% in Naxi, >50% in Yi, Tujia, Tu, Han N China, Jinuo, Karen etc. 100% of the E Asian part of Pahari, Spiti and Kusunda belong to this component. So I was saying for some time that we are getting closer and closer to showing that Tibetoburman was also spread by some migration from the foothills of the Himalayas.

Now for the Oakaie ("Myanmar LNBA") genomes in Lipson et al we see that they are FSAF + new ancestry component in EAs, and that on PCA they fall between Burmese and Naxi!

Getting closer...

Ryukendo K said...

@ Matt

Thanks for the plots. The difference between Vietnam NE (FSAF) and Myanmar LNBA (Oakaie) loads most strongly on Naxi and Chinese Han Beijing (CHB), between Myanmar LNBA and present-day Burmese is the same, while the difference between Vietnam BA (Dong Son Culture most likely, like the BA samples from MacColl) and FSAF loads most strongly on Miao, Ami and Atayal, so Lispon reflects McColl's results very well indeed.

Ryukendo K said...

@ Kristiina

Methinks the Mesolithic Indians will come out like Native Americans, where the male uniparentals come from one parent population (H, weird F, weird C) while the mtDNA and autosome is dominated by clades related to East Asians and Oceanians, with only a minor fraction of the mtDNA of the still poorly-characterised earliest inhabitants of the subcontinent.

Nirjhar007 said...

does that mean iran neolithic people were very tall people?

Iran Neolithic ? lol .

Hhmmm, what do I know about Y-DNA, but I2a2a2a (I2-P78?) in Neolithic Vietnam (1700 BCE)? Is that something expected?

LOL.

Ebizur said...

Wibhu Kutanan et al. (2018), "Contrasting maternal and paternal genetic variation of hunter-gatherer groups in Thailand" (Scientific Reports, volume 8, Article number: 1536, doi:10.1038/s41598-018-20020-0) was published online on January 24, 2018. The authors found that all four of their male Maniq (Austroasiatic-speaking "negrito" in southern Thailand) samples from three different sampling locations shared identical Y-chromosomes that belonged to haplogroup K.

Haplogroup K2b1-P399, whose nearest known outgroup is haplogroup P, predominates in most indigenous populations of Near Oceania (although descendants of haplogroup C1b2-B477 also have been found among them with notable frequency). In the Philippines, too, it appears that most Y-DNA that belongs neither to any generically East/Southeast Asian subclade of NO-M214 nor to any clade that plausibly may be ascribed to recent colonial influences belongs to haplogroup K2-M526 (cf. Karafet et al. 2010 & Trejaut et al. 2014).

These facts should be considered before jumping to conclusions on the basis of the present study's finding of Y-DNA belonging to haplogroup C and haplogroup D in two ancient specimens associated with Hoabinhian artefacts. Even if not members of K2a-M2335, members of some subclade of haplogroup K2 must have been present in Palaeolithic Southeast Asia alongside members of haplogroups C and D.

Matt said...

Interesting stuff Ryu.

Quick comment on: The difference between Vietnam NE (FSAF) and Myanmar LNBA (Oakaie) loads most strongly on Naxi and Chinese Han Beijing (CHB), between Myanmar LNBA and present-day Burmese is the same

Comparing the two (https://i.imgur.com/Mwu5oDH.png) present day Burmese samples mostly look generally loaded away all other East Asians by f3, compared to Myanmar LNBA. However I'd guess this probably reflects enrichment of ancestry from India/Bangladesh (any admixture from a deeply divergent clade hits f3 hard).

Relatively speaking among East Asian groups shifts between the LNBA and recent look suggestively like they could be towards both Dai / Miao / CHB, away from Borneo / Naxi reflecting more drift from clines entering SE Asia from an Eastern direction? So yeah, tentatively this agrees with the results suggesting wider networks of admixture over time and the kind of predominant vector identified by McColl.

For the Burmese case, though, I'm actually wondering if there really *was* a first MSEA farmers (FSAF / Austroasiatic) -> later farmers dynamic, autosomally speaking.

It seems like Upper Burma is relatively sheltered by mountains (the Shan Plateau(?) to the east) and not really downstream of the Mekong route Austroasiatics may have followed the SE China.

So I wonder if the people from what looks like it could be the Sino-Tibetan urheimat might have actually been the first to ride the Neolithic wave to Upper Burma. Even though there are obviously later subsequent, substantial migrations and any first people from a Sino-Tibetan direction weren't actually necessarily the Pyu people, who are the first recorded people whose language we have a handle on.

At the same time proper Austroasiatic (FSAF) migrations enter via a route around the coast of Lower Burma, in the form of the Mons?

Matt said...

Ryu: Matt, you might wanna take a look at this, in Supp Matt ADMIXTURE in McColl et al at K=13 we get a component that maximises in Naxi, i.e. "Sino-Tibetan" component with high specificity for the first time! Whoooop whoop whoooo

75% in Naxi, >50% in Yi, Tujia, Tu, Han N China, Jinuo, Karen etc. 100% of the E Asian part of Pahari, Spiti and Kusunda belong to this component. So I was saying for some time that we are getting closer and closer to showing that Tibetoburman was also spread by some migration from the foothills of the Himalayas.


Yeah, it look like to get that to happen, first ADMIXTURE needed to split off a Japanese-Ryukyuan component (pale green) from the generalised China-Japan-Korea component (dark green), then a Hmong dominant component linked to Southeast China (purple) which picks up some Austronesian component, all of which makes sense.

The remaining southwest / Himalaya Foothills component is then the dark green Sino-Tibetan that maximizes in Naxi. I would not coincidentally, this also maximises as a % of total East Asian components in Pahari (Indo-Aryan speaking, mostly South Asian autosomally, Tibetan foothills).

Ryukendo K said...

@ Matt

For the Burmese thing, I think its parsimonious to think AuAs people stretched from Vietnam to India at one point since the Mundas are found there. And also the Juang split preferentially from Mlabri in Lipson et al.

Yep I noticed the Spiti, Kusunda, Pahari thing too.

Honestly, once someone gets their hands on this dataset plus the HGDP, SGDP and Human Origins datasets and add Onge, Sherpa and also write to the Japanese researchers for the Ainu genomes, one can use modern-day variation to reconstruct the Tibetan HGs using allele frequency algs like ADMIXTURE, which is what the High-Altitude Adaptation paper did, and then use formal stats to prove, incontrovertibly, a contribution from diverged highlander HGs to Han N China. Would attract quite a bit of attention even if this is just blogged, methinks.

Matt said...

RK: For the Burmese thing, I think its parsimonious to think AuAs people stretched from Vietnam to India at one point since the Mundas are found there. And also the Juang split preferentially from Mlabri in Lipson et al.

Good counterpoint, though I would say that it does seem pretty consistent if the Munda/Juang groups got to India from a fairly coast hugging migration along Arakan that avoided Upper Burma, and Austroasiatic and Tibeto-Burman speaking populations may have been somewhat separate within the Burma prior to reaching India.

I was thinking something about the antiquity or non-antiquity of Nagas might tell us something as well though (e.g. Neolithic / post-Neolithic branch off?)

One thing I would say about the proportions within McColl's models is that the models which find 4% Tianyuan-like have *colossal* f2 drift in the shared proto Tianyuan-East_Asian_T_Related branch, at 114 f2 units, while 32% has about 16 f2 units, and 6 f2 units in the private East_Asian_T_Related branch.

Functionally, this means that in either case, Ami get about 5-7 f2 units that are not shared with the Group1+Onge clade, while Group1+Onge share about 2-3 f2 units, and then Onge goes on to have about 75-79 f2 units of private drift.

Essentially, most (95%?) of the Onge drift from East Asians is private and not shared with the Group1 (Hoabinhian) samples.

Matt said...

Why I'm thinking about the Burma case so much I guess is that so far in mainland SE Asia, you kind of have all the language change really within history: the initial Austroasiatic SE Neolithic picks up ancestry from groups migrating from SE China after the Neolithic, but none of this catalzyes any language change until the expansion of the Tai-Kadai languages to Siam and Laos and the emerge of polities centered around their speakers (e.g. prior to this point the region Austroasiatic speaking and presumably migrants ultimately adopted language and were absorbed into groups).

All the change is within history and fairly easily explicable. We know there was a wider Austroasiatic speaking region, and we understand its regress due to expansion of Tai-Kadai.

But in Upper Burma we only really have the first language as Pyu, a Tibeto-Burman variant, and we don't really know when Tibeto-Burman / Sino-Tibetan began expanding. Could have been Neolithic - if the group started in Himalayan foothills and reached North China, it had to do so by at least 1200BC (3200 YBP) when we first Chinese inscriptions, and clearly it would seem like it should have been evolving separately from of languages in the group by that point, one way or another. One estimate of divergence time within group for Tibeto-Burman / Sino-Tibetan is 4000 YBP (to reach estimated divergences). Even the oldest Neolithic MSEA Asian samples that are plausibly representative of an Austroasiatic wave of migration are about 4100 YBP.

Matt said...

At the risk of beating a dead horse and be a bit obsessive, with regard the MSEA situation, I guess I just want to check assumptions before we are too quick to go with Lipson's 2018 analogy ("In a striking parallel with Europe, later sites from across the region show closer connections to present-day majority groups, reflecting a second major influx of migrants by the time of the Bronze Age.")

In the European Neolithic we have:
1) evidence of the spread of the Neolithic across Europe thousands of years prior to what appears to be archaeologically expansions linked to the steppe
2) the linguistic divergence times and actual shared root vocabulary of Indo-European are in contradiction with an early Neolithic dispersion or time of expansion (present day IE could not have been expanding at this time, contra Renfrew)
3) this is backed up by a wealth of ancient evidence showing that the early Neolithic reached to the almost all regions of Europe, barring the Baltic, in some form, well prior to the dates of linguistic Indo-European expansion
4) we now have genetic data showing that the first farmers everywhere roughly formed a clade (barring local HG admixture).

In mainland SE Asia:
1) the Neolithic expansion dates, and the basis of earliest evidence of agriculture, to around 4.5kya
2) the best guess divergence times for expansion of Austonasiatic language and Tibeto-Burman languages are both rooted around 6 kya
3) Upper Burma specifically is in a region that could well more accessible to expanding TB languages, assuming a Himalayan foothill or NW Chinese urheimat
4) we only have early Neolithic samples really in Vietnam.

It's akin to as if we had questions about who reached Spain first, IE speaking culture or cultures associated with the Cardial expansion, if Cardial were expanding at the same time as IE cultures, and we had samples of Cardial cultures in Italy at the same time as we had good reason to believe IE cultures could have been in France...

Ryukendo K said...

@ Matt

I like the way you think, its very careful bro. You're talking to a guy who's a little bit more handwavy haha ¯\_(ツ)_/¯

Putting tgt McColl and Lipson together, we see that MSAF reached Laos and peninsular Malaysia (granted the Laotian samples are very close to Vietnam) and their ancestry is widespread throughout all of MSEA and Sunda and is overprinted by different other ancestries in different places(AuN in island SEA, ST in Burmese, Tai in Eastern Mekong RV and Vietnam) leaving outliers like Nicobarese, Orang Asli, Htin and Mlabri and to a lesser extent Proto-Malays like Bidayuh and Semende. To me, this suggests an earlier spread of FSAF compared to Dai, ST or AuN ancestry. Since they appear fully formed in SEA this also implies AuAs ultimate origin somewhere in S China, i.e. Nicobarese like people on the banks of Yangtze (quite difficult to imagine) and the origin of the AuN, Hmong-Mien and ST gene pools even further north than is usually assumed, maybe even North of the Yangtze.

I agree 6kya beginning of spread for both ST and AuAs is weird, but originally the ages suggested were much older for AuAs, 8kya had been suggested, and there is a trend towards younger and younger ages more recently, like 4kya for Blench which seems clearly wrong to me (divergence btw AuAs branches clearly much more than among IE). Case of "revisionist acceleration" among younger and younger scholars I think.

ST is also intrusive to the region, both genetics-wise (the original Myanmar LNBA were more Naxi than Burmese, implying admixture with locals and prior external origin) and linguistics wise, where the centre of gravity of the phylum likes in the region from Meghalaya to Sichuan. So circumstantial evidence suggest ST overprinting AuAs to me. Looking at the AuAs presence in Yunnan-Burma-Thailand trifecta among ppl like Palaung and Wa, and their closeness to Thai AuAs, it may be that that region was 100% FSAF at some point too.

VOX said...

"(Whatever disputes about likely phenotype, the ASI (pre-Neolithic) were likely extremely short, extremely dark, extremely broad featured, beyond any present day populations, whether they had a more gracile cranial phenotype on top of that like the Onge or were more robust like Oceanians. Almost everything from present day information on populations from hot climates and our knowledge of clines of phenotype in India today supports this)."

You are right about ASI having some level of genetic relation with the Onge and them being somewhat gracile.

In terms of stature, ASI (At least in the Ganges Plane) were generally tall. Damdada specimens (ASI) were 178 cm (males) and 173 cm (females), compared to Chalcolithic Mehrgarh where males and females were 171 cm and 159 cm respectively. ASI may have been tall for a couple of reasons:

1. Indian hunters (ASI) had longer limbs and a linear body proportions for easy heat dissipation

2. Longer limbs are better suited for highly mobile hunters

3. Indian hunters had generally better nutrition than the farmers

With that said, there was probably much variation in the height of south asian hunters, with shorter hunters to be found elseware. Also there's precaution to be taken when relating ancient populatons with today’s climate. For example, note that mid Holocene India was more hyper arid and desert-like than it is today.

https://muse.jhu.edu/article/48021/summary

Nirjhar007 said...

Its funny how sure some are, that Meso India will only be ASI....

Matt said...

Thanks for the compliment man. (My knowledge is really pretty lacking here; as probably shows with the archaeology and linguistics I often don't tend to retain too much until I have an adna scaffold to start thinking about and questions I want to ask!)

Really all the evidence for language dispersal here is leaving me more and more confused; I was reading some of Blench yesterday to try and get a handle on the topic and he seems fairly vehement that the dispersal of the whole TB language clade had to have begun approx 8-9kya, and that Sinitic had to expand 4kya, to get to the diversity seen among TB from NE India to China, and among Sinitic languages. Though will say he disagrees about the presence of agriculture in earliest strata of TB expansion, and does seem to favour a later expansion of TB languages into Myanmar itself (and generally in agreement with Pyu speakers as intrusive over preceding AusAs groups).

All I can really take from him in brief, is I think there's a consensus the whole TB clade has to be expanding earlier than Austroasiatic (based on internal diversity) and terminology does not too much favour a late expansion?

Anyway, so you'd guess some of this is underestimation of the spread of the Neolithic and AuAus in MSEA? Otherwise Austronesian, Austroasiatic, Sinitic expansions happening at more or less similar time. I will probably have to play it conservative with dates here, but will keep listening out to see if adna or other evidence trumps the models of language diversity and the dates that are currently in play... (Really some kind of direct adna of the Austroasiatic signature from well prior 2ky to Man Bac would do it fairly decisively). I won't be too shockedif you are ultimately right!

(the original Myanmar LNBA were more Naxi than Burmese, implying admixture with locals and prior external origin)

Or, they *were* the locals and admixture with Austroasiatic populations who had migrated in along the southern route had yet to become as a major a thing as by the present day (interactions between TB speaking northern polities and Mon speaking polities in the south later in history?). A la Lithuanians today being more tilted to EEF than Baltic_BA or the other populations that preceded them ever were, because of continuing isolation-by-distance contacts with Central Europe.

(If anything I'm more surprised by how linear the f3 correlation to East Asian groups seems to be between the Myanmar LN to BA pair and present day Burmese. The relative shift also looks a bit like Burmese are shifting towards Austronesians and SE China, than actually too much like a straight story of absoprtion of Austroasiatic groups. That is Nicobarese and Borneans also subtly above the line for relatedness of Myanmar_LNBA relative to Burmese.

From sticking a few of these stats through PCA: https://i.imgur.com/OFPQzN6.png.

On the "wheel" between relatedness to the EA groups from Myanmar_LNBA to Cambodian from Naxi->Nicobarese, Burmese about 70-80% loaded towards Myanmar LNBA... Then on a PCA wheel where one end reflects Myanmar_LNBA and Cambodian having more affinity to Nicobarese and Naxi relative to Ami, and KHV being more loaded to Ami on the other end, Burmese about intermediate. Burmese look like they're shifting mostly towards populations entering SE Asia from SE China? More than further AusAs absorption?)

Nirjhar007 said...

What is the explanation regarding Y-DNA I2a2a2a in 1700 BC Vietnam? a Typo?.

Ebizur said...

Matt wrote,

"On the "wheel" between relatedness to the EA groups from Myanmar_LNBA to Cambodian from Naxi->Nicobarese, Burmese about 70-80% loaded towards Myanmar LNBA... Then on a PCA wheel where one end reflects Myanmar_LNBA and Cambodian having more affinity to Nicobarese and Naxi relative to Ami, and KHV being more loaded to Ami on the other end, Burmese about intermediate. Burmese look like they're shifting mostly towards populations entering SE Asia from SE China? More than further AusAs absorption?)"

The Tai people have expanded probably from roughly the watershed of the Pearl River beginning probably at some time in the first millennium CE into northern Vietnam, Laos, Thailand, southern Yunnan, Upper Burma, and Assam. Could intermarriage between Burmese and Tai people over the last millennium or so have been significant enough to effect a shift of Burmese autosomes toward populations of southeastern China and Taiwan? If such an influence has occurred, it does not seem to be well-attested in the Y-DNA of present-day Bamar judging from the samples of Min-Sheng Peng et al. (2013), but that would not necessarily disprove such a hypothesis because most of the gene flow could have been mediated by Tai women marrying into the Burmese population.

ryukendo wrote,

"I agree 6kya beginning of spread for both ST and AuAs is weird, but originally the ages suggested were much older for AuAs, 8kya had been suggested, and there is a trend towards younger and younger ages more recently, like 4kya for Blench which seems clearly wrong to me (divergence btw AuAs branches clearly much more than among IE)."

I concur. There is very little shared in common among all languages that have been grouped together according to the hypothesis of an Austroasiatic language family that really screams "singular genetic origin," especially when compared to the obvious similarities among Indo-European languages. Some of the terms that do appear at least superficially to be patent cognates (something like tay for "hand" or something like mat for "eye") are frankly very similar to words in other languages of coastal Eastern Eurasia (Austronesian, Japonic, Ainu, etc.), so I wonder how rigorous proponents of the Austroasiatic hypothesis have been in their identification of sound laws and exclusion of the possibility of loanwords or Wanderwörter.

Nirjhar007 said...

Okay the I2a2a2a is a typo , from a reliable source I am informed that it is actually O1b1a .

Sorry to disappoint some excited I2a dudes here ;).....

capra internetensis said...

thanks Nirjhar i was wondering about that

Ebizur, i hope no one will take a sample of 2 as representative of all pre-Neolithic Southeast Asian Y DNA. :D F2-M427 should also have been around there.

i gather there is quite a bit of this kind of deep lexical sharing between East Asian languages, which would not suprise us if they all started out in Neolithic China, where there is supposedly an interaction sphere reaching from well south of the Yangtze to north of the Yellow River considerably pre-dating these Neolithic MSEA samples.

Ryukendo K said...

@ Matt

One thing to note is those people who think Sino-Tibetan is more of a "trans-himalayan" language family like van Driem and Blench, also tend to think that the tribes of Meghalaya and the Indian side of the Himalayas in general contain the most divergent ST languages and place Lolo-Burmese, Karen, Bodic (tibetan and related) and Sinitic, i.e. all the Eastern, more extensive branches of ST as part of a single terminal branch that represents the most successful eastern scion of the family, so their dates likely have to do with the deeply diverged stuff in trans-himalaya region, with a later, more conventional date for the extensive eastern branches of ST. Personally unqualified to comment on the validity of this proposal, a bit skeptical because trans-himalaya region is just the sorta region to develop diversity-sparing effects (like the Caucasus) or substrate effects (c.f. highland east Asian HG autosomal signature preserved in Sherpa), and it seems lowland east asian contribution from agropastorals is ubiquitous even west of the Himalayas, including ppl like Kiratas, only ppl who don't have it are the Sherpa.

hahaha said...

So Kinh Vietnamese speak AA langiage but genetically they are Tai-Kadai?

Garvan said...

Hieu Phamnhu said...
So Kinh Vietnamese speak AA langiage but genetically they are Tai-Kadai?

By the bronze age, this area of Northern Vietnam may have been Kra speaking (See 'Kra-Dai and the Proto-History of South China and Vietnam - James R. Chamberlain'). It is before the expansion of Kinh into the area, so the Kinh must have absorbed the former populations as they expanded their civilization.

Garvan


Anonymous said...

Unreal. I was resigned to waiting for the Indo-European subject matter to be finally over and done with, and therefore expecting South and Central Asia and thereafter China to be covered, long before Southeast Asia would finally get a turn.

Now 2 papers on Southeast Asia aDNA have landed, with nearly 40 samples in all. Spectacular. Good I came to check here before beginning my reading, after seeing the Y-DNA I2a2a2a mention on anthrogenica. Someone there reasoned it may have been mistyped, with O intended for I, while another said he'd notified the authors' about it. Thanks Nirjhar for confirming it was in error.

hahaha said...

So that is the reason why i am curious about the closeness between Dai and Kinh people. Are they really close by Fst?

Ebizur said...

Hieu Phamnhu wrote,

"So Kinh Vietnamese speak AA langiage but genetically they are Tai-Kadai?"

Tai-Kadai is a language family, so it is not really correct to say that Kinh people are genetically Tai-Kadai.

However, it does appear to be true that Kinh people and Tai people share Y-DNA haplogroup O1b1a1a1a1a-M111/M88 with high frequency:

Kinh/Hanoi, Vietnam (He et al. 2012)
9/76 = 11.8% C2-M217
1/76 = 1.3% K-P131(xN-M231, O-P191, Q-P36, R-M207)
2/76 = 2.6% N-M231
5/76 = 6.6% O-P203(xM101)
9/76 = 11.8% O-M95(xM88)
23/76 = 30.3% O-M88
7/76 = 9.2% O-P200(xM121, M164, P201, JST002611)
2/76 = 2.6% O-P201(xM7, M134)
8/76 = 10.5% O-M7
7/76 = 9.2% O-M134
2/76 = 2.6% O-JST002611
1/76 = 1.3% R-M17

Hanoi, Vietnam (Trejaut et al. 2014)
2/24 = 8.3% C2-M217
1/24 = 4.2% D1a1-M15
2/24 = 8.3% NO-M214(xO-M175, N-M128, N-M178)
1/24 = 4.2% O1a-M119(xP203, M50)
1/24 = 4.2% O-PK4(xM95)
2/24 = 8.3% O-M95(xM88)
6/24 = 25.0% O-M88
1/24 = 4.2% O-JST002611
1/24 = 4.2% O-M159
3/24 = 12.5% O-M7(xM113)
3/24 = 12.5% O-M134(xM133)
1/24 = 4.2% O-M133(xM162)

Kinh/Ho Chi Minh City, Vietnam (KHV sample of the 1000 Genomes Project)
1/46 = 2.2% C2c1b-F845(xC2c1b1-K511, C2c1b2-F5477, C2c1b4-FGC39587)
1/46 = 2.2% F-Y27277*(xF2-M427/M428)
2/46 = 4.3% Q1a1a1a1a-Y529
1/46 = 2.2% N1b2a-M1811 [Y24355+]
1/46 = 2.2% N1a2a1b-Y23738

2/46 = 4.3% O1a1a-P203.1 > O1a1a1-F446(xO1a1a1a-F140) > O-CTS8920
TMRCA O1a1a1-F446 approx. 8,300 ybp

1/46 = 2.2% O1b1a2c-CTS9996 (O1b1a-K18(xM95))

1/46 O1b1a1a1b-M1283*(xO1b1a1a1b1-Y9322)
2/46 O1b1a1a1b1a2-M6661
3/46 = 6.5% O-M95(xM88/M111) total
[O1b1a1a1b-M1283 formed 10300 ybp, TMRCA 5800 ybp]

3/46 O1b1a1a1a1a2-F2890
2/46 O1b1a1a1a1a1a-Z24083*(xO1b1a1a1a1a1a1b-Z24091, O1b1a1a1a1a1a1a-F923)
2/46 O1b1a1a1a1a1a1b-Z24091
3/46 O1b1a1a1a1a1a1a-F923*(xO1b1a1a1a1a1a1a1-CTS2022)
1/46 O1b1a1a1a1a1a1a1-CTS2022*(xO1b1a1a1a1a1a1a1a-F3053)
1/46 O1b1a1a1a1a1a1a1a-F3053*(xO1b1a1a1a1a1a1a1a1-CTS3745)
12/46 = 26.1% O1b1a1a1a1a-M88/M111 total
[O1b1a1a1a1a-M88/M111 formed 9900 ybp, TMRCA 5700 ybp]
[TMRCA of 10300 ybp for O-M1310, which subsumes both O-M1283 and O-M88/M111]


1/46 = 2.2% O2b-CTS1754*(xF1024)

1/46 O2a1a1b1a1-CTS1936
1/46 O2a1a1a1a1-L599/M121
2/46 = 4.3% O2a1a1-F2159 or O2a1-KL2(xO2a1c-JST002611) total
[O-F2159 TMRCA 13,300 ybp]

2/46 O2a1c1a1a1a1c-F539
1/46 O2a1c1a1a1a1f1-CTS257
1/46 O2a1c1a1a1a1e1-Y15976*(xO2a1c1a1a1a1e1a1a-SK1686)
4/46 = 8.7% O2a1c-JST002611 > O2a1c1a1a1a-F11 > O2a1c1a1a1a1-F325 total
[O-F11 TMRCA 8,800 ybp]

4/46 O2a2a1a2a2b-V3237/Z25318 [O2a2a1a2a2-F1863*(xV3237) has been found in a CDX Dai in Xishuangbanna, Yunnan.]
2/46 O2a2a1a2a1a2-Z25398*(xF1100)
6/46 = 13.0% O2a2a1a2-M7 total
[O2a2a1a2a-F1276 TMRCA 15,200 ybp]

1/46 = 2.2% O2a2b2a1b-F4124 [This belongs to the "East Asian" branch of O2a2b2-N6. TMRCA with the "Austronesian" branch is 12,700 ybp.]

1/46 O2a2b1a2-F444 > O2a2b1a2b-F1725*(xO2a2b1a2b1-Y31255)
1/46 O2a2b1a2a1a1a-F700/CTS7300
2/46 O2a2b1a2a1a2a1-Z26108 > O-Z26156/YP5166/F14610
4/46 = 8.7% O-F444 total

1/46 O2a2b1a1a4a1-CTS1154 [Also found in two individuals in Fujian, China.]
[O2a2b1a1a4a-CTS5063 TMRCA 2,500 ybp. O-CTS1154 TMRCA 1,750 ybp. O-CTS5063*(xCTS1154) found in a CDX Dai in Xishuangbanna, Yunnan, China.]
1/46 O2a2b1a1a3a-F2188*(xO2a2b1a1a3a2-Y29861) [Also found in Cebu, Philippines.]
1/46 O2a2b1a1a3a2-Y29861 [Also found in Henan, China.]
[O2a2b1a1a3a-F2188 TMRCA 2,000 ybp]
1/46 O2a2b1a1a6b1-Y7080
[O-Y7080 TMRCA 5,600 ybp. Also found in Guangdong, China and in nine CDX Dai in Xishuangbanna, Yunnan, China.]
4/46 = 8.7% O2a2b1a1a-M133 total
[O2a2b1a1a-M133 TMRCA 8,200 ybp]

Ebizur said...

Vietnam (Karafet et al. 2010; the sample is from specifically "Southern Vietnam" according to He et al. 2012)
3/70 = 4.3% C2-M217
2/70 = 2.9% D-M15
1/70 = 1.4% J-M304(xJ1-M267?, J2-M172)
1/70 = 1.4% J2-M172(xM12?)
2/70 = 2.9% N-M231(xM178, M128)
2/70 = 2.9% O-P197(xP201, JST002611)
1/70 = 1.4% O-P201(xM7, M134)
4/70 = 5.7% O-M7
10/70 = 14.3% O-JST002611
11/70 = 15.7% O-M134
4/70 = 5.7% O-P203
1/70 = 1.4% O1b-P31(xM95, M176)
1/70 = 1.4% O1b2-M176(x47z)
2/70 = 2.9% O-47z
5/70 = 7.1% O-M95(xM111)
14/70 = 20.0% O-M111
5/70 = 7.1% Q-P36(xM346)
1/70 = 1.4% R-M17

Tai Khuen from Northern Thailand (Brunelli et al. 2017)
2/24 = 8.3% D-M15
4/24 = 16.7% N-M231
1/24 = 4.2% O-P203
6/24 = 25.0% O-PK4(xM111)
6/24 = 25.0% O-M111
1/24 = 4.2% O-M134(xM117)
4/24 = 16.7% O-M117

Tai Lue from Northern Thailand (Brunelli et al. 2017)
2/91 = 2.2% C2-M217
1/91 = 1.1% D1a1-M15
3/91 = 3.3% N-M231 [Two of these individuals had null results for M231, but they are NO-P195+ O1a-M119- O1b-P31- O2-M122-, so almost certainly belong to haplogroup N.]
6/91 = 6.6% O1a1a-P203
3/91 = 3.3% O1a2-M50
22/91 = 24.2% O1b1a1-PK4(xO1b1a1a1a1a-M111)
28/91 = 30.8% O1b1a1a1a1a-M111
1/91 = 1.1% O2-M122(xO2a-M324)
3/91 = 3.3% O2a-M324(xM7, M134)
6/91 = 6.6% O2a2a1a2-M7
16/91 = 17.6% O2a2b1a1-M117

Khon Mueang from Northern Thailand (Brunelli et al. 2017)
4/205 = 2.0% C-M130(xM8, M105, M38, M356, P55, M217)
1/205 = 0.5% C-M356
6/205 = 2.9% C2-M217
1/205 = 0.5% E1-P147
4/205 = 2.0% D-M15
1/205 = 0.5% J2-M172
1/205 = 0.5% K-M9/P128(xK2e-M147, K2b1a1-P60, K2b1a2-P79 (? A->G=A instead of T->C=T), K2c-P261, K2c-P263, L-M11, L-M20, L1a1-M76, L1b-M317, L1a2-M357, M-P256, M1-M106, M1-M5, M2-M353, M3-P118, NO-P195, NO-M214, N-M231, O-P191, O-P186, P1-M45, P1-P281, Q-M242, R1-M306, R1-M173, R1-P231, R2a-M124, R2a-P249, S-P204, S-P202, T1a-M70, T?-M320)
1/205 = 0.5% H-M52
2/205 = 1.0% L1a1-M76
9/205 = 4.4% N-M231
6/205 = 2.9% O-P203
2/205 = 1.0% O-M50
47/205 = 22.9% O-PK4(xM111)
38/205 = 18.5% O-M111
1/205 = 0.5% O2-M122(xM324)
16/205 = 7.8% O2a-M324(xM7, M134)
17/205 = 8.3% O-M7
46/205 = 22.4% O-M117
2/205 = 1.0% R1-M173

Tai Yuan from Northern & Central Thailand (Brunelli et al. 2017)
1/85 = 1.2% C-M130(xM8, M105, M38, M356, P55, M217)
5/85 = 5.9% C2-M217
1/85 = 1.2% D-M15
1/85 = 1.2% H-M52
6/85 = 7.1% N-M231
3/85 = 3.5% O-P203
19/85 = 22.4% O-PK4(xM111)
19/85 = 22.4% O-M111
5/85 = 5.9% O-M122(xM324)
5/85 = 5.9% O-M7
3/85 = 3.5% O-M134(xM117)
15/85 = 17.6% O-M117
2/85 = 2.4% R1-M173


Tai Yong from Northern Thailand (Brunelli et al. 2017)
1/26 = 3.8% D-M15
1/26 = 3.8% N-M231
2/26 = 7.7% O-P203
1/26 = 3.8% O-M50
2/26 = 7.7% O-PK4(xM111)
10/26 = 38.5% O-M111
4/26 = 15.4% O-M324(xM7, M134)
2/26 = 7.7% O-M7
3/26 = 11.5% O-M117

Shan from Northern Thailand (Brunelli et al. 2017)
1/20 = 5.0% CF-P143(xC-M130, F2-M428, G-P257, H1a-M69, H2-P96, IJ-P123, IJ-P127, I-M258, I-U179, J-P209, J-M304, K-M9, K-P128)
1/20 = 5.0% N-M231
1/20 = 5.0% O-P203
6/20 = 30.0% O-PK4(xM111)
3/20 = 15.0% O-M111
1/20 = 5.0% O-M7
7/20 = 35.0% O-M117

Ebizur said...

In connection with my previous comment about the Burmese, the Tai Khuen and the Shan should be most relevant as potential vectors of SE Chinese-like DNA into Burma at some time within the last two millennia.

Haplogroup O1b1a1a-M95 (former O2a-M95; formed 12400 ybp, TMRCA 10700 ybp) is currently known to have roughly seven branches:

O1b1a1a2-CTS350 (formed 10700 ybp; TMRCA unknown) Found in two Han individuals from Hunan in the CHS sample of the 1000 Genomes Project. Note that the CHS Han from Hunan have a very aberrant Y-DNA pool, and may not be "authentic" Han Chinese, but rather mainly descendants of some originally non-Chinese ethnic group that has been linguistically assimilated by the Chinese. This is the most basal known branch of O-M95 by a small margin.

O1b1a1a1b-F789/M1283 (formed 10300 ybp, TMRCA 5800 ybp) This branch of O-M95 has been found in populations ranging from the Dravidian-speaking Gond and Konda Dora and Munda-speaking Ho in the west to the Austronesian-speaking Lebbo and Dusun of Borneo in the east. It also has been found in three Kinh, three Dai in Yunnan, a Yi, and a Cambodian. Members of this clade seem most likely to have been responsible for effecting a linguistic connection between Munda-speaking peoples in India and other Austroasiatic populations in Southeast Asia.

O1b1a1a1a2-SK1630 (formed 9900 ybp, TMRCA unknown) This is a poorly known branch of O-M95 that has been observed in an individual with a patrilineal ancestor from Ryazan Oblast of Russia, just southeast of Moscow Oblast.

O1b1a1a1a1b1a-CTS7399 (formed 9900 ybp, TMRCA 5600 ybp) This subclade has been observed in a Japanese in Tokyo, an individual from Fujian, two Dai in Xishuangbanna, and an individual from Guangxi. The Japanese individual is basal to a branch that includes all the members from China; the South China branch has an estimated TMRCA of 3500 ybp.

O1b1a1a1a1b1b-CTS5267 (formed 9900 ybp, TMRCA unknown) This subclade has been observed in one Han in Beijing and four Dai in Xishuangbanna.

O1b1a1a1a1b2-Z23781 (formed 9900 ybp, TMRCA 8500 ybp) This subclade has been observed in an individual from Henan, an individual from Chongqing, and two Dai from Xishuangbanna, with the individual from Henan being basal to a branch that includes the other three individuals. Note the rather great TMRCA.

Haplogroup O1b1a1a1a1a-M111/M88 (formed 9900 ybp, TMRCA 5700 ybp) Members of this subclade are very common among modern Tai peoples, Kinh, and Akha-Hani peoples. They also have been observed with lower frequency among other Lolo-Burmese peoples, Qiangic peoples (e.g. 1/33 Qiang in the sample of Xue et al. 2006, 2/18 Horpa from Danba County, Sichuan), Tibetans (1/47 Khampa from Hekou Town, Yajiang County, Garze Tibetan Autonomous Prefecture, Sichuan), Kam-Sui peoples (e.g. 1/40 Sui from Fuyuan, Yunnan according to Yang et al. 2005, 1/21 Chadong), Hlai people (1/11 Hlai according to Li Dongna et al. 2010, 1/34 Li according to Xue et al. 2006), Hmong-Mien peoples, Tujia, Bai, southern Han, and Austronesians in Taiwan, the Philippines, Malaysia, Vietnam (i.e. Cham), Java, and Borneo. This subclade notably has not been found among speakers of Austroasiatic languages in India, but Li Dongna et al. 2010 have reported finding O-M88 in 3/26 = 11.5% of a sample of Cambodians and Cai et al. 2011 have reported finding O-M88 in various Austroasiatic-speaking populations in Laos.

Ebizur said...

In summary, there is one branch of O-M95 spread all over southern Asia from India to Borneo, one branch spread mainly in southern China, Vietnam, Laos, and Thailand, one branch split between southern China and Japan, one branch split between Beijing Han and Yunnan Dai, one branch split between roughly central China and Yunnan Dai, one branch in Russia, and one branch in Hunan, with the branch in Hunan being the most basal by a slight margin. The founder of O-M95 as a whole most likely has not lived any more recently than the earliest Neolithic culture in East Asia, but he probably does not predate the earliest Neolithic in East Asia by a great margin, either.

Ryukendo K said...

Hi Phieau and Garvan,

Thanks for bringing the paper to my attention. I'm once again not qualified to evaluate the argument, but I've gotta say that the idea that the red river valley was not Vietic at the beginning sounds unusual and exotic to me and I've not read anywhere, either in sources on Vietnamese protohistory or AuAs linguistics, that vietic peoples were intrusive to the Vietnamese heartland (then again you won't find this argument for Sinitics either until very recently, so the reluctance to recognize non-autochthonous origin in East Asia may just be cultural inertia Am on East Asians that then got diffused to Western linguists studying the languages).

Certainly a Tai-Kradai intrusion in the Red River valley that only got overlaid by a Vietic linguistic shift spreading North close to the second mil AD would explain why Vietnamese are so similar to Tais autosomally. (Phiaeu) Vietnamese look autosomally like Cambodian+Zhuang, which is virtually identical to Tai speakers in mainland SE Asia, in fact Northern Vietnamese are if anything even more Tai-shifted than Thais themselves. The Dong-son culture remains we have in McColl cluster with present day Thais and Laos, and slightly south of present-day day Viets, i.e. slightly more more Cambodian (or AuAs) like.

But then a Tai-Kradai speaking Red River Valley must be a postNeolithic development as well, as the Neolithic genomes we have here show quite clearly that the Neolithic inhabitants of Vietnam, all fall in "group 2" and all cluster with Mlabri, Orang Asli and Nicobarese. If we accept the congruence of genes and languages in SEA, which I think we generally should given what this paper has shown, a three-layer linguistic sandwich, with the Tai-kradai in the middle from the Bronze Age to the 1st mill CE, may be closer to the truth.


Ryukendo K said...

@Ebizur,

What do you make of the paper Garvan has introduced?

Anthony Hanken said...

@Ebizur

Just out of my own curiosity, where is the most likely place of origin for O? It seems like most have speculated Southeast Asia however it seems like O branches came from farther north in China during the Neolithic judging from this paper however I have little knowledge on this subject.

capra internetensis said...

there is also Phan's theory that the Red River plain spoke a Middle Chinese dialect (Assamese Middle Chinese as he calls it in his thesis "Lacquered Words") and shifted to Vietic at a later point, explaining the very high Sinitic content in Vietnamese (parallel to Norman French in English). but the region may have been Vietic before that anyway.

Matt said...

@Ebizur: Could intermarriage between Burmese and Tai people over the last millennium or so have been significant enough to effect a shift of Burmese autosomes toward populations of southeastern China and Taiwan?

This certainly seems like a fairly possible theory.

The other thing which occured to me shortly after my initial post (and I forgot to post at the time) was that the original Bamar themselves were a later intrusion from Yunnan and so may have carried a little offset towards groups from the Southeast Chinese Neolithic (Hmong / Austronesian like). Assuming generally widening admixture networks over time in the region. That is Bamar migration may not just be a "more Tibeto-Burman / Sino-Tibetan" into a Myanmar_LNBA substrate.

In support, the Naxi do have a slight offset towards Ami, Dai, Miao, Lahu and Burmese, but away from Nicobarese and equal to CHB, compared to the Myanmar_LBNA: https://imgur.com/a/XR1Zs (e.g. compared to Myanmar_LNBA, Naxi are more SE China like, less Austroasiatic like, same distance to North China).

So it could be both effects, though I think your effect might be more plausibly dominant as it would require less admixture fraction to the same ends.

hahaha said...

Did authors run the ADMIXTURE of the above populations?

Garvan said...

Blogger Hieu Phamnhu said...
Did authors run the ADMIXTURE of the above populations?

The Dai and Kinh look like Southern Chinese in Admixture graphs.

Have a look at the f statistics at the end of the 'Ancient genomes document multiple waves of migration in Southeast Asian prehistory' paper. These show shared drift between pairs of populations. We can clearly distinguish Dai and Kinh (KHV), with the Dai having more shared drift with Hmong, Lahu, Htin. The Kinh have more shared drift with Han.

Garvan

Garvan said...

From the McColl Paper:
"These results, together with the absence of significant Denisovan ancestry in these populations, suggest that the Denisovan admixture observed in Papuans occurred after their ancestors split from the ancestors of the Onge, Jehai and the ancient Hoabinhians. This is also consistent with the presence of substantial Denisovan admixture in the Mamanwa from the Philippines, which are best modeled as resulting from an admixture between Austronesians and Papuans, not Onge"

Given the proposed migration routes in the graphic copied above, where can the Denisovan admixture have taken place? Since the Reich paper in 2011 - looking at figure 3 in this paper showing the Denisovan admixture in the Mamanwa - I have thought the migration route for the Denisovan admixed population was from China via Taiwan and the Philippines. Yes, I know they would need boats.



Ebizur said...

ryukendo wrote,

"What do you make of the paper Garvan has introduced?"

I did not find anything very conclusive. The most memorable aspect of the paper IMHO is the author's claim that Southwestern Tai peoples use cognates of Kra to refer to some "hill tribes" in their vicinity, many of whom natively speak one or another Austroasiatic language, and that Kra is cognate with 楚 Chǔ of classical Chinese texts. We really need more ancient DNA from various parts of the PRC.

Ebizur said...

Anthony Haken wrote,

"Just out of my own curiosity, where is the most likely place of origin for O?"

Personally, I would say the most likely place of origin of haplogroup O is 中原 (the "central plains," roughly the flatlands of "China proper").

O1a-M119 (formed 30100 ybp, TMRCA 15200 ybp) may have matured in the southeastern portion of this area (roughly the YRD), eventually spreading southward mainly along the coast and into Taiwan at some time in the fourth millennium BCE. Much of the subsequent distribution of the haplogroup probably has been effected by the spread of peoples speaking Austronesian languages.

O1b1a2-Page59 (formed 22100 ybp, TMRCA 15900 ybp) seems to be comprised of three primary subclades, all of which may have been spread by Chinese people proper. (The Cao clan of Wei appears to belong to one of these subclades.) However, one of the three subclades, O1b1a2b-F417, is deeply bifurcated (TMRCA 11300 ybp). The O1b1a2b1-F840 branch has been found in Han Chinese and a Dai. Members of the other branch of O1b1a2b-F417 have been found so far only with very low frequency among Koreans and Japanese IIRC.

O1b1a1-PK4 (formed 22100 ybp, TMRCA 12400 ybp) appears most likely to have populated some part of the watershed of the Yangtze River, perhaps around its middle reaches (e.g. the Chu/Jingzhou/Huguang area), in the early Neolithic. It appears that one branch of this population (probably O1b1a1a1b-M1283: formed 10300 ybp, TMRCA 5800 ybp) eventually moved south to found one of the earliest Neolithic cultures (perhaps the very earliest) in mainland Southeast Asia. The O1b1a1a1a1b-CTS5854 subclade appears to be less bottlenecked, probably because it occupies more-or-less the ancestral Neolithic homeland of the O-PK4 clade, although many members may also have spread southwestward quite recently in tandem with the expansion of the Tai people. The O1b1a1a1a1a-M111 subclade (formed 9900 ybp, TMRCA 5700 ybp) may have started to spread into Southeast Asia around the same time as O1b1a1a1b-M1283, or it may have remained somewhere in southern China until a more recent era; I think this question cannot be resolved without more data from ancient DNA. It seems to be found at present mainly among Tais and Viets, but it is also found among the Bunun tribe in Taiwan and in the Philippines, so some branch of it may have spread via the coast of southern China.

O1b2-M176 (formed 28400 ybp, TMRCA 12100 ybp) is now found mainly in Japan and Korea, but a few cases have been reported in samples from southeastern Siberia, Manchuria, Mongolia, Xinjiang, Khams, China proper, Taiwan, and Vietnam. It is difficult to discern where it might have originated; perhaps southern Liaoning?

Ebizur said...

O2-M122 (formed 31500 ybp, TMRCA 29000 ybp) is phylogeographically a mess. I suppose O2a2b2a2b-AM01847/B451 probably has spread southward along the eastern coast of China together with O1a-M119 and subsequently dispersed widely with speakers of Austronesian languages. O2a2b1a1a6-CTS1642 (formed 8200 ybp, TMRCA 6200 ybp) may have expanded along with the spread of the proto-Tibeto-Burman language. O2a2a-M188 (formed 24600 ybp, TMRCA 20900 ybp) appears to be very old, and it may have populated some part of central or southern China prior to the Neolithic revolution; one branch of it is now common among speakers of Hmong-Mien languages. The other subclades (O2a1c-JST002611, O2a2b1a2-F114, etc.) seem to be spread all over East Asia (and into Southeast Asia and Central Asia) with some variation in the density of their distribution, but no really clear-cut pattern. I think this is ascribable to their having been present in the populations of the earliest Chinese states or assimilated into the nascent Chinese polities at a very early stage of their development, so that members of the various subclades have been moving around within the Chinese realm and gradually forming a sort of homogeneous suspension (and also relocating together wherever Chinese emigrants have settled).

For archaeological reasons, too, I think a roughly Chinese (in a geographical sense) origin of the various subclades of O-M175 is more plausible than a Southeast Asian (again, in a geographical sense) origin, but that does not preclude the possibility that the common ancestor of NO-M214 (formed 41500 ybp, TMRCA 36800 ybp) may have moved northward from Southeast Asia. (Though, again, K2a* has been found in Oase and Ust'-Ishim, so, even if the MRCA of NO-M214 has arrived in East Asia via Southeast Asia, some people who shared a common patrilineal ancestor with him approximately 5,000 years earlier must have traveled at least as far as what is now southwestern Romania.)

Ryukendo K said...

@ Hieu

Just noticed I got your name wrong, apologies.

@ Ebizur

Thanks for taking the time to read the paper man. Your answer about Haplogroup O is honestly the most useful thing I've read in the comments section in a while! Kudos!

Have you considered setting up a blog? Your systematic knowledge of East Asian uniparentals, if it were recorded somewhere, would be a great reference for many people.

Few questions here, wouldn't mind if you don't answer all of them, they're quite a lot:

Judging from the spread of the FSAF from the Nicobar islands and (I'm assuming pre-MLBA) Burma to the Red River valley in Vietnam, what do you make of the implications for the Austroasiatic homeland?

What do you make of the uniparentals of FSAF, which seem to be stereotypically "Austroasiatic" clades of O? What about the uniparentals for the Dongson sample and the fact that this YHap intruded with a combination of "Hmong-Mien" and "Austronesian" (i.e. Tai-Kradai-like) ancestry?

The FSAF definitely had fully-fledged rice agriculture from the archaeology, what do you make of the reconstructions that deny proto-Austroasiatic the privilege of being the linguistic stock with the first rice-farming vocab, and reconstruct it with a light horticultural vocab centering on Taro, yams etc?

Lastly you mentioned on AG that ~1/3 to 1/4 of Koreans and Japanese carry clades of O coalescing with mainland East Asian clades in a "recent" (i.e. Neolithic and post-Neolithic) timeframe. Could you talk about some of these clades, how deeply they coalesce with clades in Continental East Asia, and if they indicate population movements or splits at the Neolithic, Metal-Age/Protohistoric, or historic periods in Korea and Japan?

Hope that you could at least answer the last question if you don't wanna answer all of them.

@ Sam

If you are still reading, I think you can learn some things from Ebizur. I really appreciate what you are doing (i'm sure many do), but there has to be a scholarly method to it. Keeping track of references, systematising, such that you can back yourself up with example uniparental clades whenever you make an argument for a certain pattern and then reference all the way back to the lab, sample and paper when necessary, etc.

Ebizur said...

Not only subclades of O2-M122, but also subclades of C2-M217 are clearly shared among Chinese and Koreans more recently than the onset of the Neolithic in China. Sharing of O1a-M119 in a similar manner appears likely. O1b2a1a-F3356 is extremely frequent among Japanese and Korean males, but it is quite uncommon (< 1%) among Chinese males; however, the TMRCA of the entire clade seems to post-date the onset of the Neolithic in China, so it hypothetically could have derived from a Chinese Neolithic ancestor, although it is phylogenetically very deeply divergent from all other extant subclades of O-M175. High-resolution data regarding other Y-DNA clades found among modern Koreans (foremost among them being N-M231, to which about 4% of Korean males belong, but also D1b-M55, Q-M242, R-M207, J1-M267, L-M20, C-M130(xM105, M38, M217), etc.) is lacking, but I would not be surprised if most of them turned out to be related to Chinese (especially members of N-M231, Q-M242, and C-M130(xM105, M38, M217)) or Japanese (especially members of D1b-M55) at a similarly shallow chronological depth.

C2c1b-F845 (found in 17/706 = 2.4% of a sample collected in Seoul and Daejeon, South Korea (Kwon et al. 2015))
C-F845 formed 11700 ybp, TMRCA 7900 [95% CI 7200 <-> 8700] ybp [Shandong x 2, Anhui x 1, Zhejiang x 1, Henan x 1, Sichuan x 1, Guizhou x 1, Hubei x 1, Beijing x 1, Seoul x 2, Ho Chi Minh City x 1, Xishuangbanna Dai x 2, Tokyo x 1]
*C-FGC39579 formed 7900 ybp, TMRCA 5000 ybp [Henan x 1, Sichuan x 1]
*C-K511 [Xishuangbanna Dai x 2]
*C-F5477 formed 7900 ybp, TMRCA 7400 ybp [Guizhou x 1, Tokyo x 1, Seoul x 1, Hubei x 1, Beijing x 1]
**C-MF1017 formed 7400 ybp, TMRCA 5000 ybp [Seoul x 1, Hubei x 1, Beijing x 1]
***C-Y35928 formed 5000 ybp, TMRCA 3100 ybp [Hubei x 1, Beijing x 1]

C2c1a2-Z8440/F3880 (found in 17/706 = 2.4% of a sample collected in Seoul and Daejeon, South Korea (Kwon et al. 2015))
C-Z8440 formed 10200 ybp, TMRCA 9800 ybp [Beijing x 3, Chechnya x 1, Guizhou x 1, Henan x 1, Shandong x 1, Bangladesh x 1, unknown China x 1, unknown x 2]
*C-CTS3385 formed 9800 ybp, TMRCA 7000 ybp [unknown x 1, Beijing x 2, Chechnya x 1; a Korean member of C-CTS3385, surname reported as Jeong, has been tabulated by the Korea DNA Project at FTDNA, and may in fact be the same individual as the unknown x 1 on the YFull tree]
**C-FGC45548 formed 7000 ybp, TMRCA 4800 ybp [Beijing x 2, Chechnya x 1]
*C-F1319 formed 9800 ybp, TMRCA 9100 ybp [Guizhou x 1, Henan x 1, Shandong x 1, Bangladesh x 1, Beijing x 1, unknown China x 1, unknown x 1]
**C-Y35926 formed 9100 ybp, TMRCA 4900 ybp [Guizhou x 1, Henan x 1, Shandong x 1, unknown China x 1]
**C-F3777 formed 9100 ybp, TMRCA 9100 ybp [Bangladesh x 1, Beijing x 1, unknown x 1]
***C-F3759 formed 9100 ybp, TMRCA 8600 ybp [Beijing x 1, unknown x1]

C2c1a1-CTS2657 (found in 50/706 = 7.1% of a sample collected in Seoul and Daejeon, South Korea (Kwon et al. 2015))
C-CTS2657 formed 10200 ybp, TMRCA 9800 ybp [Beijing x 2, Jiangsu x 1, unknown South Korea x 1, Hubei x 1, unknown China x 1, Pakistan Hazara x 1, Fujian x 1, Shanghai x 1, Armenia x 1, Tokyo x 1, unknown x 1]
*C-A14895 formed 9800 ybp, TMRCA 6600 ybp [Beijing x 1, Jiangsu x 1, Hubei x 1, unknown South Korea x 1]
**C-A14912 formed 6600 ybp, TMRCA 6200 ybp [Beijing x 1, Jiangsu x 1]
**C-Y37069 formed 6600 ybp, TMRCA 2500 ybp [Hubei x 1, unknown South Korea x 1]
*C-Z18177 formed 9800 ybp, TMRCA 8100 ybp [unknown China x 1, Pakistan Hazara x 1, Fujian x 1, Beijing x 1, Shanghai x 1, Armenian x 1, Tokyo x 1, unknown x 1]
**C-CTS8579 formed 8100 ybp, TMRCA 7600 ybp [Pakistan Hazara x 1, Fujian x 1, Beijing x 1, Shanghai x 1, Armenia x 1, Tokyo x 1, unknown x 1]
***C-CTS4449 formed 7600 ybp, TMRCA 6200 ybp [Pakistan Hazara x 1, Fujian x 1, Beijing x 1]
****C-MF1028 formed 6200 ybp, TMRCA 6100 ybp [Fujian x 1, Beijing x 1]
***C-Y13856 formed 7600 ybp, TMRCA 7600 ybp
****C-M407 formed 7600 ybp, TMRCA 4100 ybp [Shanghai x 1, Armenia x 1, Tokyo x 1, unknown x 1]

Ebizur said...

O1a1a-P203 (found in 16/573 = 2.8% of a sample collected in Seoul and 3/133 = 2.3% of a sample collected in Daejeon, South Korea (Park et al. 2012). Also found in 12/55 = 21.8% of a sample from Fujian, 11/60 = 18.3% of a sample of Taiwan Minnan, 38/258 = 14.7% of a sample of miscellaneous Taiwan Han, 1/34 = 2.9% of a sample of Taiwan Hakka, 110/370 = 29.7% assimilated Taiwan plain aborigines, 101/112 = 90.2% northern Taiwan mountain aborigines (Atayal, Taroko, Saisiyat, Thao), 37/41 = 90.2% Tsou, 72/146 = 49.3% southern Taiwan mountain aborigines (Yami, Amis, Rukai, Paiwan, Puyuma), 20/146 = 13.7% Philippines, 32/192 = 16.7% West Indonesia (Java, Borneo, Sumatra), 6/40 = 15.0% East Indonesia (Sulawesi, Ambon, Timor), 2/75 = 2.7% Thailand general population as per Trejaut et al. 2014.)
O-M307/P203 formed 14400 ybp, TMRCA 9800 [95% CI 8400 <-> 11300] ybp [Shanghai x 2, Hubei x 1, Zhejiang x 1, Beijing x 4, Hunan x 1, Fujian x 3, Xishuangbanna Dai x 10, Hong Kong x 3, Ho Chi Minh City x 2, Philippines Cebu x 1, Philippines Mountain Province x 1, Jiangsu x 5, Shandong x 1, Anhui x 1, Sichuan x 1, Guangdong x 2]

O1b2a1a-F3356 (found in 219/706 = 31.0% of a sample collected in Seoul and Daejeon, South Korea (Kwon et al. 2015))
O-K10/F3356 formed 12100 ybp, TMRCA 8200 [95% CI 6300 <-> 10200] ybp [Japan x 19, Hunan x 2, Shanxi x 1, Shandong x 1, Beijing x 1, Hezhen x 1]

O2a1c1a1a1a-F11 (found in 56/706 = 7.9% of a sample collected in Seoul and Daejeon, South Korea (Kwon et al. 2015))
O-F11 formed 13300 ybp, TMRCA 8800 [95% CI 8100 <-> 9600] ybp [Pakistan Hazara x 1, Pakistan Brahui x 1, Gansu x 1, Beijing x 4, Jiangsu x 2, Anhui x 1, Hunan x 6, Fujian x 2, Guangdong x 5, unknown China x 1, unknown x 4, Ho Chi Minh City x 4, Gyeonggi x 1, Tokyo x 1, Armenian x 1]
(Note that the individual from Gyeonggi, the province surrounding Seoul, and the Brahui individual from Pakistan belong to the O-SK1686 subclade, whose TMRCA is estimated to be 3500 [95% CI 2600 <-> 4600] ybp. Their TMRCA with HG02047 of the KHV (Kinh in Ho Chi Minh City, Vietnam) sample is estimated to be 5300 [95% CI 3900 <-> 6800] ybp.)

O2a1c1a2a-F238 (found in 8/706 = 1.1% of a sample collected in Seoul and Daejeon, South Korea (Kwon et al. 2015))
O-F238 formed 13300 ybp, TMRCA 8800 [95% CI 7400 <-> 10200] ybp [Fujian x 1, Hunan x 1, Anhui x 1, Myanmar x 1, unknown x 1]

O2a2b1a1a-M133 (found in 64/573 = 11.2% of a sample collected in Seoul and 18/133 = 13.5% of a sample collected in Daejeon, South Korea (Park et al. 2012))
O-M133/M1706 formed 14700 ybp, TMRCA 8200 [95% CI 7000 <-> 9500] ybp [Beijing x 6, Shandong x 5, Henan x 2, Anhui x 1, Jiangsu x 1, Chongqing x 1, Hunan x 1, Fujian x 5, Guangdong x 3, Naxi x 1, Xishuangbanna Dai x 13, Ho Chi Minh City x 4, Philippines Cebu x 2, North Hamgyeong x 1, Japan x 6, Myanmar x 1, Tripura x 5, Bangladesh x 2]

O2a2b1a2-F444 (found in 68/706 = 9.6% of a sample collected in Seoul and Daejeon, South Korea (Kwon et al. 2015))
O-Y20/F444 formed 18100 ybp, TMRCA 14100 [95% CI 12700 <-> 15500] ybp [Ho Chi Minh City x 4, Beijing x 8, Liaoning x 1, Henan x 1, Shandong x 1, Anhui x 1, Jiangsu x 2, Sichuan x 1, Hunan x 4, Fujian x 6, Xishuangbanna Dai x 1, Singapore x 1, Mongola/Inner Mongolian x 1, Xibo x 1, South Korea x 1, Tokyo x 3, Thailand x 1]
(Note that the individual from South Korea belongs to the O-CTS2815 subclade. TMRCA with an individual from Tokyo, an individual from Hunan, and an individual from Ho Chi Minh City is estimated to be 3400 [95% CI 2200 <-> 5000] ybp. Their TMRCA with two Han from Beijing and a slightly more basal individual from Henan is estimated to be 6900 [95% CI 5000 <-> 9000] ybp.)

Ebizur said...

Also, an erratum: I posted that the MRCA of NO-M214 must have had a patrilineal relative who migrated as far as southwestern Romania (i.e. Peștera cu Oase) approximately 5,000 years earlier, but this should have been approximately 9,000 years earlier. Perhaps ancient DNA from Palaeolithic Eurasian specimens might allow scientists to elucidate the migration routes of early members of Y-DNA haplogroup K2a, including the MRCA of NO-M214 and his close kin.

Ryukendo K said...

Ebizur, thank you so much for this data!

Unfortunately, these clades fall within the timeframe of the East Asian Neolithic but there is no clear indication if they coalesce even later than that, except for a few clades. It doesn't seem we know these clades at high enough resolution to tell us when the period of gene flow was. Would you say that, so far, the most we can say is that the autosomal connection was due to some post-neolithic rice-farming migration from somewhere on Continental East Asia to Korea and Japan, and other than this we do not know if there was further gene flow in the protohistoric or historic periods from the uniparentals?

Ebizur said...

ryukendo,

I agree that the only conclusion that can be stated definitively at the current level of resolution is that significant percentages of Chinese, Vietnamese, Koreans, and Japanese (and other populations, too, such as Mongols and Thais) share common patrilineal ancestors at some time more recent than the oldest archaeologically attested Neolithic cultures in East Asia, e.g. Nanzhuangtou in North China and Pengtoushan in South-Central China. The precise number, timing, and direction of the episode or episodes of gene flow cannot yet be determined.

For example, in the case of the O-CTS2815 subclade of O-F444, the MRCA of the Korean and the Han individuals from Beijing and Henan may have lived as early as 9000 years ago, quite early in the Neolithic. However, the Korean is basal to a subclade that includes an individual from Hunan, China, an individual from Tokyo, Japan, and an individual from Ho Chi Minh City, Vietnam, and the MRCA of the Korean individual and the three members of that subclade is estimated to have lived sometime between 2200 and 5000 years ago. One's interpretation of this phylogeny in its current poor level of detail in relation to geography or ethnic history depends a great deal upon whether one might allow a post-Neolithic migration from the pre-Koreans to the pre-Chinese(/Vietnamese/Japanese), or whether the direction of gene flow is always assumed to be from China to the rest. If the latter, then the gene flow from China to Korea and especially that to Vietnam and Japan must be quite recent indeed, perhaps in the historical era.