Let's continue our admixture analysis of the first batch of Harappa participants.
Here are their ethnic backgrounds and their admixture analysis results.
You might want to refer to the admixture analysis of the reference dataset.
At K=10,
C1 | South Asian | C2 | Kalash |
---|---|---|---|
C3 | Southwest Asian | C4 | Southeast Asian |
C5 | European | C6 | Papuan |
C7 | Northeast Asian | C8 | Siberian |
C9 | West African | C10 | East African |
At K=11,
C1 | South Asian | C2 | Balochistan/Caucasus |
---|---|---|---|
C3 | Kalash | C4 | Southeast Asian |
C5 | Southwest Asian | C6 | European |
C7 | Papuan | C8 | Northeast Asian |
C9 | Siberian | C10 | West African |
C11 | East African |
Note the C2 component, it sounds a bit like ANI (Ancestral North Indian) of Reich et al, though hold off on your conclusions and your excitement for now.
Also, note that this split is different from the results of Reference I K=11 admixture run where the East African split happened. However, at K=12 we get similar components.
At K=12,
C1 | South Asian | C2 | Balochistan/Caucasus |
---|---|---|---|
C3 | Kalash | C4 | Southeast Asian |
C5 | Southwest Asian | C6 | European |
C7 | Papuan | C8 | Northeast Asian |
C9 | Siberian | C10 | East African Bantus |
C11 | West African | C12 | East African |
I am going to explore even higher values of K since the crossvalidation errors are still decreasing.
HP0007 and HP0009 don't look nearly as simlar as they did in some of the low K analyses. In particular, at K=12, HP0009 has C2+C3 = 20% and HP0007 has C2+C3 = 10%.
Anyway, great work, as always. Should we next expect to see a new batch at lower values of K, this first batch at even higher values of K, or intermediate batches at the current values of K?
I am working on a number of things. Stay tuned. 🙂
South Asian is mostly undented at K=12 - the anticipated split did not happen. It is all the other stuff being redistributed in intriguing ways. Very interesting!
While the South Asian component percentages are about the same for K=12 as they were for K=9, they have changed from K=7 for Punjabis etc. Let's see what happens later.
Great getting more and more interesting now.
Here is my Charts Participant K12 and the Reference pop K12