More Reference Admixture Runs

Posted by Zack on May 20, 2011

In addition to the removals and changes in the previous set of runs, I removed the Onge, Great Andamanese and Kalash for this set.

The admixture results of this dataset are in a spreadsheet as usual and the bar chart is below.

K=10, 11, 12 are the ones with the lowest cross-validation error.

I wonder if anyone is going to mind my calling C2 at K=9 Pakistani instead of Balochistan/Caucasus? 😉

I like K=12 here and K=12 or 13 in the previous run. So the question is which one of all these K runs with two different datasets should I use to replace the old reference I K=12 admixture runs?

Admixturereference

← Reference 3 PCA Clustering for South Asians

Harappa Participants Map →

3 Comments.

Ibra May 21, 2011 at 8:37 am

Thank you Zack, I found this run particularly interesting especially achieving the essential components with the lowest number of Ks. It seems like the Kalash component is just an instance of the Balochistan/Caucasus component and by removing Kalash samples' member register with Balochistan/Caucasus. So I was wondering if the same holds for the Gujarati component. That is by removing the Gujarati-a samples a south Asian related component still emerges. I think probably not but I'm still curious to see if this happens.
Ref4C Admixture | Harappa Ancestry Project - pingback on May 24, 2011 at 9:40 am
Ref4C Admixture | Harappa Ancestry Project - pingback on May 24, 2011 at 9:40 am

Trackbacks and Pingbacks:

Ref4C Admixture | Harappa Ancestry Project - Pingback on 2011/05/24/ 09:40
Ref4C Admixture | Harappa Ancestry Project - Pingback on 2011/05/24/ 09:40

Harappa Ancestry Project

Genetics and South Asia