In addition to the removals and changes in the previous set of runs, I removed the Onge, Great Andamanese and Kalash for this set.
The admixture results of this dataset are in a spreadsheet as usual and the bar chart is below.
K=10, 11, 12 are the ones with the lowest cross-validation error.
I wonder if anyone is going to mind my calling C2 at K=9 Pakistani instead of Balochistan/Caucasus? 😉
I like K=12 here and K=12 or 13 in the previous run. So the question is which one of all these K runs with two different datasets should I use to replace the old reference I K=12 admixture runs?
Thank you Zack, I found this run particularly interesting especially achieving the essential components with the lowest number of Ks. It seems like the Kalash component is just an instance of the Balochistan/Caucasus component and by removing Kalash samples' member register with Balochistan/Caucasus. So I was wondering if the same holds for the Gujarati component. That is by removing the Gujarati-a samples a south Asian related component still emerges. I think probably not but I'm still curious to see if this happens.