Some months ago, I decided to run a big ChromoPainter analysis of the Eurasian samples I have. I removed from my dataset not only all Sub-Saharan Africans, but also North Africans and anyone else with more than 2% African admixture (which unfortunately included me).
Since the number of samples was still too large, I picked 25 random individuals from each non-South-Asian ethnicity while keeping all South Asians. I also tried to remove all close relatives and those with a high missing genotyping rate.
In the end, I had 254,576 SNPs for 2,001 samples belonging to 197 ethnic groups.
I ran ShapeIT to phase their genomes and then ChromoPainter and fineStructure. The whole process took about 2 months.
Then I got busy and the results sat on my computer for more than a month.
Now let's look at the ChromoPainter/fineStructure analysis. Due to my time constraints, I am going to present them in several posts.
Today, let's look at the fineStructure clustering run on the chunkcount output of ChromoPainter. It divided the individuals into 203 populations. Here's the spreadsheet containing the group and individual population clustering.
And here is the dendrogram showing the relationship of the clusters/populations computed by fineStructure.
UPDATE: Better dendrograms
2 Comments.