Based on the Dodecad Oracle, here is Harappa Oracle using reference 3 admixture results.
I am using Dienekes' code with a couple of changes. One of them is using weighted distance based on Fst divergences between ancestral components. Because of that it is several times slower than DodecadOracle. I plan to offer an option soon to switch between Euclidean distance and Fst-weighted distance.
You need to install R to use it. Then unzip the Oracle zip file. Double-click on the file or use the following in R:
load('HarappaOracleR3fst.RData')
In R, you can look at the 385 populations included by typing:
X[,1]
To use it to find your closest populations, you need your Harappa Reference 3 admixture results. Use them separated by commas like this (for me):
HarappaOracle(c(44,12,0,24,14,1,2,0,0,1,2))
You will get a result, with the first column showing the closest populations and the 2nd column their distance to you.
[,1] [,2]
[1,] "balochi" "8.0242"
[2,] "bene-israel" "9.2843"
[3,] "brahui" "9.5158"
[4,] "pathan" "9.7034"
[5,] "makrani" "10.1014"
[6,] "sindhi" "10.9236"
[7,] "Bhatia" "11.8441"
[8,] "Sindhi" "12.1704"
[9,] "Kashmiri" "13.4229"
[10,] "punjabi-arain" "13.9192"
You can also find out the closest populations to one of the reference populations:
HarappaOracle("punjabi-arain")
By default, the Oracle shows the 10 closest populations. You can change that:
HarappaOracle("punjabi-arain",k=20)
Also, by default, the Oracle excludes the Pan-Asian dataset since the overlap is only 5,400 SNPs. You can include Pan-Asian populations:
HarappaOracle("punjabi-arain",panasian=T)
There is also a mixed mode where the individual (or mean reference population) is compared against all pairs of populations as ancestors.
HarappaOracle("Haryana Jatt",mixedmode=T)
which has the following output:
[1,] "Haryana Jatt" "0"
[2,] "15.4% lithuanians + 84.6% Punjabi Brahmin" "1.9553"
[3,] "10.6% russian + 89.4% Rajasthani Brahmin" "2.0626"
[4,] "14.7% finnish + 85.3% Punjabi Brahmin" "2.0863"
[5,] "9.2% finnish + 90.8% Rajasthani Brahmin" "2.1142"
[6,] "89.4% Rajasthani Brahmin + 10.6% mordovians" "2.1727"
[7,] "9.6% lithuanians + 90.4% Rajasthani Brahmin" "2.1989"
[8,] "10.1% belorussian + 89.9% Rajasthani Brahmin" "2.2938"
[9,] "16.8% russian + 83.2% Punjabi Brahmin" "2.3015"
[10,] "16.2% belorussian + 83.8% Punjabi Brahmin" "2.3656"
You can of course combine any or all of the options.
Think of Harappa Oracle as a tool to help you interpret your admixture results by comparing who you are closest to. Do not think of it as giving you your real ancestry.
Recent Comments