Prediction of an individual affiliation to a major population group based on information from a small set of autosomal STRs
New: PopAffiliator 2 is now available at http://cracs.fc.up.pt/~nf/popaffiliator2
To calculate the assignment of an individual to a major population group (Asia, Eurasia, sub-Saharan Africa) the values in the form bellow should be provided. The range for the allele size was restricted to the ones published in the Short Tandem Repeat DNA Internet DataBase.
The output will indicate the probability of assignment to the major population groups. The accuracy of individual population affiliation assignment is approximately 86%. The probabilities are computed using a machine learning model built as described in:
- Pereira et. al. PopAffiliator: online calculator for individual affiliation to a major population group based on 17 autosomal STR genotype profile. International Journal of Legal Medicine. 2010. (in press)
The STR collection database used to train and evaluate the machine learning model encompasses data gathered from more than 40 different studies and contains a total of 56,222 individuals, distributed by 7 major geographical locations: East Asia, Eurasia, sub-Saharan Africa, North Africa, Near East, Central-South America and North America. The data is available here.