Y-DNA

Comparative overview of distribution of Y-DNA haplogroups in Europe

We hope this can be useful to any layman or enthusiast of Genetic Geneology who wants to quickly learn about distribution of main haplogroups in Europe. Beside displaying distribution of main haplogroups, this charts are also intended for easy comparision of haplogroup frequencies among countries/peoples of Europe. Since frequencies for many countries/peoples are made with small number of available haplotypes,...

NevGen’s coefficients of fitting of subclade into another subclade

(last updated on January 10th, 2018.) Experienced genetic genealogists know that haplotypes of some subclades are relatively easy to be confidently predicted on some set of markers. For example, haplotypes belonging to R1b subclades L226 and M222 are easily recognisable even on 37 markers, and accordingly haplotypes of those two subclades are easy to distinguish from haplotypes of any other...

Plans for future

We plan to add in future many new subclades to some other haplogroups, like J, I, E, and some others. In future we might make deep separate levels for other haplogroups, like it is for R1b and R1a. This description of NevGen predictor and manual will be also updated from time to time. First edition of this description writen on...

R1b level of NevGen

R1a level for NevGen is added on June 11th 2016, more than 40 days after its brother subclade R1b. For now it has 71 subclade. Testings showed something people in GG already knew, that R1a deep subclades are much easier to predict than subclades of R1b. R1a level suffers even more than R1b level from low number of haplotypes for...

Incomplete values

Incomplete values (like 18.2, 24.3, 15.1 and such) are not supported in Internet version. Reason is simple: lists (one for every STR marker) does not have incomplete values in their contents, because it could be too heavy burden. Desktop edition of NevGen (not available on internet) does have support for incomplete values in its structures, code and data, and uses...

Priority in Athey’s predictor. Why there is no such thing in NevGen?

In Athey’s haplogroup predictor there is list box when user can select one of several areas: “Northwest Europe”, “South Europe”, “East Europe”, and others. Its effect on calculation of probability of some subclade is multiplying already multiplied frequencies of given haplotype’s alleles in subclade with subclade’s frequency in chosen area, before they are normalised with sum of them all. In...

False positives

Bayesian approach which NevGen predictor uses has one bad property, or problem: it assumes that entered haplotype belongs to one of subbranches supported by predictor. So it divides 100% of probability to its supported haplogroups. But this assumption need not be true in every case. No predictor is complete, it does not support every known haplogroup or subbranch. Beside that,...

About NevGen haplogroup predictor

First version of NevGen predictor was made in May of 2014. In that time it was available only as desktop application (and its existence was known only to small number of people in Serbia), and contained only part of today available subclades. Gradually during next two years many new subclades have been added, many existing subclades have been divided into...