Thanks to those who contributed their data for tests per haplotype. Here it is, summarized: Comparison Level Cruwys MacLaren Short Taylor Y-12 2.52 2.04 1.58 1.56 Y-25 1.66 1.42 1.36 1.16 Y-37 1.36 1.21 1.29 1.07 Y-67 1.00 1.18 1.00 1.03 What shows the pattern more clearly, however, is the inverse ratio -- haplotypes per test: Comparison Level Cruwys MacLaren Short Taylor Y-12 0.396 0.490 0.633 0.643 Y-25 0.604 0.704 0.735 0.860 Y-37 0.736 0.826 0.778 0.935 Y-67 1.000 0.847 1.000 0.971 For haplotypes per test, 1.000 represents "complete diversity"; every test produces a separate pattern. "Perfect uniformity", where each test produces the same haplotype, would approach zero (0). The data and a graph have been published at http://freepages.misc.rootsweb.ancestry.com/~taylorydna/diversity-dna.shtml. -rt_/) ---------------------------------------------------------------------- Message: 1 Date: Sun, 2 Jan 2011 15:47:11 -0700 From: "Ralph Taylor" <rt-sails@comcast.net> Subject: [Y-DNA-projects] Match Rates & Variety To: <y-dna-projects@rootsweb.com> Message-ID: <56DFEAE233AF4FC79871F0E065C4E56E@Ralphs> Content-Type: text/plain; charset="us-ascii" What does the number of unique haplotypes within a project say about the variety of the underlying target population's DNA? Or, more precisely, the ratio of the number tested to unique haplotypes. How, would this affect potential match rates? FTDNA's new GAP allows us to get some data from http://gap.familytreedna.com/project-statistics.aspx and http://gap.familytreedna.com/unique-haplotypes.aspx. (Secured sites -- log onto your GAP and go to the "Project Statistics" & "Unique Y-DNA Haplotypes" links.) We can combine these statistics to get inter-project relevant comparisons. For our Taylor project on 31 Dec 2010: Tests Number Haplotypes* Ratio Y-12 361 232 1.56 Y-25 272 234 1.16 Y-37 245 229 1.08 Y-67 139 135 1.03 *"Haplotypes" means unique haplotypes (also called "unique strings"). A difference on just one marker means a separate haplotype. Remember that everyone tested at a higher level is also tested at the lower levels, so gets included in the number tested and haplotypes. Naturally, there are fewer unique haplotypes at the lower levels, as there are fewer degrees of freedom. At the 12-marker comparison level, there are slightly more than 3 haplotypes for each 2 tested; at the 67-marker level, the ratio is slightly more than 1:1. A ratio of 1.03 67-marker tests (individuals) per haplotype would seem to indicate a wide variety in the target population's DNA; almost every man tested yields a different haplotype. Of course, many of the unique haplotypes will match another; matches allow some differences in haplotypes. Still, this seems to be a rough measure of underlying DNA variety within the project's population. Does anyone have any thoughts? How does your project's data compare? -ralpht_/) ------------------------------ Message: 2 Date: Mon, 3 Jan 2011 00:09:46 -0000 From: "Debbie Kennett" <debbiekennett@aol.com> Subject: Re: [Y-DNA-projects] Match Rates & Variety To: <rt-sails@comcast.net>, <y-dna-projects@rootsweb.com> Message-ID: <204EC91B0EFE40A4A7F318A309E27FBE@NEWGAMES> Content-Type: text/plain; charset="us-ascii" Ralph For what it's worth here are my figures for the Cruwys/Cruse project: Y-DNA12 53 21 2.52 Y-DNA25 53 32 1.65 Y-DNA37 53 39 1.35 Y-DNA67 16 16 1 I think I'm getting more diversity for my rarer surnames because of the higher proportion of project members from the UK who are less likely to be related to each other than their American counterparts, and also because I'm targeting specific lines for testing. Debbie Kennett http://www.familytreedna.com/public/CruwysDNA ------------------------------ Message: 3 Date: Sun, 2 Jan 2011 17:51:17 -0700 (GMT-07:00) From: fzsaund@ix.netcom.com Subject: Re: [Y-DNA-projects] Match Rates & Variety To: y-dna-projects@rootsweb.com Message-ID: <26932689.1294015877295.JavaMail.root@elwamui-darkeyed.atl.sa.earthlink.net> Content-Type: text/plain; charset=UTF-8 This is for the Short DNA Project, which is the largest I have. Number hapoltypes ratio Y-12 60 38 1.58 Y-25 49 36 1.36 Y-37 45 35 1.29 Y-67 13 13 1.00 Rick Saunders ------------------------------ Message: 4 Date: Mon, 3 Jan 2011 01:02:16 -0500 From: "robert mclaren" <bobmclaren@earthlink.net> Subject: Re: [Y-DNA-projects] Match Rates & Variety To: y-dna-projects@rootsweb.com Message-ID: <410-220111136216406@earthlink.net> Content-Type: text/plain; charset=US-ASCII For the Clan MacLaren Project as of 2 January 2011, I have: Tests Number Haplotypes* Ratio Y-DNA12 468 229 2.04 Y-DNA25 452 319 1.42 Y-DNA37 425 352 1.21 Y-DNA67 323 273 1.18 Yours aye, Bob McLaren