Have you ever wanted to make a kit combining all your DNA tests at the different companies so as to get the most SNPs for comparisons? Well GEDmatch GENESIS is now providing that for paid members. So of course I made one of these superkits for myself! I combined my LivingDNA with my V3 23andme and my current Ancestry kit. Now to investigate what I have gained from this.
The first thing I did was compare this new kit to my recent Ancestry kit. All looked fine. It has the expected small differences, many of which disappeared (including the black lines) when I checked the prevent hard breaks box on the form. The older 23andme kit comparison had more black spots and mismatches.
My next thought about my new superkit was that I might get a better comparison to cousins who tested more recently at 23andme but none of them have uploaded to GENESIS yet. So I checked how my comparison to an Ancestry tester, my second cousin once removed Jeanie, looked. The superkit gets the same result as my recent Ancestry kit. When I compared her to my 23andme kit and my Living DNA kit however, there were small differences.
The multiple kit analysis function works beautifully with tag groups. Another benefit of tag groups, is that when I don’t remember the kit number of a cousin whose results I wish to view, I can look it up quickly by displaying the people in that tag group (from the View/Change your profile (password, email, groups) on the top left)
My previous post about tag groups mentioned that tag groups are a quick way to see where a new match fits in by looking at their one to many page for your tag colors. However this is less useful for a distant cousin match (fewer colored tags) or an iffy paper trail match. In those cases I put the new person in my Unknown group (which only ever has the one person being analyzed) and then compare with all the relatives I expect a match to, by using their tag groups.
The main GEDmatch page has a box called Analyze Your Data and towards the bottom of that box you can see Multiple Kit Analysis with a big red NEW next to it. The “new” is because you can now use tag groups for this analysis. When you click Multiple Kit Analysis to get to that function, you will see a page like the one shown below. The old way of doing multiple kit analysis, by typing in each one, is still available from the Manual Kit Selection/Entry tab on this page or by checking boxes in various other functions like one-to-many.
My tag groups: note that I am using shades of aqua and blue for my Etne, Hordaland, Norway descended cousins
You can check the tag groups of interest and compare them to the new person (the Unknowns group for me) in all the wonderful ways the multiple kit analysis gives you (Click here for the slides on that from my most recent GEDmatch presentation).
Recently I have been searching for a “Lee Oleson” who is the grandfather of a third cousin match at Ancestry. He was only in town long enough to get my match’s grandmother with child. This third cousin’s one to many lights up with the colors of my Etne, Hordaland, Norway side relatives. So I set myself a project of tracing forward all the descendants of the eight children of my Etne great-great-grandparents to see if I could find Lee.
Perhaps this post needs the subtitle , “My Perfect Cousin Goes to GEDmatch.”
Most of us can keep track of information in spreadsheets. So how to do that with DNA? Well, the idea is to keep a list of matching DNA segments so that a new match can be compared to your known family members. That way you may be able to see where they fit in.
If you have tested at 23andme or Family Tree DNA, you can download your list of matches with their matching DNA segments either directly from your testing company or by using the tools at DNAgedcom. However AncestryDNA does not provide a list of matching segments.
Extract from my Dad’s Master DNA Segment Spreadsheet (click for a larger version)
Why would you want those? The short answer is to figure out which line a new DNA cousin belongs to. For the long answer, read on. For more posts about DNA spreadsheets click here or in the tag cloud, lower right hand column.
AncestryDNA testers can make a DNA segment spreadsheet by using any of a number of utilities at the GEDmatch web site. Start by uploading your raw DNA data (click here for that “how to” post). Your results will usually be ready for full comparisons the next day. Then buy the tier 1 utilities for at least one month ($10).
My preference for making a first spreadsheet is to use the Tier 1 GEDmatch Matching Segment Search. Then I go through the top matches from the ‘One-to-many’ matches report with that spreadsheet as a reference. I add notes on what I discover to my new spreadsheet.
Here is the step by step of what I did for my perfect cousin J.M. whose AncestryDNA results I blogged about in my previous post.
Recently I gave an updated talk about GEDmatch.com for my local DNA special interest group, DIG, here in San Diego. GEDmatch.com is a DNA geek’s playground, but many less computer inclined folk find it difficult at first.
It is the only place for those who have tested at Ancestry DNA to compare their results to a possible relative, chromosome by chromosome. It also has many tools that are unique such as ancestry composition calculators with more recent breakdowns and more categories than the main companies. I covered those in detail in my original talk about GEDmatch tools. Those slides are at http://slides.com/kittycooper/gedmatch#/5
The new talk – http://slides.com/kittycooper/gedmatch-10#/ – covered uploading your data, how to manage your kits and mark a kit for research, and much detail on the one-to-many function as well as all my other favorite tools (starred in the image to the left).
There is a new 23andme upload which is nice and fast as it uses the API so you actually log into your account there rather than uploading a file.
It makes sense to upload all your kits when you have tested at more than one company but please mark all but one kit as research only, so DNA relatives are not confused by seeing so many versions of the same person.
There are four exciting new utilities at GEDmatch.com which I plan to cover in depth over the next several days. These are only available to for people who have donated at least $10 (every additional $10 gets you these for another month). A good way for GEDmatch to pay for their extra server costs. The rest of the site will remain free. The utilities are:
- A Matching Segment Search – Get a list of all your segment matches suitable for cutting and pasting into a spreadsheet
- A Relationship Tree projection – calculates probable relationship paths based on Autosomal and X-DNA Genetic Distances. It is experimental, try it and give them feedback
- Lazarus – Construct a kit to represent a close ancestor, wow!
- Triangulation – takes your top 300 matches and finds which ones match each other with details. The format can be copied to a spreadsheet