Tag Archive | DNA clustering

Automated Tree Building with Genetic Affairs

Clustering has changed the way many of us work on genealogy mysteries and unknown parentage cases. Genetic Affairs was just one of the sites offering automated clustering (click here for my first clustering post), but then they added tree building. That’s right, they make tree diagrams for each cluster that has at least two people with trees that can be matched up. They even include a GEDcom for those trees in the zip file they send.

One way I use these diagrams is to show cousins how we are related. Another way I use this feature, is to solve unknown parentage cases. I use both DNA2tree and Genetic Affairs and then go with whichever seems to have the more relevant looking trees. The advantage of Genetic Affairs (GA) is that it will look at the unlinked trees and at your ThruLines. Also the output is easy to glance over to see what is worth pursuing, once you are used to the format. Click here for my recent post on automated tree-building tools.

Above is the diagram GA built for the descendants of my gg-grandparents who are in the lonely box on the far left. Click on the image for a larger image in a new tab. My great grandparents lived on farm Skjold in Etne, Hordaland, Norway and had eight children, four of whom, plus the child of another, emigrated to the USA and have many tested descendants at Ancestry.

Here is a key to what you are seeing. The green box on the bottom line is me. The mustard yellow box means that the match’s unlinked tree was used from that person on down. The people in pink in the middle were determined from my ThruLines. Living people are shown as just id numbers, except for your matches who are shown by the name they have chosen to be seen as. All DNA matches are on the far right and are also colored pink with the source and the amount shared listed. Clicking on a match gets a little box to pop up in the lower right corner (as shown) with the name of their family tree, clickable to their Ancestry tree.

The purple box with the word ANCESTRY indicates the source of the tree information. Another GA feature is the ability to cluster both your Family Tree DNA matches and your Ancestry matches together.

When names are listed differently in other trees they will be shown in these diagrams as separate people. Notice that in the second from the left column, that the software could not tell that the A. Skjold who married L. Stephenson is the same person as the A. Halvorsdtr skjold who married L. Stephenson Fjaere. Norwegians did not have fixed surnames so we usually use the farm name as a surname in our trees. Often upon arriving in this country they often chose to use the patronymic, so Stephenson rather than Fjaere (click here for more on Norwegian naming). However the other Anna Halvorsdtr Skjold listed between those two really is a different person and she married a Thompson. Reusing first names is another bane of the Norwegian genealogist.

This tree building capability from Genetic Affairs recently helped me solve an unknown father mystery.

When “Amy” discovered her brother was only a half brother by doing an Ancestry DNA test, she was very surprised. She had heard that her mother was pregnant with her when marrying her late father, but everyone knew he was her Dad, or so she thought. Her mother was not willing to discuss this, so she asked for my help to figure out her biological father from the DNA.
Continue reading

More Automated DNA Match Clustering!

Have you been wondering why are all your favorite bloggers are going crazy for automatic clustering? Well it is a fun visual technique to see which matches belong to which family line by making a chart with your matches across both the top and side, grouping them by who matches who, and then coloring those boxes in. This creates visual clusters which will roughly correspond to your great grandparents or their parents.

My perfect cousin has many matches on all her great grandparent lines (green is my Munson side) so I used her to showcase the new DNAgedcom clustering above. Notice how similar it is to her cluster from Genetic Affairs shown in my previous blog about that site and tool.

Here are all the new ways to cluster our DNA matches:

  • DNAgedcom now has a clustering tool in their client (DGC) which uses your ancestry match list and ICW files (described in detail in the read more below)
  • Genetic Affairs has Ancestry clustering working again
  • DNApainter created a tool to create a CSV from the Genetic Affairs html cluster file. Some of us love to use spreadsheets.
  • Andy Lee of Family History Fanatics figured out how to take an autosomal match matrix from GEDmatch and cluster it in a spreadsheet program, Click here for that video – the explanation starts just after 42 minutes and this is really fun!
  • Rumor has it that GEDmatch may add automatic clustering sometime in the new year…

All of this is based on the method developed by Dana Leeds to organize your matches which is easy and simple to do. Click here for her blog about that.

Read on for how I used the new DNAgedcom clustering tool for myself and my brother, where I know all our great grandparents.

Continue reading

Automatic Clustering from Genetic Affairs

My genealogy groups are buzzing with excitement about a new tool from Genetic Affairs to automate the clustering of your DNA matches. This takes the Leeds method concept to another level.

Everyone is posting pretty cluster pictures like the one below that I made for my perfect cousin, the star of many of my blog posts. This is a table where each DNA match is listed on the top and side; then if they match each other, the box is colored in with the color for that cluster. The chart is sorted by cluster. The idea is that each colored cluster shows descendants from a probable great grandparent couple of yours.

The gray boxes show where people match others outside the cluster which can often happen when families intermarry more than once or when they are first cousins enough times removed to have been in the second or third cousin group by DNA but are related to more than one set of great grandparents.

Automated clustering is useful because it puts your DNA relatives who are related to each other into visual groups so that you can quickly see which line a new match is related on. The picture is pretty but the workhorses are the charts for each cluster shown below that image when you scroll down. Here is the privatized one for my “perfect” cousin showing our MUNSON cluster.

Each name can be clicked to go to that Ancestry match page plus much useful additional information is shown next to the username: how many cMs shared, how many matches shared in the whole group, cluster number, how many people in their tree, and the notes you made for that match.

The image and charts are from the HTML file which arrived via email from Genetic Affairs after I requested automated clustering for my cousin’s Ancestry profile, which is shared with me there. You have to save the html file to your computer and then click on it to view it. When it first comes up, it is a mish-mosh sorted by name, but then it resorts itself by cluster. Fun to watch. Click here for the step by step of how to use this tool from the Intrepid Sleuth. It can also cluster matches from other sites like 23andme.

I decided to try it on an unknown father case I had not gotten around to working on yet, to see if it succeeded in speeding up the process and it did, to under an hour! A new record.

Continue reading