Using Stranger Matches in Visual Phasing

With the standard three-sibling visual phasing, it’s usually easy to tell who owns each crossover. However, if you’ve adapted the technique for two siblings plus others, such as a nephew (like I did) or a half sibling, sometimes you can’t. In those cases, stranger matches can be very helpful. In this post, I’m going to demonstrate the use of strangers in visual phasing, using an example from my own results.

In a previous post, I showed how I did visual phasing with two siblings and a nephew (the son of a 3rd sibling). On each chromosome, I start the same way, with my mother (my nephew’s grandmother) in the extra view. I the set the crossover points and label the ones I can.

On Chromosome 4 (figure 1), I could label all of them except for 2 – one at 27.3 Mb and one at 174.5 Mb (outlined in red). I know these crossovers belong to either L or J, but I can’t tell which.

Chr 4 - Fig 1
Figure 1

Following the same technique I used previously, and with the help of some known cousin matches, I was able to figure out most of the chromosome, with the exception of those two unlabelled crossovers.

Figure 2Time to bring in some strangers to help, starting at the crossover on the left, at 27.3 Mb.

Because I know this crossover belongs to L or J, and because I know it’s a paternal crossover, I need to find some paternal matches to L and J over that section of Chromosome 4.

Since our mother has been tested and her kit uploaded to GEDMatch, I’ve already used the Phasing utility on GEDMatch to create phased kits for both L and J. I run the GEDMatch Tier 1 Matching Segment Search utility on the two phased paternal kits and look for matches around the 27.3 mark on Chr 4.

Matching kits at 27I found 6 people who match L from about 22 Mb to 34 Mb. Those same people match J from 22 Mb to 27.3 Mb. So, even though I have no idea who these matches are (hence the term “stranger matches”), it doesn’t matter. This is enough to tell me that the crossover at 27.3 must belong to J. That enables me to complete the first section.

crossover 27

Let’s look at the other end, around 174.5.

crossover 174This one is less clear-cut. Neither of us have matches that cross 174.5. We have the same two matches (Karen and Mary B) from 155 Mb to 163 Mb, as expected. And after 174.5, we have different matches, as expected. But who has the Cooper matches and who has the Sharpe matches? To figure this out, I needed to dig into these matches a bit further.

First, I ran the Multiple Kit Analysis on GEDMatch for each kit against L, J & R, to see if any of them matched on other segments besides the one on Chr 4. The only one that did was John M. (the last line of the J table), who matches J and R on Chr 17, in addition to the match with J on Chr 4.

comparison 17Looking at the visually phased Chr 17, John M would appear to be a Sharpe match (paternal grandmother)

Chr 17.pngAs well, I looked up John M in J’s FTDNA matches (his last name appeared in GEDMatch – I removed it for privacy) and found that he has a tree attached to his DNA results. He has ancestors from the same small town in New Brunswick as my grandmother’s ancestors, including one with a surname in that line. Since our paternal grandfather and grandmother came from different countries, it would be highly unlikely that we match on Chr 4 on one line and Chr 17 on a different. And since both segments are of a decent size (12cM and 14.9cM), it’s unlikely that one is a false match.

So while I haven’t (yet) figured out exactly who our common ancestor is, I’m quite confident that John M. is a Sharpe match.

Therefore, the crossover at 174.5 also belongs to J.

complete 174.pngAnd Chromosome 4 is complete

Chr 4 complete.pngWith the help of some stranger matches.

X-Chromosome & Recombination – an Example

The X-chromosome is passed down differently than the other 22 chromosomes. Since men only have one X chromosome, which they get from their mothers, when a man passes his X onto his daughters, he passes an intact X. As women have two X-chromosomes, one from each parent, when she passes her X onto her children, she could either pass along her paternal X, her maternal X, or a mixture (recombination), like with any other chromosome.

According to a study done by Blaine T. Bettinger (The Genetic Genealogist), in 14% of cases, the X-chromosome is passed down intact, with no recombination. Most X-Chromosomes undergo one or two recombinations. Here are the percentages seen among the 250 samples in the study.

Recombinations
Since I’ve done Visual Phasing using my results, my sister’s and my nephew’s, I thought it would be interesting to see how our X-Chromosomes compare. Here’s the overall view. The bottom shows the match between each of us and my mother. As expected, my sister and I share a full X with our mother (if we didn’t, she couldn’t be our mother!). My nephew does not.
Chr 23 full.jpg

First up, me. As a female, I received two X chromosomes – one from each of my parents. Since my father only had one to give me, from his mother (Sharpe), of course I got his intact. From my mother, I happened to also receive an intact X chromosome with no recombination. I got her maternal (Hicks) X chromosome.

Chr 23 Leanne

Next is my sister, J. She too got intact chromosomes from both parents, but in her case, she received our mother’s paternal (Prowse) X chromosome. So while we each received an X with no recombination, we received different ones.

Chr 23 Jayne

And finally, my nephew, R. As a male, he only received one X chromosome, from his mother (my eldest sister). Because of the way I charted them, it looks like this:

Chr 23 Ryan

But in reality, it’s all on one chromosome, so it would look more like this:

Chr 23 Ryan complete

We can see that R’s X chromosome underwent 3 recombinations. From this, we can also see what my eldest sister’s chromosome must have been – something like this:

Chr 23 Debbie

The recombination point could be anywhere within the red square. Where exactly it occurs is unknown, but we know there is one, and probably only one.

So, of my mother’s three children, one received her complete maternal X chromosome, one received her complete paternal X chromosome, and the third received a combination.

DNA never ceases to fascinate me!

Benefits of Visual Phasing

Visual Phasing is a technique whereby the DNA of siblings is assigned to each of the four grandparents. It is usually done with 3 siblings, though you can adapt the technique, as I did, with 2 siblings and a nephew.  As I mentioned in that post, I knew I want to try it as soon as I heard about it. I wasn’t really sure whether it would be helpful for me, but I was curious about the technique. While I was waiting for my sister’s and nephew’s results, I was chatting with a genealogist I know who referred to Visual Phasing as a “party trick”. She didn’t feel that there was much to be gained from doing it.

Having now mapped most of my chromosomes using Visual Phasing, I respectfully disagree – at least for me. Your mileage may vary.

Following are a couple of ways that Visual Phasing has helped me:

 1. I can easily identify whether a match is on a line of interest

The vast majority of my DNA matches are on my maternal grandmother’s side, as this line has deep Colonial American roots. As discussed in a previous post, I’m currently particularly interested in matches on my maternal grandfather’s side, as that’s where I’m attempting to break down a brick wall.

When I get a new match on GEDMatch, MyHeritage or FTDNA, I can quickly and easily figure out which side the match is on. For example, on GEDMatch, when I get a new match, I run a “Multi Kit Analysis”, select “Manual Kit Selection/Entry”, enter the new kit number in the 1st box, then compare it against my mother (FC), me, my sister (JK) and my nephew (RM):

Multi Kit Analysis

I click on Visualization Options, and select 2-D Chromosome Browser

Match Chr 11

For this match, I see that on Chromosome 11, she or he matches my mother (so I know it’s on my maternal line), matches R from 115 to 129, and matches me (L) from 119 to 129. I compare this to the phased chromosome:

Compare v11

As you can see by the the section outlined in red, this must be a Prowse match (purple). Since that’s my mother’s paternal side, the line I’m particularly interested in, I add this match to my spreadsheet as a match of interest. Doing this systematically has helped me develop a subset of matches to work with.

2. I have a much better understanding of how DNA is passed down.

When I first got my DNA results and started working with matches, there was a lot I didn’t quite get, like:

  • Why, with some matches, do I share the exact same amount as my mother, with some I share about half, and with others still I don’t share any at all?
  • Why do I match some people that my sister doesn’t, and vice versa?
  • How can my nephew share more with a person than I do, when he’s one generation farther back from the match than I am?

I have since learned that these are extremely common questions. I quickly learned that the stock answer is “because of the randomness of how DNA is inherited”. It was only when I started doing visual phasing that I really got it. It makes sense to me now. For example, with the match above my mother and my nephew (her grandson) share about the same amount with this person, I share a bit less, and J is not a match at all. And that’s totally normal.

It took seeing it for me to really understand.

I know that not everybody can do Visual Phasing. If you don’t have siblings to work with, you’re out of luck. But if you do, it’s worth considering. Of course, only you can decide if there’s a value to you – it depends on what you’re trying to accomplish.

For me, it was worth it. Plus, it’s kinda fun – but then again, spreadsheets and graphics make me happy.

What do you think? Party trick, or valuable tool in the genetic genealogy toolbox?

Visual Phasing with Two Siblings and a Niece/Nephew

Are you familiar with Visual Phasing?

VP11 intro

I’ve always loved logic puzzles, so when I first heard about Visual Phasing as a way to determine which segments of my DNA I got from which grandparent, I knew I had to try it. Problem is, you need three siblings to do Visual Phasing, but as my oldest sister died over ten years ago, I only have two siblings to work with. But I do have a nephew, my oldest sister’s son. When I saw a blog post on Genealogy Lady about VP with 2 siblings, I figured I should be able to adapt the process to working with 2 siblings and a nephew. Besides, I also have my mother’s DNA results, so that would be a big help.

Both my sister and nephew agreed to be tested. While I waited for their results to come in, I started practising using a friend’s results, along with his two siblings. I did enough to get a feel for how it worked. That was very helpful before trying to adapt it to my situation.

Here’s the process I followed. I’m assuming that you understand the basics of Visual Phasing, so I won’t be explaining every step – only the ones that differ from the standard 3-sibling approach. See here for the basics, and if you’re going to attempt Visual Phasing, I highly recommend you join the Visual Phasing group on Facebook. I used Steven Fox’s fabulous Excel Visual Phasing spreadsheet (available through the Facebook group).

Here’s the setup:

  • L – me
  • J – my sister
  • R – my nephew
  • Paternal grandfather – Cooper (orange)
  • Paternal grandmother – Sharpe (blue)
  • Maternal grandfather – Prowse (purple)
  • Maternal grandmother – Hicks (green)

In the setup, I included my mother in the cousin table. While this wouldn’t be of much use with 3 siblings, since all would share a complete chromosome with her, including her is very useful for comparing to my nephew R (her grandson). While I don’t have any 1st or 2nd cousins to work with, I have a few 2C1Rs on my Cooper side, and lots of 3rd-4th cousins on my Hicks side. I have very few known cousins on my Sharpe or Prowse lines (at least not that have uploaded to GEDMatch, or anywhere else with a chromosome browser).

Figure 1: I started with my mother in the extra view, and set the crossover points

VP11 - Fig 1
Figure 1

Note that in addition to the crossovers that you see in the top section, there’s an additional one for R compared to his grandmother (outlined in red). I included that as well, even though it doesn’t appear when comparing R against L or J.

Figure 2: I added the megabase values and assigned the crossovers.

VP11 - Fig 2
Figure 2
  • Red circles – these crossovers clearly belong to R
  • Blue circles – the first one is J, 2nd is L
  • Green circles – these ones are either L or J, but I can’t tell which
  • Purple circles – the crossovers at either end are confusing. I’ll leave them unlabelled for now and come back to them later.

Figure 3: I shaded out the unnecessary parts of R’s chromosome.

VP11 - Fig 3
Figure 3

This is where it is different using a nephew instead of a sibling or a even a half sibling. When I first did this, I shaded out the whole paternal (upper) chromosome, since I only share R’s maternal chromosome with him (he’s my sister’s son). In the maternal (lower) chromosome, I used all four grandparents. When his mother (my sister) passed on her chromosome to him, recombination would mean that he would get segments from all 4 of her grandparents (his great-grandparents).

The spreadsheet gave me an error message every time I had more than two grandparents represented on the same line. I posted in the Visual Phasing Facebook group asking how I could turn off the warning. Someone suggested that I do it this way instead, which worked much better.

So in Figure 3 (above), where the bottom line is blacked out, it means that R received his mother’s paternal chromosome and there is no match to his maternal grandmother (seen in the extra view). Where the top line is blacked out, R received his mother’s maternal chromosome, as can be seen in the extra view.

Figure 4: Here’s where we start assigning segments. I found it easiest to always start with R, and since I knew whether the segment was maternal or paternal, I didn’t need to use G1, G2, G3, G4 at all, and could start straight away with P1, P2 and M1, M2.

VP11 - Fig 4
Figure 4

I picked a maternal segment of R’s, and labelled it M1. Since L&R match and J&R don’t, L=M1 and J=M2. And since L&J don’t match on that segment at all, I set the paternal segments to L=P1 and J=P2.

Figure 5: I could then extend to the next crossover points.

VP11 - Fig 5
Figure 5

Figure 6: There’s a crossover on either side of R’s M1. On the right, it clearly goes from M1 to M2. On the left, it switches from M to P. Since he matches L and not J, it must be P1.

VP11 - Fig 6
Figure 6

On the far left, there’s a crossover at 2.6. Before that crossover, R matches both L and J, but L and J don’t match each other. Could it be that L and J do match, but it doesn’t show up with the GEDMatch defaults of 7cM and 500 SNPs?

I ran a GEDMatch comparison of L & J and dropped the thresholds to 3cM and 300 SNPs:

Gedmatch 11

Sure enough, L & J do match at the beginning (and also at the end, we’ll come to this later). Therefore, the crossover at 2.6 must belong to J (Figure 7), and it’s a crossover on the paternal side.

VP11 - Fig 7
Figure 7

Figure 8: At 36, there’s a crossover that I already established belongs to J. Since we know that R is maternal at this point, and J&R changes from matching to not matching at this point, then this must be a maternal crossover, where J changes from M2 to M1.

VP11 - Fig 8
Figure 8

Figure 9: At 44.5, R changes from maternal to paternal (don’t forget, that’s not his paternal, it’s his mothers. We’re not dealing with his paternal chromosome at all). Since he matches L, it must be P1. Then, at 82, it changes from P1 to P2, and stays the same until R’s next crossover.

VP11 - Fig 9
Figure 9

Figure 10: At 92.7 and 107, there are crossovers belonging to L or J, but we don’t know which. It’s not on the paternal side, as there’s no change in the match of either to R. But which is it? Does L change from M1 to M2, or does J?

VP11 - Fig 10
Figure 10

We’re stuck. Without knowing which one changes here, we can’t go any further. Nor can we know whether R’s next section is M1 or M2. Time to start bringing in some cousins. Maybe that will help.

Figure 11: First up, is George, my 2C1R on my Cooper (paternal) side. Since he matches L but not J, we now know that P1 = Cooper (orange) and P2 = Sharpe (blue).

VP11 - Fig 11
Figure 11

Figure 12: Next up, is E. Hicks, a 3C1R on my Hicks (maternal) side.

VP11 - Fig 12
Figure 12

Because she matches in the new section, we know that the maternal side there is Hicks. Note that if you’re using Stephen Fox’s Excel spreadsheet to do this, you will get an error if you have more than 2 entries on the same line (like here, where I have M1, M2 and Hicks). You just need to click OK, and you can keep going.

Although it looks like the match between L & E. Hicks extends farther on the right than J, if you look at the numbers, they both end at 113,620,851.

However, since J’s match only begins at 107, we now know that that crossover must belong to J. As well, we can do some extending to the right (Figure 13).

VP11 - Fig 13
Figure 13

But what about the crossover at 92.9 (outlined in red)? It could be an L crossover, which would mean that M1 = Prowse. Or it could be a J crossover, which would mean that M1=Hicks. Which is it?

Fortunately, I have another known 4th cousin match on Chr. 11 (along with a larger segment on another chromosome). I hadn’t included him when I entered cousins into the spreadsheet, so I ran a GEDMatch Multi Kit Analysis on GW, FC (my mother) and the three of us.

GEDMatch - G.W.

G.W. is a match on the Prowse side. Therefore, we know that M1=Prowse and thus the crossover at 92.9 belongs to L (Figure 14). So M2=Hicks.

VP11 - Fig 14.png
Figure 14

Figure 15: The crossover at 119.2 belongs to L. And since there’s R and L match after that point, it must be a maternal crossover from Hicks to Prowse.

VP11 - Fig 15

Which brings us to the final crossover, after which it appears that nobody matches anyone else. However, it’s impossible for R to match neither L or J, if L & J don’t match each other. There must be some matching at a lower threshold.

L & J:

Gedmatch 11 L vs J

L & R: No match

J & R:

Gedmatch 11 J vs R

Therefore, we need a crossover that will result in L=J, L≠R, J=R

  • If the crossover belongs to L (paternal), then L=J, but L=R and J≠R
  • If the crossover belong to L (maternal), then L=J and L≠R, but J≠R
  • If the crossover belongs to J (paternal), then L=J, but L=R and J≠R
  • If the crossover belongs to J (maternal), then L=J and J=R, but L=R
  • If the crossover belongs to R, then J=R and L=R, but L≠J

I may like logic puzzles, but that just broke my brain. Maybe there are two crossovers. Maybe one of those matches is real and one isn’t. In any case, I looked at each of our matches on GEDMatch’s Matching Segment Search tool (Tier 1), and none of us have any matches at that part of the chromosome (at least not at the default thresholds), and it’s a very small segment (3.8cM) so I’m not going to worry about that little bit at the end, at least for now.

So, the last step is to merge the segments, and here we have it:

VP11 - Fig 16
Figure 16

Or to just look at the chromosomes:

VP11

You can see that on the paternal side, I (L) received a complete Cooper chromosome, whereas J received an almost complete Sharpe one. R’s mother got both. We all have some recombination on the maternal side.

Chromosome 11 was a fairly easy one to figure out (with the exception of that end bit), since I had some good cousin matches to work with. In a later post, I’ll share some of the more challenging ones I faced.  As well, I’ll do a post on how VP has helped me in my research.

Until then, if you have any thoughts or questions on this, or if you need any clarification, please leave a comment and I’d be happy to respond.