3.4 Conditional versus unconditional probability

Be careful to distinguish between conditional and unconditional probabilities.

Example 3.10 Consider a group of 5 people: Harry, Bella, Frodo, Anakin, Katniss. Suppose each of their names is written on a slip of paper and the 5 slips of paper are placed into a hat. The papers are mixed up and 2 are pulled out, one after the other without replacement.

What is the probability that Harry is the first name selected?
What is the probability that Harry is the second name selected?
If you were asked question (2) before question (1), would your answer change? Should it?
If Bella is the first name selected, what is the probability that Harry is the second name selected?
If Harry is the first name selected, what is the probability that Harry is the second name selected?
How is the probability that Harry is the second name selected related to the probabilities in the two previous parts?
If Bella is the second name selected, what is the probability that Harry was the first name selected?

Solution. to Example 3.10

Show/hide solution

The probability that Harry is the first name selected is 1/5, which is an answer we think most people would agree with. There are 5 names which are equally likely to be the first one selected, 1 of which is Harry.
The probability that Harry is the second name selected is also 1/5. Many people might answer this as 1/4, since after selecting the first person there are now 4 names left. But we show and discuss below that the unconditional probability is 1/5.
Your answer to question (2) certainly shouldn’t change depending on whether we ask question (1) first. But perhaps after seeing question (1) you are implicitly assuming that Harry has not been selected first? But there is nothing in question (2) that gives you any information about what happened on the first card.
If Bella is the first name selected, the probability that Harry is the second name selected is 1/4. We think most people find this intuitive. If Bella is first, there are 4 cards remaining, equally likely to be the next card, of which 1 is Harry.
If Harry is the first name selected, the probability that Harry is the second name selected is 0 since the cards are drawn without replacement.
The probabilities in the two previous parts are conditional probabilities. The probability in (2) is an unconditional probability. By the law of total probability, we know that the unconditional probability that Harry is the second name selected is the weighted average of the two conditional probabilities from the previous parts. Let $A$ be the event that Harry is first, $B$ be the event that Harry is second. So $\textrm{P}(A) = 1/5$ , $\textrm{P}(B|A) = 0$ , $\textrm{P}(B|A^c) = 1/4$ , and $\textrm{P}(B) = \textrm{P}(B|A)\textrm{P}(A) + \textrm{P}(B|A^c)\textrm{P}(A^c) = (0)(1/5) + (1/4)(4/5) = 1/5$ Claiming that $\textrm{P}(B)$ is 1/4 ignores the outcomes in which Harry is the first name selected.
If Bella is the second name selected, the probability that Harry was the first name selected is 1/4. It doesn’t really matter what is “first” and what is “second”, but rather the information conveyed. In (4), what’s important is that you know that one of the cards selected was Bella, so the probability that the other card selected is Harry is 1/4. But this part conveys the same information

Here is a two-way table of 1000 hypothetical draws; note that Harry is second in 200 of them.

	Harry first	Harry not first	Total
Harry second	0	200	200
Harry not second	200	600	800
Total	200	800	1000

Be careful to distinguish between conditional and unconditional probabilities. A conditional probability reflects “new” information about the outcome of the random phenomenon. In the absence of such information, we must continue to account for all the possibilities. When computing probabilities, be sure to only reflect information that is known. Especially when considering a phenomenon that happens in stages, don’t assume that when considering “what happens second” that you know what happened first.

In the example above, imagine shuffling the five cards and putting two on a table face down. Now point to one of the cards and ask “what is the probability that THIS card is Harry?” Well, all you know is that this card is one of the five cards, each of the 5 cards is equally likely to be the one you’re pointing to, and only one of the cards is Harry. Should it matter whether the face down card you’re pointing to was the first or second card you laid on the table? No, the probability that THIS card is Harry should be 1/5, regardless of whether you put it down first or second.

Now turn over the other card that you’re not pointing to, and see what name is on it. The probability that the card you’re pointing to is Harry has now changed, because you have some information about the outcome of the shuffle. If the card you turned over says Harry, you know the probability that the card you’re pointing to is Harry is 0. If the card you turned over is not Harry, then you know that the probability that the card you’re pointing to is Harry is 1/4. It is not “first” or “second” that matters; it is whether or not you have obtained new information by revealing one of the cards.

Another way of asking the question is: Shuffle the five cards; what is the probability that Harry is the second card from the top? Without knowing any information about the result of the shuffle, all you know is that Harry should be equally likely to be in any one of the 5 positions, so the probability that he is the second card from the top should be 1/5. It is only after revealing information about the result of the shuffle, say the top card, that the probability that Harry is in the second position changes.