14  Conditional Distributions

Example 14.1

Roll a fair four-sided die twice. Let \(X\) be the sum of the two rolls, and let \(Y\) be the larger of the two rolls (or the common value if a tie). We have previously found the joint and marginal distributions of \(X\) and \(Y\), displayed in the two-way table below.

\(p_{X, Y}(x, y)\)
\(x\) \ \(y\) 1 2 3 4 \(p_{X}(x)\)
2 1/16 0 0 0 1/16
3 0 2/16 0 0 2/16
4 0 1/16 2/16 0 3/16
5 0 0 2/16 2/16 4/16
6 0 0 1/16 2/16 3/16
7 0 0 0 2/16 2/16
8 0 0 0 1/16 1/16
\(p_Y(y)\) 1/16 3/16 5/16 7/16
  1. Compute \(p_{X|Y}(6|4) = \text{P}(X=6|Y=4)\).




  2. Construct a table, plot, and spinner to represent the conditional distribution of \(X\) given \(Y=4\).




  3. Construct a table, plot, and spinner to represent the conditional distribution of \(X\) given \(Y=3\).




  4. Construct a table, plot, and spinner to represent the conditional distribution of \(X\) given \(Y=2\).




  5. Construct a table, plot, and spinner to represent the conditional distribution of \(X\) given \(Y=1\).




  6. Compute \(p_{Y|X}(4|6) = \text{P}(Y=4|X=6)\).




  7. Construct a table, plot, and spinner to represent the distribution of \(Y\) given \(X=6\).




  8. Construct a table, plot, and spinner to represent the distribution of \(Y\) given \(X=5\).




  9. Construct a table, plot, and spinner to represent the distribution of \(Y\) given \(X=4\).




Warning: The labeller API has been updated. Labellers taking `variable` and
`value` arguments are now deprecated. See labellers documentation.

Example 14.2

We have already discussed two ways for simulating an \((X, Y)\) pair in the dice rolling example: simulate a pair of rolls and measure \(X\) (sum) and \(Y\) (max), or spin the joint distribution spinner for \((X, Y)\) once.

  1. Now describe another way for simulating an \((X, Y)\) pair using the spinners in Example 14.1. (Hint: you’ll need one more spinner in addition to the four from the previous example.)




  2. Describe in detail how you can simulate \((X, Y)\) pairs and use the results to approximate \(\text{P}(X = 6 | Y = 4)\).




  3. Describe in detail how you can simulate \((X, Y)\) pairs and use the results to approximate the conditional distribution of \(X\) given \(Y = 4\).




  4. Describe in detail how you can simulate values from the conditional distribution of \(X\) given \(Y=4\) without simulating \((X, Y)\) pairs.




(ref:cap-dice-mosaic) Mosaic plots for Example @ref(exm:dice-conditional), where \(X\) is the sum and \(Y\) is the max of two rolls of a fair four-sided die. The plot on the left represents conditioning on values of the sum \(X\); color represents values of \(Y\). The plot on the right represents conditioning on values of the max \(Y\); color represents values of \(X\).

N_rep = 16000

# first roll 
u1 = sample(1:4, size = N_rep, replace = TRUE)

# second roll
u2 = sample(1:4, size = N_rep, replace = TRUE)

# sum
x = u1 + u2

# max
y = pmax(u1, u2)
dice_sim = data.frame(1:N_rep, u1, u2, x, y)

dice_sim |>
  head() |>
  kbl(col.names = c("Repetition", "First roll", "Second roll", "X (sum)", "Y (max)")) |>
  kable_styling(fixed_thead = TRUE) |>
    row_spec(which(head(y) == 4), bold = TRUE, color = "white", background = "#FFA500")
Repetition First roll Second roll X (sum) Y (max)
1 1 2 3 2
2 2 4 6 4
3 1 3 4 3
4 4 2 6 4
5 4 3 7 4
6 2 1 3 2
# Joint distribution: counts
table(x, y)
   y
x      1    2    3    4
  2 1018    0    0    0
  3    0 2025    0    0
  4    0  990 1937    0
  5    0    0 2040 2005
  6    0    0  942 2056
  7    0    0    0 1944
  8    0    0    0 1043
# Joint distribution: proportions
table(x, y) / N_rep
   y
x           1         2         3         4
  2 0.0636250 0.0000000 0.0000000 0.0000000
  3 0.0000000 0.1265625 0.0000000 0.0000000
  4 0.0000000 0.0618750 0.1210625 0.0000000
  5 0.0000000 0.0000000 0.1275000 0.1253125
  6 0.0000000 0.0000000 0.0588750 0.1285000
  7 0.0000000 0.0000000 0.0000000 0.1215000
  8 0.0000000 0.0000000 0.0000000 0.0651875
# Conditional distribution of X given Y = 4: counts
table(x[y == 4])

   5    6    7    8 
2005 2056 1944 1043 
# Conditional distribution of X given Y = 4: proportions
table(x[y == 4]) / sum(y == 4)

        5         6         7         8 
0.2844779 0.2917140 0.2758229 0.1479852 
ggplot(dice_sim) +
  geom_mosaic(aes(x = product(x, y),
                  fill = x),
              offset = 0) +
  scale_fill_viridis(discrete = TRUE) +
  theme_mosaic() +
  theme(axis.text.y=element_blank())
Warning: `unite_()` was deprecated in tidyr 1.2.0.
Please use `unite()` instead.
This warning is displayed once every 8 hours.
Call `lifecycle::last_lifecycle_warnings()` to see where this warning was generated.
ggplot(dice_sim) +
  geom_mosaic(aes(x = product(y, x),
                  fill = y),
              offset = 0) +
  scale_fill_viridis(discrete = TRUE) +
  theme_mosaic() +
  theme(axis.text.y=element_blank())

14.1 Conditional Expected Value

Example 14.3

Roll a fair four-sided die twice. Let \(X\) be the sum of the two rolls, and let \(Y\) be the larger of the two rolls (or the common value if a tie).

\(p_{X, Y}(x, y)\)
\(x\) \ \(y\) 1 2 3 4 \(p_{X}(x)\)
2 1/16 0 0 0 1/16
3 0 2/16 0 0 2/16
4 0 1/16 2/16 0 3/16
5 0 0 2/16 2/16 4/16
6 0 0 1/16 2/16 3/16
7 0 0 0 2/16 2/16
8 0 0 0 1/16 1/16
\(p_Y(y)\) 1/16 3/16 5/16 7/16
  1. Compute and interpret \(\text{E}(Y)\). How could you find a simulation-based approximation?


  2. We have seen that the long run average value of \(Y\) is 3.125. Would you expect the conditional long run average value of \(Y\) given \(X = 8\) to be greater than, less than, or equal to 3.125? Explain without doing any calculations. What about given \(Y = 3\)?




  3. How could you use simulation to approximate the conditional long run average value of \(Y\) given \(X = 6\)?




  4. Compute and interpret \(\text{E}(Y|X=6)\).


  5. Find \(\text{E}(Y|X=x)\) for each possible value of \(x\) of \(X\).




  6. Compute and interpret \(\text{E}(X|Y = 4)\). How could you find a simulation-based approximation?


  7. Find \(\text{E}(X|Y = y)\) for each possible value \(y\) of \(Y\).




  • The conditional expected value (a.k.a. conditional expectation a.k.a. conditional mean), of a random variable \(Y\) given the event \(\{X=x\}\), defined on a probability space with measure \(\text{P}\), is a number denoted \(\text{E}(Y|X=x)\) representing the probability-weighted average value of \(Y\), where the weights are determined by the conditional distribution of \(Y\) given \(X=x\). \[\begin{align*} & \text{Discrete $X, Y$ with conditional pmf $p_{Y|X}$:} & \text{E}(Y|X=x) & = \sum_y y p_{Y|X}(y|x)\\ \end{align*}\]
  • Remember, when conditioning on \(X=x\), \(x\) is treated as a fixed constant. The conditional expected value \(\text{E}(Y | X=x)\) is a number representing the mean of the conditional distribution of \(Y\) given \(X=x\).
  • The conditional expected value \(\text{E}(Y | X=x)\) is the long run average value of \(Y\) over only those outcomes for which \(X=x\).
  • To approximate \(\text{E}(Y|X = x)\), simulate many \((X, Y)\) pairs, discard the pairs for which \(X\neq x\), and average the \(Y\) values for the pairs that remain.
# Approximate E(Y)
mean(y)
[1] 3.124812
# Approximate E(Y| X = 6)

mean(y[x == 6])
[1] 3.685791