24  Conditional Expected Value

Example 24.1 Roll a fair four-sided die twice. Let \(X\) be the sum of the two rolls, and let \(Y\) be the larger of the two rolls (or the common value if a tie).

\(p_{X, Y}(x, y)\)
\(x\) \ \(y\) 1 2 3 4 \(p_{X}(x)\)
2 1/16 0 0 0 1/16
3 0 2/16 0 0 2/16
4 0 1/16 2/16 0 3/16
5 0 0 2/16 2/16 4/16
6 0 0 1/16 2/16 3/16
7 0 0 0 2/16 2/16
8 0 0 0 1/16 1/16
\(p_Y(y)\) 1/16 3/16 5/16 7/16
  1. Find \(\text{E}(X)\). How could you find a simulation-based approximation?




  2. Find \(\text{E}(X|Y = 4)\). How could you find a simulation-based approximation?




  3. Find \(\text{E}(X|Y = y)\) for each possible value \(y\) of \(Y\).




  4. Find \(\text{E}(Y|X=6)\). How could you find a simulation-based approximation?




Example 24.2 Recall Example 23.2. Consider two spins of the Uniform(1, 4) spinner and let \(X\) be the sum of the two spins and \(Y\) the larger to the two spins (or the common value if a tie).

  1. Find \(\text{E}(X | Y = 3)\).




  2. Find \(\text{E}(X | Y = y)\) for each value \(y\) of \(Y\).




  3. Find \(\text{E}(Y | X = 3.5)\).




  4. Find \(\text{E}(Y | X = 6)\).




  5. Find \(\text{E}(Y | X = x)\) for each value \(x\) of \(X\).




24.1 Conditional expected value as a random variable

Example 24.3 Continuing Example 24.1. Let \(X\) be the sum of two rolls of a fair four-sided die, and let \(Y\) be the larger of the two rolls (or the common value if a tie).

Let \(\text{E}(X|Y)\) denote the random variable that takes value \(\text{E}(X|Y=y)\) when \(Y=y\). Find the distribution of \(\text{E}(X|Y)\).




  • The conditional expected value of \(Y\) given \(X\) is the random variable, denoted \(\text{E}(Y|X)\), which takes value \(\text{E}(Y|X=x)\) on the occurrence of the event \(\{X=x\}\). The random variable \(\text{E}(Y|X)\) is a function of \(X\).
  • For a given value \(x\) of \(X\), \(\text{E}(Y|X=x)\) is a number. Let \(\ell\) denote the function which maps \(x\) to the number \(\ell(x)=\text{E}(Y|X=x)\). The random variable \(\text{E}(Y|X)\) is a function of \(X\), namely \(\text{E}(Y|X)=\ell(X)\).
  • Roughly, \(\text{E}(Y|X)\) can be thought of as the “best guess” of the value of \(Y\) given only the information available from \(X\).
  • Since \(\text{E}(Y|X)\) is a random variable, it has a distribution. And since \(\text{E}(Y|X)\) is a function of \(X\), the distribution of \(X\) will be depend on the distribution of \(X\). However, remember that a transformation generally changes the shape of a distribution, so the distribution of \(\text{E}(Y|X)\) will usually have a different shape than that of \(X\).

Example 24.4 Continuing Example 24.2. Consider two spins of the Uniform(1, 4) spinner and let \(X\) be the sum of the two spins and \(Y\) the larger to the two spins (or the common value if a tie).

  1. Find an expression for \(\text{E}(X | Y)\).




  2. Find an expression for \(\text{E}(Y | X)\).




24.2 Law of total expectation

Example 24.5 Continuing Example 24.3. Let \(X\) be the sum of two rolls of a fair four-sided die, and let \(Y\) be the larger of the two rolls (or the common value if a tie).

Find the expected value of the random variable \(\text{E}(X|Y)\). That is, find \(\text{E}(\text{E}(X|Y))\). How does it relate to \(\text{E}(X)\)?




  • Law of total expectation. For any two random variables \(X\) and \(Y\) (defined on the same probability space) \[ \text{E}(Y) = \text{E}(\text{E}(Y|X)) \]
  • Analogous to the law of total probability, the law of total expectation provides a way of computing an expected value by breaking down a problem into various cases, computing the conditional expected value given each case, and then computing the overall expected value as a probability-weighted average of these case-by-case conditional expected values.

Example 24.6 Recall Example 11.4. You and your friend are playing the “lookaway challenge”. The game consists of possibly multiple rounds. In the first round, you point in one of four directions: up, down, left or right. At the exact same time, your friend also looks in one of those four directions. If your friend looks in the same direction you’re pointing, you win! Otherwise, you switch roles and the game continues to the next round — now your friend points in a direction and you try to look away. (So the player who starts as the pointer is the pointer in the odd-numbered rounds, and the player who starts as the looker is the pointer in the even-numbered rounds, until the game ends.) As long as no one wins, you keep switching off who points and who looks. The game ends, and the current “pointer” wins, whenever the “looker” looks in the same direction as the pointer.

We saw in Example 11.4 that the probability that the player who starts as the pointer wins the game is 4/7 = 0.571.

  1. Compute and interpret the expected number of rounds in a game.




  2. Compute and interpret the conditional expected number of rounds in a game given that the player who is the pointer in the first round wins the game. (Do you think this is greater than, less than, or equal to the value from the previous part?)




  3. Compute and interpret the conditional expected number of rounds in a game given that the player who is the looker in the first round wins the game.




Example 24.7 Suppose you construct a “random rectangle” as follows. The base \(X\) is a random variable with a Uniform(0, 1) distribution. The height \(Y\) is a random variable whose conditional distribution given \(X=x\) is Uniform(0, \(x\)). We are interested in \(\text{E}(Y)\) the expected value of the height of the rectangle.

  1. Find \(\text{E}(Y|X=0.5)\).




  2. Find \(\text{E}(Y|X=0.2)\).




  3. Find \(\text{E}(Y|X=x)\) for a generic \(x\in(0, 1)\).




  4. Identify the random variable \(\text{E}(Y|X)\).




  5. Use LTE to find \(\text{E}(Y)\).




  6. Sketch a plot of the joint distribution of \((X, Y)\).




  7. Sketch a plot of the marginal distribution of \(Y\). Be sure to specify the possible values. Is it Uniform?




  8. What would you need to do to find \(\text{E}(Y)\) using the definition of expected value?




24.3 Taking out what is known

Example 24.8 Continuing Example 24.7. Suppose you construct a “random rectangle” as follows. The base \(X\) is a random variable with a Uniform(0, 1) distribution. The height \(Y\) is a random variable whose conditional distribution given \(X=x\) is Uniform(0, \(x\)). We are interested in \(\text{E}(XY)\) the expected value of the area of the rectangle.

  1. Explain how you could use simulation to approximate \(\text{E}(XY)\).




  2. Find \(\text{E}(XY|X=0.5)\).




  3. Find \(\text{E}(XY|X=0.2)\).




  4. Find \(\text{E}(XY|X=x)\) for a generic \(x\in(0, 1)\). How does \(\text{E}(XY|X=x)\) relate to \(\text{E}(Y|X=x)\)?




  5. Identify the random variable \(\text{E}(XY|X)\). How does \(\text{E}(XY|X)\) relate to \(\text{E}(Y|X)\)?




  6. Use LTE to find \(\text{E}(XY)\).




  7. Find \(\text{Cov}(X, Y)\). Does the sign of the covariance make sense?




  • “Taking out what is known (TOWIK)” \[ \text{E}(g(X)Y|X) = g(X)\text{E}(Y|X) \]
  • In particular, \(\text{E}(XY|X) = X\text{E}(Y|X)\), \(\text{E}(X|X)=X\), and \(\text{E}(g(X)|X)=g(X)\).
  • Intuitively, when we condition on \(X\) we treat it as though its value is known, so it behaves like a non-random constant. For example, \(\text{E}(XY|X)=X\text{E}(Y|X)\) is the conditional, random variable analog of the unconditional, numerical relationship \(\text{E}(cY) = c\text{E}(Y)\) where \(c\) is a constant. But note that TOWIK is a relationship between random variables.