15  Joint Normal Distributions

\[ N\left(\mu_Y + \frac{\rho\sigma_Y}{\sigma_X}\left(x-\mu_X\right),\;\sigma_Y\sqrt{1-\rho^2}\right) \]

Example 15.1

Suppose that SAT Math (\(M\)) and Reading (\(R\)) scores of CalPoly students have a Bivariate Normal distribution. Math scores have mean 640 and SD 80, Reading scores have mean 610 and SD 70, and the correlation between scores is 0.7.

  1. Identify the distribution of Math scores. Find the probability that a student has a Math score above 700.




  2. Compute and interpret \(\text{E}(M|R = 700)\).




  3. Compute and interpret \(\text{SD}(M|R = 700)\).




  4. Identify the conditional distribution of Math scores given the Reading score is 700. Find the probability that a student has a higher Math than Reading score if the student scores 700 on Reading.




  5. Compute and interpret \(\text{E}(M|R = 550)\).




  6. Compute and interpret \(\text{SD}(M|R = 550)\).




  7. Identify the conditional distribution of Math scores given the Reading score is 550. Find the probability that a student has a higher Math than Reading score if the student scores 550 on Reading.




  8. Describe how you could simulate a single \((M, R)\) pair.




  9. Find and interpret \(\text{E}(M|R)\).




N_rep = 10000

R = rnorm(N_rep, 610, 70)
M = rnorm(N_rep, 640 + 0.7 * 80 * (R - 610) / 70, 80 * sqrt(1 - 0.7 ^ 2))

plot(R, M)

ggplot(data.frame(R, M), aes(x = R, y = M)) +
  stat_density_2d(aes(fill = ..level..), geom = "polygon", colour="white")

cor(R, M)
[1] 0.6867692
mean(R)
[1] 609.8806
sd(R)
[1] 69.73702
mean(M)
[1] 639.406
sd(M)
[1] 78.31964