15 Joint Normal Distributions
- Jointly continuous random variables
and have a Bivariate Normal distribution with parameters , , , , and if the joint pdf is { } - If the pair
has a BivariateNormal( , , , , ) distribution
- A Bivariate Normal Density has elliptical contours. For each height
the set is an ellipse. The density decreases as moves away from , most steeply along the minor axis of the ellipse, and least steeply along the major of the ellipse. - A scatterplot of
pairs generated from a Bivariate Normal distribution will have a rough linear association and the cloud of points will resemble an ellipse. - If
and have a Bivariate Normal distribution, then the marginal distributions are also Normal: has a Normal distribution and has a Normal . - If
and have a Bivariate Normal distribution and then and are independent. (Remember, in general it is possible to have situations where the correlation is 0 but the random variables are not independent.) and have a Bivariate Normal distribution if and only if every linear combination of and has a Normal distribution. That is, and have a Bivariate Normal distribution if and only if has a Normal distribution for all , , .
- If
and have a Bivariate Normal distribution then any conditional distribution is Normal. The conditional distribution of given is - The conditional expected value of
given is a linear function of , called the regression line of on :- The regression line passes through the point of means
and has slope - The regression line estimates that if the given
value is SDs above the mean of , then the corresponding values will be, on average, SDs away from the mean of - Since
, for a given value the corresponding values will be, on average, relatively closer to the mean of than the given value is to the mean of . This is known as regression to the mean.
- The regression line passes through the point of means
- For Bivariate Normal distributions, the conditional variance of
given does not depend on :
Example 15.1 Suppose that SAT Math (
- Identify the distribution of Math scores. Find the probability that a student has a Math score above 700.
- Compute and interpret
.
- Compute and interpret
.
- Identify the conditional distribution of Math scores given the Reading score is 700. Find the probability that a student has a higher Math than Reading score if the student scores 700 on Reading.
- Compute and interpret
.
- Compute and interpret
.
- Identify the conditional distribution of Math scores given the Reading score is 550. Find the probability that a student has a higher Math than Reading score if the student scores 550 on Reading.
- Describe how you could simulate a single
pair.
- Find and interpret
.
= 10000
N_rep
= rnorm(N_rep, 610, 70)
R = rnorm(N_rep, 640 + 0.7 * 80 * (R - 610) / 70, 80 * sqrt(1 - 0.7 ^ 2))
M
plot(R, M)
ggplot(data.frame(R, M), aes(x = R, y = M)) +
stat_density_2d(aes(fill = ..level..), geom = "polygon", colour="white")
Warning: The dot-dot notation (`..level..`) was deprecated in ggplot2 3.4.0.
ℹ Please use `after_stat(level)` instead.
cor(R, M)
[1] 0.6973201
mean(R)
[1] 609.4482
sd(R)
[1] 70.43551
mean(M)
[1] 639.8739
sd(M)
[1] 80.14485
- If the pair
has a joint Normal distribution then each of and has a Normal distribution. - But the converse is not true. That is, if each of
and has a Normal distribution, it is not necessarily true that the pair has a joint Normal distribution - However, if
and are independent and each of and has a Normal distribution, then the pair has a joint Normal distribution. (But joint Normal is much more general that two independent Normal random variables.)