16Expected Values of Linear Combinations of Random Variables
16.1 Linear rescaling
If is a random variable and are non-random constants then
16.2 Linearity of expected value
Example 16.1 Refer to the tables and plots in Example 5.29 here. Each scenario contains SAT Math () and Reading () scores for 10 hypothetical students, along with the total score () and the difference between the Math and Reading scores (, negative values indicate lower Math than Reading scores). Note that the 10 values are the same in each scenario, and the 10 values are the same in each scenario, but the values are paired in different ways: the correlation is 0.78 in scenario 1, -0.04 (essentially 0) in scenario 2, and -0.94 in scenario 3.
What is the mean of in each scenario? How does it relate to the means of and ? Does the correlation affect the mean of ?
What is the mean of in each scenario? How does it relate to the means of and ? Does the correlation affect the mean of ?
Linearity of expected value. For any two random variables and ,
That is, the expected value of the sum is the sum of expected values, regardless of how the random variables are related.
Therefore, you only need to know the marginal distributions of and to find the expected value of their sum. (But keep in mind that the distribution of will depend on the joint distribution of and .)
Whether in the short run or the long run, regardless of the joint distribution of and .
A linear combination of two random variables and is of the form where and are non-random constants. Combining properties of linear rescaling with linearity of expected value yields the expected value of a linear combination.
Linearity of expected value extends naturally to more than two random variables.
16.3 Variance of linear combinations of random variables
In which of the three scenarios is the largest? Can you explain why?
In which of the three scenarios is the smallest? Can you explain why?
In which scenario is roughly equal to the sum of and ?
In which of the three scenarios is the largest? Can you explain why?
In which of the three scenarios is the smallest? Can you explain why?
In which scenario is roughly equal to the sum of and ?
Variance of sums and differences of random variables.
Example 16.3 Assume that SAT Math () and Reading () scores follow a Bivariate Normal distribution, Math scores have mean 527 and standard deviation 107, and Reading scores have mean 533 and standard deviation 100. Compute and for each of the following correlations.
Example 16.4 Continuing the previous example. Compute and for each of the following correlations.
The variance of the sum is the sum of the variances if and only if and are uncorrelated.
The variance of the difference of uncorrelated random variables is the sum of the variances
If are non-random constants and and are random variables then
Example 16.5 Suppose that SAT Math () and Reading () scores of CalPoly students have a Bivariate Normal distribution. Math scores have mean 640 and SD 80, Reading scores have mean 610 and SD 70, and the correlation between scores is 0.7.
Find the probability that a student has a total score above 1500.
Find the probability that a student has a higher Math than Reading score.
and have a Bivariate Normal distribution if and only if every linear combination of and has a Normal distribution. That is, and have a Bivariate Normal distribution if and only if has a Normal distribution for all , , .
In particular, if and are independent and each has a Normal distribution then has a Normal distribution.