8 Averages and Standard Deviation
8.1 Averages
- One summary characteristic of a distribution is the long run average value of the random variable.
- We can approximate the long run average value by simulating many values of the random variable and computing the average (mean) in the usual way.
Example 8.1
Let
Example 8.2
Donny Don’t says: “Why bother creating columns for
- In general the order of transforming and averaging is not interchangeable.
- Whether in the short run or the long run, in general
Example 8.3
Recall your tactile simulation from Example Example 5.5. Let
- Donny Don’t says: “
, so I can find the average value of by finding the average value of , the average value of , and adding the two averages”. Do you agree? Explain.
- Donny Don’t says: “
and have the same distribution, so they have the same average value, so I can find the average value of by multiplying the average value of by 2”. Do you agree? Explain.
- Donny Don’t says: “
and have the same distribution, so has the same distribution as ”. Do you agree? Explain.
- In general the order of transforming and averaging is not interchangeable.
- However, the order is interchangeable for linear transformations.
- Linearity of averages.: If
and are random variables and and are non-random constants, whether in the short run or the long run, - The average of the sum of
and is the sum of the average of and the average of regardless of the relationship between and .
8.2 Standard deviation
- The long run average value is just one feature of a distribution.
- Random variables vary, and the distribution describes the entire pattern of variability.
- We can summarize this degree of variability by listing some percentiles (10th, 25th, 50th, 75th, 90th, etc); the more percentiles we provide the clearer the picture, but the less of a summary.
- Roughly, standard deviation measures overall degree of variability in ai single number, as the average distance from the mean.
- Technically, the variance is the long run average squared distance from the mean, and the standard deviation is the square root of the variance.
Example 8.4
We’ll compare long run average and standard deviation for the Uniform(0, 60) distribution and the Normal(30, 10) distribution. A Normal(30, 10) distribution mean (long run average) 30 and standard deviation 10.
- Make an educated guess for the long run average value of a Uniform(0, 60) distribution. Explain.
- Will the standard deviation for a Uniform(0, 60) distribution be greater than, less than, or equal to 10, the standard deviation for a Normal(30, 10) distribution? Explain without doing any calculations.
- Make an educated guess for the standard deviation of a Uniform(0, 60) distribution.
- Standard deviation provides a “ruler” by which we can judge a particular realized value of a random variable relative to the distribution of values.
- Standardization measures values in terms of “standard deviations away from the mean”
- This idea is particularly useful when comparing random variables with different measurement units but whose distributions have similar shapes.
Example 8.5
SAT scores have, approximately, a Normal distribution with a mean of 1050 and a standard deviation of 200. ACT scores have, approximately, a Normal distribution with a mean of 21 and a standard deviation of 5.5. Darius’s score on the SAT is 1500. Alfred’s score on the ACT is 31. Who scored relatively better on their test?
- Compute the deviation from the mean for Darius’s SAT score. How does this compare to the average deviation from the mean for SAT scores?
- Compute the deviation from the mean for Alfred’s ACT score. How does this compare to the average deviation from the mean for ACT scores?
- Who scored relatively better on their test?
- Any Normal distribution follows the “empirical rule” which determines the percentiles that give a Normal distribution its particular bell shape.
- For a Normal distribution:
- 68% of values are within 1 standard deviation of the mean
- 95% of values are within 2 standard deviations of the mean
- 99.7% of values are within 3 standard deviations of the mean
- The table below lists some percentiles of a Normal distribution.
- The “standard” Normal distribution is a Normal(0, 1) distribution, with a mean 0 and a standard deviation of 1.
Percentile | SDs away from the mean |
---|---|
0.1% | 3.09 SDs below the mean |
0.5% | 2.58 SDs below the mean |
1% | 2.33 SDs below the mean |
2.5% | 1.96 SDs below the mean |
10% | 1.28 SDs below the mean |
15.9% | 1 SDs below the mean |
25% | 0.67 SDs below the mean |
30.9% | 0.5 SDs below the mean |
50% | 0 SDs above the mean |
69.1% | 0.5 SDs above the mean |
75% | 0.67 SDs above the mean |
84.1% | 1 SDs above the mean |
90% | 1.28 SDs above the mean |
97.5% | 1.96 SDs above the mean |
99% | 2.33 SDs above the mean |
99.5% | 2.58 SDs above the mean |
99.9% | 3.09 SDs above the mean |