8  Variance and Standard Deviation

Example 8.1

A roulette wheel has 18 black spaces, 18 red spaces, and 2 green spaces, all the same size and each with a different number on it. Guillermo bets $1 on black. If the wheel lands on black, Guillermo wins his bet back plus an additional $1; otherwise he loses the money he bet. Let \(W\) be Guillermo’s net winnings (net of the initial bet of $1.)

  1. Find the distribution of \(W\).




  2. Compute \(\text{E}(W)\).




  3. Interpret \(\text{E}(W)\) in context.




  4. An expected profit for the casino of 5 cents per $1 bet seems small. Explain how casinos can turn such a small profit into billions of dollars.




  5. The random variable \((W-\text{E}(W))^2\) represents the squared deviation from the mean. Find the distribution of this random variable and its expected value.




  6. Standard deviation is the square root of the variance. Why would we want to take the square root of the variance? Compute and interpret the standard deviation of \(W\).




  7. Compute \(\text{E}(W^2)\). (For this \(W\) you should be able to compute \(\text{E}(W^2)\) without any calculations; why?) Then compute \(\text{E}(W^2) - (\text{E}(W))^2\); what do you notice?




x = sample(c(-1, 1), size = 10000, prob = c(20, 18) / 38, replace = TRUE)

data.frame(x,
           x - mean(x),
           (x - mean(x)) ^ 2) |>
  head() |>
  kbl(col.names = c("Value", "Deviation from mean", "Squared deviation")) |>
  kable_styling(fixed_thead = TRUE)
Value Deviation from mean Squared deviation
-1 -0.9436 0.890381
-1 -0.9436 0.890381
-1 -0.9436 0.890381
-1 -0.9436 0.890381
1 1.0564 1.115981
-1 -0.9436 0.890381
mean(x)
[1] -0.0564
mean((x - mean(x)) ^ 2)
[1] 0.996819
var(x)
[1] 0.9969187
sqrt(var(x))
[1] 0.9984582
sd(x)
[1] 0.9984582

Example 8.2

Continuing with roulette, Nadja bets $1 on number 7. If the wheel lands on 7, Nadja wins her bet back plus an additional $35; otherwise she loses the money she bet. Let \(X\) be Nadja’s net winnings (net of the initial bet of $1.)

  1. Find the distribution of \(X\).




  2. Compute \(\text{E}(X)\).




  3. How do the expected values of the two $1 bets — bet on black versus bet on 7 — compare? Explain what this means.




  4. Are the two $1 bets — bet on black versus bet on 7 — identical? If not, explain why not.




  5. Before doing any calculations, determine if \(\text{SD}(X)\) is greater than, less than, or equal to \(\text{SD}(W)\). Explain.




  6. Compute \(\text{Var}(W)\) and \(\text{SD}(W)\).




  7. Which $1 bet — betting on black or betting on 7 — is “riskier”? How is this reflected in the standard deviations?




Example 8.3

The plots in Figure 8.1 summarize hypothetical distributions of quiz scores in six classes. All plots are on the same scale. Each quiz score is a whole number between 0 and 10 inclusive.

  1. Donny Dont says that C represents the smallest SD, since there is no variability in the heights of the bars. Do you agree that C represents “no variability”? Explain.




  2. What is the smallest possible value the SD of quiz scores could be? What would need to be true about the distribution for this to happen? (This scenario might not be represented by one the plots.)




  3. Without doing any calculations, arrange the classes in order based on their SDs from smallest to largest.





  4. In one of the classes, the SD of quiz scores is 5. Which one? Why?




  5. Is the SD in F greater than, less than, or equal to 1? Why?




  6. Provide a ballpark estimate of SD in each case.




8.1 Standardization

  • Standardization measures values in terms of “standard deviations away from the mean”
  • This idea is particularly useful when comparing random variables with different measurement units but whose distributions have similar shapes. \[ \text{Standardized value} = \frac{\text{Value - Mean}}{\text{Standard deviation}} \]
  • If \(X\) is a random variable with expected value \(\text{E}(X)\) and standard deviation \(\text{SD}(X)\), then the standardized random variable is \[ Z = \frac{X - \text{E}(X)}{\text{SD}(X)} \]

Example 8.4

SAT scores have, approximately, a symmetric bell-shaped distribution with a mean of 1050 and a standard deviation of 200. ACT scores have, approximately, a symmetric bell-shaped distribution with a mean of 21 and a standard deviation of 5.5. Darius’s score on the SAT is 1500. Alfred’s score on the ACT is 31. Who scored relatively better on their test?

  1. Compute and interpret the standardized value for Darius’s SAT score.



  2. Compute and interpret the standardized value for Alfred’s ACT score



  3. Who scored relatively better on their test?