16  Cumulative Distribution Functions

Example 16.1 Maggie and Seamus are babies who have just turned one. At their one-year visits to their pediatrician:

Explain what these percentiles mean.






Example 16.2 According to data on students who took the SAT in 2018-2019, 1400 was the 94th percentile of SAT scores, while 1000 was the 40th percentile. Let \(X\) be the SAT score of a randomly selected student (from this cohort), and let \(F_X\) be the cdf of \(X\). Evaluate the cdf for each of the following. For the purposes of this exercise, interpret these quantities in terms of actual SAT scores, which take values in 400, 410, 420, \(\ldots\), 1590, 1600.

  1. \(F_X(1400)\)

  2. \(F_X(1405)\)

  3. \(F_X(1000)\)

  4. \(F_X(1003.7)\)

  5. \(F_X(-3.1)\)

  6. \(F_X(390)\)

  7. \(F_X(399.5)\)

  8. \(F_X(1600)\)

  9. \(F_X(1610)\)

  10. \(F_X(2307.4)\)

  11. \(F_X(1400)-F_X(1000)\)

Example 16.3 Recall Example 15.1, where the series system resistance \(X\) (ohms) has pdf \[ f_X(x) = \begin{cases} \frac{1}{33}\left(1-\frac{1}{33}|x - 330|\right), & 297 < x < 363\\ 0, & \text{otherwise.} \end{cases} \]

  1. Find the cdf of \(X\) and sketch a plot of it. (Hint: consider \(297<x<330\) and \(330<x<363\) separately.)




  2. Evaluate and interpret \(F_X(340)\).




  3. Evaluate and interpret \(1 - F_X(340)\).




  4. Evaluate and interpret \(F_X(340) - F_X(330)\).




Example 16.4 Recall Example 15.3 where the waiting time, measured continuously in hours, from now until the next earthquake (of any magnitude) occurs in southern CA is a continuous random variable \(X\) with an Exponential distribution with rate parameter 2. The pdf of \(X\) is

\[ f_X(x) = 2 e^{-2x}, \; x \ge0. \]

  1. Find the cdf of \(X\), and sketch a plot of it.




  2. Evaluate and interpret \(F_X(0.25)\).




  3. Evaluate and interpret \(1 - F_X(0.25)\).




  4. Evaluate and interpret \(F_X(1) - F_X(0.5)\).




Example 16.5 Database queries to the Cal Poly data warehouse occur randomly throughout the day. During regular business hours, queries arrive at rate 0.8 per second on average, so that the average number of queries that arrive during any \(t\) second time interval is \(0.8t\). Suppose that the number of queries that arrive during any \(t\) second time interval follows a marginal Poisson distribution with mean \(0.8t\).

We are interested in the distribution of \(T\), the time (seconds) until the next query arrives.

  1. Interpret the event \(\{T>2\}\). How can you express this as an equivalent event involving the number of queries?




  2. Compute \(\text{P}(T > 2)\).




  3. Compute \(\text{P}(T > t)\) as a function of \(t>0\).




  4. Find the cdf of \(T\).




  5. Find the pdf of \(T\).




  6. What is the name of the distribution of \(T\)? What is its mean and SD?




Example 16.6 Let \(X\) be the number of heads in 3 flips of a fair coin. \(X\) takes values 0, 1, 2, 3, with respective probability 1/8, 3/8, 3/8, 1/8.

  1. Find the cdf of \(X\) and sketch a plot of it.




  2. Let \(Y\) be the number of tails in 3 flips of a fair coin. Find the cdf of \(Y\).