14 Discrete Random Variables: Probability Mass Functions
- The (probability) distribution of a random variable specifies the possible values of the random variable and a way of determining corresponding probabilities.
- A discrete random variable can take on only countably many isolated points on a number line. These are often counting type variables. Note that “countably many” includes the case of countably infinite, such as
. - We often specify the distribution of a discrete with a probability mass function.
- Certain common distributions have special names and properties.
- Do not confuse a random variable with its distribution.
- A random variable measures a numerical quantity which depends on the outcome of a random phenomenon
- The distribution of a random variable specifies the long run pattern of variation of values of the random variable over many repetitions of the underlying random phenomenon.
14.1 Probability mass functions
- The probability mass function (pmf) (a.k.a., density (pdf)) of a discrete RV
, defined on a probability space with probability measure , is a function which specifies each possible value of the RV and the probability that the RV takes that particular value: for each possible value of .
Example 14.1 Let
Example 14.2 Randomly select a county in the U.S. Let
This distribution1 is known as Benford’s law.
- Construct a table specifying the distribution of
, and the corresponding spinner.
- Find
14.2 Poisson distributions
Example 14.3 Let
This is known as the Poisson(2.3) distribution.
Verify that
is a valid pmf.
Compute
, and interpret the value as a long run relative frequency.
Construct a table and spinner corresponding to the distribution of
.
Find
, and interpret the value as a long run relative frequency. (The most home runs ever hit in a baseball game is 13.)
Use simulation to approximate the long run average value of
, and interpret this value.
Use simulation to approximate the variance and standard deviation of
.
- Poisson distributions are often used to model random variables that count “relatively rare events”.
- A discrete random variable
has a Poisson distribution with parameter if its probability mass function satisfies - The function
defines the shape of the pmf. The constant ensures that the probabilities sum to 1. - If
has a Poisson( ) distribution then
14.3 Binomial distributions
Example 14.4 Five messages of roughly equal length are to be transmitted across a noisy communication system. Assume the probability of any single message being transmitted successfully is 0.75, and that messages are transmitted independently of each other.
Let
Compute
.
Compute the probability that the first message is transmitted successfully but the rest are not.
Compute
.
Compute
.
Find the pmf of
.
Construct a table, plot, and spinner representing the distribution of
.
Make an educated guess for the long run average value of
.
What does the random variable
measure? What is the distribution of ?
- A discrete random variable
has a Binomial distribution with parameters , a nonnegative integer, and if its probability mass function is - If
has a Binomial( , ) distribution then - Imagine a box containing tickets
- With
representing the proportion of tickets in the box labeled 1 (“success”); the rest are labeled 0 (“failure”). - Randomly select
tickets from the box with replacement - Let
be the number of tickets in the sample that are labeled 1. - Then
has a Binomial( , ) distribution. - Since the tickets are labeled 1 and 0, the random variable
which counts the number of successes is equal to the sum of the 1/0 values on the tickets. - If the selections are made with replacement, the draws are independent, so it is enough to just specify the population proportion
without knowing the population size .
- With
- The situation in the previous bullets and the message example involves a sequence of Bernoulli trials.
- There are only two possible outcomes, “success” (1) and “failure” (0), on each trial.
- The unconditional/marginal probability of success is the same on every trial, and equal to
- The trials are independent.
- If
counts the number of successes in a fixed number, , of Bernoulli( ) trials then has a Binomial( ) distribution.