13 Equally Likely Outcomes, Counting Rules, and Uniform Probability Meaures
13.1 Equally likely outcomes
- For a sample space
with finitely many possible outcomes, assuming equally likely outcomes corresponds to a probabiliy measure which satisfies - Computing probabilities in the equally likely case reduces to just counting outcomes.
- But remember: even if the sample space outcomes are equally likely, the possible values of related random variables are usually not.
13.2 Some counting rules
Example 13.1 Suppose that to send an internet packet from the east coast of the US to the west coast, a packet must go through a major east-coast city (Boston, New York, Washington DC, or Atlanta), then a major midwest city (Chicago, St. Louis, or New Orleans), and then a major west-coast city (San Francisco or Los Angeles). How many possible routes are there?
- Multiplication principle for counting. Suppose that stage 1 of a process can be completed in any one of
ways. Further, suppose that for each way of completing the stage 1, stage 2 can be completed in any one of ways. Then the two-stage process can be completed in any one of ways. - This rule extends naturally to a
-stage process, which can then be completed in any one of ways. - In the multiplication principle it is not important whether there is a “first” or “second” stage. What is important is that there are distinct stages, each with its own number of “choices”.
Example 13.2 Suppose the board of directors of a corporation has identified 5 candidates — Ariana, Beyonce, Cardi, Drake, Elvis — for three executive positions: chief executive officer (CEO), chief financial officer (CFO), and chief operating officer (COO). In the interest of fairness, the board assigns 3 of the 5 candidates to the positions completely at random. No individual can hold more than one of the positions.
When calculating probabilities below, consider the sample space of all possible executive teams.
- How many executive teams are possible?
- What is the probability that Ariana is CEO, Beyonce is CFO, and Cardi is COO?
- What is the probability that Ariana is CEO and Beyonce is CFO?
- What is the probability that Ariana is CEO?
- Number of “ordered” arrangements. The number of “ordered” arrangements of
items, selected without replacement from a set of distinct items is - Recall the factorial notation:
. For example, . By definition, 0!=1.
Example 13.3 Your boss is forming a committee of 3 people for a new project team, and 5 people — Ariana, Beyonce, Cardi, Drake, Elvis— have volunteered to be on the committee. In the interest of fairness, 3 of the 5 people will be selected uniformly at random to form the committee.
How is this situation different from the executive team example?
How many possible committees consist of Ariana, Beyonce, Cardi? How many executive teams consisted of Ariana, Beyonce, Cardi?
How many different possible committees of 3 people can be formed from the 5 volunteers?
- The following is the relationship between “ordered” and “unordered” counting.
- “Ordered” and “unordered” are somewhat misnomers. It is not important whether there is a “first”, “second”, “third”, etc. What is important is that there are distinct stages, each with its own number of “choices”.
- In Example 13.2, it doesn’t matter if we pick the CEO first and the CFO second; what does matter is that choosing the CEO is a distinct stage from choosing the CFO.
- Number of permutations. The number of ways of arranging
items in order is - Number of combinations. The number of ways to choose
items without replacement from a group of distinct items where “order” does not matter, denoted , is - The quantity on the right is just a compact way of representing the quantity in the middle. But since factorials can be very large, it’s best to use the quantity in the middle to compute. In R:
choose(n, k)
. In Python:scipy.special.comb(n, k)
- The symbol
is by definition equal to the quantity in the middle above. It is read as “ choose ” and is referred to as a binomial coefficient.
Example 13.4 Continuous Example 13.3. Your boss is forming a committee of 3 people for a new project team, and 5 people — Ariana, Beyonce, Cardi, Drake, Elvis— have volunteered to be on the committee. In the interest of fairness, 3 of the 5 people will be selected uniformly at random to form the committee.
Find the probability that the committee consists of Ariana, Beyonce, and Cardi.
Find the probability that Ariana and Beyonce are on the committee.
Find the probability that Ariana is on the committee.
13.3 Hypergeometric distributions
Example 13.5 Capture-recapture sampling is a technique often used to estimate the size of a population. Suppose you want to estimate
In practice,
What are the possible values of
?
Describe in detail how you could use simulation to approximate the distribution of
.
Find
in two ways.
Find the probability that in the second sample the first butterfly selected is tagged but the rest are not.
Find the probability that in the second sample the first four butterflies selected are not tagged but the fifth is.
Find
in two ways.
Find
in two ways.
Suggest a formula for determining the distribution of
.
Suggest a simple shortcut formula for the long run average value of
.
Now suppose that
is unknown and that there are tagged butterflies (from the first sample). In a later sample of butterflies, are tagged. What is an intuitive estimate of based on this sample?
- A discrete random variable
has a Hypergeometric distribution with parameters , all nonnegative integers — with and — if its distribution satisfies1 - If
has a Hypergeometric( , , ) distribution - Imagine a box containing
tickets of which are labeled 1 (“success”) and of which are labeled 0 (“failure”).- Randomly select
tickets from the box without replacement and let be the number of tickets in the sample that are labeled 1. - Then
has a Hypergeometric( , , ) distribution. - Since the tickets are labeled 1 and 0, the random variable
which counts the number of successes is equal to the sum of the 1/0 values on the tickets.
13.4 Continuous Uniform probability measures
- The continuous analog of equally likely outcomes is a uniform probability measure.
- When the sample space is uncountable, size is measured continuously (length, area, volume) rather that discretely (counting).
Example 13.6 A circuit board is covered by a grid of squares, each square having sides 0.75mm in length. (Assume that the lines forming the sides of the squares having negligible width.) A single drop of solder lands, uniformly at random, on the circuit board. The diameter of the drop is 0.50mm. What is probability that the drop of solder lands entirely within a single square? Hint: focus on whichever square the center of the drop lands in and think of where the center needs to be so that the entire drop falls inside the square.
We must have
since there can’t be more successes in the sample than there are in the population. Similarly, we must have (that is, ) since there can’t be more failures in the sample than there are in the population. Often the population sizes and are large relative to the sample size in which case simply takes values .↩︎