Chapter 4 Probability
4.1 Overview
In this Chapter we introduce probability as a measure associated with a random experiment. After providing a short motivation for probability (Section 4.2), we begin in Section 4.3 with the notion of a Sample space (Section 4.3), the set of possible outcomes of a random experiment and Events (Section 4.4), the outcome(s) which occur. This enables us in Section 4.5 to define probability as a finite measure which uses the scale 0 (impossible) to 1 (certain) to define the likelihood of an event. We conclude the chapter by introducing the concept of conditional probability (Section 4.6), the probability of one event given (conditional upon) another event (or events) having occurred. We present the key results of the Theorem of total probability and Bayes’ formula. The discussion of conditional probability leads us naturally to consider the dependence between two (or more) events and the notion of independence, where the probability of an event occurring is not affected by whether or not another event has occurred and we explore this further in Section 4.7.
4.2 Motivation
There are many situations where we have uncertainty and want to quantify that uncertainty.
- Manchester United will win the Premier League this season.
- The Labour Party will win the next general election.
- The £ will rise against the $ today.
- Coin tossed repeatedly - a head will turn up eventually.
- In 5 throws of a dart, I will hit the Bull’s eye once.
- If I play the lottery every week I will win a prize next year.
(a)-(c) are subjective probabilities, whereas (d)-(f) are objective/statistical/physical probabilities.
The general idea is:
- A conceptual random experiment .
- List all possible outcomes for the experiment but don’t know which occurs, has occurred or will occur.
- Assign to each possible outcome a real number which is the probability of that outcome.
4.3 Sample Space
We begin by defining a set.
A set is a collection of objects. The notation for a set is to simply list every object separated by commas, and to surround this list by curly braces and . The objects in a set are referred to as the elements of the set.
There is no restrictions on what constitutes as an object in set. A set can have a finite or infinite number of elements. The ordering of the elements in a list is irrelevant. Two sets are equal if and only if they have the same collection of elements.
The sample space for a random experiment is the set of all possible outcomes of the random experiment.
Rolling a die.
The sample space for the roll of a die is
that is, the set of the six possible outcomes.
Dart in a target.
Dart into a circular target, radius 1:

Figure 4.1: Example: .
that is, the set of pairs of real numbers that are less than a distance 1 from the origin .
Note that the examples illustrate how may be discrete or continuous.
4.4 Events
An event relating to an experiment is a subset of .
Toss two coins.
The sample space for the toss of two fair coins is
Let be the event that at least one head occurs, then
Volcano eruption.
The sample space of time in years until a volcano next erupts is that is, all positive real numbers. Let event be the volcano erupting in the next years, then
We summarise key set notation, involving sets and , below:
- We use to denote that is an element of the set . Likewise, denotes that is not an element of ;
- The notation means that if , then . In this case, we say is a subset of ;
- - complement of , sometimes written .
consists of all points in that are not in . Thus, occurs if and only if does not occur, see Figure 4.2.

Figure 4.2: Complement example.
- The intersection of two sets and , denoted , is the set of all elements that belong to both and , see Figure 4.3.

Figure 4.3: Intersection example.
- If then and cannot both occur, i.e. and are disjoint (or exclusive) sets, see Figure 4.4.

Figure 4.4: Disjoint (exclusive) example.
- The union of two sets and , denoted , is the set of all elements that belong to either or to , see Figure 4.5.

Figure 4.5: Union example.
- The set with no elements in it is called the empty set and is denoted .
Note: and .
A summary of sets notation using the outcomes from a six sided die are presented in the Video 6.
Video 6: Set notation
4.5 Probability
There are different possible interpretations of the meaning of a probability:
Classical interpretation. Assuming that all outcomes of an experiment are equally likely, then the probability of an event , where is the number of outcomes satisfying and is the number of outcomes in (total number of possible outcomes).
Frequency interpretation. The probability of an event is the relative frequency of observing a particular outcome when an experiment is repeated a large number of times under similar circumstances.
Subjective interpretation. The probability of an event is an individual’s perception as to the likelihood of an event’s occurrence.
A probability (measure) is a real-valued set function defined on the events (subsets) of a sample space satisfying the following three axioms (see Kolmogorov, 1933):
- for any event ;
- ;
- If are disjoint events (i.e. for all ), then
If is infinite then 3. can be extended to:
3’. If is any infinite sequence of disjoint events (i.e. for all ), then
Note that all of the other standard properties of probability (measures) that we use are derived from these three axioms.
Using only the axioms above, prove:
for any event ;
where is the complement of ;
;
.
A summary of probability along with proofs of the results in Exercise 1 are provided in Video 7.
Video 7: Probability
Alternatively the solutions are available:
Solution to Exercise 1
which rearranges to give .
Special cases
If , then giving and it follows that .
Since , we have that and hence .
In many cases, consists of equally likely elements, i.e. with .
Then, for any event (i.e. subset of ), coinciding with the Classical interpretation of probability.
- Throw a die. .
The probability of throwing an odd number is
- Draw a card at random from a standard pack of 52.
for all .
If and , then there are 26 black cards, , 4 kings, and 2 black kings ( and ), ,
4.6 Conditional probability
The conditional probability of an event given an event is
Note if , then .
Moreover, since , we have that In other words, to compute the probability of both events and occurring, we can either:
- Consider first whether occurs, , and then whether occurs given that has occurred, ,
- Or consider first whether occurs, , and then whether occurs given that has occurred, .
Rolling a die.
Consider the experiment of tossing a fair 6-sided die. What is the probability of observing a 2 if the outcome was even?
Let event be observing a 2 and let event be the outcome is even. Find :
Two events and are independent if
If , two events, and , are independent if and only if .
Observations on Independence
- If and are NOT independent then
- and being independent is NOT the same as and being disjoint.
Independence: .
Disjoint (exclusive): .
Rolling a die.
Consider the experiment of tossing a fair 6-sided die.
Let , an even number is rolled on the die and , a multiple of 3 is rolled on the die.
Are and independent?
, so . Therefore and are independent.
A partition of a sample space is a collection of events in such that:
- for all (disjoint sets)
- .
We can set in Definition 7 and have infinitely many events constructed the partition.
Figure 4.6 presents an example of a partition of using six events .

Figure 4.6: Example of a partition of a sample space using six events.
For an event , This is illustrated in Figure 4.7 using the partition given in Figure 4.6.

Figure 4.7: The event F expressed in terms of the union of events.
Theorem of Total Probability.
Let be a partition of (i.e. for all and ) and let be any event. Then,
Since the ’s form a partition:
- .
- For each , .
Therefore
By the definition of conditional probability, for each ,
Tin can factory.
Suppose a factory uses three different machines to produce tin cans. Machine I produces 50% of all cans, machine II produces 30% of all cans and machine III produces the rest of the cans. It is known that 4% of cans produced on machine I are defective, 2% of the cans produced on machine II are defective and 5% of the cans produced on machine III are defective. If a can is selected at random, what is the probability that it is defective?
Let event be the can is produced by machine , . Let be the event that the can is defective. From the question, we know Therefore
Job interview problem.
A manager interviews 4 candidates for a job. The manager MUST make a decision offer/reject after each interview. Suppose that candidates are ranked (1 best) and are interviewed in random order.
The manager interviews and rejects the first candidate. They then offer the job to the first candidate that is better than the rejected candidate. If all are worse then they offer the job to the last candidate.
What is the probability that the job is offered to the best candidate?
Attempt Exercise 2 and then watch Video 8 for the solution.
Video 8: Job Interview
Alternatively the solutions are available:
Solution to Exercise 2
Let be the event that the best candidate is offered the job.
For , let be the event that candidate (the best candidate) is interviewed first. Note that s form a partition of the sample space and by randomness,We have that:
1. . If the ranked candidate is interviewed first they will be rejected and cannot be offered the job.
2. . If the ranked candidate is interviewed first then all candidates will be rejected until the best ( ranked) candidate is interviewed and offered the job.
3. . If the ranked candidate is interviewed first then whoever is interviewed first out of the ranked and ranked candidates will be offered the job. Each of these possibilities is equally likely.
4. . If the ranked (worst) candidate is interviewed first then the ranked candidate will only be offered the job if they are interviewed second.
Bayes Formula.
Let be a partition of (i.e. for all and ) such that for all , and let be any event such that . Then
If and , then by definition Since is a partition of such that for all , then by the Theorem of total probability we can rewrite and obtain
Tin can factory (continued).
Consider Example 8. Suppose now that we randomly select a can and find that it is defective.
What is the probability that it was produced by machine I?Guilty?
At a certain stage of a jury trial a jury member gauges that the probability that the defendant is guilty is .
The prosecution then produces evidence that fibres of the victim’s clothing were found on the defendant.
If the probability of such fibres being found is 1 if the defendant is guilty and if the defendant is not guilty, what now should be the jury member’s probability that the defendant is guilty?
Attempt Exercise 3 and then watch Video 9 for the solution.
Video 9: Guilty?
Alternatively the solutions are available:
Solution to Exercise 3
Let be the event that the defendant is guilty and let be the event that fibres are found on the victim’s clothing.
We want , the probability of being guilty given fibres are found on the victim’s clothing.
We have that , , and .
Therefore, by Bayes’ Theorem,
4.7 Mutual Independence
We can extend the concept of independence from two events to events.
Mutual independence.
Events are (mutually) independent if for any finite subset ,
Aircraft safety.
An aircraft safety system contains independent components. The aircraft can fly provided at least one of the components is working. The probability that the th component works is . Then
Task: Lab 3
Attempt the R Markdown file for Lab 3:
Lab 3: Probability in R
Student Exercises
Attempt the exercises below.
Question 1.
A card is drawn from a standard pack of 52. Let be the event ‘the card is black’, the event ‘the card is not an Ace’ and the event ‘the card is a Heart’. Calculate the following probabilities:
- ;
- ;
- ;
- ;
- .
Solution to Question 1.
- ;
- ;
- ;
- ;
- Since , we have that
.
Question 2.
A diagnostic test has a probability of giving a positive result when applied to a person suffering from a certain disease, and a probability of giving a (false) positive when applied to a non-sufferer. It is estimated that of the population are sufferers. Suppose that the test is applied to a person chosen at random from the population. Find the probabilities of the following events.
- the test result will be positive.
- the person is a sufferer, given a positive result.
- the person is a non-sufferer, given a negative result.
- the person is missclassified.
Solution to Question 2.
Let denote ‘test positive’ and denote ‘person is a sufferer’. Then question tells us that , , and .
- .
- Missclassification means either the person is a sufferer and the test is negative, or the person is not
a sufferer and the test is positive. These are disjoint events, so we require
Question 3.
In areas such as market research, medical research, etc, it is often hard to get people to answer embarrassing questions. One way around this is the following. Suppose that people are interviewed, where is even. Each person is given a card, chosen at random from cards, containing a single question. Half of the cards contain the embarrassing question, to which the answer is either ‘Yes’ or ‘No’. The other half of the cards contain the question ‘Is your birthday between January and June inclusive?’
Suppose that of the people interviewed, answer ‘Yes’ to the question that they received. Let be the event that a person gives a ‘Yes’ answer, the event that they received a card asking the embarrassing question. Assuming that half the population have birthdays between January and June inclusive, write down:
- ;
- ;
- .
Hence calculate the proportion of people who answered `Yes’ to the embarrassing question.
Hint: Try writing down an expression for using the Theorem of Total Probability.
Comment on your answer.
Solution to Question 3.
We have that:
- ;
- ;
- .
A little thought should help you see why.