Chapter 6 Joint Distribution Functions

6.1 Overview

In Section 5 we have introduced the concept of a random variable and a variety of discrete and continuous random variables. However, often in statistics it is important to consider the joint behaviour of two (or more) random variables. For example:

  1. Height, Weight.
  2. Degree class, graduate salary.

In this section we explore the joint distribution between two random variables X and Y.

6.2 Joint c.d.f. and p.d.f.

Joint c.d.f.

The joint (cumulative) probability distribution function (joint c.d.f.) of X and Y is defined by
FX,Y(x,y)=P({ω:X(ω)x and Y(ω)y})=P(Xx,Yy),

where x,yR.

Joint p.d.f.

Two r.v.’s X and Y are said to be jointly continuous, if there exists a function fX,Y(x,y)0 such that for every “nice” set CR2,
P((X,Y)C)=CfX,Y(x,y)dxdy.

The function fX,Y is called the joint probability density function (joint p.d.f.) of X and Y.

If X and Y are jointly continuous, then
FX,Y(x,y)=P(Xx,Yy)=yxfX,Y(u,v)dudv.
Hence we differentiate the c.d.f. FX,Y(x,y) with respect to both x and y to obtain the p.d.f.
fX,Y(x,y)=2xyFX,Y(x,y).

We note, as in the following example, that often the joint p.d.f. is non-zero on a subset of R2.


Suppose that
fX,Y(x,y)={24x(1xy)if x,y0 and x+y1,0otherwise.
  1. Find P(X>Y),
  2. Find P(X>12).
  1. Let C={(x,y):x>y} and write A={(x,y):fX,Y(x,y)>0}. Then,
CA={(x,y);x>0,y>0,x+y<1,x>y}.


Therefore
P(X>Y)=P((X,Y)C)=CfX,Y(x,y)dxdy=CA24x(1xy)dxdy=01/2y1y24x(1xy)dxdy=01/2[12x28x312yx2]y1ydy=01/2(412y+16y3)dy=[4y6y2+4y4]01/2=232+14=34.
  1. Let D={(x,y):x>1/2}, then
DA={(x,y);x>1/2,y>0,x+y<1}.
Therefore
P(X>1/2)=P((X,Y)D)=DfX,Y(x,y)dxdy=DA24x(1xy)dxdy=1/2101x24x(1xy)dydx=1/21[24xy(1x12y)]01xdx=1/2112x(1x)2dx=[122x2243x3+124x4]1/21=68+332+1316=516.


6.3 Marginal c.d.f. and p.d.f.

There are many situations with bivariate distributions where we are interested in one of the random variables. For example, we might have the joint distribution of height and weight of individuals but only be interested in the weight of individuals. This is known as the marginal distribution.

Marginal c.d.f.

Suppose that the c.d.f. of X and Y is given by FX,Y, then the c.d.f. of X can be obtained from FX,Y since FX(x)=P(Xx)=P(Xx,Y<)=limyFX,Y(x,y). FX is called the marginal distribution (marginal c.d.f.) of X.

Marginal p.d.f.

If fX,Y is the joint p.d.f. of X and Y, then the marginal probability density function (marginal p.d.f.) of X is given by fX(x)=fX,Y(x,y)dy.


Consider Example 6.2.3.

Find the marginal p.d.f. and c.d.f of Y.

fY(y)=fX,Y(x,y)dx={01y24x(1xy)dx0y1,0otherwise.={4(1y)30y1,0otherwise.

Note that marginal distribution of Y is a Beta(1,4) distribution.

Hence,
FY(y)={0,y<0,0y4(1u)3du=1(1y)4,0y1,1,y>1.



Find the p.d.f. of Z=X/Y, where
fX,Y(x,y)={e(x+y)0<x,y<,0otherwise.

Attempt Example 6.3.4 and then watch Video 14 for the solutions.

Video 14: Ratio of Exponentials

Solution to Example 6.3.4.

Clearly, Z>0. For z>0,

Therefore
FZ(z)=P(Zz)=P(X/Yz)={(x,y):x/yz}fX,Y(x,y)dxdy=00yze(x+y)dxdy=0ey(1+z)+eydy=111+z
and so
fZ(z)=dFZ(z)dz={1(1+z)2,z>0,0,z0.


Note that we can extend the notion of joint and marginal distributions to random variables X1,X2,,Xn in a similar fashion.

6.4 Independent random variables

Independent random variables

Random variables X and Y are said to be independent if, for all x,yR,

P(Xx,Yy)=P(Xx)P(Yy),

that is, for all x,yR, FX,Y(x,y)=FX(x)FY(y).

If X and Y are discrete random variables with joint p.m.f. pX,Y(x,y) and marginal p.m.f.’s pX(x) and pY(y), respectively, then X and Y are independent if and only if for all x,yR,
pX,Y(x,y)=pX(x)pY(y).
If X and Y are continuous random variables with joint p.d.f. fX,Y(x,y) and marginal p.d.f.’s fX(x) and fY(y), respectively, then X and Y are independent if and only if for all x,yR,
fX,Y(x,y)=fX(x)fY(y).
For example, in Example 6.3.4 X and Y have joint probability density function:
fX,Y(x,y)=exp({x+y})=exp(x)exp(y)=fX(x)fY(y),(x,y>0),

where both X and Y are distributed according to Exp(1). Thus the distribution Z given in Example 6.3.4 is the ratio of two independent exponential random variables with mean 1.

Note that we can easily extend the notion of independent random variables to random variables X1,X2,,Xn.

Independent and identically distributed

The random variables X1,X2,,Xn are said to independent and identically distributed (i.i.d.) if,

  • X1,X2,,Xn are independent.

  • X1,X2,,Xn all have the same distribution, that is, XiF for all i=1,,n.

Definition 6.4.2 extends the notion of i.i.d. given at the start of Section 5.4.2 for discrete random variables.

Random sample

The random variables X1,X2,,Xn are said to be a random sample if they are i.i.d.


Suppose X1,X2,,Xn are a random sample from the Poisson distribution with mean λ. Find the joint p.m.f. of X1,X2,,Xn.


If XiPo(λ), then its p.m.f. is given by

P(Xi=xi)=pXi(xi)={eλλxixi!if xi=0,1,2,,0otherwise.

Since X1,X2,,Xn are independent, their joint p.m.f. is given by,

pX1,X2,,Xn(x1,x2,,xn)=i=1npXi(xi)={i=1neλλxixi!if xi=0,1,2,,0otherwise.={enλλi=1nxii=1nxi!if xi=0,1,2,,0otherwise.


The joint p.m.f. of X=(X1,X2,,Xn) tells us how likely we are to observe x=(x1,x2,,xn) given λ. This can be used either:

  1. To compute P(X=x) when λ is known;
  2. Or, more commonly in statistics, to assess what is a good estimate of λ given x in situations where λ is unknown, see Parameter Estimation in Section 9.

Student Exercise

Attempt the exercise below.


A theory of chemical reactions suggests that the variation in the quantities X and Y of two products C1 and C2 of a certain reaction is described by the joint probability density function
fX,Y(x,y)=2(1+x+y)3x0,y0.

On the basis of this theory, answer the following questions.

  1. What is the probability that at least one unit of each product is produced?
  2. Determine the probability that the quantity of C1 produced is less than half that of C2.
  3. Find the c.d.f. for the total quantity of C1 and C2.
Solution to Exercise 6.1.
  1. The required probability is
    P(X1,Y1)=112(1+x+y)3dydx=1[1(1+x+y)3]y=1dx=11(2+x)2dx=[12+x]x=1=13.
  2. The required probability is
    P(X12Y)=00y/22(1+x+y)3dxdy=0[1(1+x+y)3]x=0y/2dy=0(1(1+y)21(1+3y/2)2)du=[11+y2/3(1+3y/2)]y=0=13.
  3. Since both X and Y are non-negative random variables, X+Y is non-negative. Thus P(X+Yz)=0 for z<0. For z0,
    P(X+Yz)=0z0zy2(1+x+y)3dxdy=0z[1(1+x+y)3]x=0zydy=0z(1(1+y)21(1+z)2)du=[11+yy(1+z)2]y=0z=11+zz(1+z)2+1+0=(z1+z)2.