Chapter 6 Joint Distribution Functions

6.1 Overview

In Section 5 we have introduced the concept of a random variable and a variety of discrete and continuous random variables. However, often in statistics it is important to consider the joint behaviour of two (or more) random variables. For example:

  1. Height, Weight.
  2. Degree class, graduate salary.

In this Section we explore the joint distribution between two random variables XX and YY.

6.2 Joint c.d.f. and p.d.f.

The joint (cumulative) probability distribution function (joint c.d.f.) of XX and YY is defined by
FX,Y(x,y)=P({ω:X(ω)x and Y(ω)y})=P(Xx,Yy),FX,Y(x,y)=P({ω:X(ω)x and Y(ω)y})=P(Xx,Yy),

where x,yRx,yR.

Two r.v.’s XX and YY are said to be jointly continuous, if there exists a function fX,Y(x,y)0fX,Y(x,y)0 such that for every “nice” set CR2CR2,
P((X,Y)C)=CfX,Y(x,y)dxdy.P((X,Y)C)=CfX,Y(x,y)dxdy.

The function fX,YfX,Y is called the joint probability density function (joint p.d.f.) of XX and YY.

If XX and YY are jointly continuous, then
FX,Y(x,y)=P(Xx,Yy)=yxfX,Y(u,v)dudv.FX,Y(x,y)=P(Xx,Yy)=yxfX,Y(u,v)dudv.

Hence we differentiate the c.d.f. FX,Y(x,y)FX,Y(x,y) with respect to both xx and yy to obtain the p.d.f. fX,Y(x,y)=2xyFX,Y(x,y).fX,Y(x,y)=2xyFX,Y(x,y).

Suppose that
fX,Y(x,y)={24x(1xy)if x,y0 and x+y1,0otherwise.fX,Y(x,y)={24x(1xy)if x,y0 and x+y1,0otherwise.
  1. Find P(X>Y)P(X>Y),
  2. Find P(X>12)P(X>12).
  1. Let C={(x,y):x>y}C={(x,y):x>y} and write A={(x,y):fX,Y(x,y)>0}A={(x,y):fX,Y(x,y)>0}. Then,
CA={(x,y);x>0,y>0,x+y<1,x>y}.CA={(x,y);x>0,y>0,x+y<1,x>y}.


Therefore
P(X>Y)=P((X,Y)C)=CfX,Y(x,y)dxdy=CA24x(1xy)dxdy=1/201yy24x(1xy)dxdy=1/20[12x28x312yx2]1yydy=1/20(412y+16y3)dy=[4y6y2+4y4]1/20=232+14=34.P(X>Y)=P((X,Y)C)=CfX,Y(x,y)dxdy=CA24x(1xy)dxdy=1/201yy24x(1xy)dxdy=1/20[12x28x312yx2]1yydy=1/20(412y+16y3)dy=[4y6y2+4y4]1/20=232+14=34.
  1. Let D={(x,y):x>1/2}D={(x,y):x>1/2}, then
DA={(x,y);x>1/2,y>0,x+y<1}.DA={(x,y);x>1/2,y>0,x+y<1}.
Therefore
P(X>1/2)=P((X,Y)D)=DfX,Y(x,y)dxdy=DA24x(1xy)dxdy=11/21x024x(1xy)dydx=11/2[24xy(1x12y)]1x0dx=11/212x(1x)2dx=[122x2243x3+124x4]11/2=68+332+1316=516.P(X>1/2)=P((X,Y)D)=DfX,Y(x,y)dxdy=DA24x(1xy)dxdy=11/21x024x(1xy)dydx=11/2[24xy(1x12y)]1x0dx=11/212x(1x)2dx=[122x2243x3+124x4]11/2=68+332+1316=516.


6.3 Marginal c.d.f. and p.d.f.

There are many situations with bivariate distributions where we are interested in one of the random variables. For example, we might have the joint distribution of height and weight of individuals but only be interested in the weight of individuals. This is known as the marginal distribution.

Suppose that the c.d.f. of XX and YY is given by FX,YFX,Y, then the c.d.f. of XX can be obtained from FX,YFX,Y since FX(x)=P(Xx)=P(Xx,Y<)=limyFX,Y(x,y).FX(x)=P(Xx)=P(Xx,Y<)=limyFX,Y(x,y). FXFX is called the marginal distribution (marginal c.d.f.) of XX.

If fX,YfX,Y is the joint p.d.f. of XX and YY, then the marginal probability density function (marginal p.d.f.) of XX is given by fX(x)=fX,Y(x,y)dy.fX(x)=fX,Y(x,y)dy.

Consider Example 1.

Find the marginal p.d.f. and c.d.f of Y.

fY(y)=fX,Y(x,y)dx={1y024x(1xy)dx0y1,0otherwise.={4(1y)30y1,0otherwise.fY(y)=fX,Y(x,y)dx={1y024x(1xy)dx0y1,0otherwise.={4(1y)30y1,0otherwise.
Hence,
FY(y)={0,y<0,y04(1u)3du=1(1y)4,0y1,1,y>1.FY(y)=⎪ ⎪ ⎪⎪ ⎪ ⎪0,y<0,y04(1u)3du=1(1y)4,0y1,1,y>1.


Find the p.d.f. of Z=X/YZ=X/Y, where
fX,Y(x,y)={e(x+y)0<x,y<,0otherwise.fX,Y(x,y)={e(x+y)0<x,y<,0otherwise.

Attempt Exercise 1 and then watch Video 14 for the solutions.

Video 14: Ratio of Exponentials

Alternatively the solutions are available:

Solution to Exercise 1

Clearly, Z>0Z>0. For z>0z>0,

Therefore
FZ(z)=P(Zz)=P(X/Yz)={(x,y):x/yz}fX,Y(x,y)dxdy=0yz0e(x+y)dxdy=0ey(1+z)+eydy=111+zFZ(z)=P(Zz)=P(X/Yz)={(x,y):x/yz}fX,Y(x,y)dxdy=0yz0e(x+y)dxdy=0ey(1+z)+eydy=111+z
and so
fZ(z)=dFZ(z)dz={1(1+z)2,z>0,0,z0.fZ(z)=dFZ(z)dz={1(1+z)2,z>0,0,z0.


Note that we can extend the notion of joint and marginal distributions to random variables X1,X2,,XnX1,X2,,Xn in a similar fashion.

6.4 Independent random variables

Random variables XX and YY are said to be independent if, for all x,yRx,yR,

P(Xx,Yy)=P(Xx)P(Yy),P(Xx,Yy)=P(Xx)P(Yy),

that is, for all x,yRx,yR, FX,Y(x,y)=FX(x)FY(y)FX,Y(x,y)=FX(x)FY(y).

If XX and YY are discrete random variables with joint p.m.f. pX,Y(x,y)pX,Y(x,y) and marginal p.m.f.’s pX(x)pX(x) and pY(y)pY(y), respectively, then XX and YY are independent if and only if for all x,yRx,yR,
pX,Y(x,y)=pX(x)pY(y).pX,Y(x,y)=pX(x)pY(y).
If XX and YY are continuous random variables with joint p.d.f. fX,Y(x,y)fX,Y(x,y) and marginal p.d.f.’s fX(x)fX(x) and fY(y)fY(y), respectively, then XX and YY are independent if and only if for all x,yRx,yR,
fX,Y(x,y)=fX(x)fY(y).fX,Y(x,y)=fX(x)fY(y).
For example, in Exercise 1 XX and YY have joint probability density function:
fX,Y(x,y)=exp({x+y})=exp(x)exp(y)=fX(x)fY(y),(x,y>0),fX,Y(x,y)=exp({x+y})=exp(x)exp(y)=fX(x)fY(y),(x,y>0),

where both XX and YY are distributed according to Exp(1)Exp(1). Thus the distribution ZZ given in Exercise 1 is the ratio of two independent exponential random variables with mean 1.

Note that we can easily extend the notion of independent random variables to random variables X1,X2,,XnX1,X2,,Xn.

The random variables X1,X2,,XnX1,X2,,Xn are said to independent and identically distributed (i.i.d.) if,

  • X1,X2,,XnX1,X2,,Xn are independent.

  • X1,X2,,XnX1,X2,,Xn all have the same distribution, that is, XiFXiF for all i=1,,ni=1,,n.

Definition 6 extends the notion of i.i.d. given at the start of Section 5.4.2 for discrete random variables.

The random variables X1,X2,,XnX1,X2,,Xn are said to be a random sample if they are i.i.d.

Suppose X1,X2,,XnX1,X2,,Xn are a random sample from the Poisson distribution with mean λλ. Find the joint p.m.f. of X1,X2,,XnX1,X2,,Xn.


If XiPo(λ)XiPo(λ), then its p.m.f. is given by

P(Xi=xi)=pXi(xi)={eλλxixi!if xi=0,1,2,,0otherwise.P(Xi=xi)=pXi(xi)={eλλxixi!if xi=0,1,2,,0otherwise.

Since X1,X2,,XnX1,X2,,Xn are independent, their joint p.m.f. is given by,

pX1,X2,,Xn(x1,x2,,xn)=ni=1pXi(xi)={ni=1eλλxixi!if xi=0,1,2,,0otherwise.={enλλni=1xini=1xi!if xi=0,1,2,,0otherwise.pX1,X2,,Xn(x1,x2,,xn)=ni=1pXi(xi)={ni=1eλλxixi!if xi=0,1,2,,0otherwise.={enλλni=1xini=1xi!if xi=0,1,2,,0otherwise.


The joint p.m.f. of X=(X1,X2,,Xn)X=(X1,X2,,Xn) tells us how likely we are to observe x=(x1,x2,,xn)x=(x1,x2,,xn) given λλ. This can be used either:

  1. To compute P(X=x)P(X=x) when λλ is known;
  2. Or, more commonly in statistics, to assess what is a good estimate of λλ given xx in situations where λλ is unknown.

Student Exercise

Attempt the exercise below.

Question.

A theory of chemical reactions suggests that the variation in the quantities XX and YY of two products C1C1 and C2C2 of a certain reaction is described by the joint probability density function
fX,Y(x,y)=2(1+x+y)3x0,y0.fX,Y(x,y)=2(1+x+y)3x0,y0.

On the basis of this theory, answer the following questions.

  1. What is the probability that at least one unit of each product is produced?
  2. Determine the probability that quantity of C1C1 produced is less than half that of C2C2.
  3. Find the c.d.f. for the total quantity of C1C1 and C2C2.
Solution to Question.
  1. The required probability is
    P(X1,Y1)=112(1+x+y)3dydx=1[1(1+x+y)3]y=1dx=11(2+x)2dx=[12+x]x=1=13.P(X1,Y1)=112(1+x+y)3dydx=1[1(1+x+y)3]y=1dx=11(2+x)2dx=[12+x]x=1=13.
  2. The required probability is
    P(X12Y)=0y/202(1+x+y)3dxdy=0[1(1+x+y)3]y/2x=0dy=0(1(1+y)21(1+3y/2)2)du=[11+y2/3(1+3y/2)]y=0=13.P(X12Y)=0y/202(1+x+y)3dxdy=0[1(1+x+y)3]y/2x=0dy=0(1(1+y)21(1+3y/2)2)du=[11+y2/3(1+3y/2)]y=0=13.
  3. Since both XX and YY are non-negative random variables, X+YX+Y is non-negative. Thus P(X+Yz)=0P(X+Yz)=0 for z<0z<0. For z0z0,
    P(X+Yz)=z0zy02(1+x+y)3dxdy=z0[1(1+x+y)3]zyx=0dy=z0(1(1+y)21(1+z)2)du=[11+yy(1+z)2]zy=0=11+zz(1+z)2+1+0=(z1+z)2.P(X+Yz)=z0zy02(1+x+y)3dxdy=z0[1(1+x+y)3]zyx=0dy=z0(1(1+y)21(1+z)2)du=[11+yy(1+z)2]zy=0=11+zz(1+z)2+1+0=(z1+z)2.