Chapter 15 Multivariate Normal Distribution

15.1 Introduction

In previous sections we have introduced joint distributions between random variables X and Y and considered their marginal and conditional distributions. In this section, we study the special case where the joint distribution of X1,X2,,Xn is a multivariate normal distribution. In this case both marginal and conditional distributions are (multivariate) normal distributions. We pay particular attention to the special case, n=2, the bivariate normal distribution. Multivariate normal distributions appear in many areas of statistics and being able to manipulate multivariate normal distributions is an important skill.

15.2 n-Dimensional Normal Distribution

Multivariate normal distribution

A random vector X=(X1,X2,,Xn)T is said to have an n-dimensional normal distribution with parameters μ and Σ if the joint p.d.f. of X is given by fX(x)=(2π)n2|Σ|12exp{12(xμ)TΣ1(xμ)}, where μ=(μ1,μ2,,μn)T and Σ=(σij) is an n×n real, symmetric, positive definite matrix with all positive eigenvalues. It is denoted by XNn(μ,Σ).

The multivariate normal distribution has the following important properties:

  • If D is a p×n matrix and XNn(μ,Σ), then Z=DXNp(Dμ,DΣDT).
  • The marginal distribution of each component Xi is normal with E[Xi]=μi and Var(Xi)=σii. Note that this is a direct consequence of the first property taking D=(0,,0,1,0,,0), that is, the ith component equal to 1.
  • The components X1,X2,,Xn of a multivariate normal random vector are independent of each other if and only if X1,X2,,Xn are uncorrelated, i.e. σij=Cov(Xi,Xj)=0 for all ij.
    That is, for normal random variables uncorrelated (zero covariance) implies independence.
  • Conditional distributions derived from joint normal distributions are normal.

Bivariate normal distribution

The bivariate normal distribution is the special case n=2 for the n-dimensional normal distribution.

Bivariate normal distribution

The random variables X1 and X2 are said to have a bivariate normal distribution with mean μ=(μ1,μ2) and variance-covariance matrix Σ=(σ12ρσ1σ2ρσ1σ2σ22) if their joint p.d.f. is given by fX1,X2(x1,x2)=12πσ1σ21ρ2exp{12(1ρ2)[(x1μ1σ1)22ρ(x1μ1σ1)(x2μ2σ2)+(x2μ2σ2)2]}.

For X1 and X2 with a bivariate normal distribution, we have:

  • X1N(μ1,σ12) and X2N(μ2,σ22) and Cov(X1,X2)=ρσ1σ2.

  • (X1|X2=x2)N(μ1+ρσ1σ2(x2μ2),σ12(1ρ2)).

  • (X2|X1=x1)N(μ2+ρσ2σ1(x1μ1),σ22(1ρ2)).

Suppose that Z=(Z1,Z2), where Z1 and Z2 are independent N(0,1) random variables, that is, ZN2(0,I2), where I2 is the 2×2 identity matrix.

Then for any μ=(μ1,μ2)R2 and variance-covariance matrix Σ=(σ12ρσ1σ2ρσ1σ2σ22), with σ1,σ2>0 and 1<ρ<1, XN2(μ,Σ) can be expressed as a transformation of Z. Specifically, X=μ+LZ, where L=(σ10ρσ21ρ2σ2). Note that Σ=LL and this is known as the Cholesky decomposition.

The Cholesky decomposition extends to n×n variance-covariance matrix, Σ and is a common approach to convert n independent univariate normal distributions into a multivariate normal distribution.

Trivariate normal.

Suppose X=(X1,X2,X3)TN3(0,Σ), where

Σ=(210140005).
  1. Find the distribution of Y=X1+X2.

  2. Determine the constant c such that Y1=2X1+cX2 and Y2=2X1+cX3 are independent.

Attempt Example 15.2.4: Trivariate Normal and then watch Video 23 for the solutions.

Video 23: Trivariate Normal

Solution to Example 15.2.4.
  1. Writing Y=X1+X2, in the form DX requires D=(1 1 0). By the properties of a multivariate normal distribution
    YN(D0,DΣDT),
    where D0=0 and
    DΣDT=(110)(210140005)(110)=(350)(110)=8.
    Therefore, YN(0,8).
  2. Let Y=(Y1Y2)=DX. Choose
    D=(2c020c).
    By the properties of a multivariate normal distribution, YN2(D0,DΣDT), where D0=0 and
    DΣDT=(2c020c)(210140005)(22c00c)=(4+c2+4c0425c)(22c00c)=(8+4c+4c28+2c8+2c8+5c2).

For Y1 to be independent of Y2, necessarily Cov(Y1,Y2)=8+2c=0. Therefore c=4.

Task: Session 8

Attempt the R Markdown file for Session 8:
Session 8: Transformations and Multivariate Normal Distribution