Chapter 2 Maximum Likelihood Estiamtion

2.1 Introduction

  • The Maximum Likelihood Estimation (MLE) is a method of estimating the parameters of a model. This estimation method is one of the most widely used.

  • The method of maximum likelihood selects the set of values of the model parameters that maximizes the likelihood function. Intuitively, this maximizes the “agreement” of the selected model with the observed data.

  • The Maximum-likelihood Estimation gives an unified approach to estimation.

2.2 The Principle of Maximum Likelihood

We take poisson distributed random variables as an example. Suppose that X1,X2,,XN are i.i.d. discrete random variables, such that XiPois(θ) with a pmf (probability mass function) defined as:

Pr(Xi=xi)=exp(θ)θxixi!

where θ is an unknown parameter to estimate.

Question: What is the probability of observing the particular sample {x1,x2,,xN}, assuming that a Poisson distribution with as yet unknown parameter θ generated the data?

This probability is equal to

Pr((X1=x1)(XN=xN))

Since the variables Xi are i.i.d., this joint probability is equal to the product of the marginal probabilities:

Pr((X1=x1)(XN=xN))=Ni=1Pr(Xi=xi)

Given the pmf of the Poisson distribution, we have:

Pr((X1=x1)(XN=xN))=Ni=1exp(θ)θxixi!=exp(θN)θNi=1xiNi=1xi!

This joint probability is a function of θ (the unknown parameter) and corresponds to the likelihood of the sample {x1,x2,,xN} denoted by

L(x1,,xN|θ)=Pr((X1=x1)(XN=xN))

Consider maximizing the likelihood function L(x1,,xN|θ) with respect to θ. Since the log function is monotonically increasing, we usually maximize lnL(x1,,xN|θ) instead. We call this as loglikelihood function: (x1,,xN|θ)=lnL(x1,,xN|θ), or simply (θ). In this case:

(x1,,xN|θ)=θN+ln(θ)Ni=1xiln(Ni=1xi!)

The simplest way to find the θ that maximizes (θ) is to take a derivative.

(θ)θ=N+1θNi=1xi

To make sure that we indeed maximize not minimize (θ), we should also check that the second derivative is less than 0:

2(θ)θ2=1θ2Ni=1xi<0

Therefore, the maximum likelihood estimator ˆθmle is:

ˆθmle=1NNi=1xi

For the Laplace model, the maximum-likelihood estimates are:

ˆμ=median(xt)ˆb=1nni=1|xtˆμ|

Note that they are different from the MOM results.

2.3 Example (Exponential Distribution)

Suppose that D1,D2,,DN are i.i.d. positive random variables (durations for instance), with DiExp(θ) with θ0 and

Li(di|θi)=fDi(di|θi)=1θexp(diθ)i(di|θi)=ln(fDi(di|θi))=ln(θ)diθ