7.4 Estimation

We can estimate model parameters using maximum likelihood. But first, we must write down the likelihood. Given that the conditional intensity function fully specifies a point process, it seems clear that the likelihood function should depend on that.

Given a dataset of event times t1,t2,,tn observed on the interval [0,T], the log-likelihood for a model λ(tHt;θ) with parameter vector θ is

(θ)=ni=1logλ(tiHt;θ)T0λ(tHt;θ)dt.

The log-likelihood has two parts

  1. The first sum “rewards” a model for having high intensity where event times are located.

  2. The integral part, because it is subtracted off, rewards a model for having low intensity where the event times are not located.

With point process models, it is just as important to get it right where the points are not located as it is to model where the points are located.

Typically, the log-likelihood (θ) will be a nonlinear function and therefore will need to be maximized using standard nonlinear optimization routines.

For a stationary Poisson process with rate λ, we see that the log-likelihood reduces down the familiar

(λ)=nlogλλT. The maximum likelihood of λ is therefore ˆλ=n/T.