Homework 7

Problem 1

Assume a Poisson( $μ$ ) model for the number of home runs hit (in total by both teams) in a MLB game. Let $X_{1}, \dots, X_{n}$ be a random sample of home run counts for $n$ games.

Suppose we want to estimate $θ = μ e^{- μ}$ , the probability that any single game has exactly 1 HR (for Poisson( $μ$ ), $P (X = 1) = e^{- μ} u^{1} / 1! = μ e^{- μ}$ ). Consider two estimators of $θ$ :

$\hat{θ} = \bar{X} e^{- \bar{X}}$
$\hat{p} = sample proportion of 1s = \frac{number of games in the sample with 1 HR}{sample size}$

Compute the value of $\hat{θ}$ based on the sample (3, 0, 1, 4, 0). Write a clearly worded sentence reporting in context this estimate of $θ$ .
Compute the value of $\hat{p}$ based on the sample (3, 0, 1, 4, 0). Write a clearly worded sentence reporting in context your estimate of $θ$ .
Which of these two estimators is the MLE of $θ$ in this situation? Explain, without doing any calculations.
It can be shown that $\hat{p}$ is an unbiased estimator of $θ$ . Explain in words what this means.
Is $\hat{θ}$ an unbiased estimator of $θ$ ? Explain. (You don’t have to derive anything; just apply a general principle.)
Suppose $μ = 2.3$ and $n = 5$ . Explain in full detail how you would use simulation to approximate the bias of $\hat{θ}$ in this case.
Coding required. Conduct the simulation from the previous part and approximate bias of $\hat{θ}$ when $μ = 2.3$ and $n = 5$ .
Explain in full detail how you would use simulation to approximate the bias function of $\hat{θ}$ when $n = 5$ .
Coding required. Conduct the simulation from the previous part and plot the approximate bias function when $n = 5$ . For what values of $μ$ does $\hat{θ}$ tend to overestimate $μ$ ? Underestimate? For what values of $μ$ is the bias the worst?

Problem 2

Continuing Problem 1.

It can be shown that $Var (\hat{p}) = \frac{θ (1 - θ)}{n}$ . Compute $Var (\hat{p})$ when $μ = 2.3$ and $n = 5$ . Then write a clearly worded sentence interpreting this value.
Suppose $μ = 2.3$ and $n = 5$ . Explain in full detail how you would use simulation to approximate the variance of $\hat{θ}$ .
Coding required. Conduct the simulation from the previous part and approximate the variance of $\hat{θ}$ when $μ = 2.3$ and $n = 5$ . Then write a clearly worded sentence interpreting this value.
Which estimator has smaller variance when $μ = 2.3$ (and $n = 5$ )? Answer, but then explain why this information alone is not really useful.
Explain in full detail how you would use simulation to approximate the variance function of $\hat{θ}$ (if $n = 5$ ).
Coding required. Conduct the simulation from the previous part and plot the approximate variance function. Compare to the variance function of $\hat{p}$ . Based on variability alone, which estimator is preferred?

Problem 3

Continuing Problems 1 and 2

Compute $MSE (\hat{p})$ when $μ = 2.3$ and $n = 5$ . (You can do the next part first if you want, but it helps to work with specific numbers first.)
Derive the MSE function of $\hat{p}$ . (Hint: use facts from previous parts.)
Suppose $μ = 2.3$ (and $n = 5$ ). Explain in full detail how you would use simulation to approximate the MSE of $\hat{θ}$ .
Coding required. Conduct the simulation from the previous part and approximate the MSE of $\hat{θ}$ when $μ = 2.3$ (and $n = 5$ ).
Which estimator has smaller MSE when $μ = 2.3$ (and $n = 5$ )? Answer, but then explain why this information alone is not really useful.
Explain in full detail how you would use simulation to approximate the MSE function of $\hat{θ}$ (if $n = 5$ ).
Coding required. Conduct the simulation from the previous part and plot the approximate MSE function. Compare to the MSE function of $\hat{p}$ .
Compare the MSEs of the two estimators for $n = 5$ and a few other values of $n$ . Is there a clear preference between these two estimators? Discuss.