16.5 Statistical Properties of SI Model Estimates
To determine the statistical properties of the plug-in principle/least squares/ML estimators ˆαi, ˆβi and ˆσ2ϵ,i in the SI model, we treat them as functions of the random variables {(Ri,t,RMt)}Tt=1 where Rt and RMt are assumed to be generated by the SI model (16.1) - (16.5).
16.5.1 Bias
In the SI model, the estimators ˆαi, ˆβi and ˆσ2ϵ,i (with degrees-of-freedom adjustment) are unbiased: E[ˆαi]=αi,E[ˆβi]=βi,E[ˆσ2ϵ,i]=σ2ϵ,i. To shows that ˆαi and ˆβi are unbiased, it is useful to consider the SI model for asset i in matrix form for t=1,…,T: Ri=αi1+βiRM+ϵi=Xγi+ϵi, where Ri=(Ri1,…,RiT)′, RM=(RM1,…,RMT)′, ϵi=(ϵi1,…,ϵiT)′, 1=(1,…,1)′, X=(1RM), and γi=(αi,βi)′. The estimator for γi is ˆγi=(X′X)−1X′Ri. Pluging in Ri=Xγi+ϵi gives ˆγi=(X′X)−1X′(Xγi+ϵi)=(X′X)−1X′Xγi+(X′X)−1X′ϵi=γi+(X′X)−1X′ϵi. Then E[ˆγi]=γi+E[(X′X)−1X′ϵi]=γi+E[(X′X)−1X′]E[ϵi](becauseϵitisindependentofRMt)=γi(becauseE[ϵi]=0). The derivation of E[ˆσ2ϵ,i]=σ2ϵ,i is beyond the scope of this book and can be found in graduate econometrics textbooks such as Hayashi (1980).
16.5.2 Precision
Under the assumptions of the SI model, analytic formulas for estimates of the standard errors for ˆαi and ˆβi are given by: ^se(ˆαi)≈ˆσε,i√T⋅ˆσ2M⋅√1TT∑t=1r2Mt,^se(ˆβi)≈ˆσε,i√T⋅ˆσ2M, where “≈” denotes an approximation based on the CLT that gets more accurate the larger the sample size. Remarks:
- ^se(ˆαi) and ^se(ˆβi) are smaller the smaller is ˆσε,i. That is, the closer are returns to the fitted regression line the smaller are the estimation errors in ˆαi and ˆβi.
- ^se(ˆβi) is smaller the larger is ˆσ2M. That is, the greater the variability in the market return RMt the smaller is the estimation error in the estimated slope coefficient ˆβi. This is illustrated in Figure xxx. The left panel shows a data sample from the SI model with a small value of ˆσ2M and the right panel shows a sample with a large value of ˆσ2M. The right panel shows that the high variability in RMt makes it easier to identify the slope of the line.
- Both ^se(ˆαi) and ^se(ˆβi) go to zero as the sample size, T, gets large. Since ˆαi and ˆβi are unbiased estimators, this implies that they are also consistent estimators. That is, they converge to the true values αi and βi, respectively, as T→∞.
- In R, the standard error values (16.36) and (16.37) are computed using the
summary()
function on an “lm
” object.
There are no easy formulas for the estimated standard errors for ˆσ2ϵ,i, ˆσε,i and ˆR2. Estimated standard errors for these estimators, however, can be easily computed using the bootstrap.
to be completed
◼
16.5.3 Sampling distribution and confidence intervals.
Using arguments based on the CLT, it can be shown that for large enough T the estimators ˆαi and ˆβi are approximately normally distributed: ˆαi∼N(αi,^se(ˆαi)2),ˆβi∼N(βi,^se(ˆβi)2), where ^se(ˆαi) and ^se(ˆβi) are given by (16.36) and (16.37), respectively.