Chapter10 Measures of Missing data information
These measures are the Fraction of Missing information (FMI), the relative increase in variance due to nonresponse (RIV) and the Relative Efficiency (RE). They are derived from values of the between, and within imputation variance and the total variance. There exist two versions of the FMI, which are referred to as lambda and FMI.
10.1 Fraction of Missing Information - Lambda
The proportion of total variance due to missingness, lambda, (Van Buuren (2018); Raghunathan (2016)) can be derived from the between and total missing data variance as:
Lambda=VB+VBmVT
Where m is the number of imputed datasets and VB and VT are the between and total variance respectively. This value can be interpreted as the proportion of variation in the parameter of interest due to the missing data.
When we use the VB and VT values that were calculated in paragraph 5.1.2, lambda will be:
Lambda=0.040027+0.04002730.849084=0.06283485
This specific value for lambda is not reported by SPSS, but is reported by the mice package in R. Van Buuren (2018) and Enders (2010) use the same formula to calculate this type of missing data information, but van Buuren calls it lambda and Enders FMI.
10.2 Relative increase in variance
Another related measure is the relative increase in variance due to nonresponse. This value is calculated as:
RIV=VB+VBmVW
Where VB and VW are the between and within variance respectively. This value can be interpreted as the proportional increase in the sampling variance of the parameter of interest that is due to the missing data.
Filling in this formula with the values for VB and VW from paragraph 5.1.2 results in:
RIV=0.040027+0.04002730.7957147=0.06704779
This value is also presented in (Figure 9.1) in the column Relative Increase Variance. The relation between RIV and Lambda is defined as
RIV=Lambda1−Lambda.
10.3 Fraction of Missing Information - FMI
FMI=RIV+2df+31+RIV
Where RIV is the relative increase in variance due to missing data and df is the degrees of freedom for the pooled result. The degrees of freedom for the pooled result can be obtained in two ways: dfOld or dfAdjusted.
In SPSS, FMI is calculated using dfOld, which results in:
FMI=RIV+2df+31+RIV=0.06704779+2506.5576+31+0.06704779=0.0665132
In R package mice, FMI is calculated using the formula for dfAdjusted, that results in:
FMI=RIV+2dfAdjusted+31+RIV=0.06704779+2107.7509+31+0.06704779=0.0797587
The difference between lambda and FMI is that FMI is adjusted for the fact that the number of imputed datasets that are generated is not unlimitedly large. These measures differ for a small value of the df.
10.4 Relative Efficiency
The Relative Efficiency (RE) is defined as:
RE=11+FMIm
FMI is the fraction of missing information and m is the number of imputed datasets.
The RE value is only provided by SPSS and is calculated by filling in the values of (Figure 9.1) as follows:
RE=11+0.06651323=0.9783098
The RE gives information about the precision of the parameter estimate as the standard error of a regression coefficient.