7.8 Full information statistics- Test level (Cont’d)
Based on the full information statistics, classical goodness of fit statistics such as Pearson’s χ2 and likelihood ratio G2 are widely used and fundamental fit indices in CDM analysis(Han & Johnson, 2019).
Two full-information statistics can be defined as χ2=N∑y(py−ˆπy)2ˆπy and G2=2N∑ypypyˆπy,
where,
py is the observed proportion for response vector y.
^πy is the model- based expected proportion for response vector y.
Both statistics are χ2 distributed. The associated degrees of freedom is equal to the number of response patterns - the number of parameters - 1 (i.e., 2k−p−1).
- H0: The model fits data well
- H1: The model does not fit data well
Once you perform the χ2 and G2 tests, you will get a P−value. If the P−value>0.05, we may suggest that the model fits the data well.
7.8.1 Limitations:
Both χ2 and G2 suffer from the issues of Sparsity, if:
- The number of items is large, i.e., too many latent class.
- The sample size is low.
- With large item numbers, Pearson’s χ2 and G2 may not follow χ2 distribution(Han & Johnson, 2019)
We can still use full information statistics in case we have large number of items, and smaller sample size by employing resampling and bootstrapping procedures. However, these process are often not easy to implement and the computation may be prohibitively expensive.
To address the issue, Maydeu-Olivares & Joe (2005) introduced an alternative approach employing limited information statistics.