7.15 Relative Fit Measures in the Bayesian Approach of CDM estimation

7.15.1 Deviance Information Criterion(DIC)

Spiegelhalter et al. (2002) introduced to the concept of Deviance Information Criterion(DIC) for comparing the performance of multiple competing models. DIC can be defined as,

$DIC = \bar{D}(\theta)+p_D,$ where, $\bar{D}(\theta)$ is the posterior mean deviance, a measure of how well the data predicts the observed data. $p_D$ is the effective number of parameters.

We can calculate the DIC measures easily from RJAGS, if you are using JAGS to estimate models. For JAGS model, you need to specify DIC=TRUE.

Code

jags.dina.mcmc <- jags(data = jags.data, inits = jags.inits, parameters.to.save = jags.parameters,
                       model.file = jags.dina, n.chains = 2, n.iter = 50, n.burnin = 25,
                       n.thin = 1, DIC = TRUE)

jags.dina.mcmc_output<- jags.dina.mcmc$BUGSoutput
jags.dina.mcmc_output$DIC # DIC measure

7.15.2 Watanabe-Akaike Information Criterion(WAIC)

WAIC is also known as Widely Applicable Information Criterion(Watanabe & Opper, 2010). It is a fully Bayesian approach for estimating the out-of-sample prediction performance.In general,

$WAIC= -2(lppd -p_{WAIC})$

The WAIC is calculated by utilizing both the log pointwise predictive density (lppd) and $p_{WAIC}$ . lppd is the log likelihood of the observed data given the estimated parameters, averaged over the posterior distribution of the parameters, and $p_{WAIC}$ is the effective number of parameters.

we usually report either DIC or WAIC for relative fit measures.

NIMBLE can calculate WAIC directly during MCMC estimation.

Code

library(nimble)
library(GDINA)
library(MCMCvis)

library(nimble, warn.conflicts = FALSE)

DINA.mcmc <- nimbleCode({
  for (n in 1:N) {
    for (j in 1:J) {
      prob[n, j] <- g[j] + (1 - s[j] - g[j]) * 1 * (sum(alpha[n,
                                                              1:K] * Q[j, 1:K]) >= sum(Q[j, 1:K]))
      score[n, j] ~ dbern(prob[n, j])
    }
    latent.group.index[n] ~ dcat(pi[1:C])
    alpha[n, 1:K] <- all.patterns[latent.group.index[n], 1:K]
  }
  pi[1:C] ~ ddirch(delta[1:C])
  
  for (j in 1:J) {
    s[j] ~ dbeta(1, 1)
    g[j] ~ T(dbeta(1, 1), , 1 - s[j])
    
  }
})

score <- sim10GDINA$simdat
Q <- sim10GDINA$simQ
J <- ncol(score)
K <- ncol(Q)
N <- nrow(score)
all.patterns <- GDINA::attributepattern(K)
C <- nrow(all.patterns)

DINAconstants <- list(Q = Q, N = N, J = J, K = K, C = C, delta = rep(1,C))

DINAdata <- list(score = score, all.patterns = as.matrix(all.patterns))

initials <- list(list(s = rep(0.2, J), g = rep(0.2, J), pi = rep(1/C, C),
                      latent.group.index = sample(1:C, N, replace = TRUE)), list(s = rep(0.1,J), g = rep(0.1, J), pi = rep(1/C, C), 
                                                                                 latent.group.index = sample(1:C,N, replace = TRUE)))

nimbleMCMC_samples <- nimbleMCMC(code = DINA.mcmc, constants = DINAconstants,
                                 data = DINAdata, inits = initials, monitors = c("s", "g"), nburnin = 2500,
                                 niter = 5000, nchains = 2, setSeed = c(123, 456), WAIC = TRUE)

nimbleMCMC_samples$WAIC

References

Spiegelhalter, D. J., Best, N. G., Carlin, B. P., & Van Der Linde, A. (2002). Bayesian measures of model complexity and fit. Journal of the Royal Statistical Society: Series b (Statistical Methodology), 64(4), 583–639.

Watanabe, S., & Opper, M. (2010). Asymptotic equivalence of bayes cross validation and widely applicable information criterion in singular learning theory. Journal of Machine Learning Research, 11(12).