6.15 Classification: Assessing Model Accuracy

  • Training error rate: the proportion of mistakes that are made if we apply estimate to the training observations
    • 1nni=1I(yiˆyi)
      • ˆyi: predicted class label for observation i
      • I(yiˆyi): indicator variable that equals 1 if yiˆyi and zero yi=ˆyi
      • If I(yiˆyi)=0 then the ith observation was classified correctly (otherwise missclassified)
      • Computes the fraction of incorrect classifications
  • Test error rate: Associated with a set of test observations of the form (x0,y0)
    • Ave(I(y0=ˆy0)) + ˆy0: predicted class label that results from applying the classifier to the test observation with predictor x0
    • Good classifier: One for which the test error is smallest
  • Source: James et al. (2013 Chap. 2.2.3)

References

James, Gareth, Daniela Witten, Trevor Hastie, and Robert Tibshirani. 2013. An Introduction to Statistical Learning: With Applications in R. Springer Texts in Statistics. Springer.