This document documents (no pun intended) my code and my workflow for the cancer dataset that Sean found a couple of days ago.

I intend to - using the dataset - build several supervised machine learning classifiers to predict the status of cancer patients. I will then evaluate the performance of these models using the content taught in week 9; Furthermore, if possible, I think I will also try to perform GO term analysis on some of the genes that are the most differentially expressed in patients with and without cancer.

