Part 1 What is the document about?

This document documents (no pun intended) my code and my workflow for the cancer dataset that Sean found a couple of days ago.

I intend to - using the dataset - build several supervised machine learning classifiers to predict the status of cancer patients. I will then evaluate the performance of these models using the content taught in week 9; Furthermore, if possible, I think I will also try to perform GO term analysis on some of the genes that are the most differentially expressed in patients with and without cancer.

As always, feel free to ask questions in the group chat (or you can send me a private message if that’s your cup of tea) if you have any questions regarding this document.

1.1 DISCLAIMER

This document is still unfinished - hence, certain hyperlinks may be broken!

I will rectify these broken hyperlinks over time, but for now, just bear this in mind.