Chapter 3 Analyzing word and document frequency

A central question in text mining and natural language processing is how to quantify what a document is about. This chapter presents two approaches of measuring the “keywords” of a particular document amid other, tf-idf and weighted log odds ratio.