A Minimal Book Example
1
Hello and Welcome!
2
Life Advice
2.1
Some Potentially Cool Readingsā¦
3
Introduction to Data Mining
3.1
Lesson 1: What is Data Mining?
3.1.1
How can big data help?
3.1.2
Activity
3.1.3
What are the techniques in data mining?
3.2
Lesson 2 & 3: Ethics of Data Mining
3.2.1
Activity
3.2.2
Bias in Data Mining (more to be added)
3.2.3
Activity
3.2.4
Bias in Data Mining, cont.
3.2.5
Activity
4
Exploratory Data Analysis, part 1
4.1
Lesson 1: Looking at data
4.2
Lesson 2: Basic EDA
4.2.1
Brief ethics consideration based on point (e)
4.2.2
Quick reminders of definitions for EDA
4.2.3
Variation
4.2.4
Data in Context
4.2.5
Data in Context, part 2
4.2.6
Ethics interjection
4.2.7
Activity
4.2.8
Covariation
4.3
Lesson 3: Missing Data
4.3.1
NA v NULL in R
4.3.2
So, what do we do?
4.3.3
Activity
4.4
Lesson 4: Imputations
4.4.1
Definitions
4.4.2
Imputation Examples
4.4.3
Activity
4.5
Lesson 5: Basic Statistics in R
4.5.1
Descriptive Statistics
4.5.2
Correlations
4.5.3
Visual Correlations using corrplot
4.5.4
t-tests
5
Data Visualization
5.1
Lesson 1: An introduction and very brief history
5.2
What makes a good graphic?
5.2.1
The ggplot way
5.2.2
Facets
5.3
Lesson 3: More Advanced Data Visualization
5.3.1
Bar Plots
6
Cluster Analysis, part 1
6.1
Lesson 1: Introduction to Cluster Analysis
6.1.1
Activity
6.2
Lesson 2: Partitional Clustering
6.2.1
K-Means
6.2.2
K-Medoids Clustering (PAM)
6.2.3
Clustering Large Applications (CLARA)
6.2.4
Activity
6.3
Lesson 3: Hierarchical Clustering
6.3.1
Agglomerative Hierarchical Clustering
6.3.2
Divisive Hierarchical Clustering
7
Cluster Analysis, part 2
7.1
Lesson 1: Density-based Clustering
7.1.1
Activity
7.2
Lesson 2: Soft Clustering
7.2.1
Activity
7.3
Lesson 3: Evaluation Techniques
7.3.1
Internal Clustering Validation Methods
7.3.2
External Clustering Validation Method
7.3.3
Activity
8
Text Mining, part 1
8.1
Lesson 1: Introduction to Text Mining
8.2
Lesson 2: Basic Sentiment Analysis
8.2.1
Activity
8.3
Lesson 3: Term-Frequency
8.3.1
How are these calculated?
8.3.2
Analyzing TF-IDF
9
Text Mining, part 2
9.1
Lesson 1: n-grams
9.1.1
Tokenizing ngrams
9.1.2
How can we analyze bigrams?
9.1.3
Activity
9.1.4
Bigrams and Sentiment
9.1.5
Network Visualization of ngrams
9.1.6
Activity
9.2
Lesson 2: Topic Modelling
9.2.1
Latent Dirichlet Allocation (LDA)
9.2.2
Data Prep and Cleaning
9.2.3
Activity
10
Supervised Machine Learning Hold Space
11
Bonus Content Reserve Space
11.1
Principle Components Analysis (PCA)
References
Published with bookdown
Data Analytics Living Texbook
Chapter 10
Supervised Machine Learning Hold Space