My Data Science Notes
Intro
1
Probability
1.1
Principles
2
Discrete Distributions
2.1
Bernoulli
2.2
Binomial
2.3
Poission
2.4
Multinomial
2.5
Negative-Binomial
2.6
Geometric
2.7
Hypergeometric
2.8
Gamma
3
Continuous Distributions
3.1
Normal
3.1.1
Example
3.1.2
Example
3.1.3
Example
3.1.4
Normal Approximation to Binomial
3.1.5
Example
3.1.6
Example
3.1.7
From Sample to Population
3.2
Join Distributions
3.3
Likelihood
4
Categorical Analysis - Nonmodel
4.1
Chi-Square Test
4.2
One-Way Tables
4.2.1
Chi-Square Goodness-of-Fit Test
4.2.2
Proportion Test
4.3
Two-Way Tables
4.3.1
Chi-Square Independence Test
4.3.2
Residuals Analysis
4.3.3
Difference in Proportions
4.3.4
Relative Risk
4.3.5
Odds Ratio
4.3.6
Partitioning Chi-Square
4.3.7
Correlation
4.4
K-Way Tables
4.4.1
Odds Ratio
4.4.2
Chi-Square Independence Test
5
Continuous Variable Analysis
5.0.1
Correlation
Supervised Machine Learning
6
Ordinary Least Squares
6.1
Linear Regression Model
6.2
Parameter Estimation
6.3
Model Assumptions
6.3.1
Linearity
6.3.2
Multicollinearity
6.3.3
Normality
6.3.4
Equal Variances
6.4
Prediction
6.5
Inference
6.5.1
t
-Test
6.5.2
F
-Test
6.6
Interpretation
6.7
Model Validation
6.7.1
Accuracy Metrics
6.7.2
Cross-Validation
6.7.3
Gain Curve
6.8
Reference
7
Generalized Linear Models
7.1
Logistic Regression
7.2
Multinomial Logistic Regression
7.3
Ordinal Logistic Regression
7.3.1
Assumptions
7.3.2
Modeling
7.3.3
Case Study
7.4
Poisson Regression
8
Linear Discriminant Analysis
9
Classification
10
Regularization
11
Decision Trees
11.1
Classification Tree
11.1.1
Confusion Matrix
11.1.2
ROC Curve
11.1.3
Caret Approach
11.2
Regression Trees
11.2.1
Caret Approach
11.3
Bagging
11.4
Random Forests
11.5
Gradient Boosting
11.6
Summary
11.7
Reference
12
Non-linear Models
12.1
Splines
12.2
MARS
12.3
GAM
13
Support Vector Machines
13.1
Maximal Margin Classifier
13.2
Support Vector Classifier
13.3
Support Vector Machines
13.4
Example
13.5
Using Caret
14
Principal Components Analysis
15
Clustering
16
Text Mining
17
Survival Analysis
Appendix
Publishing to BookDown
Shiny Apps
Packages
Create a package
17.0.1
Document Functions with roxygen
Create Data
Create Vignette
References
Published with bookdown
My Data Science Notes
Chapter 15
Clustering