3.3 Measurement: Contingency tables | Applied Causal Analysis (with R)

3.3 Measurement: Contingency tables

Discussion
1. Q: What do the tables show (start with Table 3.3 then 3.4)?
2. Q: What does a cell in those tables show?
3. Q: What would the univariate distribution for income look like behind Table 3.4?
4. Q: What is a missing value/NA? Do they appear in Table 3.3 and 3.4?
5. Q: How would the data underlying Table 3.3 and 3.4 look like in a dataframe?

Table 3.3: Contingency table of discipline (this class)
Economics	Sociology
4	7

Table 3.4: Contingency table of Income/Gender
	female	male
high income	5	0
low income	2	3

Notes

Lessons to be learned
- We can conceptualize data as units that have been assigned to cells through observation. Each cell represents a particular value on the variable (or a combination of values on several variables).
- “A contingency table (also known as a cross tabulation or crosstab) is a type of table in a matrix format that displays the (multivariate) frequency distribution of the variables.” (Wikipedia)
  - Displays how observations are distributed across the values of our variable(s).
- Contingency tables are “underused”.
- Ideally, display contingency tables with the missings (in R).