3.4 Two population proportions
Comparing two proportions, like comparing two means, is common
A hypothesis test can help determine if a difference in the estimated proportions reflects a difference in the two population proportions
The null hypothesis states that there is no difference between two population proportions p1 and p2
H0: p1=p2
- The alternative hypothesis can take one of three forms:
H1: p1≠p2two-tailed testH1: p1<p2left-tailed testH1: p1>p2right-tailed test
A distribution of the differences in samples proportions is found to be a normal distribution in a large samples (according to the central limit theorem CLT)
Therefore a test statistic follows a normal distribution withe zero mean and unit variance
Z=ˆp1−ˆp2√S2p(1n1+1n2)∼N(0, 1)S2p=ˉp(1−ˉp)
- Here is what each term means:
ˆp1=x1n1 is the proportion of the first sample (chosen from population 1)ˆp2=x2n2 is the proportion of the second sample (chosen from population 2)x1 is the the number of successes in the first samplex2 is the the number of successes in the second samplen1 is the size of the first samplen2 is the size of the second sampleS2p is the pooled (common) variance from both samplesˉp=x1+x2n1+n2 is the pooled (common) proportion from both samples
Example 3.7 A bank has recently acquired a new branch in location B. Bank wishes to test the hypothesis that default rate of new customres in the location B is different from default rate of current customers. They sample 200 current customers, and find that 20 have defaulted. In location B, another sample of 100 new customres shows that 6 have defaulted on their loans. Significance level is 5%. Compute the p-value in Excel using function =2*(1-NORM.S.DIST(ABS(z);TRUE))
.
Example 3.8 Use the same information from previous example 3.7 to test the hypothesis that default rate of new customres in the location B is lower than default rate of current customers. Compute the p-value in Excel using function =1-NORM.S.DIST(ABS(z);TRUE)
.