Statistics for Managers Using Microsoft Excel 5 th

  • Slides: 65
Download presentation
Statistics for Managers Using Microsoft® Excel 5 th Edition Chapter 12 Chi Square Tests

Statistics for Managers Using Microsoft® Excel 5 th Edition Chapter 12 Chi Square Tests and Nonparametric Tests Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. Chap 12 -1

Learning Objectives In this chapter, you learn: § How and when to use the

Learning Objectives In this chapter, you learn: § How and when to use the chi-square test for contingency tables § How to use the Marascuilo procedure for determining pair-wise differences when evaluating more than two proportions § How and when to use the Mc. Nemar test § How and when to use nonparametric tests Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 2

Contingency Tables § Useful in situations involving multiple population proportions § Used to classify

Contingency Tables § Useful in situations involving multiple population proportions § Used to classify sample observations according to two or more characteristics § Also called a cross-classification table. Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 3

Contingency Table Example Hand Preference vs. Gender Dominant Hand: Left vs. Right Gender: Male

Contingency Table Example Hand Preference vs. Gender Dominant Hand: Left vs. Right Gender: Male vs. Female § 2 categories for each variable, so the table is called a 2 x 2 table § Suppose we examine a sample of 300 college students Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 4

Contingency Table Example Sample results organized in a contingency table: Gender sample size =

Contingency Table Example Sample results organized in a contingency table: Gender sample size = n = 300: 120 Females, 12 were left handed 180 Males, 24 were left handed Hand Female Preference Male Left 12 24 36 Right 108 156 264 120 180 300 Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 5

Contingency Table Example H 0: π1 = π2 (Proportion of females who are left

Contingency Table Example H 0: π1 = π2 (Proportion of females who are left handed is equal to the proportion of males who are left handed) H 1: π1 ≠ π2 (The two proportions are not the same – Hand preference is not independent of gender) § If H 0 is true, then the proportion of left-handed females should be the same as the proportion of left-handed males. § The two proportions above should be the same as the proportion of left-handed people overall. Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 6

The Chi-Square Test Statistic The Chi-square test statistic is: where: fo = observed frequency

The Chi-Square Test Statistic The Chi-square test statistic is: where: fo = observed frequency in a particular cell fe = expected frequency in a particular cell if H 0 is true 2 for the 2 x 2 case has 1 degree of freedom Assumed: each cell in the contingency table has expected frequency of at least 5 Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 7

The Chi-Square Test Statistic The 2 test statistic approximately follows a chi-square distribution with

The Chi-Square Test Statistic The 2 test statistic approximately follows a chi-square distribution with one degree of freedom Decision Rule: If 2 > 2 U, reject H 0, otherwise, do not reject H 0 0 Do not reject H 0 Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. Reject H 0 2 2 U 8

Computing the Average Proportion The average proportion is: 120 Females, 12 were left handed

Computing the Average Proportion The average proportion is: 120 Females, 12 were left handed § Here: 180 Males, 24 were left handed The proportion of left handers overall is 0. 12, that is, 12% Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 9

Finding Expected Frequencies § To obtain the expected frequency for left handed females, multiply

Finding Expected Frequencies § To obtain the expected frequency for left handed females, multiply the average proportion left handed (p) by the total number of females § To obtain the expected frequency for left handed males, multiply the average proportion left handed (p) by the total number of males If the two proportions are equal, then P(Left Handed | Female) = P(Left Handed | Male) =. 12 i. e. , we would expect (. 12)(120) = 14. 4 females to be left handed (. 12)(180) = 21. 6 males to be left handed Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 10

Observed vs. Expected Frequencies Gender Hand Preference Female Male Left Observed = 12 Expected

Observed vs. Expected Frequencies Gender Hand Preference Female Male Left Observed = 12 Expected = 14. 4 Observed = 24 Expected = 21. 6 36 Right Observed = 108 Observed = 156 Expected = 105. 6 Expected = 158. 4 264 120 Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 180 300 11

The Chi-Square Test Statistic Gender Hand Preference Female Male Left Observed = 12 Expected

The Chi-Square Test Statistic Gender Hand Preference Female Male Left Observed = 12 Expected = 14. 4 Observed = 24 Expected = 21. 6 36 Right Observed = 108 Expected = 105. 6 Observed = 156 Expected = 158. 4 264 120 180 300 The test statistic is: Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 12

The Chi-Square Test Statistic Decision Rule: If 2 > 3. 841, reject H 0,

The Chi-Square Test Statistic Decision Rule: If 2 > 3. 841, reject H 0, otherwise, do not reject H 0 =. 05 0 Do not reject H 0 Reject H 0 2 2 U=3. 841 Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. Here, 2 = 0. . 7576 < 2 U = 3. 841, so you do not reject H 0 and conclude that there is insufficient evidence that the two proportions are different. 13

 2 Test for The Differences Among More Than Two Proportions § Extend the

2 Test for The Differences Among More Than Two Proportions § Extend the 2 test to the case with more than two independent populations: H 0: π 1 = π 2 = … = π c H 1: Not all of the πj are equal (j = 1, 2, …, c) Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 14

The Chi-Square Test Statistic The Chi-square test statistic is: where: § § § fo

The Chi-Square Test Statistic The Chi-square test statistic is: where: § § § fo = observed frequency in a particular cell of the 2 x c table fe = expected frequency in a particular cell if H 0 is true 2 for the 2 x c case has (2 -1)(c-1) = c - 1 degrees of freedom Assumed: each cell in the contingency table has expected frequency of at least 1 Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 15

Computing the Overall Proportion The overall proportion is: § Expected cell frequencies for the

Computing the Overall Proportion The overall proportion is: § Expected cell frequencies for the c categories are calculated as in the 2 x 2 case, and the decision rule is the same: Decision Rule: If 2 > 2 U, reject H 0, otherwise, do not reject H 0 Where 2 U is from the chi-square distribution with c – 1 degrees of freedom Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 16

 2 Test with More Than Two Proportions: Example § The sharing of patient

2 Test with More Than Two Proportions: Example § The sharing of patient records is a controversial issue in health care. A survey of 500 respondents asked whether they objected to their records being shared by insurance companies, by pharmacies, and by medical researchers. The results are summarized on the following table: Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 17

 2 Test with More Than Two Proportions: Example Organization Object to Record Sharing

2 Test with More Than Two Proportions: Example Organization Object to Record Sharing Yes No Insurance Companies Pharmacies Medical Researchers 410 295 335 90 205 165 Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 18

 2 Test with More Than Two Proportions: Example The overall proportion is: Organization

2 Test with More Than Two Proportions: Example The overall proportion is: Organization Object to Insurance Record Sharing Companies Pharmacies Medical Researchers Yes fo = 410 fe = 346. 667 fo = 295 fe = 346. 667 fo = 335 fe = 346. 667 No fo = 90 fe = 153. 333 fo = 205 fe = 153. 333 fo = 165 fe = 153. 333 Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 19

 2 Test with More Than Two Proportions: Example Organization Object to Record Sharing

2 Test with More Than Two Proportions: Example Organization Object to Record Sharing Insurance Companies Pharmacies Medical Researchers Yes No The Chi-square test statistic is: Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 20

 2 Test with More Than Two Proportions: Example H 0: π 1 =

2 Test with More Than Two Proportions: Example H 0: π 1 = π 2 = π 3 H 1: Not all of the πj are equal (j = 1, 2, 3) Decision Rule: If 2 > 2 U, reject H 0, otherwise, do not reject H 0 2 U = 5. 991 is from the chisquare distribution with 2 degrees of freedom. Conclusion: Since 64. 1196 > 5. 991, you reject H 0 and you conclude that at least one proportion of respondents who object to their records being shared is different across the three organizations Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 21

The Marascuilo Procedure The Marascuilo procedure enables you to make comparisons between all pairs

The Marascuilo Procedure The Marascuilo procedure enables you to make comparisons between all pairs of groups. First, compute the observed differences pj - pj’ among all c(c-1)/2 pairs. Second, compute the corresponding critical range for the Marascuilo procedure. Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 22

The Marascuilo Procedure § Critical Range for the Marascuilo Procedure: Statistics for Managers Using

The Marascuilo Procedure § Critical Range for the Marascuilo Procedure: Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 23

The Marascuilo Procedure § Compute a different critical range for each pair-wise comparison of

The Marascuilo Procedure § Compute a different critical range for each pair-wise comparison of sample proportions. § Compare each of the c(c - 1)/2 pairs of sample proportions against its corresponding critical range. § Declare a specific pair significantly different if the absolute difference in the sample proportions |pj – pj’| is greater than its critical range. Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 24

The Marascuilo Procedure Example Organization Object to Record Sharing Insurance Companies Pharmacies Medical Researchers

The Marascuilo Procedure Example Organization Object to Record Sharing Insurance Companies Pharmacies Medical Researchers Yes 410 P 1 = 0. 82 295 P 2 = 0. 59 335 P 3 = 0. 67 No 90 205 165 Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 25

The Marascuilo Procedure Example MARASCUILO TABLE Proportions Absolute Differences Critical Range | Group 1

The Marascuilo Procedure Example MARASCUILO TABLE Proportions Absolute Differences Critical Range | Group 1 - Group 2 | 0. 23 0. 06831808 | Group 1 - Group 3 | 0. 15 0. 0664689 | Group 2 - Group 3 | 0. 08 0. 074485617 Conclusion: Since each absolute difference is greater than the critical range, you conclude that each proportion is significantly different that the other two. Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 26

 2 Test of Independence § Similar to the 2 test for equality of

2 Test of Independence § Similar to the 2 test for equality of more than two proportions, but extends the concept to contingency tables with r rows and c columns H 0: The two categorical variables are independent (i. e. , there is no relationship between them) H 1: The two categorical variables are dependent (i. e. , there is a relationship between them) Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 27

 2 Test of Independence The Chi-square test statistic is: § where: fo =

2 Test of Independence The Chi-square test statistic is: § where: fo = observed frequency in a particular cell of the r x c table fe = expected frequency in a particular cell if H 0 is true 2 for the r x c case has (r-1)(c-1) degrees of freedom Assumed: each cell in the contingency table has expected frequency of at least 1) Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 28

Expected Cell Frequencies § Expected cell frequencies: Where: row total = sum of all

Expected Cell Frequencies § Expected cell frequencies: Where: row total = sum of all frequencies in the row column total = sum of all frequencies in the column n = overall sample size Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 29

Decision Rule § The decision rule is If 2 > 2 U, reject H

Decision Rule § The decision rule is If 2 > 2 U, reject H 0, otherwise, do not reject H 0 Where 2 U is from the chi-square distribution with (r – 1)(c – 1) degrees of freedom Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 30

Example: Test of Independence § The meal plan selected by 200 students is shown

Example: Test of Independence § The meal plan selected by 200 students is shown below: Class Standing Fresh. Number of meals per week 20/week 10/week none Total 24 32 14 70 Soph. 22 26 12 60 Junior 10 14 6 30 Senior 14 16 10 40 70 88 42 200 Total Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 31

Example: Test of Independence § The hypothesis to be tested is: H 0: Meal

Example: Test of Independence § The hypothesis to be tested is: H 0: Meal plan and class standing are independent (i. e. , there is no relationship between them) H 1: Meal plan and class standing are dependent (i. e. , there is a relationship between them) Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 32

Example: Test of Independence Expected cell frequencies if H 0 is true: Example for

Example: Test of Independence Expected cell frequencies if H 0 is true: Example for one cell: Class Standing Number of meals per week 20/wk 10/wk none Total Fresh. 24. 5 30. 8 14. 7 70 Soph. 21. 0 26. 4 12. 6 60 Junior 10. 5 13. 2 6. 3 30 Senior 14. 0 17. 6 8. 4 40 70 88 42 200 Total Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 33

Example: Test of Independence § The test statistic value is: 2 U = 12.

Example: Test of Independence § The test statistic value is: 2 U = 12. 592 for α =. 05 from the chi-square distribution with (4 – 1)(3 – 1) = 6 degrees of freedom Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 34

Example: Test of Independence Decision Rule: If 2 > 12. 592, reject H 0,

Example: Test of Independence Decision Rule: If 2 > 12. 592, reject H 0, otherwise, do not reject H 0 =0. 05 0 Do not reject H 0 Reject H 0 2 2 U=12. 592 Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. Here, 2 = 0. 709 < 2 U = 12. 592, so do not reject H 0 Conclusion: there is insufficient evidence that meal plan and class standing are related. 35

Mc. Nemar Test § Used to test for the difference between two proportions of

Mc. Nemar Test § Used to test for the difference between two proportions of related samples (not independent) § You need to use a test statistic that follows the normal distribution Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 36

Mc. Nemar Test Contingency Table Condition (Group) 2 Condition (Group) 1 Yes No Totals

Mc. Nemar Test Contingency Table Condition (Group) 2 Condition (Group) 1 Yes No Totals Yes A B A+B No C D C+D Totals A+C B+D n Where A = number of respondents who answered yes to condition 1 and condition 2 B = number of respondents who answered yes to condition 1 and no to 2 C = number of respondents who answered no to condition 1 and yes to 2 D = number of respondents who answered no to condition 1 and condition 2 n = number of respondents in the sample Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 37

Mc. Nemar Test Contingency Table Condition (Group) 2 Condition (Group) 1 Yes No Totals

Mc. Nemar Test Contingency Table Condition (Group) 2 Condition (Group) 1 Yes No Totals Yes A B A+B No C D C+D Totals A+C B+D n The sample proportions are: Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 38

Mc. Nemar Test Contingency Table Condition (Group) 2 Condition (Group) 1 Yes No Totals

Mc. Nemar Test Contingency Table Condition (Group) 2 Condition (Group) 1 Yes No Totals Yes A B A+B No C D C+D Totals A+C B+D n The population proportions are: π1 = proportion of the population who answer yes to condition 1 π2 = proportion of the population who answer yes to condition 2 Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 39

Mc. Nemar Test Statistic To test the hypothesis: H 0: π1 = π2 H

Mc. Nemar Test Statistic To test the hypothesis: H 0: π1 = π2 H 1: π1 ≠ π2 Use the test statistic: Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 40

Mc. Nemar Test Example § Suppose you survey 300 homeowners and ask them if

Mc. Nemar Test Example § Suppose you survey 300 homeowners and ask them if they are interested in refinancing their home. In an effort to generate business, a mortgage company improved their loan terms and reduced closing costs. The same homeowners were again surveyed. Determine if change in loan terms was effective in generating business for the mortgage company. The data are summarized as follows: Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 41

Mc. Nemar Test Example Survey response after change Survey response before change Yes No

Mc. Nemar Test Example Survey response after change Survey response before change Yes No Totals Yes 118 2 120 No 22 158 180 140 160 300 Totals Test the hypothesis (at the 0. 05 level of significance): H 0: π1 ≥ π2: The change in loan terms was ineffective H 1: π1 < π2: The change in loan terms increased business Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 42

Mc. Nemar Test Example The critical value (. 05 significance) is Z = -1.

Mc. Nemar Test Example The critical value (. 05 significance) is Z = -1. 96 Survey response after change Survey response before Yes change No Totals Yes 118 2 120 No 22 158 180 140 160 300 Totals The test statistic is: Since Z = -4. 08 < -1. 96, you reject H 0 and conclude that the change in loan terms significantly increase business for the mortgage company. Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 43

Wilcoxon Rank-Sum Test § Test two independent population medians § Populations need not be

Wilcoxon Rank-Sum Test § Test two independent population medians § Populations need not be normally distributed § Distribution free procedure § Used when only rank data are available Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 44

Wilcoxon Rank-Sum Test § Can use when both n 1 , n 2 ≤

Wilcoxon Rank-Sum Test § Can use when both n 1 , n 2 ≤ 10 § Assign ranks to the combined n 1 + n 2 sample values § If unequal sample sizes, let n 1 refer to smaller-sized sample § Smallest value rank = 1, largest value rank = n 1 + n 2 § Assign average rank for ties § Sum the ranks for each sample: T 1 and T 2 § Obtain test statistic, T 1 (from smaller sample) Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 45

Checking the Rankings § The sum of the rankings must satisfy the formula below

Checking the Rankings § The sum of the rankings must satisfy the formula below § Can use this to verify the sums T 1 and T 2 where n = n 1 + n 2 Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 46

Wilcoxon Rank-Sum Test Hypothesis and Decision M 1 = median of population 1; M

Wilcoxon Rank-Sum Test Hypothesis and Decision M 1 = median of population 1; M 2 = median of population 2 Test statistic = T 1 (Sum of ranks from smaller sample) Two-Tail Test H 0: M 1 = M 2 H 1: M 1 ≠ M 2 Reject Do Not Reject T 1 L Reject T 1 U Reject H 0 if T 1 ≤ T 1 L or if T 1 ≥ T 1 U Left-Tail Test H 0: M 1 ≥ M 2 H 1: M 1 < M 2 Do Not Reject T 1 L Reject H 0 if T 1 ≤ T 1 L Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. Right-Tail Test H 0: M 1 ≤ M 2 H 1: M 1 > M 2 Do Not Reject T 1 U Reject H 0 if T 1 ≥ T 1 U 47

Wilcoxon Rank-Sum Test Example with Small Samples § Sample data are collected on the

Wilcoxon Rank-Sum Test Example with Small Samples § Sample data are collected on the capacity § § rates (% of capacity) for two factories. Are the median operating rates for two factories the same? For factory A, the rates are 71, 82, 77, 94, 88 For factory B, the rates are 85, 82, 97 Test for equality of the population medians at the 0. 05 significance level Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 48

Wilcoxon Rank-Sum Test Example with Small Samples Ranked Capacity values: Tie in 3 rd

Wilcoxon Rank-Sum Test Example with Small Samples Ranked Capacity values: Tie in 3 rd and 4 th places Capacity Rank Factory A Factory B 71 1 77 2 82 3. 5 85 5 88 6 92 94 7 8 97 Rank Sums: Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 9 20. 5 24. 5 49

Wilcoxon Rank-Sum Test Example with Small Samples Factory B has the smaller sample size,

Wilcoxon Rank-Sum Test Example with Small Samples Factory B has the smaller sample size, so the test statistic is the sum of the Factory B ranks: T 1 = 24. 5 The sample sizes are: n 1 = 4 (factory B) n 2 = 5 (factory A) The level of significance is α =. 05 Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 50

Wilcoxon Rank-Sum Test Example with Small Samples n Lower and Upper Critical Values for

Wilcoxon Rank-Sum Test Example with Small Samples n Lower and Upper Critical Values for T 1 from Appendix Table E. 8: T 1 L = 11 and T 1 U = 29 n 2 n 1 One. Tailed Two. Tailed 4 5 . 05 . 10 12, 28 19, 36 . 025 . 05 11, 29 17, 38 . 01 . 02 10, 30 16, 39 . 005 . 01 --, -- 15, 40 4 5 6 Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 51

Wilcoxon Rank-Sum Test Example with Small Samples § =. 05 § n 1 =

Wilcoxon Rank-Sum Test Example with Small Samples § =. 05 § n 1 = 4 , n 2 = 5 Two-Tail Test H 0: M 1 = M 2 H 1: M 1 ≠ M 2 Reject Do Not Reject T 1 L=11 Reject T 1 U=29 Reject H 0 if T 1 ≤ T 1 L = 11 or if T 1 ≥ T 1 U = 29 Test Statistic (Sum of ranks from smaller sample): T 1 = 24. 5 Decision: Do not reject at a = 0. 05 Conclusion: There is not enough evidence to prove that the medians are not equal. Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 52

Wilcoxon Rank-Sum Test Using Normal Approximation § For large samples, the test statistic T

Wilcoxon Rank-Sum Test Using Normal Approximation § For large samples, the test statistic T 1 is approximately normal with mean and standard deviation: § Must use the normal approximation if either n 1 or n 2 > 10 § Assign n 1 to be the smaller of the two sample sizes § Can use the normal approximation for small samples Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 53

Wilcoxon Rank-Sum Test Using Normal Approximation § The Z test statistic is § Where

Wilcoxon Rank-Sum Test Using Normal Approximation § The Z test statistic is § Where Z approximately follows a standardized normal distribution Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 54

Wilcoxon Rank-Sum Test Using Normal Approximation Use the setting of the prior example: The

Wilcoxon Rank-Sum Test Using Normal Approximation Use the setting of the prior example: The sample sizes were: n 1 = 4 (factory B) n 2 = 5 (factory A) The level of significance was α =. 05 The test statistic was T 1 = 24. 5 Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 55

Wilcoxon Rank-Sum Test Using Normal Approximation § The test statistic is § Z =

Wilcoxon Rank-Sum Test Using Normal Approximation § The test statistic is § Z = 1. 10 is less than the critical Z value of 1. 96 (for α = . 05) so you do not reject H 0 – there is not sufficient evidence that the medians are not equal Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 56

Kruskal-Wallis Rank Test § Tests the equality of more than 2 population medians §

Kruskal-Wallis Rank Test § Tests the equality of more than 2 population medians § Use when the normality assumption for one-way ANOVA is violated § Assumptions: § The samples are random and independent § variables have a continuous distribution § the data can be ranked § populations have the same variability § populations have the same shape Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 57

Kruskal-Wallis Rank Test § Obtain overall rankings for each value § In event of

Kruskal-Wallis Rank Test § Obtain overall rankings for each value § In event of tie, each of the tied values gets the average rank § Sum the rankings for data from each of the c groups § Compute the H test statistic Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 58

Kruskal-Wallis Rank Test § The Kruskal-Wallis H test statistic: (with c – 1 degrees

Kruskal-Wallis Rank Test § The Kruskal-Wallis H test statistic: (with c – 1 degrees of freedom) where: n = total number of values over the combined samples c = Number of groups Tj = Sum of ranks in the jth sample nj = Size of the jth sample Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 59

Kruskal-Wallis Rank Test § Complete the test by comparing the calculated H value to

Kruskal-Wallis Rank Test § Complete the test by comparing the calculated H value to a critical 2 value from the chi-square distribution with c – 1 degrees of freedom § Decision rule 0 Do not reject H 0 2 U Reject H 0 § Reject H 0 if test statistic H > 2 U 2 § Otherwise do not reject H 0 Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 60

Kruskal-Wallis Rank Test Example § Do different branch offices have a different number of

Kruskal-Wallis Rank Test Example § Do different branch offices have a different number of employees? Office size (Chicago, C) Office size (Denver, D) Office size (Houston, H) 23 41 54 78 66 55 60 72 45 70 30 40 18 34 44 Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 61

Kruskal-Wallis Rank Test Example § Do different branch offices have a different number of

Kruskal-Wallis Rank Test Example § Do different branch offices have a different number of employees? Office size Ranking (Chicago, C) 23 41 54 78 66 2 6 9 15 12 = 44 Office size (Denver, D) Ranking Class size (Houston, H) Ranking 55 60 72 45 70 10 11 14 8 13 30 40 18 34 44 3 5 1 4 7 = 56 Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. = 20 62

Kruskal-Wallis Rank Test Example § The H statistic is Statistics for Managers Using Microsoft

Kruskal-Wallis Rank Test Example § The H statistic is Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 63

Kruskal-Wallis Rank Test Example § Compare H = 6. 72 to the critical value

Kruskal-Wallis Rank Test Example § Compare H = 6. 72 to the critical value from the chi-square distribution for 3 – 1 = 2 degrees of freedom and =. 05: Since H = 6. 72 > reject H 0 , There is evidence of a difference in the population median number of employees in the branch offices Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 64

Chapter Summary In this chapter, we have § § § Developed and applied the

Chapter Summary In this chapter, we have § § § Developed and applied the 2 test for the difference between two proportions Developed and applied the 2 test for differences in more than two proportions Examined the 2 test for independence Used the Mc. Nemar test for differences in two related proportions Used the Wilcoxon rank sum test for two population medians § Small Samples § Large sample Z approximation Applied the Kruskal-Wallis H-test for multiple population medians Statistics for Managers Using Microsoft Excel, 5 e © 2008 Prentice-Hall, Inc. 65