ANOVA ANALYSIS OF VARIANCE AN ANOVA SITUATION Subjects
ANOVA: ANALYSIS OF VARIANCE
AN ANOVA SITUATION Subjects: 25 patients with blisters Treatments: Treatment A, Treatment B, Placebo (randomly assigned) Measurement: # of days until blisters heal Data [and means]: • A: 5, 6, 6, 7, 7, 8, 9, 10 • B: 7, 7, 8, 9, 9, 10, 11 • P: 7, 9, 9, 10, 10, 11, 12, 13 [7. 25] [8. 875] [10. 11] Are these differences in the means significant?
WHAT DOES ANOVA DO? At its simplest (there are extensions) ANOVA tests the following hypotheses: H 0: The means of all the groups are equal. Ha: Not all the means are equal • Doesn’t say how or which ones differ. • Can follow up with “multiple comparisons”
CONDITIONS OF ANOVA • Random • Normal/Large Sample • Standard deviations of each group are approximately equal q Rule of thumb: ratio of largest to smallest sample st. dev. must be less than 2: 1 (largest sd/ smallest sd < 2)
NORMALITY CHECK
STANDARD DEVIATION CHECK Variable days treatment A B P N 8 8 9 Mean 7. 250 8. 875 10. 111 Median 7. 000 9. 000 10. 000 St. Dev 1. 669 1. 458 1. 764 Compare largest and smallest standard deviations: • largest: 1. 764 • smallest: 1. 458 • 1. 764/1. 458 = 1. 21 < 2
HOW ANOVA WORKS (OUTLINE) ANOVA measures two sources of variation in the data and compares their relative sizes • variation BETWEEN groups • for each data value look at the difference between its group mean and the overall mean • variation WITHIN groups • for each data value we look at the difference between that value and the mean of its group
The ANOVA F-statistic is a ratio of the Between Group Variation divided by the Within Group Variation: A large F is evidence against H 0, since it indicates that there is more difference between groups than within groups.
DEGREES OF FREEDOM • The between group df (numerator) is one less than the number of groups • We have three groups, so df(numerator) = 3 – 1 = 2 • The within group df is the sum of the individual df’s of each group or Total number of observations – number of groups • The sample sizes are 8, 8, and 9 • df(denominator) = 7 + 8 = 22 or 25 – 3 = 22
OUR DATA Variable days treatment A B P N 8 8 9 Mean 7. 250 8. 875 10. 111 Median 7. 000 9. 000 10. 000 St. Dev 1. 669 1. 458 1. 764
CALCULATIONS 1. Calculate the overall mean for our data. 2. Calculate MSG 3. Calculate MSE 4. Calculate the F-statistic. F = MSG MSE
MINITAB ANOVA OUTPUT Analysis of Variance for days Source DF SS MS treatment 2 34. 74 17. 37 Error 22 59. 26 2. 69 Total 24 94. 00 F 6. 45 P 0. 006
SO HOW BIG IS F? Since F is Mean Square Between / Mean Square Within = MSG / MSE A large value of F indicates relatively more difference between groups than within groups (evidence against H 0) To get the P-value, we will need to use the F-distribution. Be thankful for technology!!!
OUR ORIGINAL SITUATION Subjects: 25 patients with blisters Treatments: Treatment A, Treatment B, Placebo (randomly assigned) Measurement: # of days until blisters heal Data [and means]: • A: 5, 6, 6, 7, 7, 8, 9, 10 • B: 7, 7, 8, 9, 9, 10, 11 • P: 7, 9, 9, 10, 10, 11, 12, 13 Put each treatment into a list. Run an ANOVA test on your calculator.
- Slides: 15