STAT 250 Dr Kari Lock Morgan Hypothesis Testing

  • Slides: 36
Download presentation
STAT 250 Dr. Kari Lock Morgan Hypothesis Testing: Cautions SECTION 4. 3, 4. 5

STAT 250 Dr. Kari Lock Morgan Hypothesis Testing: Cautions SECTION 4. 3, 4. 5 • Type I and II errors (4. 3) • Statistical versus practical significance (4. 5) • Multiple testing (4. 5) Statistics: Unlocking the Power of Data Lock 5

Errors There are four possibilities: Truth Decision Reject H 0 Do not reject H

Errors There are four possibilities: Truth Decision Reject H 0 Do not reject H 0 true TYPE I ERROR TYPE II ERROR H 0 false • A Type I Error is rejecting a true null (false positive) • A Type II Error is not rejecting a false null (false negative) Statistics: Unlocking the Power of Data Lock 5

Red Wine and Weight Loss • In the test to see if resveratrol is

Red Wine and Weight Loss • In the test to see if resveratrol is associated with food intake, the p-value is 0. 035. o If resveratrol is not associated with food intake, a Type I Error would have been made • In the test to see if resveratrol is associated with locomotor activity, the p-value is 0. 980. o If resveratrol is associated with locomotor activity, a Type II Error would have been made Statistics: Unlocking the Power of Data Lock 5

Analogy to Law A person is innocent until proven guilty. Evidence must be beyond

Analogy to Law A person is innocent until proven guilty. Evidence must be beyond the shadow of a doubt. Types of mistakes in a verdict? Convict an innocent Release a guilty Statistics: Unlocking the Power of Data Type error Lock 5

Probability of Type I Error Distribution of statistics, assuming H 0 true: If the

Probability of Type I Error Distribution of statistics, assuming H 0 true: If the null hypothesis is true: • 5% of statistics will be in the most extreme 5% • 5% of statistics will give p-values less than 0. 05 • 5% of statistics will lead to rejecting H 0 at α = 0. 05 • If α = 0. 05, there is a 5% chance of a Type I error Statistics: Unlocking the Power of Data Lock 5

Probability of Type I Error Distribution of statistics, assuming H 0 true: If the

Probability of Type I Error Distribution of statistics, assuming H 0 true: If the null hypothesis is true: • 1% of statistics will be in the most extreme 1% • 1% of statistics will give p-values less than 0. 01 • 1% of statistics will lead to rejecting H 0 at α = 0. 01 • If α = 0. 01, there is a 1% chance of a Type I error Statistics: Unlocking the Power of Data Lock 5

Probability of Type I Error • The probability of making a Type I error

Probability of Type I Error • The probability of making a Type I error (rejecting a true null) is the significance level, α Statistics: Unlocking the Power of Data Lock 5

Probability of Type II Error �How can we reduce the probability of making a

Probability of Type II Error �How can we reduce the probability of making a Type II Error (not rejecting a false null)? a) Decrease the sample size b) Increase the sample size Statistics: Unlocking the Power of Data Lock 5

Larger sample size makes it easier to reject the null H 0: p =

Larger sample size makes it easier to reject the null H 0: p = 0. 5 Ha: p > 0. 5 n = 100 Statistics: Unlocking the Power of Data Lock 5

Probability of Type II Error �How can we reduce the probability of making a

Probability of Type II Error �How can we reduce the probability of making a Type II Error (not rejecting a false null)? a) Decrease the significance level b) Increase the significance level Statistics: Unlocking the Power of Data Lock 5

Significance Level and Errors α • Reject H 0 • Do not reject H

Significance Level and Errors α • Reject H 0 • Do not reject H 0 • Could be making a Type I error if H 0 true • Could be making a Type II error if Ha true • Chance of Type I error • Related to chance of making a Type II error • Decrease α if Type I error is very bad • Increase α if Type II error is very bad Statistics: Unlocking the Power of Data Lock 5

Probability of Errors • The probability of making a Type I error (rejecting a

Probability of Errors • The probability of making a Type I error (rejecting a true null) if the null is true is the significance level, α • The probability of making a Type II error (not rejecting a false null) if the alternative is true depends on the significance level and the sample size (among other things) • α should be chosen depending how bad it is to make a Type I or Type II error Statistics: Unlocking the Power of Data Lock 5

Choosing α �By default, usually α = 0. 05 �If a Type I error

Choosing α �By default, usually α = 0. 05 �If a Type I error (rejecting a true null) is much worse than a Type II error, we may choose a smaller α, like α = 0. 01 �If a Type II error (not rejecting a false null) is much worse than a Type I error, we may choose a larger α, like α = 0. 10 Statistics: Unlocking the Power of Data Lock 5

Significance Level Come up with a hypothesis testing situation in which you may want

Significance Level Come up with a hypothesis testing situation in which you may want to… • Use a smaller significance level, like = 0. 01 • Use a larger significance level, like = 0. 10 Statistics: Unlocking the Power of Data Lock 5

Statistical vs Practical Significance • With small sample sizes, even large differences or effects

Statistical vs Practical Significance • With small sample sizes, even large differences or effects may not be significant • With large sample sizes, even a very small difference or effect can be significant • A statistically significant result is not always practically significant, especially with large sample sizes Statistics: Unlocking the Power of Data Lock 5

Statistical vs Practical Significance • Example: Suppose a weight loss program recruits 10, 000

Statistical vs Practical Significance • Example: Suppose a weight loss program recruits 10, 000 people for a randomized experiment. • A difference in average weight loss of only 0. 5 lbs could be found to be statistically significant • Suppose the experiment lasted for a year. Is a loss of ½ a pound practically significant? Statistics: Unlocking the Power of Data Lock 5

Diet and Sex of Baby • Are certain foods in your diet associated with

Diet and Sex of Baby • Are certain foods in your diet associated with whether or not you conceive a boy or a girl? • To study this, researchers asked women about their eating habits, including asking whether or not they ate 133 different foods regularly • A significant difference was found for breakfast cereal (mothers of boys eat more), prompting the headline “Breakfast Cereal Boosts Chances of Conceiving Boys”. http: //www. newscientist. com/article/dn 13754 -breakfast-cereals-boost-chances-of-conceiving-boys. html Statistics: Unlocking the Power of Data Lock 5

“Breakfast Cereal Boosts Chances of Conceiving Boys” I used to eat breakfast cereal every

“Breakfast Cereal Boosts Chances of Conceiving Boys” I used to eat breakfast cereal every morning and have two boys. Do you think this helped to boost my chances of having boys? a) Yes b) No c) Impossible to tell Statistics: Unlocking the Power of Data Lock 5

Hypothesis Tests For each of the 133 foods studied, a hypothesis test was conducted

Hypothesis Tests For each of the 133 foods studied, a hypothesis test was conducted for a difference between mothers who conceived boys and girls in the proportion who consume each food �If there are NO differences (all null hypotheses are true), about how many significant differences would be found using α = 0. 05? �How might you explain the significant difference for breakfast cereal? Statistics: Unlocking the Power of Data Lock 5

Multiple Testing When multiple hypothesis tests are conducted, the chance that at least one

Multiple Testing When multiple hypothesis tests are conducted, the chance that at least one test incorrectly rejects a true null hypothesis increases with the number of tests. If the null hypotheses are all true, α of the tests will yield statistically significant results just by random chance. Statistics: Unlocking the Power of Data Lock 5

Statistics: Unlocking the Power of Data www. causeweb. org Author: JB Landers Lock 5

Statistics: Unlocking the Power of Data www. causeweb. org Author: JB Landers Lock 5

Multiple Comparisons • Consider a topic that is being investigated by research teams all

Multiple Comparisons • Consider a topic that is being investigated by research teams all over the world Using α = 0. 05, 5% of teams are going to find something significant, even if the null hypothesis is true Statistics: Unlocking the Power of Data Lock 5

Multiple Comparisons • Consider a research team/company doing many hypothesis tests Using α =

Multiple Comparisons • Consider a research team/company doing many hypothesis tests Using α = 0. 05, 5% of tests are going to be significant, even if the null hypotheses are all true Statistics: Unlocking the Power of Data Lock 5

Multiple Comparisons • This is a serious problem • The most important thing is

Multiple Comparisons • This is a serious problem • The most important thing is to be aware of this issue, and not to trust claims that are obviously one of many tests (unless they specifically mention an adjustment for multiple testing) • There are ways to account for this (e. g. Bonferroni’s Correction), but these are beyond the scope of this class Statistics: Unlocking the Power of Data Lock 5

Publication Bias • publication bias refers to the fact that usually only the significant

Publication Bias • publication bias refers to the fact that usually only the significant results get published • The one study that turns out significant gets published, and no one knows about all the insignificant results • This combined with the problem of multiple comparisons, can yield very misleading results Statistics: Unlocking the Power of Data Lock 5

Jelly Beans Cause Acne! http: //xkcd. com/882/ Statistics: Unlocking the Power of Data Lock

Jelly Beans Cause Acne! http: //xkcd. com/882/ Statistics: Unlocking the Power of Data Lock 5

Statistics: Unlocking the Power of Data Lock 5

Statistics: Unlocking the Power of Data Lock 5

Statistics: Unlocking the Power of Data Lock 5

Statistics: Unlocking the Power of Data Lock 5

http: //xkcd. com/882/ Statistics: Unlocking the Power of Data Lock 5

http: //xkcd. com/882/ Statistics: Unlocking the Power of Data Lock 5

Multiple Testing and Publication Bias �THIS SHOULD SCARE YOU. �Why most published research findings

Multiple Testing and Publication Bias �THIS SHOULD SCARE YOU. �Why most published research findings are false. Statistics: Unlocking the Power of Data Lock 5

Cuckoo Birds • Cuckoo birds lay their eggs in the nests of other birds

Cuckoo Birds • Cuckoo birds lay their eggs in the nests of other birds • When the cuckoo baby hatches, it kicks out all the original eggs/babies • If the cuckoo is lucky, the mother will raise the cuckoo as if it were her own • Do cuckoo birds found in nests of different species differ in size? http: //opinionator. blogs. nytimes. com/2010/06/01/ cuckoo-cuckoo/ Statistics: Unlocking the Power of Data Lock 5

Length of Cuckoo Eggs Statistics: Unlocking the Power of Data Lock 5

Length of Cuckoo Eggs Statistics: Unlocking the Power of Data Lock 5

Cuckoo Eggs Bird Sample Mean Sample SD Sample Size Pied Wagtail 22. 90 1.

Cuckoo Eggs Bird Sample Mean Sample SD Sample Size Pied Wagtail 22. 90 1. 07 15 Pipit 22. 50 0. 97 60 Robin 22. 58 0. 68 16 Sparrow 23. 12 1. 07 14 Wren 21. 13 0. 74 15 Overall 22. 46 1. 07 120 Statistics: Unlocking the Power of Data Lock 5

p-values Pied Wagtail Pipit Robin Sparrow Wren - 0. 21 0. 34 0. 59

p-values Pied Wagtail Pipit Robin Sparrow Wren - 0. 21 0. 34 0. 59 0. 0001 Pipit 0. 21 - 0. 71 0. 07 0. 00003 Robin 0. 34 0. 71 - 0. 13 0. 00006 Sparrow 0. 59 0. 07 0. 13 - 0. 00006 0. 0001 0. 00003 0. 00006 Pied Wagtail Wren Statistics: Unlocking the Power of Data - Lock 5

Summary • Two types of errors: rejecting a true null (Type I) and not

Summary • Two types of errors: rejecting a true null (Type I) and not rejecting a false null (Type II) • Statistical vs practical significance • Using α = 0. 05, 5% of all hypothesis tests will lead to rejecting the null, even if all nulls are true Statistics: Unlocking the Power of Data Lock 5

To Do �Read Section 4. 3, 4. 5 �Do HW 4. 5 (due Friday,

To Do �Read Section 4. 3, 4. 5 �Do HW 4. 5 (due Friday, 3/20) Statistics: Unlocking the Power of Data Lock 5