Hypothesis Testing A procedure for determining which of

  • Slides: 23
Download presentation
Hypothesis Testing A procedure for determining which of two (or more) mutually exclusive statements

Hypothesis Testing A procedure for determining which of two (or more) mutually exclusive statements is more likely true l We classify hypothesis tests in terms of l – Parametric vs. Non-parametric – Directional vs. Non-directional

Hypothesis Tests l Parametric Tests - tests about specific population parameters (μ, σ2, etc.

Hypothesis Tests l Parametric Tests - tests about specific population parameters (μ, σ2, etc. ) – Is μ 1 different from a predetermined value? – Is μ 1 different from μ 2 ? l Non-Parametric Tests - tests about the shape of the population (medians? ) – Is this population different from another population?

Hypothesis Tests l Non-Directional Hypothesis – Is μ 1 different from μ 2 ?

Hypothesis Tests l Non-Directional Hypothesis – Is μ 1 different from μ 2 ? – Is the distribution of scores in group 1 different than those in group 2? l Directional Hypotheses – Is μ 1 greater than μ 2 ? – Is μ 1 less than μ 2 ? – Is the distribution of scores in group 1 to the right (greater than) of those in group 2?

The Six-Steps of Hypothesis Testing 1. State and Check Assumptions 2. Generate Null and

The Six-Steps of Hypothesis Testing 1. State and Check Assumptions 2. Generate Null and Alternative Hypotheses 3. Chose the Sampling Distribution of the Test Statistic 4. Set Significance Level 5. Compute the Test Statistic 6. Draw Conclusions

1. State and Check Assumptions l There are three requirements for hypothesis testing to

1. State and Check Assumptions l There are three requirements for hypothesis testing to work – Assumptions about the population – Assumptions about the sample

Assumptions about the Population l “Assumption of Normality” - the population is normally distributed

Assumptions about the Population l “Assumption of Normality” - the population is normally distributed or the sample size is sufficiently large so that the CLT comes into play and

Assumptions about Variance l Depending of the type of test, some important features of

Assumptions about Variance l Depending of the type of test, some important features of variance may come into play – Is σ or σ2 known? – “Homogeneity of Variance” - the variance of the populations being compared have equal variance

Assumptions about the Sample l The sample has been obtained using independent random sampling

Assumptions about the Sample l The sample has been obtained using independent random sampling

What if the assumptions can’t be met? l “Violation” - when assumptions are not

What if the assumptions can’t be met? l “Violation” - when assumptions are not met – Violation of Normality = “Non-normal” – Violation of Homogeneity of Variance = “Heterogeneous Variance” l “Robust” - a test’s ability to “deal with” violations – “a t-test is robust to violations of normality”

BUT, . . . l No tests are robust to violations of the random

BUT, . . . l No tests are robust to violations of the random sampling assumption. l If you do not have a random sample, probability theory will not work and therefore inferential statistical techniques will fail.

2. Generate Null and Alternative Hypotheses l These two hypotheses, designated HO and HA

2. Generate Null and Alternative Hypotheses l These two hypotheses, designated HO and HA (or H 1), are mutually exclusive – mutually exclusive - the don’t overlap

Properties of HO Specifies no difference or no change from a standard or theoretical

Properties of HO Specifies no difference or no change from a standard or theoretical value l Always specifies something about a particular population parameter l Used in constructing a sampling distribution l – For the subsequent quantitative work, the null hypothesis is assumed to be true

Properties of HA (or H 1) About the same aspect of the population as

Properties of HA (or H 1) About the same aspect of the population as HO l Usually stated in general terms l Mutually exclusive - no overlap with HO l Used in making a decision l Can be directional or non-directional l

Directional vs. Non-directional HAs Non-directional HA - usually stated as “does not equal” or

Directional vs. Non-directional HAs Non-directional HA - usually stated as “does not equal” or “is different than” l Directional HA - stated as “greater than” or “less than” l – note that a non-directional hypothesis is equal to the two directional hypotheses “greater than” or “less than”

3. Chose the Sampling Distribution Depending on the type of data, assumptions, and hypotheses

3. Chose the Sampling Distribution Depending on the type of data, assumptions, and hypotheses certain distributions of the test statistic require selection l The second half of this course will be devoted to making the “best” decision about which test statistic to choose (z, t, F, etc. ) l

4. Set Significance Level Hypothesis testing is sometimes called “significance testing” l The significance

4. Set Significance Level Hypothesis testing is sometimes called “significance testing” l The significance level is the basis for making our decision l – “rejection region” - the value specified by the significance level of the test – “critical value” - the value of the test statistic specified by the significance level that begins the rejection region

Significance Level The probability value associated with the decision rule is called the significance

Significance Level The probability value associated with the decision rule is called the significance level of the test l Significance level is represented by the Greek letter alpha (α) l The actual value of α is up to you l

What is the significance level? l l Hypothesis testing entails determining which of two

What is the significance level? l l Hypothesis testing entails determining which of two hypotheses (HO and H 1) is more likely correct. But “more likely” is a subjective evaluation on your part. If you were to obtain a statistic that was unlikely if HO were assume to be true, would you be willing to accept the H 1? How “unlikely” does it need to be for you to be convinced?

Typical Significance level A Typical significance level : α =. 05 l We are

Typical Significance level A Typical significance level : α =. 05 l We are convinced that results are considered as significant different from the HO when they are in the most extreme 5% (a proportion of. 05) of all possible outcomes specified in HO l

Using α l With our significance level, we determine a decision rule – Critical

Using α l With our significance level, we determine a decision rule – Critical Value – the sufficiently extreme value of the statistic such that if our statistic is more extreme, we reject the HO – p-value – if the probability of our statistic, assuming HO is true, is less than α, we reject the HO

5. Compute the Test Statistic l From the random sample obtained, the test statistic

5. Compute the Test Statistic l From the random sample obtained, the test statistic is computed using various formulae – z-statistic – binomial probability – t-statistic – F-statistic – etc.

6. Draw Conclusions l Based on the test statistic just computed, we do one

6. Draw Conclusions l Based on the test statistic just computed, we do one of two things: – Reject the null hypothesis and accept the alternative hypothesis, or – Do not reject the null hypothesis and do not accept the alternative hypothesis l Note that we NEVER accept the null hypothesis, we only fail to reject it!

More on Decisions l When completed, a significance test tells us the probability of

More on Decisions l When completed, a significance test tells us the probability of obtaining our results when the null hypothesis is true p(Results|Ho is True) l If that probability is small, smaller than our significance level (α), it is likely that Ho is not true and we reject it