Significance Testing Statistical testing of the mean z

  • Slides: 14
Download presentation
Significance Testing Statistical testing of the mean (z test)

Significance Testing Statistical testing of the mean (z test)

Binomial Distribution n Mathematicians have figured formulas to estimate long run relative frequencies for

Binomial Distribution n Mathematicians have figured formulas to estimate long run relative frequencies for simple events, like how many heads will appear for a given number of coin tosses. The binomial is one such. Recall dice Number of ‘heads’ in 10 flips of a coin.

Normal Distribution n We have already figured percentages of the normal. Percentages of the

Normal Distribution n We have already figured percentages of the normal. Percentages of the normal correspond to probabilities of finding individual cases in the distribution. The sampling distribution of the mean is normal if N is large. Middle 95 percent from going up and down 1. 96 SDs from the mean.

Significance Testing 1 n n n Significance testing is a ‘what if’ game. We

Significance Testing 1 n n n Significance testing is a ‘what if’ game. We make an assumption, and ask what will happen if the assumption is true. Assumption is null hypothesis. Significance testing is based on probabilities that come from the ‘what if’ scenario (from the null hypothesis). n n n What if the true mean height of students at USF is 66 inches and SD is 5 inches? What if we draw people from USF 100 at a time and plot the means? We can figure the sampling distribution from these assumptions and figure probabilities. Rejection region is place that is unlikely to occur if null is true.

Significance Testing 2 n n n Given: Result 1: Sampling distribution of mean is

Significance Testing 2 n n n Given: Result 1: Sampling distribution of mean is normal (mu =66). Result 2: Standard error of the mean is: What is a rejection region? Rejection Region

Review n n Suppose population mean is 500, population SD is 100 (SAT data),

Review n n Suppose population mean is 500, population SD is 100 (SAT data), and sample size is 100. Draw sampling distribution of means. q q q n What is the shape of this distribution? What is the mean of this distribution? What is the standard deviation of this distribution? Find, mark, and label the rejection regions.

Review Upper = 500+1. 96(10)=519. 6 Lower = 500 -1. 96(10) =480. 4 RR

Review Upper = 500+1. 96(10)=519. 6 Lower = 500 -1. 96(10) =480. 4 RR RR Shape is normal, mean is 500, SD is 10. RR > 519. 6 RR < 480. 4

Significance Testing 3 n n n Establish ‘what if’ Collect sample data. Examine probability

Significance Testing 3 n n n Establish ‘what if’ Collect sample data. Examine probability of sample result given the null. If probability is low, say that null is false, i. e. , reject the null hypothesis. This is a significance test. If we reject the null, we say result is statistically significant. Significance testing lets us make decisions about populations from sample data.

Example n Mean # beers at Skipper’s Smokehouse? Null (what if): n Data from

Example n Mean # beers at Skipper’s Smokehouse? Null (what if): n Data from Skipper’s: n Derive: Reject the null. Result is significant. Observed data are very unlikely if null is true. Null must be false. Lots of beer at Skipper’s. Note: data are fictitious.

Review n We want to know if a workbook helps with learning stats. We

Review n We want to know if a workbook helps with learning stats. We know from past classes that students average 75 percent on the final with a SD of 5. Our new class of 225 has a mean of 78. Did the workbook help? (Hint: sqrt(225) = 15. )

Review Upper = 75+1. 96(. 33) = 75. 65. This is far below 78.

Review Upper = 75+1. 96(. 33) = 75. 65. This is far below 78. The workbook helps.

Definition n The term probability refers to the long run q q 1 2

Definition n The term probability refers to the long run q q 1 2 3 4 Frequency of outcome Odds ratio Relative frequency Rolling of dice

Definition n The calculation of probabilities in hypothesis testing rests upon assumptions described in

Definition n The calculation of probabilities in hypothesis testing rests upon assumptions described in the ______. q q 1 Alternative hypothesis 2 Null hypothesis 3 Sampling distribution 4 Standard error

Definition n The rejection region is the place in the sampling distribution that is

Definition n The rejection region is the place in the sampling distribution that is _____ q q Close to the mean Obtained when the result is not significant Very unlikely if the null hypothesis is true Visited by losers