Part IV Significantly Different Using Inferential Statistics Chapter

  • Slides: 25
Download presentation
Part IV Significantly Different Using Inferential Statistics Chapter 15 Using Linear Regression Predicting Who’ll

Part IV Significantly Different Using Inferential Statistics Chapter 15 Using Linear Regression Predicting Who’ll Win the Super Bowl

What you will learn in Chapter 15 n How prediction works and how it

What you will learn in Chapter 15 n How prediction works and how it can be used in the social and behavioral sciences n How and why linear regression works n predicting one variable from another n How to judge the accuracy of predictions n The usefulness of multiple regression

What is Prediction All About? n Correlations can be used as a basis for

What is Prediction All About? n Correlations can be used as a basis for the prediction of the value of one variable from the value of another Correlation can be determined by using a set of previously collected data (such as data on variables X and Y) n calculate how correlated these variables are with one another n use that correlation and the knowledge of X to predict Y with a new set of data n

Remember… n The greater the strength of the relationship between two variables (higher the

Remember… n The greater the strength of the relationship between two variables (higher the absolute value of the correlation coefficient) the more accurate the predictive relationship n Why? ? ? n The more two variables share in common (shared variance) the more you know about one variable from the other.

The Logic of Prediction n Prediction is an activity that computes future outcomes from

The Logic of Prediction n Prediction is an activity that computes future outcomes from present ones n What if you wanted to predict college GPA based on high school GPA?

Scatter Plot

Scatter Plot

Regression Line n Regression line – reflects our best guess as to what score

Regression Line n Regression line – reflects our best guess as to what score on the Y variable would be predicted by the X variable. n Also known as the “line of best fit. ”

Prediction of Y given X = 3. 0

Prediction of Y given X = 3. 0

Error in Prediction is rarely perfect…

Error in Prediction is rarely perfect…

Drawing the World’s Best Line n Linear Regression Formula Y=b. X + a n

Drawing the World’s Best Line n Linear Regression Formula Y=b. X + a n Y = dependent variable n n the predicted score or criterion n X = independent variable n the score being used as the predictor n b = the slope n direction of the line n a = the intercept n point at which the line crosses the y-axis

Hasbro

Hasbro

Slope & Intercept n Slope – calculating b n Intercept – calculating a

Slope & Intercept n Slope – calculating b n Intercept – calculating a

Number of Complaints (y) by Reindeer Age (x)

Number of Complaints (y) by Reindeer Age (x)

Complaints by Reindeer Age: Intermediate Calculations

Complaints by Reindeer Age: Intermediate Calculations

SS Reg, SS Error, R 2, and Correlation

SS Reg, SS Error, R 2, and Correlation

Now You Try!! Participant Hours/Week Video Games College GPA 1 3 3. 8 2

Now You Try!! Participant Hours/Week Video Games College GPA 1 3 3. 8 2 15 2. 1 3 22 2. 5 4 30 0. 6 5 11 3. 1 6 25 1. 9 7 6 3. 9 8 12 3. 8 9 17 1. 7 Chapter 6 16

Printout: Slope Int, SS Reg, SS Error and R 2

Printout: Slope Int, SS Reg, SS Error and R 2

College GPA by SAT scores Slope 0. 003478 -1. 07148 Intercept 0. 000832 0.

College GPA by SAT scores Slope 0. 003478 -1. 07148 Intercept 0. 000832 0. 957866 Rsquare 0. 686069 0. 445998 F SS Regression 17. 48335 8 dfs SS 3. 477686 1. 591314 Residual

Severity of Injuries by # hrs per week strength training; Slope -0. 12507 6.

Severity of Injuries by # hrs per week strength training; Slope -0. 12507 6. 847277 Intercept Stand Error 0. 045864 1. 004246 R 2 0. 209854 2. 181672 7. 436476 28 SS Regression 35. 39532 SS 133. 2713 Residual

Using the Computer n SPSS and Linear Regression

Using the Computer n SPSS and Linear Regression

SPSS Output n What does it all mean?

SPSS Output n What does it all mean?

SPSS Scatterplot

SPSS Scatterplot

The More Predictors the Better? Multiple Regression n Multiple Regression Formula n Y =

The More Predictors the Better? Multiple Regression n Multiple Regression Formula n Y = b. X 1 + b. X 2 + a n Y = the value of the predicted score n X 1 = the value of the first independent variable n X 2 = the value of the second independent variable n b = the regression weight for each variable

The BIG Rule… n When using multiple predictors keep in mind. . . n

The BIG Rule… n When using multiple predictors keep in mind. . . n Your independent variables (X 1, , X 2 , , X 3 , etc. ) should be related to the dependent variable (Y)…they should have something in common n However…the independent variables should not be related to each other…they should be “uncorrelated” so that they provide a “unique” contribution to the variance in the outcome of interest.

Glossary Terms to Know n Regression line n Line of best fit n Error

Glossary Terms to Know n Regression line n Line of best fit n Error in prediction n Standard error of the estimate n Criterion n Independent variable n Predictor n Dependent variable n Y prime n Multiple Regression