Chapter 5 LSRL Bivariate Data x explanatory independent
Chapter 5 LSRL
Bivariate Data • x: explanatory (independent) variable • y: response (dependent) variable • Use x to predict y
– (y-hat) means predicted y-value b – slope Be sure when to put the hat by – the amount y increases x increases 1 unit on the y! a – y-intercept – height of the line when x = 0 – in some situations, y-intercept has no meaning
Least Squares Regression Line (LSRL) • Line that gives the best fit to the data • Minimizes the sum of the squared deviations from the line
(3, 10) y =. 5(6) + 4 = 7 4. 5 2 – 7 = -5 y=4 0 – 4 = -4 y =. 5(3) + 4 = 5. 5 -4 (0, 0) 10 – 5. 5 = 4. 5 -5 (6, 2) Sum of the squares = 61. 25
What is the sum of the deviations from the line? Will it always be zero? (3, 10) 6 The line that minimizes the sum of the squared deviations from the line is the LSRL -3 (0, 0) Use a calculator to find the line of best fit -3 (6, 2) Sum of the squares = 54
Interpretations Slope: For each unit increase in x, there is an approximate increase/decrease of b in y. Correlation coefficient: There is a strength, direction, linear relationship between x and y.
The ages (in months) and heights (in inches) of seven children are given. Age 16 24 42 60 75 102 120 Ht. 24 30 35 40 48 56 60 Find the LSRL. Interpret the slope and correlation coefficient in the context of the problem.
Slope: For each increase of one month in age, age there is an approximate increase of. 34 inches in heights of children. Correlation coefficient: There is a strong, positive, linear relationship between the age and height of children
The ages (in months) and heights (in inches) of seven children are given. x 16 24 42 60 75 102 120 y 24 30 35 40 48 56 60 Predict the height of a child who is 4. 5 years old. Predict the height of someone who is 20 years old.
Extrapolation • LSRL should not be used to predict y for x-values outside the data set • We don’t know if the pattern in the scatterplot continues
Age 16 24 42 60 75 102 120 Ht. 24 30 35 40 48 56 60 Calculate x & y. Find the point (x, y) on the scatterplot. This point is always on the LSRL
Both r and the LSRL are non-resistant measures
Formulas on green sheet
The following statistics are found for the variables posted speed limit and the average number of accidents. Find the LSRL & predict the number of accidents for a posted speed limit of 50 mph.
- Slides: 15