Residuals and Residual Plots How close is the

  • Slides: 8
Download presentation
Residuals and Residual Plots

Residuals and Residual Plots

How close is the Line of Best Fit? • One additional method to determine

How close is the Line of Best Fit? • One additional method to determine if a linear model is appropriate for a data set is to analyze the residuals. • Do this by comparing the actual data point to the predicted outcome using the equation • A residual is the vertical distance between an observed data value and its predicted value using the regression equation. • A residual plot is a scatter plot of the independent variable on the x-axis and the residuals on the y-axis. The Residual Value is the difference between the actual observed value and the value predicted by the equation Residual value = observed value – predicted value

Use Residual Plots to Interpret Data • The shape of the residual plot can

Use Residual Plots to Interpret Data • The shape of the residual plot can be useful to determine whether a linear model is a good fit for a data set or not • Linear: • If a residual plot results in no identifiable pattern or a flat pattern, then the data may be linearly related. • This means most of the data points were about the same distance from the line of best fit so the line had a strong correlation to the data

Use Residual Plots to Interpret Data • Non-Linear: • If there is a pattern

Use Residual Plots to Interpret Data • Non-Linear: • If there is a pattern in the residual plot, the data may not be linearly related. (it could have some type of non-linear relationship like quadratic, exponential, or none at all)

Residuals Example –Follow Along in Carnegie Book Lesson 3 pg 198 -200 The table

Residuals Example –Follow Along in Carnegie Book Lesson 3 pg 198 -200 The table below shows data of the speed at which a car is travelling and the distance it takes to brake to a complete stop.

Residuals Example –Follow Along in Carnegie Book Lesson 3 pg 198 -200 1. Construct

Residuals Example –Follow Along in Carnegie Book Lesson 3 pg 198 -200 1. Construct a scatterplot and line of best fit) 2. Correlation coefficient r =. 99, strong positive linear association 3. The linear regression equation is y = 5. 4 x - 134

Residuals Example pg 199 Now calculate the residuals First, use the linear regression equation

Residuals Example pg 199 Now calculate the residuals First, use the linear regression equation to calculate the predicted braking distance (y value / output) for each speed (x value / input) The linear regression equation is y = 5. 4 x - 134 Next, subtract: Residual value = observed value – predicted value The Residual Value is the difference between the actual observed value and the value predicted by the equation

Now Create a Residual Plot pg 200 • The residual plot is a scatter

Now Create a Residual Plot pg 200 • The residual plot is a scatter plot of the independent variable on the x-axis and the residuals on the y-axis. • It will show you how far away each actual data point is from the line of best fit • To Graph: keep the x axis the same (speed as the independent variable) and graph the residual value on the y axis