 2 Analyzing TwoVariable Data Lesson 2 2 Relationships

• Slides: 14 2 Analyzing Two-Variable Data Lesson 2. 2 Relationships Between Two Quantitative Variables Statistics and Probability with Applications, 3 rd Edition Starnes & Tabor Bedford Freeman Worth Publishers Relationships Between Two Quantitative Variables Learning Targets After this lesson, you should be able to: ü Distinguish between explanatory and response variables for quantitative data. ü Make a scatterplot to display the relationship between two quantitative variables. ü Describe the direction, form, and strength of a relationship displayed in a scatterplot, and identify outliers. Statistics and Probability with Applications, 3 rd Edition 2 Relationships Between Two Quantitative Variables Although there are many ways to display the distribution of a single quantitative variable, a scatterplot is the best way to display the relationship between two quantitative variables. Scatterplot A scatterplot shows the relationship between two quantitative variables measured on the same individuals. The values of one variable appear on the horizontal axis, and the values of the other variable appear on the vertical axis. Each individual in the data set appears as a point in the graph. Statistics and Probability with Applications, 3 rd Edition 3 Relationships Between Two Quantitative Variables Here is a scatterplot showing the relationship between the weight (in carats) and price (in \$1000 s) for 95 different round, clear, internally flawless, and excellently cut diamonds. It is fairly easy to make a scatterplot by hand. The first step is to determine which variable is the response variable and which variable is the explanatory variable. However, in some cases there isn’t a clear explanatory or response variable. Statistics and Probability with Applications, 3 rd Edition 4 What grade would you give this house? Identifying explanatory variables PROBLEM: Identify the explanatory variable for the following relationships if possible. Explain your reasoning. (a) The average hours of study time per week and the average hours of extracurricular activities per week for a sample of students. The explanatory variable could be either the average hours of study time per week or the average hours of extra-curricular activities per week, because either variable could be used to explain or predict the other. (b) The square footage of living space and the price in a sample of houses for sale. The explanatory variable is the square footage of living space, because the square footage of living space in a house helps explain its price. Statistics and Probability with Applications, 3 rd Edition 5 Relationships Between Two Quantitative Variables How to Make a Scatterplot 1. Label the axes. The explanatory variable is plotted on the horizontal axis and the response variable is plotted on the vertical axis. If there is no explanatory variable, either variable can go on the horizontal axis. 2. Scale the axes. Put the name of the explanatory variable under the horizontal axis and place equally spaced tick marks along the axis beginning at a “friendly” number just below the smallest value of the explanatory variable and continuing until you exceed the largest value. Do the same for the response variable along the vertical axis. 3. Plot individual data values. For each individual, plot a point directly above that individual’s value for the explanatory variable and directly to the right of that individual’s value for the response variable. Statistics and Probability with Applications, 3 rd Edition 6 Weight training Making a scatterplot PROBLEM: The table shows the number of pounds that 10 students in Gary Lang’s weight training class could bench press and squat. Make a scatterplot to display the relationship between the bench press and squat weights for this sample. Bench press (pounds) Squat (pounds) 95 111 113 125 135 145 165 180 196 95 111 141 208 100 218 284 275 338 420 141 208 SOLUTION: It’s unclear if the bench press weight or the squat weight is the explanatory variable. Thus, a scatterplot showing bench press versus squat or showing squat versus bench press could be created. Statistics and Probability with Applications, 3 rd Edition 7 Relationships Between Two Quantitative Variables How to Describe a Scatterplot To describe a scatterplot, make sure to address the following four characteristics in the context of the data: • Direction: A scatterplot can show a positive association, negative association, or no association. In a positive association, above-average values of the explanatory variable tend to be paired with above-average values of the response variable and below-average values tend to be paired with below-average values. In a negative association, above-average values of the explanatory variable tend to be paired with below-average values of the response variable and vice-versa. • Form: A scatterplot can show a linear form or a nonlinear form. The form is linear if the overall pattern follows a straight line. Otherwise, the form is nonlinear. • Strength: A scatterplot can show a weak, moderate, or strong association. An association is strong if the points don’t deviate much from the form identified. An association is weak if the points deviate quite a bit from the form identified. • Outliers: Individual points that fall outside the overall pattern of the relationship. Statistics and Probability with Applications, 3 rd Edition 8 How fast are barefoot runners and how long can you expect to live? Describing a scatterplot PROBLEM: Scatterplot (a) displays the time in seconds to run the 40 -meter dash without shoes versus the time to run the same distance with shoes for a sample of 82 high school students. The data were collected from the website http: //www. gapminder. org. Describe the relationships shown in the scatterplot. Direction: There is a positive association between time with shoes and time without shoes for this sample of students. Students who took longer to run 40 meters with shoes also took longer to run 40 meters without shoes. Form: There is a linear pattern in the scatterplot. That is, the overall pattern follows a straight line. Strength: Because the points do not vary much from the linear pattern, the association is strong. Outliers: There are two noticeable outliers in the data. One student had a time of around 8. 4 seconds with shoes and around 5. 9 seconds without shoes. The second outlier is a student who had a time of around 10. 4 seconds with shoes and around 8. 3 seconds without shoes. Statistics and Probability with Applications, 3 rd Edition 9 How fast are barefoot runners and how long can you expect to live? Describing a scatterplot PROBLEM: Scatterplot (b) displays the life expectancy (in years) and income person (in dollars) for 179 countries in a recent year. The data were collected from the website http: //www. gapminder. org. Describe the relationships shown in the scatterplot. Direction: There is a positive association between life expectancy and income person. Countries with longer life expectancies tended to greater incomes person, and vice versa. Form: There is a curved (nonlinear) pattern in the scatterplot. Strength: Because the points do not vary much from the curved pattern, the association is strong. Statistics and Probability with Applications, 3 rd Edition 10 LESSON APP 2. 2 More sugar, more calories? Is there a relationship between the amount of sugar (in grams) and the number of calories in movie-theater candy? Here are the data from a sample of 12 types of candy. 1. Make a scatterplot to display the relationship between amount of sugar and the number of calories in movie-theater candy. 2. Describe the relationship show in the scatterplot. Statistics and Probability with Applications, 3 rd Edition 11 LESSON APP 2. 2 More sugar, more calories? 1. Make a scatterplot to display the relationship between amount of sugar and the number of calories in movie-theater candy. Statistics and Probability with Applications, 3 rd Edition 12 LESSON APP 2. 2 More sugar, more calories? 2. Describe the relationship show in the scatterplot. Statistics and Probability with Applications, 3 rd Edition 13 Relationships Between Two Quantitative Variables Learning Targets After this lesson, you should be able to: ü Distinguish between explanatory and response variables for quantitative data. ü Make a scatterplot to display the relationship between two quantitative variables. ü Describe the direction, form, and strength of a relationship displayed in a scatterplot, and identify outliers. Statistics and Probability with Applications, 3 rd Edition 14