1 Analyzing OneVariable Data Lesson 1 9 Describing
1 Analyzing One-Variable Data Lesson 1. 9 Describing Location in a Distribution Statistics and Probability with Applications, 3 rd Edition Starnes, Tabor Bedford Freeman Worth Publishers
Describing Location in a Distribution Learning Targets After this lesson, you should be able to: ü Find and interpret a percentile in a distribution of quantitative data. ü Estimate percentiles and individual values using a cumulative relative frequency graph. ü Find and interpret a standardized score (z-score) in a distribution of quantitative data. Statistics and Probability with Applications, 3 rd Edition 2
Describing Location in a Distribution Here are the scores of all 25 students in Mr. Pryor’s statistics class on their first test: 79 81 80 77 73 83 74 93 78 80 75 67 73 77 83 86 90 79 85 83 89 84 82 77 72 The bold score is Jenny’s 86. How did she perform on this test relative to her classmates? Statistics and Probability with Applications, 3 rd Edition 3
Describing Location in a Distribution One way to describe Jenny’s location in the distribution of test scores is to calculate her percentile. Percentile An individual’s percentile is the percent of values in a distribution that are less than the individual’s data value. Because 21 of the 25 observations (84%) are below her score, Jenny is at the 84 th percentile in the class’s test score distribution. Be careful with your language when describing percentiles. Percentiles are specific locations in a distribution, so an observation isn’t “in” the 84 th percentile. Rather, it is “at” the 84 th percentile. Statistics and Probability with Applications, 3 rd Edition 4
Which salads are at Mc. Donald’s? Finding and interpreting percentiles PROBLEM: The dotplot below shows the number of calories in Mc. Donald’s salads in a recent year. (a) Find the percentile for the Premium Bacon Ranch Salad with Grilled Chicken, which contains 230 calories. 6 of the 11 salads have fewer than 230 calories. Because 6/11 ≈ 0. 55, or 55%, the Premium Bacon Ranch Salad with Grilled Chicken is at the 55 th percentile of this distribution. Statistics and Probability with Applications, 3 rd Edition 5
Which salads are at Mc. Donald’s? Finding and interpreting percentiles PROBLEM: The dotplot below shows the number of calories in Mc. Donald’s salads in a recent year. (b) The Premium Bacon Ranch Salad (without chicken) is at the 18 th percentile of the distribution. Interpret this value in context. How many calories does the Premium Bacon Ranch Salad contain? About 18 percent of the salads have fewer calories than the Premium Bacon Ranch Salad (without chicken). Because 18% of 11 is (0. 18)(11) = 1. 98 (or 2 due to rounding error), the Premium Bacon Ranch Salad (without chicken) had more calories than 2 of the 11 salads on the menu. Counting up from the minimum on the dotplot, we see that the Premium Bacon Ranch Salad (without chicken) has 140 calories. Note that one other salad on the menu has 140 calories. This salad is also at the 18 th percentile because 2 of the 11 salads have fewer calories than it does. Statistics and Probability with Applications, 3 rd Edition 6
Describing Location in a Distribution There are some interesting graphs that can be made with percentiles. One of the most common starts with a frequency table for a quantitative variable and expands it to include cumulative frequency and cumulative relative frequency. Cumulative Relative Frequency Graph A cumulative relative frequency graph plots a point corresponding to the cumulative relative frequency in each interval at the smallest value of the next interval, starting with a point at a height of 0% at the smallest value of the first interval. Consecutive points are then connected with a line segment to form the graph. Statistics and Probability with Applications, 3 rd Edition 7
Describing Location in a Distribution A cumulative relative frequency graph can be used to describe the position of an individual within a distribution or to locate a specified percentile of the distribution. Statistics and Probability with Applications, 3 rd Edition 8
How fast can you run? Interpreting cumulative relative frequency graphs PROBLEM: As part of a student project, high school students were asked to sprint 50 yards. Their times were recorded. The cumulative relative frequency graph of the sprint times is shown. (a) One student ran the 50 yards in 8 seconds. Is a sprint time of 8 seconds unusually slow? To find this student’s location in the distribution, draw a vertical line up from his sprint time (8) on the horizontal axis until it meets the graph. Then draw a horizontal line from this point to the vertical axis. See the figure below. This student’s sprint time places him at about the 74 th percentile. About 74% of all students who participated had lower times than this student. So this student’s time is somewhat slow. Statistics and Probability with Applications, 3 rd Edition 9
How fast can you run? Interpreting cumulative relative frequency graphs PROBLEM: As part of a student project, high school students were asked to sprint 50 yards. Their times were recorded. The cumulative relative frequency graph of the sprint times is shown. (b) Estimate and interpret the 25 th percentile of the distribution. The 25 th percentile of the distribution is the sprint time with cumulative relative frequency 25%. To find this value, draw a horizontal line across from the vertical axis at a height of 25% until it meets the graph. Then draw a vertical line from this point down to the horizontal axis. See the figure below. The 25 th percentile is about 6. 8 seconds. About 25% of the students who participated ran the sprint in a time less than 6. 8 seconds. Statistics and Probability with Applications, 3 rd Edition 10
Describing Location in a Distribution A percentile is one way to describe the location of an individual in a distribution of quantitative data. Another way is to give the standardized score (z-score) for the observed value. Standardized Score (z-score) The standardized score (z-score) for an individual value in a distribution tells us how many standard deviations from the mean the value falls, and in what direction. To find the standardized score (zscore), compute Statistics and Probability with Applications, 3 rd Edition 11
Describing Location in a Distribution Values larger than the mean have positive z-scores. Values smaller than the mean have negative z-scores. Let’s return to the data from Mr. Pryor’s first statistics test. Where does Jenny’s 86 fall within the distribution? Her standardized score (z-score) is That is, Jenny’s test score is 0. 99 standard deviations above the mean score of the class. Statistics and Probability with Applications, 3 rd Edition 12
Are Caimans affected by pesticides? Finding and interpreting z-scores • Variable Maximum Mass 17. 50 n 14 Mean 9. 81 Statistics and Probability with Applications, 3 rd Edition St. Dev 3. 93 Minimum 6. 00 Q 1 7. 20 Median 8. 15 Q 3 11. 00 Mass 7. 2 10 6 8. 1 17. 5 8. 2 6. 2 8 15 7. 9 6 11 17 9. 2 13
LESSON APP 1. 9 Which states are rich? The following cumulative relative frequency graph and the numerical summaries describe the distribution of median household incomes in the 50 states in a recent year. 1. At what percentile is North Dakota, with a median household income of $55, 766? 2. Estimate and interpret the first quartile Q 1 of the distribution. 3. Find and interpret the standardized score (z-score) for New Jersey, with a median household income of $66, 692. Statistics and Probability with Applications, 3 rd Edition 14
LESSON APP 1. 9 Which states are rich? Statistics and Probability with Applications, 3 rd Edition 15
Describing Location in a Distribution Learning Targets After this lesson, you should be able to: ü Find and interpret a percentile in a distribution of quantitative data. ü Estimate percentiles and individual values using a cumulative relative frequency graph. ü Find and interpret a standardized score (z-score) in a distribution of quantitative data. Statistics and Probability with Applications, 3 rd Edition 16
- Slides: 16