Five Number Summary and Box Plots Five Number
Five Number Summary and Box Plots
Five Number Summary and Box Plot Basics • The Five Number Summary consists of the minimum, lower quartile (Q 1), median, upper quartile (Q 3), and maximum. • It is used to determine the variability (or the differences in data) of a data set and to construct box plots. • Box plots are used as a visual representation of the data.
Five Number Summary Definitions • Minimum: the smallest value in a data set • Lower quartile (Q 1): the 25 th percentile; 25% of the information is less than this value • Median: the 50 th percentile or middle value • Upper quartile (Q 3): the 75 th percentile; 75% of the information is less than this value • Maximum: the largest value in a data set
Example • First, you’ll break up this information into quarters. • There will be the same amount of numbers in each section. 5 6 7 8 9 9 10 11 15 18 19 20 24
Example • Find the middle number • Since this list has 13 values, the middle number would be the 7 th number. Note: If there was an even amount of values, you would find the average of the two middle numbers. 5 6 7 8 9 9 10 11 15 18 19 20 24
Example • Now that you have your halfway point, find the middle of the top and bottom sections. • There are six values in the top section so the middle value would fall between the 3 rd and 4 th values. • Find the middle value for the bottom section. 5 6 7 8 9 9 10 11 15 18 19 20 24
Example • The middle value would fall between the 10 th and 11 th values. • Now that we have our intervals set up, we can find the values for our five number summary. 5 6 7 8 9 9 10 11 15 18 19 20 24
Example • The minimum is 5. • To find the lower quartile (Q 1), find the average of 7 and 8. Q 1 is 7. 5 • The median, or middle value, is 10. • To find the upper quartile (Q 3), find the average of 18 and 19. Q 3 is 18. 5. • The maximum is 24 5 6 7 8 9 9 10 11 15 18 19 20 24
Example • You would write the five number summary for this data set as follows: Minimum = 5 Q 1 = 7. 5 Median = 10 Q 3 = 18. 5 Maximum = 24 5 6 7 8 9 9 10 11 15 18 19 20 24
Making a Box Plot Minimum = 5 • Using the five number Q 1 = 7. 5 summary, you can easily Median = 10 construct a box plot. Q 3 = 18. 5 Maximum = 24 • First, we need to make a number line. • Our minimum is 5 and • Choose your minimum, maximum is 24. What maximum, and scale should we choose as based on your five our minimum, number summary maximum, and scale for our number line?
Making a Box Plot Minimum = 5 Q 1 = 7. 5 Median = 10 Q 3 = 18. 5 Maximum = 24 • 0 would be a good choice for our minimum, 25 for our maximum with a scale of 5. • Next, we need to mark off the values for each number in the five number summary. 0 5 10 15 20 25
Making a Box Plot • Make a small dot for the minimum and maximum. • Make tick marks for Q 1, median, and Q 3. • Connect the tick marks with two lines to form a box • Connect the dots with lines to the box in the center 0 Minimum = 5 Q 1 = 7. 5 Median = 10 Q 3 = 18. 5 Maximum = 24 5 10 15 20 25
Describing a Box Plot • In a box plot, each segment represents 25% of the information. • What can you tell about the way the information is grouped based on this graph? 0 Minimum = 5 Q 1 = 7. 5 Median = 10 Q 3 = 18. 5 Maximum = 24 25% 5 10 25% 15 25% 20 25
Describing a Box Plot • Intervals that are smaller (like from the minimum to Q 1) have information that is tightly packed together. • Intervals that are larger (like from the median to Q 3) have information that is more spread out. 0 Minimum = 5 Q 1 = 7. 5 Median = 10 Q 3 = 18. 5 Maximum = 24 25% 5 10 25% 15 25% 20 25
Describing a Box Plot Minimum = 5 Q 1 = 7. 5 Median = 10 Q 3 = 18. 5 Maximum = 24 • The length of the box (from Q 1 to Q 3) represents the IQR, or interquartile range. This is the middle 50% of the data 0 50% 5 10 15 20 25
Describing a Box Plot Minimum = 5 Q 1 = 7. 5 Median = 10 Q 3 = 18. 5 Maximum = 24 • This value will help you determine the variability of a data set or to compare variability of more than one set of data. 0 50% 5 10 15 20 25
Describing a Box Plot Minimum = 5 Q 1 = 7. 5 Median = 10 Q 3 = 18. 5 Maximum = 24 • The larger the IQR, the larger the variability of the data set. The smaller the IQR, the smaller the variability. The IQR for this box plot is 11 (Q 3 -Q 1). 18. 5 – 7. 5 = 11 0 50% 5 10 15 20 25
Describing a Box Plot Minimum = 5 Q 1 = 7. 5 Median = 10 Q 3 = 18. 5 Maximum = 24 • You can also look at the length of the box to help determine the variability of the data. 0 50% 5 10 15 20 25
• About what percentage of students scored between 70 and 90 on the test depicted in the box plot above? a) 40 b) 50 c) 75 d) 90 e) Cannot be determined from the information given
Describing a Box Plot Minimum = 5 Q 1 = 7. 5 Median = 10 Q 3 = 18. 5 Maximum = 24 • Along with the five number summary, you can also talk about whether the box plot is skewed or symmetric based on the size of each interval. • Do you think this box plot is skewed right, left, or symmetric? 0 5 10 15 20 25
Describing a Box Plot Minimum = 5 Q 1 = 7. 5 Median = 10 Q 3 = 18. 5 Maximum = 24 • This box plot is skewed to the right because the intervals between the median and Q 3 and the interval between Q 3 and the maximum are very spread out. 0 5 10 15 20 25
Describing a Box Plot • Here is the correct description of this box plot Minimum = 5 Q 1 = 7. 5 Median = 10 Q 3 = 18. 5 Maximum = 24 IQR = 11 Range = 19 Skewed to the right 0 5 10 15 20 25
Complete the following problem and turn in to the teacher • The following are scores from 20 students on a unit test in mathematics. 1. 2. 3. 4. 75 92 62 78 85 77 93 65 80 90 65 50 70 57 98 45 54 73 84 65 Find the 5 number summary Find the interquartile range (IQR). Make a box plot for this information. Describe the distribution.
- Slides: 23