Summarizing Quantitative Data • Frequency Distribution • Relative Frequency and Percent Frequency Distributions • Histogram • Cumulative Distributions • Ogive Constructing a Frequency Distribution for Quantitative Data 3 initial steps 1. Determine the number of nonoverlapping classes. 2. Determine the width of each class. 3. Determine the class limits Example: Hudson Auto Repair The manager of Hudson Auto would like to have a better understanding of the cost of parts used in the engine tune-ups performed in the shop. She examines 50 customer invoices for tune-ups. The costs of parts, rounded to the nearest dollar, are listed on the next slide. Example: Hudson Auto Repair Sample of Parts Cost for 50 Tune-ups 91 71 104 85 62 78 69 74 97 82 93 72 62 88 98 57 89 68 68 101 75 66 97 83 79 52 75 105 68 105 99 79 77 71 79 80 75 65 69 69 97 72 80 67 62 62 76 109 74 73 Frequency Distribution • Guidelines for Selecting Number of Classes • Use between 5 and 20 classes. • Data sets with a larger number of elements usually require a larger number of classes. • Smaller data sets usually require fewer classes Frequency Distribution • Guidelines for Selecting Width of Classes •Use classes of equal width. •Approximate Class Width = Largest Data Value Smallest Data Value Number of Classes Frequency Distribution •We decide that SIX (6) classes are appropriate for our purposes •Thus the approximate class width is given by: Approximate Class Width = (109 - 52)/6 = 9.5 10 Thus we have: Parts Cost($) Frequency 50-59 2 60-69 13 70-79 16 80-89 7 90-99 7 100-109 5 Total 50 Using Excel’s Frequency Function to Construct a Frequency Distribution Formula Worksheet (showing data entered) 1 2 3 4 5 6 7 8 A B C Parts Parts Cost Cost 91 50-59 71 60-69 104 70-79 85 80-89 62 90-99 78 100-109 69 Note: Rows 9-51 are not shown. D Frequency Using Excel’s Frequency Function to Construct a Frequency Distribution Formula Worksheet 1 2 3 4 5 6 7 8 A B C Parts Parts Cost Cost 91 50-59 71 60-69 104 70-79 85 80-89 62 90-99 78 100-109 69 D Frequency =FREQUENCY(A2:A51,{59,69,79,89,99,109}) =FREQUENCY(A2:A51,{59,69,79,89,99,109}) =FREQUENCY(A2:A51,{59,69,79,89,99,109}) =FREQUENCY(A2:A51,{59,69,79,89,99,109}) =FREQUENCY(A2:A51,{59,69,79,89,99,109}) =FREQUENCY(A2:A51,{59,69,79,89,99,109}) Note: Rows 9-51 are not shown. Relative Frequency and Percent Frequency Parts Cost ($) Relative Frequency Percent Frequency 50-59 .04 4 2/50 60-69 .26 70-79 .32 32 80-89 .14 14 90-99 .14 14 100-109 .10 10 1.00 100 Total 26 .04(100 ) Using Excel’s Frequency Function to Construct a Frequency Distribution Value Worksheet 1 2 3 4 5 6 7 8 A B C Parts Parts Cost Cost 91 50-59 71 60-69 104 70-79 85 80-89 62 90-99 78 100-109 69 Note: Rows 9-51 are not shown. D Frequency 2 13 16 7 7 5 Audit example •5 classes should be sufficient, given the size of our data set. •Approximate class width is given by: 33 - 12 Approximat e Class Width 4.2 5 We round up to the next highest integer, so our class interval is 5. Using Excel’s Frequency Function to Construct a Frequency Distribution 1. Step 1: Select cells D2:D6, the cells in which we want the frequencies to appear 2. Step 2: Type the following formula in the formula bar: =FREQUENCY(A2:A21,{14,19,24,29,34}) 3. Step 3: Press Ctrl-Shift-Enter The Audit Example Audit Time Frequency Relative Frequency Percent Frequency 10-14 4 0.2 20 15-19 8 0.4 40 20-24 5 0.25 25 25-29 2 0.1 10 30-34 1 0.05 5 20 1 100 Total The Histogram This is a bar graph of a frequency distribution for quantitative data. •The variable of interest is measured on the horizontal axis. •Frequency, relative frequency, or percent frequency for each class is measured on the vertical axis. •There are no spaces between the bars on a histogram—the rectangles are adjacent. Histogram Tune-up Parts Cost 18 16 Frequency 14 12 10 8 6 4 2 50 60 70 80 90 100 110 Parts 120 Cost ($) Using Excel’s Chart Wizard to Construct a Histogram: Audit Example 1. Step 1: Select cells C1:D6. 2. Click the Chart Wizard button. 3. When Chart Type dialog box appears, select Clustered Column from the Chart sub-type display. Click Next>. 4. At Step 2 —Chart Data Source —Click Next>. 5. At Step 3 – Chart Options – select Titles Tab and then type Histogram for Audit Time Data in the Chart title box. Type Audit Time in Days in the Category (X) axis box and type Frequency in the Value (Y) axis box. Select the Legend tab and then remove the check in the Show Legend box. Click Next>. 6. Step 6: Click Finish Removing the Gaps Between Rectangles (bars) • Step 1: Click right on any rectangle (bar) in the chart to produce a list of options. • Step 2: Select the Format Data Series option • Step 3: Select the Options tab and then enter 0 in the Gap width box. Click OK It looks like this: Histogram for Audit Time Data 10 Frequency 8 6 4 2 0 10-14 15-19 20-24 Audit Time in Days 25-29 30-34 Cumulative Distributions These show the number of data items with values less than or equal to the class limit of each class. •Cumulative relative frequency distributions show the proportion of data items with values less than or equal to the class limit of each class. •Cumulative frequency distributions show the proportion of data items with values less than or equal to the class limit of each class. Cumulative Distributions • Hudson Auto Repair Cost ($) < 59 < 69 < 79 < 89 < 99 < 109 Cumulative Cumulative Relative Frequency Frequency 2 15 31 38 45 50 .04 .30 2 + 13 .62 .76 .90 1.00 Cumulative Percent Frequency 4 30 15/50 62 76 90 100 .30(100) Ogive with Cumulative Percent Frequencies Cumulative Percent Frequency Tune-up Parts Cost 100 80 60 (89.5, 76) 40 20 50 60 70 80 90 100 110 Parts Cost ($) Ogive for Audit Example Ogive for Audit Time Data Frequency 25 20 15 10 5 0 1-14 15-19 20-24 25-29 Audit Time in Days 30-34 Cumulative Percent Frequency for the Audit Example Cumulative Percent Frequency 120 100 80 60 40 20 0 1-14 15-19 20-24 Audit Time in Days 25-29 30-34