Outline Other Types of Graphs Lecture 4, STAT 2246 Julien Dompierre Département de mathématiques et d’informatique Université Laurentienne 14 janvier 2007, Sudbury Julien Dompierre 1 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Outline 1 Other Types of Graphs Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Julien Dompierre 2 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Outline 1 Other Types of Graphs Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Julien Dompierre 3 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Vilfredo Pareto Vilfredo Federico Damaso Pareto (b. July 15, 1848, Paris – d. August 19, 1923, Geneva) was an Italian sociologist, economist and philosopher. In 1906 he made the famous observation that twenty percent of the population owned eighty percent of the property in Italy, later generalised by Joseph M. Juran and others into the so-called Pareto principle (also termed the 80-20 rule) and generalised further to the concept of a Pareto distribution. Julien Dompierre 4 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Pareto Chart (p. 64) If the data can be placed in specific categories, such as qualitative or categorical variables, a Pareto chart can be used. A Pareto chart is used to represent a frequency distribution for a categorical variable, and the frequencies are displayed by the heights of vertical bars, which are arranged in order from highest to lowest. Julien Dompierre 5 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Example of a Pareto Chart (p. 64) The table shown here is the average cost per mile for passenger vehicles on state turnpikes. State Indiana Oklahoma Florida Maine Pennsylvania Cost 2.96 c 4.36 c 6.06 c 3.86 c 5.86 c Julien Dompierre 6 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Drawing a Pareto Chart 1. Arrange the data form largest to smallest according to frequency. 2. Make the bars the same size. 3. Make the space between the bar equal. 4. Make the units that are used for the frequency equal in size. 5. Draw the bars corresponding to the frequencies. Julien Dompierre 7 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Outline 1 Other Types of Graphs Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Julien Dompierre 8 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Time Series Graph A time series graph represents data that occur over a specific period of time. Step 1: Draw and label the x and y axes. Step 2: Label the x axis for the time and the y axis for the data. Step 3: Plot each point according to the table. Step 4: Draw line segments connecting adjacent points. Do not try to fit a smooth curve through the data points. Julien Dompierre 9 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Example (Exercise 6, p. 78) Draw a time series graph to represent the data for the number of airline departures (in millions) for the given years. Year 1996 1997 1998 1999 2000 2001 2002 Number of departures 7.9 9.9 10.5 10.9 11.0 9.8 10.1 Julien Dompierre 10 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Compound Time Series Graph (p. 66) Two data sets (or more) can be compared on the same graph (called a compound time series graph) if two lines (or more) are used. (Exercise 20, p. 79) The number of successful space launches by the United States and Japan for the years 1993-1997. Year 1993 1994 1995 1996 1997 U.S 29 27 24 32 37 Japan 1 4 2 1 2 Julien Dompierre 11 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Outline 1 Other Types of Graphs Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Julien Dompierre 12 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots The Pie Graph (p. 67) A pie graph is a circle that is divided into sections or wedges according to the percentage of frequencies in each category of the distribution. The purpose of the pie graph is to show the relationship of the parts to the whole by visually comparing the sizes of the sections. Percentages or proportions can be used. The variable is nominal or categorical. Julien Dompierre 13 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Steps to Construct a Pie Graph (p. 67–68) Step 1: Since there are 360 degrees in a circle, the frequency for each class must be converted into a proportional part of the circle. This conversion is done by using the formula Degrees = f · 360◦ n where f = frequency for each class and n = sum of the frequencies. The degrees should sum to 360. Julien Dompierre 14 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Steps to Construct a Pie Graph (p. 67–68) Step 2: Each frequency must also be converted to a percentage using the formula f % = · 100% n Step 3: Next, using a protractor and a compass, draw the graph using the appropriate degree measures found in step 1, and label each section with the name and percentages. Julien Dompierre 15 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Outline 1 Other Types of Graphs Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Julien Dompierre 16 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Misleading Graphs (p. 70) Graphs give a visual representation that enables readers to analyze and interpret data more easily than they could simply by looking at numbers. However, inappropriately drawn graphs can misrepresent the data and lead the reader to false conclusions. Julien Dompierre 17 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Misleading Graphs (p. 70) When the graph is redrawn using a scale that goes from 0 to 100%, there is hardly a noticeable difference in the percentages. Thus, changing the units at the starting point on the y axis can convey a very different visual representation of the data. Julien Dompierre 18 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Misleading Graphs (p. 71) Julien Dompierre 19 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Misleading Graphs (p. 72) Julien Dompierre 20 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Misleading Graphs – Bad Scaling (p. 72) Another misleading graphing technique sometimes used involves exaggerating a one-dimensional increase by showing it in two dimensions. Julien Dompierre 21 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Misleading Graphs – Bad Labelling (p. 72) Another way to misrepresent data on a graph is by omitting labels or units on the axes of the graph. Graphs and charts should always be labeled and titled. On all graphs and charts, equal spaces should always represent equal amounts. Finally, all graphs should contain a source for the information presented. The inclusion of a source for the data will enable you to check the reliability of the organization presenting the data. Julien Dompierre 22 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Outline 1 Other Types of Graphs Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Julien Dompierre 23 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Stem and Leaf Plots (p. 73) A stem and leaf plot is a data plot that uses part of the data value as the stem and part of the data value as the leaf to form groups or classes. The stem and leaf plot is a method of organizing data and is a combination of sorting and graphing. It has the advantage over a grouped frequency distribution of retaining the actual data while showing them in graphical form. Julien Dompierre 24 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Steps to Construct a Stem and Leaf Plot (p. 74) Step 1: Arrange the data in order. (Note: Arranging the data in order is not essential and can be cumbersome when the data set is large; however, it is helpful in constructing a stem and leaf plot). The leaves in the final stem and leaf plot should be arranged in order. Step 2: Separate the data according to the first digit. Step 3: A display can be made by using the leading digit as the stem and the trailing digit as the leaf. Julien Dompierre 25 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Back-to-Back Stem and Leaf Plot (p. 76) Related distributions can be compared by using a back-to-back stem and leaf plot. The back-to-back stem and leaf plot uses the same digits for the stems of both distributions, but the digits that are used for the leaves are arranged in order out from the stems on both sides. Atlanta | Philadelphia -------------------------------55 70 44 36 40 | 61 40 38 32 30 63 40 44 34 38 | 58 40 40 25 30 60 47 52 32 32 | 54 40 36 30 30 50 53 32 28 31 | 53 39 36 34 33 52 32 34 32 50 | 50 38 36 39 32 26 29 Julien Dompierre 26 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Atlanta | Philadelphia -------------------------------55 70 44 36 40 | 61 40 38 32 30 63 40 44 34 38 | 58 40 40 25 30 60 47 52 32 32 | 54 40 36 30 30 50 53 32 28 31 | 53 39 36 34 33 52 32 34 32 50 | 50 38 36 39 32 26 29 Atlanta Philadelphia ------------------------------986 | 2 | 5 8644222221 | 3 | 000022346668899 74400 | 4 | 0000 532200 | 5 | 0348 30 | 6 | 1 0 | 7 | Julien Dompierre 27 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Outline 1 Other Types of Graphs Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Julien Dompierre 28 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Paired Data and Scatter Plots (p. 86) Many times researchers are interested in determining if a relationship between two variables exists. In order to accomplish this, the researcher collects data consisting of two measures that are paired with each other. The first measure is called the independent variable, designated as x, and the second variable, is called the dependent variable, and designated as y . Once a set of order pairs (x, y ) of data values is collected, a graph can be drawn to represent the data. This graph is called a scatter plot or scatter diagram. Julien Dompierre 29 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Paired Data and Scatter Plots (p. 86) A scatter plot is a graph of order pairs of data values that is used to determine if a relationship exists between the two variables. Julien Dompierre 30 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Analyzing Scatter Plot (p. 87) A positive linear relationship exists when the points fall approximately in an ascending straight line and both the x and y values increase at the same time. Julien Dompierre 31 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Analyzing Scatter Plot (p. 87) A negative linear relationship exists when the points fall approximately in a descending straight line from left to right. Julien Dompierre 32 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Analyzing Scatter Plot (p. 87) A nonlinear relationship exists when the points fall in a curved line. The relationship is described by the nature of the curve. Julien Dompierre 33 Outline Pareto Charts Time Series Graphs Pie Graphs Misleading Graphs Stem and Leaf Plots Paired Data and Scatter Plots Analyzing Scatter Plot (p. 87) No relationship exists when there is no discernible pattern of the points. Julien Dompierre 34