2.6 Before Now Why? Key Vocabulary • scatter plot • positive correlation • negative correlation • correlation Draw Scatter Plots and Best-Fitting Lines You wrote equations of lines. You will fit lines to data in scatter plots. So you can model sports trends, as in Ex. 27. A scatter plot is a graph of a set of data pairs (x, y). If y tends to increase as x increases, then the data have a positive correlation. If y tends to decrease as x increases, then the data have a negative correlation. If the points show no obvious pattern, then the data have approximately no correlation. y y coefficient • best-fitting line y x x Positive correlation x Approximately no correlation Negative correlation EXAMPLE 1 Describe correlation TELEPHONES Describe the correlation shown by each scatter plot. y 160 120 80 40 0 Cellular Phone Subscribers and Corded Phone Sales, 1995–2003 Corded phone sales (millions of dollars) Cellular service regions (thousands) Cellular Phone Subscribers and Cellular Service Regions, 1995–2003 0 40 80 120 160 x Subscribers (millions) y 550 450 350 250 0 40 80 120 160 x Subscribers (millions) Solution The first scatter plot shows a positive correlation, because as the number of cellular phone subscribers increased, the number of cellular service regions tended to increase. The second scatter plot shows a negative correlation, because as the number of cellular phone subscribers increased, corded phone sales tended to decrease. 2.6 Draw Scatter Plots and Best-Fitting Lines n2pe-0206.indd 113 113 10/20/05 10:06:57 AM CORRELATION COEFFICIENTS A correlation coefficient, denoted by r, is a number from 21 to 1 that measures how well a line fits a set of data pairs (x, y). If r is near 1, the points lie close to a line with positive slope. If r is near 21, the points lie close to a line with negative slope. If r is near 0, the points do not lie close to any line. r 5 21 Points lie near line with a negative slope. EXAMPLE 2 r50 r51 Points do not lie near any line. Points lie near line with positive slope. Estimate correlation coefficients Tell whether the correlation coefficient for the data is closest to 21, 20.5, 0, 0.5, or 1. a. b. y 150 c. y 150 y 150 100 100 100 50 50 50 0 0 2 4 0 6x 0 2 4 0 6x 0 2 4 6x Solution a. The scatter plot shows a clear but fairly weak negative correlation. So, r is between 0 and 21, but not too close to either one. The best estimate given is r 5 20.5. (The actual value is r ø 20.46.) b. The scatter plot shows approximately no correlation. So, the best estimate given is r 5 0. (The actual value is r ø 20.02.) c. The scatter plot shows a strong positive correlation. So, the best estimate given is r 5 1. (The actual value is r ø 0.98.) ✓ GUIDED PRACTICE for Examples 1 and 2 For each scatter plot, (a) tell whether the data have a positive correlation, a negative correlation, or approximately no correlation, and (b) tell whether the correlation coefficient is closest to 21, 20.5, 0, 0.5, or 1. 1. 2. y 3. y y 100 100 100 50 50 50 0 0 2 4 6x 0 0 2 4 6x 0 0 2 4 6x BEST-FITTING LINES If the correlation coefficient for a set of data is near 61, the data can be reasonably modeled by a line. The best-fitting line is the line that lies as close as possible to all the data points. You can approximate a best-fitting line by graphing. 114 n2pe-0206.indd 114 Chapter 2 Linear Equations and Functions 10/20/05 10:06:59 AM For Your Notebook KEY CONCEPT Approximating a Best-Fitting Line Draw a scatter plot of the data. STEP 1 STEP 2 Sketch the line that appears to follow most closely the trend given by the data points. There should be about as many points above the line as below it. STEP 3 Choose two points on the line, and estimate the coordinates of each point. These points do not have to be original data points. STEP 4 Write an equation of the line that passes through the two points from Step 3. This equation is a model for the data. EXAMPLE 3 Approximate a best-fitting line ALTERNATIVE-FUELED VEHICLES The table shows the number y (in thousands) of alternative-fueled vehicles in use in the United States x years after 1997. Approximate the best-fitting line for the data. x 0 1 2 3 4 5 6 7 y 280 295 322 395 425 471 511 548 Solution y 550 STEP 1 Draw a scatter plot of the data. (7, 548) STEP 2 Sketch the line that appears to STEP 3 Choose two points that appear to lie on the line. For the line shown, you might choose (1, 300), which is not an original data point, and (7, 548), which is an original data point. STEP 4 Write an equation of the line. First find the slope using the points (1, 300) and (7, 548). Number of vehicles (thousands) best fit the data. One possibility is shown. 500 450 400 350 300 250 0 (1, 300) 0 2 4 6 8 x Years since 1997 548 2 300 5 248 ø 41.3 m5} } 721 6 Use point-slope form to write the equation. Choose (x1, y1) 5 (1, 300). y 2 y1 5 m(x 2 x1) y 2 300 5 41.3(x 2 1) y ø 41.3x 1 259 Point-slope form Substitute for m, x1, and y1. Simplify. c An approximation of the best-fitting line is y 5 41.3x 1 259. "MHFCSB at classzone.com 2.6 Draw Scatter Plots and Best-Fitting Lines n2pe-0206.indd 115 115 10/20/05 10:07:00 AM EXAMPLE 4 Use a line of fit to make a prediction Use the equation of the line of fit from Example 3 to predict the number of alternative-fueled vehicles in use in the United States in 2010. Solution Because 2010 is 13 years after 1997, substitute 13 for x in the equation from Example 3. y 5 41.3x 1 259 5 41.3(13) 1 259 ø 796 c You can predict that there will be about 796,000 alternative-fueled vehicles in use in the United States in 2010. LINEAR REGRESSION Many graphing calculators have a linear regression feature that can be used to find the best-fitting line for a set of data. EXAMPLE 5 Use a graphing calculator to find a best-fitting line Use the linear regression feature on a graphing calculator to find an equation of the best-fitting line for the data in Example 3. Solution FIND CORRELATION If your calculator does not display the correlation coefficient r when it displays the regression equation, you may need to select DiagnosticOn from the CATALOG menu. STEP 1 Enter the data into two lists. Press and then select Edit. Enter years since 1997 in L1 and number of alternative-fueled vehicles in L 2. L1 L2 0 280 1 295 2 322 3 395 4 425 L1(2)=1 L3 STEP 3 Make a scatter plot of the data pairs to see how well the regression equation models the data. Press [STAT PLOT] to set up your plot. Then select an appropriate window for the graph. STEP 2 Find an equation of the bestfitting (linear regression) line. Press , choose the CALC menu, and select LinReg(ax1b). The equation can be rounded to y 5 40.9x 1 263. LinReg y=ax+b a=40.86904762 b=262.83333333 r=.9929677507 STEP 4 Graph the regression equation with the scatter plot by entering the equation y 5 40.9x 1 263. The graph (displayed in the window 0 ≤ x ≤ 8 and 200 ≤ y ≤ 600) shows that the line fits the data well. Plot1 Plot2 Plot3 On Off Type XList:L1 YList:L2 Mark: + c An equation of the best-fitting line is y 5 40.9x 1 263. 116 n2pe-0206.indd 116 Chapter 2 Linear Equations and Functions 10/20/05 10:07:01 AM ✓ GUIDED PRACTICE for Examples 3, 4, and 5 4. OIL PRODUCTION The table shows the U.S. daily oil production y (in thousands of barrels) x years after 1994. x 0 1 2 3 4 5 6 7 8 y 6660 6560 6470 6450 6250 5880 5820 5800 5750 a. Approximate the best-fitting line for the data. b. Use your equation from part (a) to predict the daily oil production in 2009. c. Use a graphing calculator to find and graph an equation of the best-fitting line. Repeat the prediction from part (b) using this equation. 2.6 EXERCISES HOMEWORK KEY 5 WORKED-OUT SOLUTIONS on p. WS1 for Exs. 9, 11, and 25 ★ 5 STANDARDIZED TEST PRACTICE Exs. 2, 16, 18, 21, and 28 5 MULTIPLE REPRESENTATIONS Ex. 27 SKILL PRACTICE 1. VOCABULARY Copy and complete: A line that lies as close as possible to a set of data points (x, y) is called the ? for the data points. 2. ★ WRITING Describe how to tell whether a set of data points shows a positive correlation, a negative correlation, or approximately no correlation. EXAMPLE 1 on p. 113 for Exs. 3–5 DESCRIBING CORRELATIONS Tell whether the data have a positive correlation, a negative correlation, or approximately no correlation. 3. y 5. y 60 30 6 40 20 4 20 10 2 4. y 0 0 2 4 6 0 8x 0 2 4 6 0 8x 0 2 4 8x 6 6. REASONING Explain how you can determine the type of correlation for a set of data pairs by examining the data in a table without drawing a scatter plot. EXAMPLE 2 on p. 114 for Exs. 7–9 CORRELATION COEFFICIENTS Tell whether the correlation coefficient for the data is closest to 21, 20.5, 0, 0.5, or 1. 7. y 60 8. y 60 9. y 60 40 40 40 20 20 20 0 0 2 4 6 8 x 0 0 2 4 6 8 x 0 0 2 4 6 2.6 Draw Scatter Plots and Best-Fitting Lines n2pe-0206.indd 117 8 x 117 10/20/05 10:07:01 AM EXAMPLES 3 and 4 on pp. 115–116 for Exs. 10–15 BEST-FITTING LINES In Exercises 10–15, (a) draw a scatter plot of the data, (b) approximate the best-fitting line, and (c) estimate y when x 5 20. 10. 12. 14. 11. x 1 2 3 4 5 y 10 22 35 49 62 x 12 25 36 50 64 y 100 75 52 26 9 x 5.6 6.2 7 7.3 8.4 y 120 130 141 156 167 13. 15. x 1 2 3 4 5 y 120 101 87 57 42 x 3 7 10 15 18 y 16 45 82 102 116 x 16 24 39 55 68 y 3.9 3.7 3.4 2.9 2.6 16. ★ MULTIPLE CHOICE Which equation best y models the data in the scatter plot? 20 A y 5 15 1 x 1 26 B y 5 2} 2 10 2 x 1 19 C y 5 2} 4 x 1 33 D y 5 2} 0 5 5 0 10 20 x 30 17. ERROR ANALYSIS The graph shows one y student’s approximation of the bestfitting line for the data in the scatter plot. Describe and correct the error in the student’s work. 40 20 2 4 6 x 8 18. ★ MULTIPLE CHOICE A set of data has correlation coefficient r. For which value of r would the data points lie closest to a line? A r 5 20.96 EXAMPLE 5 on p. 116 for Exs. 19–20 B r50 C r 5 0.38 D r 5 0.5 GRAPHING CALCULATOR In Exercises 19 and 20, use a graphing calculator to find and graph an equation of the best-fitting line. 19. 20. x 78 74 68 76 80 84 50 76 55 93 y 5.1 5.0 4.6 4.9 5.3 5.5 3.7 5.0 3.9 5.8 x 7000 7400 7800 8100 8500 8800 9200 9500 9800 y 56.0 54.5 51.9 50.0 47.3 45.6 43.1 41.6 39.9 21. ★ OPEN-ENDED MATH Give two real-life quantities that have (a) a positive correlation, (b) a negative correlation, and (c) approximately no correlation. 22. REASONING A set of data pairs has correlation coefficient r 5 0.1. Is it logical to use the best-fitting line to make predictions from the data? Explain. 23. CHALLENGE If x and y have a positive correlation and y and z have a negative correlation, what can you say about the correlation between x and z? Explain. 118 n2pe-0206.indd 118 5 WORKED-OUT SOLUTIONS Chapter 2 Linear and Functions on p. Equations WS1 ★ 5 STANDARDIZED TEST PRACTICE 5 MULTIPLE REPRESENTATIONS 10/20/05 10:07:02 AM PROBLEM SOLVING GRAPHING CALCULATOR You may wish to use a graphing calculator to complete the following Problem Solving exercises. EXAMPLES 3, 4, and 5 on pp. 115–116 for Exs. 24–28 24. POPULATION The data pairs (x, y) give the population y (in millions) of Texas x years after 1997. Approximate the best-fitting line for the data. (0, 19.7), (1, 20.2), (2, 20.6), (3, 20.9), (4, 21.3), (5, 21.7), (6, 22.1), (7, 22.5) GPSQSPCMFNTPMWJOHIFMQBUDMBTT[POFDPN 25. TUITION The data pairs (x, y) give U.S. average annual public college tuition y (in dollars) x years after 1997. Approximate the best-fitting line for the data. (0, 2271), (1, 2360), (2, 2430), (3, 2506), (4, 2562), (5, 2727), (6, 2928) GPSQSPCMFNTPMWJOHIFMQBUDMBTT[POFDPN 26. PHYSICAL SCIENCE The diagram shows the boiling point of water at various elevations. Approximate the best-fitting line for the data pairs (x, y) where x represents the elevation (in feet) and y represents the boiling point (in degrees Fahrenheit). Then use this line to estimate the boiling point at an elevation of 14,000 feet. 27. MULTIPLE REPRESENTATIONS The table shows the numbers of countries that participated in the Winter Olympics from 1980 to 2002. Year Countries 1980 1984 1988 1992 1994 1998 2002 37 49 57 64 67 72 77 a. Making a List Use the table to make a list of data pairs (x, y) where x represents years since 1980 and y represents the number of countries. b. Drawing a Graph Draw a scatter plot of the data pairs from part (a). c. Writing an Equation Write an equation that approximates the best-fitting line, and use it to predict the number of participating countries in 2014. 28. ★ EXTENDED RESPONSE The table shows manufacturers’ shipments (in millions) of cassettes and CDs in the United States from 1988 to 2002. Year 1988 1990 1992 1994 1996 1998 2000 2002 Cassettes 450.1 442.2 336.4 345.4 225.3 158.5 76.0 31.1 CDs 149.7 286.5 407.5 662.1 778.9 847.0 942.5 803.3 a. Draw a scatter plot of the data pairs (year, shipments of cassettes). Describe the correlation shown by the scatter plot. b. Draw a scatter plot of the data pairs (year, shipments of CDs). Describe the correlation shown by the scatter plot. c. Describe the correlation between cassette shipments and CD shipments. What real-world factors might account for this? 2.6 Draw Scatter Plots and Best-Fitting Lines n2pe-0206.indd 119 119 10/20/05 10:07:03 AM 29. CHALLENGE Data from some countries in North America show a positive correlation between the average life expectancy in a country and the number of personal computers per capita in that country. a. Make a conjecture about the reason for the positive correlation between life expectancy and number of personal computers per capita. b. Is it reasonable to conclude from the data that giving residents of a country more personal computers will lengthen their lives? Explain. MIXED REVIEW Solve the equation for y. Then find the value of y for the given value of x. (p. 26) 30. 2x 2 y 5 10; x 5 8 31. 6y 1 x 5 25; x 5 1 32. x 2 4y 5 3; x 5 23 33. 23x 1 4y 1 5 5 0; x 5 22 34. 20.5y 1 0.25x 5 2; x 5 4 35. xy 2 4x 5 9; x 5 6 Evaluate the function for the given value of x. (p. 72) PREVIEW Prepare for Lesson 2.7 in Exs. 36–41. 36. f(x) 5 2x 1 7; f(9) 37. f(x) 5 24x 2 11; f(25) 38. f(x) 5 14 2 x; f (22) 39. f(x) 5 x 2 10; f(10) 40. f(x) 5 26 2 x; f (4) 41. f(x) 5 2x 1 8 2 1; f (23) 42. y 5 x 1 8 43. y 5 2x 2 14 44. y 5 5x 1 9 45. 2x 1 y 5 1 46. 3x 2 2y 5 24 47. x 1 3y 5 15 Graph the equation. (p. 89) QUIZ for Lessons 2.4–2.6 Write an equation of the line that satisfies the given conditions. (p. 98) 1. m 5 25, b 5 3 2. m 5 2, b 5 12 3. m 5 4, passes through (23, 6) 4. m 5 27, passes through (1, 24) 5. passes through (0, 7) and (23, 22) 6. passes through (29, 9) and (29, 0) Write and graph a direct variation equation that has the given ordered pair as a solution. (p. 107) 7. (1, 2) 8. (22, 8) 9. (5, 216) 10. (12, 4) The variables x and y vary directly. Write an equation that relates x and y. Then find y when x 5 8. (p. 107) 11. x 5 4, y 5 12 12. x 5 23, y 5 28 13. x 5 40, y 5 25 14. x 5 12, y 5 2 15. CONCERT TICKETS The table shows the average price of a concert ticket to one of the top 50 musical touring acts for the years 1999–2004. Write an equation that approximates the best-fitting line for the data pairs (x, y). Use the equation to predict the average price of a ticket in 2010. (p. 113) Years since 1999, x Ticket price (dollars), y 120 n2pe-0206.indd 120 0 1 2 3 4 5 38.56 44.80 46.69 50.81 51.81 58.71 PRACTICE Lesson 2.6, p. 1011 Chapter 2EXTRA Linear Equations andfor Functions ONLINE QUIZ at classzone.com 10/20/05 10:07:04 AM