PubH 6414 Worksheet 4a: Relationships Between Two Numerical Variables 1 of 2 Example 1: Scatterplot and Correlation Coefficient Data below are the heights of mothers (X) and their daughters (Y) in inches. Source: U.S. Department of Health and Human Services, National Center for Health Statistics X 63 67 64 60 65 67 59 60 58 72 63 Y 59 65 65 61 65 67 61 63 60 71 62 1. Plot the (X,Y) points on the graph Daughter's height in inches 80 75 70 65 60 55 50 50 55 60 65 70 75 80 Mother's height in inches 2. Does there appear to be a linear relationship between mother and daughter heights? Yes—A positive linear relationship 3. Is this a deterministic relationship or a statistical relationship? Statistical—all of the points are not on a straight line 4. Will the correlation coefficient for the relationship between mother’s and daughter’s heights be positive or negative? Positive—increasing height of the mother is associated with increasing height of the daughter PubH 6414 Worksheet 4a: Relationships Between Two Numerical Variables 2 of 2 5. Which of the following correlation coefficients represents the strongest linear relationship? a. 0.76 b. 0.24 c. -0.89 this value is closest to either 1 or -1 d. -0.03 Example 2: Correlation Coefficient and Coefficient of Determination A study enrolling 40 women found that the correlation coefficient between cholesterol (y) and weight (x) was 0.493. A scatter plot of the data did not indicate any outliers or curvilinear relationship. 1. Describe the direction and strength of the linear relationship Positive with a fair degree of linear relationship 2. What proportion of variation in cholesterol is explained by the linear association between cholesterol and weight? the coefficient of determination = R2 = (0.493)2 = 0.243 24.3% of the variation in cholesterol is explained by the linear association between cholesterol and weight 3. As a result of the correlation coefficient or coefficient of determination, can we conclude that change in cholesterol is caused by change in weight? No—the correlation coefficient does not provide information about causality 4. Indicate the direction of the scatter of points on a scatter plot of the data for this analysis: a. upper left to lower right b. lower left to upper right (increasing weight is associated with increasing cholesterol) c. no evident pattern d. increasing from lower left, then decreasing