# Stat Facilitated Study Group (FSG) 14.10.2011 STAT FSG Times ```Stat Facilitated Study Group (FSG)
14.10.2011
STAT FSG Times: Tuesdays 1 p.m to 2 p.m
Location: Room AC219
Fridays 11 a.m to 12 p.m
Location: Portable 04 Room 101
1. Make sure you know the meaning of:
a) Regression line
b) Residue
c) Correlation (r)
d) R square (R-sq)
e) Coefficient of determination (r◘)
f) Least-square regression line:
g)
h)
i)
j)
Why square? What is the sum and the mean of least-square residuals?
Extrapolation
Lurking variable
Why we can’t use a high correlation to conclude one variable is cause of the other?
2. Which statement about the below scatterplot is true:
a)
b)
c)
d)
It’s form is curved
It is right skewed
Correlation coefficient is negative
Correlation coefficient is 0.1
Stat Facilitated Study Group (FSG)
14.10.2011
STAT FSG Times: Tuesdays 1 p.m to 2 p.m
Location: Room AC219
Fridays 11 a.m to 12 p.m
Location: Portable 04 Room 101
3. A researcher expects a relationship between two variables, but finds that the correlation
between them is close to zero. The researcher has plenty of data. What is a possible
explanation?
(a) The relationship is a curve
(b) The relationship is strongly linear but the correlation happened to come out close to
zero.
(c) The relationship is a curved upward trend.
(d) There cannot actually be a relationship between the variables if the correlation is
close to zero.
4. A researcher studies children in school and finds a strong positive linear association
between height and reading ability. What would the researcher's best conclusion be?
(a) The observed association was an accident.
(c) There is a lurking variable that explains the correlation.
(d) Height and reading ability are confounded.
5. You are given a set of data. Calculate the residue for X=3.
X … -1 3 6 …
Y … 0 12 24 …
Y=5x-10
6. The following statistical output was obtained from a regression analysis of the relationship between two variables x and y:
Regression Analysis: y versus x
The regression equation is y = 104 - 0.887 x
S = 1.56848
R-Sq = 27.0%
What is the correlation between x and y? Choose the closest value.
A) 0.27
B) -0.89
C) -0.27
D) 0.52
E) -0.52
Stat Facilitated Study Group (FSG)
14.10.2011
STAT FSG Times: Tuesdays 1 p.m to 2 p.m
Location: Room AC219
Fridays 11 a.m to 12 p.m
Location: Portable 04 Room 101
7. A researcher wishes to determine whether the rate of water flow (in liters per
second) over an experimental soil bed can be used to predict the amount of soil washed
away (in kilograms). The researcher measures the amount of soil washed away for
various flow rates, and from these data calculates the least-squares regression line to be
amount of eroded soil = 0.4 + 1.3 x ( flow rate).
What do we know about the correlation between amount of eroded soil and flow rate?
A) r = 1=1:3
B) r = 0:4
C) r = 1:3 � 0:4
D) It should be positive, but we cannot determine the exact value with the information given.
E) It should be negative, but we cannot determine the exact value with the information given.
8. One of the flow rates used by the researcher in question 24 above was 0.3 liters per
second and for this flow rate the amount of eroded soil was 0.8 kilograms. These values
were used in the calculation of the least-squares regression line shown in question 7.
What is the residual corresponding to these values?
A) 0.01
B) -0.01
C) 0.50
D) -0.50
E) 0.37
```