Chelsea Hutto PSYC 8000 Exam 1 50 points Due 2/13/2014 If my

advertisement
Chelsea Hutto
PSYC 8000
Exam 1
50 points
Due 2/13/2014
1. If my experimental hypothesis were ‘Eating cheese before bed affects the number of
nightmares you have’, what would the null hypothesis be? (1 pt)
a. Eating cheese before bed gives you more nightmares.
b. Eating cheese is linearly related to the number of nightmares you have.
c. The number of nightmares you have is not affected by eating cheese before
bed.
d. Eating cheese before bed gives you fewer nightmares.
2. What does a significant test statistic tell us? (1 pt)
a. There is an important effect.
b. That the test statistic is larger than we would expect if there were no effect in
the population.
c. The hull hypothesis is false.
d. All of the above.
3. Give one example of each of the following: (3 pts)
a. Nominal variable - A number of a contestant in a competition
b. Ordinal variable - Contestants who won a competition, ranked in order such as
first, second, and third place.
c. Interval variable - Ratings of performance on a performance scale, which are
based on a five point scale.
4. What is a Type I error? What is one way that researchers can reduce their risk of making
a Type I error? (2 pts)
Type 1 error occurs when we believe that there is a genuine effect in our population,
when there in reality is not. One method which ensures Type 1 error remains below
.05 is by using the Bonferroni Correction, which is conducted by dividing alpha by
the number of comparisons (k). Additionally researchers could reduce the alpha
level, but this in turn increases the likelihood of Type 2 error.
5. What is a Type II error? What is one way that researchers can reduce their risk of
making a Type II error? (2 pts)
Type 2 error occurs when we believe that there is no effect in the population when,
in reality, there is. Researchers would be able to conduct the power of a test, which
gives the probability that a given tests will find an effect assuming that one exists in
a population. Using this measure researchers would be able to reduce the likelihood
of making a Type 2 error.
6. In 1-2 sentences, explain what power is, and one way in which power can be increased.
(2 pts)
Power is the probability that a given test will find an effect assuming that one exists
in the population. Increasing the sample size increases the power of a test.
Chelsea Hutto
7. ‘Children can learn a second language differently before the age of 7 than after.’ Is this
statement: (1 pt)
a. A two-tailed hypothesis
b. A one-tailed hypothesis
c. A null hypothesis
d. A non-falsifiable hypothesis
8. Under a null hypothesis, a sample value yields a p-value of .15. Which of the following
statements is true? (1 pt)
a. This finding is statistically significant at the .01 level of significance.
b. This finding is not statistically significant.
c. This finding is statistically significant at the .05 level of significance.
d. This finding is statistically significant at the .001 level of significance.
9. Why is the standard error important? (1 pt)
a. It tells us the precise value of the variance within the population.
b. It gives you a measure of how well your sample parameter represents the
population value.
c. It is unaffected by outliers.
d. It is unaffected by the distribution of scores.
10. What is the basic equation for all statistical models? In words, what does this equation
mean? (2 pts) The outcome is equal to the model plus error. This equation also
means the data we observe can be predicted from the model we choose to fit to the
data plus some amount of error.
11. A 95% confidence interval is: (1 pt)
a. The range of values of the statistic which probably contains the true value of
the statistic in the population.
b. The range of values of the statistic that we can be 95% confident contains a
significant effect in the population.
c. The range of values of the statistic which we can by 95% certain does not contain
the true population effect.
d. The range of values of the statistic which we can be 5% confident contains a
significant effect in the population.
12. What is an effect size? If you have a particularly large sample, why might it be important
to calculate effect sizes in addition to test statistics? (2 pts) An effect size in an objective
and usually standardized measure of the magnitude of the observed effect. By
calculating the effect size in comparison to other test statistics it gives a more
accurate viewpoint since it is less affected by a large sample size as other statistical
measures are. Effect sizes are also standardized, therefore enabling them to be
compared across studies.
13. The Kolmogorov–Smirnov test can be used to test: (1 pt)
a. Whether group variances are equal.
b. Whether group means differ.
c. Whether scores are normally distributed.
d. Whether scores are measured at the interval level.
Chelsea Hutto
14. Which of these variables would be considered not to have met the assumptions of
parametric tests based on the normal distribution? (1 pt)
a. Reaction time (in seconds)
b. Cognitive ability score (scores range from 0-100)
c. Temperature
d. Gender
15.What does the graph below indicate about the normality of our data? (1 pt)
a. The P-P plot reveals that the data deviate substantially from normal.
b. The P-P plot reveals that the data are normal.
c. The P-P plot reveals that data is strongly negatively skewed.
d. The P-P plot reveals that data is strongly positively skewed
16. We predict an outcome variable from some kind of model. That model is described by
one or more _______ variables and ________ that tell us something about the
relationship between the predictor and outcome variable. (1 pt)
a. parameter, outcome variables
b. predictor, parameters
c. dependent, predictors
d. outcome, estimates
17. What does the assumption of independence mean? (1 pt)
a. This assumption means that none of your independent variables are correlated.
b. This assumption means that you must use an independent design rather than a
repeated-measures design.
c. This assumption means that the errors in your model are not related to each
other.
d. This assumption means that the residuals in your model are not independent.
Chelsea Hutto
18.Looking at the table below, which of the following statements is the most accurate? (1 pt)
a. For the level of musical skill, the data are heavily negatively skewed.
b. For the number of hours spent practising, there is an issue with kurtosis.
c. For the number of hours spent practising, the data are fairly positively skewed.
d. For the number of hours spent practising, there is not an issue with kurtosis.
19.Looking at the table below, which of the following statements is correct? (1 pt)
a. Levene’s test was significant, F(1, 118) = 0.93, p = .007, indicating that the
assumption of homogenity of variance had been met.
b. Levene’s test was non-significant, F(1, 118) = 0.01, p = .93, indicating that the
assumption of homogenity of variance had been met.
c. Levene’s test was non-significant, F(1, 118) = 0.01, p = .93, indicating that the
assumption of homogenity of variance had been violated.
d. Levene’s test was significant, F(1, 118) = 0.01, p = .93, indicating that the
assumption of homogenity of variance had been violated.
20. What does the central limit theorem tell us about the relationship between sample size
and the sampling distribution of a parameter? (2 pts)
The theory of central limit theorem is based on the assumption that regardless of the
shape of the population, parameter estimates of that population will have a normal
distribution provided the samples are big enough.
Chelsea Hutto
21. Kevin has 4 extreme scores at the positive end of his distribution that are causing his data
to be positively skewed. To fix this problem, Kevin deletes these 4 scores and carries on
with his analyses. Is this the appropriate way for Kevin to have handled these scores?
Why or why not? Name one other way that he could have handled these scores. (3 pt)
Trimming doesn't automatically remove outliers, and should only be used if there
was a reason to think they came from a different population or were entered wrong;
due to this fact trimming was not the most appropriate way for Kevin to have
handled his outliers. Kevin could have winsorize his data which substitutes outliers
with the highest value that isn't an outlier.
22. Which of the following statements about Pearson’s correlation coefficient is not true? (1
pt)
a. It can be used as an effect size measure.
b. It can be used on ranked data.
c. Its value ranges from 0 to 1.
d. It can be used on ranked data.
23. The correlation between two variables A and B is .12 with a significance of p < .01. What
can we conclude? (1 pt)
a. That there is a substantial relationship between A and B.
b. That there is a small relationship between A and B.
c. That variable A causes variable B.
d. None of the above
24. Aurelia wants to determine the correlation between two variables: cognitive ability and
job performance. Cognitive ability was measured using a well-established test, and
scores range from 0-200. Job performance was measured by having the general manager
rank employees from 1-100 in terms of their performance over the last 6 months. What
type of correlation should Aurelia use? Why? (2 pts)
Spearman's rho is a non-parametric statistic based off of ranked data and can also
be useful when dealing with non-normal data. In comparison to Kendall's tau, there
is no indication of tied ranks of this sample, which suggests Spearman's rho would
be a better measured to be used in this situation.
25. The relationship between two variables partialling out the effect that a third variable has
on one of those variables can be expressed using a: (1 pt)
a. Bivariate correlation
b. Point-biserial correlation
c. Partial correlation
d. Semi-partial correlation
26. List and explain 2 reasons why causality cannot be inferred from correlation. (2 pts)
Correlation does not imply causality. Correlation can only show the relationship
between two variables. Issues that show correlation does not show causality is the
third variable problem which means some other variable (one that was not
measured) is responsible for the observed relationship. Additionally, there is no way
to determine directionality.
27. Which correlation coefficient would you use to look at the correlation between gender
and time spent on the phone talking to your mother? (1 pt)
a. The point-biserial correlation coefficient, rpb
b. The biserial correlation coefficient, rb
Chelsea Hutto
c. Pearson’s correlation coefficient, r
d. Kendall’s correlation coefficient, τ
28. How do you determine if your data is curvilinear? If your data is curvilinear, can you
use a Pearson’s r correlation? Why or why not? (3 pts)
When using Pearson's r it is important to remember it is based on the assumption of
linear relationship rather than curvilinear, therefore, pearson's r would not be a
proper correlation to use when dealing with curvilinear data. To determine if your
data is curvilinear, using graphs is one approach to visually determine if your data is
curvilinear.
29.What do the results in the table below show? (1 pt)
Work productivity
Work productivity
Time spent on
Facebook
Time spent on
Facebook
Pearson’s
correlation
1.000
–.94
Sig. (2-tail)
.
.000
N
100
100
Pearson’s
correlation
–.94
1.000
Sig. (2-tail)
.000
.
N
100
100
30.
a. In a sample of 100 people, there was a strong negative but non-significant
relationship between work productivity and time spent on Facebook, r = –.94, p
> .001.
b. In a sample of 100 people, there was a non-significant negative relationship between
work productivity and time spent on Facebook, r = –.94, p < .001.
c. In a sample of 100 people, there was a strong negative relationship between work
productivity and time spent on Facebook, r = –.94, p < .001.
d. In a sample of 100 people, there was a weak negative relationship between work
productivity and time spent on Facebook, r = –.94, p < .001.
Chelsea Hutto
30. Looking at the table below, which variables were the most strongly correlated? (1 pt)
Work ethic
Work ethic
Annual income
IQ
Annual income
IQ
Pearson’s
correlation
1.000
.72
.66
Sig. (2-tail)
.
.001
.000
N
550
550
550
Pearson’s
correlation
.72
1.000
.47
Sig. (2-tail)
.000
.
.03
N
550
550
550
Pearson’s
correlation
.66
.47
1.000
Sig. (2-tail)
.000
.03
.
N
550
550
550
31.
a. Annual income and IQ
b. Work ethic and annual income
c. Work ethic and IQ
d. None of the variables were significantly correlated with one another
31. How is the coefficient of determination calculated? What does the coefficient of
determination tell us in terms of variance? (2 pts)
Coefficient of determination (R2) is a measure of the amount of variability in one
variable that is shared by the other. This value is calculated by squaring the
correlation of two variables. R2 tells us how much of the variability between two
variables are shared (which can be transformed into a percentage). R2 is not to be
mistaken as the variance accounted for of one variable by another, which implies
causality.
32.A psychologist was interested in whether the amount of news people watch (minutes per
day) predicts how depressed they are (from 0 = not depressed to 7 = very depressed).
What does the standardized beta tell us in the output? (1 pt)
Chelsea Hutto
a. As news exposure increases by 1 standard deviation, depression decreases by
0.224 of a standard deviation.
b. As news exposure decreases by 0.224 standard deviations, depression increases by
1 standard deviation.
c. As news exposure increases by 1 minute, depression decreases by 0.224 units.
d. As news exposure decreases by 0.224 minutes, depression increases by 1 unit.
33.A consumer researcher was interested in what factors influence people's fear responses to
horror films. She measured gender and how much a person is prone to believe in things
that are not real (fantasy proneness). Fear responses were measured too. In this table, what
does the value 847.685 represent? (1 pt)
a. The reduction in the error in predicting fear scores when fantasy proneness is
added to the model
b. The total error in predicting fear scores when both gender and fantasy proneness
are included as predictors in the model
c. The improvement in prediction of fear resulting from including both gender and
fantasy proneness as predictors in the model
d. The improvement in prediction of fear resulting from adding fantasy proneness to
the model
Chelsea Hutto
34.A psychologist was interested in whether the amount of news people watch predicts how
depressed they are. In this table, what does the value 4.404 represent? (1 pt)
a. The ratio of how much the prediction of depression has improved by fitting the
model, compared to how much variability there is in depression scores
b. The ratio of how much error there is in the model, compared to how much
variability there is in depression scores
c. The proportion of variance in depression explained by news exposure
d. The ratio of how much the prediction of depression has improved by fitting
the model, compared to how much error still remains
35. Looking at this plot showing the zpred x zresid values for the outcome variable
(depression), does there appear to be a problem with homoscedasticity? Why or why not?
(2 pts)
There does appear to be a problem with homoscedasticity with this particular data
set. In this example the spread of scores for depression were different at each unit of
news exposure.
Chelsea Hutto
36. Which of the following statements about the t-statistic in regression is not true? (1 pt)
a. The t-statistic provides some idea of how well a predictor predicts the outcome
variable.
b. The t-statistic can be used to see whether a predictor variables makes a
statistically significant contribution to the regression model.
c. The t-statistic is equal to the regression coefficient divided by its standard
deviation.
d. The t-statistic tests whether the regression coefficient, b, is equal to 0.
Download