III - Crop and Soil Science

advertisement
Name:
KEY
CSS 590 – Experimental Design in Agriculture
Second Midterm
Winter 2007
10 pts
15 pts
1) While entering yield data from an experiment onto a spreadsheet, the value of 17796
g/plot was entered rather than the true value of 1796. Draw a picture showing how this
might appear on a residual plot. Include titles for the X and Y axes of your diagram and
the zero mark on the Y axis.
2) What is the purpose of conducting a Levene Test? If the test is significant, how would
you proceed with your data analysis and summarization?
A Levene test is conducted to determine if the assumption of homogeneity of variance
among groups is valid. If the test is significant, then the assumption is not valid, and
some form of transformation may be needed. To determine the appropriate
transformation, you would first consider the type of data that has been collected, which
might suggest an underlying distribution that would apply to this data set. For example,
you could determine if the mean is proportional to the standard deviation, or to the
variance. In the former case we would consider using a log transformation, and in the
latter, a square root transformation. Finally, we would want to recheck the residual plot
after analyzing the transformed data. For data summary, means would be back
transformed to the original scale. Alternatively, one could use a Generalized Linear
Model, or a nonparametric test, if no suitable transformation can be identified.
1
3) An animal scientist wishes to test the effects of 5 different feeding regimens on the
weight gain of pigs. Only five pigs are available for this research.
15 pts
a) Can you suggest a design that would permit her to address the research question of
interest given this limitation? Show the sources of variation and degrees of freedom for
the ANOVA for this design.
She could use a Latin Square Design and give all five feeding regimens to each pig. The
pigs would be one blocking factor (rows) and the sequence of the feeding regimen could
be a second blocking factor (columns).
Source
Total
p -1
24
Pigs
p-1
4
Sequence
p-1
4
Feeding Regimen
p-1
4
Error
10 pts
df
2
(p-1)(p-2)
12
b) Briefly describe how the feeding regimens would be assigned to the experimental units
in this experiment (you do not need to describe the randomization process step by step,
but describe the features of an appropriate randomization).

Each animal would receive each regimen once

The order in which the regimens are given would be unique for each pig

At any point in time, each of the five regimens would be assigned to one and only
one pig

A randomization process would be used to assign the regimens to pigs and
sequences
2
4) A Sugar Company conducted an experiment to compare two varieties of sugar cane in
combination with three levels of nitrogen (150, 210, and 270 lbs. N per acre
respectively). The experiment was run in 4 complete blocks.
a) Fill in the shaded cells to complete the following ANOVA. (There is an F table at the
end of this exam)
16 pts
ANOVA
Source
Blocks
Variety
Nitrogen
VxN
Error
Total
df
3
1
2
2
15
23
SS
249
216
16
400
672
1553
MS
83.0
216.0
8.0
200.0
44.8
F
F crit
4.82
0.18
4.46
4.54
3.68
3.68
Mean Yield (tons)
Variety 1
Variety 2
Mean
5 pts
Mean
69
63
66
b) Interpret the results from this experiment. What can you say about the response of
sugarcane to Nitrogen fertilizer? (you do not need to calculate contrasts, but discuss
trends).
c) Is there a significant difference between the varieties at the highest N level?
There is a significant interaction between variety and nitrogen, so conclusions should be
based on the two-way table of means. The yield of sugar cane in this study depended
upon the variety grown and the rate of applied nitrogen. The yield of Variety 2
decreased as the rate of application increased, while variety 1 showed a positive
response to the higher application rates.
Variety 1
Variety 2
sed = sqrt(2*44.8/4) = 4.73
75
t(0.05,15 df) = 2.131
70
LSD for interactions = 2.131*4.73
Sugar Yield (tons)
9 pts
N-150 N-210 N-270
65
69
73
69
63
57
67
66
65
= 10.09
Mean1-Mean2 = 73-57 = 16
16 > 10.09
65
60
55
Yes, the varieties differ at the highest N
level.
50
N-150
(could also compare a calculated t to the
critical t, or compare confidence intervals)
3
N-210
Nitrogen Application
N-270
5) You wish to compare the tolerance of 5 mint varieties to a new herbicide. A Randomized
Block Design is used, with 3 blocks. Initial plant densities are the same for all varieties,
and herbicide is applied to all plots in the experiment. After two months, numbers of mint
plants are counted in 4 quadrats in each plot.
The GLM Procedure
Dependent Variable: Count
Source
DF
Sum of
Squares
Mean Square
F Value
Pr > F
Model
14
10409.83333
743.55952
11.16
<.0001
Error
45
2997.50000
66.61111
Corrected Total
59
13407.33333
Source
Block
Variety
Block*Variety
R-Square
Coeff Var
Root MSE
count Mean
0.776428
16.43268
8.161563
49.66667
DF
Type III SS
Mean Square
F Value
Pr > F
2
4
8
470.233333
9035.333333
904.266667
235.116667
2258.833333
113.033333
3.53
33.91
1.70
0.0377
<.0001
0.1255
Tests of Hypotheses Using the Type III MS for Block*Variety as an Error Term
Source
Block
Variety
10 pts
DF
Type III SS
Mean Square
F Value
Pr > F
2
4
470.233333
9035.333333
235.116667
2258.833333
2.08
19.98
0.1873
0.0003
a) What is the appropriate F value for comparing the varieties? Explain your answer.
The F value for comparing varieties is 19.98. This ratio is obtained when we use the
experimental error (block*variety) as the error term. The default analysis uses the
sampling error in the denominator of the F test, which gives too large a value and
inflates the level of significance.
5 pts
b) Is the F test for blocks significant? what is your proof?
No. The observed Probability of 0.1873 is > than 0.05, so we fail to reject the null
hypothesis.
5 pts
c) Calculate the standard error for a variety mean.
standard error of a mean = sqrt(113.03/(3*4) = 3.07
4
sY 
MSE
r *n
F Distribution 5% Points
Denominator
df
1
1 161.45
2 18.51
3 10.13
4
7.71
5
6.61
6
5.99
7
5.59
8
5.32
9
5.12
10
4.96
11
4.84
12
4.75
13
4.67
14
4.6
15
4.54
16
4.49
17
4.45
18
4.41
19
4.38
20
4.35
21
4.32
22
4.3
23
4.28
24
4.26
25
4.24
26
27
28
29
30
Student's t Distribution
Numerator
(2-tailed probability)
2
3
4
5
6
7
199.5 215.71 224.58 230.16 233.99 236.77
19 19.16 19.25
19.3 19.33 19.36
9.55
9.28
9.12
9.01
8.94
8.89
6.94
6.59
6.39
6.26
6.16
6.08
5.79
5.41
5.19
5.05
4.95
5.88
5.14
4.76
4.53
4.39
4.28
4.21
4.74
4.35
4.12
3.97
3.87
3.79
4.46
4.07
3.84
3.69
3.58
3.5
4.26
3.86
3.63
3.48
3.37
3.29
4.1
3.71
3.48
3.32
3.22
3.13
3.98
3.59
3.36
3.2
3.09
3.01
3.88
3.49
3.26
3.1
3
2.91
3.8
3.41
3.18
3.02
2.92
2.83
3.74
3.34
3.11
2.96
2.85
2.76
3.68
3.29
3.06
2.9
2.79
2.71
3.63
3.24
3.01
2.85
2.74
2.66
3.59
3.2
2.96
2.81
2.7
2.61
3.55
3.16
2.93
2.77
2.66
2.58
3.52
3.13
2.9
2.74
2.63
2.54
3.49
3.1
2.87
2.71
2.6
2.51
3.47
3.07
2.84
2.68
2.57
2.49
3.44
3.05
2.82
2.66
2.55
2.46
3.42
3.03
2.8
2.64
2.53
2.44
3.4
3
2.78
2.62
2.51
2.42
3.38
2.99
2.76
2.6
2.49
2.4
5
df
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
0.4
0.05
0.01
1.376 12.706 63.667
1.061 4.303 9.925
0.978 3.182 5.841
0.941 2.776 4.604
0.920 2.571 4.032
0.906 2.447 3.707
0.896 2.365 3.499
0.889 2.306 3.355
0.883 2.262
3.25
0.879 2.228 3.169
0.876 2.201 3.106
0.873 2.179 3.055
0.870
2.16 3.012
0.868 2.145 2.977
0.866 2.131 2.947
0.865
2.12 2.921
0.863
2.11 2.898
0.862 2.101 2.878
0.861 2.093 2.861
0.860 2.086 2.845
0.859 2.080 2.831
0.858 2.074 2.819
0.858 2.069 2.807
0.857 2.064 2.797
0.856 2.060 2.787
0.856 2.056 2.779
0.855 2.052 2.771
0.855 2.048 2.763
0.854 2.045 2.756
0.854 2.042 2.750
Download