STAT 305: Chapter 14 – Randomized Complete Block Designs & Their Analysis Spring 2014 EXAMPLE 14.1 – Comparing Methods of Determining Blood Serum Level Data File: Serum-Meth.JMP The goal of this study was determine if four different methods for determining blood serum levels significantly differ in terms of the readings they give. Suppose we wish to have 6 readings for each method which we will use to make our comparisons. One approach we could take would be to find 24 volunteers and randomly allocate six subjects to each method and compare the readings obtained using the four methods. (Note: this is called a completely randomized design). There is one major problem with this approach, what is it? Instead of taking this approach it would clearly be better to use each method on the sample from the same subject. This removes subject to subject variation from the results and will allow us to get a clearer picture of the actual differences in the methods. Also if we truly only wish to have 6 readings for each method, this approach will only require the use of 6 subjects versus the 24 subjects the completely randomized approach discussed above requires, thus reducing the “cost” of the experiment. The experimental design where each patient’s serum level is determined using each method is called a randomized complete block (RCB) design. Here the patients serve as the blocks; the term randomized refers to the fact that the methods will be applied to the samples from the patients in a random order, and complete refers to the fact that each method is used on each subject. In some experiments where blocking is used it is not possible to apply each treatment to each block resulting in what is called an incomplete block design. These are less common and we will not discuss them in this class. The table below contains the raw data from the RCB experiment to compare the serum determination methods. Method Subject 1 2 3 4 1 360 435 391 502 2 1035 1152 1002 1230 3 632 750 591 804 4 581 703 583 790 5 463 520 471 502 6 1131 1340 1144 1300 220 STAT 305: Chapter 14 – Randomized Complete Block Designs & Their Analysis Spring 2014 Visualizing the need for Blocking Select Fit Y by X from the Analyze menu and place Serum Level in the Y, Response box and Method in the X, Factor box. The resulting comparative plot is shown below. Does there appear to be any differences in the serum levels obtained from the four methods? This plot completely ignores the fact that the same six blood samples were used for each method. We can incorporate this information visually by selecting Oneway Analysis > Matching Column... > then highlight Patient in the list. This will have the following effect on the plot. Now we can clearly see that ignoring the fact the blood samples were used for each method is a big mistake! On the next page we will show how to correctly analyze these data. 221 STAT 305: Chapter 14 – Randomized Complete Block Designs & Their Analysis Spring 2014 Correct Analysis of RCB Design Data in JMP First select Fit Y by X from the Analyze menu and place Serum Level in the Y, Response box, Method in the X, Factor box, and Patient in the Block box. The results from JMP are shown below. Notice the Y axis is “Serum Level – Block Centered”. These means that the results we are seeing in the display are the differences in the serum level readings adjusting for the fact that the readings for each method came from the same 6 patients. Examining the data in this way we can clearly see that the methods differ in the serum level reported when measuring blood samples from the same patient. The results of the ANOVA clearly show we have strong evidence that the four methods do not give the same readings when measuring the same blood sample (p < .0001). We do not test the blocking factor generally, so even though the patient p-value is highly significant we really don’t care – we know already that the patients differ, that is why we used them as blocks in the first place. However, the fact that much of the variation in the response is due to patient-topatient differences and reaffirms the fact that blocking on patients was a good idea! The tables below give the block corrected mean for each method and the block means used to make the adjustment. 222 STAT 305: Chapter 14 – Randomized Complete Block Designs & Their Analysis Spring 2014 As was the case with one-way ANOVA (completely randomized) we may still wish to determine which methods give significantly different means when measuring the same blood sample. Select Compare Means... > All Pairs, Tukey’s HSD. We can see that methods 4 & 2 differ significantly from methods 1 & 3 but not each other. The same can be said for methods 1 & 3 when compared to methods 4 & 2. The confidence intervals quantify the size of the difference we can expect on average when measuring the same blood samples. For example, we see that method 4 will give between 90.28 and 255.05 higher serum levels than method 3 on average when measuring the same blood sample. Other comparisons can be interpreted in similar fashion. 223 STAT 305: Chapter 14 – Randomized Complete Block Designs & Their Analysis Spring 2014 Example 14.2 – Glucose Concentrations During Four Stages of Labor Data File: Glucose Labor.JMP The purpose of a study conducted by Maheux et al. in their paper “Glucose Homeostasis During Spontaneous Labor in Normal Human Pregnancy” published in Journal of Clinical Endocrinology and Metabolism (1996) was to evaluate the effect of labor on glucose production and utilization. Subjects in the study were six normal pregnant women. Among the data collected were the following glucose concentrations during four stages of labor: latent (A1) and active (A2) phases of cervical dilatation, fetal expulsion (B), and placental expulsion (C). The raw data collected are presented below: Labor Phase Subject A1 A2 B C 1 3.60 4.40 5.30 6.20 2 3.53 3.70 4.10 3.80 3 4.02 4.80 5.40 5.27 4 4.90 5.33 6.30 6.20 5 4.06 4.65 6.10 6.90 6 3.97 5.20 4.90 4.60 These data entered into JMP are shown below. 224 STAT 305: Chapter 14 – Randomized Complete Block Designs & Their Analysis Spring 2014 Again we use Analyze > Fit Y by X with Y = Glucose Concentration and X = Labor Phase and place Subject in the Block box. The dialog box set up correctly is shown below. The results from JMP are shown below. Sample sizes are really too small to assess normality, so we will just have to assume that assumption is valid for these data. We can check equality of the variance of the glucose levels within each labor phase using the UnEqual Variances option as in one-way ANOVA. None of the p-values are less than = .05 so we will use ANOVA to test for differences in the mean glucose levels across labor phase. 225 STAT 305: Chapter 14 – Randomized Complete Block Designs & Their Analysis Spring 2014 The ANOVA table and multiple comparisons are shown below. Using Tukey’s HSD to perform the multiple comparisons. We can see that the mean glucose levels during fetal and placental expulsion differ significantly from the mean levels during the latent dilatation phase. The mean level during placental expulsion for example is estimated to between .612 and 2.35 units larger than the mean glucose level at the latent dilatation phase. 226