BALANCED INCOMPLETE BLOCK DESIGNS (BIBD) & RELATED

advertisement

STAT 512 Balanced Incomplete Block Designs

BALANCED INCOMPLETE BLOCK DESIGNS (BIBD) & RELATED

• As with CBD, b blocks, but now each with k units, k < t

• N = bk

• NOW all treatments cannot fit in any one block, so blocks are called “ incomplete ”

• Examples: b = 4 , k = 3 , t = 4

Usually a bad idea block treatment

3

4

1

2

1

1

1

4

2

2

2

4

3

3

3

4

Usually a better idea block treatment

3

4

1

2

1

1

1

2

2

2

3

3

3

4

4

4

1

STAT 512 Balanced Incomplete Block Designs

Rules for BALANCED Incomplete Block Designs (BIBD’s)

• “First-order balance” : Each treatment must appear the same number of times in the (entire) design: r = N/t = bk/t

• “Second-order balance” : Each PAIR of treatments must appear together in the same number of blocks:

λ = r ( k − 1) / ( t − 1)

– pick any specific treatment, say “1”

– numerator is number of “non-1” assignments in blocks containing a “1”

– denominator is number of “non-1” treatments

2

STAT 512 Balanced Incomplete Block Designs

• BIBD’s don’t exist for every t , b , and k < t

• Necessary, but not Sufficient, Condition: r = bk/t and λ = r ( k − 1) / ( t − 1) must be integers

• Example: t = 7 , k = 5

– r = bk/t = b [5 / 7] = integer

– λ = [ r ][( k − 1) / ( t − 1)] = b [5 / 7][4 / 6] = b [10 / 21] = integer

– smallest integer solution is b = 21 → λ = 10 , r = 15

– EXIST? Yes, in this case use all possible combinations of 7 treatments in groups of size 5:

 

7

 

= 21

5

3

STAT 512 Balanced Incomplete Block Designs

• Easy Case:

– any k < t , one block for each unique subset of k treatments

  t

∗ b =

  k

 t − 1

∗ r = b [ k/t ] = [ t !

/ (( t − k )!

k !)][ k/t ] =

 k − 1

 t − 1

∗ λ = r [( k − 1) / ( t − 1)] =

 k − 1

  t − 2

 k − 1 t − 1

=

 k − 2

– the second design on the first slide is an example of this:

∗ k = t − 1 , b = t , r = t − 1 , λ = t − 2

• In General: combinatorics, tables

4

STAT 512 Balanced Incomplete Block Designs

Model

• y i,j

= α + β i

+ τ j

+ i,j or β i

+ τ j

+ i,j

– as with CBD, but not all ( i, j ) combinations are included

• y = X

1

β + X

2

τ +

• [ bk × 1] = [ bk × b ][ b × 1] + [ bk × t ][ t × 1] + [ bk × 1]

Estimable Functions ... need to characterize ( I − H

1

) X

2

• H

1

= X

1

( X

0

1

X

1

)

− 1

X

0

1

=

1 k diag ( J k × k

...

J k × k

)

– like CBD, but now k = t

5

STAT 512 Balanced Incomplete Block Designs

• H

1

X

2

=

1 k

← columns indexed by j →

# of times trt j appears in blk 1

# of times trt j appears in blk 2

.

..

# of times trt j appears in blk b

( k rows )

( k rows )

.

..

( k rows )

• Not the same as a CRD with r units/trt group

# of times trt j appears in this row of X

2

-

• ( I − H

1

) X

2

=

1 k no. of times trt j appears in blk 1

.

..

# of times trt j appears in this row of X

2

-

1 k no. of times trt j appears in blk b

6

STAT 512 Balanced Incomplete Block Designs

• e.g. for k = 4 , t = 6 , treatments 1 - 4 in first block:

X

2 | 1

=

− 1

4

3

4

− 1

4

− 1

4

...

− 1

4

− 1

4

1

4

3

4

− 1

4

3

4

1

4

1

4

1

4

1

4

1

4

3

4

0 0

0

0

0

0

0

0

• Note all rows sum to zero, so all linear combinations of rows add to zero, so only contrasts between treatments are estimable

(again)

7

STAT 512 Balanced Incomplete Block Designs

Reduced Normal Equations: (Need X

0

2 | 1

X

2 | 1

= X

0

2

( I − H

1

) X

2

)

• X

0

2

X

2

= r I

• ( X

0

1

X

1

)

− 1

= k

− 1

I

• { X

0

1

X

2

} ij

= N ( i, j ) , an “incidence matrix”

– number of times treatment j appears in block i

– always 0 or 1

• { ( X

0

1

X

2

)

0

( X

0

1

X

2

) } j,j 0

= P i

N ( i, j ) N ( i, j

0

)

– number of blocks containing treatments j AND j

0

– always r for j = j

0

– always λ for j = j

0

– ( r − λ ) I + λ J

• ( X

0

2

X

1

)( X

0

1

X

1

)

− 1

( X

0

1

X

2

) = r − λ

I + k

λ

J k

8

STAT 512 Balanced Incomplete Block Designs

• Left side of R.N.E.:

– [ X

0

2

X

2

− X

0

2

H

1

X

2

– [ r I − ( r − λ ) /k I − λ/k J ]ˆ

λ k

[ t I − J ]ˆ = I

2 | 1

• Right side of R.N.E.:

– X

0

2

( I − X

1

( X

0

1

X

1

)

− 1 X

0

1

) y

– X

0

2 y − 1 k

X

0

2

X

1

X

0

1 y

– T − k

− 1

( X

0

2

X

1

) B

∗ T = t -element vector of treatment totals

∗ B = b -element vector of block totals

∗ { ( X

0

2

X

1

) B } j

= sum of block totals containing trt j

9

STAT 512 Balanced Incomplete Block Designs

• Multiply both Right and Left expressions above by k/λ

– [ t I − J ]ˆ = k/λ [ T − k

− 1

( X

0

2

X

1

) B ]

– Now divide both sides of this by t

– j th row: ˆ j

− τ

¯

.

= k/ ( λt ) [ y

.j

− k

− 1 P i

N ( i, j ) y i.

]

– Second term in brackets is the sum of block averages for

(only!) blocks that contain a unit with treatment j .

– Many books use Q j for [-], “adjusted treatment j total”

– If c

0

τ is estimable,

– c 0 τ = k/ ( λt ) P j c j

Q j

(since

¯ drops out)

Note that k

λt

“plays the role of”

1 r

... but they aren’t equal

10

STAT 512 Balanced Incomplete Block Designs

Variance of Estimable Functions:

• 0 τ = k/ ( λt ) c

0

[ X

0

2

− X

0

2

H

1

] y

• V ar [ c 0 τ ]

= k

2

σ

2

/ ( λt )

2 c

0

[ X

0

2

− X

0

2

H

1

][ X

2

− H

1

X

2

] c

= k 2 σ 2 / ( λt ) 2 c

0

[ X

0

2

X

2

− X

0

2

H

1

X

2

] c left-side matrix !

= k

2

σ

2

/ ( λt )

2 c

0

[ λt/k I − λ/k J ] c

J term drops out for estimable functions

= [ kσ

2

/ ( λt )] c

0 c

– again, k

λt

“plays the role of”

1 r

...

11

STAT 512 Balanced Incomplete Block Designs

When is it worth using?

• V ar

BIBD

[ 0 τ ] = [ kσ 2

BIBD

/ ( λt )] c

0 c

• V ar

CRD

[ 0 τ ] = [ σ

2

CRD

/ ( N/t )] c

0 c

• Use r = N/t and λ = ( N/t )[( k − 1) / ( t − 1)] ...

• V ar

BIBD

/V ar

CRD

= [ k/ ( k − 1)] / [ t/ ( t − 1)] [ σ

2

BIBD

2

CRD

]

• “Multiplying ratio” is greater than one if k < t

• Compare to earlier results, e.g.

V ar

LSD

/V ar

CRD

= σ

2

LSD

2

CRD

12

STAT 512 Balanced Incomplete Block Designs

• Multiplying ratio increases as k decreases relative to t , e.g.

t k ratio

10 10 1/1

10 8 72/70

10 6 54/50

10 4 36/30

10 2 18/10

• Expected squared CI length criterion becomes: v u u t k/ ( k − 1) t/ ( t − 1)

σ

BIBD

CRD

< t (1 t

(1 −

α/ 2;

α/

N

2;

N b

− t t )

+ 1)

13

STAT 512 Balanced Incomplete Block Designs

ANOVA: SS’s for blocks and treatments aren’t orthogonal ...

• SST( blocks

) = y

0

H

1 y = P b i =1 k (¯ i.

− ¯

..

)

2

• SST( blocks

, treatments ) = y

0

Hy

SST( treatments | blocks

) = SST( blocks

, treatments ) - SST( blocks

)

=

=

= k

λt

P j

Q 2 j

(slide 10) k

λt

P j

[ y

.j

( t − 1) k

( k − 1) t

P

1 k

P i

N ( i, j ) y i.

]

2 j r [¯

.j

− y

[ j ]

..

]

2 where ¯

[ j ]

..

is the average of all observations include a unit assigned to treatment j from blocks that

∗ : “ blocks ” includes the intercept here, which would ordinarily be removed first in “correction for the mean”

14

STAT 512 Balanced Incomplete Block Designs

Similar result for Test Power

• Hyp

0

: τ

1

= τ

2

= ...

= τ t

• Compare CRD with r = N/t , BIBD with same N

• For hypothetical τ , CRD Q ( τ ) = r P i

( τ i

− ¯ )

2

• For CRD, the distribution of the test statistic is:

F

0

( t − 1 , N − t,

Q (

τ

)

σ 2

CRD

)

• For BIBD, the distribution of the test statistic is:

F

0

( t − 1 , N − b − t + 1 , t/ ( t − 1) k/ ( k − 1)

Q (

τ

)

σ 2

BIBD

)

• i.e. noncentrality is decreased , and estimation variances increased by the same factor

15

STAT 512 Balanced Incomplete Block Designs

YOUDEN “SQUARE” DESIGNS

• Recall that in LSD, #(rows) = #(columns) = #(treatments)

• Now consider row-column designs in which

– ignoring rows leads to a CBD in columns

– ignoring columns leads to a BIBD in rows

• For example:

1 2 3

2 4 1

3 1 4

4 3 2

• Generally: t treatments, t rows, k < t columns

16

STAT 512 Balanced Incomplete Block Designs

• Structure:

– Columns are orthogonal to Treatments (this matters)

– Rows are orthogonal to Columns (this DOESN’T matter)

– Rows are not orthogonal to Treatments (this matters)

• H

1 is more complicated than it would be for a CBD or a BIBD, but

– H

1

X

2 is the same as for a BIBD with t blocks of size k .

– RNE’s are as with BIBD with row blocks (only)

– Point estimates, variance formulae, and non-centrality parameters can be computed ignoring row blocks

• Can replicate Youden Designs in any of the 3 ways discussed with

LSDs

17

STAT 512 Balanced Incomplete Block Designs

PARTIALLY BALANCED INCOMPLETE BLOCK DESIGNS

• For two associate classes ...

• t treatments, b blocks, k < t units/block

• Every treatment is applied to r units overall (as in BIBD’s)

• Everay pair of treatments are either:

– First associates , applied together in λ

1 blocks, or

– Second associates , applied together in λ

2 blocks

• Each treatment has the same number of 1st associates and the same number of 2nd associates

• For any two treatments that are i th associates, there are:

– p i

1 other treatments that are 1st associates of each

– p i

2

– q i other treatments that are 2nd associates of each other treatments that are 1st associates of one and 2nd associates of the other

18

STAT 512 Balanced Incomplete Block Designs

• For example:

1 2 4 5

2 3 5 6

1 3 4 6

– t = 6 , b = 3 , k = 4

– r = 2

– Treatments can be divided into two groups: (1,2,3) and (4,5,6)

– Any two treatments within a group are 1st associates, with

λ

1

= 1

– Any two treatments in different groups are 2nd associates, with

λ

2

= 2

• Analysis is somewhat messier than with BIBD’s, but there is far more flexibility in size for designs for given t and k

19

STAT 512 Balanced Incomplete Block Designs

BIBD’s: INTER- and INTRA-BLOCK INFORMATION

• Fixed-Effects analyses we’ve considered derive treatment information only from within-block comparisons (INTRA-block information)

• INTER-block comparisons are considered “sacrificed”; confounded with unknown block effects.

• Recall 0 τ = k

λt

P j c j

[1 × y

.j

− 1 k

P i

N ( i, j ) y i.

]

– for every block containing treatment j , weights for each data value in the estimate of 0 τ are

− 1 k

, − 1 k

, ..., 1 − k

λt

× c j times:

1 k

, − 1 k

, − 1 k

, ...

– for every block not containing treatment j , weights are all zero.

– in either case, 0 τ = P ij w ij y ij where P j w ij

= 0 for all i .

20

STAT 512 Balanced Incomplete Block Designs

• First, note that some of the INTRA-block info is not so obvious ...

for example: block treatment

3

4

1

2

1

1

1

2

2

2

3

3

3

4

4

4

• Is there trt 1-vs-trt 2 INTRA-block information contributed from blocks 3 and 4?

– Overall, V ar [ τ

1

− d

τ

2

] = c

0 c σ

2 k

λt

= 2 σ

2 k

λt

=

3

4

σ

2

– From just blocks 1 and 2,

V ar [ τ

1 d

τ

2

] = V ar [

1

2

[( y

11

− y

12

) + ( y

21

− y

22

)] ] = σ 2

– So the answer is “Yes, but indirect”, e.g.

[1-3 diff from block 3] - [2-3 diff from block 4] estimates 1-2

21

STAT 512 Balanced Incomplete Block Designs

“Recovery” of INTER-block information

• For example design above, if y ij

= β i

+ τ j

+ ij

,

+block

2

3

4

Expected Block Total Differences

-block

1 2 3

τ

4

− τ

3

+ 3 β

2

− 3 β

1

τ

4

− τ

2

+ 3 β

3

− 3 β

1

τ

3

− τ

2

+ 3 β

3

− 3 β

2

τ

4

− τ

1

+ 3 β

4

− 3 β

1

τ

3

− τ

1

+ 3 β

4

− 3 β

2

τ

2

− τ

1

+ 3 β

4

− 3 β

3

• Suppose block effects are assumed to be random :

β i

∼ i.i.d.

N (0 , δ 2 ) + µ

β

• Then differences between blocks also contain information about treatment differences

(but ... some of these differences are correlated ... common β ’s)

22

STAT 512 Balanced Incomplete Block Designs

• Practical, Realistic Assumption?

• Sometimes not:

– machinists ... perhaps some of those available were trained by same instructor, and that these are “systematically different” from others

– litters of animals ... perhaps some of those available were bred/raised in same facility, ...

• Sometimes maybe:

– autos actually selected randomly from a year’s production

– fields actually selected randomly from Iowa farms

23

STAT 512 Balanced Incomplete Block Designs

• y ij

= µ

β

+ τ j

+ ( β i

+ ij

) = µ

β

+ τ j

+

– E [

∗ ij

] = 0

– V ar [

∗ ij

] = σ 2 + δ 2

δ

2

, i = i

0

, j = j

0

– Cov [

∗ ij

,

∗ i 0 j 0

] =

0 , i = i

0

, j = j

0

∗ ij

• For block-ordered y :

– E [ y ] = µ

β

1 + X

2

τ

– V ar [ y ] = σ

2

I + δ

2 diag ( J k

, J k

, ..., J k

) = Σ

– Full, general treatment would involve generalized least squares with unknown Σ

24

STAT 512 Balanced Incomplete Block Designs

Neat Trick for BIBD

• Recall the INTRA-block estimator: c 0 τ = k/ ( λt ) P j c j

[ y

.j

− k

− 1 P i

N ( i, j ) y i.

]

• For random blocks, we can summarize INTER-block information using block totals:

– Let { z } i

= y i.

, i.e. the vector of block totals

– z

– b × 1

= kµ

β

Elements of

1 b × 1

+ U b × t

τ t × 1

+

∗ b × 1

∗ are i.i.d. w/ variance k

2

δ

2

+ kσ

2

– U = X

0

1

X

2

!

25

STAT 512 Balanced Incomplete Block Designs

– U = X

0

1

X

2

(again)

– U has:

∗ k 1’s in each row

∗ r 1’s in each column

∗ inner product of any two columns = λ

– One implication is that the sum of all columns in U is k 1

∗ ( 1 | U ) is of rank t (rather than t + 1 )

∗ only linear contrasts in τ ’s are estimable ...

– RNE’s: U

0

( I − 1 b

J ) U ˆ = U

0

( I − 1 b

J ) z

– 0 τ = k/ ( r − λ ) P j c j

[ k

− 1 P i

N ( i, j ) y i.

− r y ¯

..

]

– V ar [ 0 τ ] = [( k

2

δ

2

+ kσ

2

) / ( r − λ )] c

0 c

26

STAT 512 Balanced Incomplete Block Designs

Bottom Line:

• P d j c j

τ j and P j c j

τ j are independent statistics

• The weighted l.s. estimator (BLUE, given Σ ) is a weighted average of the two, with weights inversely proportional to their respective variances.

P d d j c j

τ j

= w

1 w

1

+ w

2

P d j c j

τ j

+ w

2 w

1

+ w

2

P g j c j

τ j

• w

1

= [ k

2

δ

2

+ kσ

2

] / ( r − λ ) (generally larger)

• w

2

= [ kσ 2 ] / ( tλ )

σ

2

• E [MSE] =

 k

2

δ

2

+ kσ

2

INTRA: fit of individual data

INTER: fit of block-total data

• Since, we ordinarily don’t know the weights, substitute:

– ˆ

1

= MSE

IN T ER

/ ( r − λ )

– ˆ

2

= k MSE

IN T RA

/ ( tλ )

27

STAT 512 Balanced Incomplete Block Designs

• If we knew σ and δ , we could write:

V ar [ τ ] = ( w

1 )

2

V ar [ 0 τ ] + (

= c

0 w

1

+ w

2 h c ( w

1 w

1

+ w

2

) 2 w

2

+ ( w

1

+ w

2 w

2 w

2 w

1

+ w

2

) 2

)

2 w

V ar [ 0 τ ]

1 i

= c

0 c w

1 w

2 w

1

+ w

2

• Again, can substitute estimates for σ

2 and k

2

δ

2

+ kσ

2

, although this does not necessarily lead to an estimator that is better than c 0 τ

28

Download