Title: Stat 232
1- Stat 232
- Experimental Design
- Spring 2008
2- Ching-Shui Cheng
- Office 419 Evans HallPhone 642-9968Email
cheng_at_stat.berkeley.edu - Office Hours Tu Th 200-300 and by appointment
3- Course webpage
- http//www.stat.berkeley.edu/cheng/232.htm
4- No textbook
- Recommended (for first half of the course)
- Design of Comparative Exeperiments by R. A.
Bailey, to appear in 2008 - http//www.maths.qmul.ac.uk/rab/DOEbook/
- Experiments Planning, Analysis, and Parameter
Design Optimization by C. F. J. Wu and M. Hamada - Statistics for Experimenters Design, Innovation
and Discovery by Box, Hunter and Hunter - A useful software GenStat
5Experimental Design
- Planning of experiments to produce valid
information as efficiently as possible
6Comparative Experiments
- Treatments (varieties)
- Varieties of grain, fertilizers, drugs, .
- Experimental units (plots) smallest division of
the experimental material so that different units
can receive different treatments - Plots, patients, .
-
7Design How to assign the treatments to the
experimental units
-
- Fundamental difficulty variability among the
units no two units are exactly the same. - Each unit can be assigned only one treatment.
- Different responses may be observed even if the
same treatment is assigned to the units. - Systematic assignments may lead to bias.
8- R. A. Fisher worked at the Rothamsted
Experimental Station in the United Kingdom to
evaluate the success of various fertilizer
treatments.
9- Fisher found the data from experiments going on
for decades to be basically worthless because of
poor experimental design. - Fertilizer had been applied to a field one year
and not in another in order to compare the yield
of grain produced in the two years. - BUT
- It may have rained more, or been sunnier, in
different years. - The seeds used may have differed between years as
well. - Or fertilizer was applied to one field and not to
a nearby field in the same year. - BUT
- The fields might have different soil, water,
drainage, and history of previous use. - ? Too many factors affecting the results were
uncontrolled.
10Fishers solution Randomization
- In the same field and same year, apply fertilizer
to randomly spaced plots within the field. - This averages out the effect of variation within
the field in drainage and soil composition on
yield, as well as controlling for weather, etc.
11- Randomization prevents any particular treatment
from receiving more than its fair share of better
units, thereby eliminating potential systematic
bias. Some treatments may still get lucky, but if
we assign many units to each treatment, then the
effects of chance will average out. - Replications
- In addition to guarding against potential
systematic biases, randomization also provides a
basis for doing statistical inference. - (Randomization model)
-
12Start with an initial design
Randomly permute (labels of) the experimental
units Complete randomization Pick one of the
72! Permutations randomly
134 treatments
Pick one of the 72! Permutations randomly
Completely randomized design
14blocking
- A disadvantage of complete randomization is that
when variations among the experimental units are
large, the treatment comparisons do not have good
precision. Blocking is an effective way to
reduce experimental error. The experimental
units are divided into more homogeneous groups
called blocks. Better precision can be achieved
by comparing the treatments within blocks.
15After randomization
Randomized complete block design
16Wine tasting
- Four wines are tasted and evaluated by each of
eight judges. - A unit is one tasting by one judge judges are
blocks. So there are eight blocks and 32 units. - Units within each judge are identified by order
of tasting.
17(No Transcript)
18-
- Block what you can and randomize what you
cannot. -
19- Randomization
- Blocking
- Replication
20Incomplete block design
21- Each of ten housewives does four washloads in an
experiment to compare five new detergents. - 5 treatments and 10 blocks of size 4.
-
22Incomplete block design
23Incomplete block design
-
- Balanced incomplete block design
- Randomize by randomly permuting the block labels
and independently permuting the unit labels
within each block. -
24Two simple block (unit) structures
- Nesting
- block/unit
- Crossing
- row column
25Two simple block structures
- Nesting
- block/unit
- Crossing
- row column
Latin square
26(No Transcript)
27Wine tasting
28- Simple block structures
- Iterated crossing and nesting
- cover most, though not all block structures
encountered in practice - Nelder (1965)
29Consumer testing
- A consumer organization wishes to compare 8
brands of - vacuum cleaner. There is one sample for each
brand. - Each of four housewives tests two cleaners in her
home - for a week. To allow for housewife effects, each
housewife - tests each cleaner and therefore takes part in
the trial for 4 - weeks.
- 8 treatments
- Block structure
30Trojan square
31Treatment structures
- No structure
- Treatments vs. control
- Factorial structure
- A fertilizer may be a combination of three
factors (variables) N (nitrogen), P (Phosphate),
K (Potassium)
32- Treatment structure
- Block structure (unit structure)
- Design
- Randomization
- Analysis
-
33Choice of design
- Efficiency
- Combinatorial considerations
- Practical considerations
34McLeod and Brewster (2004) Technometrics
- A company was experiencing problems with one of
its - chrome-plating processes in that when a
particular - complex-shaped part was being plated, excessive
pitting and cracking, as well as poor adhesion
and uneven deposition of chrome across the part,
were observed. With the goal being the
identification of key factors affecting the
quality of the process, a screening experiment
was planned. - In collaboration with the companys process
engineers, six - factors were identified for consideration in the
experiment. -
35- Hard-to-vary treatment factors
- A chrome concentration
- B Chrome to sulfate ratio
- C bath temperature
- Easy-to-vary treatment factors
- p etching current density
- q plating current density
- r part geometry
36- The responses included the numbers of pits and
cracks, in addition to hardness and thickness
readings at various locations on the part. - Suppose each of the six factors have two levels,
then there are 64 treatments. - A complete factorial design needs 64
experimental runs
37- Block structure 4 weeks/4 days/2 runs
- Treatment structure A B C p q r
- Each of the six factors has two levels
- Fractional factorial design
38Miller (1997) Technometrics
-
- Experimental objective Investigate methods of
reducing the wrinkling of clothes being laundered
39- Miller (1997)
- The experiment is run in 2 blocks and employs
- 4 washers and 4 driers. Sets of cloth samples
- are run through the washers and the samples
- are divided into groups such that each group
- contains exactly one sample from each washer.
- Each group of samples is then assigned to one
- of the driers. Once dried, the extent of
wrinkling - on each sample is evaluated.
40Treatment structure A, B, C, D, E, F
configurations of washersa,b,c,d
configurations of dryers
41-
-
- Block structure2 blocks/(4 washers4 dryers)
42 Block 1 Block 2
- 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 0 1 1 0
- 0 0 0 0 0 0 0 0 1 1 0 0 0 1 1 1 0 1 0 1
- 0 0 0 0 0 0 1 1 0 0 0 0 0 1 1 1 1 0 1 0
- 0 0 0 0 0 0 1 1 1 1 0 0 0 1 1 1 1 0 0 1
- 0 1 1 0 1 1 0 0 0 0 0 1 1 1 0 0 0 1 1 0
- 0 1 1 0 1 1 0 0 1 1 0 1 1 1 0 0 0 1 0 1
- 0 1 1 0 1 1 1 1 0 0 0 1 1 1 0 0 1 0 1 0
- 0 1 1 0 1 1 1 1 1 1 0 1 1 1 0 0 1 0 0 1
- 1 0 1 1 0 1 0 0 0 0 1 0 1 0 1 0 0 1 1 0
- 1 0 1 1 0 1 0 0 1 1 1 0 1 0 1 0 0 1 0 1
- 1 0 1 1 0 1 1 1 0 0 1 0 1 0 1 0 1 0 1 0
- 1 0 1 1 0 1 1 1 1 1 1 0 1 0 1 0 1 0 0 1
- 1 1 0 1 1 0 0 0 0 0 1 1 0 0 0 1 0 1 1 0
- 1 1 0 1 1 0 0 0 1 1 1 1 0 0 0 1
0 1 0 1 - 1 1 0 1 1 0 1 1 0 0 1 1 0 0 0 1
1 0 1 0 - 1 1 0 1 1 0 1 1 1 1 1 1 0 0 0 1
1 0 0 1
43- GenStat code
- factor nvalue32levels2 block,A,B,C,D,E,F,a,b,
c,d - levels4 wash, dryer
- generate block,wash,dryer
- blockstructure block/(washdryer)
- treatmentstructure
- (ABCDEF)(ABCDEF)
- (abcd)(abcd)
- (ABCDEF)(abcd)
44matrix rows10 columns5 values b r1
r2 c1 c2" 0, 0, 1, 0, 0, 0, 1, 0, 0, 0,
0, 1, 1, 0, 0, 1, 0, 1, 0, 0, 1, 1,
0, 0, 0, 1, 1, 1, 0, 0, 0, 0, 0, 0,
1, 1, 0, 0, 0, 1, 1, 0, 0, 1, 0,
0, 0, 0, 1, 0 Mkey
45- Akey blockfactorsblock,wash,dryer KeyMkey
- rowprimes!(10(2))colprimes!(5(2))
colmappings!(1,2,2,3,3) - Pdesign
- Arandom blocksblock/(washdryer)seed12345
- PDESIGN
- ANOVA
46Outline
- Introduction randomization and blocking
- Some mathematical preliminaries
- Linear models
- Block structures strata, null ANOVA
- Computation of estimates ANOVA table
- Orthogonal designs
- Non-orthogonal designs
- Factorial designs
- Response surface methodology
- Other topics as time permits