Stat 232 - PowerPoint PPT Presentation

About This Presentation

Title:

Stat 232

Description:

Stat 232 – PowerPoint PPT presentation

Number of Views:53

Avg rating:3.0/5.0

Slides: 47

Provided by: chingsh

Learn more at: https://www.stat.berkeley.edu

Category:

Tags: stat

more less

Transcript and Presenter's Notes

Title: Stat 232

1

Stat 232
Experimental Design
Spring 2008

Ching-Shui Cheng
Office 419 Evans HallPhone 642-9968Email
cheng_at_stat.berkeley.edu
Office Hours Tu Th 200-300 and by appointment

Course webpage
http//www.stat.berkeley.edu/cheng/232.htm

No textbook
Recommended (for first half of the course)
Design of Comparative Exeperiments by R. A.
Bailey, to appear in 2008
http//www.maths.qmul.ac.uk/rab/DOEbook/
Experiments Planning, Analysis, and Parameter
Design Optimization by C. F. J. Wu and M. Hamada
Statistics for Experimenters Design, Innovation
and Discovery by Box, Hunter and Hunter
A useful software GenStat

5
Experimental Design

Planning of experiments to produce valid
information as efficiently as possible

6
Comparative Experiments

Treatments (varieties)
Varieties of grain, fertilizers, drugs, .
Experimental units (plots) smallest division of
the experimental material so that different units
can receive different treatments
Plots, patients, .

7
Design How to assign the treatments to the
experimental units

Fundamental difficulty variability among the
units no two units are exactly the same.
Each unit can be assigned only one treatment.
Different responses may be observed even if the
same treatment is assigned to the units.
Systematic assignments may lead to bias.

R. A. Fisher worked at the Rothamsted
Experimental Station in the United Kingdom to
evaluate the success of various fertilizer
treatments.

Fisher found the data from experiments going on
for decades to be basically worthless because of
poor experimental design.
Fertilizer had been applied to a field one year
and not in another in order to compare the yield
of grain produced in the two years.
BUT
It may have rained more, or been sunnier, in
different years.
The seeds used may have differed between years as
well.
Or fertilizer was applied to one field and not to
a nearby field in the same year.
BUT
The fields might have different soil, water,
drainage, and history of previous use.
? Too many factors affecting the results were
uncontrolled.

10
Fishers solution Randomization

In the same field and same year, apply fertilizer
to randomly spaced plots within the field.
This averages out the effect of variation within
the field in drainage and soil composition on
yield, as well as controlling for weather, etc.

Randomization prevents any particular treatment
from receiving more than its fair share of better
units, thereby eliminating potential systematic
bias. Some treatments may still get lucky, but if
we assign many units to each treatment, then the
effects of chance will average out.
Replications
In addition to guarding against potential
systematic biases, randomization also provides a
basis for doing statistical inference.
(Randomization model)

12
Start with an initial design
Randomly permute (labels of) the experimental
units Complete randomization Pick one of the
72! Permutations randomly
13
4 treatments
Pick one of the 72! Permutations randomly
Completely randomized design
14
blocking

A disadvantage of complete randomization is that
when variations among the experimental units are
large, the treatment comparisons do not have good
precision. Blocking is an effective way to
reduce experimental error. The experimental
units are divided into more homogeneous groups
called blocks. Better precision can be achieved
by comparing the treatments within blocks.

15
After randomization
Randomized complete block design
16
Wine tasting

Four wines are tasted and evaluated by each of
eight judges.
A unit is one tasting by one judge judges are
blocks. So there are eight blocks and 32 units.
Units within each judge are identified by order
of tasting.

17
(No Transcript)
18

Block what you can and randomize what you
cannot.

Randomization
Blocking
Replication

20
Incomplete block design

7 treatments

Each of ten housewives does four washloads in an
experiment to compare five new detergents.
5 treatments and 10 blocks of size 4.

22
Incomplete block design

7 treatments

23
Incomplete block design

Balanced incomplete block design
Randomize by randomly permuting the block labels
and independently permuting the unit labels
within each block.

24
Two simple block (unit) structures

Nesting
block/unit
Crossing
row column

25
Two simple block structures

Nesting
block/unit
Crossing
row column

Latin square
26
(No Transcript)
27
Wine tasting
28

Simple block structures
Iterated crossing and nesting
cover most, though not all block structures
encountered in practice
Nelder (1965)

29
Consumer testing

A consumer organization wishes to compare 8
brands of
vacuum cleaner. There is one sample for each
brand.
Each of four housewives tests two cleaners in her
home
for a week. To allow for housewife effects, each
housewife
tests each cleaner and therefore takes part in
the trial for 4
weeks.
8 treatments
Block structure

30
Trojan square
31
Treatment structures

No structure
Treatments vs. control
Factorial structure
A fertilizer may be a combination of three
factors (variables) N (nitrogen), P (Phosphate),
K (Potassium)

Treatment structure
Block structure (unit structure)
Design
Randomization
Analysis

33
Choice of design

Efficiency
Combinatorial considerations
Practical considerations

34
McLeod and Brewster (2004) Technometrics

A company was experiencing problems with one of
its
chrome-plating processes in that when a
particular
complex-shaped part was being plated, excessive
pitting and cracking, as well as poor adhesion
and uneven deposition of chrome across the part,
were observed. With the goal being the
identification of key factors affecting the
quality of the process, a screening experiment
was planned.
In collaboration with the companys process
engineers, six
factors were identified for consideration in the
experiment.

Hard-to-vary treatment factors
A chrome concentration
B Chrome to sulfate ratio
C bath temperature
Easy-to-vary treatment factors
p etching current density
q plating current density
r part geometry

The responses included the numbers of pits and
cracks, in addition to hardness and thickness
readings at various locations on the part.
Suppose each of the six factors have two levels,
then there are 64 treatments.
A complete factorial design needs 64
experimental runs

Block structure 4 weeks/4 days/2 runs
Treatment structure A B C p q r
Each of the six factors has two levels
Fractional factorial design

38
Miller (1997) Technometrics

Experimental objective Investigate methods of
reducing the wrinkling of clothes being laundered

Miller (1997)
The experiment is run in 2 blocks and employs
4 washers and 4 driers. Sets of cloth samples
are run through the washers and the samples
are divided into groups such that each group
contains exactly one sample from each washer.
Each group of samples is then assigned to one
of the driers. Once dried, the extent of
wrinkling
on each sample is evaluated.

40
Treatment structure A, B, C, D, E, F
configurations of washersa,b,c,d
configurations of dryers

41

Block structure2 blocks/(4 washers4 dryers)

42
Block 1 Block 2

0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 0 1 1 0
0 0 0 0 0 0 0 0 1 1 0 0 0 1 1 1 0 1 0 1
0 0 0 0 0 0 1 1 0 0 0 0 0 1 1 1 1 0 1 0
0 0 0 0 0 0 1 1 1 1 0 0 0 1 1 1 1 0 0 1
0 1 1 0 1 1 0 0 0 0 0 1 1 1 0 0 0 1 1 0
0 1 1 0 1 1 0 0 1 1 0 1 1 1 0 0 0 1 0 1
0 1 1 0 1 1 1 1 0 0 0 1 1 1 0 0 1 0 1 0
0 1 1 0 1 1 1 1 1 1 0 1 1 1 0 0 1 0 0 1
1 0 1 1 0 1 0 0 0 0 1 0 1 0 1 0 0 1 1 0
1 0 1 1 0 1 0 0 1 1 1 0 1 0 1 0 0 1 0 1
1 0 1 1 0 1 1 1 0 0 1 0 1 0 1 0 1 0 1 0
1 0 1 1 0 1 1 1 1 1 1 0 1 0 1 0 1 0 0 1
1 1 0 1 1 0 0 0 0 0 1 1 0 0 0 1 0 1 1 0
1 1 0 1 1 0 0 0 1 1 1 1 0 0 0 1
0 1 0 1
1 1 0 1 1 0 1 1 0 0 1 1 0 0 0 1
1 0 1 0
1 1 0 1 1 0 1 1 1 1 1 1 0 0 0 1
1 0 0 1

GenStat code
factor nvalue32levels2 block,A,B,C,D,E,F,a,b,
c,d
levels4 wash, dryer
generate block,wash,dryer
blockstructure block/(washdryer)
treatmentstructure
(ABCDEF)(ABCDEF)
(abcd)(abcd)
(ABCDEF)(abcd)

44
matrix rows10 columns5 values b r1
r2 c1 c2" 0, 0, 1, 0, 0, 0, 1, 0, 0, 0,
0, 1, 1, 0, 0, 1, 0, 1, 0, 0, 1, 1,
0, 0, 0, 1, 1, 1, 0, 0, 0, 0, 0, 0,
1, 1, 0, 0, 0, 1, 1, 0, 0, 1, 0,
0, 0, 0, 1, 0 Mkey
45