Chapter 3: Producing Data - PowerPoint PPT Presentation

1 / 28
About This Presentation
Title:

Chapter 3: Producing Data

Description:

Fisher found the data from experiments going on for decades to be basically ... line 104. The next three to be selected are Moe, George, and Amy (13, 7, and 2) ... – PowerPoint PPT presentation

Number of Views:84
Avg rating:3.0/5.0
Slides: 29
Provided by: homeE
Category:

less

Transcript and Presenter's Notes

Title: Chapter 3: Producing Data


1
Chapter 3 Producing Data
  • Types of producing data
  • Design of Experiment randomization, blocking
  • Sampling SRS, bias
  • Statistical inference sampling distributions

2
Types of producing data
  • Available data are data that were produced in the
    past for some other purpose but that may help
    answer a present question inexpensively. The
    library and the Internet are sources of available
    data.
  • Some questions require data produced specifically
    to answer them. This leads to designing
    observational or experimental studies.

3
(No Transcript)
4
Terminologies
  • The individuals in an experiment are the
    experimental units. If they are human, we call
    them subjects.
  • In an experiment, we do something to the subject
    and measure the response. The something we do
    is called a treatment, or factor.
  • The factor may be the administration of a drug.
  • One group of people may be placed on a
    diet/exercise program for six months (treatment),
    and their blood pressure (response variable)
    would be compared with that of people who did not
    diet or exercise.

5
  • If the experiment involves giving two different
    doses of a drug, we say that we are testing two
    levels of the factor.
  • A response to a treatment is statistically
    significant if it is larger than you would expect
    by chance (due to random variation among the
    subjects). We will learn how to determine this
    later.
  • In a study of sickle cell anemia, 150 patients
    were given the drug hydroxyurea, and 150 were
    given a placebo (dummy pill). The researchers
    counted the episodes of pain in each subject.
    Identify
  • The subjects
  • The factors / treatments
  • And the response variable

6
The biggest medical experiment in history 1954
field trial of the Salk Poliomyelitis vaccine
  • Polio first appeared in the US in 1916
  • Caused by a virus
  • Responsible for 6 of deaths of children age 5-9
    by 1950s
  • Young children most vulnerable sometimes fatal,
    left many others crippled or dependent on
    respirators
  • Contagious and occurs in unpredictable epidemic
    waves

7
Vaccines
  • Stimulate the production of natural antibodies to
    fight the virus
  • Live-virus use a virus similar to the polio
    virus to cause production of antibodies. But what
    if the vaccine itself causes the disease?
  • Killed-virus safer, but might not stimulate
    sufficient production of antibodies
  • Jonas Salk produced a killed-virus solution that
    was safe and produced high levels of antibodies
    in laboratory tests. But would the vaccine
    actually prevent polio?

Suppose we vaccinate thousands of children across
the country, and the incidence of polio goes down
the following year. Does this provide evidence
that the vaccine prevents polio?
8
Comparative experiments
  • Experiments are comparative in nature We compare
    the response to a treatment to
  • Another treatment,
  • No treatment (a control),
  • A placebo
  • Or any combination of the above
  • A control is a situation when no treatment is
    administered. It serves as a reference mark for
    an actual treatment (e.g., a group of subject
    does not receive any drug or pill of any kind).
  • A placebo is a fake treatment, such as a sugar
    pill. This is to test the hypothesis that the
    response to the actual treatment is due to the
    actual treatment and not to how the subject is
    being taken care of.

9
About the placebo effect
  • The placebo effect is an improvement in health
    due not to any treatment but only to the
    patients belief that he or she will improve.
  • The placebo effect is not understood, but it is
    believed to have therapeutic results on up to 35
    of patients.
  • An opposite, or negative placebo effect, has
    been observed when patients believe their health
    will get worse.

The most famous, and maybe most powerful, placebo
is the kiss, blow, or hugwhatever your
technique. Unfortunately, the effect gradually
disappears once children figure out that they
sometimes get better without help and vice
versa.
10
Comparative experiments
  • The Treatment Group consists of those subjects in
    the study who receive the drug or other
    intervention under study.
  • The Control Group consists of those subjects in
    the study who do not receive the treatment.
  • Vaccinate children in New York city and compare
    them with unvaccinated children in Chicago, all
    in the same year.

11
NFIP experiment design
  • In 1954, National Foundation for Infantile
    Paralysis wanted to test the Salk vaccine.
  • Suppose we vaccinate only children whose parents
    consent, and use those children whose parents do
    not consent as the control group.
  • NFIP vaccinated all children in grade 2 whose
    parents consented, and used children in grades 1
    and 3 as a control.

12
  • We want to treatment and control to differ as
    little as possible, so that the difference in
    response is evidence of the effect of the
    treatment, and not some other lurking variable.
  • Some school districts found flaws in the NFIP
    design and used a different design, trying to
    make the treatment and control groups as similar
    as possible.
  • Children with consent were divided into treatment
    and control groups children without consent were
    excluded from the study.

13
Randomized assignment
  • Impartial and objective
  • With enough subjects, the treatment and control
    groups will be similar with respect to all
    variables (whether or not they have been
    identified)
  • We can quantify our uncertainty about the results
    using methods of statistical inference and
    probability

14
Logic behind the randomized comparative design
  • Randomization produces groups that should be
    similar in all respects before the treatment is
    applied
  • Comparative design ensures that influences other
    than treatment operate equally on both groups
  • Therefore, differences in response must be due
    either to the treatment or to the play of chance
    in the random assignments (use enough
    experimental units to reduce chance variation)
  • Randomization does not eliminate the effects of
    other factors it just allocates them equally to
    the treatment and control groups

15
Caution about experimentation
The design of a study is biased if it
systematically favors certain outcomes.
The best way to exclude biases in an experiment
is to randomize the design. Both the individuals
and treatments are assigned randomly.
16
  • Other ways to remove bias
  • A double-blind experiment is one in which neither
    the subjects nor the experimenter know which
    individuals got which treatment until the
    experiment is completed. The goal is to avoid
    forms of placebo effects and biases in
    interpretation.
  • The best way to make sure your conclusions are
    robust is to replicate your experimentdo it
    over. Replication ensures that particular results
    are not due to uncontrolled factors or errors of
    manipulation.

17
Designing controlled experiments
Sir Ronald FisherThe father of statistics He
was sent to Rothamsted Agricultural Station in
the United Kingdom to evaluate the success of
various fertilizer treatments.
  • Fisher found the data from experiments going on
    for decades to be basically worthless because of
    poor experimental design.
  • Fertilizer had been applied to a field one year
    and not in another in order to compare the yield
    of grain produced in the two years. BUT
  • It may have rained more, or been sunnier, in
    different years.
  • The seeds used may have differed between years as
    well.
  • Or fertilizer was applied to one field and not to
    a nearby field in the same year. BUT
  • The fields might have different soil, water,
    drainage, and history of previous use.
  • ? Too many factors affecting the results were
    uncontrolled.

18
Fishers solution
Randomized comparative experiments
  • In the same field and same year, apply fertilizer
    to randomly spaced plots within the field.
    Analyze plants from similarly treated plots
    together.
  • This minimizes the effect of variation within the
    field in drainage and soil composition on yield,
    as well as controlling for weather.

19
Randomization
  • One way to randomize an experiment is to rely on
    random digits to make choices in a neutral way.
    We can use a table of random digits (like Table
    B) or the random sampling function of a
    statistical software.
  • How to randomly choose n individuals from a group
    of N
  • We first label each of the N individuals with a
    number (typically from 1 to N, or 0 to N - 1)
  • A list of random digits is parsed into digits the
    same length as N (if N 233, then its length is
    3 if N 18, its length is 2).
  • The parsed list is read in sequence and the first
    n digits corresponding to a label in our group of
    N are selected.
  • The n individuals with these labels constitute
    our selection.

20
Using Table B
  • We need to randomly select five students from a
    class of 20.
  • List and number all members of the population,
    which is the class of 20.
  • The number 20 is two digits long.
  • Parse the list of random digits into numbers that
    are two digits long. Here we chose to start with
    line 103 for no particular reason.

45 46 71 17 09 77 55 80 00 95 32 86
32 94 85 82 22 69 00 56
21
45 46 71 17 09 77 55 80 00 95 32 86
32 94 85 82 22 69 00 56
52 71 13 88 89 93 07 46 02
1 Alison 2 Amy 3 Brigitte 4 Darwin 5 Emily 6
Fernando 7 George 8 Harry 9 Henry 10 John 11
Kate 12 Max 13 Moe 14 Nancy 15 Ned 16 Paul 17
Ramon 18 Rupert 19 Tom 20 Victoria
  • Randomly choose five students by reading through
    the list of two-digit random numbers, starting
    with line 103 and on.
  • The first five random numbers matching numbers
    assigned to students make our selection.
  • Remember that 1 is 01, 2 is 02, etc.
  • If you were to hit 17 again before getting five
    people, dont sample Ramon twicejust keep going.

22
Completely randomized designs
Completely randomized experimental
designs Individuals are randomly assigned to
groups, then the groups are randomly assigned to
treatments.
23
Reducing variability by Blocking
  • Responsettt effect random variation
  • Random variation due to other factors (age, sex,
    diet, etc.), measurement errors, etc.
  • A Block is a group of units or subjects that are
    known before the experiment to be similar in some
    way that is expected to affect the units
    response to the treatments. Blocking reduces
    random variation by reducing the random effects
    of one or more factors.

24
Block designs
In a block, or stratified, design, subjects are
divided into groups, or blocks, prior to
experiments to test hypotheses about differences
between the groups. The blocking, or
stratification, here is by gender.
25
Matched pairs designs
Matched pairs Choose pairs of subjects that are
closely matchede.g., same sex, height, weight,
age, and race. Within each pair, randomly assign
who will receive which treatment. It is also
possible to just use a single person, and give
the two treatments to this person over time in
random order. In this case, the matched pair
is just the same person at different points in
time.
26
Boys shoes
  • Measurements of the amount of wear of the soles
    of shoes worn by 10 boys. The shoe soles were
    made of two different synthetic materials, A and
    B.
  • The results overlap extensively and do not at
    first sight suggest that one material is better
    than the other.

27
Boys shoes
  • Important fact the experiments were run in
    pairs. Each boy wore a special pair of shoes, the
    sole of one shoe having been made with A and the
    sole of the other with B.
  • The decision as to whether the left or the right
    sole was made with A or B was determined by the
    flip of a coin. (random assignment)

The difference in wear (B-A) for each boy clearly
indicates that, for these 10 boys, material A
usually shows less wear than B.
28
What experimental design?
A researcher wants to see if there is a
significant difference in resting pulse rates for
men and women. Twenty-eight men and 24 women had
their pulse rate measured at rest in the lab.
  • One factor, two levels (male and female)
  • Stratified random sample (by gender)

Many dairy cows now receive injections of BST, a
hormone intended to spur greater milk production.
The milk production of 60 Ayrshire dairy cows was
recorded before and after they received a first
injection of BST.
  • SRS of 60 cows
  • Match pair design (before and after)
Write a Comment
User Comments (0)
About PowerShow.com