TwoSample Inference Procedures with Means - PowerPoint PPT Presentation

1 / 24
About This Presentation
Title:

TwoSample Inference Procedures with Means

Description:

The goal of these inference procedures is to ... (xM-xF) 5) = normalcdf(-,5,6,3.471/30) = .0573 ... (xM-xF) = invNorm(.7,6,3.471/30) = 6.33. Assumptions: ... – PowerPoint PPT presentation

Number of Views:40
Avg rating:3.0/5.0
Slides: 25
Provided by: plan168
Category:

less

Transcript and Presenter's Notes

Title: TwoSample Inference Procedures with Means


1
Two-Sample Inference Procedures with Means
2
Two-Sample Procedures with means
  • The goal of these inference procedures is to
    compare the responses to two treatments or to
    compare the characteristics of two populations.
  • We have INDEPENDENT samples from each treatment
    or population

3
Remember
We will be interested in the difference of means,
so we will use this to find standard error.
4
Suppose we have a population of adult
men with a mean height of 71 inches
and standard deviation of 2.6
inches. We also have a population of adult women
with a mean height of 65 inches and standard
deviation of 2.3 inches. Assume heights are
normally distributed. Describe the distribution
of the difference in heights between males and
females (male-female).
Normal distribution with mx-y 6 inches sx-y
3.471 inches
5
s 3.471
6
  • What is the probability that the height of a
    randomly selected man is at most 5 inches taller
    than the height of a randomly selected woman?
  • b) What is the 70th percentile for the difference
    (male-female) in heights of a randomly selected
    man woman?

P((xM-xF) lt 5) normalcdf(-8,5,6,3.471) .3866
(xM-xF) invNorm(.7,6,3.471) 7.82
7
Example
  • What is the probability that the mean height of
    30 men is at most 5 inches taller than the mean
    height of 30 women?
  • ((xM-xF)lt 5) normalcdf(-8,5,6,3.471/v30)
    .0573
  • b) What is the 70th percentile for the difference
    (male-female) in mean heights of 30 men and 30
    women?

(xM-xF) invNorm(.7,6,3.471/v30) 6.33
8
Assumptions
  • Have two SRSs from the populations or two
    randomly assigned treatment groups
  • Samples are independent
  • Both distributions are approximately normally
  • Have large sample sizes
  • Graph BOTH sets of data
  • ss known/unknown

9
Formulas
  • Since in real-life, we will NOT know both ss, we
    will do t-procedures.

10
Degrees of Freedom
  • Option 1 use the smaller of the two values n1
    1 and n2 1
  • This will produce conservative results higher
    p-values lower confidence.
  • Option 2 approximation used by technology

Calculator does this automatically!
11
Confidence intervals
Called standard error
12
Pooled procedures
  • Used for two populations with the same variance
  • When you pool, you average the two-sample
    variances to estimate the common population
    variance.
  • DO NOT use pooling!!!!!

We do NOT know the variances of the population,
so ALWAYS tell the calculator NO for pooling!
13
Two competing headache remedies claim to give
fast-acting relief. An experiment was performed
to compare the mean lengths of time required for
bodily absorption of brand A and brand B. Assume
the absorption time is normally distributed.
Twelve people were randomly selected and given an
oral dosage of brand A. Another 12 were randomly
selected and given an equal dosage of brand B.
The length of time in minutes for the drugs to
reach a specified level in the blood was
recorded. The results follow
mean SD n Brand A 20.1 8.7 12
Brand B 18.9 7.5 12 Describe the shape
standard error for sampling distribution of the
differences in the mean speed of absorption.
(answer on next screen)
14
Describe the sampling distribution of the
differences in the mean speed of
absorption. Find a 95 confidence interval
difference in mean lengths of time required for
bodily absorption of each brand. (answer on next
screen)
Normal distribution with S.E. 3.316
15
Assumptions Have 2 independent randomly assigned
treatments Given the absorption rate is
normally distributed ss unknown
State assumptions!
Think Price is Right! Closest without going
over
Formula calculations
Conclusion in context
From calculator df 21.53, use t for df 21
95 confidence level
We are 95 confident that the true difference in
mean lengths of time required for bodily
absorption of each brand is between 5.685
minutes and 8.085 minutes.
16
Note confidence interval statements
  • Matched pairs refer to mean difference
  • Two-Sample refer to difference of means

17
Hypothesis Statements
  • H0 m1 - m2 0
  • Ha m1 - m2 lt 0
  • Ha m1 - m2 gt 0
  • Ha m1 - m2 ? 0

H0 m1 m2
Be sure to define BOTH m1 and m2!
Ha m1lt m2
Ha m1gt m2
Ha m1 ? m2
18
Hypothesis Test
Since we usually assume H0 is true, then this
equals 0 so we can usually leave it out
19
The length of time in minutes for the drugs to
reach a specified level in the blood was
recorded. The results follow mean SD n
Brand A 20.1 8.7 12 Brand
B 18.9 7.5 12 Is there sufficient evidence
that these drugs differ in the speed at which
they enter the blood stream?
20
Have 2 independent randomly assigned treatments
Given the absorption rate is normally
distributed ss unknown
State assumptions!
Hypotheses define variables!
Where mA is the true mean absorption time for
Brand A mB is the true mean absorption time for
Brand B
Formula calculations
Conclusion in context
Since p-value gt a, I fail to reject H0. There is
not sufficient evidence to suggest that these
drugs differ in the speed at which they enter the
blood stream.
21
Suppose that the sample mean of Brand B is 16.5,
then is Brand B faster?
No, I would still fail to reject the null
hypothesis.
22
Robustness
  • Two-sample procedures are more robust than
    one-sample procedures
  • BEST to have equal sample sizes! (but not
    necessary)

REMEMBER An interval or test is robust if the
confidence level or p-value does not change very
much when the assumptions of the procedure are
violated.
23
A modification has been made to the process for
producing a certain type of time-zero film (film
that begins to develop as soon as the picture is
taken). Because the modification involves extra
cost, it will be incorporated only if sample data
indicate that the modification decreases true
average development time by more than 1 second.
Should the company incorporate the
modification? Original 8.6 5.1 4.5 5.4 6.3 6.6 5.
7 8.5 Modified 5.5 4.0 3.8 6.0 5.8 4.9 7.0 5.7
24
Assume we have 2 independent SRS of film
Both distributions are
approximately normal due to approximately
symmetrical boxplots
ss unknown
H0 mO- mM 1 HamO- mM gt 1
Where mO is the true mean developing time for
original film mM is the true mean developing
time for modified film
Since p-value gt a, I fail to reject H0. There is
not sufficient evidence to suggest that the
company incorporate the modification.
Write a Comment
User Comments (0)
About PowerShow.com