Title: Two-Sample Inference Procedures with Means
1Two-Sample Inference Procedures with Means
2Remember
We will be interested in the difference of means,
so we will use this to find standard error.
3Suppose we have a population of adult
men with a mean height of 71 inches
and standard deviation of 2.6
inches. We also have a population of adult women
with a mean height of 65 inches and standard
deviation of 2.3 inches. Assume heights are
normally distributed. Describe the distribution
of the difference in heights between males and
females (male-female).
Normal distribution with mx-y 6 inches sx-y
3.471 inches
4s 3.471
5- What is the probability that the height of a
randomly selected man is at most 5 inches taller
than the height of a randomly selected woman? - b) What is the 70th percentile for the difference
(male-female) in heights of a randomly selected
man woman?
P((xM-xF) lt 5) normalcdf(-8,5,6,3.471) .3866
(xM-xF) invNorm(.7,6,3.471) 7.82
6Do calculator simulation!
- a) What is the probability that the mean height
of 30 men is at most 5 inches taller than the
mean height of 30 women? - b) What is the 70th percentile for the difference
(male-female) in mean heights of 30 men and 30
women?
Â
7Two-Sample Procedures with means
When we compare, what are we interested in?
- The goal of these inference procedures is to
compare the responses to two treatments or to
compare the characteristics of two populations. - We have INDEPENDENT samples from each treatment
or population
8Assumptions
- Have two SRSs from the populations or two
randomly assigned treatment groups - Samples are independent
- Both distributions are approximately normal
- Have large sample sizes
- Graph BOTH sets of data
- ss unknown
9Formulas
- Since in real-life, we will NOT know both ss, we
will do t-procedures.
10Degrees of Freedom
- Option 1 use the smaller of the two values n1
1 and n2 1 - This will produce conservative results higher
p-values lower confidence. - Option 2 approximation used by technology
Calculator does this automatically!
11Confidence intervals
Called standard error
12Pooled procedures
- Used for two populations with the same variance
- When you pool, you average the two-sample
variances to estimate the common population
variance. - DO NOT use on AP Exam!!!!!
We do NOT know the variances of the population,
so ALWAYS tell the calculator NO for pooling!
13Two competing headache remedies claim to give
fast-acting relief. An experiment was performed
to compare the mean lengths of time required for
bodily absorption of brand A and brand B. Assume
the absorption time is normally distributed.
Twelve people were randomly selected and given an
oral dosage of brand A. Another 12 were randomly
selected and given an equal dosage of brand B.
The length of time in minutes for the drugs to
reach a specified level in the blood was
recorded. The results follow
mean SD n Brand A 20.1 8.7 12
Brand B 18.9 7.5 12 Describe the shape
standard error for sampling distribution of the
differences in the mean speed of absorption.
(answer on next screen)
14Describe the sampling distribution of the
differences in the mean speed of
absorption. Find a 95 confidence interval
difference in mean lengths of time required for
bodily absorption of each brand.
Normal distribution with S.E. 3.316
15Note confidence interval statements
- Matched pairs refer to mean difference
- Two-Sample refer to difference of means
16Hypothesis Statements
- H0 m1 - m2 0
- Ha m1 lt m2 lt 0
- Ha m1 gt m2 gt 0
- Ha m1 ? m2 ? 0
H0 m1 m2
Be sure to define BOTH m1 and m2!
17Hypothesis Test
Since we usually assume H0 is true, then this
equals 0 so we can usually leave it out
18The length of time in minutes for the drugs to
reach a specified level in the blood was
recorded. The results follow mean SD n
Brand A 20.1 8.7 12 Brand
B 18.9 7.5 12 Is there sufficient evidence
that these drugs differ in the speed at which
they enter the blood stream?
19Have 2 independent randomly assigned treatments
Given the absorption rate is normally
distributed ss unknown
State assumptions!
Hypotheses define variables!
Where mA is the true mean absorption time for
Brand A mB is the true mean absorption time for
Brand B
On Calc Go to Stat Test Input Statistics No
Pooling Calculate
Formula calculations
Conclusion in context
Since p-value gt a, I fail to reject H0. There is
not sufficient evidence to suggest that these
drugs differ in the speed at which they enter the
blood stream.
20Suppose that the sample mean of Brand B is 16.5,
then is Brand B faster?
No, I would still fail to reject the null
hypothesis.
21Robustness
- Two-sample procedures are more robust than
one-sample procedures - BEST to have equal sample sizes! (but not
necessary)
22A modification has been made to the process for
producing a certain type of time-zero film (film
that begins to develop as soon as the picture is
taken). Because the modification involves extra
cost, it will be incorporated only if sample data
indicate that the modification decreases true
average development time by more than 1 second.
Should the company incorporate the
modification? Original 8.6 5.1 4.5 5.4 6.3 6.6 5.
7 8.5 Modified 5.5 4.0 3.8 6.0 5.8 4.9 7.0 5.7
23Assume we have 2 independent SRS of film
Both distributions are
approximately normal due to approximately
symmetrical boxplots
ss unknown
H0 mO- mM 1 HamO- mM gt 1
Where mO is the true mean developing time for
original film mM is the true mean developing
time for modified film
Since p-value gt a, I fail to reject H0. There is
not sufficient evidence to suggest that the
company incorporate the modification.