Title: Estimating with Confidence!
1do first
2calculator
3Estimating with Confidence!
- date 1/30
- hw 10.1, 10.2, 10.3
4hw 10.1, 10.2, 10.3
5hw 10.1, 10.2, 10.3
6the Relay problem
7the CLT
8statistical confidence
- 25 NHS students were randomly interviewed about
their sleep patterns. Their mean sleep time was 6
hours. From this SRS of n25, what can be said
about the mean sleep time u of NHS students
(population) - the sample mean __ has a ___ distr.
- its mean is __
- the s.d. of ___ is ____ ____
9GRAPH the sampling distribution of mean sleep
time of SRS (n25)
- suppose the pop s.d. is 1 hr.
10the sampling distribution of mean sleep time of
SRS (n25)
- the 68-95-99.7 rule says that in about __ of all
samples, __ will be within __ s.d. of ___ ___ (u) - in 95 of all samples, the unknown u lies between
_____ and _____. - since our sample mean __ 6, we say we are 95
confident that the unknown NHS mean sleep time
lies between ___ and ___.
1195 confidence interval for u
- We got these numbers by a method that gives
correct results 95 of the time. - form estimate margin of error
- estimate (critical value)(standard error of the
estimate).
12applet
13calculator activity
6
6.2
6.3
6.4
6.5
6.6
5.8
5.7
5.6
5.5
5.4
6.7
6.1
5.9
- sample 25 from N(6,1) pop randNorm(6,1,25)
- find sample mean (stat calc 1)
- construct 95 interval for u
- construct nine more confidence intervals and use
Zinterval (Stat-Tests) - how many contain the true pop mean?
- how many would you expect?
14Confidence Intervals
- date 1/31
- hw 10.5, 10.6, 10.7
15homework
- 10.3
- 95 CI for the mean male score
- 95 of all men have scores between __ and __.
population
sampling distribution
95 of all samples give an interval that
contains the pop mean u
16confidence intervals
- A confidence interval has 2 parts
- an interval
- a confidence level, C
- a 95 confidence level C .95
C
17A 95 confidence interval
18confidence levels
Confidence Level Tail Area
90
95
99
19TABLE C
20Critical values
- is the upper p critical value
probability p
21confidence interval for a population mean
22example NAILS
- take SRS size 5 from Pop (class)
- expect nail length to be ____
- plot the data
- assume s.d. of population is __mm
- give a 90 CI for the mean nail length
- would your result differ is nails came from only
women?
23larger samples give shorter intervals
n 20
n 5
24quiz 10.1B
25LAB CIs
26data semester grades
27LAB CIs
- http//www.cvgs.k12.va.us/DIGSTATS/main/descriptv/
d_confidence.htm
28- Suppose we are in an orchard which has twenty
apple trees, and we place ten buckets underneath
each tree in order to catch falling apples. Let's
consider what happens to one tree.From the ten
buckets, we find a sample mean (X-bar total
number of apples/ten buckets) of 6 apples per
bucket, with a standard deviation (s) of 2.8
apples. If we want to be 95 confident that the
mean number of apples per bucket in the orchard
is within a range based upon our one tree,
assuming the collection of apples in each bucket
is normally distributed,
29we get a confidence interval of
- So based upon our sample and calculations, we are
95 confident that the mean number of apples per
bucket for all the buckets in the orchard is
between ___ and ___.
30Suppose we used the same level of confidence for
all calculations and calculated a confidence
interval for each tree in the orchard.
31LAB How CIs behave
- date 2/1
- hw 10.9, 10.12,
- 10.13, 10.14
32semester grades
- data set (semester 1 grades)
- plot the population
- pop mean ___ pop s.d.___
- describe the shape
- sample (SRS) 4 scores
- write the sample mean ___ ___
- write the confidence level ___ ____
- write the critical value _______
- write the standard error SE ____
- write the margin of error, ME ____
- write the 90 CI for the true pop mean is
- ___ _____ or ( , )
- when you increase the confidence level, the ME
______, so the CI gets ______ - when you increase the sample size (try n16), the
ME ____, so the CI gets _____
33penny activity what happened
34homework p519 10.6
35Confidence intervals
- you choose the confidence level
- we want high confidence and small margin of error
- high confidence
- small margin of error
36the margin of error gets smaller when
- gets smaller
- gets smaller
- n gets larger
37- Confidence intervals get wider as the confidence
increases - Confidence intervals get narrower as sample size
increases
C.90
n5
C.95
n20
C.99
n80
38(No Transcript)
39applet confidence intervals
40- I am 99 confident that the true mean is between
___ and ___.
41video CI
42- A guy notices a bunch of targets scattered over a
barn wall, and in the center of each, in the
"bulls-eye," is a bullet hole. "Wow," he says to
the farmer, "that's pretty good shooting. How'd
you do it?" "Oh," says the farmer, "it was easy.
I painted the targets after I shot the holes."
43- Confidence intervals are a little like that.
After we make a point estimate (a bullet hole),
we are going to draw a target (an interval)
around the point and state the probability that
real objective is in the target area. The wider
the target, the greater the probability, as you'd
expect. Also, the more accurate the shooting, the
greater the probability. - To avoid misleading you, we should re-phrase the
joke. Suppose there's only one target painted on
the barn, and all of the bullet holes are within
it, even though they're not all in the center.
It's still pretty good shooting, and the better
the shooting, the smaller the target needs to be
to encompass them all. - For the reasons we discussed in the last section,
the accuracy of the shooting depends on how many
samples we took and on how variable they are - If they're all over the place, there's no reason
to be very confident in their mean. - If they're concentrated in one area, there's good
reason to think that the real objective is in
that area.
44activity
6
6.2
6.3
6.4
6.5
6.6
5.8
5.7
5.6
5.5
5.4
6.7
6.1
5.9
6
6.2
6.3
6.4
6.5
6.6
5.8
5.7
5.6
5.5
5.4
6.7
6.1
5.9
6
6.2
6.3
6.4
6.5
6.6
5.8
5.7
5.6
5.5
5.4
6.7
6.1
5.9
45choosing the sample size
- wanted high confidence - small margin of error
- question how large a sample do I need to be
within ___ of the true mean with ___ confidence?
46(No Transcript)
47some algebra.
- A Subaru dealer wants to find out the age of
their customers (for advertising purposes).
They want the margin of error to be 3 years old.
If they want a 90 confidence interval, how many
people do they need to know about? - Hence the dealer should survey at least 52
people.
48some cautions
- date 2-2
- hw p526 10.16-10.18
49estimating a normal mean
- data from an SRS from pop
- outliers affect CI (sample mean is sensitive)
- if n is small and pop is not normal confidence
level C is different. - the pop s.d. is known (or sample is large)
- undercoverage, nonresponse may be present
50interpreting CI
- we are 95 confident that the true mean ___ lies
between ___ and ___ - 95 of the population lies between __ and __
- the method gives correct results in 95 of all
possible samples. - the probability is 95 that the true mean falls
between ___ and ___
51tests of significance
52- CI estimate u
- significance test check a claim about u
53- A coach says the average weight an NHS student
can lift is of 30 kilograms. You took a sample of
100 NHS students. The average weight lifted of
this sample was 28 kilos, with a standard
deviation of 12 kilos. Does this enable you to
dismiss the coaches claims?
http//www.bized.ac.uk/timeweb/crunching/crunch_ex
periment_reviewb.htm
54test of significance
- does our low sample mean __ mean the coach is
wrong - OR
-
- could we have gotten a sample mean __ easily by
chance
55the null hypothesis
- H0 u 30
- Ha u lt 30
- If the null is true, is our result surprisingly
low. If yes, thats evidence against the null and
in favor of the alternative hypothesis.
56hypothesis testing
- Most of you do not know that when Santa was a
young man he had to take a statistics course.
When the class started covering hypothesis tests,
he had a lot of trouble remembering where to put
the equal sign. He started repeating to himself
"The equal sign goes in the null hypothesis. The
equal sign goes in the null hypothesis. The equal
sign goes in the null hypothesis." - Eventually Santa had to shorten this phrase to
make it easier to remember. In fact to this day
you can still hear him say "Ho, Ho, Ho."
57If the null is true, is our result surprisingly
low.
- sampling distribution of ___
58the p-value
- sampling distr of __ when u is __
p-value
p-value is the prob of getting a result at least
as extreme as the one we got.
59statistically significant
- small p-values are evidence against H0
- RULE OF THUMB a p-value below .05 is called
statistically significant. - its unlikely to get a result like ours by chance
alone.
60one-sided vs. two-sidedalternatives
- Suppose we wanted to test a manufacturers claim
that there are, on average, 50 matches in a box.
it would be useful to know if there is likely to
be____ than 50 matches, on average, in a box. - Now lets say nothing specific can be said about
the average number of matches in a box only
that, if we could reject the null hypothesis in
our test, we would know that the average number
of matches in a box is likely to be less than or
greater than 50.
61(No Transcript)
62One-Sided vs. Two-Sided Tests
- Does the iron concentration of an iron ore exceed
30 ? - Is the pizza you ordered from the pizza service
well done (it could be raw, or burnt, or well
done)?
63activity
- http//exploringdata.cqu.edu.au/stem_ws1.htm
64P-values and statistical significance
65test statistic
- a stat that estimates the parameter in H0
- H0 u 30
- Ha u lt 30
- ___ estimates ___
p-value
p-value if the null were true
66comment on this p-value
67comment on his p-value
68- Ms. Lisa Monnin is the budget director for the
New Process Company. She would like to compare
the daily travel expenses for the sales staff and
the audit staff. She collected the following
sample information. - At the .10 significance level, can she conclude
that the mean daily expenses are greater for the
sales staff than the audit staff? What is the
p-value? - Sales () 131
135 146 165 136 142 - Audit () 130 102 129
143 149 120 139 - Having problems finding the p-value unsure of
the formula.Kathy
69the significance level
- significance level alpha level __
- how much evidence do we need (how low a p-value)
to reject a claim (reject the null) - common alpha levels
- __ .05
- __ .01
- if p-value __ alpha level, we say the data are
statistically significant and we have evidence to
___ the null.
70worksheet
- state the parameter
- state the null and alternative hypothesis
- is the alternative one-sided or 2 sided?
- write the test statistic
- graph the distribution when H0 is true
- the mean of sampling distr is ___ sd ___
- write the z test statistic
- find the p-value
- what is the alpha (significance) level?
- is the result statistically significant at alpha
.05? - is the result statistically significant at alpha
.01? - write your conclusion interpret the p-value
71PROBLEMS
- coffee sales 10.35
- executives blood pressure p573
72tests for a population mean
73form
- estimated value - hypothesized value.
- standard error of the estimate
74how to test
- define your parameter
- state your hypotheses
- calculate the test statistic
- find the p-value
- interpretation and conclusiton.
75one sided alternative
- hypotheses
- H0 u 30
- Ha u 30
- compute the z test statistic
- find the P-value P(Z z)
-
76two-sided alternative
- hypotheses
- H0 u 30
- Ha u 30
- compute the z test statistic
- find the P-value 2P(Z z)
-
77what tests do
- tests of significance assess the evidence against
H0. We never say we have clear evidence that H0
is true.
78(No Transcript)
79(No Transcript)
80(No Transcript)
81(No Transcript)
82(No Transcript)
83(No Transcript)
84(No Transcript)
85(No Transcript)
86(No Transcript)
87(No Transcript)
88tests with fixed significance levelstests from
CIs/
89how much evidence do I need to reject the null?
- a level of significance alpha tells you how much
- If P lt alpha the outcome of a test is significant
- you can decide whether a result is significant
without calculating p - compare your z test statistic to the critical
value from Table C
90CI and 2-sided tests
- a 2-sided significance test can be carried out
from a CI with C 1-alpha - p55510.44
radon
91- Example Students analyze data reported in The
1992 Statistical Abstract of the United States
that 30.5 of a sample of 40,000 American
households own a pet cat (see 19). We ask
whether this sample provides strong evidence that
less than one-third of the population of all
American households owns a cat and then whether
it provides evidence that much less than
one-third owns a cat. A significance test answers
the first question in the affirmative (p-value lt
.0001), but a confidence interval supplies the
additional information needed to answer the
second question in the negative (95 c.i. (.300,
.310)). Students discern that large sample sizes
can often lead to statistically significant
results that are not practically significant.
92using significance tests
93choosing a level of significance
- do this if you must make a decision
- if H0 is an assumption that people have held for
years, choose (small, large) alpha - if rejecting H0 will lead to dramatic changes in
your product, choose (small, large) alpha
94example SAT prep
95about p-value
- use the p-value if you want to describe the
strength of your evidence - P-value .0049 vs. P-value .051
- with large samples even tiny deviations from the
null will be significant - statistical significance is not the same as
practical significance.
96antibacterial cream
97beware multiple analyses
- running 1 test and reaching alpha .05 level is
ok evidence that you have found something
significant. Running 20 tests and finding 1
reaching the alpha .05 level is not.
98beware multiple analyses
- ESP test
- A standard run uses 25 cards. With only 25 cards,
you must obtain at least 9 hits (correct guesses)
to give a significant result (plt.05) showing
possible evidence of ESP.
99(No Transcript)
100(No Transcript)
101teaching methods
- http//www.rossmanchance.com/papers/topten.html
102old faithful
- Example We provide students with data on times
between eruptions (in minutes) for the Old
Faithful geyser, originally reported in 2 and
also presented in 4 and 12. If students
merely calculate a confidence interval for the
population mean intereruption time (95 c.i.
(70.73, 73.90)) without inspecting the data
first, they fail to notice the pronounced bimodal
nature of the data with peaks around 55 and 78
minutes. With this realization students are able
to describe the inter-eruption times more
effectively.
103monty hall
- Example In an activity illustrating the famous
"Monty Hall Problem," we give three playing
cards, two red and one black, to pairs of
students. The cards represent prizes behind doors
used in a game show, one a winner (black) and two
losers (red). One student (dealer) shuffles the
cards and holds them facing away from the other
(contestant). The contestant chooses a card and
the dealer reveals one of the two remaining cards
to be red. The contestant is then asked to either
switch to the remaining card or stay with the
original choice, in an effort to find the
(winning) black card. We have the students play
this game 20 times and perform a test of
significance to see if the proportion of wins
using the switching strategy is different from
1/2. Most students fail to reject this hypothesis
(power .152). Since 1/2 agrees with many
students intuition, they do not find this result
surprising. However, if they continue to play the
game or reason probabilistically, they discover
that the actual probability of winning with the
switch strategy is 2/3. Thus, they learn that the
original sample size of 20 was not large enough
to enable them to detect this difference.
104hand span activity
105inference as decision
106gamehttp//www.stat.psu.edu/resources/InLarge/wl
h_01.htm
107- significance tests H0
- 1. evidence against H0 p-value
- 2. make a decision based on a fixed level, alpha
if we reject H0 we accept Ha
108Type 1 and Type 2 errors
- Type 1 reject
- Type 2 not reject
- applet
109examples
- Type 1 In other words, it's the rate of false
alarms or false positives. - Type 2 In other words, it's the rate of failed
alarms or false negatives.
110(No Transcript)
111Type 1 and Type 2 errors
H0 is true
Ha is true
dec i s I o n
ACCEPT H0
REJECT H0
112(No Transcript)
113beans
- H0 the sample of beans meets standards
- Ha the sample of beans does not meet standards.
- explain the 2 types of incorrect decisions.
- we can accept a bad bunch
- we can reject a good bunch
- which is more serious?
- which side are you on? consumer? producer?
114example problem
115significance and type 1 error
- the significance level __ P (type I error)
- to find Type II error, we need to know the value
of a particular alternative - a high type II error for a particular alternative
means the test is not sensitive enough to detect
that alternative
116example, continued
117power of a test
- The power of a test is the probability of
choosing the alternative hypothesis when the
alternative hypothesis is correct. - power 1 P (type II error)
118factors Influencing Power
- Alpha (a). Thus, all other things being equal,
using an alpha of .05 will result in a more
powerful test than using an alpha of .01. - Sample Size (N). The bigger the sample (i.e., the
more work we do), the more powerful the test. - Variability. Generally speaking, variability in
the sample and/or population results in a less
powerful test.