Estimating with Confidence! - PowerPoint PPT Presentation

1 / 118
About This Presentation
Title:

Estimating with Confidence!

Description:

calculator Estimating with Confidence! date: 1/30 hw 10.1, 10.2, 10.3 the Relay problem the CLT statistical confidence 25 NHS students were randomly interviewed about ... – PowerPoint PPT presentation

Number of Views:247
Avg rating:3.0/5.0
Slides: 119
Provided by: JohnC209
Category:

less

Transcript and Presenter's Notes

Title: Estimating with Confidence!


1
do first
2
calculator
3
Estimating with Confidence!
  • date 1/30
  • hw 10.1, 10.2, 10.3

4
hw 10.1, 10.2, 10.3
5
hw 10.1, 10.2, 10.3
6
the Relay problem
7
the CLT
8
statistical confidence
  • 25 NHS students were randomly interviewed about
    their sleep patterns. Their mean sleep time was 6
    hours. From this SRS of n25, what can be said
    about the mean sleep time u of NHS students
    (population)
  • the sample mean __ has a ___ distr.
  • its mean is __
  • the s.d. of ___ is ____ ____

9
GRAPH the sampling distribution of mean sleep
time of SRS (n25)
  • suppose the pop s.d. is 1 hr.

10
the sampling distribution of mean sleep time of
SRS (n25)
  • the 68-95-99.7 rule says that in about __ of all
    samples, __ will be within __ s.d. of ___ ___ (u)
  • in 95 of all samples, the unknown u lies between
    _____ and _____.
  • since our sample mean __ 6, we say we are 95
    confident that the unknown NHS mean sleep time
    lies between ___ and ___.

11
95 confidence interval for u
  • We got these numbers by a method that gives
    correct results 95 of the time.
  • form estimate margin of error
  • estimate (critical value)(standard error of the
    estimate).

12
applet
13
calculator activity
6
6.2
6.3
6.4
6.5
6.6
5.8
5.7
5.6
5.5
5.4
6.7
6.1
5.9
  • sample 25 from N(6,1) pop randNorm(6,1,25)
  • find sample mean (stat calc 1)
  • construct 95 interval for u
  • construct nine more confidence intervals and use
    Zinterval (Stat-Tests)
  • how many contain the true pop mean?
  • how many would you expect?

14
Confidence Intervals
  • date 1/31
  • hw 10.5, 10.6, 10.7

15
homework
  • 10.3
  • 95 CI for the mean male score
  • 95 of all men have scores between __ and __.

population
sampling distribution
95 of all samples give an interval that
contains the pop mean u
16
confidence intervals
  • A confidence interval has 2 parts
  • an interval
  • a confidence level, C
  • a 95 confidence level C .95

C
17
A 95 confidence interval
18
confidence levels
Confidence Level Tail Area
90
95
99
19
TABLE C
20
Critical values
  • is the upper p critical value

probability p
21
confidence interval for a population mean
22
example NAILS
  • take SRS size 5 from Pop (class)
  • expect nail length to be ____
  • plot the data
  • assume s.d. of population is __mm
  • give a 90 CI for the mean nail length
  • would your result differ is nails came from only
    women?

23
larger samples give shorter intervals
n 20
n 5
24
quiz 10.1B
25
LAB CIs
26
data semester grades
27
LAB CIs
  • http//www.cvgs.k12.va.us/DIGSTATS/main/descriptv/
    d_confidence.htm

28
  • Suppose we are in an orchard which has twenty
    apple trees, and we place ten buckets underneath
    each tree in order to catch falling apples. Let's
    consider what happens to one tree.From the ten
    buckets, we find a sample mean (X-bar total
    number of apples/ten buckets) of 6 apples per
    bucket, with a standard deviation (s) of 2.8
    apples. If we want to be 95 confident that the
    mean number of apples per bucket in the orchard
    is within a range based upon our one tree,
    assuming the collection of apples in each bucket
    is normally distributed,

29
we get a confidence interval of
  • So based upon our sample and calculations, we are
    95 confident that the mean number of apples per
    bucket for all the buckets in the orchard is
    between ___ and ___.

30
Suppose we used the same level of confidence for
all calculations and calculated a confidence
interval for each tree in the orchard.
31
LAB How CIs behave
  • date 2/1
  • hw 10.9, 10.12,
  • 10.13, 10.14

32
semester grades
  • data set (semester 1 grades)
  • plot the population
  • pop mean ___ pop s.d.___
  • describe the shape
  • sample (SRS) 4 scores
  • write the sample mean ___ ___
  • write the confidence level ___ ____
  • write the critical value _______
  • write the standard error SE ____
  • write the margin of error, ME ____
  • write the 90 CI for the true pop mean is
  • ___ _____ or ( , )
  • when you increase the confidence level, the ME
    ______, so the CI gets ______
  • when you increase the sample size (try n16), the
    ME ____, so the CI gets _____

33
penny activity what happened
34
homework p519 10.6
35
Confidence intervals
  • you choose the confidence level
  • we want high confidence and small margin of error
  • high confidence
  • small margin of error

36
the margin of error gets smaller when
  • gets smaller
  • gets smaller
  • n gets larger

37
  • Confidence intervals get wider as the confidence
    increases
  • Confidence intervals get narrower as sample size
    increases

C.90
n5
C.95
n20
C.99
n80
38
(No Transcript)
39
applet confidence intervals
40
  • I am 99 confident that the true mean is between
    ___ and ___.

41
video CI
42
  • A guy notices a bunch of targets scattered over a
    barn wall, and in the center of each, in the
    "bulls-eye," is a bullet hole. "Wow," he says to
    the farmer, "that's pretty good shooting. How'd
    you do it?" "Oh," says the farmer, "it was easy.
    I painted the targets after I shot the holes."

43
  • Confidence intervals are a little like that.
    After we make a point estimate (a bullet hole),
    we are going to draw a target (an interval)
    around the point and state the probability that
    real objective is in the target area. The wider
    the target, the greater the probability, as you'd
    expect. Also, the more accurate the shooting, the
    greater the probability.
  • To avoid misleading you, we should re-phrase the
    joke. Suppose there's only one target painted on
    the barn, and all of the bullet holes are within
    it, even though they're not all in the center.
    It's still pretty good shooting, and the better
    the shooting, the smaller the target needs to be
    to encompass them all.
  • For the reasons we discussed in the last section,
    the accuracy of the shooting depends on how many
    samples we took and on how variable they are
  • If they're all over the place, there's no reason
    to be very confident in their mean.
  • If they're concentrated in one area, there's good
    reason to think that the real objective is in
    that area.

44
activity
6
6.2
6.3
6.4
6.5
6.6
5.8
5.7
5.6
5.5
5.4
6.7
6.1
5.9
6
6.2
6.3
6.4
6.5
6.6
5.8
5.7
5.6
5.5
5.4
6.7
6.1
5.9
6
6.2
6.3
6.4
6.5
6.6
5.8
5.7
5.6
5.5
5.4
6.7
6.1
5.9
45
choosing the sample size
  • wanted high confidence - small margin of error
  • question how large a sample do I need to be
    within ___ of the true mean with ___ confidence?

46
(No Transcript)
47
some algebra.
  • A Subaru dealer wants to find out the age of
    their customers  (for advertising purposes).
     They want the margin of error to be 3 years old.
     If they want a 90 confidence interval, how many
    people do they need to know about?
  • Hence the dealer should survey at least 52
    people.

48
some cautions
  • date 2-2
  • hw p526 10.16-10.18

49
estimating a normal mean
  • data from an SRS from pop
  • outliers affect CI (sample mean is sensitive)
  • if n is small and pop is not normal confidence
    level C is different.
  • the pop s.d. is known (or sample is large)
  • undercoverage, nonresponse may be present

50
interpreting CI
  • we are 95 confident that the true mean ___ lies
    between ___ and ___
  • 95 of the population lies between __ and __
  • the method gives correct results in 95 of all
    possible samples.
  • the probability is 95 that the true mean falls
    between ___ and ___

51
tests of significance
52
  • CI estimate u
  • significance test check a claim about u

53
  • A coach says the average weight an NHS student
    can lift is of 30 kilograms. You took a sample of
    100 NHS students. The average weight lifted of
    this sample was 28 kilos, with a standard
    deviation of 12 kilos. Does this enable you to
    dismiss the coaches claims?

http//www.bized.ac.uk/timeweb/crunching/crunch_ex
periment_reviewb.htm
54
test of significance
  • does our low sample mean __ mean the coach is
    wrong
  • OR
  • could we have gotten a sample mean __ easily by
    chance

55
the null hypothesis
  • H0 u 30
  • Ha u lt 30
  • If the null is true, is our result surprisingly
    low. If yes, thats evidence against the null and
    in favor of the alternative hypothesis.

56
hypothesis testing
  • Most of you do not know that when Santa was a
    young man he had to take a statistics course.
    When the class started covering hypothesis tests,
    he had a lot of trouble remembering where to put
    the equal sign. He started repeating to himself
    "The equal sign goes in the null hypothesis. The
    equal sign goes in the null hypothesis. The equal
    sign goes in the null hypothesis."
  • Eventually Santa had to shorten this phrase to
    make it easier to remember. In fact to this day
    you can still hear him say "Ho, Ho, Ho."

57
If the null is true, is our result surprisingly
low.
  • sampling distribution of ___

58
the p-value
  • sampling distr of __ when u is __

p-value
p-value is the prob of getting a result at least
as extreme as the one we got.
59
statistically significant
  • small p-values are evidence against H0
  • RULE OF THUMB a p-value below .05 is called
    statistically significant.
  • its unlikely to get a result like ours by chance
    alone.

60
one-sided vs. two-sidedalternatives
  • Suppose we wanted to test a manufacturers claim
    that there are, on average, 50 matches in a box.
    it would be useful to know if there is likely to
    be____ than 50 matches, on average, in a box.
  • Now lets say nothing specific can be said about
    the average number of matches in a box only
    that, if we could reject the null hypothesis in
    our test, we would know that the average number
    of matches in a box is likely to be less than or
    greater than 50.

61
(No Transcript)
62
One-Sided vs. Two-Sided Tests
  • Does the iron concentration of an iron ore exceed
    30 ?
  • Is the pizza you ordered from the pizza service
    well done (it could be raw, or burnt, or well
    done)?

63
activity
  • http//exploringdata.cqu.edu.au/stem_ws1.htm

64
P-values and statistical significance
65
test statistic
  • a stat that estimates the parameter in H0
  • H0 u 30
  • Ha u lt 30
  • ___ estimates ___

p-value
p-value if the null were true
66
comment on this p-value
67
comment on his p-value
68
  • Ms. Lisa Monnin is the budget director for the
    New Process Company. She would like to compare
    the daily travel expenses for the sales staff and
    the audit staff. She collected the following
    sample information.
  • At the .10 significance level, can she conclude
    that the mean daily expenses are greater for the
    sales staff than the audit staff? What is the
    p-value?
  • Sales ()                       131      
    135       146       165       136       142
  • Audit ()           130       102       129      
    143       149       120       139
  • Having problems finding the p-value unsure of
    the formula.Kathy

69
the significance level
  • significance level alpha level __
  • how much evidence do we need (how low a p-value)
    to reject a claim (reject the null)
  • common alpha levels
  • __ .05
  • __ .01
  • if p-value __ alpha level, we say the data are
    statistically significant and we have evidence to
    ___ the null.

70
worksheet
  • state the parameter
  • state the null and alternative hypothesis
  • is the alternative one-sided or 2 sided?
  • write the test statistic
  • graph the distribution when H0 is true
  • the mean of sampling distr is ___ sd ___
  • write the z test statistic
  • find the p-value
  • what is the alpha (significance) level?
  • is the result statistically significant at alpha
    .05?
  • is the result statistically significant at alpha
    .01?
  • write your conclusion interpret the p-value

71
PROBLEMS
  • coffee sales 10.35
  • executives blood pressure p573

72
tests for a population mean
73
form
  • estimated value - hypothesized value.
  • standard error of the estimate

74
how to test
  • define your parameter
  • state your hypotheses
  • calculate the test statistic
  • find the p-value
  • interpretation and conclusiton.

75
one sided alternative
  • hypotheses
  • H0 u 30
  • Ha u 30
  • compute the z test statistic
  • find the P-value P(Z z)

76
two-sided alternative
  • hypotheses
  • H0 u 30
  • Ha u 30
  • compute the z test statistic
  • find the P-value 2P(Z z)

77
what tests do
  • tests of significance assess the evidence against
    H0. We never say we have clear evidence that H0
    is true.

78
(No Transcript)
79
(No Transcript)
80
(No Transcript)
81
(No Transcript)
82
(No Transcript)
83
(No Transcript)
84
(No Transcript)
85
(No Transcript)
86
(No Transcript)
87
(No Transcript)
88
tests with fixed significance levelstests from
CIs/
89
how much evidence do I need to reject the null?
  • a level of significance alpha tells you how much
  • If P lt alpha the outcome of a test is significant
  • you can decide whether a result is significant
    without calculating p
  • compare your z test statistic to the critical
    value from Table C

90
CI and 2-sided tests
  • a 2-sided significance test can be carried out
    from a CI with C 1-alpha
  • p55510.44

radon
91
  • Example Students analyze data reported in The
    1992 Statistical Abstract of the United States
    that 30.5 of a sample of 40,000 American
    households own a pet cat (see 19). We ask
    whether this sample provides strong evidence that
    less than one-third of the population of all
    American households owns a cat and then whether
    it provides evidence that much less than
    one-third owns a cat. A significance test answers
    the first question in the affirmative (p-value lt
    .0001), but a confidence interval supplies the
    additional information needed to answer the
    second question in the negative (95 c.i. (.300,
    .310)). Students discern that large sample sizes
    can often lead to statistically significant
    results that are not practically significant.

92
using significance tests
93
choosing a level of significance
  • do this if you must make a decision
  • if H0 is an assumption that people have held for
    years, choose (small, large) alpha
  • if rejecting H0 will lead to dramatic changes in
    your product, choose (small, large) alpha

94
example SAT prep
95
about p-value
  • use the p-value if you want to describe the
    strength of your evidence
  • P-value .0049 vs. P-value .051
  • with large samples even tiny deviations from the
    null will be significant
  • statistical significance is not the same as
    practical significance.

96
antibacterial cream
  • p588

97
beware multiple analyses
  • running 1 test and reaching alpha .05 level is
    ok evidence that you have found something
    significant. Running 20 tests and finding 1
    reaching the alpha .05 level is not.

98
beware multiple analyses
  • ESP test
  • A standard run uses 25 cards. With only 25 cards,
    you must obtain at least 9 hits (correct guesses)
    to give a significant result (plt.05) showing
    possible evidence of ESP.

99
(No Transcript)
100
(No Transcript)
101
teaching methods
  • http//www.rossmanchance.com/papers/topten.html

102
old faithful
  • Example We provide students with data on times
    between eruptions (in minutes) for the Old
    Faithful geyser, originally reported in 2 and
    also presented in 4 and 12. If students
    merely calculate a confidence interval for the
    population mean intereruption time (95 c.i.
    (70.73, 73.90)) without inspecting the data
    first, they fail to notice the pronounced bimodal
    nature of the data with peaks around 55 and 78
    minutes. With this realization students are able
    to describe the inter-eruption times more
    effectively.

103
monty hall
  • Example In an activity illustrating the famous
    "Monty Hall Problem," we give three playing
    cards, two red and one black, to pairs of
    students. The cards represent prizes behind doors
    used in a game show, one a winner (black) and two
    losers (red). One student (dealer) shuffles the
    cards and holds them facing away from the other
    (contestant). The contestant chooses a card and
    the dealer reveals one of the two remaining cards
    to be red. The contestant is then asked to either
    switch to the remaining card or stay with the
    original choice, in an effort to find the
    (winning) black card. We have the students play
    this game 20 times and perform a test of
    significance to see if the proportion of wins
    using the switching strategy is different from
    1/2. Most students fail to reject this hypothesis
    (power .152). Since 1/2 agrees with many
    students intuition, they do not find this result
    surprising. However, if they continue to play the
    game or reason probabilistically, they discover
    that the actual probability of winning with the
    switch strategy is 2/3. Thus, they learn that the
    original sample size of 20 was not large enough
    to enable them to detect this difference.

104
hand span activity
105
inference as decision
  • date
  • hw 10.66, 10.67

106
gamehttp//www.stat.psu.edu/resources/InLarge/wl
h_01.htm
107
  • significance tests H0
  • 1. evidence against H0 p-value
  • 2. make a decision based on a fixed level, alpha
    if we reject H0 we accept Ha

108
Type 1 and Type 2 errors
  • Type 1 reject
  • Type 2 not reject
  • applet

109
examples
  • Type 1 In other words, it's the rate of false
    alarms or false positives.
  • Type 2 In other words, it's the rate of failed
    alarms or false negatives.

110
(No Transcript)
111
Type 1 and Type 2 errors
H0 is true
Ha is true
dec i s I o n
ACCEPT H0
REJECT H0
112
(No Transcript)
113
beans
  • H0 the sample of beans meets standards
  • Ha the sample of beans does not meet standards.
  • explain the 2 types of incorrect decisions.
  • we can accept a bad bunch
  • we can reject a good bunch
  • which is more serious?
  • which side are you on? consumer? producer?

114
example problem
115
significance and type 1 error
  • the significance level __ P (type I error)
  • to find Type II error, we need to know the value
    of a particular alternative
  • a high type II error for a particular alternative
    means the test is not sensitive enough to detect
    that alternative

116
example, continued
117
power of a test
  • The power of a test is the probability of
    choosing the alternative hypothesis when the
    alternative hypothesis is correct.
  • power 1 P (type II error)

118
factors Influencing Power
  • Alpha (a). Thus, all other things being equal,
    using an alpha of .05 will result in a more
    powerful test than using an alpha of .01.
  • Sample Size (N). The bigger the sample (i.e., the
    more work we do), the more powerful the test.
  • Variability. Generally speaking, variability in
    the sample and/or population results in a less
    powerful test.
Write a Comment
User Comments (0)
About PowerShow.com