Statistical Inference - PowerPoint PPT Presentation

1 / 34
About This Presentation
Title:

Statistical Inference

Description:

Statistical Inference – PowerPoint PPT presentation

Number of Views:41
Avg rating:3.0/5.0
Slides: 35
Provided by: university57
Category:

less

Transcript and Presenter's Notes

Title: Statistical Inference


1

Biostatistics Academic Preview Session 3
Statistical Inference
2
Outline
  • The what and why of statistical inference
  • Statistical estimation and confidence intervals
  • Statistical significance tests

3
Statistical Inference
4
Statistical Inference
  • Estimation the process by which sample data are
    used to indicate the value of an unknown quantity
    in the population. Results can be expressed as
  • Point estimate
  • Confidence intervals
  • Significance tests
  • P-values

5
Statistical Estimation
6
Confidence intervals (cont)
  • A range of values in which a population parameter
    may lie is a confidence interval.
  • The probability that a particular value lies
    within this interval is called a level of
    confidence.

7
90 and 95 confidence intervals for ?
8
Confidence intervals (cont)
  • Formula
  • Estimate margin of error
  • Steps
  • Calculate the sample statistic to use as an
    estimate of the population parameter
  • Calculate the margin of error based on the
    distribution of sample statistic
  • Calculate the lower (LL) and the upper limits
    (UL) of the confidence interval
  • Write out the confidence interval

LL lt population parameter lt UL
LL, UL
9
Understanding Confidence Intervals
  • A confidence interval for a population parameter
    is a set of plausible values of the parameter
    that could have generated the observed data as a
    likely outcome
  • The level of confidence tells the probability the
    method produced an interval that includes the
    unknown parameter
  • Narrow widths and high confidence levels are
    desirable, but these two things affect each other

Higher confidencewider confidence limits (all
else held constant)
10
Case Study 1
  • Construct a 95 confidence interval for the mean
    birth weight in grams
  • Construct a 95 confidence interval for the
    percent of women who deliver low birth weight
    babies

11
Case Study 1 Example 1
  • 95 Confidence interval for the mean birth weight
    in grams
  • Calculate
  • Calculate the margin of error m104.6
  • Calculate the lower and upper limit of the
    confidence interval
  • LL2944.7-104.62840.1
  • UL29447104.63049.3
  • Confidence interval
  • 2840.1, 3049.3
  • 2840.1 lt µ lt 3049.3

I am 95 confident that the mean birth weight for
omen in Springfield Massachusetts in 1986 is
between 2840.1 and 3049.3
12
Case Study 1 Example 2
  • 95 confidence interval for the percent of women
    who deliver low birth weight babies
  • Calculate
  • Calculate the margin of error m0.066
  • Calculate the lower and upper limit of the
    confidence interval
  • LL0.312-0.0660.246
  • UL0.3120.0660.378
  • Confidence interval 0.246, 0.378

I am 95 confident that the proportion of women
giving birth to low birth weight babies in
Springfield Massachusetts in 1986 is between
0.246 and 0.378
13
Case study 1
  • In order to determine if
  • There is a difference in the mean birth weight
    between smokers and non-smokers
  • There is a difference in the proportion of low
    birth weight babies between white women and black
    women
  • We need to calculate a confidence interval around
    the difference in the estimates between the two
    groups in the aforementioned examples

14
Confidence intervals (cont)
  • Formula
  • Estimate (group1) estimate (group 2) margin
    of error
  • Steps
  • Calculate the sample statistics to use as an
    estimate of the population parameter
  • Calculate the margin of error based on the
    distributions of the sample statistic
  • Calculate the lower (LL) and the upper limits
    (UL) of the confidence interval
  • Write out the confidence interval
  • LL lt population parameter lt UL or LL, UL

15
Case Study 1 Example 3
  • 95 confidence interval for the difference in the
    percent of women who deliver low birth weight
    babies between whites and blacks
  • Calculate
  • Calculate the margin of error m.2085
  • Calculate the lower and upper limit of the
    confidence interval
  • LL-.1835-.2085-.392
  • UL-.1835.2085.025
  • Confidence interval -.392, .025

I am 95 confident that the difference in the
proportion of white women and black women giving
birth to low birth weight babies in Springfield
Massachusetts in 1986 is between -.392 and .025
16
Significance /Hypothesis Testing
  • Statistical inference that allows one to test a
    claim about a population parameter.
  • Using information from your study sample you can
    test any desired claim

Next set of slides drafted from the following
reference Elementary Statistics by Larson and
Farber, 2nd edition
17
A statistical hypothesis is a claim about a
population.
Alternative hypothesis Ha contains a statement
of inequality such as lt , ¹ or gt
Null hypothesis H0 contains a statement of
equality such as ³ , or .
If I am false, you are true
If I am false, you are true
18
Writing Hypotheses
Write the claim about the population. Then, write
its complement. Either hypothesis, the null or
the alternative, can represent the claim.
  • A hospital claims its ambulance response time is
    less than 10 minutes.

H0
Ha
  • A consumer magazine claims the proportion of
    cell phone calls made during evenings and
    weekends is at most 60.

H0
Ha
19
Hypothesis Test Strategy
  • Begin by assuming the equality condition in the
    null hypothesis is true. This is regardless of
    whether the claim is represented by the null
    hypothesis or by the alternative hypothesis,
  • Collect data from a random sample taken from the
    population and calculate the necessary sample
    statistics.
  • If the sample statistic has a low probability of
    being drawn
  • from a population in which the null hypothesis is
    true, you will
  • reject H0 . (As a consequence, you will support
    the alternative
  • hypothesis.)
  • If the probability is not low enough, fail to
    reject H0 .

20
Underlying Rationale of Hypotheses Testing
  • If, under a given observed assumption, the
    probability of getting the sample is
    exceptionally small, we conclude that the
    assumption is probably not correct.
  • When testing a claim, we make an assumption
    (null hypothesis) that contains equality. We then
    compare the assumption and the sample results and
    we form one of the following conclusions

21
Underlying Rationale of Hypotheses Testing
  • If the sample results can easily occur when the
    assumption (null hypothesis) is true, we
    attribute the relatively small discrepancy
    between the assumption and the sample results to
    chance.
  • If the sample results cannot easily occur when
    that assumption (null hypothesis) is true, we
    explain the relatively large discrepancy between
    the assumption and the sample by concluding that
    the assumption is not true.

22
Errors and Level of Significance
Actual Truth of H0
H0 True
H0 False
Do not reject H0
Type II Error
Decision
Type I Error
Reject H0
A type I error Null hypothesis is actually true
but the decision is to reject it.
Level of significance, a Maximum probability of
committing a type I error.
23
Types of Hypothesis Tests
Right-tail test Ha m gt value
Left-tail test Ha m lt value
Two-tail test Ha m ¹ value
Hypothesis tests can also be one or two sampled
24
P-Values
The P-value is the probability of obtaining a
sample statistic with a value as extreme or more
extreme than the one determined by the sample
data.
P-value indicated area
If z is negative, twice the area in the left tail
If z is positive twice the area in the right tail
25
Test Decisions with P-values
The decision about whether there is enough
evidence to reject the null hypothesis can be
made by comparing the P-value to the value of
a, the level of significance of the test.
If P ? a reject the null hypothesis
If P gt a fail to reject the null hypothesis
26
Understanding p-values
  • A p-value tells you the chance of getting a
    statistics as extreme or more extreme than the
    one calculated for the sample
  • measures the strength of evidence against the
    null hypothesis
  • The smaller the p-value, the more convincing the
    evidence is against the null hypothesis
  • A p-value is not
  • the probability that the null hypothesis is
    true/false
  • The effect size
  • The significance of results

27
Interpreting the Decision
Claim
Claim is H0
Claim is Ha
There is enough evidence to reject the claim.
There is enough evidence to support the claim.
Reject H0
Decision
There is not enough evidence to reject the claim.
There is not enough evidence to support the claim.
Fail to reject H0
28
Steps in a Hypothesis Test
1. Write the null and alternative hypothesis
Write H0 and Ha as mathematical statements.
Remember H0 always contains the symbol.
2. State the level of significance
This is the maximum probability of rejecting the
null hypothesis when it is actually true. (Making
a type I error.)
3. Identify the sampling distribution
The sampling distribution is the distribution for
the test statistic assuming that the equality
condition in H0 is true and that the experiment
is repeated an infinite number of times.
29
4. Find the test statistic
Perform the calculations to standardize your
sample statistic.
5. Calculate the P-value for the test statistic
This is the probability of obtaining your test
statistic or one that is more extreme from the
sampling distribution.
30
6. Make your decision
If the P-value is less than ? (the level of
significance) reject H0. If the P value is
greater ?, fail to reject H0.
7. Interpret your decision
  • If the claim is the null hypothesis you will
    either reject
  • the claim or determine there is not enough
    evidence to
  • reject the claim.
  • If the claim is the alternative hypothesis you
    will either support the claim or determine there
    is not enough
  • evidence to support the claim.

31
Case study 1
  • Perform a hypothesis test to determine if the
    proportion of women giving birth to low birth
    weight babies is less than .50.
  • Perform a hypothesis test to determine if there
    is a difference in the mean birth weight between
    smokers and non-smokers

32
Case Study 1 Example 4
  • Step 1 Write out the null and alternative
    hypothesis
  • Ho p.05
  • Ha plt.05
  • Step 2 level of significance a.05
  • Step 3 skip (normal distribution)
  • Step 4 test statistic z.5.16
  • Step 5 p-valuelt.0001
  • Step 6 Decision-we reject Ho because
    p-valueltalpha
  • Step 7 Conclusion There is strong evidence to
    suggest that the percent of mothers giving birth
    to low birth weight babies in Springfield
    Massachusetts in 1986 is less than .50

33
Case Study 1 Example 5
  • Step 1 Write out the null and alternative
    hypothesis
  • Ho µsmokers- µnon-smokers0
  • Ha µsmokers- µnon-smokers ?0
  • Step 2 level of significance a.05
  • Step 3 skip (t-distribution)
  • Step 4 test statistic t-2.634
  • Step 5 p-value.0092
  • Step 6 Decision-we reject Ho because
    p-valueltalpha
  • Step 7 Conclusion There is strong evidence to
    suggest that there is a difference in the mean
    birth weight between smokers and non-smokers for
    women in Springfield Massachusetts in 1986.

34
Next session
  • Results of our in class survey
Write a Comment
User Comments (0)
About PowerShow.com