STAT 3120 Statistical Methods I - PowerPoint PPT Presentation

1 / 24
About This Presentation
Title:

STAT 3120 Statistical Methods I

Description:

A corporation manages a fleet of company cars. A random sample of 40 cars is examined. ... How does this compare to a Critical Z of 1.96? What is your decision? ... – PowerPoint PPT presentation

Number of Views:36
Avg rating:3.0/5.0
Slides: 25
Provided by: mauricelen
Category:

less

Transcript and Presenter's Notes

Title: STAT 3120 Statistical Methods I


1
STAT 3120Statistical Methods I
  • Lecture 3
  • Hypothesis Testing

2
STAT3120 Hypothesis Testing
  • Some Limitations of Confidence Interval Analysis
  • Descriptive, Not Decision-Oriented
  • Provides no analysis of potential errors
    (Impact/likelihood)
  • Hypothesis Testing is Decision Oriented
  • Is a Population Parameter less than, Equal to, or
    Greater than a specific Value (With Decision
    Making Implications)
  • Guides user to a particular choice
  • Provides User with quantitative information
    regarding probabilities of different outcomes
  • Highlights that Two Different Decision Making
    Errors Possible
  • TYPE I or ?-Error
  • TYPE II or ?-Error
  • ?-Level of significance predefines standard of
    proof (risk)
  • 1- ? indicating the power of the test (the
    probability of finding a significant effect)
  • p-Value (Prob-Value) Aids in Interpreting Results
  • Strength of the Evidence

3
STAT3120 Hypothesis Testing
  • The first step in Hypothesis Testing is to
    develop a claim to be tested. For example
  • Drug A will lower an individuals cholesterol
    level by a minimum of 10 points.
  • ltDrug A will not lower an individuals
    cholesterol level by a minimum of 10 pointsgt
  • Trashbag A has greater tensile strength than
    Trashbag B.
  • lt Trashbag A does not have greater tensile
    strength than Trashbag Bgt
  • Automobile A will average at least 35 miles to
    the gallon.
  • lt Automobile A will not average at least 35 miles
    to the gallongt
  • Note that every possible claim has an opposite
    statement the two statements together must be
    mutually exclusive and collectively exhaustive.

4
STAT3120 Hypothesis Testing
  • When conducting Hypothesis Testing, the process
    of testing includes five distinct parts
  • Statement of the Claim This is referred to as
    the Alternative Hypothesis and is designated as
    H1 or Ha
  • Statement of the Opposite of the Claim This
    is referred to as the Null Hypothesis and is
    designated as H0
  • Calculation of the appropriate Test Statistic
  • Identification of the Rejection Region (H0)
  • Assess Results and Draw Conclusions

5
STAT3120 Hypothesis Testing
  • Develop the appropriate Hypothesis Statements to
    test the claims
  • The Coca Cola Marketing Department wants to run a
    TV ad Diet Coke tastes better than Diet
    Pepsi. Assume some kind of taste scale of 1-10
    (10 is the best).
  • A Pharmaceutical company wants to claim in their
    marketing materials that a particular drug will
    lower cholesterol by at least 30.
  • A manufacturer of PVC pipes wants to become a
    supplier to a large civil engineering firm. The
    manufacturer claims that they can manufacture
    pipes to within 1mm of the engineering firms
    specifications.
  • The Milemaster Tire company has a new tire that
    they claim will go 100,000 before the treads wear
    out.

6
STAT3120 Hypothesis Testing
  • When conducting a hypothesis test, you should
    always develop the 2x2 matrix below which
    compares the statistically-supported decision to
    the true state of nature

True State of nature Decision Ho is true Ho is false
Reject Ho
Do not reject Ho
7
STAT3120 Hypothesis Testing
  • Descriptions of errors
  • Type I Error Reject Null Hypothesis When Null is
    actually true (?-Error)
  • Type II Error Accept Null Hypothesis which is
    false (?-Error)
  • Significance level, ? , is Maximum Risk of making
    Type I error that we are prepared to Live With.
    In other words, Probability of Type I Error ?
    (usually set at .05 or Less)
  • A Type II Error ? (not typically controlled)
  • Type I Error and Type II error cannot be
    controlled simultaneously

Type of Error DECISION CONSEQUENCES/COSTS
TYPE I Reject Null Null is true Market tire, but should not have done so may cause customer dissatisfaction, loss, claims for refund
TYPE II Accept Null Null is false Fail to market a good product opportunity cost
8
STAT3120 Hypothesis Testing
Tire is no good Ho is true Tire meets expectations Ho is false
Market Tire Reject Ho
Dont Market Tire Do not reject Ho
customer dissatisfaction, lose market share Type
1 Error - ?
Gain market share Valid Decision
Fail to capitalize on good product, opportunity
cost Type II Error - ?
No change Valid Decision
The calculated probability of this outcome is
1- ?. This is known as the Power of the test.
9
STAT3120 Hypothesis Testing
  • Descriptions of each outcome in English

Ho is true Ho is false
Reject Ho Typically the Worst Possible Mistakethis decision asserts that an effect is present when it is not a false positive. The Power of the test. There was an effect present and it was detected.
Do not reject Ho A Push. There was no effect present and this was correctly determined. Lost opportunity. There was an effect present and it was not detected a false negative
In a medical context, the False Negative is
often considered to be the worse mistake.
10
STAT3120 Hypothesis Testing
  • After the Hypothesis Statements have been
    developed, and the Type I and Type II errors have
    been evaluated, we establish the alpha level
    the highest probability we are willing to assume
    of committing a Type I error. This alpha value
    corresponds to a Critical or cut off Z-score
  • One tailed tests
  • alpha/Z-score
  • .01/2.33
  • .05/1.645
  • .10/1.28
  • Two tailed tests
  • alpha/Z-score
  • .01/2.575
  • .05/1.96
  • .10/1.645

11
STAT3120 Hypothesis Testing
  • At this point, we perform the calculation of the
    ACTUAL Z-score and compare this to the CRITICAL
    Z-score

(Example from Book (pg 211)) A corporation
manages a fleet of company cars. A random sample
of 40 cars is examined. The mean and std for the
sample are 2,752 and 350 miles, respectively.
Records for previous years indicated that the
average miles driven was 2,600. Use the sample
data to test the claim that the current mean is
different from the previous mean. Use alpha
.05. Z (2752-2600)/(350/SQRT(40))
2.75 How does this compare to a Critical Z of
1.96? What is your decision? What is the
implication if you are wrong?
12
STAT3120 Hypothesis Testing
The Z-scores can be compared directly. However,
we typically translate these Z-scores into
probabilities
Sample Conditions p-Value Significance level
Z-Score Associated with the Actual or Calculated Z-score Associated with the Critical Z-score
Range 0 lt p lt 1 0 lt ? lt 1
Definition Actual Probability of Making a Type I Error Maximum Probability of Making a Type I Error
How Determined and When Known From Sample Data After Analysis Set Before Analysis
13
STAT3120 Hypothesis Testing
Decision Rule If the p-value is less than the
established alpha value, REJECT the null
hypothesis and proceed with the claim. The
potential error is a Type I. If the p-value is
greater than the established alpha value, we FAIL
TO REJECT the null hypothesis and maintain the
status quo. The potential error is a Type II.

14
STAT3120 Hypothesis Testing
Fun and Exciting SAS and SPSS exercises for all
to enjoy!
15
STAT3120 Hypothesis Testing
A second calculation that is conducted when
conducting Hypothesis Testing is the evaluation
of the Power of the test. Recall that Power is
the probability of correctly rejecting the null
hypothesis when it should be rejectedin other
wordsthe probability of detecting a true effect
(1-Type II error). Question Why not set both
alpha and power at acceptable levels?
16
STAT3120 Hypothesis Testing
Answer Because the Type I and Type II errors are
inversely relatedas the probability of a Type I
error becomes more restricted, the probability of
a Type II error increases. Since Power is
1-probability of a Type II, increasing the Type
II error, decreases the Power of the test Type
II and Power ARE directly related. Statistical
Power is the probability of rejecting the null
hypothesis when it should be rejected. In other
words, it is the probability that if a true
difference exists, it will be discovered.
Statistical Power is heavily used in medicine,
clinical psychology and biology. Typically, a
test must have a Statistical Power of 80 or
greater to be considered valid.
17
STAT3120 Hypothesis Testing
  • Power is a function of three factors
  • Effect size i.e., the difference between the
    two groups or measurements. As the effect size
    goes up, the power increases.
  • Alpha As the chance of finding an incorrect
    significant effect is reduced (Type I error), the
    probability of correctly finding an effect is
    also reduced. Typically, alpha is set to be .01
    (most conservative and lowers power), .05 or .10
    (most risk tolerant and increases power).
  • Sample Size - Increased sample sizes will
    always produce greater power. But, increasing
    the sample size can also produce too much power
    smaller and smaller effects will be found to be
    significant until at a large enough sample size,
    any effect is considered to be significant.

18
STAT3120 Hypothesis Testing
When conducting a one tailed test, the power
calculation is executed as 1-?(?) 1-Pzltz?-
( ?0 - ?a/ ?) A two tailed test is conducted
similarly, with the alpha value associated with
the z score divided by two.
19
STAT3120 Hypothesis Testing
Fun In class examples 5.8, 5.9, 5.10
20
STAT3120 Hypothesis Testing
A Power Curve, is a plot of differing values of ?
versus the calculated Power. The slope of the
curve will become more steep as the sample size
increasesexamine the curves on pgs 218/219.
21
STAT3120 Hypothesis Testing
If you are starting an experiment from scratch,
how do you determine the appropriate sample
size? n ?2(Z ?Z ?)2/?2 A two tailed test
is conducted similarly, with the alpha value
associated with the z score divided by two. See
Example 5.11
22
STAT3120 Hypothesis Testing
Secret weapon for all of that computational
_at_ proc power onesamplemeans testt
mean 7 stddev 3
ntotal 50 power . run
23
STAT3120 Hypothesis Testing
Secret weapon for all of that computational
_at_ proc power twosamplemeans
testdiff meandiff 7 stddev
12 npergroup 50 power
. run
24
STAT3120 Hypothesis Testing
Secret weapon for all of that computational
_at_
proc power pairedmeans testdiff
pairedmeans 8 15 corr 0.4
pairedstddevs (7 12) npairs .
power 0.9 run
Write a Comment
User Comments (0)
About PowerShow.com