Hypothesis Testing - PowerPoint PPT Presentation

1 / 86

About This Presentation

Title:

Hypothesis Testing

Description:

Hypothesis An educated opinion What you think will happen, based on previous research anecdotal evidence reading the literature Body fat level of 8th graders National ... – PowerPoint PPT presentation

Number of Views:594

Avg rating:3.0/5.0

Slides: 87

Provided by: CASTIllin94

Category:

more less

Transcript and Presenter's Notes

Title: Hypothesis Testing

1
Hypothesis Testing
2
Hypothesis

An educated opinion
What you think will happen, based on
previous research
anecdotal evidence
reading the literature

3
Body fat level of 8th graders

National Norm
Mean 23, SD 7
postulated parameter (? and ?)
Your 8th grade PE program (N200)

How does my program compare??
4
Your gut feeling

You expect to find, you want to find, your
instincts tell you that your students are better.

5
Your gut feeling

You expect to find, you want to find, your
instincts tell you that your students are better.

But are they??
6
Question

Is any observed difference between your sample
mean (representative of your 8th grade population
mean) and the National Norm (population of all
8th graders) attributable to random sampling
errors, or is there a real difference?

7
Question

Is any observed difference between your sample
mean (representative of your 8th grade population
mean) and the National Norm (population of all
8th graders) attributable to random sampling
errors, or is there a real difference?
Is the mean of your class REALLY the same as the
National Norm?

8
How to determine this

Research Question
is my POPULATION mean really 23
Statistical Question
? 23
set the Null Hypothesis that the mean of YOUR
group is 23 (equal to the National Norm)
assume that your group is NOT REALLY different

9
Null Hypothesis

Ho ? 23
The true difference between your sample and the
population mean is 0.
There is NO real difference between your sample
mean and the population mean.
The performance of your students is not really
different from the national norm.

10
Null Hypothesis

In inferential statistics, we usually want to
reject the Null hypothesis
to say that the differences are more than what
would be expected by random sampling error
this was our initial gut feeling
our program is better

11
3 Possible Outcomes

No difference between groups
do not reject the null hypothesis

12
3 Possible Outcomes

No difference between groups
One specific group is higher than the other
directional hypothesis
What you EXPECT to happen when planning the
experiment/measurement

13
3 Possible Outcomes

No difference between groups
One specific group is higher than the other
Either group mean is higher
non-directional hypothesis
The possible outcome of the experiment/measurement

14
Alternative Hypothesis

Our research hypothesis (what we expect to see)
HA ? ? 23
non-directional hypothesis
interested to see if my grade body composition is
better than or worse than the national norm

15
Alternative Hypothesis

Our research hypothesis (what we expect to see)
HA ? lt 23 (HA ? gt 23)
directional hypothesis
expect to see my grade mean less than (better
than) the national norm
expect to see my grade mean greater than (worse
than) that of the national norm

16
Comparing My Class to the National Norm

My 8th grade PE program (N 200)
National Norm 23
postulated parameter
At the end of the semester, calculate the mean
body fat
Using a random sample ( n 25)
mean body fat of 20

Is my sample mean different from the National
Norm?
17
Need to Test Ho

Determine whether the observed difference is
means is attributable to random sampling error
rather than a true difference between the groups
(my class and the national norm)
treatment effect

Hypothesis Testing
18
Null Hypothesis

No true difference between two means (sample mean
and national norm)
Infers my sample is drawn from the identified
population
Nothing more than random sampling errors accounts
for any observed difference between the means.

An element of uncertainty is inherent in any act
of observation (Menards Philosophy)
19
Alternative Hypothesis

A true difference does exist between two means
Infers my sample is not drawn from the
identified population
Observed difference between the means is larger
than what we are willing to attribute to random
sampling error

20
Testing Ho

Test the probability that the observed difference
between means is attributable to random sampling
error alone
Evaluate the probability that Ho is not to be
rejected
reject or do not reject Ho

What amount of risk are you willing to take?
21
Weatherman Example

85 chance of rain
put up the sunroof
5 chance of rain
it may happen, but the chance is slight
not very likely to rain
willing to risk being wrong to avoid the
inconvenience of having to put up the sunroof.

22
If we do not put up the sunroof
We reject the hypothesis that it will rain
23
If we do not put up the sunroof
We could be right or We could be wrong
24
Wait for certainty means to wait forever
25
What risk are YOU willing to take 1?? 5??
10
26
Applied Research ? 0.10 ? 0.05 ? 0.01
27
? 0.05

With these observed conditions
5 times in 100 it will rain
5 times in 100 it will rain when we have kept the
sunroof down
95 times in 100 it will not rain
95 times in 100 it will not rain when we have
kept the sunroof down

28
? 0.05

Reject Ho if the observed mean difference is
greater than what we would expect to occur by
chance (random sampling error) less than 5 times
in 100 instances
reported in research as a statistically
significant difference

29
Testing Ho at ? 0.05

If p gt 0.05 do not reject Ho
difference is attributable to random sampling
error (expected variability in mean drawn from a
population)
If p ? 0.05 reject Ho
difference is attributable to something other
than random sampling error

30
Decision Table
DECISION
31
Decision Table
DECISION
R E A L I T Y
32
Decision Table Correct
DECISION
R E A L I T Y
33
Decision Table Incorrect (RT1)
DECISION
R E A L I T Y
34
Decision Table Incorrect (AFII)
DECISION
R E A L I T Y
35
(No Transcript)
36
Belief in God as Decision Table
Ho God does not exist
DECISION
R E A L I T Y
Lived life of hope
Life no hope
Eternal life
Lost out on Eternal life
37
To this juncture

Sampling involves error
Expect differences between samples

38
To this juncture

Sampling involves error
Expect differences between samples
If we expect a difference between
treatments/conditions, BUT we also expect a
difference because of random sampling error

39
To this juncture

Sampling involves error
Expect differences between samples
If we expect a difference between
treatments/conditions, BUT we also expect a
difference because of random sampling error
HOW do we determine if difference is
statistically significant (gt than RSE)?

40
Testing Ho requires

Mean value
measure of typical performance level
Standard deviation
measure of the variability
n of cases
known to affect
variability expected with the estimate of the
population mean

41
z test for one sample

Our beginning point
National Norm BF 23 (SD 7)
Our sample performance
n 25
Mean 20
SD 6

Do my students differ from the National Norm??
42
Our hypotheses

Research Hypothesis
Do my students differ from the national norm
want to know if better OR worse
Ho
There is no real difference in the BF of my
students and the national norm
? 0.05

43
Recall

z-score of gt 1.96 or lt -1.96 occurs less than 5
of the time
see table of the Normal Curve
That is, the probability of obtaining a z-score
value this extreme purely by chance is 5 (only 5
times in 100) (explain).

44
Relevance to Hypothesis Testing

Use the same general idea to evaluate the
probability of obtaining a sample mean score of
20 with n 25 if the true population mean is
23
Recall the concept of the distribution of
sampling means

45
Recall Z score equation
X - X
Z
SD
46
Introduce Z test equation
X -?
Z
SEm
47
Standard Error of the Mean
48
Z test equation
X - ?
Mean difference
Z
SEm
49
Z test equation
X - ?
Z
SEm
Expected variability in sample means
50
Our given required data

X 20
SD 6
n 25
? 23
? 7
SEm ???
X - ? ???
Z ???

X - ?
Z
SEm
51
Our given required data

X 20
SD 6
n 25
? 23
? 7
SEm 7/5 1.4
X - ? ???
Z ???

X - ?
Z
SEm
Use the population standard deviation (SDp)
52
Our given required data

X 20
SD 6
n 25
? 23
? 7
SEm 7/5 1.4
X - ? 20 - 23 -3
z ???

X - ?
Z
SEm
53
Our given required data

X 20
SD 6
n 25
? 23
? 7
SEm 7/5 1.4
X - ? 20 - 23 -3
Z -3 / 1.4 -2.14

-3
Z
1.4
54
Decision Making

What is the probability of obtaining a Z -2.14
IF the difference is attributable only to random
sampling error?
Is the observed probability (p) LESS THAN or
EQUAL TO the ? level set?
Is p ? ? ?

55
From the tables

Z gt 1.96 or Z lt -1.96 has a 5 chance of
occurring purely by chance (explain).
Since Zobserved -2.14, our statistical
conclusion is to reject Ho
the difference of -2.14 is not likely to have
occurred by chance
The data indicate/suggest (not prove) that our
class HAS less body fat than the norm.

56
Graphically, ? 0.05
Zcritical ? 1.96
1.96
-1.96
Z observed -2.14
57
Graphically, ? 0.05
Zcritical ? 1.96
Region of Non-Rejection
1.96
-1.96
Z observed -2.14
58
Graphically, ? 0.05
Zcritical ? 1.96
Region of Rejection
Region of Rejection
1.96
-1.96
Z observed -2.14
59
Graphically, ? 0.05
Zcritical ? 1.96
Region of Rejection
Region of Rejection
Region of Non-Rejection
1.96
-1.96
Z observed -2.14
60
Reporting the Results? 0.05
The observed mean of our treatment group of 25
students was 20 (? 6) body fat. The z-test for
one sample indicates that the difference between
the observed mean of 20 and the National Norm of
23 was statistically significant (Zobs -2.14,
p ? 0.05). These data suggest that our measured
percent body fat was less than the national norm.

61
Reporting the Results? 0.01
The observed mean of our treatment group was 20
(? 6) body fat. The z-test for one
sample indicates that the difference between the
observed mean of 20 and the National Norm of 23
was not statistically significant (Zobs -2.14,
p gt 0.01). Our measured percent body fat was not
significantly different from the national norm.
62
Reporting the Results, youset ? 0.01
The observed mean of our treatment group was 20
(? 6) body fat. With ? 0.01, the z-test
for one sample indicates that the difference
between the observed mean of 20 and the
National Norm of 23 was not statistically
significant (Zobs -2.14, p 0.028). Our
measured percent body fat was not significantly
different from the national norm.
63
Consider all possible reasonsfor your outcome
64
Statistics humour
What does a statistician call it when the heads
of 10 rats are cut off and 1 survives?
65
Statistics humour
What does a statistician call it when the heads
of 10 rats are cut off and 1 survives? Non-signi
ficant.
66
Do not reject H0 vs Accept H0
Accept infers that we are sure Ho is valid
67
Do not reject H0 vs Accept H0
Accept infers that we are sure Ho is valid Do
not reject reflects that this time we are unable
to say with a high enough degree of confidence
that the difference observed is attributable to
other than sampling error.
68
Examples

Zobs -3.45
? 0.05
Decision (statistical conclusion) ???

69
Examples

Zobs 1.45
? 0.01
Decision (statistical conclusion) ???

70
Examples

Zobs 1.96
? 0.05
Decision (statistical conclusion) ???

71
Examples

Zobs -1.96
? 0.01
Decision (statistical conclusion) ???

72
Examples

Zobs 1.96
? 0.01
Decision (statistical conclusion) ???

73
Examples

Zobs -1.95
? 0.05
Decision (statistical conclusion) ???

74
Z-test vs t-test

SPSS does not provide the z-test
Can only use z-test if you know population SD
Typically, all population parameter values are
estimated from sample statistics
Mean
Standard deviation
Standard error
SPSS uses t-test
Same concept, different assumptions
t-test more robust against departures from
normality (doesnt affect the accuracy of the
p-estimate as much)

75
When population mean is not knownchanging
distributions

The Z-test uses one sample statistic to estimate
population parameters
sample mean ? population mean
Population standard deviation is known
The t-test uses two sample statistics to estimate
population parameters
sample mean ? population mean
sample standard error? population SD

76
t-test equation

So the test statistic now becomes

77
Estimated population SD

To estimate pop SD from sample SD, the sample SD
is inflated a little

You may have noticed this modification earlier
78
SEm from estimated SD population

To estimate standard error from sample SD, use
the estimated SD again, thus

79
Recall factors affecting Sx

Size of estimated SE obviously depends on both SD
of sample, and sample size

80
When population mean is not knownchanging
distributions

The distribution used to evaluate calculated
ratio switches from the normal distribution to
the t-distribution
Sampling variation in Z-distribution reflected
variability with respect to sample mean
BUT sampling variation in t-distribution reflects
variability with respect to sample mean and
standard error of the mean
Soas the sample gets smaller (and the standard
error of the mean increases) the sampling
distribution of t differs from that of Z
The good old 1.96 for 95 is toast

81
Concept of Degrees of Freedom (df)

The number of independent pieces of information a
sample of observations can provide for purposes
of statistical inference
E.g. 3 numbers in a sample 2, 2, 5
Sample mean 3 deviations are 1, -1, 2
Are these independent?
No when you know two, youll know the other
because
For any sample of size n you have n-1 values
that are free to vary the last value is fixed

82
Sampling distribution of t
Large n ? t-dist pretty much like the
z-dist (because sample SD is a good estimate of
pop SD, sample SE is a good estimate of pop SE)
83
Sampling distribution of t

Because distribution gets flatter as n gets
smaller, this implies t for significance gets
bigger as n gets smaller
http//duke.usask.ca/rbaker/Tables.html

84
Work an example with SPSS