Title: STATISTICAL INFERENCE:
1STATISTICAL INFERENCE Making decisions or
drawing conclusions about a population.
Suppose we have collected a representative sample
that gives some information concerning a mean or
other statistical quantity.
2One question on the basis of this information
would be are the sample quantity and a
corresponding population quantity close enough
together so that it is reasonable to say that the
sample might have come from the population? Or
are they far enough apart so that they likely
represent different populations?
3Statistical inference may be divided into two
major areas parameter estimation hypothesis
testing
4Test of Hypothesis We are testing the hypothesis
that a sample is similar enough to a particular
population so that it might have come from that
population. If the hypothesis is true, we say
that the sample is consistent with the population
and all disagreement between the sample and the
population is due to random variation.
5Suppose that we are interested in the burning
rate of a solid fuel. Burning rate is a random
variable that can be described by a probability
distribution. We are interested in the mean
burning rate. To be specific, we are interested
in deciding whether or not the mean burning rate
is 70 cm/s. We may express this formally as
6(No Transcript)
7IMPORTANT
Hypotheses are always statements about the
population or distribution under study, not
statements about the sample.
8The value of the population parameter specified
in the null hypothesis (mean of 70 cm/s in our
example) is usually determined in one of the
three ways.
- Past experience or knowledge of the process
2. Theory or model regarding the process
- Engineering specifications or contractual
obligations
9Objective of Hypothesis Testing
(Parameter from Past experience or knowledge of
the process) To determine whether the parameter
value has changed
10Objective of Hypothesis Testing
(Parameter from Theory or model regarding the
process) To verify the theory or model
11Objective of Hypothesis Testing
(Parameter from Engineering specifications or
contractual obligations) To test conformance or
compliance
12A procedure leading to a decision about a
particular hypothesis is called a test of a
hypothesis. Hypothesis testing procedures rely
on using the information in a random sample from
the population of interest. If this information
is consistent with the hypothesis, then we will
conclude that the hypothesis is true. However,
if this information is inconsistent with the
hypothesis, we will conclude that the hypothesis
is false.
13The truth or falsity of a particular hypothesis
can never be known with certainty unless we can
examine the entire population. This is usually
impossible in most practical situations.
Therefore, a hypothesis testing procedure should
be developed with the probability of reaching a
wrong conclusion in mind.
14Test of Hypothesis
Consider the burning rate of the solid fuel. The
null hypothesis is that the mean burning rate is
70 cm/s, and the alternative hypothesis is that
it is not equal to 70 cm/s. We wish to test the
following
15The sample mean is an estimate of the true
population mean m.
16On the other hand, a sample mean that is
considerably different from 70 cm/s is evidence
in support of the alternative hypothesis H1.
17The sample mean can take on many different
values. Suppose that if
we will not reject the null hypothesis
18But if
or
we will reject the null hypothesis in favour of
the alternative hypothesis
19Decision criteria
20The boundaries between the critical regions and
acceptance region are called the critical values.
In our example the critical values are 68.5 and
71.5.
21The decision procedure can lead to either of two
wrong conclusions.
The true mean burning rate of the solid fuel
could be equal to 70 cm/s. However, the value of
the sample mean observed from the randomly
selected specimens may fall into the critical
region. We would then reject the null hypothesis
H0 in favour of the alternative H1 when in fact
H0 is really true. This type of wrong conclusion
is called a type I error.
22The true mean burning rate of the solid fuel
could be different from 70 cm/s. However, the
value of the sample mean observed from the
randomly selected specimens may fall into the
acceptance region. We would then fail to reject
the null hypothesis H0 when in fact H0 is false.
This type of wrong conclusion is called a type II
error.
23Definitions
Rejecting the null hypothesis H0 when it is true
is defined as a type I error.
Failing to reject the null hypothesis H0 when it
is false is defined as a type II error.
The probability of making a type I error is
called the significance level or size of the test.
24Example Consider the burning rate of the solid
fuel. The true mean burning rate is 70 cm/s and
the standard deviation of the burning rate is s
2.42 cm/s.
25What percentage of all random samples would lead
to the rejection of the null hypothesis when the
true mean burning rate is really 70 cm/s?
26Critical Region
27Standard deviation of the sample mean
a P(type I error)
28Critical Region
29The z-values that correspond to the critical
values 68.5 and 71.5 are
30Therefore,
5 of all samples would lead to the rejection of
the null hypothesis.
31CONFIDENCE INTERVAL
32If the population mean is m and the normal
distribution applies, the probability that a
random sample mean will fall by chance in the
region between z1 -1.96 and z2 1.96 is 95.
This is called 95 confidence interval.
33(No Transcript)
34(No Transcript)
35When a manufacturing process is operating
properly, the mean length of a certain part is
known to be 6.175 inches, and lengths are
normally distributed. The standard deviation of
this length is 0.008 inches. If a sample
consisting of 6 items taken from current
population has a mean length of 6.168 inches, is
there evidence at the 5 level of significance
that some adjustment of the process is required?
36(No Transcript)
37At the 5 level of significance, the critical
regions are
and
Standard deviation of the sample mean
38At the 5 level of significance z -2.12 falls
in the critical (rejection) region. Therefore,
some adjustment of the process is required.
39Carbon composition resistors with a mean
resistance of 560 W and a standard deviation of
51 W are produced by a factory. They are sampled
each hour in the quality control laboratory.
What sample size would be required so that there
is 95 probability (confidence) that the mean
resistance of the sample lies within 10 W of 560
W if the population mean has not changed.
40For a 95 confidence interval, a random sample
mean will fall in the region between z1 -1.96
and z2 1.96 if the manufacturing process has
not changed.
41(No Transcript)
42A sample size of 100 would be required.