Biostatistics - PowerPoint PPT Presentation

1 / 52

About This Presentation

Title:

Biostatistics

Description:

Biostatistics Unit 5 Samples The z-score The z-score for the sample proportion is Example In the mid seventies, according to a report by the National Center for ... – PowerPoint PPT presentation

Number of Views:110

Avg rating:3.0/5.0

Slides: 53

Provided by: Osb50

Category:

Tags: biostatistics

more less

Transcript and Presenter's Notes

Title: Biostatistics

1
Biostatistics

Unit 5 Samples

2
Sampling distributions

Sampling distributions are important in the
understanding of statistical inference.
Probability distributions permit us to answer
questions about sampling and they provide the
foundation for statistical inference procedures.

3
Definition

The sampling distribution of a statistic is the
distribution of all possible values of the
statistic, computed from samples of the same size
randomly drawn from the same population. When
sampling a discrete, finite population, a
sampling distribution can be constructed. Note
that this construction is difficult with a large
population and impossible with an infinite
population.

4
Construction of sampling distributions

1. From a population of size N, randomly draw
all possible samples of size n. 2. Compute the
statistic of interest for each sample.3. Create
a frequency distribution of the statistic.

5
Properties of sampling distributions

We are interested in the mean, standard deviation
and appearance of the graph (functional form) of
a sampling distribution.

6
Types of sampling distributions

We will study the following types of sampling
distributions.A) Distribution of the sample mean
B) Distribution of the difference between two
means
C) Distribution of the sample proportion D)
Distribution of the difference between two
proportions

7
Sampling distribution of

Given a finite population with mean (m) and
variance (s2). When sampling from a normally
distributed population, it can be shown that the
distribution of the sample mean will have the
following properties.

8
Properties of the sampling distribution

1. The distribution of will be normal2.
The mean , of the distribution of the
values of
, will be the same as the mean of the
population from which the samples were drawn
m.3. The variance, , of the
distribution of
, will be equal to the variance of the
population divided by the sample size
.

9
Standard error

The square root of the variance of the sampling
distribution is called the standard error of the
mean or the standard error.

10
Nonnormally distributed populations

When the sampling is done from a nonnormally
distributed population, the central limit theorem
is used.

11
The central limit theorem

Given a population of any nonnormal functional
form with mean (m) and variance (s2) , the
sampling distribution of , computed from
samples of size n from this population will have
mean, m, and variance, s2/n, and will be
approximately normally distributed when the
sample is large (30 or higher).

12
The central limit theorem

Note that the standard deviation of the sampling
distribution is used in calculations of z scores
and is equal to

13
Example

Given the information below, what is the
probability that x is greater than 53?
(1) Write the given information m 50
s 16 n 64 x 53

14
Example

(2) Sketch a normal curve

15
Example

(3) Convert x to a z score

16
Example

(4) Find the appropriate value(s) in the
table A value of z 1.5 gives an area of
.9332. This is subtracted from 1 to give the
probability P (z gt 1.5) .0668

17
Example

(5) Complete the answerThe probability that x
is greater than 53 is .0668.

18
Distribution of the difference between two means

It often becomes important to compare two
population means. Knowledge of the sampling
distribution of the difference between two means
is useful in studies of this type. It is
generally assumed that the two populations are
normally distributed.

19
Sampling distribution of

Plotting sample differences against frequency
gives a normal distribution with mean equal to
which is the difference between the two
population means.

20
Variance

The variance of the distribution of the sample
differences is equal to
Therefore, the standard error of the differences
between two means would be equal to

21
Converting to a z score

To convert to the standard normal distribution,
we use the formula
We find the z score by assuming that there is
no difference between the population means.

22
Sampling from normal populations

This procedure is valid even when Sampling from
normal populations the population variances are
different or when the sample sizes are
different. Given two normally distributed
populations with means, and , and
variances, and , respectively.
(continued)

23
Sampling from normal populations

The sampling distribution of the difference,
, between the means of independent samples
of size n1 and n2 drawn from these populations is
normally distributed with mean, ,
and
variance,

24
Example

In a study of annual family expenditures for
general health care, two populations were
surveyed with the following resultsPopulation
1 n1 40, 346
Population 2 n2 35, 300

25
Example

If the variances of the populations are
2800 and 3250, what is the
probability of obtaining sample results
as large as those shown if there is no
difference in the means of the two populations?

26
Solution
(1) Write the given informationn1 40,
346, 2800 n2 35,
300, 3250
27
Solution

(2) Sketch a normal curve

28
Solution

(3) Find the z score

29
Solution

(4) Find the appropriate value(s) in the
table A value of z 3.6 gives an area of
.9998. This is subtracted from 1 to give the
probability P (z gt 3.6) .0002

30
Solution

(5) Complete the answer The probability
that is as large as given is
.0002.

31
Distribution of the sample proportion ( )

While statistics such as the sample mean are
derived from measured variables, the sample
proportion is derived from counts or frequency
data.

32
Properties of the sample proportion

Construction of the sampling distribution of the
sample proportion is done in a manner similar to
that of the mean and the difference between two
means. When the sample size is large, the
distribution of the sample proportion is
approximately normally distributed because of the
central limit theorem.

33
Mean and variance

The mean of the distribution, , will be
equal to the true population proportion, p, and
the variance of the distribution, , will be
equal to p(1-p)/n.

34
The z-score

The z-score for the sample proportion is

35
Example

In the mid seventies, according to a report by
the National Center for Health Statistics, 19.4
percent of the adult U.S. male population was
obese. What is the probability that in a simple
random sample of size 150 from this population
fewer than 15 percent will be obese?

36
Solution

(1) Write the given information n 150
p .194 Find P( lt .15)

37
Solution

(2) Sketch a normal curve

38
Solution

(3) Find the z score

39
Solution

(4) Find the appropriate value(s) in the tableA
value of z -1.36 gives an area of .0869 which
is the probability P (z lt -1.36) .0869

40
Solution

(5) Complete the answer
The probability that lt .15 is .0869.

41
Distribution of the difference between two
proportions

This is for situations with two population
proportions. We assess the probability
associated with a difference in proportions
computed from samples drawn from each of these
populations. The appropriate distribution is the
distribution of the difference between two sample
proportions.

42
Sampling distribution of

The sampling distribution of the difference
between two sample proportions is constructed in
a manner similar to the difference between two
means.
(continued)

43
Sampling distribution of

Independent random samples of size n1 and n2 are
drawn from two populations of dichotomous
variables where the proportions of observations
with the character of interest in the two
populations are p1 and p2 , respectively.

44
Mean and variance

The distribution of the difference between two
sample proportions, , is
approximately normal. The mean is
The variance is
These are true when n1 and n2 are large.

45
The z score

The z score for the difference between two
proportions is given by the formula

46
Example

In a certain area of a large city it is
hypothesized that 40 percent of the houses are in
a dilapidated condition. A random sample of 75
houses from this section and 90 houses from
another section yielded difference, ,
of .09. If there is no difference between the
two areas in the proportion of dilapidated
houses, what is the probability of observing a
difference this large or larger?

47
Solution

(1) Write the given information n1 75, p1
.40
n2 90, p2 .40
.09Find P(
.09)

48
Solution

(2) Sketch a normal curve

49
Solution

(3) Find the z score

50
Solution

(4) Find the appropriate value(s) in the
table A value of z 1.17 gives an area of
.8790 which is subtracted from 1 to give the
probability P (z gt 1.17) .121

51
Solution