Title: Samples and populations
1Samples and populations
- Estimating with uncertainty
2Review - order of operations
3Review - order of operations
- Parentheses
- Exponents and roots
- Multiply and divide
- Add and subtract
4Review - order of operations
5Review - order of operations
6Review - types of variables
- Categorical variables
- For example, country of birth
- Numerical variables
- For example, student height
7Review - types of variables
- Categorical variables
- Numerical variables
Discrete
Continuous
8Review - types of variables
Nominal
- Categorical variables
- Numerical variables
Ordinal
Discrete
Continuous
9Review - types of variables
- Categorical variables
- Nominal - no natural order
- Ordinal - can be placed in an order
10Review - types of variables
- Categorical variables
- Nominal - no natural order
- Example - country of birth
- Ordinal - can be placed in an order
11Review - types of variables
- Categorical variables
- Nominal - no natural order
- Example - country of birth
- Ordinal - can be placed in an order
- Example - educational experience
- Some high school, high school diploma, some
college, college degree, masters degree, PhD
12Sampling from a population
- We often sample from a population
- Consider random samples
- Each individual has an equal and identical
probability of being selected
13Body mass of 400 humans
14(No Transcript)
15Random sample of 10 people
16(No Transcript)
17(No Transcript)
18Population mean ? 70.8 kg
19Population mean ? 70.8 kg
Sample mean x 76.7 kg
20Another sample
21(No Transcript)
22Population mean ? 70.8 kg
Sample mean x 69.2 kg
23What if we do this many times?
24n 20,290
25n 20,290 ? 2622.0 ? 2037.9
26Sample histogram
27n 100 Y 2675.4 s 1539.2
28Y 2767.2 s 2044.7
Y 2675.4 s 1539.2
Y 2588.8 s 1620.5
Y 2702.4 s 1727.1
29Sampling distribution of the mean
Y 2767.2 s 2044.7
Y 2675.4 s 1539.2
Y 2588.8 s 1620.5
Y 2702.4 s 1727.1
30Sampling distribution of the mean
1000 samples
31Sampling distribution of the mean
32Sampling distribution of the mean
33? 2622.0
Sampling distribution of the mean
Mean of means 2626.4
34Y 2767.2 s 2044.7
Y 2675.4 s 1539.2
Y 2588.8 s 1620.5
Y 2702.4 s 1727.1
35Sampling distribution of the standard deviation
s 2044.7
s 1539.2
s 1620.5
s 1727.1
36Sampling distribution of the standard deviation
37Sampling distribution of the standard deviation
100 samples Population ? 2036.9 Mean sample s
1962.6
38Sampling distribution of the standard deviation
1000 samples Population ? 2036.9 Mean sample s
1929.7
39Sampling distribution of the mean, n10
Sampling distribution of the mean, n100
Sampling distribution of the mean, n 1000
40Sampling distribution of the mean, n10
Sampling distribution of the mean, n100
Sampling distribution of the mean, n 1000
41(No Transcript)
42Larger sample size
43(No Transcript)
44Group activity 2
- Form groups of size 2-5
- Get out a blank sheet of paper
- Write everyones full name on the paper
45How many toes do aliens have?
46(No Transcript)
47(No Transcript)
48Instructions
- You have measurements from a population of 400
aliens - Use your random number table to select a sample
of ten measurements - Calculate your sample mean and, if you have a
calculator or a large brain, your sample standard
deviation - On your paper, answer the following
- What was your sample mean and standard deviation?
- How did you randomly choose your sample?
49Distribution of the sample mean
- No matter what the frequency distribution of the
population - The sample mean has an approximately bell-shaped
(normal) distribution - Especially for large n (large samples)
50How precise is any one estimated sample mean?
51 The standard error of an estimate is the
standard deviation of its sampling distribution.
The standard error predicts the sampling error of
the estimate.
52Standard error of the mean
53Estimate of the standard error of the mean
54Confidence interval
- Confidence interval
- a range of values surrounding the sample estimate
that is likely to contain the population
parameter - 95 confidence interval
- plausible range for a parameter based on the data
55The 2SE rule-of-thumb
56Confidence interval
57Pseudoreplication
The error that occurs when samples are not
independent, but they are treated as though they
are.
58Example The transylvania effect
A study of 130,000 calls for police assistance
in 1980 found that they were more likely than
chance to occur during a full moon.
59Example The transylvania effect
A study of 130,000 calls for police assistance
in 1980 found that they were more likely than
chance to occur during a full moon.
Problem There may have been 130,000 calls in
the data set, but there were only 13 full moons
in 1980. These data are not independent.