Title: Measures of Dispersion
1 2- Series I 70 70 70 70 70 70 70 70 70 70
- Series II 66 67 68 69 70 70 71 72 73 74
- Series III 1 19 50 60 70 80 90 100 110 120
-
- n 10, ?X 700 Mean 700/10 70
3Measures of Variability
- A single summary figure that describes the spread
of observations within a distribution.
4MEASURES OF DESPERSION
- RANGE
- INTERQUARTILE RANGE
- VARIANCE
- STANDARD DEVIATION
5Measures of Variability
- Range
- Difference between the smallest and largest
observations. - Interquartile Range
- Range of the middle half of scores.
- Variance
- Mean of all squared deviations from the mean.
- Standard Deviation
- Rough measure of the average amount by which
observations deviate from the mean. The square
root of the variance.
6Variability Example Range
- Hotel Rates
- 52, 76, 100, 136, 186, 196, 205, 150, 257, 264,
264, 280, 282, 283, 303, 313, 317, 317, 325, 373,
384, 384, 400, 402, 417, 422, 472, 480, 643, 693,
732, 749, 750, 791, 891 - Range 891-52 839
7Pros and Cons of the Range
- Pros
- Very easy to compute.
- Scores exist in the data set.
- Cons
- Value depends only on two scores.
- Very sensitive to outliers.
- Influenced by sample size (the larger the sample,
the larger the range).
8Inter quartile Range
- The inter quartile range is Q3-Q1
- 50 of the observations in the distribution are
in the inter quartile range. - The following figure shows the interaction
between the quartiles, the median and the inter
quartile range.
9Inter quartile Range
10Quartiles
Inter quartile IQR Q3
Q1
11(No Transcript)
12Pros and Cons of the Interquartile Range
- Pros
- Fairly easy to compute.
- Scores exist in the data set.
- Eliminates influence of extreme scores.
- Cons
- Discards much of the data.
13Percentiles and Quartiles
- Maximum is 100th percentile 100 of values lie
at or below the maximum - Median is 50th percentile 50 of values lie at
or below the median - Any percentile can be calculated. But the most
common are 25th (1st Quartile) and 75th (3rd
Quartile)
14Locating Percentiles in a Frequency Distribution
- A percentile is a score below which a specific
percentage of the distribution falls(the median
is the 50th percentile. - The 75th percentile is a score below which 75
of the cases fall. - The median is the 50th percentile 50 of the
cases fall below it - Another type of percentile The quartile lower
quartile is 25th percentile and the upper
quartile is the 75th percentile
15Locating Percentiles in a Frequency Distribution
25 included here
25th percentile
50 included here
50th percentile
80th percentile
80 included here
16(No Transcript)
17Five Number Summary
- Minimum Value
- 1st Quartile
- Median
- 3rd Quartile
- Maximum Value
18(No Transcript)
19- VARIANCE
- Deviations of each observation from
the mean, then averaging the sum of squares of
these deviations. - STANDARD DEVIATION
-
- ROOT- MEANS-SQUARE-DEVIATIONS
20Variance
- The average amount that a score deviates from the
typical score. - Score Mean Difference Score
- Average of Difference Scores 0
- In order to make this number not 0, square the
difference scores (no negatives to cancel out the
positives).
21Variance Computational Formula
22Variance
- Use the computational formula to calculate the
variance.
23Variability Example Variance
24Pros and Cons of Variance
- Pros
- Takes all data into account.
- Lends itself to computation of other stable
measures (and is a prerequisite for many of
them).
- Cons
- Hard to interpret.
- Can be influenced by extreme scores.
25Standard Deviation
- To undo the squaring of difference scores, take
the square root of the variance. - Return to original units rather than squared
units.
26Quantifying Uncertainty
- Standard deviation measures the variation of a
variable in the sample. - Technically,
27Standard Deviation
Rough measure of the average amount by which
observations deviate on either side of the mean.
The square root of the variance.
28Variability Example Standard Deviation
Mean 6 Standard Deviation 2
29Variability Example Standard Deviation
Mean 371.60 Standard Deviation
30(No Transcript)
31(No Transcript)
32(No Transcript)
33(No Transcript)
34Pros and Cons of Standard Deviation
- Cons
- Influenced by extreme scores.
- Pros
- Lends itself to computation of other stable
measures (and is a prerequisite for many of
them). - Average of deviations around the mean.
- Majority of data within one standard deviation
above or below the mean.
35Mean and Standard Deviation
- Using the mean and standard deviation together
- Is an efficient way to describe a distribution
with just two numbers. - Allows a direct comparison between distributions
that are on different scales.
36(No Transcript)
37WHICH MEASURE TO USE ?
- DISTRIBUTION OF DATA IS SYMMETRIC
-
- ---- USE MEAN S.D.,
- DISTRIBUTION OF DATA IS SKEWED
- ---- USE MEDIAN QUARTILES
38Mean, Median and Mode
39Distributions
- Bell-Shaped (also known as symmetric or
normal) - Skewed
- positively (skewed to the right) it tails off
toward larger values - negatively (skewed to the left) it tails off
toward smaller values
40(No Transcript)
41A 100 samples were selected. Each of the sample
contained 100 normal individuals. The mean
Systolic BP of each sample is presented
110, 120, 130, 90, 100, 140,
160, 100, 120, 120, 110,
130, etc
Mean 120 Sd., 10
Systolic BP level No. of samples 90 -
5 100 - 10 110 - 20
120 - 34 130 - 20
140 - 10 150 - 5 160 -
2
42Normal Distribution
Mean 120 SD 10
120
100
110
130
140
150
90
- The curve describes probability of getting any
range of values ie., P(xgt120), P(xlt100), P(110
ltXlt130) - Area under the curve probability
- Area under the whole curve 1
- Probability of getting specific number 0, eg
P(X120) 0
43 44