Measures of Dispersion - PowerPoint PPT Presentation

About This Presentation
Title:

Measures of Dispersion

Description:

Measures of Dispersion – PowerPoint PPT presentation

Number of Views:495
Avg rating:3.0/5.0
Slides: 45
Provided by: fami4169
Category:

less

Transcript and Presenter's Notes

Title: Measures of Dispersion


1
  • Measures of Dispersion

2
  • Series I 70 70 70 70 70 70 70 70 70 70
  • Series II 66 67 68 69 70 70 71 72 73 74
  • Series III 1 19 50 60 70 80 90 100 110 120
  • n 10, ?X 700 Mean 700/10 70

3
Measures of Variability
  • A single summary figure that describes the spread
    of observations within a distribution.

4
MEASURES OF DESPERSION
  • RANGE
  • INTERQUARTILE RANGE
  • VARIANCE
  • STANDARD DEVIATION

5
Measures of Variability
  • Range
  • Difference between the smallest and largest
    observations.
  • Interquartile Range
  • Range of the middle half of scores.
  • Variance
  • Mean of all squared deviations from the mean.
  • Standard Deviation
  • Rough measure of the average amount by which
    observations deviate from the mean. The square
    root of the variance.

6
Variability Example Range
  • Hotel Rates
  • 52, 76, 100, 136, 186, 196, 205, 150, 257, 264,
    264, 280, 282, 283, 303, 313, 317, 317, 325, 373,
    384, 384, 400, 402, 417, 422, 472, 480, 643, 693,
    732, 749, 750, 791, 891
  • Range 891-52 839

7
Pros and Cons of the Range
  • Pros
  • Very easy to compute.
  • Scores exist in the data set.
  • Cons
  • Value depends only on two scores.
  • Very sensitive to outliers.
  • Influenced by sample size (the larger the sample,
    the larger the range).

8
Inter quartile Range
  • The inter quartile range is Q3-Q1
  • 50 of the observations in the distribution are
    in the inter quartile range.
  • The following figure shows the interaction
    between the quartiles, the median and the inter
    quartile range.

9
Inter quartile Range
10
Quartiles
Inter quartile IQR Q3
Q1
11
(No Transcript)
12
Pros and Cons of the Interquartile Range
  • Pros
  • Fairly easy to compute.
  • Scores exist in the data set.
  • Eliminates influence of extreme scores.
  • Cons
  • Discards much of the data.

13
Percentiles and Quartiles
  • Maximum is 100th percentile 100 of values lie
    at or below the maximum
  • Median is 50th percentile 50 of values lie at
    or below the median
  • Any percentile can be calculated. But the most
    common are 25th (1st Quartile) and 75th (3rd
    Quartile)

14
Locating Percentiles in a Frequency Distribution
  • A percentile is a score below which a specific
    percentage of the distribution falls(the median
    is the 50th percentile.
  • The 75th percentile is a score below which 75
    of the cases fall.
  • The median is the 50th percentile 50 of the
    cases fall below it
  • Another type of percentile The quartile lower
    quartile is 25th percentile and the upper
    quartile is the 75th percentile

15
Locating Percentiles in a Frequency Distribution
25 included here
25th percentile
50 included here
50th percentile
80th percentile
80 included here
16
(No Transcript)
17
Five Number Summary
  • Minimum Value
  • 1st Quartile
  • Median
  • 3rd Quartile
  • Maximum Value

18
(No Transcript)
19
  • VARIANCE
  • Deviations of each observation from
    the mean, then averaging the sum of squares of
    these deviations.
  • STANDARD DEVIATION
  • ROOT- MEANS-SQUARE-DEVIATIONS

20
Variance
  • The average amount that a score deviates from the
    typical score.
  • Score Mean Difference Score
  • Average of Difference Scores 0
  • In order to make this number not 0, square the
    difference scores (no negatives to cancel out the
    positives).

21
Variance Computational Formula
  • Population
  • Sample

22
Variance
  • Use the computational formula to calculate the
    variance.

23
Variability Example Variance
  • Hotel Rates

24
Pros and Cons of Variance
  • Pros
  • Takes all data into account.
  • Lends itself to computation of other stable
    measures (and is a prerequisite for many of
    them).
  • Cons
  • Hard to interpret.
  • Can be influenced by extreme scores.

25
Standard Deviation
  • To undo the squaring of difference scores, take
    the square root of the variance.
  • Return to original units rather than squared
    units.

26
Quantifying Uncertainty
  • Standard deviation measures the variation of a
    variable in the sample.
  • Technically,

27
Standard Deviation
Rough measure of the average amount by which
observations deviate on either side of the mean.
The square root of the variance.
  • Population
  • Sample

28
Variability Example Standard Deviation
Mean 6 Standard Deviation 2
29
Variability Example Standard Deviation
Mean 371.60 Standard Deviation
30
(No Transcript)
31
(No Transcript)
32
(No Transcript)
33
(No Transcript)
34
Pros and Cons of Standard Deviation
  • Cons
  • Influenced by extreme scores.
  • Pros
  • Lends itself to computation of other stable
    measures (and is a prerequisite for many of
    them).
  • Average of deviations around the mean.
  • Majority of data within one standard deviation
    above or below the mean.

35
Mean and Standard Deviation
  • Using the mean and standard deviation together
  • Is an efficient way to describe a distribution
    with just two numbers.
  • Allows a direct comparison between distributions
    that are on different scales.

36
(No Transcript)
37
WHICH MEASURE TO USE ?
  • DISTRIBUTION OF DATA IS SYMMETRIC
  • ---- USE MEAN S.D.,
  • DISTRIBUTION OF DATA IS SKEWED
  • ---- USE MEDIAN QUARTILES

38
Mean, Median and Mode
39
Distributions
  • Bell-Shaped (also known as symmetric or
    normal)
  • Skewed
  • positively (skewed to the right) it tails off
    toward larger values
  • negatively (skewed to the left) it tails off
    toward smaller values

40
(No Transcript)
41
A 100 samples were selected. Each of the sample
contained 100 normal individuals. The mean
Systolic BP of each sample is presented
110, 120, 130, 90, 100, 140,
160, 100, 120, 120, 110,
130, etc
Mean 120 Sd., 10
Systolic BP level No. of samples 90 -
5 100 - 10 110 - 20
120 - 34 130 - 20
140 - 10 150 - 5 160 -
2
42
Normal Distribution
Mean 120 SD 10
120
100
110
130
140
150
90
  • The curve describes probability of getting any
    range of values ie., P(xgt120), P(xlt100), P(110
    ltXlt130)
  • Area under the curve probability
  • Area under the whole curve 1
  • Probability of getting specific number 0, eg
    P(X120) 0

43
  • ANY QUESTIONS

44
  • THANK YOU
Write a Comment
User Comments (0)
About PowerShow.com