Title: LSSG Green Belt Training
1LSSG Green Belt Training
Descriptive Statistics
2Describing Data Summary Measures
Measures of Central Location Mean, Median,
Mode Measures of Variation Range,
Variance and Standard Deviation
3Mean
- It is the Arithmetic Average of data values
- The Most Common Measure of Central Tendency
- Affected by Extreme Values (Outliers)
x
x
x
n
x
å
x
n
2
i
i
Sample Mean
1
i
n
n
0 1 2 3 4 5 6 7 8 9 10
0 1 2 3 4 5 6 7 8 9 10 12
14
Mean 5
Mean 6
4Median
- Important Measure of Central Tendency
- In an ordered array, the median is the middle
number. - If n is odd, the median is the middle number.
- If n is even, the median is the average of the 2
middle numbers. - Not Affected by Extreme Values
0 1 2 3 4 5 6 7 8 9 10
0 1 2 3 4 5 6 7 8 9 10 12
14
5Mode
- A Measure of Central Tendency
- Value that Occurs Most Often
- Not Affected by Extreme Values
- There May Not be a Mode
- There May be Several Modes
- Used for Either Numerical or Categorical Data
0 1 2 3 4 5 6
6Measures Of Variability
Range and Inter Quartile Range Variance and
Standard Deviation Coefficient of Variation
7 Range
- Measure of Variation
- Difference Between Largest Smallest
- Observations
- Range Highest Value Lowest Value
- Ignores How Data Are Distributed
8Inter Quartile Range
- Difference between the 75th percentile (3rd
Quartile) and the 25th percentile (1st Quartile) - Eliminates Effects of Outliers
- Captures how data are distributed around the
median (2nd Quartile)
Q2
Q3
Q1
IQR
9 Variance
- Important Measure of Variation
- Shows Variation About the Mean
- For the Population
- For the Sample
)
2
m
å
s
(
)
2
-
X
2
i
s
-
1
n
10 Standard Deviation
- Most Important Measure of Variation
- Shows Variation About the Mean
- For the Population
- For the Sample
For the Population use N in the denominator.
For the Sample use n - 1 in the denominator.
11 Sample Standard Deviation
For the Sample use n - 1 in the denominator.
Data 10 12 14 15 17 18
18 24
n 8 Mean 16
Sample Standard Deviation 4.24
12Comparing Standard Deviations
Data A
Mean 15.5 s 3.3
11 12 13 14 15 16 17 18
19 20 21
Data B
Mean 15.5 s .92
11 12 13 14 15 16 17 18
19 20 21
Data C
Mean 15.5 s 4.57
11 12 13 14 15 16 17 18
19 20 21
13Coefficient of Variation
- Relative Variation (adjusted for the mean)
- Measured as a
- Adjusts for differences in magnitude of data
- Comparison of variation across groups
-
14Comparing Coefficient of Variation
- Stock A Average Price last year 50
- Standard Deviation 5
- Stock B Average Price last year 100
- Standard Deviation 5
Coefficient of Variation Stock A CV
10 Stock B CV 5
15Shape of Distribution
- Describes How Data Are Distributed
- Measures of Shape
- Symmetric or skewed
16BOX PLOTS
Captures Many Statistics in One Chart
Mean
Max
Min
Median
Q1
Q3