Title: OPIM 5103 Statistics Looking at Data Computing Summary Statistics
1OPIM 5103 StatisticsLooking at DataComputing
Summary Statistics
2Topic 2 Looking and Examining Data
- Looking at Data (Ch. 2)
- Numerical Data
- Histograms
- Cumulative frequency distributions
- Scatterplots
- Categorical Data
- Bar Charts, Pie Charts, Pareto diagrams
- Contingency Tables Excels PivotTable
- Computing Summary Statistics
- Measures of Central Tendency
- Measures of Dispersion
3Looking at Data
4Types of Data
5Displaying Numerical Data
6Displaying Numerical Data
7EXCEL Tutorial Histograms
8Displaying Bivariate Numerical Data
9Displaying Categorical Data
- Example bar chart, pie chart, Pareto diagram
10Displaying Categorical Data
- Example bar chart, pie chart, Pareto diagram
11Displaying Categorical Data
- Example bar chart, pie chart, Pareto diagram
12Displaying Bivariate Categorical Data
Basement
13EXCEL Tutorial PivotTables
14Computing Summary Statistics from Data
- Central Tendency
- mean
- median
- Spread/Dispersion
- variance
- interquartile range
15Median
16Mean (Arithmetic Mean)
- The most common measure of central tendency
- Affected by extreme values (outliers)
0 1 2 3 4 5 6 7 8 9 10
0 1 2 3 4 5 6 7 8 9 10 12
14
Mean 5
Mean 6
Excel function average(range)
17Median
- Robust measure of central tendency
- Not affected by extreme values
-
-
- In an ordered array, the median is the middle
number
0 1 2 3 4 5 6 7 8 9 10
0 1 2 3 4 5 6 7 8 9 10 12
14
Median 5
Median 5
Excel function median(range)
18Quartiles
- Split Ordered Data into 4 Quarters
- Median, A Measure of Central Tendency
25
25
25
25
Excel function quartile(range,
number) 0 minimum value 1
Q1 4 maximum value
19Interquartile Range
- Measure of spread/dispersion
- Also known as midspread
- Spread in the middle 50
- Difference between the first and third quartiles
- Not affected by extreme values
20Variance
- Important measure of variation
- Shows variation about the mean
- Sample variance
- Average of squared deviations from the mean
- Standard deviation square root of variance
21Excel functions
- Variance
- VAR(range)
- Standard Deviation
- STDEV(range)
22Comparing Standard Deviations
Data A
Mean 15.5 s 3.338
11 12 13 14 15 16 17 18
19 20 21
Data B
Mean 15.5 s .9258
11 12 13 14 15 16 17 18
19 20 21
Data C
Mean 15.5 s 4.57
11 12 13 14 15 16 17 18
19 20 21
23Coefficient of Correlation
- Measures the strength of the linear relationship
between two quantitative variables -
24Features of Correlation Coefficient
- Unit free
- Ranges between 1 and 1
- The closer to 1, the stronger the negative
linear relationship - The closer to 1, the stronger the positive linear
relationship - The closer to 0, the weaker any positive linear
relationship
25Scatter Plots of Data with Various Correlation
Coefficients
Y
Y
Y
X
X
X
r -1
r -.6
r 0
Y
Y
X
X
r 1
r .6