Title: Quantitative Data Mining Tools
1Quantitative Data Mining Tools
Shannon Adams Katrina Thomas Visa Metcalf
- MIS 4093-Team 9B
- October 31, 2002
2DiamondDiamond is a quantitative
visualization package that was developed by the
Exploratory Visualization Group at IBM research
and is currently licensed to SPSS for
distribution.
3Formatting and Loading Data in Diamond
- The basic layout is to put variables denoted by
titles in columns with corresponding values for
each case in rows. - The maximum size of any cell is 20 characters.
- The worksheet is very simple in its construction
and usage because it simply requires you to
specify the titles of the columns followed by
rows of values separated by blanks or tabs.
4Data Manipulation Functions Supported by Diamond
- Casewise
- Color
- Distribution
- Miscellaneous
- Missing Value
- Ordering
- Periodic
- Primitive Arithmetic
- Statistical
- Tessellating
- Transforming
- Unique
5Figure 9.12
6Figure 9.13
7Figure 9.14
8Figure 9.15
9Figure 9.16
10CrossGraphs
- CrossGraphs is perhaps one of the most flexible
commercial data visualization systems available
for performing multidimensional quantitative
analyses. - The system was originally developed for the
government for use in clinical cancer research
studies.
11Platform Support and Development Environment of
CrossGraphs
- The CrossGraphs system configuration is supported
on a variety of platforms including UNIX, NT,
PowerPC MacIntosh, and Windows configurations, - CrossGraphs is written in CG, which is a
Belmont defined programming language similar to
Java.
12Importing and Processing the Data in CrossGraphs
- CrossGraphs has a variety of data access methods
that can be used to couple to data sources. - It is ODBC compliant and also has special
interfaces for SAS data sets, ASCII files, and
dBase which makes transferring data to
CrossGraphs a fairly straight forward process in
most cases.
13CrossGraph Design Window
14Generating Graphical Displays in CrossGraphs
- There are a number of ways to generate a design
in CrossGraphs they include - Box Plots Graphs
- Contingency Tables Graphs
- Counts Graphs
- Delta Graphs
- Histogram Graphs
15Sample set of Box Plots created within CrossGraphs
16Example of a Contingency Table Graph in
CrossGraphs
17Several different Counts Graphs produced by
CrossGraphs
18Set of CrossGraphs Delta Graphs
19Histograms in CrossGraphs
20Picket Fences in CrossGraphs
21Scatter Plots in CrossGraphs
22Spatial Maps in CrossGraphs
23Statistics Graphs in CrossGraphs
24Survival Curves in CrossGraphs
25Timeline Summaries in CrossGraphs
26Trend Graphs in CrossGraphs
27Other Quantitative Visualization Systems
28Features supported by Graf-FX
- You can drill-down on any table or query and
display it either through group or
pivot/crosstabs tables. - Graf-FX is a good after market add-on to the
Access environment because it makes the data come
alive and allows the analyst to interact with the
data set. - You can view the SQL statements or save the query
from any one of the drill-down exercises.
29Other Quantitative Visualization Systems (contd)
30Shannons QuestionWhat are the basic rules
for formatting and loading data in diamond?
31KaTrinas Qusestion
32Visas Question
- What are several features supported by Graf-Fx
that are useful during a data mining analysis?