Title: Data Mining Tools For ZLE
1Data Mining Tools For ZLE
Copying and Use Restrictions Material under this
presentation is the Intellectual Property of HP
Corporation and Genus Software. Any use of the
this material, in part or whole, except in
context of Genus Data Mining Integrator and Data
Mart Builder, without written permission from HP
and Genus is prohibited.
2agenda
agenda
- data mining in ZLE solutions
- ZLE data mining toolkit
- toolkit demonstration
3title text
- Meta Group
- process of identifying and/or extracting
previously unknown, non-trivial, unanticipated,
important information from large sets of data - Gartner Group
- process of discovering meaningful new
correlations, patterns and trends by sifting
through large amounts of data stored in
repositories, using pattern recognition
technologies, statistical and mathematical
techniques
4title text
- role
- determine most effective responses to business
events - ZLE facilitates mining by providing
- a rich, integrated, current data source
- an integrated operational environment into which
models can be deployed - data mining helps to realize the full business
value of a ZLE system
5ZLE data mining process
- understand the opportunity
- identify and define business opportunity
- prepare data
- profile and understand data
- derive attributes
- transform data
- create case set
- build models
- train models
- assess model performance
- use models
- deploy model
- monitor model performance
identify and define business opportunity
typically about 75 of process
profile data
derive attributes
transform data
create case set
train models
assess performance
deploy model
monitor modelperformance
6agenda
agenda
- data mining in ZLE solutions
- ZLE data mining toolkit
- toolkit demonstration
7the ZLE data mining toolkit
- goal
- provide tools that facilitate ZLE data mining
- reduce process cycle times dramatically
- three tools being developed by Genus Software
- data preparation
- data transfer
- model deployment
- partners Genus, MicroStrategy, SAS
- product names
- Genus Mining Integrator for NonStop SQL (all
three tools) - Genus Mart Builder for NonStop SQL (first two
tools only)
8ZLE data mining analytical cycle
9agenda
agenda
- data mining in ZLE solutions
- ZLE data mining toolkit
- toolkit demonstration
10toolkit demonstration
- credit card fraud detection example
- opportunity use ZLE data store data to predict,
in real-time, which credit card purchases are
likely to be fraudulent - use tools to
- build a case set table with one row describing
each purchase - transfer table to SAS server for modeling
- deploy predictive model to ZLE data store
- execute model in real-time to make fraud
predictions - steps described, including many tool screen shots
11toolkit data preparationsolution
- based on the MicroStrategy (MSI) Business
Intelligence toolset, leverages GUI, logical
data model support, SQL generation, etc. - uses NonStop SQL/MX DBMS, leverages sampling,
TRANSPOSE, statistical functions, - custom tool developed by Genus using MSI SDK for
NonStop SQL operations and functionality not
supported by MSI tools
12two main ZLE data preparation tasks
- profile tables
- column names and types
- partitioning information, attributes, key
structure, - column values
- transform source tables
- derive new attributes
- aggregate to appropriate level
- clean data
- pivot
- combine to form case set
13the MicroStrategy desktop
14MSI profile report fraud vs. billing state
15NonStop SQL/MX sampling
- source table sampling
- insert into CustSampselect from Cust sample
random 1 percent clusters of 10 blocksunion
select from Custwhere CardNo in (select
CardNo from FrdFlg) - enables interactive and exploratory data prep
- cleanly integrated into SQL
- performed efficiently in DP2
- easily accessible through Genus tool
16creating a materialized sample table using the
Genus Data Mart Builder
17identifying source and sample method
18specifying materialized sample table
19transforming source data
20result a case set for modeling
Hundreds of Attributes
Mix of Fraud and No-Fraud Purchases
One Row Per Purchase
21MSI Datamart report summarizing items
22data transfer tool
data transfer tool
- task transfer case set from data store to mining
mart
NonStop SQL/MX
Data Store
Mining Mart
ASCII files
SAS data set
23data transfer specification screen
24transfer monitoring
25modeling in SAS enterprise miner
26model exportscore converter node generates
Java model codereporter node exports code and
HTML report to project directory
27model deployment tool
- task
- copy model information to a ZLE Data Store
Model export/registration
28starting the model deployment tool
29connecting to a Data Store
30a list of models in the Data Store
31viewing a deployed model
32selecting a SAS report directory
33viewing available reports
34viewing an Enterprise Miner report
35deploying a model
36deployment confirmation
37real-time scoring using the Recommender
Offers /Advice
BusinessRules
Rules Engine
Model Scores
DeployedModels
Interaction Manager
Scoring Engine
Model Aggregates
CustomerData
AggregateDefinitions
Aggregation Engine
38how to get the data mining tools
- Product Names
- Genus Mining Integrator for NonStop SQL (Data
Preparation, Data Transfer, and Model Deployment
tools) - Genus Mart Builder for NonStop SQL (first two
tools only) - Can be ordered through HP, support provided by
Genus - Availability calendar Q4 2002
- For more information, contact
- kevin.christian_at_hp.com (Product Manager)
- susan.mcconkey_at_hp.com (Program Manager)
- angad_at_genussoft.com (Development)