DATA MINING KNOWLEDGE DISCOVERY - PowerPoint PPT Presentation

1 / 9
About This Presentation
Title:

DATA MINING KNOWLEDGE DISCOVERY

Description:

... in the lean periods may be likely to buy at least once during Christmas period. ... and regression tree, memory-based reasoning, or nearest neighbor and ... – PowerPoint PPT presentation

Number of Views:35
Avg rating:3.0/5.0
Slides: 10
Provided by: Shi116
Category:

less

Transcript and Presenter's Notes

Title: DATA MINING KNOWLEDGE DISCOVERY


1
DATA MINING(KNOWLEDGE DISCOVERY)
  • Data mining is an attempt at knowledge discovery
    searching for patterns and structure in large
    data sets.
  • OLAP is about confirming the known, we might say
    that data mining is about exploring the unknown.
  • Data mining uses a blend of statistical analysis,
    AI, and computer graphics techniques

2
GOALS OF DATA MINING
  • Explanatory To explain some observed event or
    condition, such as why sales of pickup trucks
    have increased in Colorado
  • Confirmatory To confirm a hypothesis, such as
    whether two-income families are more likely to
    buy family medical coverage than single-income
    families
  • Exploratory To analyze data for new or
    unexpected relationships, such as what spending
    patterns are likely to accompany credit card fraud

3
  • TECHNIQUES USED IN DATA MINING
  • Association rules - e.g., whenever a customer
    buys video equipment, he or she also buys another
    electronic gadget.
  • Sequential patterns e.g., suppose a customer
    buys a camera, and within three months he or she
    buys photographic supplies, and within six months
    an accessory item. A customer who buys more than
    twice in the lean periods may be likely to buy at
    least once during Christmas period.
  • Classification trees e.g., customers may be
    classified by frequency of visits, by types of
    financing used, by amount of purchase.

4
Extraction Of Knowledge From Data
5
Four Phases of Data Mining
  • 1. Data Preparation
  • Identify and cleanse data sets.
  • Data Warehouse is usually used for data mining
    operations.
  • E.g a database consumer goods retailer.
  • Client data cust_name, zip code, phone number,
    date of purchase, item code, price, qty, total
    amt.

6
  • 2. Data Analysis and Classification
  • Identify common data characteristics or patterns
    using
  • Data groupings, classifications, clusters, or
    sequences.
  • Data dependencies, links, or relationships.
  • Data patterns, trends, and deviations.
  • E.g categories of items, from stores in a
    specific region

7
  • 3. Knowledge Acquisition
  • Select the appropriate modeling or knowledge
    acquisition algorithms.
  • Examples neural networks, decision trees, rules
    induction, genetic algorithms, classification and
    regression tree, memory-based reasoning, or
    nearest neighbor and data visualization.
  • e.g People who are in the salary range gtRM5
    per month are likely to spend RM1K for groceries.

8
  • 4. Prognosis
  • Predict future behavior and forecast business
    outcomes using the data mining findings.
  • E.g
  • 65 of customers who did not use a particular
    credit card in the last six months are 88 likely
    to cancel that account.
  • 82 of customers who bought a new TV 27 inches or
    larger are 90 likely to buy an entertainment
    center within the next four weeks

9
Data-Mining Phases
Write a Comment
User Comments (0)
About PowerShow.com