Data Mining and Decision Tree - PowerPoint PPT Presentation

1 / 12
About This Presentation
Title:

Data Mining and Decision Tree

Description:

... likely to happen if a particular type of event occurs ... Predict the future ... Helps making decision of what kind of group of people the company should target ... – PowerPoint PPT presentation

Number of Views:33
Avg rating:3.0/5.0
Slides: 13
Provided by: masumis
Category:

less

Transcript and Presenter's Notes

Title: Data Mining and Decision Tree


1
Data Miningand Decision Tree
  • CS157B Spring 2006
  • Masumi Shimoda

2
Outline
  • Brief introduction to data mining
  • Definition
  • Objective
  • Application
  • Decision tree

3
What is Data Mining?
  • Process of automatically finding the
    relationships and patterns, and extracting the
    meaning of enormous amount of data.
  • Also called knowledge discovery

4
Objective
  • Extracting the hidden, or not easily recognizable
    knowledge out of the large data Know the past
  • Predicting what is likely to happen if a
    particular type of event occurs Predict the
    future

5
Application
  • Marketing example
  • Sending direct mail to randomly chosen people
  • Database of recipients attribute data (e.g.
    gender, marital status, of children, etc) is
    available
  • How can this company increase the response rate
    of direct mail?

6
Application (Contd)
  • Figure out the pattern, relationship of
    attributes that those who responded has in common
  • Helps making decision of what kind of group of
    people the company should target

7
  • Data mining helps analyzing large amount of data,
    and making decisionbut how exactly does it work?
  • One method that is commonly used is decision tree

8
Decision Tree
  • One of many methods to perform data mining -
    particularly classification
  • Divides the dataset into multiple groups by
    evaluating attributes
  • Decision tree can be explained a series of nested
    if-then-else statements.

9
Decision Tree (Contd)
  • Each non-leaf node has a predicate associated,
    testing an attribute of data
  • Leaf node represents a class, or category
  • To classify a data, start from root node and
    traverse down the tree by testing predicates and
    taking branches

10
Example of Decision Tree
11
Advantages of Decision Tree
  • Easy to visualize the process of classification
  • Can easily tell why the data is classified in a
    particular category - just trace the path to get
    to the leaf and it explains the reason
  • Simple, fast processing
  • Once the tree is made, just traverse down the
    tree to classify the data

12
Decision Tree is for
  • Classifying the dataset which
  • The predicates return discrete values
  • Does not have an attributes that all data has the
    same value
Write a Comment
User Comments (0)
About PowerShow.com