The C4.5 Project - PowerPoint PPT Presentation

About This Presentation
Title:

The C4.5 Project

Description:

Title: C4.5 Presentation Author: LL003 Last modified by: Student Created Date: 12/2/2004 7:03:33 PM Document presentation format: On-screen Show Company – PowerPoint PPT presentation

Number of Views:23
Avg rating:3.0/5.0
Slides: 17
Provided by: LL09
Category:
Tags: project

less

Transcript and Presenter's Notes

Title: The C4.5 Project


1
The C4.5 Project
  • Overview of algorithm with results of
    experimentation

2
Summary
  • Terminology
  • C4.5 vs. ID3
  • Datasets
  • C4.5 results on datasets

3
Terminology
  • Training cases
  • Test cases
  • Unseen cases

4
Gain vs. Gain Ratio
  • ID3 creates complex trees using gain
  • C4.5 uses a different measure
  • Gain ratio considers what ID3 does not
  • Minimum number of instances per leaf node
  • Meaning C4.5 creates more useful models

5
Missing Data
  • ID3 does not make allowances
  • C4.5 adjusts the gain ratio to favor attributes
    with existing values
  • Classifying training and unseen cases
  • C4.5 uses probabilistic weights

6
Pruning
  • ID3 produces complex trees
  • C4.5 prunes trees
  • Pessimistic error prediction
  • Subtree raising
  • Subtree replacement

7
Subtree Raising
8
Subtree Replacement
9
Features of C4.5
  • Rules
  • Consulter
  • Categorical data
  • Windowing

10
Windowing
11
Iris Dataset
12
Wine Dataset
13
Results of C4.5 on Datasets
  • Iris dataset similar results

14
Results of C4.5 on Datasets
  • Wine dataset different results
  • Possible reasons for differences

15
Closing Summary
  • C4.5 vs. ID3
  • Gain vs. gain ratio
  • Missing data
  • Pruning
  • Features of C4.5
  • C4.5 Results
  • Iris similar results
  • Wine different results

16
The End
  • reasonable questions welcomed.
Write a Comment
User Comments (0)
About PowerShow.com