Chameleon: A hierarchical Clustering Algorithm Using Dynamic Mo - PowerPoint PPT Presentation

About This Presentation

Title:

Chameleon: A hierarchical Clustering Algorithm Using Dynamic Mo

Description:

Chameleon: A hierarchical Clustering Algorithm Using Dynamic Modeling ... Algorithm assigns K-representational points to ... yucky and reasonably yucky parts. ... – PowerPoint PPT presentation

Number of Views:1465

Avg rating:3.0/5.0

Slides: 19

Provided by: ccGa

Learn more at: https://faculty.cc.gatech.edu

Category:

Tags: algorithm | chameleon | clustering | dynamic | hierarchical | using | yucky

Transcript and Presenter's Notes

Title: Chameleon: A hierarchical Clustering Algorithm Using Dynamic Mo

1
Chameleon A hierarchical Clustering Algorithm
Using Dynamic Modeling

By George Karypis, Eui-Hong Han,Vipin Kumar
and not by Prashant Thiruvengadachari

2
Existing Algorithms

K-means and PAM
Algorithm assigns K-representational points to
the clusters and tries to form clusters based on
the distance measure.

3
More algorithms

Other algorithm include CURE, ROCK, CLARANS, etc.
CURE takes into account distance between
representatives
ROCK takes into account inter-cluster aggregate
connectivity.

4
Chameleon

Two-phase approach
Phase -I
Uses a graph partitioning algorithm to divide the
data set into a set of individual clusters.
Phase -II
uses an agglomerative hierarchical mining
algorithm to merge the clusters.

5
So, basically..
6

Why not stop with Phase-I? We've got the
clusters, haven't we ?
Chameleon(Phase-II) takes into account
Inter Connetivity
Relative closeness
Hence, chameleon takes into account features
intrinsic to a cluster.

7
Constructing a sparse graph

Using KNN
Data points that are far away are completely
avoided by the algorithm (reducing the noise in
the dataset)?
captures the concept of neighbourhood dynamically
by taking into account the density of the region.

8
What do you do with the graph ?

Partition the KNN graph such that the edge cut is
minimized.
Reason Since edge cut represents similarity
between the points, less edge cut gt less
similarity.
Multi-level graph partitioning algorithms to
partition the graph hMeTiS library.

9
Example
10
Cluster Similarity

Models cluster similarity based on the relative
inter-connectivity and relative closeness of the
clusters.

11
Relative Inter-Connectivity

Ci and Cj
RIC AbsoluteIC(Ci,Cj)?
internal IC(Ci)internal IC(Cj) / 2
where AbsoluteIC(Ci,Cj) sum of weights of edges
that connect Ci with Cj.
internalIC(Ci) weighted sum of edges that
partition the cluster into roughly equal parts.

12
Relative Closeness

Absolute closeness normalized with respect to the
internal closeness of the two clusters.
Absolute closeness got by average similarity
between the points in Ci that are connected to
the points in Cj.
average weight of the edges from C(i)-gtC(j).

13
Internal Closeness.

Internal closeness of the cluster got by average
of the weights of the edges in the cluster.

14
So, which clusters do we merge?

So far, we have got
Relative Inter-Connectivity measure.
Relative Closeness measure.
Using them,

15
Merging the clusters..

If the relative inter-connectivity measure
relative closeness measure are same, choose
inter-connectivity.
You can also use,
RI (Ci , C j )T(RI) and RC(C i,C j ) T(RC)?
Allows multiple clusters to merge at each level.

16
Good points about the paper

Nice description of the working of the system.
Gives a note of existing algorithms and as to why
chameleon is better.
Not specific to a particular domain.

17
yucky and reasonably yucky parts..

Not much information given about the Phase-I part
of the paper graph properties ?
Finding the complexity of the algorithm
O(nm n log n m2log m)?
Different domains require different measures for
connectivity and closeness, ...................

18
Questions ?

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

Introducing-PowerShowcom PowerPoint PPT Presentation

Introducing-PowerShowcom - Introducing-PowerShowcom (Without Music)

CrystalGraphics 3D Character Slides for PowerPoint PowerPoint PPT Presentation

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint PowerPoint PPT Presentation

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them. – PowerPoint PPT presentation

Related Presentations

CHAMELEON: A Hierarchical Clustering Algorithm Using Dynamic Modeling PowerPoint PPT Presentation

CHAMELEON: A Hierarchical Clustering Algorithm Using Dynamic Modeling - CHAMELEON: A Hierarchical Clustering Algorithm Using Dynamic Modeling Paper presentation in data mining class Presenter : ; Data : 2001/12/18 | PowerPoint PPT presentation | free to view

Hierarchical Clustering PowerPoint PPT Presentation

Hierarchical Clustering - Hierarchical Clustering Ke Chen COMP24111 Machine Learning COMP24111 Machine Learning * Outline Introduction Cluster Distance Measures Agglomerative Algorithm Example ... | PowerPoint PPT presentation | free to view

Data Mining Cluster Analysis: Advanced Concepts and Algorithms PowerPoint PPT Presentation

Data Mining Cluster Analysis: Advanced Concepts and Algorithms - (centroid) (single link) CURE Cannot Handle Differing Densities Original Points CURE Graph-Based Clustering Graph-Based clustering uses the proximity graph Start ... | PowerPoint PPT presentation | free to view

Data%20Mining:%20%20Concepts%20and%20Techniques PowerPoint PPT Presentation

Data%20Mining:%20%20Concepts%20and%20Techniques - ... A join path, e.g., Student ... Construct a partition of a database D of n objects into a set of k ... Density Based Spatial Clustering of Applications with Noise ... | PowerPoint PPT presentation | free to view

Fall 2004, CIS, Temple University PowerPoint PPT Presentation

Fall 2004, CIS, Temple University - DBSCAN: Core, Border, and Noise Points DBSCAN Algorithm Eliminate noise points Perform clustering on the remaining points DBSCAN: Core, ... | PowerPoint PPT presentation | free to view

Hierarchical Clustering PowerPoint PPT Presentation

Hierarchical Clustering - Hierarchical Clustering Agglomerative approach Initialization: Each object is a cluster Iteration: Merge two clusters which are most similar to each other; | PowerPoint PPT presentation | free to view

Cluster Analysis Part II PowerPoint PPT Presentation

Cluster Analysis Part II - Cluster Analysis Part II | PowerPoint PPT presentation | free to view

Data Mining PowerPoint PPT Presentation

Data Mining - The clustering problem is about grouping a set of data tuples ... CURE (Clustering Using REpresentitives) is another example. 9/03. Data Mining Clustering ... | PowerPoint PPT presentation | free to view

Data Mining Cluster Analysis: Advanced Concepts and Algorithms PowerPoint PPT Presentation

Data Mining Cluster Analysis: Advanced Concepts and Algorithms - Tan,Steinbach, Kumar Introduction to Data Mining 4/18/2004 1 Tan,Steinbach, Kumar Introduction to Data Mining 4/18/2004 2. Hierarchical Clustering: Revisited ... | PowerPoint PPT presentation | free to view

Cluster%20Analysis PowerPoint PPT Presentation

Cluster%20Analysis - DBSCAN: Density Based Spatial Clustering of Applications with Noise Relies on a density-based notion of cluster: A cluster is defined as a maximal set of density- ... | PowerPoint PPT presentation | free to view

Data%20Mining:%20Concepts%20and%20Techniques%20%20Clustering PowerPoint PPT Presentation

Data%20Mining:%20Concepts%20and%20Techniques%20%20Clustering - Types of Data in Cluster Analysis. A Categorization of Major Clustering Methods ... Cluster Weblog data to discover groups of similar access patterns. 8/11/09 ... | PowerPoint PPT presentation | free to view

On the Parallel Complexity of Hierarchical Clustering and CCComplete Problems PowerPoint PPT Presentation

On the Parallel Complexity of Hierarchical Clustering and CCComplete Problems - For a fast parallel algorithm for hierarchical clustering, the algorithm should ... Stable Marriage Problem finds a matching in which no man could do any better ... | PowerPoint PPT presentation | free to view

Fundamentos de Miner PowerPoint PPT Presentation

Fundamentos de Miner - Fundamentos de Miner a de Datos Clustering Fernando Berzal fberzal@decsai.ugr.es http://elvex.ugr.es/idbis/dm/ Clustering Clustering Clustering Clustering Clustering ... | PowerPoint PPT presentation | free to view

Chapter 8' Cluster Analysis PowerPoint PPT Presentation

Chapter 8' Cluster Analysis - Partitioning algorithms: Construct various partitions and then evaluate them by ... CLARANS (A Clustering Algorithm based on Randomized Search) (Ng and Han'94) ... | PowerPoint PPT presentation | free to view

Literature Survey of Clustering Algorithms PowerPoint PPT Presentation

Literature Survey of Clustering Algorithms - Literature Survey of Clustering Algorithms Bill Andreopoulos Biotec, TU Dresden, Germany, and Department of Computer Science and Engineering York University, Toronto ... | PowerPoint PPT presentation | free to view

Cluster Analysis PowerPoint PPT Presentation

Cluster Analysis - Cluster Analysis Chapter 7 - The Course Chapter Outline What is Cluster Analysis? Types of Data in Cluster Analysis A Categorization of Major Clustering Methods ... | PowerPoint PPT presentation | free to view

Fall 2004, CIS, Temple University PowerPoint PPT Presentation

Fall 2004, CIS, Temple University - Fall 2004, CIS, Temple University CIS527: Data Warehousing, Filtering, and Mining Lecture 6 Clustering Lecture s taken/modified from: Jiawei Han (http://www-sal ... | PowerPoint PPT presentation | free to view

Data Mining Cluster Analysis: Advanced Concepts and Algorithms PowerPoint PPT Presentation

Data Mining Cluster Analysis: Advanced Concepts and Algorithms - Title: Steven F. Ashby Center for Applied Scientific Computing Month DD, 1997 Author: Computations Last modified by: M rti Created Date: 3/18/1998 1:44:31 PM | PowerPoint PPT presentation | free to view

Project Presentation CPSC 695 PowerPoint PPT Presentation

Project Presentation CPSC 695 - Project Presentation CPSC 695 Prepared By: Priyadarshi Bhattacharya Outline of Talk Introduction to clustering and its relevance to my research interests. | PowerPoint PPT presentation | free to view

CSci 8980: Data Mining (Fall 2002) PowerPoint PPT Presentation

CSci 8980: Data Mining (Fall 2002) - Army High Performance Computing Research Center Department of Computer Science University of Minnesota http://www.cs.umn.edu/~kumar CSci 8980: Data Mining (Fall 2002) | PowerPoint PPT presentation | free to view

Cluster Analysis (cont.) Pertemuan 12 PowerPoint PPT Presentation

Cluster Analysis (cont.) Pertemuan 12 - ... time complexity of at ... refers to a concept and contains a probabilistic description ... Bina Nusantara Dilanjutkan ke pert. 13 Applications and ... | PowerPoint PPT presentation | free to view

Clustering slide from Han and Kamber PowerPoint PPT Presentation

Clustering slide from Han and Kamber - If q = 1, d is Manhattan distance. Binary Variables. A contingency table for binary data ... map the range of each variable onto [0, 1] by replacing i-th object ... | PowerPoint PPT presentation | free to view

Data Clustering 50 Years Beyond Kmeans PowerPoint PPT Presentation

Data Clustering 50 Years Beyond Kmeans - What facial types are represented in these portraits? ... Xu & Croft (ACM TOIS, 1998) used corpus analysis based on word co-occurrence to ... | PowerPoint PPT presentation | free to view

Data Mining Cluster Analysis: Advanced Concepts and Algorithms PowerPoint PPT Presentation

Data Mining Cluster Analysis: Advanced Concepts and Algorithms - Title: Steven F. Ashby Center for Applied Scientific Computing Month DD, 1997 Author: Computations Last modified by: mike Created Date: 3/18/1998 1:44:31 PM | PowerPoint PPT presentation | free to view

???????????? Using Social Recommendation in Academic Community PowerPoint PPT Presentation

???????????? Using Social Recommendation in Academic Community - Using Social Recommendation in Academic Community | PowerPoint PPT presentation | free to view

DB Seminar Series: Validation and Presentation of Clustering Results PowerPoint PPT Presentation

DB Seminar Series: Validation and Presentation of Clustering Results - This method identifies groups of records that are similar in different conditions. ... Each kind of method provides a different kind of validation with ... | PowerPoint PPT presentation | free to view

Clustering PowerPoint PPT Presentation

Clustering - Sub-optimal Clustering. Optimal Clustering. Original Points ... BIRCH (1996): uses CF-tree and incrementally adjusts the quality of sub-clusters ... | PowerPoint PPT presentation | free to view