Title: Hierarchical Clustering
1Hierarchical Clustering
- Dr. Bernard Chen
- Assistant Professor
2Outline
- Hierarchical Clustering
- Hybrid Hierarchical Kmeans clustering
- DBscan
3Hierarchical Clustering
Venn Diagram of Clustered Data
Dendrogram
From http//www.stat.unc.edu/postscript/papers/mar
ron/Stat321FDA/RimaIzempresentation.ppt
4Nearest Neighbor, Level 2, k 1 clusters.
From http//www.stat.unc.edu/postscript/papers/mar
ron/Stat321FDA/RimaIzempresentation.ppt
5Nearest Neighbor, Level 3, k 2 clusters.
6Nearest Neighbor, Level 4, k 3 clusters.
7Nearest Neighbor, Level 5, k 2 clusters.
8Nearest Neighbor, Level 6, k 2 clusters.
9Nearest Neighbor, Level 7, k 2 clusters.
10Nearest Neighbor, Level 8, k 1 cluster.
11Typical Alternatives to Calculate the Distance
between Clusters
- Single link smallest distance between an
element in one cluster and an element in the
other, i.e., dis(Ki, Kj) min(tip, tjq) - Complete link largest distance between an
element in one cluster and an element in the
other, i.e., dis(Ki, Kj) max(tip, tjq) - Average avg distance between an element in one
cluster and an element in the other, i.e.,
dis(Ki, Kj) avg(tip, tjq)
12Functional significant gene clusters
Two-way clustering
Sample clusters
Gene clusters
13Outline
- Hierarchical Clustering
- Hybrid Hierarchical Kmeans clustering
- DBscan
14Motivation
- Among clustering algorithms, Hierarchical and
K-means clustering are the two most popular and
classic methods. However, both have their innate
disadvantages. - K-means clustering requires a specified number
of clusters in advance and chooses initial
centroids randomly in other words, you dont
know how to start - Hierarchical clustering is hard to find a place
to cut
15Hybrid Hierarchical K-means Clustering (HHK)
Algorithm
- The brief idea is we cluster around half data
through Hierarchical clustering and succeed by
K-means for the remaining - In order to generate super-rules, we let
Hierarchical terminate when it generates the
largest number of clusters
16Hybrid Hierarchical K-means Clustering (HHK)
Algorithm
17Hybrid Hierarchical K-means Clustering (HHK)
Algorithm Example
18Hybrid Hierarchical K-means Clustering (HHK)
Algorithm Example
19Hybrid Hierarchical K-means Clustering (HHK)
Algorithm Example
20Hybrid Hierarchical K-means Clustering (HHK)
Algorithm Example
21Hybrid Hierarchical K-means Clustering (HHK)
Algorithm Example
22Hybrid Hierarchical K-means Clustering (HHK)
Algorithm Example
23Hybrid Hierarchical K-means Clustering (HHK)
Algorithm Example