Title: Clustering Algorithms Meta Applier (CAMA) Toolbox
1Clustering Algorithms Meta Applier (CAMA) Toolbox
Dmitry S. Shalymov Kirill S. Skrygan Dmitry A.
Lyubimov
2Clustering
- Goals
- To detect the underlying structure in data
- To reduce data set capacity
- To extract unique objects
- Usage
- Data mining
- Machine learning
- Financial mathematics
- Optimization
- Statistics
- Pattern recognition
- Control strategies development
-
SYRCoSE09
3Clustering Problem
Clustering and Classification
SYRCoSE09
4Variety of Clustering Algorithms
- Hierarchical
- Aglomerative
- Partitioning
- Iterative
- Hard (K-means, SVM, SPSA)
- Fuzzy (FCM)
- Important parameters
- -Distance norm
- -Number of clusters
- -Initial values of cluster centers
SYRCoSE09
5Cluster Stability Algorithms
- Indexes
- Stability (similarity, merit) functions
- Probabilistic measures assessing the likelihood
of a decision - Density estimation approaches
SYRCoSE09
6Stochastic Approximation
Recursive stochastic approximation
FDSA
SPSA
SYRCoSE09
7SYRCoSE09
8Effectiveness of SPSA
SYRCoSE09
9Finding the number of clusters in data set
- Run the SPSA algorithm for different numbers of
clusters, K, and calculate the corresponding
distortions - Select a transformation power, Y
- Calculate the jumps in transformed distortion
- Estimate the number of clusters in the data set
by
SYRCoSE09
10Structure of data set detection
SYRCoSE09
11Examples
- Iris (3 clusters, 4 features, 150 instances)
- Wine (3 clusters, 13 features, 178 instances)
- Breast Cancer (2 clusters, 32 features, 569
instances) - Image Segmentation (7 clusters, 19 features, 2310
instances)
SYRCoSE09
12Software Tools for Clustering Analysis
- Research
- COMPACT
- DCPR (Data Clustering Pattern Recognition)
- FCDA (Fuzzy Clustering and Data Analysis Toolbox)
- ClusterPack Matlab Toolbox
- The Curve Clustering Toolbox
- SOM (Self-Organizing Map)
- Spectral Clustering Toolbox
- Yashil's FCM Clustering
- License software
- SPSS
- STATISTICA
- Characteristics
- Visualization
- Efectiveness analysis with patterns
- Tools to check performance
SYRCoSE09
13Clustering Algorithms Meta Applier
SYRCoSE09
14Clustering Algorithms Meta Applier
SYRCoSE09
15CAMA. Kernel
SYRCoSE09
16CAMA. Kernel
SYRCoSE09
17CAMA Toolboxhttp//ancient.punklan.net8084/CAMA2
/index.jsp
SYRCoSE09
18CAMA Toolbox
SYRCoSE09
19CAMA Toolbox
SYRCoSE09
20Thank you!
SYRCoSE09