Indexing Fingerprint Data with Clustering Approach - PowerPoint PPT Presentation

1 / 21

About This Presentation

Title:

Indexing Fingerprint Data with Clustering Approach

Description:

not only MaxSignal values. Basic Approach. Assigning one location into multiple clusters ... Two-step approach. Finding appropriate cluster. Search location ... – PowerPoint PPT presentation

Number of Views:43

Avg rating:3.0/5.0

Slides: 22

Provided by: woose

Category:

more less

Transcript and Presenter's Notes

Title: Indexing Fingerprint Data with Clustering Approach

1
Indexing Fingerprint Data with Clustering Approach

Wooseok Ryu
Database Lab.
September 22, 2007

2
Summary of Previous Seminar

2 approaches for indexing fingerprint data when
the number of Access Point is many
Multi-Dimensional Approach
Description
Assign dimension to each Access Point
The number of dimension is same as the number of
AP
Problem
Curse of dimensionality
Set of 1-D Index Approach
Description
Maintaining one 1-D Index per Access Point
Merge result of each index
Problem
Not faster than M-D Index Approach

3
Todays Topic

Comments at Previous Seminar
How to reduce dimensions in the Multi-Dimensional
Approach?
Index Pruning Approach
Clustering Approach
Todays Topic
Review of Clustering Approach
Problem of Previous Approach
Basic Idea for Clustering

4
Clustering Approach

Clustering
To reduce the computational cost of searching the
radio map
A set of locations sharing a common set of access
points

5
Related Work Clustering

Clustering at the offline phase
Group locations into clusters according to access
points that cover the locations
For a given location l, use the set of the q
strongest access points covering this location as
the cluster key
Use MaxSignal value of each access points
Do not use weak access points

WLAN Location Determination via Clustering and
Probability Distributions Youssef, Agrawala,
Shankar (PerCom 2003)
6
Related Work Clustering

Online Location Determination Phase
Determine cluster to search
Use q strongest access points to determine one
cluster
Find most probable location within a cluster
Use Bayes theorem to estimate the probability of
each location within the cluster
Compute each location one by one and choose one
with highest joint probability

7
Related Work Clustering

Experimental Testbed
Total 110 locations along the corridors (5 feet
apart)
Each location is covered averagely by 4 access
points

8
Related Work Clustering

Experimental Result
In the offline phase, number of clusters is 15
when q is 3
Average cluster size is about 6 7.
Clustering reduces the average number of
operations per location estimate by more than an
order of magnitude

9
Problem of Related Work

Problem of using only MaxSignal
Observed vector can fall into wrong cluster
Because of the characteristics of probability
distribution
Simply using q strongest access points to
determine one cluster cannot guarantee the
accuracy

Example of off-line Clustering
Example of on-line location determination
C1 AP1, AP2, AP3
C2 AP1, AP2, AP4
AP1
AP2
L1 L2 L3 L4 L5 L6 L7 L8
L9 L10 L11 L12 L13 L14 L15 L16
L17 L18 L19 L20 L21 L22 L23 L24
AP3
AP4
C3 AP1, AP3, AP4
C4 AP2, AP3, AP4
10
Problem of Related Work

Summary
When choosing q strongest access points at each
location
Using just MaxSignal value degrades accuracy
How to solve the problem?
we must consider probability distribution
not only MaxSignal values

11
Basic Approach

Assigning one location into multiple clusters
Compute possible combination of q strongest
access points from one locations

3 strongest AP based on Max Signal ? AP1, AP2,
AP3
P
0.5
Possible cases when q is 3
0.4
0.3
0.2
0.1
AP1
AP3
SS at Location 9
-90
-80
-70
-60
-50
-40
-30
-100
AP4
AP2
12
Basic Approach

Multiple Clustering Example
assign each location into multiple clusters
Consider all possible combinations

C1 AP1, AP2, AP3
C2 AP1, AP2, AP4
AP1
AP2
L1 L2 L3 L4 L5 L6 L7 L8
L9 L10 L11 L12 L13 L14 L15 L16
L17 L18 L19 L20 L21 L22 L23 L24
AP3
AP4
Location 9 is assigned to cluster C1, C2, C3
C3 AP1, AP3, AP4
C4 AP2, AP3, AP4
13
Problem Domain

Processing Real-Time Phase
1. find cluster that contains q strong access
points
2. compute probability distribution of each
location in the cluster

1. Find matching cluster for received signal
-45, -56, -65, -80
2. In a selected cluster, find best matching
location for signal -45, -56, -65, -80 ?
14
First Step Finding matching cluster

Naïve Approach
Sequential scan
Performance degrades when the number of cluster
is many

Received Signal AP1 -45, AP2 -56, AP3
-65, AP4 -80 3 strongest access points
AP1, AP2, AP3
15
First Step Finding matching cluster

Indexing Cluster Key
Assign each Access Point to unique number as
following
20, 21, 22, , 2k (k is the number of all access
points)
Example
AP1 20
AP2 21
AP3 22
AP4 23
Unique number can be assigned to each cluster

16
First Step Finding matching cluster

Indexing Cluster Key is same as indexing Cluster
Number
We can use well-known 1-D index structure
B-Tree for Disk Index
T-Tree for Main Memory Index

11, C2
7, C1
14, C3
13, C4
Index Representation using Binary Tree
17
Second Step Search a best matching location

Search best matching location in a cluster
Use q strongest access points, model q1
dimensional space

Joint Probability
1
0
-100
0
Signal Strength of AP1
-100
0
Signal Strength of AP2
Example value of q is 2, cluster key is AP1,
AP2
18
Second Step Search a best matching location