Title: Robust Similarity Measures for Mobile Object Trajectories
1Robust Similarity Measures for Mobile Object
Trajectories
-
- Michalis Vlachos (UCR), Dimitrios Gunopulos
(UCR), George Kollios (BU)
2Introduction
- Problem Discover similar trajectories of moving
objects - Examples
- Features extracted from video-clips
- Animal Mobility Experiments (GPS data)
- Sign Language Recognition, etc.
3Applications Requirements
Classification
Clustering
- What do we need?
- Similarity Measure (robust to noise)
- Indexing Scheme
4Outline
- Related Work (Euclidean Distance, Time Warping)
- Extension of LCSS model to 2d trajectories
- Algorithms for Computing the new similarity model
- Flexible Sigmoidal Matching
- Comparison with Lp-Norms and DTW distance
5Related Work Euclidean Distance
- LpNorm LP(S(xi-yi)p)1/p
- L2 Euclidean Distance
- L1 Manhattan Distance
- Disadvantages
- Small Robustness to outliers
- Sensitive to time axis displacement
- Does not support variable lengths
6Related Work DTW
- Time Warping
- Allows stretching in time axis
- Difficult Indexing
- Disadvantages
- Computationally intensive, O(nm)
- Has to match ALL elements
7Requirements for new Similarity Model (1)
We need to address the following issues
- Different Sampling Rates or Different Speeds
8Requirements for new Similarity Model (2)
We need to address the following issues
- Similar Motions in different space Regions
9Requirements for new Similarity Model (3)
We need to address the following issues
Non Recoverable Part
Noise Everywhere
Random Peaks
10Longest Common Subsequence (LCSS)
- Dynamic Programming Solution
- Arithmetic Example
- t10, 4, 6, 8, 7, 4, 6, 5, 6, 4, 6
- t20, 3, 4, 6, 7, 6, 3, 6, 4, 6
11Extending LCSS (1)
We extend the LCSS to 2-dimensions and add more
flexibility
Similarity of 2 seq/s with length n m
12Extending LCSS Example
2e
- Rigid matching
- Points marginally outside matching region are
ignored - Set parameter epsilon
13Extending LCSS Flexible Matching
14Sigmoidal Matching
15Computation Algorithms for new models (S1)
- Computing Similarity S1
- Lemma 1 Given two trajectories A and B, with
An and Bm, we can find the SigmoidSimd(?,?)
in O(d(nm)) time
16 Extending LCSS (2)
- S1 cannot detect parallel movements,
Time
B
Y
X
- S2 can detect parallel movements
- Better accuracy than simple normalization
- Distance D1 1-S1 distance D2 1-S2
17Exact Algorithm for similarity function S2
- For trajectories A, B with length n we want to
find - translation fc,d that maximizes SigmoidSim
between A and fc,d (B) - Not infinite translations.
- Each dimension separately
- A translation in 1D fc(bi) bi c (line with
slope 1) - fc(bi) will allow bi to be matched to all aj
i-jltd ai-e fc(bi) (bi, aje)
- Transform into a stabbing problem
Translations O(d2n2) LCSS O(dn) Total
O(d3n3)
yx2
yx
18Approximate Algorithm for similarity function S2
- A translation corresponds to a line fc(x) xc.
- Sort translations by c ? THEY DIFFER IN HOW MANY
SEGMENTS?
- If we can afford to be within ß of max(Sim) ? we
can afford to lose ßn elements
- Dont take all translations ? we can examine
every ßn translations each time - So, if we examine every ßn, we lose at most ßn
elements (1D) - So, for 2D, we can skip every ßn/2 translations
19Approximate Algorithm for similarity function S2
Theorem Given two trajectories A and B, with A
n and Bn, and a constant 0ltßlt1, we can find
an approximation AS2d,ß(A,B) of the similarity
S2(d,e,A,B) such that S2(d,e,A,B) - AS2d,ß(A,B) lt
ß in O(nd3/ ß2) time.
20Approximate Algorithm for similarity function S2
(cont/d)
Theorem Given two trajectories A and B, with A
n and Bn, and a constant 0ltßlt1, we can find
an approximation AS2d,ß(A,B) of the similarity
S2(e,A,B) such that S2(d,e,A,B) - AS2d,ß(A,B) lt
aß in O(nd3/ ß2) time, for a constant a.
21Clustering Accuracy
- Datasets
- MobileLong
- MobileShort
- MobileShort Noise
Test clustering accuracy using Hierarchical
Clustering
C1
C2
C3
C4
C5
22Clustering Accuracy
- LpNorm LP(S(xi-yi)p)1/p
- DTW Lp min((Head(A), B),
(A,Head(B)), (Head(A), Head(B))) - SigmoidSim without translation
23Clustering Accuracy (MobileLong)
- Number of Correct Clusterings out of 10
24Clustering Accuracy (MobileShort)
- Number of Correct Clusterings out of 21
25Clustering Accuracy (MobileShort Noise)
- Number of Correct Clusterings out of 21
26Conclusions, Future Work
- Sigmoid Similarity provides best results under
noise - Optimal translation can be found
- Approximate solutions with provable performance
bounds
- FUTURE WORK
- Improve LCSS performance
- Trajectory Segmentation
- Add Scaling Rotation
27(No Transcript)