Title: Change Detection in Data Streams by Testing Exchangeability
1Change Detection in Data Streams by Testing
Exchangeability
- Shen-Shyang Ho
- JPL/Caltech
The research is part of the authors PhD
dissertation (in computer science) at George
Mason University Conference travel is partially
sponsored by NASA Postdoctoral Program (NPP)
Travel Grant.
2Outline
- Introduction
- Previous Work (Statistics and Machine
Learning/Data Mining/Computer Vision) - Intuition
- Background (Exchangeability/Martingale)
- Methodology
- Comparison and Experimental Results
- Application I Adaptive Support Vector Machine
(Classification Model) - Application II Video Shot Change Detection
(Cluster Model)
3Introduction
Assumption Data vectors are observed
sequentially.
4Introduction
5Previous Work
- Statistics - Sequential Analysis is statistical
inference with the assumption that the number of
observations/samples required is not
pre-determined. - Sequential Probability Ratio Test A. Wald
(1945) - Application Quality Control (Military/Manufactur
ing) - CUSUM (Cumulative Sum) E. S. Page (1954)
- Refer to Sequential Analysis Design Methods
and Applications Journal for recent research. - Most recent issue (vol 27, no 2, 2008) papers
on structural change/minimax method for
change-point detection problems/multidecision
quickest change-point detection 3 out of 6
papers.
- Machine Learning/Data Mining
- Applications Concept Drift Problem, Adaptive
classifier, Anomaly in Internet Traffic,
Video-shot change detection - Proposed methodology is usually problem-specific
- Monitoring error, sliding window, weighted data,
ensemble classifier - Statistical method Likelihood ratio method,
Bayesian methods, Hypothesis Testing
6Related Data Mining/Machine Learning/Computer
Vision Research
- Xiuyao Song, Mingxi Wu, Christopher M. Jermaine,
Sanjay Ranka Statistical change detection for
multi-dimensional data. KDD 2007 667-676 - Kolter, J.Z. and Maloof, M.A. Dynamic Weighted
Majority An ensemble method for drifting
concepts. Journal of Machine Learning Research
82755--2790, 2007. - Klinkenberg, Ralf and Joachims, Thorsten
Detecting Concept Drift with Support Vector
Machines. Proceedings of the Seventeenth
International Conference on Machine Learning
(ICML) 487--494, 2000. - Bi Song, Namrata Vaswani, Amit K. Roy Chowdhury
Closed-Loop Tracking and Change Detection in
Multi-Activity Sequences. CVPR 2007 - Paul L. Rosin Thresholding for Change Detection.
ICCV 1998 274-279 - Balachander Krishnamurthy, Subhabrata Sen, Yin
Zhang, Yan Chen Sketch-based change detection
methods, evaluation, and applications. Internet
Measurement Conference 2003 234-247 - Tsuyoshi Idé, Keisuke Inoue Knowledge Discovery
from Heterogeneous Dynamic Systems using
Change-Point Correlations. SDM 2005 - Tsuyoshi Idé, Koji Tsuda Change-Point Detection
using Krylov Subspace Learning. SDM 2007 - Daniel Kifer, Shai Ben-David, Johannes Gehrke,
Detecting Changes in Data Streams, Proc. 30th
VLDB Conference, 2004. - ...
7Motivation
Lack of Exchangeability implies Change in Data
Distribution/Model
3/29/2016
7
8Intuition
9Background
- Vovk et als work on Testing Exchangeability
Online (ICML 2003) and Algorithmic Learning in
a random world (Springer) - - Testing exchangeability assumption in an online
mode. - Explicit Martingale for testing the hypothesis of
exchangeability
(Refer to http//www.vovk.net (conformal
prediction) )
3/29/2016
9
10Background
3/29/2016
10
11Background
12Methodology - Strangeness
- Strangeness measures how well one data point (for
each data point seen so far) is represented by a
data model compared to other points - Applicable to classification, regression or
cluster model - measure diversity / disagreements, i.e. the
higher the strangeness of a point, the less
likely it comes from the model
Condition for a valid strangeness measure A
strangeness value of a data point at a particular
time instance should be independent of the order
it is observed with respect to the other data
points.
13Classification Model
Strangeness (SVM) Lagrange Multiplier
14Classification Model
Strangeness (SVM) Lagrange Multiplier
3/29/2016
15Cluster Model
16Regression Model
(Papadopoulos et al., Inductive Confidence
Machines for Regression, ECML, LNAI 2430, pp
345-356, 2002)
17Methodology
18Methodology
19Methodology
20Methodology
21Methodology
3/29/2016
21
22Experimental Result Performance Measure
3/29/2016
22
23Experimental Result Varying
3/29/2016
23
24Experimental Result Varying Strangeness
25Experimental Result Varying
26Experimental Result
3/29/2016
26
27Experimental Result
3/29/2016
27
28Experimental Result Different Methods
29Application Adaptive SVM
30Application Adaptive SVM
Simulated USPS 3-Digit Image Data Stream
t
01120120034003340415655611577789987
3/29/2016
30
31Application Adaptive SVM
A (blue) True Change Point Known to the
SVM B(red) Adaptive SVM using martingale
method C(magenta) SVM using sliding window of
size 250 D(black) SVM using sliding window of
size 500 E(green) SVM using sliding window of
size 1000
32Application Video-Shot Change Detection
Martingale Change Detection using multiple
features (MVMT Multiple-view martingale test)
33Application Video-Shot Change Detection
- HI Histogram Intersection
- Chi-Square Measure
- Euclidean Distance (ED)
3/29/2016
33
34Reference
- S.-S. Ho and H. Wechsler, Detecting Change-Points
in Unlabeled Data Streams using Martingale, Proc.
20th Int. Joint. Conf. Artificial Intelligence
(IJCAI 2007), Hyderabad, India, Jan. 6 - 12,
2007. - S-S Ho, A Martingale Framework for Concept Change
Detection in Time-Varying Data Streams, Proc Int.
Conf. on Machine Learning (ICML 2005), Bonn,
Germany, Aug. 7 - 11, 2005 - S-S Ho and H. Wechsler, Adaptive Support Vector
Machine for Time-Varying Data streams Using the
Martingale, Proc. Int. Joint Conf. on Artificial
Intelligence (IJCAI 2005), Edinburgh, Scotland,
July 30 - Aug. 5, 2005 - S-S Ho and H. Wechsler, On the detection of
concept change in time-varying data streams by
testing exchangeability, Proc. Conference on
Uncertainty in Artificial Intelligence (UAI
2005), Edinburgh, Scotland, July 26 - 29, 2005 - http//shenshyang.googlepages.com/codes (matlab
codes datasets)
3/29/2016
34
35Acknowledgement
- Harry Wechsler, PhD Advisor (George Mason
University) - Volodya Vovk, (Royal Holloway, University of
London) - Alexander Gammerman (Royal Holloway, University
of London) - Oak Ridge Associated University (ORAU)
3/29/2016
35