Title: Dynamic%20Visualization%20of%20Transient%20Data%20Streams
1Dynamic Visualization of Transient Data Streams
- P. Wong, et al
- The Pacific Northwest National Laboratory
- Presented by John Sharko
- Visualization of Massive Datasets
2Characteristics of Data Streams
- Arrives continuously
- Arrives unpredictably
- Arrives unboundedly
- Arrives without persistent patterns
3Examples of Data Streams
- Newswires
- Internet click streams
- Network resource management
- Phone call records
- Remote sensing imagery
4Visualization Problem
- Fusing a large amount of previously analyzed
information with a small amount of new
information - Reprocess the whole dataset in full detail
5First Objective
- Achieve the best understanding of transient data
when influx rate exceed processing rate - Approach Data stratification to reduce data size
6Second Objective
- Incremental visualization technique
- Approach Project new information incrementally
onto previous data
7Primary Visualization OutputMultidimensional
Scaling
OJ Simpson trial
Oklahoma bombing
French elections
8Adaptive Visualization Using Stratification
9Methods for Adaptive Visualization
- Vector dimension reduction
- Vector sampling
10Vector Dimension Reduction
- Approach dyadic wavelets (Haar)
200 terms
100 terms
50 terms
11Results of Vector Dimension Reduction
50
200
100
Dimensions
12Results of Vector Sampling
3298
824
1649
Number of Documents
13Scatterplot Similarity Matching
14Scatterplot Similarity Matching
- Procrustes Analysis Results
200 100 50
All 0.0 (self) 0.022 0.084
1/2 0.016 0.051 0.111
1/4 0.033 0.062 0.141
15Incremental Visualization Using Fusion
- Reprocessing by projecting new items onto
existing visualization - Feature reprocessing the entire dataset is often
not required
16Hyperspectral Image Processing
- Apply MDS to scale pixel vectors
- K-mean process to assign unique colors
- Stratify the vectors progressively
17Robust Eigenvectors
- Generate three MDS scatter plots for each third
of the image
18Robust Eigenvectors (contd)
- Generate MDS scatterplot for entire dataset
19Robust Eigenvectors (contd)
- Extract points from cropped areas
20Using Multiple Sliding Windows
Sliding Direction
Data Stream
Long Window
Short Window
- Eigenvectors determined by the long window
- New vectors are projected using the Eigenvectors
of the long window
21Dynamic Visualization Steps
- 1. When influx rate lt processing rate, use MDS
- 2. When influx rate gt processing rate, halt MDS
- 3. Use multiple sliding windows for pre-defined
number of steps - 4. Use stratification approach for fast overview
- 5. Check for accumulated error using Procrustes
analysis - 6. If error threshold not reached, go to step 3
- If error threshold reached, go to step 1
22Conclusions
- The data stratification approach can
substantially accelerate visualization process - The data fusion approach can provide instant
updates
23(No Transcript)