Marti Hearst - PowerPoint PPT Presentation

About This Presentation
Title:

Marti Hearst

Description:

SIMS 247 Lecture 4 Graphing Multivariate Information January 29, 1998 – PowerPoint PPT presentation

Number of Views:119
Avg rating:3.0/5.0
Slides: 29
Provided by: hear54
Category:

less

Transcript and Presenter's Notes

Title: Marti Hearst


1
SIMS 247 Lecture 4Graphing Multivariate
Information
  • January 29, 1998

2
Follow-up previous lecture
  • Docuverse
  • length of arc is proportional to
  • number of subdirectories
  • radius for a given arc is long enough to contain
    marks for all the files in the directory
  • Nightingales coxcomb
  • keep arc length constant
  • vary radius length (proportional to sqrt(freq))

3
Today Multivariate Information
  • We see a 3D world
  • How do we handle more than 3 variables?
  • multi-functioning elements
  • Tufte examples
  • cinematography example
  • multiple views

4
Example Data Sets
  • How do we handle 9 variables?
  • Our web access dataset
  • Factors involved in alcoholism
  • ALCOHOL
  • USE
  • AVAILABILITY
  • CONCERN ABOUT USE
  • COPING MECHANISMS
  • PERSONALITY MEASURES
  • EXTROVERSION
  • DISINHIBITION
  • OTHER
  • GENDER
  • GPA

5
Graphing Multivariate Information
  • How do we handle cases with more than three
    variables?
  • Scatterplot matrices
  • Parallel coordinates
  • Multiple views
  • Overlay space and time
  • Interaction/animation across time

6
Multiple Variables Scatterplot Matrices(from
Wegman et al.)
7
Multiple Variables Scatterplot Matrices(from
Schall 95)
8
Multiple Views Star Plot(Discussed in Feinberg
79. Works better with animation. Example taken
from Behrans Yu 95.)
9
Multiple Dimensions Parallel Coordinates(earthqu
ake data, color indicates longitude, y axis
severity of earthquake, from Schall 95)
10
Multiple Dimensions Multivariate Star Plot(from
Behran Yu 95)
11
Chernoff Faces
  • Assumption people have built-in face recognizers
  • Map variables to features of a cartoon face
  • Example eyes
  • location, separation, angle, shape, width
  • Example entire face
  • area, shape, nose length, mouth location, smile
    curve
  • Originally tongue-in-cheek, but taken seriously
  • Sometimes seems to work for small numbers of
    points

12
Chernoff Example (Marchette)
  • Three groups of points
  • each drawn from a different distribution with 5
    variables
  • First show scatter-plot matrix
  • Then graph with Chernoff faces
  • vary faces overall
  • vary eyes
  • vary mouth and eyebrows
  • Which seems to be most effective?

13
Chernoff Experiment (Marchette)
14
Chernoff Experiment (Marchette)
15
Chernoff Experiment (Marchette)
16
Chernoff Experiment (Marchette)
17
Overlaying Space and Time(Minards graph of
Napoleans march through Russia)
18
A Detective Story(Inselberg 97)
  • Domain Manufacture of computer chips
  • Objectives create batches with
  • high yield (X1)
  • high quality (X2)
  • Hypothesized cause of problem
  • 9 types of defects (X3-X12)
  • Some physical properties (X13-X16)
  • Approach
  • examine data for 473 batches
  • use interactive parallel coordinates

19
Multidimensional Detective
  • Long term objectives
  • high quality, high yield
  • Logical approach given the hypothesis
  • try to eliminate defects
  • First clue
  • what patterns can be found among batches with
    high yield and quality?

20
Detectives arent intimidated!
X1 seems to be normally distributed X2 bipolar
21
High quality yields obtained despite defects
good batches
X15 breaks into two clusters (important physical p
roperty)
some low X3 defect batches dont appear here
at least one good batch with defects
22
Low-defect batches are not highest quality!
few defects
low yield, low quality
23
Original plot shows defect X6 behaves
differently exclude it from the 9-out-of-10
defects constraint the best batches return
24
Isolate the best batches.Conclusion defects are
necessary!
The very best batch has X3 and X6 defects
Ensure this is not an outlier -- look at top few
batches. The same result is found.
25
How to graph web page traversals?
26
References for this Lecture
  • Visualization Techniques of Different Dimensions,
    John Behrens and Chong Ho Yu, 1995
    http//seamonkey.ed.asu.edu/behrens/asu/reports/c
    ompre/comp1.html
  • Feinberg, S. E. Graphical methods in statistics.
    American Statisticians, 33, 165-178, 1979
  • Friendly, Michael, Gallery of Data Visualization.
    http//www.math.yorku.ca/SCS/Gallery
  • scan of Minards graph from Tufte 1983
  • multivariate means comparison
  • Wegman, Edward J. and Luo, Qiang. High
    Dimensional Clustering Using Parallel Coordinates
    and the Grand Tour., Conference of the German
    Classification Society, Freiberg, Germany, 1996.
    http//galaxy.gmu.edu/papers/inter96.html
  • Cook, Dennis R and Weisberg, Sanford. An
    Introduction to Regression Graphics, 1995.
    http//stat.umn.edu/rcode/node3.html
  • Schall, Matthew. SPSS DIAMOND a visual
    exploratory data analysis tool. Perspective, 18
    (2), 1995. http//www.spss.com/cool/papers/diamon
    dw.html
  • Marchette, David. An Investigation of Chernoff
    Faces for High Dimensional Data Exploration.
    http//farside.nswc.navy.mil/CSI803/Dave/chern.htm
    l
  • Chernoff, H. The use of Faces to Represent
    Points in k-Dimensional Space Graphically.
    Journal of the American Statistical Association,
    68, 361-368, 1973.

27
Next Time Brushing and Linking
  • An interactive technique
  • Brushing
  • pick out some points from one viewpoint
  • see how this effects other viewpoints
  • (Cleveland scatterplot matrix example)
  • Graphs must be linked together

28
Brushing and Linking Systems
  • VISAGE Roth et. al
  • Attribute Explorer Tweedie et. al
  • SpotFire (IVEE) Ahlberg et. al
Write a Comment
User Comments (0)
About PowerShow.com