ACE:A Fast Multiscale Eigenvectors Computation for Drawing Huge Graphs

1 / 44
About This Presentation
Title:

ACE:A Fast Multiscale Eigenvectors Computation for Drawing Huge Graphs

Description:

ACE:A Fast Multiscale Eigenvectors Computation for Drawing Huge Graphs Yehunda Koren Liran Carmel David Harel –

Number of Views:104
Avg rating:3.0/5.0
Slides: 45
Provided by: momd
Category:

less

Transcript and Presenter's Notes

Title: ACE:A Fast Multiscale Eigenvectors Computation for Drawing Huge Graphs


1
ACEA Fast Multiscale Eigenvectors Computation
for Drawing Huge Graphs
  • Yehunda Koren
  • Liran Carmel
  • David Harel

2
Fast Graph Drawing Algorithm
  • Very large graphs
  • Millions of nodes in less than a minute
  • Finds optimal drawing by minimizing a quadratic
    energy function
  • Minimization generalized eigenvalue problem

3
Need for Speed
  • Drawing of graph -gt capture its nature
  • Most graph drawing methods- lengthy computation
    times
  • Fastest alg- require 10 min for 105-node graph
  • ACE- 10-20 sec for 105-node graph

4
Minimization
  • For commonly encountered graphs, in which all
    weights are non-negative, the Laplacian is
    positive semi-definite. ? Key observation that
    this provides for the feasibility of nice drawings

5
Minimization
  • Drawing expect nodes connected by large positive
    weights to be close together, and those connected
    by large negative weights to be far away from
    each other

6
ACE algorithm
  • Technique common to algebraic multigrid (AMG)
    algorithm
  • Express as originally high-dimensional problem in
    lower and lower dimensions using coarsening
  • Problem solved exactly then starts a refinement
    process- the solution is progressively projected
    back into higher and higher dimensions until
    original problem is reproduced

7
ACE algorithm
8
Coarsening
  • Let G be a PSD graph with n nodes
  • Replaces G with another PSD graph Gc containing m
    lt n nodes approximating the same entity on
    different scales
  • Interpolation matrix- tool that linkes G
    and Gc

9
Coarsening
  • Given a fine n-node PSD graph, and an nxm
    interpolation matrix, take the coarse m-node PSD
    graph to be Gc (Lc,Mc)

10
Coarsening
  • Interpolation Matrix
  • Retain structure
  • Easy to compute in running time
  • The sparser the better
  • Types
  • Edge Contraction Interpolation
  • Weighted Interpolation

11
Interpolation Types
  • Edge Contracting Interpolation
  • Interpolate each node of the finer graph from
    exactly one coarse node
  • 1. Divide the nodes of fine graph into small
    disjoint connected subsets
  • 2. Associate the members of each subset with
    single coarse node
  • Sparse and easy to compute
  • Only takes into consideration the strongest
    connection of nodes

12
Interpolation Types
  • Weighted Interpolation
  • Interpolate each node of the fine graph from
    possibly several coarse nodes
  • 1. Choose a subset of m nodes out of n nodes of G
    representatives
  • 2. Fix the coordinates of the representatives by
    their values
  • 3. Determine the coordinates of the
    non-representatives as a weighted sum of
    neighboring representatives

13
Running Time
  • The Sparser the interpolation matrix, the faster
    a single iteration is
  • The more accurate the interpolation matrix, the
    less iterations are required
  • Edge Contracting fastest structure of
    homogeneous graphs
  • Weighted fastest structure of non-homogeneous
    graphs

14
Coarsening cont.
15
Coarsening cont.
16
Coarsening cont.
  • Given Gc, the generalized eigenprojection problem
    and the original problem in the same dimension
    is (substituted x Ay)

17
Refinement
  • Keep coarsening until couse PSD graph is
    sufficiently small lt 100 nodes
  • The drawing of G(L,M) is obtained from lowest
    positive generalized eigenvectors of (L,M)
  • Since M is diagonal, the problem can be an
    eigenvalue problem Bv µv and since B is small,
    this takes fraction of total running time
  • Use power iteration algorithm to find the largest
    eigenvalues of symmetric matrices but reverse
    since we need to find the lowest not the largest

18
Examples
  • Graphs 105 nodes - few sec
  • Graphs 106 nodes 10-20 sec
  • Graphs 7.5106 nodes 2 min

19
(No Transcript)
20
Examples
  • Electronic Noses chemical devices to quantify
    odors
  • Array of sensors that gives an odor-unique
    response pattern
  • N300 measurements
  • 30 types of odors
  • Odors in up-rt outliers
  • Diff clusteres of odors
  • separated

21
(No Transcript)
22
Multiscale Vizualization of Small World Networks
  • David Auber
  • Yves Chiricota
  • Fabien Jourdan
  • Guy Melancon

23
Problem
  • small world networks
  • Shown to be relevant models in study social
    networks, software reverse engineering, biology,
    etc.
  • Have a multiscale nature
  • viewed as a network of groups also small world
    networks

24
Small World Networks
  • First identified by Milgram studying the
    structure of social networks
  • Six degree of separation

25
Small World Networks
  • Defining Properties
  • Average path length
  • Same edges as random graph
  • Clustering index of nodes
  • Orders of magnitude larger than ave

26
Internet Movie Database
27
Interactive Visualization of Small World Networks
  • Common tasks Identify patterns in set of
    connections
  • Social groups collections of actors who are
    closely linked to one another
  • Social Positions sets of actors who are linked
    into the total social system in similar ways

28
Might not need
  • Use of small world properties of networks to
    support the visualization
  • Compute decomposition of small world network into
    highly connected components also small world

29
Technique
  • Compute clustering index for nodes and graph
  • Where c(v) is the clustering index of node v,
  • v is a node in the network,
  • r(N(v)) is the number of edges connecting
    neighbors of v ,
  • k is the size of vs neighborhood N(v)

30
Technique
  • Clustering index of graph G
  • Take the average
  • Where N is the number of nodes in G

31
Technique
  • Small world networks
  • High clustering index
  • Small average path length between nodes

32
Technique
  • Finding a clustering index of an edge
  • Denote
  • M(u) set of neighbors of u that are not
    neighbors of v
  • W(u,v) set of common neighbors to u and v
  • R(A,B) for the number of edges linking nodes in a
    set A to nodes in set B
  • S(A,B) r(A,B) / AB computes the proportion
    of existing edges among the set of all possible
    edges

33
Technique
  • Finding a clustering index of an edge
  • Given edge(u,v), strength is computed by dividing
    neighborhood of u or v into distinct subsets
  • Compute

34
Discard weak edges
  • Strength of edge measure of its contribution to
    the cohesion of the neighborhood
  • Filter out the weak
  • If strength of edge lt threshold
  • Maximal disconnected subgraphs corresponding to
    clustering of initial graph
  • Quotient graph subgraphs as vertices

35
Visual Coherence
  • Force-directed algorithm
  • naturally embeds neighbor nodes close to one
    another
  • Not distinguish between edges
  • Induce same attractive force for all edges of the
    graph avoid be assigning weights
  • Quotient graph laid out using Force-directed
    algorithm
  • Weaker edges replaces by single edge

36

37
Navigational Coherence
  • Selected component showed in same manner as
    overview panel
  • Left click display of overview
  • Middle click show same component unfolded as a
    flat graph
  • Wheel zoom in/out

38
Examples
  • Resyn Assistan API
  • Software designed for the study of chemical
    components developed in Montpellier (LIRMM)
  • IMDB subset
  • Central group renowned movie actors
  • Periphery movies

39
Threshold and Quality Measurements
  • Threshold choice determined by good partitioning
    from filtering
  • Quality of a partition comput how close a
    partition is to a partion with optimal cost

40
Threshold and Quality Measurements
  • Cost function
  • Where C clustering of graph G
  • First term mean value of edge density inside
    each cluster
  • Second term mean value of edge density between
    clusters

41
Quality
  • Optimal threshold approaches 1.6 for MQ close to
    0.80

42
MQ scores (really high)
43
Partitioning
  • Interactive approach to choosing threshold might
    be an alternative
  • Clustering techniques aim at finding partitions
    that have a minimal number of outer edges
  • Immediate visual feedback provided
  • Computing time still low
  • Method able to find partitions with a high MQ
    value

44
(No Transcript)
Write a Comment
User Comments (0)
About PowerShow.com