Title: Scaling, renormalization and self-similarity in complex networks
1Scaling, renormalization and self-similarity in
complex networks
Chaoming Song (CCNY) Lazaros Gallos (CCNY) Shlomo
Havlin (Bar-Ilan, Israel)
Protein interaction network
2Are scale-free networks really free-of-scale?
If you had asked me yesterday, I would have said
surely not - said Barabasi. (Science News,
February 2, 2005).
Small World effect shows that distance between
nodes grows logarithmically with N (the network
size) OR Self-similar fractal topology is
defined by a power-law relation
Small world contradicts self-similarity!!!
How the network behaves under a scale
transformation.
3WWW
nd.edu
300,000 web-pages
4Internet
Faloutsos et al., SIGCOMM 99
Internet connectivity, with selected backbone
ISPs (Internet Service Provider) colored
separately.
5Yeast Protein-Protein Interaction Map
Individual proteins Physical interactions from
the filtered yeast interactome database 2493
high-confidence interactions observed by at least
two methods (yeast two-hybrid). 1379 proteins,
ltkgt 3.6
J. Han et al., Nature (2004)
Modular structure according to function!
Colored according to protein function in the
cell Transcription, Translation, Transcription
control, Protein-fate, Genome maintenance,
Metabolism, Unknown, etc
from MIPS database, mips.gsf.de
6Metabolic network of biochemical reactions in
E.coli
Chemical substrates Biochemical interactions
enzyme-catalyzed reactions that transform one
metabolite into another.
J. Jeong, et al., Nature, 407 651 (2000)
Modular structure according to the
biochemical class of the metabolic products of
the organism.
Colored according to product class Lipids,
essential elements, protein, peptides and amino
acids, coenzymes and prosthetic groups,
carbohydrates, nucleotides and nucleic acids.
7How long is the coastline of Norway?
It depends on the length of your ruler.
Fractals look the same on all scales
scale-invariant.
Box length
Fractal Dimension dB- Box Covering Method
Total no. of boxes
8Boxing in Biology
Boxing in Biology
How to zoom out of a complex network?
- Generate boxes where all
- nodes are within a distance
- Calculate number of boxes, ,
- of size needed to cover the
- network
We need the minimum number of boxes NP-complete
optimization problem!
9Most efficient tiling of the network
8 node network Easy to solve
4 boxes
5 boxes
300,000 node network Mapping to graph colouring
problem. Greedy algorithm to find minimum boxes
10Larger distances need fewer boxes
1
2
-dB
fractal
log(NB)
3
non fractal
log(lB)
11Box covering in yeast protein interaction network
12Most complex networks are Fractal
Biological networks
Metabolic
Protein interaction
43 organisms - all scale
Three domains of life archaea, bacteria, eukaria
E. coli, H. sapiens, yeast
Song, Havlin, Makse, Nature (2005)
13Metabolic networks are fractals
14Technological and Social Networks TOO
WWW
Hollywood film actors
212,000 actors
Other bio networks Khang and Bremen groups
Internet is not fractal!
nd.edu domain
300,000 web-pages
15 Two ways to calculate fractal dimensions
Cluster growing method
Box covering method
In homogeneous systems (all nodes with similar k)
both definitions agree
percolation
16Box Covering flat average
Cluster Growing biased
exponential
power law
Different methods yield different results due to
heterogeneous topology
Box covering reveals the self similarity. Cluster
growth reveals the small world. NO
CONTRADICTION! SAME HUBS ARE USED MANY TIMES IN
CG.
17Is evolution of the yeast fractal?
present day
Other Fungi
Animals Plants
Archaea Bacteria
Yeast
300 million years ago
Ancestral yeast
Ancestral Fungus
Ancestral Eukaryote
1 billion years ago
Ancestral Prokaryote Cell
3.5 billion years ago
Following the phylogenetic tree of life
COG database von Mering, et al Nature (2002)
1.5 billion years ago
18Same fractal dimension and scale-free exponent
over 3.5 billion years
Suggests that present-day networks could have
been created following a self-similar, fractal
dynamics.
19Renormalization in Complex Networks
NOW, REGARD EACH BOX AS A SINGLE NODE AND ASK
WHAT IS THE DEGREE DISRIBUTION OF THE
NETWORK OF BOXES AT DIFFERENT SCALES ?
20Renormalization of WWW network with
21The degree distribution is invariant under
renormalization
Internet is not fractal dB--gt infinity But it is
renormalizable
22Turning back the time
Repeatedly BOXING the network is the same as
going back in time from a single
node to present day.
THE RENORMALIZATION SCHEME
renormalization
present day network
ancestral node
1
time evolution
Can we predict the past. ? if not the future.
23Evolution of complex networks
time evolution
24How does Modularity arise?
The boxes have a physical meaning self-similar
nested communities
How to identify communities in complex networks?
renormalization
present day network
ancestral node
1
time evolution
25Emergence of Modularity in PIN
Boxes are related to the biologically relevant
functional modules in the yeast protein
interactome
renormalization
time evolution
translation
transcription
protein-fate
cellular-fate organization
present day network
26Emergence of modularity in metabolic networks
Appearance of functional modules in E. coli
metabolic network. Most robust network than
non-fractals.
27Theoretical approach
How the communities/modules are linked?
renormalization
k2
k8
s1/4
k degree of the nodes
k degree of the communities
28Theoretical approach to modular networks Scaling
theory to the rescue
The larger the community the smaller their
connectivity
new exponent describing how families link
29Scaling relations
A theoretical prediction relating the different
exponents
distance
boxes
degree
30Scaling relations
The communities also follow a self-similar pattern
WWW
Metabolic
prediction
Scaling relation works
scale-free
fractals
communities/modules
31Why fractality?
Some real networks are not fractal
INTERNET
32What is the origin of self-similarity?
Can you see the difference?
FRACTAL
NON FRACTAL
Internet map
Yeast protein map
E.coli metabolic map
HINT the key to understand fractals is in the
degree correlations P(k1,k2) not in P(k)
33Quantifying correlations
P(k1,k2) Probability to find a node with k1
links connected with a node of k2 links
Internet map - non fractal
Metabolic map - fractal
high prob.
low prob.
log(k2)
log(k2)
P(k1,k2)
low prob.
high prob.
log(k1)
log(k1)
Hubs connected with hubs
Hubs connected with non-hubs
34Quantify anticorrelation between hubsat all
length scales
Hub-Hub Correlation function fraction of hub-hub
connections
hubs
Renormalize
hubs
Hubs connected directly
35Hub-hub connection organized in a self-similar way
non-fractal
The larger de implies more anticorrelation
fractal
(fractal) (non-fractal)
Anticorrelations are essential for fractal
structure
36What is the origin of self-similarity?
Non-fractal networks
Fractal networks
- very compact networks
- hubs connected with other hubs
- strong hub-hub attraction
- assortativity
- less compact networks
- hubs connected with non-hubs
- strong hub-hub repulsion
- dissasortativity
Internet All available models BA model,
hierarchical random scale free, JKK, etc
WWW, PIN, metabolic, genetic, neural networks,
some sociological networks
37How to model it? renormalization reverses
time evolution
Song, Havlin, Makse, Nature Physics, 2006
Both mass and degree increase exponentially with
time
time
offspring nodes attached to their parents
renormalize
(m2) in this case
Scale-free
Mode II
Mode I
38How does the length increase with time?
Mode I NONFRACTAL SMALL WORLD
Mode II FRACTAL
39Combine two modes together
Mode I with probability e Mode II with
probability 1-e
time
renormalize
e0.5
40Predictions
Model reproduces local small world, scale-free
and fractality
yeast
h.sapiens
- model with e0.2
- repulsion between hubs leads to fractal topology
- small world locally inside well defined
communities
- model with e1
- attraction between hubs
- non-fractal
- small world globally
41The model reproduces the main features of real
networks
Case 1 e 0.8 FRACTALS Case 2 e 1.0
NON-FRACTALS
42Model predicts all exponents in terms of growth
rates
Each step the total mass scales with a constant
n, all the degrees scale with a constant s. The
length scales with a constant a, we obtain
We predict the fractal exponents
43Time evolution in yeast network
44Multiplicative and exponential growth in yeast
PIN Length-scales, number of conserved proteins
and degree
45A new principle of network dynamics
less vulnerable to intentional attacks
46Summary
- In contrast to common belief, many real world
networks are self-similar. - FRACTALS WWW, Protein interactions, metabolic
networks, neural networks, collaboration
networks. - NON-FRACTALS Internet, all models.
- Communities/modules are self-similar, as well.
- Scaling theory describes the dynamical
evolution. - Boxes are related to the functional modules in
metabolic and protein networks. - Origin of self similarity anticorrelation
between hubs - Fractal networks are less vulnerable than
non-fractal networks
Positions available jamlab.org
47An finally, a model to put all this together
A multiplicative growth process of the number of
nodes and links
m 2
Analogous to duplication/divergence mechanism in
proteins??
Probability e hubs always connected strong hub
attraction should lead to non-fractal
Probability 1-e hubs never connected strong hub
repulsion should lead to fractal
48Different growth modes lead to different topologie
s
For the both models, each step the total number
of nodes scale as n 2m 1( N(t1) nN(t) ).
Now we investigate the transformation of the
lengths. They show quite different ways for this
two models as following
Mode I L(t1) L(t)2
Then we lead to two different scaling law of N L
smaller
smaller
Mode II L(t1) 2L(t)1
Mode III L(t1) 3L(t)
49Dynamical model
Suppose we have e probability to have mode I, 1-e
probability to have mode II and mode III. Then we
have
or
50Graph theoretical representation of a
metabolic network
(a) A pathway (catalyzed by Mg2-dependant
enzymes). (b) All interacting metabolites are
considered equally. (c) For many biological
applications it is useful to ignore co-factors,
such as the high energy-phosphate donor ATP,
which results in a second type of mapping that
connects only the main source metabolites to the
main products.
51Classes of genes in the yeast proteome
52Renormalization following the phylogenetic tree
P. Uetz, et al. Nature 403 (2000).