kavosh : a new algorithm for finding network motifs
DESCRIPTION
Kavosh : a new algorithm for finding network motifs. Jin Chen 2012 Fall Michigan State University. Motivation of this paper. It presents a new algorithm for finding size- k network motifs from a directed network with less memory and CPU time in comparison to other algorithms - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/1.jpg)
1
Kavosh: a new algorithm for finding network motifs
Jin Chen2012 Fall
Michigan State University
![Page 2: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/2.jpg)
![Page 3: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/3.jpg)
Motivation of this paper
• It presents a new algorithm for finding size-k network motifs from a directed network with less memory and CPU time in comparison to other algorithms
• Input : A large directed or undirected network
• Output: Network motifs which occur in the input network.
![Page 4: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/4.jpg)
Basic Terminologies
slides adapted from Shalev Itzkovitz’s talk given at IPAM UCLA on July 2005
![Page 5: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/5.jpg)
Basic Terminologies
Transcription regulation of gene by Protein
slides adapted from Shalev Itzkovitz’s talk given at IPAM UCLA on July 2005
![Page 6: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/6.jpg)
Basic Terminologies
slides adapted from Shalev Itzkovitz’s talk given at IPAM UCLA on July 2005
![Page 7: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/7.jpg)
Basic Terminologies
slides adapted from Shalev Itzkovitz’s talk given at IPAM UCLA on July 2005
![Page 8: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/8.jpg)
Basic Terminologies
Or Motifs
slides adapted from Shalev Itzkovitz’s talk given at IPAM UCLA on July 2005
![Page 9: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/9.jpg)
Basic Terminologies
slides adapted from Shalev Itzkovitz’s talk given at IPAM UCLA on July 2005
![Page 10: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/10.jpg)
Definition of Network Motifs• Patterns that occur in a real network significantly more than in
randomized networks are called NETWORK MOTIFS.
• Randomized Networks:Networks with same characteristics as the real network, but where the connections between nodes and edges are made at random.
![Page 11: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/11.jpg)
Definition of Network Motifs
R. Milo et al. Science 2002; vol 298:824-827
![Page 12: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/12.jpg)
Exist algorithms
• mFinder: size 3-4, directed and undirected
• PAJEK: size 3-4, directed and undirected, visible
• FANMOD: size 8, directed and undirected, sampling, visible
• NeMoFinder: size 13, undirected
![Page 13: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/13.jpg)
Kavosh consists of 4 steps• Enumeration: finding all subgraphs of a given size that occur in the
input graph
• Classification: classifying each found subgraph into isomorphic groups
• Random graph generation: generating random networks with respect to the input network (enumeration and classification are also performed on random networks)
• Motif identification: distinguishing motifs among all found subgraphs on basis of statistical parameters
![Page 14: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/14.jpg)
Enumeration
• All subgraphs that include a particular vertex are discovered
• Subsequently, this vertex is removed from thenetwork, and the process is repeated
![Page 15: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/15.jpg)
15
Enumeration
Example of enumeration: to find all size-3 induced subgraphs in G, the composition is (1,1),(2)
To find all size-4 induced subgraphs in G, the composition is (1,1,1),(1,2),(2,1),(3)
(1,1)
(2)G
![Page 16: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/16.jpg)
16
Enumeration(1,1)
(2)After removing node 1
After removing node 1 and 2
![Page 17: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/17.jpg)
Time Complexity of Enumeration
Typically, graph partition problems fall under the category of NP-hard problems. Solutions to these problems are generally derived using heuristics and approximation algorithms.
However, uniform graph partitioning or a balanced graph partition problem can be shown to be NP-complete to approximate within any finite factor. Even for special graph classes such as trees and grids, no reasonable approximation algorithms exist, unless P=NP. … When not only the number of edges between the components is approximated, but also the sizes of the components, it can be shown that no reasonable fully polynomial algorithms exist for these graphs.
from Wikipedia
![Page 18: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/18.jpg)
Classification
• NAUTY - algorithm for finding isomorphism subgraphs
• NAUTY uses canonical matrix as the unique identifier of a subgraph
• Two subgraphs are isomorphic if and only if their canonical matrices are same
![Page 19: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/19.jpg)
Canonical Matrix and Labeling
Adjacent-matrix
0011100100010000
Switch the node labels for obtaining new adjacent matrix.Turn matrix to string, representing a graph.Canonical Labeling: maximal or minimum string
Node order (2,1,3,4) 0101001100010000
Canonical Labeling
subgraph String
![Page 20: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/20.jpg)
NAUTY• The world's fastest isomorphism testing program is Nauty, by Brendan
D. McKay, Professor in the Research School of Computer Science, Australian National University.
• Nauty (No AUTomorphisms, Yes?) is a set of efficient C language procedures to produce a canonically-labeled isomorph of the graph, for isomorphism testing.
• It can test most graphs of less than 100 vertices in well under a second.
• Nauty has been successfully ported to a variety of operating systems and C compilers.
http://www.cs.sunysb.edu/~algorith/implement/nauty/implement.shtmlMcKay, B.D. Practical Graph Isomorphism, Congressus Numerantium, 30 (1981) 45-87
![Page 21: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/21.jpg)
Random graph generation• Switching operations are applied on the edges of the input
network repeatedly, until the network is well randomized.
This progress does not change the vertex degrees.
![Page 22: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/22.jpg)
Motif determination
• Two statistical measures– Z-score
where Np is the number which motif Gp occurred in the input network, is the mean which Gp occurred in random networks and σ is the standard deviation. The larger the Z-score, the more significant is the network motif
– P-valueIt indicates the number of random networks in which a motif GP occurred more often than in a biological network, divided by the total number of random
networks. P-value ranges from 0 to 1. The smaller the P-value, the more significant is the network motif.
![Page 23: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/23.jpg)
Parameters in KavoshThe following parameters are used to describe a network motif in Kovash paper
• The frequency(in real graph) is larger than 4
• By using 1000 randomized network, p-value < 0.01
• By using 1000 randomized network, Z-score > 1.0
![Page 24: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/24.jpg)
Performance
![Page 25: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/25.jpg)
Performance
E. coli gene regulatory network
Node number: 672Edge number: 1276
![Page 26: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/26.jpg)
Performance
![Page 27: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/27.jpg)
Performance
![Page 28: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/28.jpg)
Contribution
• Designed a new algorithm to find network motif for both directed and undirected network. size: > 8
• A new method to enumerate all the subgraphs
![Page 29: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/29.jpg)
![Page 30: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/30.jpg)
Discussions
• In terms of the algorithm for isomorphism testing, any better ones?
(LEDA Technical report)
![Page 31: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/31.jpg)
Discussions
• What is the bottleneck of this algorithm? Enumeration of subgraphs: Computing combination is exponential
Calculation of Canonical Labeling for all the subgraphs
![Page 32: Kavosh : a new algorithm for finding network motifs](https://reader036.vdocuments.net/reader036/viewer/2022062315/568165bc550346895dd8bbb7/html5/thumbnails/32.jpg)
Discussions
• Is it unbelievable?