leonid e. zhukovrandom walks on graph a random walk on graph on graph g is a sequence of vertices v...

22
Diffusion and random walks on graphs Leonid E. Zhukov School of Data Analysis and Artificial Intelligence Department of Computer Science National Research University Higher School of Economics Structural Analysis and Visualization of Networks Leonid E. Zhukov (HSE) Lecture 11 31.03.2015 1 / 22

Upload: others

Post on 08-May-2020

7 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Leonid E. ZhukovRandom walks on graph A random walk on graph on graph G is a sequence of vertices v 0;v 1;:::v t::, where each v t+1 is chosen to be a random neighbor of v t, fv t;v

Diffusion and random walks on graphs

Leonid E. Zhukov

School of Data Analysis and Artificial IntelligenceDepartment of Computer Science

National Research University Higher School of Economics

Structural Analysis and Visualization of Networks

Leonid E. Zhukov (HSE) Lecture 11 31.03.2015 1 / 22

Page 2: Leonid E. ZhukovRandom walks on graph A random walk on graph on graph G is a sequence of vertices v 0;v 1;:::v t::, where each v t+1 is chosen to be a random neighbor of v t, fv t;v

Module 4

1 Diffusions and random walks on graphs

2 Epidemics on networks

3 Information flow

4 Diffusion of innovations

5 Social influence

6 Trust propagation

7 Segregation model on networks

8 Link prediction

9 Node labeling

10 Time evolution of networks

Leonid E. Zhukov (HSE) Lecture 11 31.03.2015 2 / 22

Page 3: Leonid E. ZhukovRandom walks on graph A random walk on graph on graph G is a sequence of vertices v 0;v 1;:::v t::, where each v t+1 is chosen to be a random neighbor of v t, fv t;v

Lecture outline

1 Random walks on graph

2 Diffusion on graphDiffusion equationLaplace operator

3 Spectral graph theoryNormalized laplacian

Leonid E. Zhukov (HSE) Lecture 11 31.03.2015 3 / 22

Page 4: Leonid E. ZhukovRandom walks on graph A random walk on graph on graph G is a sequence of vertices v 0;v 1;:::v t::, where each v t+1 is chosen to be a random neighbor of v t, fv t;v

Random walks on graph

A random walk on graph on graph G is a sequence of verticesv0, v1, ...vt .., where each vt+1 is chosen to be a random neighbor ofvt , {vt , vt+1} ∈ E (G ) and probability of the transition is given by

Pij = P(xt+1 = vj |xt = vi ),

where∑

i Pij = 1, matrix P - row stochastic

Leonid E. Zhukov (HSE) Lecture 11 31.03.2015 4 / 22

Page 5: Leonid E. ZhukovRandom walks on graph A random walk on graph on graph G is a sequence of vertices v 0;v 1;:::v t::, where each v t+1 is chosen to be a random neighbor of v t, fv t;v

Random walks on graph

2D grid (k=2 regular graph)

image from wikipedia.org

Leonid E. Zhukov (HSE) Lecture 11 31.03.2015 5 / 22

Page 6: Leonid E. ZhukovRandom walks on graph A random walk on graph on graph G is a sequence of vertices v 0;v 1;:::v t::, where each v t+1 is chosen to be a random neighbor of v t, fv t;v

Random walks on graph

We will be considering undirected connected unweighted graphs

Transition matrix

Pij =

{1/d(i) , if ∃ e(i , j), i and j adjacent,0 , otherwise

Using adjacency matrix

Pij =Aij

di= D−1

ii Aij , where Dij = diδij

Let pi (t) - probability, that a walk is at node i at moment t(probability distribution vector, value per node)

Random walk

pj(t + 1) =∑i

Pijpi (t) =∑i

pi (t)

diAij

Matrix form~p(t + 1) = ~p(t)P = ~p(t)(D−1A)

Leonid E. Zhukov (HSE) Lecture 11 31.03.2015 6 / 22

Page 7: Leonid E. ZhukovRandom walks on graph A random walk on graph on graph G is a sequence of vertices v 0;v 1;:::v t::, where each v t+1 is chosen to be a random neighbor of v t, fv t;v

Random walks on graph

Starting from initial distribution ~p(0) after t steps

~p(t) = ~p(0)Pt

Random walk on connected non-bipartite graphs converges to limitingdistribution

limt→∞

~p(t) = limt→∞

~p(0)Pt = ~π

Limiting distribution = stationary distribution

limt→∞

~p(t + 1) = limt→∞

~p(t)P

~π = ~πP

Left eigenvalue corresponding to λ = 1

λ~π = ~πP

Leonid E. Zhukov (HSE) Lecture 11 31.03.2015 7 / 22

Page 8: Leonid E. ZhukovRandom walks on graph A random walk on graph on graph G is a sequence of vertices v 0;v 1;:::v t::, where each v t+1 is chosen to be a random neighbor of v t, fv t;v

Random walks on graph

Random walk is reversible if

πiPij = πjPji

On undirected graph:

πiAij

di= πj

Aji

dj

πidi

=πjdj

= const

and∑

i πi = 1

Stationary (stable) distribution

πi =di∑j dj

=di

2|E |

Leonid E. Zhukov (HSE) Lecture 11 31.03.2015 8 / 22

Page 9: Leonid E. ZhukovRandom walks on graph A random walk on graph on graph G is a sequence of vertices v 0;v 1;:::v t::, where each v t+1 is chosen to be a random neighbor of v t, fv t;v

Random walks on graph

Lazy random walk

pj(t + 1) =1

2pj(t) +

1

2

∑i

pi (t)

diAij

Matrix form

~p(t + 1) =1

2~p(t)(I + D−1A)

Converges (always!) to the same stationary distribution

(2λ− 1)~π = ~π(D−1A)

Leonid E. Zhukov (HSE) Lecture 11 31.03.2015 9 / 22

Page 10: Leonid E. ZhukovRandom walks on graph A random walk on graph on graph G is a sequence of vertices v 0;v 1;:::v t::, where each v t+1 is chosen to be a random neighbor of v t, fv t;v

Random walks on graph

Theorem

Let λ2 denote second largest eigenvalue of transition matrix P = D−1A,p(t) probability distribution vector and π stationary distribution. If walkstarts from the vertex i , pi (0) = 1, then after t steps for every vertex:

|pj(t)− πj | ≤

√djdiλt2

For P = D−1A, λ1 = 1, λ2 < 1

For P′ = 12 (I + D−1A), λ′2 = 1

2 (1 + λ2)

Leonid E. Zhukov (HSE) Lecture 11 31.03.2015 10 / 22

Page 11: Leonid E. ZhukovRandom walks on graph A random walk on graph on graph G is a sequence of vertices v 0;v 1;:::v t::, where each v t+1 is chosen to be a random neighbor of v t, fv t;v

Physics of Diffusion

Let Φ(r , t) -concentration

Fik’s Law

J = −C ∂Φ

∂r= −C∇Φ

Continuity equation (conserved quantity)

∂Φ

∂t+∇J = 0

Diffusion equation ( heat equation)

∂Φ(r , t)

∂t= C∆Φ(r , t)

Leonid E. Zhukov (HSE) Lecture 11 31.03.2015 11 / 22

Page 12: Leonid E. ZhukovRandom walks on graph A random walk on graph on graph G is a sequence of vertices v 0;v 1;:::v t::, where each v t+1 is chosen to be a random neighbor of v t, fv t;v

Diffusion

Laplacian 2D

∆f = ∇2f =∂2f

∂x2+∂2f

∂y2

Discretized Laplacian in 2D

∆f (x , y) =f (x + h, y) + f (x − h, y) + f (x , y + h) + f (x , y − h)− 4f (x , y)

h2

Leonid E. Zhukov (HSE) Lecture 11 31.03.2015 12 / 22

Page 13: Leonid E. ZhukovRandom walks on graph A random walk on graph on graph G is a sequence of vertices v 0;v 1;:::v t::, where each v t+1 is chosen to be a random neighbor of v t, fv t;v

Diffusion on network

Some substance that occupy vertices, on each time step diffuses outφi (t) - quantity per node

φi (t + 1) = φi (t) +∑j

Aij(φj(t)− φi (t))Cδt

dφi (t)

dt= C

∑j

Aij(φj(t)− φi (t))

dφidt

= C (∑j

Aijφj−∑j

Aijφi ) = C (∑j

Aijφj−diφi ) = C∑j

(Aij−δijdj)φj

dφidt

= −C∑j

Lijφj

Leonid E. Zhukov (HSE) Lecture 11 31.03.2015 13 / 22

Page 14: Leonid E. ZhukovRandom walks on graph A random walk on graph on graph G is a sequence of vertices v 0;v 1;:::v t::, where each v t+1 is chosen to be a random neighbor of v t, fv t;v

Graph Laplacian

Graph Laplacian

Lij = djδij − Aij = Dij − Aij , Dij = djδij

Lij =

d(i) , if i = j ,−1 , if ∃ e(i , j)− i and j adjacent,0 , otherwise

Matrix formL = D− A

Leonid E. Zhukov (HSE) Lecture 11 31.03.2015 14 / 22

Page 15: Leonid E. ZhukovRandom walks on graph A random walk on graph on graph G is a sequence of vertices v 0;v 1;:::v t::, where each v t+1 is chosen to be a random neighbor of v t, fv t;v

Diffusion on Graph

Diffusion equationdφ

dt+ CLφ = 0

Eigenvector basisLvk = λvk

φ(t) =∑k

ak(t)vk , ak(t) = φ(t)Tvk

ODE ∑k

(dak(t)

dt+ Cλkak(t)

)vk = 0

dak(t)

dt+ Cλkak(t) = 0

ak(t) = ak(0)e−Cλk t

Solutionφ(t) =

∑k

ak(0)vke−Cλk t

Leonid E. Zhukov (HSE) Lecture 11 31.03.2015 15 / 22

Page 16: Leonid E. ZhukovRandom walks on graph A random walk on graph on graph G is a sequence of vertices v 0;v 1;:::v t::, where each v t+1 is chosen to be a random neighbor of v t, fv t;v

Laplace matrix

L - symmetric positive semidefinite

φTLφ =∑ij

Lijφiφj =∑ij

(diδij − Aij)φiφj =1

2

∑ij

Aij(φi − φj)2

Spectral propertiesLvi = λvi

real non-negative eigenvalues λi ≥ 0 and orthogonal eigenvectors vi

smallest eigenvalue always λ1 = 0 for v1 = e = [1, 1, 1...1]T

Le = (D− A)e = 0

Number of zero eigenvalues = number of connected components

In connected graph λ2 6= 0 - algebraic connectivity of a graph(spectral gap), v2 - Fiedler vector

Leonid E. Zhukov (HSE) Lecture 11 31.03.2015 16 / 22

Page 17: Leonid E. ZhukovRandom walks on graph A random walk on graph on graph G is a sequence of vertices v 0;v 1;:::v t::, where each v t+1 is chosen to be a random neighbor of v t, fv t;v

Diffusion on Graph

Solutionφ(t) =

∑k

ak(0)vke−Cλk t

all λi > 0 for i > 1, λ1 = 0:

limt→∞

φ(t) = a1(0)v1

Normalized solution v1 = 1√Ne

a1(0) = φ(0)Tv1 =1√N

∑j

φj(0)

Steady state

limt→∞

φ(t) = (1

N

∑j

φj(0))e = const

Leonid E. Zhukov (HSE) Lecture 11 31.03.2015 17 / 22

Page 18: Leonid E. ZhukovRandom walks on graph A random walk on graph on graph G is a sequence of vertices v 0;v 1;:::v t::, where each v t+1 is chosen to be a random neighbor of v t, fv t;v

Diffusion on Graph

Leonid E. Zhukov (HSE) Lecture 11 31.03.2015 18 / 22

Page 19: Leonid E. ZhukovRandom walks on graph A random walk on graph on graph G is a sequence of vertices v 0;v 1;:::v t::, where each v t+1 is chosen to be a random neighbor of v t, fv t;v

Smoothing operator

Smoothing operator

(Lφ)i =∑j

(Dij −Aij)φj =∑j

(diδijφj −Aijφj) = di (φi −1

di

∑j

Aijφj)

Laplace equation ∇φ = 0, (Lφ)i = 0, solution - harmonic function

φi =1

di

∑j

Aijφj

Regression on graphs

Leonid E. Zhukov (HSE) Lecture 11 31.03.2015 19 / 22

Page 20: Leonid E. ZhukovRandom walks on graph A random walk on graph on graph G is a sequence of vertices v 0;v 1;:::v t::, where each v t+1 is chosen to be a random neighbor of v t, fv t;v

Normalized Laplacian

Normalized Laplacian

L = D−1/2LD−1/2

Lij =

1 , if i = j ,− 1√

didj, if ∃ e(i , j)− i and j adjacent,

0 , otherwise

Connection to random walks:

P = D−1A = D−1/2(I − L)D1/2

Similar matrices, share properties of represented linear operators, i.e.eigenvalues: λmax(P) = 1, λ1(L) = 0.

Leonid E. Zhukov (HSE) Lecture 11 31.03.2015 20 / 22

Page 21: Leonid E. ZhukovRandom walks on graph A random walk on graph on graph G is a sequence of vertices v 0;v 1;:::v t::, where each v t+1 is chosen to be a random neighbor of v t, fv t;v

Normalized Laplacian

Conductance of a vertex set S

φ(S) =cut(S ,V \S)

min(vol(S), vol(V \S))

where vol(S) =∑

i∈S ki - sum of all node degrees in the set

Cheeger’s inequality

λ2(L)/2 ≤ minSφ(S) ≤

√2λ2(L)

Leonid E. Zhukov (HSE) Lecture 11 31.03.2015 21 / 22

Page 22: Leonid E. ZhukovRandom walks on graph A random walk on graph on graph G is a sequence of vertices v 0;v 1;:::v t::, where each v t+1 is chosen to be a random neighbor of v t, fv t;v

References

Chung, Fan R.K. (1997). Spectral graph theory (2ed.). Providence,RI: American Math. Soc.

Daniel A. Spielman. Spectral Graph theory. Combinatorial ScientificComputing. Chapman and Hall/CRC Press. 2011

Lovasz, L. (1993). Random walks on graphs: a survey. InCombinatorics, Paul Erdos is eighty (pp. 353 – 397). Budapest:Janos Bolyai Math. Soc.

Leonid E. Zhukov (HSE) Lecture 11 31.03.2015 22 / 22