coupledlp: link prediction in coupled networks · 2020-03-15 · 1 coupledlp: link prediction in...

43
1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong # , Jing Zhang + , Jie Tang + , Nitesh V. Chawla # , Bai Wang* # University of Notre Dame + Tsinghua University * Beijing University of Posts and Telecommunications

Upload: others

Post on 21-Apr-2020

8 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

1

CoupledLP: Link Prediction in Coupled Networks

Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

#University of Notre Dame +Tsinghua University *Beijing University of Posts and Telecommunications

Page 2: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

2

2015.08.08 10:30

2015.08.08 10:48

2015.08.08 11:01

2015.08.08 11:29 ……

2016.01.01 00:00

……

Mobile Networks

Page 3: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

3

2015.08.08 10:30

2015.08.08 10:48

2015.08.08 11:01

2015.08.08 11:29 ……

2016.01.01 00:00

……

Mobile Networks

Page 4: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

4

Disease-Gene Networks

1.  K.I. Goh, M. E. Cusick, D. Valle, B. Childs, M. Vidal, and A.-L. Barabási. The human disease network. PNAS 2007. 2.  D. Davis, N. V. Chawla. Exploring and Exploiting Disease Interactions from Multi-Relational Gene and Phenotype Networks. PLoS One 2011. 3.  J. Menche, A. Sharma, M. Kitsak, S. D. Ghiassian, M. Vidal, J. Loscalzo, A.-L. Barabási. Uncovering disease-disease relationships through the incomplete interactome. Science 2015.

Disease network Cross network Gene network

Page 5: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

5

Coupled Networks Given a source network GS = (VS, ES) and a target network GT = (VT, ET), they compose coupled networks if there exists a cross link eij with one node vi ∈ VS and the other node vj ∈ VT. The cross network GC = (VC, EC) is a bipartite network containing all the cross links in the coupled networks.

Source network Target networkCross network Coupled networks

Page 6: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

6

Coupled Link Prediction

Input   Output  

Source network Cross network Target network

Given the source network GS and the cross network GC in coupled networks G = (GS, GT, GC), the task is to find a predictive function:

f : (GS, GC) → YT where YT is the set of labels for the potential links in the target network GT.

Page 7: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

7

Challenges

Input   Output  

Source network Cross network Target network

Incompleteness Heterogeneity

Asymmetry

Page 8: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

8

Related Work: Traditional Link Prediction

1. D. Liben-Nowell and J. Kleinberg. The link prediction problem for social networks. CIKM’03.

B

D C

A

E

F

G

t1

B

D C

A

E

F

G

Input Output

t2

Page 9: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

9

Related Work: Heterogeneous Link Prediction

Input

1. Y. Sun, J. Han, C. C. Aggarwal, N. V. Chawla. Will Will This Happen? Relationship Prediction in Heterogeneous Information Networks. WSDM’12.

Output

Page 10: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

10

Related Work: Transfer Link Prediction

Input Output

1.  Y. Dong, J. Tang, S. Wu, J. Tian, N. V. Chawla. J. Rao, H. Cao. Link Prediction and Recommendation across Heterogeneous Networks. ICDM’12 2.  J. Tang, T. Lou, J. Kleinberg, S. Wu. Transfer Link Prediction across Heterogeneous Networks. TOIS 2015.

Source network Target network

Page 11: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

11

Related Work: Cross-Domain Link Prediction

Input Output

1. J. Tang, S. Wu, J. Sun, H. Su. Cross-Domain Collaboration Recommendation. KDD’12.

Source network Cross networkTarget network

Page 12: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

12

Related Work: Anchor Link Prediction

Input Output

1.  X. Kong, J. Zhang, P. S. Yu. Inferring anchor links across multiple heterogeneous social networks. CIKM’13. 2.  Y. Zhang, J. Tang, Z. Yang, J. Pei, and P. S. Yu. COSNET: Connecting Heterogeneous Social Networks with Local and Global Consistency. KDD’15.

A network Self-linkage networkB network

Page 13: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

13

Challenges

Input   Output  

Source network Cross network Target network

Incompleteness Heterogeneity

Asymmetry

Page 14: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

14

CoupledLP Framework

1. Implicit Target Network Construction •  Solve Incompleteness

2. Coupled Factor Graph Model •  Solve Asymmetry

•  Solve Heterogeneity

Page 15: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

15

CoupledLP Framework

Input Output

Source network Cross network Target network

99%

80%

75%

87%

Implicit Target Network Construction

Incompleteness

1. V. Leroy, B. B. Cambazoglu, and F. Bonchi. Cold start link prediction. In KDD ’10.

Page 16: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

16

CoupledLP: Implicit Target Network

v3S

v1T

v4S

v2T

v3S

v1T

v4S

v2T

v3S

v1T

v4S

v2T

1. R. Guha, R. Kumar, P. Raghavan, and A. Tomkins. Propagation of trust and distrust. In WWW ’04

Atomic Propagations for constructing an implicit target network

Direct Coupling Co-citation

MM MMT MTM + + top z%

Page 17: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

17

CoupledLP Framework

Input Output

Source network Cross network Target network

99%

80%

75%

87%

Coupled Factor Graph

Asymmetry Heterogeneity

Page 18: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

18

Basic Idea

y12

y34 y23

v1 v2

v3 v4

v5

v6

1.  F. R. Kschischang, B. J. Frey, and H. andrea Loeliger. Factor graphs and the sum-product algorithm. In IEEE TOIT 2001. 2.  Jie Tang, Tiancheng Lou, and Jon Kleinberg. Inferring social ties across heterogeneous networks. In WSDM '12

Input Network Factor Graph

Page 19: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

19

CoupledLP: Coupled Factor Graph

y13

f (v1, v3, y13)

y34

y06

Observations

y13=?

v1,v3

v3, v4

v0, v6

y06=?

f (v3, v4, y34)

F (vs, v3, ys3)

y12

v1, v2

f (v1, v2, y12)

y12=1

v5, v6v2, v3

y56y23

y23=? y56=0

f (v2, v3, y23) f (v5, v6, y56)

g(y12, y13)

g(y13, y23) g(y06, y56)

v5

v2

v3v0

Coupled Networks

v6

v1

v4

... CoupledFG

g(y12, y23)

y34=?

Page 20: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

20

CoupledLP: Coupled Factor Graph

y13

f (v1, v3, y13)

y34

y06

Observations

y13=?

v1,v3

v3, v4

v0, v6

y06=?

f (v3, v4, y34)

F (vs, v3, ys3)

y12

v1, v2

f (v1, v2, y12)

y12=1

v5, v6v2, v3

y56y23

y23=? y56=0

f (v2, v3, y23) f (v5, v6, y56)

g(y12, y13)

g(y13, y23) g(y06, y56)

v5

v2

v3v0

Coupled Networks

v6

v1

v4

... CoupledFG

g(y12, y23)

y34=?

model source and target network separately meta-path

Asymmetry

Attribute Factor

1. Jie Tang, Tiancheng Lou, and Jon Kleinberg. Inferring social ties across heterogeneous networks. In WSDM '12.

Page 21: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

21

CoupledLP: Coupled Factor Graph

y13

f (v1, v3, y13)

y34

y06

Observations

y13=?

v1,v3

v3, v4

v0, v6

y06=?

f (v3, v4, y34)

F (vs, v3, ys3)

y12

v1, v2

f (v1, v2, y12)

y12=1

v5, v6v2, v3

y56y23

y23=? y56=0

f (v2, v3, y23) f (v5, v6, y56)

g(y12, y13)

g(y13, y23) g(y06, y56)

v5

v2

v3v0

Coupled Networks

v6

v1

v4

... CoupledFG

g(y12, y23)

y34=?

model source and target network separately meta-path Heterogeneity

Asymmetry

Meta-path Factor

1. Jie Tang, Tiancheng Lou, and Jon Kleinberg. Inferring social ties across heterogeneous networks. In WSDM '12.

Page 22: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

22

CoupledLP: Coupled Factor Graph

1.  Y. Sun, J. Han, C. C. Aggarwal, N. V. Chawla. Will Will This Happen? Relationship Prediction in Heterogeneous Information Networks. In WSDM’12.

meta-path

disease

express associate

gene gene

disease disease

gene gene

?

?

……

Page 23: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

23

v  Factor Initialization: exponential-linear

CoupledLP: Coupled Factor Graph

v  Objective Function: model source & target network separately

meta-path bridge source & target networks

Page 24: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

24

Learning: Gradient Decent method

1. Jie Tang, Tiancheng Lou, and Jon Kleinberg. Inferring social ties across heterogeneous networks. In WSDM '12.

CoupledLP: Coupled Factor Graph

Page 25: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

25

CoupledLP Framework

1. Implicit Target Network Construction •  Solve Incompleteness

2. Coupled Factor Graph Model •  Solve Asymmetry

•  Solve Heterogeneity

Page 26: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

26

Experiments: Data

1.  D. Davis, N. V. Chawla. Exploring and Exploiting Disease Interactions from Multi-Relational Gene and Phenotype Networks. PLoS One 2011. 2.  N. Du, C. Faloutsos, B. Wang, and L. Akoglu. Large human communication networks: patterns and a utility-driven generator. In KDD ’09. 3.  Y. Dong, Y. Yang, J. Tang, Y. Yang, and N. V. Chawla. Inferring user demographics and social strategies in mobile social networks. In KDD’14.

k: average degree; cc: clustering coefficient; ac: associative coefficient

Healthcare Networks Disease (D)---Gene (G)

Page 27: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

27

Experiments: Data

1.  D. Davis, N. V. Chawla. Exploring and Exploiting Disease Interactions from Multi-Relational Gene and Phenotype Networks. PLoS One 2011. 2.  N. Du, C. Faloutsos, B. Wang, and L. Akoglu. Large human communication networks: patterns and a utility-driven generator. In KDD ’09. 3.  Y. Dong, Y. Yang, J. Tang, Y. Yang, and N. V. Chawla. Inferring user demographics and social strategies in mobile social networks. In KDD’14.

k: average degree; cc: clustering coefficient; ac: associative coefficient

Healthcare Networks Disease (D)---Gene (G)

Mobile Phone Call Networks Two Operators: Aa---Ab

Page 28: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

28

Experiments: Data

1.  D. Davis, N. V. Chawla. Exploring and Exploiting Disease Interactions from Multi-Relational Gene and Phenotype Networks. PLoS One 2011. 2.  N. Du, C. Faloutsos, B. Wang, and L. Akoglu. Large human communication networks: patterns and a utility-driven generator. In KDD ’09. 3.  Y. Dong, Y. Yang, J. Tang, Y. Yang, and N. V. Chawla. Inferring user demographics and social strategies in mobile social networks. In KDD’14.

k: average degree; cc: clustering coefficient; ac: associative coefficient

Healthcare Networks Disease (D)---Gene (G)

Mobile Phone Call Networks Two Operators: Aa---Ab

Mobile Phone Call Networks Three Operators: Ea---Eb---Ec

Page 29: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

29

Experiments: Data

k: average degree; cc: clustering coefficient; ac: associative coefficient

Asymmetry Heterogeneity

Page 30: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

30

Experiments: Coupled Networks

1 2 3 4 5 6 7 8 9 10

10 Coupled Prediction Cases

Page 31: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

31

Experiments

AUPR or AUROC or Precision @ k

10 Coupled Prediction Cases

Page 32: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

32

Baselines

1.  R. N. Lichtenwalter, J. T. Lussier, and N. V. Chawla. New perspectives and methods in link prediction. In KDD ’10. 2.  L. Backstrom and J. Leskovec. Supervised random walks: predicting and recommending links in social networks. In WSDM’11.

AUPR or AUROC or Precision @ k

Unsupervised Methods: ü Common Neighbors (CN) ü Adamic Adar (AA) ü Jaccard Coefficient (JC) ü Preferential Attachment (PA) ü PropFlow (PF) ü Implicit Target Network (IT)

Supervised Methods: ü Logistic Regression (LRC)

o  LRC-IT ü Decision Tree (DT)

o  DT-IT ü CoupledLP

ü CoupledLP-IT

LRC-IT, DT-IT, CoupledLP-IT: NO Implicit Target network construction

features

Page 33: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

33

Experiments

AUPR or AUROC or Precision @ k

Training Links: ü source links between nodes with cross links ü 1% target links

ü Test Links: ü 99% target links

Page 34: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

34

Evaluation Metrics

1.  R. N. Lichtenwalter, J. T. Lussier, and N. V. Chawla. New perspectives and methods in link prediction. In KDD ’10. 2.  L. Backstrom and J. Leskovec. Supervised random walks: predicting and recommending links in social networks. In WSDM’11.

AUPR or AUROC or Precision @ k

ü  Area Under Precision Recall Curve (AUPR) ü  Area Under Receiver Operating Characteristic Curve (AUROC) ü  Precision at Top k

Page 35: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

35

AUPR Results

AUPR

Page 36: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

36

AUROC Results

AUROC

Page 37: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

37

Precision@k Results Precision @ k

Page 38: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

38

Effects of Implicit Target Network

AUPR

Page 39: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

39

Effects of Implicit Target Network

AUPR

Page 40: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

40

Future Work

1. Efficiency of CoupledLP

2. One-step prediction framework

3. User behavior in coupled networks

… …

Page 41: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

41

Conclusion

Output

Source network Cross network Target network

Input

v  Coupled Link Prediction Problem

v  CoupledLP Framework

Implicit Target Network Construction

•  Solve Incompleteness

Coupled Factor Graph Model •  Solve Asymmetry

•  Solve Heterogeneity

Page 42: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

42

Thank You!

Questions

Data & Code: https://aminer.org/coupledlp

Page 43: CoupledLP: Link Prediction in Coupled Networks · 2020-03-15 · 1 CoupledLP: Link Prediction in Coupled Networks Yuxiao Dong#, Jing Zhang+, Jie Tang+, Nitesh V. Chawla#, Bai Wang*

43

Effects of Implicit Target Network

x-axis: pruning threshold z y-axis: AUPR / AUROC

+5% AUPR

+8% AUROC

Unsupervised methods on implicit target network