graph mining techniques for social media analysis · 2017. 9. 1. · why graph mining? thanks to...

57
1- 1 McGlohon, Faloutsos ICWSM 2008 1 Graph Mining Techniques for Social Media Analysis Mary McGlohon Christos Faloutsos

Upload: others

Post on 30-Aug-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 1McGlohon, Faloutsos ICWSM 2008

1

Graph MiningTechniques for Social

Media AnalysisMary McGlohon

Christos Faloutsos

Page 2: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 2McGlohon, Faloutsos ICWSM 2008

What is graph mining?● Extracting useful knowledge (patterns, outliers,

etc.) from structured data that can berepresented as a graph.

● For our purposes, this is usually a social network.`

Facebook graph, via Touchgraph

Livejournal, via Lehman and Kottler

Page 3: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 3McGlohon, Faloutsos ICWSM 2008

What is graph mining?● Example: Social media host tries to look at

certain online groups and predict whether thegroup will flourish or disband.

● Example: Phone provider looks at cell phone callrecords to determine whether an account is aresult of identity theft.

Page 4: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 4McGlohon, Faloutsos ICWSM 2008

Why graph mining?● Thanks to the web and social media, for the first

time we have easily accessible network data on alarge-scale.

● Understand relationships (links) as well ascontent (text, images).

● Large amounts of data raise new questions.

Need fororganization

Massive amountof data

Page 5: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 5McGlohon, Faloutsos ICWSM 2008

Motivating questions● Q1: How do networks form, evolve, collapse?● Q2: What tools can we use to study networks?● Q3: Who are the most influential/central

members of a network?● Q4: How do ideas diffuse through a network?● Q5: How can we extract communities?● Q6: What sort of anomaly detection can we

perform on networks?

Page 6: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 6McGlohon, Faloutsos ICWSM 2008

Outline● Part 1: Q1: How do networks form, evolve,

collapse?– Introduction to networks– Patterns, Laws

● Part 2: Q2: What tools can we use to studynetworks?– Q3: Ranking: Who are the most important members of a network?

● Part 3: Case studies– Q4: Diffusion: How do ideas diffuse through a network?– Q5: How can we extract communities?– Q6: What sort of anomaly detection can we perform?

Page 7: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 7McGlohon, Faloutsos ICWSM 2008

Part 1 Outline● Introduction to networks and 6 definitions● Patterns

– Diameter– Degree distribution– Connected components– Evolution over time

Page 8: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 8McGlohon, Faloutsos ICWSM 2008

D1: Network● A network is defined as a graph G=(V,E)

– V : set of vertices, or nodes.– E : set of edges.

● Edges may have numerical weights.

Page 9: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 9McGlohon, Faloutsos ICWSM 2008

D2: Adjacency matrix● To represent graphs, use adjacency matrix● Unweighted graphs: all entries are 0 or 1● Undirected graphs: matrix is symmetric

B1 B2 B3 B4

B1 0 1 0 0B2 1 0 0 0B3 0 0 1 0B4 1 2 0 3

from

to

Page 10: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 10McGlohon, Faloutsos ICWSM 2008

D3: Bipartite graphs● In a bipartite graph,

– 2 sets of vertices– edges occur between different sets.

● If graph is undirected, we can represent as a non-square adjacency matrix.

n1

n2

n3

n4

m1

m2

m3

m1 m2 m3

n1 1 1 0n2 0 0 1n3 0 0 0n4 0 0 1

Page 11: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 11McGlohon, Faloutsos ICWSM 2008

D4: Components● Component: set of nodes with paths between

each.

n1

n2

n3

n4

m1

m2

m3

Page 12: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 12McGlohon, Faloutsos ICWSM 2008

D4: Components● Component: set of nodes with paths between

each.● We will see later that often real graphs form a

giant connected component.

n1

n2

n3

n4

m1

m2

m3

Page 13: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 13McGlohon, Faloutsos ICWSM 2008

D5: Diameter● Diameter of a graph is the “longest shortest path”.

n1

n2

n3

n4

m1

m2

m3

Page 14: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 14McGlohon, Faloutsos ICWSM 2008

D5: Diameter● Diameter of a graph is the “longest shortest path”.

n1

n2

n3

n4

m1

m2

m3

diameter=3

Page 15: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 15McGlohon, Faloutsos ICWSM 2008

D5: Diameter● Diameter of a graph is the “longest shortest path”.● We can estimate this by sampling.● Effective diameter is the distance at which 90%

of nodes can be reached.

n1

n2

n3

n4

m1

m2

m3

diameter=3

Page 16: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 16McGlohon, Faloutsos ICWSM 2008

D6: Degree distribution● We can find the degree of any node by summing

entries in the (unweighted) adjacency matrix.

B1 B2 B3 B4

B1 0 1 0 0 1B2 1 0 0 0 1B3 0 0 1 0 1B4 1 1 0 1 3 2 2 1 1

from

toout-degree

in-degree

Page 17: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 17McGlohon, Faloutsos ICWSM 2008

Some graphs

Internet Map [lumeta.com] Food Web [Martinez ’91]

Protein Interactions[genomebiology.com]Friendship Network [Moody ’01]

Research question:Are real graphs random?

(no)

Page 18: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 18McGlohon, Faloutsos ICWSM 2008

Part 1 Outline● Introduction to networks● Patterns

– Diameter– Degree distribution– Connected components– Evolution over time

Page 19: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 19McGlohon, Faloutsos ICWSM 2008

Small-world effect● Graphs usually display small diameter.● First demonstrated by Travers & Milgram in 1960.

– Most of the time, distance was around 6.

● Similarly, real graphs we see have smalldiameter…

Page 20: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 20McGlohon, Faloutsos ICWSM 2008

[Leskovec & Horvitz 07]Distribution ofshortest path lengths

Microsoft Messengernetwork 180 million people 1.3 billion edges Edge if two people

exchanged at leastone message in onemonth period Distance (Hops)

Num

ber o

f nod

es

Pick a randomnode, counthow many

nodes are atdistance

1,2,3... hops

7

Page 21: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 21McGlohon, Faloutsos ICWSM 2008

Part 1 Outline● Introduction to networks● Patterns

– Diameter: “small world effect”– Degree distribution– Connected components– Evolution over time

Page 22: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 22McGlohon, Faloutsos ICWSM 2008

Degree distribution● Suppose average

degree is 3.3● If we pick a node at

random, can weguess its degree?

Count vs. degree

avg: 3.3

● In real graph, “mode” is 1!

Page 23: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 23McGlohon, Faloutsos ICWSM 2008

Degree distribution● Suppose average

degree is 3.3● If we pick a node at

random, can weguess its degree?

Count vs. degree

avg: 3.3

● In real graph, “mode” is 1!● Therefore, mean is

“meaningless”.

What pattern does degree of real graphs follow?

Page 24: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 24McGlohon, Faloutsos ICWSM 2008

Power law degreedistribution

● Measure with rank exponent R● [SIGCOMM99]

internet domains

log(rank)

log(degree)

-0.82

att.com

ibm.com

Page 25: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 25McGlohon, Faloutsos ICWSM 2008

Power laws - discussion● Do they hold, over time?

● Do they hold on other graphs/domains?

Page 26: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 26McGlohon, Faloutsos ICWSM 2008

Power laws - discussion● Do they hold, over time?

– Yes! for multiple years [Siganos+]

● Do they hold on other graphs/domains?– Yes!– Web sites and links [Tomkins+], [Barabasi+]– Peer-to-peer graphs (gnutella-style)– Who-trusts-whom (epinions.com)

Page 27: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 27McGlohon, Faloutsos ICWSM 2008

Time Evolution: rank R

• The rank exponent has not changed![Siganos+]

Domainlevel

log(rank)

log(degree)

-0.82

att.com

ibm.com

Page 28: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 28McGlohon, Faloutsos ICWSM 2008

The Peer-to-Peer Topology

• Number of immediate peers (= degree), follows apower-law

[Jovanovic+]

degree

count

Page 29: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 29McGlohon, Faloutsos ICWSM 2008

epinions.com● who-trusts-whom

[Richardson +Domingos, KDD 2001]

(out) degree

count

Page 30: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 30McGlohon, Faloutsos ICWSM 2008

Part 1 Outline● Introduction to networks● Patterns

– Diameter: “small world effect”– Degree distribution: power law– Connected components– Evolution over time

Page 31: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 31McGlohon, Faloutsos ICWSM 2008

Components● Basic graph generator, Erdos-Renyi

– For n vertices, connect any two IID with probability p.

● Many provable properties, including emergenceof a giant connected component.

Real graphs do nothave E-R degreedistribution, but...

Page 32: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 32McGlohon, Faloutsos ICWSM 2008

Giant connected component● Nearly all real networks have a giant connected

component (GCC) emerge!● Often SM graphs have “middle region”

– See [Kumar, Novak, & Tomkins, KDD 2006]

(Example: NIPS citation graph,visualization with GUESS [Adar 06]

Page 33: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 33McGlohon, Faloutsos ICWSM 2008

Part 1 Outline● Introduction to networks● Patterns

– Diameter: “small world effect”– Degree distribution: power law– Connected components: giant CC– Evolution over time

Page 34: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 34McGlohon, Faloutsos ICWSM 2008

Motivating questions

• How do graphs evolve?• Degree-exponent seems constant - any other

consistent patterns?

Page 35: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 35McGlohon, Faloutsos ICWSM 2008

Evolution of diameter?

• Prior analysis, on power-law-like graphs, hintsdiameter slowly increasing with time.

diameter ~ O(log(N)) ordiameter ~ O( log(log(N)))

• Slowly increasing with network size• What is happening, in reality?

Diameter shrinks, toward a constant value!

X

Page 36: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 36McGlohon, Faloutsos ICWSM 2008

Shrinking diameter[Leskovec, Faloutsos,

Kleinberg KDD 2005]● Citations among physics

papers● 11yrs; @ 2003:

– 29,555 papers– 352,807 citations

● For each month M, create agraph of all citations up tomonth M

time

diameter

Page 37: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 37McGlohon, Faloutsos ICWSM 2008

Shrinking diameter• Authors &

publications• 1992– 318 nodes– 272 edges• 2002– 60,000 nodes• 20,000 authors• 38,000 papers

– 133,000 edges

Page 38: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 38McGlohon, Faloutsos ICWSM 2008

Shrinking diameter• Patents & citations• 1975– 334,000 nodes– 676,000 edges• 1999– 2.9 million nodes– 16.5 million edges• Each year is a

datapoint

Page 39: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 39McGlohon, Faloutsos ICWSM 2008

Shrinking diameter• Autonomous

systems• 1997– 3,000 nodes– 10,000 edges• 2000– 6,000 nodes– 26,000 edges• One graph per day

N

diameter

Page 40: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 40McGlohon, Faloutsos ICWSM 2008

Temporal evolution●N(t) nodes; E(t) edges at time t●Suppose that

N(t+1) = 2 * N(t)●What is your guess for

E(t+1) =? 2 * E(t)

Page 41: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 41McGlohon, Faloutsos ICWSM 2008

Temporal evolution●N(t) nodes; E(t) edges at time t●Suppose that

N(t+1) = 2 * N(t)●What is your guess for

E(t+1) =? 2 * E(t)●Edges over-double!

X

Page 42: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 42McGlohon, Faloutsos ICWSM 2008

Temporal evolution

●Growth of edges obeys power law with:E(t) ~ N(t)a for all t

where 1<a<2

Page 43: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 43McGlohon, Faloutsos ICWSM 2008

Densification Power LawArXiv: Physics papersand their citations

1.69

N(t)

E(t)

Page 44: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 44McGlohon, Faloutsos ICWSM 2008

Densification Power LawArXiv: Physics papersand their citations

1.69

N(t)

E(t)

‘tree’

1

Page 45: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 45McGlohon, Faloutsos ICWSM 2008

Densification Power LawArXiv: Physics papersand their citations

1.69

N(t)

E(t)‘clique’

2

Page 46: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 46McGlohon, Faloutsos ICWSM 2008

Densification Power LawU.S. Patents, citing each

other

1.66

N(t)

E(t)

Page 47: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 47McGlohon, Faloutsos ICWSM 2008

Densification Power LawAutonomous Systems

1.18

N(t)

E(t)

Page 48: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 48McGlohon, Faloutsos ICWSM 2008

Densification Power LawArXiv: authors & papers

1.15

N(t)

E(t)

Page 49: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 49McGlohon, Faloutsos ICWSM 2008

Part 1 Outline● Introduction to networks● Patterns

– Diameter: “small world effect”– Degree distribution: power law– Connected components: giant CC– Evolution over time: Shrinking diameter,

Densification power law

Page 50: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 50McGlohon, Faloutsos ICWSM 2008

Another big question● Q: How can we generate realistic networks?

Page 51: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 51McGlohon, Faloutsos ICWSM 2008

Another big question● Q: How can we generate realistic networks?● A: Answering this question fully would require

another tutorial. ● Some models are preferential attachment

(Barabasi et. al.), copying model (Kleinberg et.al.), “winners don’t take all” (Pennock et. al.),Kronecker multiplication (Leskovec et. al.).

Page 52: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 52McGlohon, Faloutsos ICWSM 2008

Bibliography: Part 1● Power Laws and patterns

– Albert, R. & Barabasi, A. (2002), 'Statistical mechanics ofcomplex networks', Reviews of Modern Physics 74, 47.

– Barabasi, A. L. & Albert, R. (1999), 'Emergence of scalingin random networks', Science 286(5439), 509--512.

– Domingos, P. & Richardson, M. (2001), Mining the networkvalue of customers, in 'KDD '01: Proceedings of theseventh ACM SIGKDD international conference onKnowledge discovery and data mining', ACM Press, NewYork, NY, USA, pp. 57--66.

– Faloutsos, M.; Faloutsos, P. & Faloutsos, C. (1999), 'OnPower-law Relationships of the Internet Topology',SIGCOMM, 251-262.

Page 53: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 53McGlohon, Faloutsos ICWSM 2008

Bibliography: Part 1– Jovanovic, M. A. (2001), 'Modeling Large-scale Peer-to-

Peer Networks and a Case Study of Gnutella ', Master'sthesis, University of Cincinatti.

– Kossinets, G. & Watts, D. J. (2006), 'Empirical Analysis ofan Evolving Social Network', Science 311(5757), 88--90.

– Kumar, R.; Novak, J.; Raghavan, P. & Tomkins, A. (2004),'Structure and evolution of blogspace', Commun. ACM47(12), 35--39.

– Kumar, R.; Novak, J.; Raghavan, P. & Tomkins, A. (2003),On the bursty evolution of blogspace, in 'WWW '03:Proceedings of the 12th international conference on WorldWide Web', ACM Press, New York, NY, USA, pp. 568--576.

– Kumar, R.; Novak, J. & Tomkins, A. (2006), Structure andevolution of online social networks, in 'KDD '06:Proceedings of the 12th ACM SIGKDD InternationalConference on Knowedge Discover and Data Mining', pp.611--617.

Page 54: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 54McGlohon, Faloutsos ICWSM 2008

Bibliography: Part 1– Leland, W.; Taqqu, M.; Willinger, W. & Wilson, D. (1994),

'On the Self-Similar Nature of Ethernet Traffic', IEEETransactions on Networking 2(1), 1-15.

– Leskovec, J.; Kleinberg, J. & Faloutsos, C. (2005), Graphsover time: densification laws, shrinking diameters andpossible explanations, in 'KDD '05.

– Milgram, S. (1967), 'The small-world problem', PsychologyToday 2, 60--67.

– Newman, M. E. J. (2005), 'Power laws, Pareto distributionsand Zipf's law', Contemporary Physics 46, 323.

– Siganos, G.; Faloutsos, M.; Faloutsos, P. & Faloutsos, C.(2003), 'Power laws and the AS-level internet topology.',IEEE/ACM Trans. Netw. 11(4), 514-524.

– Zipf, G. (1949), Human Behavior and Principle of LeastEffort: An Introduction to Human Ecology, Addison Wesley,Cambridge, Massachusetts.

Page 55: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 55McGlohon, Faloutsos ICWSM 2008

Bibliography: Part 1● Models

– Barabasi, A. (2005), 'The origin of bursts and heavy tails inhuman dynamics', Nature 435, 207.

– Erdos, P. & Renyi, A. (1960), 'On the evolution of randomgraphs', Publ. Math. Inst. Hungary. Acad. Sci. 5, 17-61.

– Granovetter, M. (1978), 'Threshold Models of CollectiveBehavior', Am. Journal of Sociology 83(6), 1420--1443.

– Leskovec, J.; Faloutsos, C. Scalable modeling of realgraphs using Kronecker Multiplication. ICML 2007.

– Vazquez, A.; Oliveira, J. G.; Dezso, Z.; Goh, K. I.; Kondor,I. & Barabasi, A. L. (2006), 'Modeling bursts and heavy tailsin human dynamics', Physical Review E 73, 036127.

– Watts, D. J. & Strogatz, S. H. (1998), 'Collective dynamicsof 'small-world' networks.', Nature 393(6684), 440--442.

Page 56: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 56McGlohon, Faloutsos ICWSM 2008

Bibliography: Part 1● Visualization example

– Adar, Eytan, "GUESS: A Language and Interface forGraph Exploration," CHI 2006

– Katharina Anna Lehmann, Stephan Kottler, MichaelKaufmann. Visualizing Large and ClusteredNetworks. Graph Drawing 2006

– Touchgraph:http://www.touchgraph.com/TGFacebookBrowser.html

Page 57: Graph Mining Techniques for Social Media Analysis · 2017. 9. 1. · Why graph mining? Thanks to the web and social media, for the first time we have easily accessible network data

1- 57McGlohon, Faloutsos ICWSM 2008