community clustering in distributed publish/subscribe system wei li 1,2,songlin hu 1, jintao li 1,...

21
Community Clustering in Distributed Publish/Subscribe System Wei Li 1,2 ,Songlin Hu 1 , Jintao Li 1 , Hans- Arno Jacobsen 3 1 Institute of Computing Technology, Chinese Academy of Sciences 2 Graduate University of Chinese Academy of Sciences, Beijing, China 3 University of Toronto, Toronto, Canada IEEE Cluster 2012

Upload: kelly-robinson

Post on 18-Jan-2018

224 views

Category:

Documents


0 download

DESCRIPTION

Background Distributed publish/subscribe systems  Clients (publishers & subscribers)  Routers (a.k.a. brokers) … Distributed Router System … Advertisement

TRANSCRIPT

Page 1: Community Clustering in Distributed Publish/Subscribe System Wei Li 1,2,Songlin Hu 1, Jintao Li 1, Hans-Arno Jacobsen 3 1 Institute of Computing Technology,

Community Clustering in Distributed Publish/Subscribe SystemWei Li1,2,Songlin Hu1, Jintao Li1, Hans-Arno Jacobsen3

1 Institute of Computing Technology, Chinese Academy of Sciences2 Graduate University of Chinese Academy of Sciences, Beijing, China3 University of Toronto, Toronto, Canada

IEEE Cluster 2012

Page 2: Community Clustering in Distributed Publish/Subscribe System Wei Li 1,2,Songlin Hu 1, Jintao Li 1, Hans-Arno Jacobsen 3 1 Institute of Computing Technology,

Agenda

Background Algorithms Experiments Conclusions

Page 3: Community Clustering in Distributed Publish/Subscribe System Wei Li 1,2,Songlin Hu 1, Jintao Li 1, Hans-Arno Jacobsen 3 1 Institute of Computing Technology,

Background

Distributed publish/subscribe systems Clients (publishers & subscribers) Routers (a.k.a. brokers)

Publisher

Publisher

Distributed Router SystemSubscriber

Subscriber

Broker Broker

BrokerBroker

BrokerAdvertisement

Page 4: Community Clustering in Distributed Publish/Subscribe System Wei Li 1,2,Songlin Hu 1, Jintao Li 1, Hans-Arno Jacobsen 3 1 Institute of Computing Technology,

Background

Publisher

Publisher

Distributed Router SystemSubscriber

Subscriber

Broker Broker

BrokerBroker

Broker

Subscription

Advertisement

Page 5: Community Clustering in Distributed Publish/Subscribe System Wei Li 1,2,Songlin Hu 1, Jintao Li 1, Hans-Arno Jacobsen 3 1 Institute of Computing Technology,

Background

Publisher

Publisher

Distributed Router SystemSubscriber

Subscriber

Broker Broker

BrokerBroker

BrokerAdvertisement

Subscription

Publication

Page 6: Community Clustering in Distributed Publish/Subscribe System Wei Li 1,2,Songlin Hu 1, Jintao Li 1, Hans-Arno Jacobsen 3 1 Institute of Computing Technology,

Background

Distributed Publish/Subscribe Systems Loosely coupled communication abstraction Widely used in industry, for example

GooPS at Google PNUTS at Yahoo!

Page 7: Community Clustering in Distributed Publish/Subscribe System Wei Li 1,2,Songlin Hu 1, Jintao Li 1, Hans-Arno Jacobsen 3 1 Institute of Computing Technology,

Client Placement Client placement affects performance of the

system Current solutions

Connecting to closest broker [Chen_05] Interest clustering of subscribers [Querzoni_08,

Riabov_02] Publisher dynamic placement [Cheung_10]

Limitations Complex communication relationships in interacting clients

are not considered The cost of client relocation is not considered

Page 8: Community Clustering in Distributed Publish/Subscribe System Wei Li 1,2,Songlin Hu 1, Jintao Li 1, Hans-Arno Jacobsen 3 1 Institute of Computing Technology,

Algorithms

Problem definition Network of interacting clients

Distributed routers

Cluster1Cluster2 Cluster3 Cluster4 Cluster5

Page 9: Community Clustering in Distributed Publish/Subscribe System Wei Li 1,2,Songlin Hu 1, Jintao Li 1, Hans-Arno Jacobsen 3 1 Institute of Computing Technology,

Algorithms

Problem definition cont’d. The allocation of clients to routers

Maximize the performance of the system Minimize the cost of client allocation

Page 10: Community Clustering in Distributed Publish/Subscribe System Wei Li 1,2,Songlin Hu 1, Jintao Li 1, Hans-Arno Jacobsen 3 1 Institute of Computing Technology,

Agenda

Background Algorithms Experiments Conclusions

Page 11: Community Clustering in Distributed Publish/Subscribe System Wei Li 1,2,Songlin Hu 1, Jintao Li 1, Hans-Arno Jacobsen 3 1 Institute of Computing Technology,

Algorithms

Overview

Page 12: Community Clustering in Distributed Publish/Subscribe System Wei Li 1,2,Songlin Hu 1, Jintao Li 1, Hans-Arno Jacobsen 3 1 Institute of Computing Technology,

Algorithms

Steps Phase 1: Network construction among clients Phase 2: Community division of client network

Newman’s algorithm: modularity-based [Newman_04]

Page 13: Community Clustering in Distributed Publish/Subscribe System Wei Li 1,2,Songlin Hu 1, Jintao Li 1, Hans-Arno Jacobsen 3 1 Institute of Computing Technology,

Algorithms

Steps Phase 3: Heuristic community clustering

Majority-place Mp:

Page 14: Community Clustering in Distributed Publish/Subscribe System Wei Li 1,2,Songlin Hu 1, Jintao Li 1, Hans-Arno Jacobsen 3 1 Institute of Computing Technology,

Algorithms

Steps Phase 2 and Phase 3 are iterative: Re-divide

several communities into smaller ones Performance lose vs. deployment cost decrease Achieve trade off between performance and deployment

cost Phase 4: Load balancing

Page 15: Community Clustering in Distributed Publish/Subscribe System Wei Li 1,2,Songlin Hu 1, Jintao Li 1, Hans-Arno Jacobsen 3 1 Institute of Computing Technology,

Agenda

Background Algorithms Experiments Conclusions

Page 16: Community Clustering in Distributed Publish/Subscribe System Wei Li 1,2,Songlin Hu 1, Jintao Li 1, Hans-Arno Jacobsen 3 1 Institute of Computing Technology,

Experiments

Community clustering vs. interest clustering Experiment settings

Different relationship modes of clients Random Small-world Scale-free

Differently structured router overlays

Page 17: Community Clustering in Distributed Publish/Subscribe System Wei Li 1,2,Songlin Hu 1, Jintao Li 1, Hans-Arno Jacobsen 3 1 Institute of Computing Technology,

Evaluation

Different relationship modes among clients Message distribution

Page 18: Community Clustering in Distributed Publish/Subscribe System Wei Li 1,2,Songlin Hu 1, Jintao Li 1, Hans-Arno Jacobsen 3 1 Institute of Computing Technology,

Evaluation

Different relationship modes among clients Message latency & load reduction

Page 19: Community Clustering in Distributed Publish/Subscribe System Wei Li 1,2,Songlin Hu 1, Jintao Li 1, Hans-Arno Jacobsen 3 1 Institute of Computing Technology,

Evaluation

Different cluster compositions

Page 20: Community Clustering in Distributed Publish/Subscribe System Wei Li 1,2,Songlin Hu 1, Jintao Li 1, Hans-Arno Jacobsen 3 1 Institute of Computing Technology,

Agenda

Background Algorithms Experiments Conclusions

Page 21: Community Clustering in Distributed Publish/Subscribe System Wei Li 1,2,Songlin Hu 1, Jintao Li 1, Hans-Arno Jacobsen 3 1 Institute of Computing Technology,

Conclusions

A community clustering method is proposed for distributed publish/subscribe systems

Community clustering is effective to improve the performance under different experimental settings