privacy aware semantic dissemination

35
Privacy-Aware and Scalable Content Dissemination in Distributed Social Networks Pavan Kapanipathi 1,2 , Julia Anaya 1 , Amit Sheth 2 , Brett Slatkin 3 , Alexandre Passant 1 1 1 Digital Enterprise Research Institute, Galway – Ireland 2 Kno.e.sis, Dayton, OH- USA 3 Google, San Fransisco, CA - USA

Upload: pavan-kapanipathi

Post on 11-May-2015

341 views

Category:

Technology


1 download

DESCRIPTION

Centralized social networking websites raise scalability issues — due to the growing number of participants — and policy concerns — such as control, privacy and ownership of users’ data. Distributed Social Networks aim to solve those by enabling architectures where people own their data and share it whenever and to whomever they wish. However, the privacy and scalability challenges are still to be tackled. Here, we present a privacy-aware extension to Google’s PubSubHubbub protocol, using Semantic Web technologies, solving both the scalability and the privacy issues in Distributed Social Networks. We enhanced the tradi- tional features of PubSubHubbub in order to allow content publishers to decide whom they want to share their information with, using semantic and dynamic group-based definition. We also present the application of this extension to SMOB (our Semantic Microblogging framework). Yet, our proposal is application agnostic, and can be adopted by any system requiring scalable and privacy-aware content broadcasting.

TRANSCRIPT

Page 1: Privacy Aware Semantic Dissemination

Privacy-Aware and Scalable Content Dissemination in Distributed Social Networks

Pavan Kapanipathi 1,2, Julia Anaya 1, Amit Sheth 2, Brett Slatkin 3, Alexandre Passant 1

1

1 Digital Enterprise Research Institute, Galway – Ireland2 Kno.e.sis, Dayton, OH- USA

3 Google, San Fransisco, CA - USA

Page 2: Privacy Aware Semantic Dissemination

Agenda• Background• Motivation• Semantic Hub – Protocol• SMOB + Semantic Hub• Conclusion• Future Work

2

Page 3: Privacy Aware Semantic Dissemination

Social Networks

• Centralized Social Networks (CSN)s• Twitter and Facebook.

• What’s missing?• Users don’t own their data• Consistent privacy and performance concerns

• Distributed Social Networks• Diaspora, SMOB, StatusNet, OneSocialWeb

3

Page 4: Privacy Aware Semantic Dissemination

Distributed Social Networks• Users own their data• Own Social Space• Decentralized servers

• Better Performance

• Google’s PubSubHubbub protocol• Communication protocol used in most Distributed Social

Networks• Improves scalability • Near-instant notifications

4

Page 5: Privacy Aware Semantic Dissemination

PubSubHubbub

5

Publisher

Give me your latest content for

feed X, please

I have delegated distribution task

to a Hub

Subscriber

• PubSubHubbub – Communication protocol• Simple, Open, web-hook based pubsub protocol• Extension to RSS, Atom.

Page 6: Privacy Aware Semantic Dissemination

6

Publisher Hub

I want to subscribe to

feed X

Hey Did u send this??

Yup, its really me not a DoS

attacker.

Subscriber

• PubSubHubbub – Communication protocol• Simple, Open, web-hook based pubsub protocol• Extension to RSS, Atom.

PubSubHubbub

Page 7: Privacy Aware Semantic Dissemination

7

77

Publisher SubscriberHub

I have new content for

feed X

Give me the latest content

for feed X

Here it is

SubscriberSubscriber

SubscriberSubscriber

Here is the latest content

for feed X

• PubSubHubbub – Communication protocol• Simple, Open, web-hook based pubsub protocol• Extension to RSS, Atom.

PubSubHubbub

Page 8: Privacy Aware Semantic Dissemination

“Cisco just offered me a job! Now I have to weigh the utility of a fatty

paycheck against the daily commute to San Jose and hating the work.”

Motivation – Privacy• Twitter – Data is either

• Private – authorized followers• Public – Everyone (No authorization required to

follow)

8

http://www.msnbc.msn.com/id/29796962/ns/technology_and_science-tech_and_gadgets/t/twitter-gets-you-fired-characters-or-less/

Page 9: Privacy Aware Semantic Dissemination

Motivation• Micropost -- “good presentation about Linked Data: then, now

and next http://bit.ly/xyz”

• Decide on your audience • Almost certain about getting the message across to the right

audience

9

Page 10: Privacy Aware Semantic Dissemination

Protocol – Extension (Overview)• Publishers’ Preferences• Content also includes its preferences (recipients)

• Complexity at the Hub• Store Social Graph of the Publisher• Filter appropriate Subscribers to push the content

10

Page 11: Privacy Aware Semantic Dissemination

PubSubHubbub Protocol Extension

Pub

Follower - A

Follower - B

Follower - C

Follower - D

Hey I have new content for feed

X + my preference Y

Social Graph

Get the subscribers of Pub whose profile

matches preference Y

Here is the new content

of feed XGive me the new content

Here it is

Semantic Hub

11

Page 12: Privacy Aware Semantic Dissemination

Protocol In-Use• SMOB • Privacy-Aware Microblog dissemination

• Personalized Filtering• Twitter Stream (SPIM Workshop)

12

Page 13: Privacy Aware Semantic Dissemination

SMOB + Semantic Hub• Open and Distributed Semantic Microblogging Framework• http://smob.me

• PuSH implementation used to broadcast content• Privacy was a concern

• Our Privacy-Aware protocol implementation used• http://semantichub.appspot.com

13

Page 14: Privacy Aware Semantic Dissemination

SMOB – AInterest: Sematic Web

SMOB – BInterest: Knowledge Engg

SMOB – CInterest: Computer Networks

SMOB – DInterest: Sematic Search

RDF Semantic Hub

RDFPOST“Presentation at ISWC by me on

Sunday #semanticweb”

Get my privacy settings fordbPedia:Semantic_Web

Access Space: #semanticwebSELECT ?user WHERE {

?user foaf:topic_interest ?topic?topic dcterms:subject category:Semantic_Web

Get micropost+Access Space

There is a micropost

Get subscribers matching

Access Space

SMOB – BAnd

SMOB - D

Push Updates

14

Page 15: Privacy Aware Semantic Dissemination

SMOB – AInterest: Sematic Web

SMOB – BInterest: Knowledge Engg

SMOB – CInterest: Computer Networks

SMOB – DInterest: Sematic Search

RDF Semantic Hub

RDFPOST“Presentation at ISWC by me on

Sunday #semanticweb”

Get my privacy settings fordbPedia:Semantic_Web

Access Space: #semanticwebSELECT ?user WHERE {

?user foaf:topic_interest ?topic?topic dcterms:subject category:Semantic_Web

Get micropost+Access Space

There is a micropost

Get subscribers matching

Access Space

SMOB – BAnd

SMOB - D

Push Updates

15

Page 16: Privacy Aware Semantic Dissemination

Publisher – Privacy Preferences

• Feature based preferences at Publisher• Content generated for topic Linked Data• Micropost containing hashtag #knoesis

• Modeled using Privacy Preference Ontology (PPO)• Fine grained restrictions

• SPARQL Queries representing the subset of subscribers from the Social Graph in the hub

16Privacy Preference Ontology – Work by Owen Sacco, Alexandre Passant DERI

Page 17: Privacy Aware Semantic Dissemination

Privacy Preference Ontology (PPO)

• Publishing Data on Web as a part of Linked Open Data has privacy issues.• Web Access Control ontology (WAC)

• PPO -- Light weight vocabulary • On top of Web Access Control Ontology• Fine grained privacy preferences for RDF Data

• Example: Share email, phone number in my FOAF profile to Users with workplace as Kno.e.sis Center.

17

Page 18: Privacy Aware Semantic Dissemination

Privacy Preferences using PPO

18

Example: Restrict all the documents tagged with Semantic web to only those users who are interested in Semantic Web

<http://example.org/privacy/3> a ppo:PrivacyPreference; ppo : appliesToResource

<http://xmlns.com/foaf /0.1/Document>; ppo : hasCondition [

ppo : hasProperty tag:Tag;ppo : resourceAsObject dbpedia : Semantic Web

]; ppo: assignAccess acl :Read; ppo : hasAccessSpace [

ppo:hasAccessQuery ”SELECT ?user WHERE { ?user foaf : topic interest dbpedia:Semantic Web }”

] .

Page 19: Privacy Aware Semantic Dissemination

SMOB’s Simple Interface – Privacy Preference Generation

19

Page 20: Privacy Aware Semantic Dissemination

SMOB – AInterest: Sematic Web

SMOB – BInterest: Knowledge Engg

SMOB – CInterest: Computer Networks

SMOB – DInterest: Sematic Search

RDF Semantic Hub

RDFPOST“Presentation at ISWC by me on

Sunday #semanticweb”

Get my privacy settings fordbPedia:Semantic_Web

Access Space: #semanticwebSELECT ?user WHERE {

?user foaf:topic_interest ?topic?topic dcterms:subject category:Semantic_Web

Get micropost+Access Space

There is a micropost

Get subscribers matching

Access Space

SMOB – BAnd

SMOB - D

Push Updates

20

Page 21: Privacy Aware Semantic Dissemination

Semantic Hub – Distributed Social Semantic Graph• FOAF Profiles of Publishers/Subscibers

• Semantic Social Graph at the Semantic Hub• FOAF profiles linked using push vocabulary

• Distributed• Every user owns a Hub

21

Page 22: Privacy Aware Semantic Dissemination

FOAF -- Exchange

22

Pub/Sub

Hey, I want to Publish/Subscribe Content of feed X

Great, Since it is your first time. I

need your profile

OK, Let me confirm ur

identity

Here is my profile

Semantic Hub

Profile stored

Social Graph

Page 23: Privacy Aware Semantic Dissemination

Push Vocabulary • SIOC vocabulary • Models information from online communities• Communication protocols are not considered

• PubSubHubbub Vocabulary (push)• Models information of users communicating via Semantic Hub

23

Page 24: Privacy Aware Semantic Dissemination

24

Page 25: Privacy Aware Semantic Dissemination

SMOB – AInterest: Sematic Web

SMOB – BInterest: Knowledge Engg

SMOB – CInterest: Computer Networks

SMOB – DInterest: Sematic Search

RDF Semantic Hub

RDFPOST“Presentation at ISWC by me on

Sunday #semanticweb”

Get my privacy settings fordbPedia:Semantic_Web

Access Space: #semanticwebSELECT ?user WHERE {

?user foaf:topic_interest ?topic?topic dcterms:subject category:Semantic_Web

Get micropost+Access Space

There is a micropost

Get subscribers matching

Access Space

SMOB – BAnd

SMOB - D

Push Updates

25

Page 26: Privacy Aware Semantic Dissemination

Semantic Hub – Content Distribution

• RSS Extension• Element to include the privacy preference

• privacy, accessspace

• Each updated content is received by restricted subscribers• Privacy Preference for each content fetches the matched

subscribers• Preferences are deleted before distribution

26

Page 27: Privacy Aware Semantic Dissemination

RSS Extension

27

<item>

<title>Only Friends</title>

<description>

Send this to only people I know and interested in

Semantic Web

</description>

<link>http://example.org /rss</link>

<guid>123123123123</guid>

<pubDate>March 06 2001</pubDate>

<privacy>

<accessspace>

SELECT ?user WHERE {

foaf :me

foaf :knows ?user .

?user foaf : topic interest

dbpedia:Semantic Web . }

</accessspace>

</privacy>

</item>

Page 28: Privacy Aware Semantic Dissemination

Personalized Filtering of the Twitter Stream• Profile Generator

• Automatic generation of User Profiles

• Semantic Filter• Annotating Twitter Stream with concepts from Linked Open Data

• Semantic Hub• Delivering tweets to appropriate Interested Users (near real-time)

28

Page 29: Privacy Aware Semantic Dissemination

Architecture

Semantic Filter

Semantic Hub

Profile Generator

RDF

ANNOTATOR

RDF

RSS

Store and Query Topics

Notify Update

Fetch Updates

Get Interested Subscribers

Push

Upd

ates

Subs

crib

e

Create Profile

Store FOAF

The new iPhone has a 3.5-

inch screen, released today

Annotate: iPhone?user foaf:interest

dbPedia:iPhoneUnion

?user foaf:interestCategory:Apple

Get Subscribers

based on preference

Push Updates to Interested Users

Update RSS

29

Page 30: Privacy Aware Semantic Dissemination

Conclusion• Extension of Google’s PuSH protocol to cope with privacy

concerns.

• Implementation of the protocol

• Use in SMOB (Microblogging)

• Push vocabulary to model activities via PuSH protocol

30

Page 31: Privacy Aware Semantic Dissemination

Future Work• Enabling the architecture for mobile devices.

• Communicate to and from devices that are offline and still need to be notified.

31

Page 32: Privacy Aware Semantic Dissemination

Thanks

Acknowledgements:Fabrizio OrlandiOwen Sacco

Contact us:{pavan, amit}@knoesis.org{alexandre.passant, julia.anaya}@[email protected]

32This work is funded by (1) Science Foundation Ireland under grant number SFI/08/CE/I1380 (Lıon 2) and by an IRCSET scholarship supported by Cisco Systems (2) Social Media Enhanced Organizational Sensemaking in Emergency Response, National Science Foundation under award IIS-1111182, 09/01/2011 - 08/31/2014.

Page 33: Privacy Aware Semantic Dissemination

PubSubHubbub• PuSH is a scalable protocol, and Google provides a public hub

that people can use to broadcast their content. • This public hub delivers for approximately 40 million unique

active feeds• 117 million subscriptions. • In two years, approximately 5.5 billion unique feeds have been

delivered• fetching 200 to 400 feeds• delivering 400 to 600 of them per second. • Its largest subscribers get between 20 and 120 updates per

second from the hub. 33

Page 34: Privacy Aware Semantic Dissemination

34

Page 35: Privacy Aware Semantic Dissemination

35