#sdow2011 keynote: user modeling and personalization on twitter

31
Delft University of Technology User Modeling and Personalization on Twitter SDoW, ISWC, Bonn, Oct 23, 2011 Fabian Abel Web Information Systems, TU Delft

Upload: web-information-systems-tu-delft

Post on 28-Aug-2014

1.403 views

Category:

Technology


1 download

DESCRIPTION

Keynote at http://sdow.semanticweb.org/2011/

TRANSCRIPT

Page 1: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

DelftUniversity ofTechnology

User Modeling and Personalization on Twitter SDoW, ISWC, Bonn, Oct 23, 2011

Fabian AbelWeb Information Systems, TU Delft

Page 2: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

2User Modeling and Personalization on Twitter

#papers that use Twitter datasets

time2006 2007 2008 2009 2010 2011 2012

Page 3: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

3User Modeling and Personalization on Twitter

Perspectives on Twitter data

Grrrr…that is gr8http://bit.ly/47gt3

@bob

What are Bob’s personal interests? What are his current demands?...

Page 4: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

4User Modeling and Personalization on Twitter

PersonalizedRecommendations

Personalized Search Adaptive Systems

What we do: Science and Engineering for the Personal Web

Social Web

Analysis and User Modeling

user/usage data

Semantic Enrichment, Linkage and Alignment

domains: news social media cultural heritage public data e-learning

Page 5: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

5User Modeling and Personalization on Twitter

User Modeling Challenge

I want my personalized

news recommendatio

ns!Analysis and User Modeling

Semantic Enrichment, Linkage and Alignment

Personalized News Recommender

Profile

?

(How) can we infer a Twitter-based user profile that

supports a news recommender?

Page 6: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

6User Modeling and Personalization on Twitter

User Modeling FrameworkBuilding Blocks for generating valuable user profiles

Geert-Jan Houben Ke TaoQi GaoFabian, Qi, Geert-Jan, Ke: Analyzing User Modeling on Twitter for Personalized News Recommendations. UMAP 2011

Page 7: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

7User Modeling and Personalization on Twitter

User Modeling Building Blocks

Profile?concept weight

?

time

1. Which tweets of the user should be

analyzed?

Morning:Afternoon:Night:

1. Temporal

Constraints

June 27 July 4 July 11

(b) temporal patterns

weekendsstart end

(a) time period

Page 8: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

8User Modeling and Personalization on Twitter

User Modeling Building Blocks

Profile?concept weight

2. Profile Type

Francesca Schiavone won French Open #fo2010 ?

Francesca Schiavone

FrenchOpen

Francesca Schiavone French Open entity-

based

SportT

T topic-based

2. What type of concepts should represent

“interests”?

# fo2010

#fo2010# hashtag-

based

1. Temporal

Constraints

timeJune 27 July 4 July 11

Page 9: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

9User Modeling and Personalization on Twitter

User Modeling Building Blocks

Profile?concept weight

2. Profile Type

Francesca Schiavone won! http://bit.ly/2f4t7a

Francesca Schiavone

3. Further enrich the semantics of tweets?

1. Temporal

Constraints

3. Semantic

Enrichment

Francesca Schiavone

Francesca wins French Open

Thirty in women'stennis is primordially old, an age when agility and desire recedes as the …

French Open

Tennis

French OpenTennis

(b) further enrichment

(a) tweet-based

Page 10: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

10User Modeling and Personalization on Twitter

User Modeling Building Blocks

Profile? concept weight

2. Profile Type

4. How to weight the concepts?

1. Temporal

Constraints

3. Semantic

Enrichment

Francesca SchiavoneFrench OpenTennis

4. Weighting Scheme

timeJune 27 July 4 July 11

?weight(Francesca Schiavone)

Concept frequency (TF)

4

weight(French Open)weight(Tennis)

36

TFxIDFTime-sensitive

Page 11: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

11User Modeling and Personalization on Twitter

(a)hashtag-based(b)entity-based(c)topic-based

User Modeling Building Blocks

2. Profile Type

1. Temporal

Constraints

3. Semantic

Enrichment4.

Weighting Scheme

(a)time period(b)temporal patterns

(a)tweet-based(b)further enrichment

(a)concept frequency

Page 12: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

12User Modeling and Personalization on Twitter

AnalysisHow do the user modeling building blocks impact the (temporal) characteristics of Twitter-based user profiles?

(a)hashtag-based(b)entity-based(c)topic-based

2. Profile Type

1. Temporal

Constraints

3. Semantic

Enrichment4.

Weighting Scheme

(a)time period(b)temporal patterns

(a)tweet-based(b)further enrichment

(a)concept frequency

Page 13: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

13User Modeling and Personalization on Twitter

Dataset

timeNov 15 Dec 15 Jan 15

20,000 Twitter users

10,000,000 tweets

2 months

more than:

75,000 news articles

WikiLeaks founder, Julian Assange, under arrest in

London

Available online: http://wis.ewi.tudelft.nl/umap2011/

Page 14: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

14User Modeling and Personalization on Twitter

Size of user profiles

entity-based

topic-based hashtag-based

~5% of the users do not make use of hashtags hashtag-based profiles are empty

Entity-based user modeling succeeds for 100% of the users

Profile Type

Page 15: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

15User Modeling and Personalization on Twitter

Tweet-based

further enrichment(e.g. exploiting links)

topic-based user profiles

More distinct entities per profile

further enrichment(e.g. exploiting links)

Tweet-based

entity-based user profiles

Impact of Semantic Enrichment

Exploiting external resources allows for significantly richer user profiles (quantitatively)

More distinct topics per profile

Semantic Enrichment

Page 16: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

16User Modeling and Personalization on Twitter

?

User Profiles change over time

d1-distance:

difference between current profile and

past profile

d1(r p x,

r p current ) = | px,i − pcurrent ,i |i

Example:

d1(0.50.50

⎜ ⎜ ⎜

⎟ ⎟ ⎟,

0.50

0.5

⎜ ⎜ ⎜

⎟ ⎟ ⎟) =1

music

footballtennis

old new

Hashtag-based profiles change stronger than entity-based and topic-based profiles

#

TThe older the profile the more it differs from the current profile

Temporal Constraint

s

Page 17: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

17User Modeling and Personalization on Twitter

Temporal patterns of user profiles

topic-based user profiles

weekday vs. weekend profilesd1(pweekday, pweekend)

day vs. night profilesd1(pday, pnight)

1. Weekend profiles differ significantly from weekday profiles

2. the difference is stronger than between day and night profiles

2

Temporal Constraint

s

Page 18: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

18User Modeling and Personalization on Twitter

Observations• Semantic enrichment allows for richer user profiles• Profiles change over time: fresh profiles seem to

better reflect current user demands• Temporal patterns: weekend profiles differ

significantly form weekday profiles

Page 19: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

19User Modeling and Personalization on Twitter

EvaluationHow do the user modeling building blocks impact the quality of Twitter-based profiles for personalized news recommendations?

(a)hashtag-based(b)entity-based(c)topic-based

2. Profile Type

1. Temporal

Constraints

3. Semantic

Enrichment4.

Weighting Scheme

(a)time period(b)temporal patterns

(a)tweet-based(b)further enrichment

(a)concept frequency

And can we benefit from the findings of the analysis to improve recommendations?

Page 20: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

20User Modeling and Personalization on Twitter

Twitter-based Profiles for Personalization

• Task: Recommending news articles (= tweets with URLs pointing to news articles)

• Recommender algorithm: cosine similarity between user profile and tweets

• Ground truth: re-tweets of users• Candidate items: news article tweets posted

during evaluation period

time

P(u)= ?

1 week

Recommendations = ?

5.5 relevant tweets per user

5529 candidate news articles

Page 21: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

21User Modeling and Personalization on Twitter

Overview: Performance of User Modeling strategies

Entity-based strategy improves the recommendation quality significantly (MRR & S@10)

Topic-based strategy improves S@10 significantly

T

#

Profile Type

Page 22: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

22User Modeling and Personalization on Twitter

Impact of Semantic Enrichment

Tweet-based

Further enrichment

Further semantic enrichment (exploiting links) improves the quality of the Twitter-based profiles!

T

Semantic Enrichment

Page 23: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

23User Modeling and Personalization on Twitter

Impact of temporal characteristicsSelection of

temporal constraints depends on type of

user profile.

•Topic-based profiles: adapting to temporal context is beneficial• Entity-based profiles: long-term profiles perform better

Adapting to temporal context helps?

yes

no

yes

no

T

T

time

startcomplet

eend

complete: 2 months

Recommendations = ?

startfresh

fresh: 2 weeks

time

start end

Recommendations = ?

weekends

Temporal Constraint

s

Page 24: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

24User Modeling and Personalization on Twitter

Observations• Best user modeling strategy: Entity-based > topic-

based > hashtag-based • Semantic enrichment improves recommendation

quality • Adapting to temporal context helps for topic-based

strategy

Page 25: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

25User Modeling and Personalization on Twitter

3 Research QuestionsEngineering-UM-Personalization Perspective

Page 26: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

26User Modeling and Personalization on Twitter

Semantic Web Engineering Perspective on Twitter (and other social) data

Social Webdata

Applications…that understand and

leverage Social Web data

Social Web vocabulary, e.g. Twitter language

Model of the application, e.g. news

categories

Mining Semantics

# fo2011

sports -> tennis

translate &

integrate

What is the actual impact of mining and integrating social data on the application?Evaluate!

Page 27: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

27User Modeling and Personalization on Twitter

How can we find “information” in social (micro-)streams? How can personalization help?

Answer

Question

translate between query and Twitter vocabulary

compose answer

see also TREC Microblogging Task: http://trec.nist.gov/data/tweets/

1. Search on Twitter

Page 28: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

28User Modeling and Personalization on Twitter

What kind of knowledge can we learn from users’ micro-blogging activities and how can we (re-)use it for what types of applications?

Applications…that understand and

leverage Social Web data

2. Re-using Twitter data in other applications

translate & integrate between application and Twitter vocabulary

Page 29: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

29User Modeling and Personalization on Twitter

Example: improving product recommendations with Twitter data?

dbpedia:Dog

dbpedia:Food

dbpedia:Mark_Haddon

I would never eat dogs!

Page 30: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

30User Modeling and Personalization on Twitter

Narcissus

3. Personalization and Serendipity

How can we balance between personalization and serendipity?

Profile Cross-system UM: get complete picture about a personReasoning: what type of

things could surprise and interest a person?

Cross UM dataset: [email protected]

Page 31: #SDoW2011 Keynote: User Modeling and Personalization on Twitter

31User Modeling and Personalization on Twitter

Thank you!

Twitter: @fabianabel