using trust in recommender systems: an experimental analysis

Using Trust in Recommender Systems:

an experimental analysis

Paolo Massa

University of Trento

(joint work with Bobby Bhattacharjee, UMD)

Motivation:

1. Recommender Systems recommends items the user might like, based on past ratings.

2. Now, Decentralized publishing of info:– Ratings on Items

– Trust on Principals[Semantic Web]

3. New issues (sparseness, scalability, trust, attacks, ...)

... Trust-aware Decentralized RS

Summary

1. Recommender Systems (RSs)– Weaknesses

2. Solution: trust-awareness– Trust and trust metrics

3. Experiments on Epinions.com – Evidence trust solves RSs problems

– (~50.000 users!)

4. Future works

Collaborative Filtering (CF)

1. Input: ratings given by users to items● I like “ Titanic” as 4/5

2. I ask recommendation

3. RS computes the similarity of me against every other user

● Pearson correlation coefficient

4. RS find similar users and suggests to me items liked by them.

I

It does not consider the content of the items, only the ratings given by users.

It works independently of the domain (also jokes)

BUT

Overlapping of rated items required!

?

52 45

2 5 5User1

31

5 5 1

5User2

User3

User4

Item

1Ite

m2

Item

3

Item

4

5?2 5 55

52 4552 45

52 55

RSs weaknesses

1. Ratings Matrix sparseness (95-99%)– Low or No overlapping (users not comparable)

2. Cold start– New users have 0 ratings (->not comparable)

3. Easy Attacks by Malicious Users– Copy profile and become the most similar

4. Hard to understand and control– Black box (bad recs -> user gives up)

Solution? Trust of course!

Trust-awareness

1. Trust statement =Rating by human to human about her usefulness (ex: in providing good movie reviews)

2. Explicitly provided

3. Trust is subjective! T(A,Z)=1 & T(B,Z)=0

– No Global BAD principals!!!

4. Trust is asymmetric! I trust Bill Gates.

5. FOAF (Friend-Of-A-Friend) is an XML format to express relationships

– Some millions files out there...

Trust Networks

ME

6 degrees of separation “ theorem”

Trust metrics

1. Task: based on known trust edges, predict trustworthiness of principals

2. Trust propagation (A->B,B->C|A-?->C)

3. Global (pagerank, ebay, ...)

4. Local (personalized)

ME

Trust solves RS problems1. Trust solves CF sparseness problem

– trust propagation and “ 6 degrees” -> reach many

2. Trust solves Cold Start problem

– “ just add 1 friend”

3. Trust metrics resistant to copy-profile-attack.

– “ you can be similar but if no trust path to you ...”

4. Trust easier to understand and control

– trust nets supports Explanation (HCI tests needed)

EVIDENCE of 1 and 2 provided by analyzing a REAL, VAST community (Epinions.com)

Experiment: Epinions.com

1. Epinions.com' users can– Review and rate items (from 1 to 5)

– Keep web of trust (trust=1) and block list (trust=0).

– “ Reviewers whose reviews and ratings you have consistently found to be valuable” (Epinions FAQ)

2. Dataset (by crawling site):– ~50K users, ~140K items, ~660K ratings.

– ~500K trust statements. • No block list (not shown on site)

Epinions' recommendations

Taken one user “ ME” , we can

- use CF on ratings and compute “ similarity” of other users

- use Trust Metric and compute “ trustworthiness” of other users

Then we can suggest items liked by similar or trustable users.

On how many users are they computable?

Statistics (1)

#Ratings expressed by Users

(#rev<5) = 52.82%! [Cold start users]

#Trust statements expressed by Users

(#trust<5) = 70.18%!

Statistics (2)

User Similarity Computability

1. Ideally, every user should be comparable against every other user.

2. BUT ratings sparseness = 99.99135% -> tiny overlapping between 2 users

3. Pearson correlation coefficient meaningful only if overlapping(A,U)>1

4. Question: taken one user, how many users are comparable?

US computability (cont.)

1. Taken one user, we computed all the comparable users.

– On average an user has 161 comparable users (ideally ~50.000!)

2. We have averaged #comparable_users over users who expressed a certain number of reviews.

US computability (cont.)

Ex: users with 40 reviews have ~800 comparable users.

BUT users (y axis) are ~50.000!

And for Cold Start Users (>50%) this is 2.74

Cold Start Users

Trust computability

1. Trust metrics predict trust in unknown users based on known trust statements.

2. Distance from ME to U is a first measure of Trust computability

3. On average, – In 2 steps, reach 400 users

– In 3 steps, reach 4386 users

Mean # Reachable Users (in k steps) for users expressing X trust statements

In few steps, you can predict trust in every user!

Even for Cold Start Users!!!

Trust and US computability comparison

Mean number of Comparable users for All users

Propagating Trust Using PearsonDist 1 Dist 2 Dist 3 Dist 4

Mean number of Comparable users for Cold Start users

Propagating Trust Using PearsonDist 1 Dist 2 Dist 3 Dist 4

9.88 400 4386 16334 161 2.14 94.54 1675 9121 2.74

Contribution

Experimental evidence that– CF is ineffective in real world scenarios

• Especially for Cold Start users.

– Trust can solve CF problems• Sparseness

• Cold Start

• Attacks (self-evident)

Trust is computable on many more users than user similarityEspecially for cold start users (the majority!)

Future works

1. US and Trust correlate? Contradict?– US over trusted is higher than usual?

2. Distrust?– Propagation? Properties?

3. Design a Trust Metric (for RS)– Create and evaluate a Trust-aware RS

• Input data

Thanks for your attention!

Questions?

Paolo Massa

Email: [email protected]

Blog: http://moloko.itc.it/paoloblog/index.html

Collaborative FilteringSimilarity measure: Pearson Correlation Coefficient of user a and u

Prediction of rating given by user a to item i

wa , u=∑i=1

mr a , i−r a r u , i−r u

� ∑i=1

mr a , i−r u

2∑i=1

mr u , i−r u

2

pa , i=r a∑u=1

nr u , i−r u ∗wa , u

∑u=1

nwa , u

Hard Trust and Soft Trust

1. Vocabulary:– Hard Trust: about security, identity of

something (user, device, information)• Public key cryptography

– Soft Trust: appreciation of some principal (explicitly provided by another principal)• Social Networks and Trust Metrics

using trust in recommender systems: an experimental analysis

Technology

users users

web of trust trust

trustawareness trust

items trust

evidence trust

trust networksme

trust easier

trust path