recmax: exploiting recommender systems for fun …...benefits of recmax 7 targeted marketing in...

40
Amit Goyal Laks V. S. Lakshmanan RecMax: Exploiting Recommender Systems for Fun and Profit University of British Columbia http://cs.ubc.ca/~goyal

Upload: others

Post on 10-Aug-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Amit Goyal

Laks V. S. Lakshmanan

RecMax: Exploiting Recommender Systems for

Fun and Profit

University of British Columbia

http://cs.ubc.ca/~goyal

Page 2: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Recommender Systems2

Movies Products Music

Videos News Websites

Page 3: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

RecMax – Recommendation

Maximization3

Previous research mostly focused on improving

accuracy of recommendations.

In this paper, we propose a novel problem RecMax

(short for Recommendation Maximization).

Can we launch a targeted marketing

campaign over an existing operational

Recommender System?

Page 4: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Consider an item in a Recommender

System4

Some users rate the item

(seed users)

Because of these ratings,

the item may be

recommended to some

other users.

Flow of

information

RecMax: Can we strategically select the seed users?

Page 5: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

RecMax5

Seed Users

Flow of

information

Users to whom the

item is recommended

Select k seed users such that

if they provide high ratings to a new product,

then the number of other users to whom the product is

recommended (hit score) by the underlying

recommender system algorithm is maximum.

Page 6: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

RecMax – Problem Formulation6

Recommendations Expected Rating

Harry Potter 4.8

American Pie 4.3

….

The Dark Knight 3.2

Num

ber o

f reco

mm

end

atio

ns a

re l

Recommendation List for user v ratin

g th

resh

old

of u

se

r v

(de

no

ted

by θ

v )

For a new item i, if expected rating

R(v,i) > θv, then the new item is

recommended to v

f (S) = I(R(v, i) >qvvÎV-S

å )

The goal of RecMax is to find a seed set S such that hit

score f(S) is maximized.

Page 7: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Benefits of RecMax7

Targeted marketing in Recommender Systems

Marketers can effectively advertise new products on a Recommender System platform.

Business opportunity to Recommender System platform.

Similar to Influence Maximization problem in spirit.

Beneficial to seed users

They get free/discounted samples of a new product.

Helpful to other users

They receive recommendations of new products –solution to cold start problem.

Page 8: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

A key Challenge – Wide diversity of

Recommender Systems8

Recommender Systems

Content BasedCollaborative

Filtering

Model Based

Matrix Factorization

Memory Based

User-based Item-based

Similarity functions: Cosine, Pearson,

Adjusted Cosine etc

Due to this wide diversity, it is very difficult to study

RecMax

Page 9: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Outline9

What is RecMax?

Does Seeding Help? – Preliminary Experiments.

Theoretical Analysis of RecMax.

Experiments.

Conclusions and Future Work.

Page 10: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Does Seeding Help?

Dataset: Movielens

Recommender

System: User-based

Seeds are picked

randomly.

Recall that Hit Score

is the number of users

to whom the product

is recommended.

10

0

1000

2000

3000

4000

5000

6000

0 100 200 300 400 500 600 700 800 900 1000

Hit S

core

Seed Set Size

User-based

A budget of 500 can get a hit score of 5091 (10x)

(User-based)

Page 11: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Does Seeding Help?11

0

100

200

300

400

500

600

700

0 5 10 15 20 25 30

Hit S

core

Seed Set Size

Item-based Dataset: Movielens

Recommender

System: Item-based

Seeds are picked

randomly.

Recall that Hit Score

is the number of

users to whom the

product is

recommended.

A budget of 20 can get a hit score of 636 (30x)

(Item-based)

Page 12: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Outline12

What is RecMax?

Does Seeding Help? – Preliminary Experiments.

Theoretical Analysis of RecMax.

Experiments.

Conclusions and Future Work.

Page 13: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Key Theoretical Results13

RecMax is NP-hard to solve exactly.

RecMax is NP-hard to approximate within a factor

to 1/|V|(1-ε) for any ε> 0.

No reasonable approximation algorithm can be

developed.

RecMax is as hard as Maximum Independent Set Problem.

Under both User-based and under Item-based.

Page 14: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Why is RecMax so hard? (1/2)14

We introduce a helper problem – Maximum

Encirclement Problem

find a set S of size k such that it encircles maximum

number of nodes in the graph.

A

D

B

C

E

• Nodes {B,C} encircle node A.

• Nodes {B,C,E} encircle node D.

• Thus, {B,C,E} encircle A and D.

Page 15: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Why is RecMax so hard? (2/2)15

A

D

B

C

E

• Set {B,C,E} is a solution to Maximum Encirclement Problem

(for k=3).

• Nodes {A,D} form Maximum Independent Set.

• Reduction: Nodes {B,C,E} must

rate the new item highly for the

item to be recommended to A

and D.

• RecMax is as hard as Maximum

Independent Set, and hence

NP-hard to approximate within

a factor to 1/|V|(1-ε)

Page 16: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Discussion (1/2)16

We show hardness for User-based and Item-based

methods.

What about Matrix Factorization?

Most likely hardness would remain (future work).

Page 17: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Discussion (2/2)17

Since the problem is hard to approximate, does it

make sense to study?

YES, as we saw earlier, even a random heuristic fetches

impressive gains.

We explore several natural heuristics and compare

them.

What about sophisticated heuristics (future work).

Page 18: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Outline18

What is RecMax?

Does Seeding Help? – Preliminary Experiments.

Theoretical Analysis of RecMax.

Experiments.

Conclusions and Future Work.

Page 19: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Datasets19

Page 20: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Heuristics20

Random: Seed set is selected randomly. The

process is repeated several times and average

is taken.

Most-Active: Top-k users with most number of

ratings.

Most-Positive: Top-k users with most positive

average ratings.

Most-Critical: Top-k users with most critical

average ratings.

Page 21: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Heuristics21

Most-Central: Top-k central users.

agg(u) = sim(u,v)vÎV-u

å

Page 22: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

User-Based Recommender Systems22

Page 23: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Comparison – Hit Score achieved23

0

1000

2000

3000

4000

5000

6000

0 100 200 300 400 500 600 700 800 900 1000

Hit S

core

Seed Set Size

Most CentralMost Positive

RandomMost CriticalMost Active

Dataset: Movielens

Recommender

System: User-based

Most Central, Most Positive and Random perform good here.

Page 24: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Comparison – Hit Score achieved24

Dataset: Yahoo!

Music

Recommender

System: User-based

Most Positive, Most Central perform good here.

0

1000

2000

3000

4000

5000

6000

7000

8000

0 100 200 300 400 500 600 700 800 900 1000

Hit S

core

Seed Set Size

Most PositiveMost Central

RandomMost Active

Most Critical

Page 25: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Comparison – Hit Score achieved25

Dataset: Jester Joke

Recommender

System: User-based

Most Central out-performs all other heuristics.

0

5000

10000

15000

20000

25000

0 100 200 300 400 500 600 700 800 900 1000

Hit S

core

Seed Set Size

Most CentralMost Positive

RandomMost CriticalMost Active

Page 26: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Key Takeaways26

Even the simple heuristics perform well.

With a budget of 300, Most-Central heuristic

achieves hit score of 4.4K, 3.4K and 15.6K on

Movielens, Yahoo! and Jester respectively.

Depending on the data set, we may encounter a

“tipping point” – a minimum seeding is needed for

the results to be impressive.

Page 27: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Item-Based Recommender Systems27

Page 28: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Comparison – Hit Score achieved28

Dataset: Movielens

Recommender

System: Item-based

Most Central performs good here.

0

200

400

600

800

1000

1200

0 100 200 300 400 500 600 700 800 900 1000

Hit S

core

Seed Set Size

Most CentralUB Most Central

RandomMost Active

Most CriticalMost Positive

Page 29: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Comparison – Hit Score achieved29

Dataset: Yahoo!

Music

Recommender

System: Item-based

Most Central performs good here.

0

500

1000

1500

2000

0 100 200 300 400 500 600 700 800 900 1000

Hit S

core

Seed Set Size

Most CentralRandom

Most CriticalUB Most Central

Most PositiveMost Active

Page 30: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Comparison – Hit Score achieved30

Dataset: Jester Joke

Recommender

System: Item-based

Most Central, Random and Most-Active performs good here.

0

2000

4000

6000

8000

10000

12000

14000

0 100 200 300 400 500 600 700 800 900 1000

Hit S

core

Seed Set Size

Most CentralRandom

Most CriticalMost Active

UB Most CentralMost Positive

Page 31: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Key Takeaways31

Again, the simple heuristics perform well.

Hit score achieved in Item-based is much lower than

in User-based.

Thus, much less seeding is required to achieve

maximum possible hit score.

Overall, Most-Central performs well.

The difference of Most-Central with baseline

Random is not much.

We need better heuristics (future work).

Page 32: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

User-Based vs Item-Based32

Page 33: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

User-based vs Item-based33

Dataset: Yahoo!

Music

Initial rise of hit

score is steeper in

Item-based.

Hit score saturates

much earlier in Item-

based.

Eventual hit score that can be achieved is much

more in User-based.

0

500

1000

1500

2000

2500

3000

3500

0 50 100 150 200 250

Hit S

core

Seed Set Size

User-basedItem-based

Page 34: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

User-based vs Item-based34

Common Seeds

(out of 1000 seeds)

Movielens 103 (10.3%)

Yahoo! Music 219 (21.9 %)

Jester Joke 62 (0.62 %)

Seed Sets are different in both methods.

Page 35: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Outline35

What is RecMax?

Does Seeding Help? – Preliminary Experiments.

Theoretical Analysis of RecMax.

Experiments.

Conclusions and Future Work.

Page 36: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Our Contributions36

The main goal of the paper is to propose and study

a novel problem that we call RecMax

Select k seed users such that if they endorse a new

product by providing relatively high ratings, the number

of users to whom the product is recommended (hit score)

is maximum.

We focus on User-based and Item-based recommender

systems.

We offer empirical evidence that seeding does help

in boosting the number of recommendations

Page 37: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Our Contributions37

We perform a thorough theoretical analysis of RecMax.

RecMax is NP-hard to solve exactly.

RecMax is NP-hard to approximate within any reasonable factor.

Given this hardness, we explore several natural heuristics on 3 real world datasets and report our findings.

Even simple heuristics like Most-Central provide impressive gains

This makes RecMax an interesting problem for targeted marketing in recommender systems.

Page 38: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Future Work38

RecMax is a new problem and has real applications

– our work is just the first work.

Developing better heuristics.

Studying RecMax on more sophisticated

recommender systems algorithms

Matrix Factorization.

Page 39: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Thanks and Questions39

University of British Columbia

RecMax: Exploiting Recommender Systems for

Fun and Profit

Amit Goyal

Laks V. S. Lakshmanan

University of British Columbia

http://cs.ubc.ca/~goyal

Page 40: RecMax: Exploiting Recommender Systems for Fun …...Benefits of RecMax 7 Targeted marketing in Recommender Systems Marketers can effectively advertise new products on a Recommender

Heuristics40

Most-Central: Top-k central users.