iptv recommender systems

56
IPTV Recommender Systems Paolo Cremonesi

Upload: rockys11

Post on 06-May-2015

1.619 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: IPTV Recommender Systems

IPTV Recommender Systems

Paolo Cremonesi

Page 2: IPTV Recommender Systems

2

Paolo Cremonesi - Recommender Systems

Agenda

• IPTV architecture• Recommender algorithms• Evaluation of different algorithms• Multi-model systems

Page 3: IPTV Recommender Systems

3

Paolo Cremonesi - Recommender Systems

Valentino Rossi

Page 4: IPTV Recommender Systems

4

Paolo Cremonesi - Recommender Systems

CustomersService Provider Network ProviderContent Provider

IPTV architecture

Live TV

VOD

Set-top-box(decoder)

Page 5: IPTV Recommender Systems

5

Paolo Cremonesi - Recommender Systems

IPTV architecture

• IPTV is a video service supplied by a telecom service provider that owns the network infrastructure and controls content distribution over the broadband network for reliable delivery to the consumer (generally to the TV/IP STB).

• ServicesBroadcast TV (BTV) services which consist in the simultaneous reception by the users of a traditional TV channel, Free-to-air or Pay TV. BTV services are usually implemented using IP multicast protocols. Video On Demand (VOD) services, which consist in viewing multimedia contents made available by the Service Provider, uponrequest. VOD services are usually implemented using IP unicastprotocols.

Page 6: IPTV Recommender Systems

6

Paolo Cremonesi - Recommender Systems

CUSTOMER FRUSTRATION

CUSTOMER PURCHASES

CUSTOMERS FACEDIFFICULTIES FINDING THE “RIGHT” CONTENTHUNDREDS

LIVECHANNELS

THOUSANDSVOD

ITEMS

IPTV Platform: Now

Page 7: IPTV Recommender Systems

7

Paolo Cremonesi - Recommender Systems

Today recommendations,based on your personal taste, are:

From this….

To this.

IPTV Platform: with a recommender systems

Page 8: IPTV Recommender Systems

8

Paolo Cremonesi - Recommender Systems

IPTV recommender needs

• Improve user satisfaction

• Sell new content to usersVOD Pay-per-view channels

• Targeting advertisement

Page 9: IPTV Recommender Systems

9

Paolo Cremonesi - Recommender Systems

Agenda

• IPTV architecture• Recommender algorithms• Evaluation of different algorithms• Multi-model systems

Page 10: IPTV Recommender Systems

10

Paolo Cremonesi - Recommender Systems

Recommender System: how it works

USERDATA

USER’S TASTE

FRUTIONS ANDRATINGS

CONTENTMETADATA

RECOMMENDER

SYSTEM

CONTENTRECOMMENDATIONS

Page 11: IPTV Recommender Systems

11

Paolo Cremonesi - Recommender Systems

Problem formulation

RecommenderUsers ratings Items metadata

•Item1•Item2•Item3

•.•.•.

•ItemX

Ranked list

Top N

Page 12: IPTV Recommender Systems

12

Paolo Cremonesi - Recommender Systems

Similar ItemsUsers with

similar taste

Recommendation techniques

CollaborativeFiltering

Content-basedFiltering

Userbased

Itembased

Recommenderalgorithms

Page 13: IPTV Recommender Systems

13

Paolo Cremonesi - Recommender Systems

Memory vs. model based

XContent-based

XDimensional-reduction

XItem-based

XUser-based

Modelbased

Memorybased

Page 14: IPTV Recommender Systems

14

Paolo Cremonesi - Recommender Systems

Collaborative Filtering

4 5

2 2

?3

Neighborhood

User-basedsimilar users rate an item similarly

Item-basedsimilar items are rated by a user similarly

NB: similarity means correlation

User-basedsimilar users rate an item similarly

Page 15: IPTV Recommender Systems

15

Paolo Cremonesi - Recommender Systems

Collaborative filtering: User Rating Matrix

User

Item

Page 16: IPTV Recommender Systems

16

Paolo Cremonesi - Recommender Systems

I1 I2 I3 I4

U1 3 4 0 1

U2 2 2 1 0

U3 2 0 0 4

U4 1 5 0 1

U5 3 0 1 0

Explicit URM

User rating matrix URM

I1 I2 I3 I4

U1 0 1 0 1

U2 0 0 1 1

U3 1 1 0 0

U4 1 1 0 1

U5 0 1 1 1

Implicit URM

Page 17: IPTV Recommender Systems

17

Paolo Cremonesi - Recommender Systems

Dimensional-reduction collaborative model

• items and users can be described by a number (K) of unknown features

• auf : describes if feature f is important for user u• bif : describes if feature f is present in item i• rui : rating assigned by (or estimated) user u to item i

rui =kf=1 auf · bif

Page 18: IPTV Recommender Systems

18

Paolo Cremonesi - Recommender Systems

Singular Value Decomposition

A = U S VTVTR

A =

VkT

Rk Uk

Sk

m x n

m x n m x k k x k k x n

Page 19: IPTV Recommender Systems

19

Paolo Cremonesi - Recommender Systems

Singular Value Decomposition

UT ·U = I

V ·VT = I

R = U · S ·VT

Page 20: IPTV Recommender Systems

20

Paolo Cremonesi - Recommender Systems

Singular Value Decomposition

Rk : best rank-k approximation of R according to the Frobenious normnot according least square error!!

UTk ·Uk = I

Vk ·VTk = I

Rk = Uk · Sk ·VTk

Page 21: IPTV Recommender Systems

21

Paolo Cremonesi - Recommender Systems

Folding-in

• New rows/columns of A are projected (folded-in) in the existing latent space without computing a new SVD

• e.g., a new user u

u’ = u Vk Sk-1

Ak

u

Uk

u’

Sk Vk

Page 22: IPTV Recommender Systems

22

Paolo Cremonesi - Recommender Systems

Collaborative Filtering: pro & cons

• Pro: There is no need for content

• Cons:Cold Start: we needs to have enough users in the system to find a match.Sparsity: when the user/ratings matrix is sparse it is hard to find a neighbourhood.First Rater: cannot recommend an item that has not been previously rated anyone elsePopularity Bias: cannot recommend items to someone with unique tastes. Tends to recommend popular items (dataset coverage)

Page 23: IPTV Recommender Systems

23

Paolo Cremonesi - Recommender Systems

Content-based Filtering

...mettendo a punto una scoperta che potrebbe portare al primo uso terapeutico della controversa procedura. Se gli studi animali si riveleranno promettenti, i ricercatori potrebbero cominciare a mettere alla prova le nuove cellule su occhi umani da qui a due anni...

Term 1

Term2

Term

3

• Similar items contain the same terms• The more a term occurs in an item, the more representative it is• The more a term occurs in the collection, the less representative it is

(i.e. it is less important in order to distinguish a specific item)

Page 24: IPTV Recommender Systems

24

Paolo Cremonesi - Recommender Systems

Content-based filtering: Item-Content Matrix

Word

Item

Page 25: IPTV Recommender Systems

25

Paolo Cremonesi - Recommender Systems

Content-based Filtering: techniques

User-item similarity

Term 1

Term2

Term

3

Page 26: IPTV Recommender Systems

26

Paolo Cremonesi - Recommender Systems

Content-based Filtering: pro & cons

• Pro: No need for data on other usersNo cold-start or sparsity problems, neither first-rater

• Able to recommend to users with unique tastes• Able to recommend new and unpopular items

Can provide explanations about recommended itemsWell-known technology

• Cons:Requires a structured contentLower accuracyUsers tastes must be represented as a function of the contentUnable to exploit quality judgments of other users

Page 27: IPTV Recommender Systems

27

Paolo Cremonesi - Recommender Systems

Content-based Filtering: Latent Semantic Analysis

A Ak

VkT

Uk

Sk=

Uk * sqrt (Sk)Vk * sqrt (Sk)

pseudo terms pseudo items

cosine

Ak

svd

-Terms in rows-Items in columns

m x n

Page 28: IPTV Recommender Systems

28

Paolo Cremonesi - Recommender Systems

Recommender architecture

Items Storage Features extraction

Featuresrepresentation

Users Infer and learn profile

Interests/tastes representation

Compute user-item correlation

Items retrieval

Items recommendation

Resources management

Users management

Filter

feedback

Explicit vs implicit ratings

Page 29: IPTV Recommender Systems

29

Paolo Cremonesi - Recommender Systems

Datasets

• User-item rating matrix23942 users564 movies56686 ratings

• Movie Meta-data (textual information)TitleGenreDirectorCastDuration…

Real datasets composed by movies and user fruitions, plus some extra information

Page 30: IPTV Recommender Systems

30

Paolo Cremonesi - Recommender Systems

• Implicit vs Explicit• Come determinare il rating implicito

VODTV (EPG)

Page 31: IPTV Recommender Systems

31

Paolo Cremonesi - Recommender Systems

Some problems with IPTV recommender

• Cold start

• Multi-language content(e.g., Switzerland)

• New user problem(user-based algorithms)

• New item problem(all collaborative algorithms)

• Semantic problem(e.g., house and home)

Page 32: IPTV Recommender Systems

32

Paolo Cremonesi - Recommender Systems

Agenda

• IPTV architecture• Recommender algorithms• Evaluation of different algorithms• Multi-model systems

Page 33: IPTV Recommender Systems

33

Paolo Cremonesi - Recommender Systems

Problem

• Many works do not describe clearly the methods used for performance evaluation and model comparison

• Different dataset partition methodology and evaluation metrics lead to divergent results

• The Netflix prize has improperly focused the research attention onHold-outRMSE

Page 34: IPTV Recommender Systems

34

Paolo Cremonesi - Recommender Systems

Objective

• Design a new methodology to compare different algorithms according to

how often the user watches the TV(length of user profile)

if the user prefers “blockbuster” movies(user preference versus popular or unpopular movies and programs)

• Design a multi-model system

34

Page 35: IPTV Recommender Systems

35

Paolo Cremonesi - Recommender Systems

Metrics

• Error metricsMean Square Error(MSE)Root Mean Square Error (RMSE)Mean Absolute Error (MAE)

Only for explicit datasetsTop-N recommender systems

• Accuracy metricsRecallPrecisionFalloutF-measure

☺ Both implicit and explicit datasets

Page 36: IPTV Recommender Systems

36

Paolo Cremonesi - Recommender Systems

Accuracy metrics

Page 37: IPTV Recommender Systems

37

Paolo Cremonesi - Recommender Systems

Accuracy metrics

Page 38: IPTV Recommender Systems

38

Paolo Cremonesi - Recommender Systems

Accuracy metrics

Page 39: IPTV Recommender Systems

39

Paolo Cremonesi - Recommender Systems

Netflix dataset: test user profile

Page 40: IPTV Recommender Systems

40

Paolo Cremonesi - Recommender Systems

Netflix dataset: Global effects algorithm

RMSE: 0.95Recall: 1%F-measure: 0.01

RMSE: 0.95Recall: 1%F-measure: 0.01

Page 41: IPTV Recommender Systems

41

Paolo Cremonesi - Recommender Systems

Netflix dataset: Adjusted cosine algorithm

RMSE: 1.6Recall: 8%F-measure: 0.16

RMSE: 1.6Recall: 8%F-measure: 0.16

Page 42: IPTV Recommender Systems

42

Paolo Cremonesi - Recommender Systems

Netflix dataset: SVD algorithm

RMSE: 2.7Recall: 17%F-measure: 0.28

RMSE: 2.7Recall: 17%F-measure: 0.28

Page 43: IPTV Recommender Systems

43

Paolo Cremonesi - Recommender Systems

Quality evaluation

• Focus on future performance on new data• Proper partitioning of original data set into:

training settest set

• Test set must be different and independent from training set• Active user: should be left out of the model

43

Page 44: IPTV Recommender Systems

44

Paolo Cremonesi - Recommender Systems

Hold-out

Page 45: IPTV Recommender Systems

45

Paolo Cremonesi - Recommender Systems

Leave-one-out

Page 46: IPTV Recommender Systems

46

Paolo Cremonesi - Recommender Systems

K-fold

Page 47: IPTV Recommender Systems

47

Paolo Cremonesi - Recommender Systems

Agenda

• IPTV architecture• Recommender algorithms• Evaluation of different algorithms• Multi-model systems

Page 48: IPTV Recommender Systems

48

Paolo Cremonesi - Recommender Systems

Recommender system architecture

Web Services Items’ Content

(ICM)

Users’ Ratings (URM)

Batch Processing

Real-time Recommendation

Model Repository

Inputs Real time calls

STB

server

STB client

STB client

Business

Rules

Page 49: IPTV Recommender Systems

Paolo Cremonesi - Recommender Systems

Proposed approach

• Batch system

Statistical analysis of the datasetDefinition of a number of modelsAccuracy evaluation for different user profiles

• Run-time system

User profile analysisSelection of best candidate modelRecommendation

Page 50: IPTV Recommender Systems

Paolo Cremonesi - Recommender Systems

Multi-model recommender engine

Page 51: IPTV Recommender Systems

51

Paolo Cremonesi - Recommender Systems

Dataset statistical analysis (example)

Page 52: IPTV Recommender Systems

52

Paolo Cremonesi - Recommender Systems

Dataset statistical analysis (example)

100 101 102 103 1040

0.25

0.5

0.75

1

Position of the items in the top-rated

Per

cent

age

of ra

ted

item

s in

the

top-

rate

d

NMMLNF

Page 53: IPTV Recommender Systems

53

Paolo Cremonesi - Recommender Systems

Dataset statistical analysis (example)

20 or more

10...19 2...9

Popular

Non-Popular

User groups Item popularity

Page 54: IPTV Recommender Systems

54

Paolo Cremonesi - Recommender Systems

Popular vs. unpopular: SVD algorithm - NF

0 200 400 600 800 10000

0.05

0.1

0.15

0.2

0.25

0.3

0.35

Latent size

Rec

all

allpopularunpopular

Page 55: IPTV Recommender Systems

55

Paolo Cremonesi - Recommender Systems

Popular vs. unpopular: SVD algorithm - NM

50 100 200 300155

0.05

0.1

0.15

0.2

0.25

Latent size

Rec

all

allpopularunpopular

Page 56: IPTV Recommender Systems

56

Paolo Cremonesi - Recommender Systems

User profile length – NM recall

All

Group SVD Cos -like NBN_S NBN_I NBN_U2-9 11,21% 19,35% 19,65% 17,23% 21,65%

10 -19 11,23% 13,11% 12,09% 13,62% 12,60%20 -inf 9,91% 8,01% 6,45% 6,65% 6,52%

Group NBN_UGroup NBN_U

Popular

Group SVD Cos -like NBN_S NBN_I NBN_U2-9 22,21% 31,21% 31,54% 26,17% 33,93%

10 -19 24,12% 27,12% 24,61% 27,36% 25,59%20 -inf 25,72% 22,71% 20,71% 21,14% 20,93%

Group NBN_UGroup NBN_U

Unpopular

Group SVD Cos -like NBN_S NBN_I NBN_U2-9 9,92% 0,81% 0,13% 2,64% 1,48%

10 -19 1,23% 0,19% 0,56% 0,25%20 -inf 10,14% 0,70% 0,01% 0,10% 0,01%

Group SVD NBN_UGroup SVD NBN_U

10,01%

15,94%Best average algorithm (item-based)

20,92%Multi-model (overall)