ncdrec: a decomposability inspired framework for top-n...

17
Introduction & Motivation NCDREC Framework Experimental Evaluation Future Work & Conclusion NCDREC: A Decomposability Inspired Framework for Top-N Recommendation Athanasios N. Nikolakopoulos 1,2 John D. Garofalakis 1,2 1 Computer Engineering and Informatics Department, University of Patras, Greece 2 Computer Technology Institute & Press “Diophantus” IEEE/WIC/ACM International Conference on Web Intelligence Warsaw, August 2014 A. N. Nikolakopoulos and J. D. Garofalakis NCDREC

Upload: others

Post on 29-May-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: NCDREC: A Decomposability Inspired Framework for Top-N …nikolako.net/slides/NCDREC_WI2014_slides.pdf · Challenges of Modern CF Algorithms Recommender System Algorithms _1*PJJ1L.1_X

Introduction & MotivationNCDREC Framework

Experimental EvaluationFuture Work & Conclusion

NCDREC: A Decomposability Inspired Frameworkfor Top-N Recommendation

Athanasios N. Nikolakopoulos1,2 John D. Garofalakis1,2

1Computer Engineering and Informatics Department,University of Patras, Greece

2Computer Technology Institute & Press “Diophantus”

IEEE/WIC/ACM International Conference on Web IntelligenceWarsaw, August 2014

A. N. Nikolakopoulos and J. D. Garofalakis NCDREC

Page 2: NCDREC: A Decomposability Inspired Framework for Top-N …nikolako.net/slides/NCDREC_WI2014_slides.pdf · Challenges of Modern CF Algorithms Recommender System Algorithms _1*PJJ1L.1_X

Introduction & MotivationNCDREC Framework

Experimental EvaluationFuture Work & Conclusion

Outline

1 Introduction & MotivationRecommender Systems - Collaborative FilteringChallenges of Modern CF Algorithms

2 NCDREC FrameworkNCDREC ModelCriterion for ItemSpace CoverageNCDREC Algorithm: Storage and Computational Issues

3 Experimental EvaluationEvaluation MethodologyQuality of RecommendationsLong-Tail RecommendationsCold-Start Recommendations

4 Future Work & Conclusion

A. N. Nikolakopoulos and J. D. Garofalakis NCDREC

Page 3: NCDREC: A Decomposability Inspired Framework for Top-N …nikolako.net/slides/NCDREC_WI2014_slides.pdf · Challenges of Modern CF Algorithms Recommender System Algorithms _1*PJJ1L.1_X

Introduction & MotivationNCDREC Framework

Experimental EvaluationFuture Work & Conclusion

Recommender Systems - Collaborative FilteringChallenges of Modern CF Algorithms

Recommender System Algorithms

..RECOMMENDERSYSTEM

.

USERS

.ITEMS .

RATINGS

.

...

.

RecommendationList

.Rating

Predictions

.....

Collaborative Filtering Recommendation Algorithms

Wide deployment in Commercial EnvironmentsSignificant Research Efforts

A. N. Nikolakopoulos and J. D. Garofalakis NCDREC

Page 4: NCDREC: A Decomposability Inspired Framework for Top-N …nikolako.net/slides/NCDREC_WI2014_slides.pdf · Challenges of Modern CF Algorithms Recommender System Algorithms _1*PJJ1L.1_X

Introduction & MotivationNCDREC Framework

Experimental EvaluationFuture Work & Conclusion

Recommender Systems - Collaborative FilteringChallenges of Modern CF Algorithms

Challenges of Modern CF Algorithms

SparsityIt is an Intrinsic RS Characteristic related to serious problems:

Long-Tail RecommendationCold start ProblemLimited ItemSpace Coverage

Traditional CF techniques, such as neighborhood models, are verysusceptible to sparsity. Among the most promising approaches inalleviating sparsity related problems are:

Dimensionality-Reduction Models

Build a reduced latent space which is dense.

Graph-Based Models.

Exploit transitive relations in the data, while preserving some of the“locality”.

A. N. Nikolakopoulos and J. D. Garofalakis NCDREC

Page 5: NCDREC: A Decomposability Inspired Framework for Top-N …nikolako.net/slides/NCDREC_WI2014_slides.pdf · Challenges of Modern CF Algorithms Recommender System Algorithms _1*PJJ1L.1_X

Introduction & MotivationNCDREC Framework

Experimental EvaluationFuture Work & Conclusion

Recommender Systems - Collaborative FilteringChallenges of Modern CF Algorithms

Exploiting Decomposability

We attack the problem from a different perspective:

Sparsity ←→ Hierarchy ←→ Decomposability.

Nearly Completely Decomposable Systems

Pioneered by Herbert A. Simon.Many Applications in Diverse Disciplines such as economics,cognitive theory and social sciences, to computer systemsperformance evaluation, data mining and information retrieval

Main Idea: Exploit the innate Hierarchy of the Item Set, and viewit as a decomposable space.

Can this enrich the Collaborative Filtering Paradigm in an Efficientand Scalable Way?Does this approach offer any qualitative advantages in alleviatingsparsity related problems?

A. N. Nikolakopoulos and J. D. Garofalakis NCDREC

Page 6: NCDREC: A Decomposability Inspired Framework for Top-N …nikolako.net/slides/NCDREC_WI2014_slides.pdf · Challenges of Modern CF Algorithms Recommender System Algorithms _1*PJJ1L.1_X

Introduction & MotivationNCDREC Framework

Experimental EvaluationFuture Work & Conclusion

NCDREC ModelCriterion for ItemSpace CoverageNCDREC Algorithm: Storage and Computational Issues

NCDREC Model Overview

Definitions

We define a D-decomposition to be an indexed

family of sets D , D1, . . . ,DK, that spanthe ItemSpace V,

We define Dv ,⋃

v∈DkDk to be the proximal

set of items of v ∈ V,

We also define the associated block coupling

graph GD , (VD, ED); its vertices correspondto the D-blocks, and an edge between twovertices exists whenever the intersection ofthese blocks is a non-empty set. Finally, weintroduce an aggregation matrix AD ∈ Rm×K ,whose jk th element is 1, if vj ∈ Dk and zerootherwise.

NCDREC Components

Main Component: Recommendation vectorsproduced by projecting the NCD perturbeddata onto an f -dimensional space.

ColdStart Component:Recommendationvectors are the stationary distributions of aDiscrete Markov Chain Model.

W , εZXᵀ

X , diag(ADe)−1AD

[Z]ik , (nkui)−1[RAD ]ik , when nkui

> 0,

and zero otherwise.

G , R + εW

H , diag(Ce)−1C,

where [C]ij , rᵀi rj for i 6= j

D , XY, X , diag(ADe)−1AD,

Y , diag(AᵀDe)−1Aᵀ

D,

E , eωᵀ

S(ω) , (1 − α)E + α(βH + (1 − β)D)

A. N. Nikolakopoulos and J. D. Garofalakis NCDREC

Page 7: NCDREC: A Decomposability Inspired Framework for Top-N …nikolako.net/slides/NCDREC_WI2014_slides.pdf · Challenges of Modern CF Algorithms Recommender System Algorithms _1*PJJ1L.1_X

Introduction & MotivationNCDREC Framework

Experimental EvaluationFuture Work & Conclusion

NCDREC ModelCriterion for ItemSpace CoverageNCDREC Algorithm: Storage and Computational Issues

Criterion for ItemSpace Coverage

Theorem (ItemSpace Coverage)

If the block coupling graph GD is connected, there exists a unique steadystate distribution π of the Markov chain corresponding to matrix S thatdepends on the preference vector ω; however, irrespectively of anyparticular such vector, the support of this distribution includes every itemof the underlying space.

Proof Sketch: When GD is connected, the Markov chain induced by thestochastic matrix S consists of a single irreducible and aperiodic closedset of states, that includes all the items. The above is true for everystochastic vector ω, and for every positive real numbers α, β < 1. Takinginto account the fact that the state space is finite, the resulting Markovchain becomes ergodic. So πi > 0, for all i , and the support of thedistribution that defines the recommendation vector includes every itemof the underlying space.

A. N. Nikolakopoulos and J. D. Garofalakis NCDREC

Page 8: NCDREC: A Decomposability Inspired Framework for Top-N …nikolako.net/slides/NCDREC_WI2014_slides.pdf · Challenges of Modern CF Algorithms Recommender System Algorithms _1*PJJ1L.1_X

Introduction & MotivationNCDREC Framework

Experimental EvaluationFuture Work & Conclusion

NCDREC ModelCriterion for ItemSpace CoverageNCDREC Algorithm: Storage and Computational Issues

NCDREC Algorithm: Storage and Computational Issues

Input: Matrices R ∈ Rn×m, H ∈ Rm×m,X ∈ Rm×K ,Y ∈ RK×m,Z ∈ Rn×K . Parameters α, β, f , εOutput: The matrix with recommendation vectors for every user, Π ∈ Rn×m

Step 1: Find the newly added users and collect their preference vectors into matrix Ω.

Step 2: Compute Πsparse using the ColdStart Procedure .Step 3: Initialize vector p1 to be a random unit length vector.

Step 4: Compute the modified Lanczos procedure up to step M, using NCD PartialLBD withstarting vector p1.Step 5: Compute the SVD of the bidiagonal matrix B and use it to extract f < M approximatesingular triplets:

uj, σj , vj ← Qu(B)j , σ

(B)j ,Pv

(B)j

Step 6: Orthogonalize against the approximate singular vectors to get a new starting vector p1.Step 7: Continue the Lanczos procedure for M more steps using the new starting vector.Step 8: Check for convergence tolerance. If met compute matrix:

Πfull = UΣVᵀ

else go to Step 4Step 9: Update Πfull, replacing the rows that correspond to new users with Πsparse.Return Πfull

A. N. Nikolakopoulos and J. D. Garofalakis NCDREC

Page 9: NCDREC: A Decomposability Inspired Framework for Top-N …nikolako.net/slides/NCDREC_WI2014_slides.pdf · Challenges of Modern CF Algorithms Recommender System Algorithms _1*PJJ1L.1_X

Introduction & MotivationNCDREC Framework

Experimental EvaluationFuture Work & Conclusion

Evaluation MethodologyQuality of RecommendationsLong-Tail RecommendationsCold-Start Recommendations

Experimental Evaluation

DatasetsYahoo!R2Music

MovieLens

Competing MethodsCommute Time (CT)

Pseudo-Inverse of the user-item graph Laplacian (L†)Matrix Forest Algorithm (MFA)

First Passage Time (FP)

Katz Algorithm (Katz)

MetricsRecall

Precision

R-Score

Normalized Discounted Cumulative Gain (NDCG@k)

Mean Reciprocal Rank

A. N. Nikolakopoulos and J. D. Garofalakis NCDREC

Page 10: NCDREC: A Decomposability Inspired Framework for Top-N …nikolako.net/slides/NCDREC_WI2014_slides.pdf · Challenges of Modern CF Algorithms Recommender System Algorithms _1*PJJ1L.1_X

Introduction & MotivationNCDREC Framework

Experimental EvaluationFuture Work & Conclusion

Evaluation MethodologyQuality of RecommendationsLong-Tail RecommendationsCold-Start Recommendations

Full Dist. Recommendations

MethodologyRandomly sample 1.4% of the ratings ofthe dataset ⇒ probe set PUse each item vj , rated with 5 stars byuser ui in P ⇒ test set TRandomly select another 1000 unrateditems of the same user for each item in TForm ranked lists by ordering all the 1001items

TABLE IRECOMMENDATION QUALITY ON MOVIELENS1M AND YAHOO!R2MUSIC

DATASETS USING R-SCORE AND MRR METRICS

MovieLens1M Yahoo!R2Music

R(5) R(10) MRR R(5) R(10) MRR

NCDREC 0.3997 0.5098 0.3008 0.3539 0.4587 0.2647MFA 0.1217 0.1911 0.0887 0.2017 0.2875 0.1591L† 0.1216 0.1914 0.0892 0.1965 0.2814 0.1546FP 0.2054 0.2874 0.1524 0.1446 0.2241 0.0998

Katz 0.2187 0.3020 0.1642 0.1704 0.2529 0.1203CT 0.2070 0.2896 0.1535 0.1465 0.2293 0.1019

5 10 15 200

0.2

0.4

0.6

MovieLens1M

Recall@N

0 0.2 0.4 0.6 0.8 10

5 · 10−2

0.1

0.15

0.2

Precision/Recall

5 10 15 200

0.2

0.4

0.6

NDCG@N

5 10 15 200

0.2

0.4

0.6

Yahoo!R2Music

0 0.2 0.4 0.6 0.8 10

5 · 10−2

0.1

0.15

5 10 15 200

0.2

0.4

0.6

NCDREC Katz FP MFA L† CT

A. N. Nikolakopoulos and J. D. Garofalakis NCDREC

Page 11: NCDREC: A Decomposability Inspired Framework for Top-N …nikolako.net/slides/NCDREC_WI2014_slides.pdf · Challenges of Modern CF Algorithms Recommender System Algorithms _1*PJJ1L.1_X

Introduction & MotivationNCDREC Framework

Experimental EvaluationFuture Work & Conclusion

Evaluation MethodologyQuality of RecommendationsLong-Tail RecommendationsCold-Start Recommendations

Long-Tail Recommendations

Methodology (Long Tail)We order the items according to theirpopularity (measured in terms of numberof ratings)

We further partition the test set T intotwo subsets, Thead and TtailWe discard the popular items and weevaluate NCDREC and the otheralgorithms on the Ttail test set.

TABLE IILONG TAIL RECOMMENDATION QUALITY ON MOVIELENS1M ANDYAHOO!R2MUSIC DATASETS USING R-SCORE AND MRR METRICS

MovieLens1M Yahoo!R2Music

R(5) R(10) MRR R(5) R(10) MRR

NCDREC 0.3279 0.4376 0.2395 0.3520 0.4322 0.2834MFA 0.1660 0.2517 0.1188 0.2556 0.3530 0.1995L† 0.1654 0.2507 0.1193 0.2492 0.3461 0.1939FP 0.0183 0.0654 0.0221 0.0195 0.0684 0.0224

Katz 0.0275 0.0822 0.0267 0.0349 0.0939 0.0309CT 0.0192 0.0675 0.0227 0.0215 0.0747 0.0249

5 10 15 200

0.2

0.4

0.6

MovieLens1M

Recall@N

0 0.2 0.4 0.6 0.8 10

0.1

0.2

Precision/Recall

5 10 15 200

0.2

0.4

0.6

NDCG@N

5 10 15 200

0.2

0.4

0.6

Yahoo!R2Music

0 0.2 0.4 0.6 0.8 10

0.1

0.2

5 10 15 200

0.2

0.4

0.6

NCDREC Katz FP MFA L† CT

A. N. Nikolakopoulos and J. D. Garofalakis NCDREC

Page 12: NCDREC: A Decomposability Inspired Framework for Top-N …nikolako.net/slides/NCDREC_WI2014_slides.pdf · Challenges of Modern CF Algorithms Recommender System Algorithms _1*PJJ1L.1_X

Introduction & MotivationNCDREC Framework

Experimental EvaluationFuture Work & Conclusion

Evaluation MethodologyQuality of RecommendationsLong-Tail RecommendationsCold-Start Recommendations

New Users problem

MethodologyRandomly select 100 users having ratedat least 100 items and delete 96%, 94%,92% and 90% of each users’ ratings.

Compare the rankings induced on themodified data with the complete set ofratings.

MetricsSpearman’s ρ

Kendall’s τ

Degree of Agreement (DOA)

Normalized Distance-based PerformanceMeasure (NDPM)

4% 6% 8% 10%

0.3

0.32

Percentage of included ratings

Spearman’s ρ

4% 6% 8% 10%

0.24

0.25

0.26

Percentage of included ratings

Kendall’s τ

4% 6% 8% 10%

0.86

0.88

Percentage of included ratings

Degree Of Agreement

4% 6% 8% 10%0.1

0.12

0.14

Percentage of included ratings

NDPM (the smaller the better)

NCDREC Katz FP MFA L† CT

A. N. Nikolakopoulos and J. D. Garofalakis NCDREC

Page 13: NCDREC: A Decomposability Inspired Framework for Top-N …nikolako.net/slides/NCDREC_WI2014_slides.pdf · Challenges of Modern CF Algorithms Recommender System Algorithms _1*PJJ1L.1_X

Introduction & MotivationNCDREC Framework

Experimental EvaluationFuture Work & Conclusion

Future Research Directions & Conclusion

Future WorkDecomposition Granularity Effect

Coarse Grained ⇒ Sparseness InsensitivityFine Grained ⇒ Higher Quality of Recommendations

Multiple-Criteria DecompositionsHow it affects the theoretical properties of the ColdStartComponent?

A. N. Nikolakopoulos and J. D. Garofalakis NCDREC

Page 14: NCDREC: A Decomposability Inspired Framework for Top-N …nikolako.net/slides/NCDREC_WI2014_slides.pdf · Challenges of Modern CF Algorithms Recommender System Algorithms _1*PJJ1L.1_X

Introduction & MotivationNCDREC Framework

Experimental EvaluationFuture Work & Conclusion

Thanks!Q&A

A. N. Nikolakopoulos and J. D. Garofalakis NCDREC

Page 15: NCDREC: A Decomposability Inspired Framework for Top-N …nikolako.net/slides/NCDREC_WI2014_slides.pdf · Challenges of Modern CF Algorithms Recommender System Algorithms _1*PJJ1L.1_X

Example NCD Proximity MatrixNCDREC Basic SubComponents

Example NCD Proximity Matrix Back

D1 D2 D3 Nv Gvv1 X − − 1 v1, v2, v4v2 X − X 2 v1, v2, v4, v5, v6, v7, v8v3 − X − 1 v3, v4, v5, v8v4 X X − 2 v1, v2, v3, v4, v5, v8v5 − X X 2 v2, v3, v4, v5, v6, v7, v8v6 − − X 1 v2, v5, v6, v7, v8v7 − − X 1 v2, v5, v6, v7, v8v8 − X X 2 v2, v3, v4, v5, v6, v7, v8

v1 v2 v3 v4 v5 v6 v7 v8

D1 D2 D3

v1 v2 v3 v4 v5 v6 v7 v8

1 1/2

1/2

1 1/2 1/2 1/2 1/2 1

1

1/2

1/2

1/31/3 1/3

1/4

1/41/4

1/41/5

1/5

1/5 1/51/5

=⇒ D =

13

13 0 1

3 0 0 0 016

415 0 1

61

101

10110

110

0 0 14

14

14 0 0 1

416

16

18

724

18 0 0 1

8

0 110

18

18

940

110

110

940

0 15 0 0 1

515

15

15

0 15 0 0 1

515

15

15

0 110

18

18

940

110

110

940

A. N. Nikolakopoulos and J. D. Garofalakis NCDREC

Page 16: NCDREC: A Decomposability Inspired Framework for Top-N …nikolako.net/slides/NCDREC_WI2014_slides.pdf · Challenges of Modern CF Algorithms Recommender System Algorithms _1*PJJ1L.1_X

Example NCD Proximity MatrixNCDREC Basic SubComponents

NCD PartialLBD ProcedureColdStart Procedure

NCD PartialLBD Back

procedure NCD PartialLBD(R,X,Z,p1, ε)φ← Xᵀp1; q1 ← Rp1 + εZφ;b1,1 ← ‖q1‖2 ; u1 ← q1/b1,1;for j = 1 to M doφ← Zᵀqj;r← Rᵀqj + εXφ− bj,jpj;r← r − [p1 . . .pj] ([p1 . . .pj]

ᵀr);if j < M then

bj,j+1 ← ‖r‖; pj+1 ← r/bj,j+1;φ← Xᵀpj+1;qj+1 ← Rpj+1 + εZφ− bj,j+1qj;qj+1 ← qj+1 − [q1 . . .qj] ([q1 . . .qj]

ᵀqj+1);bj+1,j+1 ← ‖qj+1‖;qj+1 ← qj+1/bj+1,j+1;

end ifend for

end procedure

A. N. Nikolakopoulos and J. D. Garofalakis NCDREC

Page 17: NCDREC: A Decomposability Inspired Framework for Top-N …nikolako.net/slides/NCDREC_WI2014_slides.pdf · Challenges of Modern CF Algorithms Recommender System Algorithms _1*PJJ1L.1_X

Example NCD Proximity MatrixNCDREC Basic SubComponents

NCD PartialLBD ProcedureColdStart Procedure

ColdStart Procedure Back

procedure ColdStart(H,X,Y,Ω, α, β)Π← Ω; k ← 0; r ← 1;while r > tol and k ≤ maxit do

k ← k + 1;Π← αβΠH; Φ← ΠX;Π← Π + α(1− β)ΦY + (1− α)Ω;r ← ‖Π−Π‖; Π← Π;

end whilereturn Πsparse ← Πend procedure

A. N. Nikolakopoulos and J. D. Garofalakis NCDREC