analysis, design and implementation of a multi-criteria recommender system based on aspect...

Analysis, design and implementation of a Multi-

Criteria Recommender System based on Aspect

Extraction and Sentiment Analysis techniques

Instructors: Student:

Prof. Giovanni Semeraro Davide GIANNICO

Dott. Marco de Gemmis

Department of Computer ScienceMaster of Computer Science (Msc)

Outline

Multi-Criteria RecSys

Proposed approach

Experimentation

Conclusion

Future work

Need: necessity of the user to be supported during

complex decisional processes

Tendency: development of on-line platforms which provide

recommendations to the user and where the community

expresses the own opinion for a item class

Some stats 315M unique views per month & 200M reviews(TripAdvisor, 2014);

168M unique views per month & 35M reviews (Amazon, 2013);

139M unique views per month & 67M reviews (Yelp, 2014).

Scenery

Opportunity

Take advantage of the reviews informative power

incorporating such information in the recommendation

process.

Concept: make «value» from raw data.

Recommender Systems

Recommender Systems (RecSys) are decision support and

information filtering tools

The main goal is helping users which access to a data

source for discovering information or items that could be

interesting for them

Recently, RecSys area has focused on Multi-Criteria

RecSys[AMK11]

[AMK11] G. Adomavicius, N. Manouselis, Y. Kwon. ‘Multi-Criteria Recommender Systems’. In:

Recommender Systems Handbook, pp 769-803, Springer US, 2011

Multi-Criteria Recommender Systems

Techniques which provide recommendations to the user,

modeling the utility concept espressed by the user for an

item as a ratings vectore on several criteria

Focus: multi-criteria item-based collaborative filtering

Item-to-item similarity matrix

1.Item-to-item matrix calculus

2. Rating prediction

multi-static-criteria ratings

Input:

multi-criteria ratings

Output: recommendations

𝑠𝑖𝑚 Itemi, Itemj= …

IB Collaborative Filtering RecSys

Discussed Problem

State-of-the-art Approach: default criteria taxonomies on which users can rate

Limits and challenges:

Proposed criteria are not exhaustive respect to userpreferences need to «customize» the criteria

Vagueness of the criteria need of fine-grained criteria

Proposed Approach

Automatic identification of criteria from the reviews

using Aspect Extraction techniques

Sentiment Analysis for associating a preference level to

the extracted criteria (implicit rating)

Extension of multi-criteria item-based recommendation

algorithms exploiting the information extracted

Aspect Extraction e Sentiment Analysis

Step[V11]

Aspect

Extraction

Sentiment

Analysis

review<Main Aspect,Sub-Aspect,Rilevance,

Sentiment Score,holder>

<location,station, 0.8, 0.9, >

<staff, courtesy, 0.9, -0.6, >

<…, …, …, …, >

Opinion Retrieval

Engine (ORE) system

[V11] V. Giuliani. ‘Analisi, progettazione e sviluppo di un framework per l'opinion retrieval basato su tecniche

automatiche di aspect extraction’. In Tesi di Laurea in Informatica Magistrale, 2011

(multi-ORE-criteria ratings)

Algorithm A: #multi-ORE-criteria (1/2)

review dataset

multi-ORE-criteria ratings

Aspect

Extraction

Sentiment

Analysis

ORE System

Multi-Criteria Item-Based Collaborative Filtering RecSys which

uses the extracted ratings from the reviews for calculating the

item-to-item similarity matrix

Algorithm A: #multi-ORE-criteria (2/2)

multi-ORE-ratings Item-item similarity

matrix

(Multi-dimensional Pearson

coefficient)

𝑠𝑖𝑚 𝐼𝑡𝑒𝑚𝑖, Item𝑗 =

𝑢∈U,c=1..k 𝑅𝑢,𝑐,𝑖 − 𝑅 𝑖)(𝑅𝑢,𝑐,𝑖 − 𝑅 𝑗

𝑢∈U,c=1…k 𝑅𝑢,𝑐,𝑖 − 𝑅 𝑖

2𝑅𝑢,𝑐,𝑖 − 𝑅 𝑗

(Rating Prediction)

𝑃𝑢,𝑖 = 𝑎𝑙𝑙 𝑠𝑖𝑚𝑖𝑙𝑎𝑟 𝑖𝑡𝑒𝑚𝑠 𝑗(𝑠𝑗 ∗ 𝑅𝑢)

𝑛( , , 0.8 )

( , , 0.6 )

predicted ratings

Algorithm B: Multi-criteria Aspect-based Recommender

system based on sentimenT Analysis (#MARTA) (1/2)

Item #1Aspect

Extraction

Sentiment

Analysis

Position

Staff-courtesy

Breakfast-buffet

<position, *,0.9, ..>(review#1)

<staff,courtesy, -0.6, ..>(review#2)<position, *,0.7, ..>(review#2)

(Item#1 summary)

If <aspect> 𝜖 summary

add <aspect> to the running average

add <aspect> to summary

Item#1 review set

(Item#1 multi-ORE-ratings)

Multi-criteria CF RecSys in which the item-to-item similarity matrix

is calculated considering each item description (summary)

Position

Staff-courtesy

Breakfast-buffet

Room-clean

Item-item similarity matrix

Item #1 summary

Item #2 summary

(Multi-dimensional

similarity measure)

(Rating Prediction)

( , , 0.9 )

( , , 0.7 )

predicted ratings

Algorithm B: Multi-criteria Aspect-based Recommender

system based on sentimenT Analysis (#MARTA) (2/2)

Room-view

𝑠𝑖𝑚 Itemi, Itemj= …

Developed system: Btravel (1/2)

Developed system: Btravel (2/2)

EXPERIMENTATION

Design

Research questions

Q1) #multi-ORE-criteria algorithm overtakes the single-criteria(#single-criteria) and multi-criteria techniques (#multi-static-criteria), which use default criteria, results ?

Q2) Multi-criteria Aspect-based Recommender system based on sentimenT Analysis(#MARTA) algorithm overtakes the state-of-the-art approaches results?

10-Fold Cross Validation. For each run the training set isused for calculating the similarity matrix

Evaluation of the approaches on the respective test set

Evaluation metrics: MAE, RMSE, Precision, Recall and F-Measure

Statistical validation using the Wilcoxon test

Dataset

Amazon

TripAdvisor

#ratings #users #items #ratings/

#ratings/

sparsity

355.949 2.850 2.820 124,73 126,06 0,96

#ratings/

sparsity

229.905 45.981 11.537 5 19,9 0,99

#ratings/

sparsity

208.135 1.386 1.580 150,16 131,73 0,90

Q1 Results (#multi-ORE-criteria vs #single-

criteria and #multi-static-criteria)

Dataset TripAdvisor

SVD 0,87

#single-criteria 1,32

#multi-static-criteria 1,10

#Multi-ORE-criteria 0,96

F-Measure

SVD 0,59

#multi-static-criteria 0,53

#multi-ORE-criteria 0,71

0,52 0,53

Q2 Results (#MARTA)

DatasetYelp (sparsity: 0,99)

SVD 0,91

#MARTA 0,85

1,181,13

F-Measure

SVD 0,58

#MARTA 0,63

Q2 Results (#MARTA)

Dataset TripAdvisor (sparsity: 0,96)

SVD 0,87

#MARTA 0,87

0,960,87

F-Measure

SVD 0,59

#MARTA 0,58

Q2 Results (#MARTA)

Dataset Amazon (sparsity: 0,90)

SVD 0,71

#MARTA 0,71

0,710,65

F-Measure

#SVD 0,62

#MARTA 0,63

Analysis of the results

Q1) #multi-ORE-criteria algorithm overtakes the single-

criteria(#single-criteria) and multi-criteria techniques (#multi-

static-criteria), which use default criteria, results ?

#multi-ORE-criteria showed better results than #multi-

static-criteria and #single-criteria (MAE and F1)

Q2) Multi-criteria Aspect-based Recommender system based on

sentimenT Analysis(#MARTA) algorithm overtakes the state-of-

the-art approaches results?

#MARTA showed good results especially on sparse

dataset

Conclusions

Indipendence from the domain

Influenced by the review quality

Good performance especially on the item classification

(relevant or not)

Future work

Considering the extracted opinions relevance score

Testing more advanced Aspect Extraction and Sentiment

Analysis techniques

Learning weighing schemes for each user and considering

the context for better performance

analysis, design and implementation of a multi-criteria recommender system based on aspect...

multicriteria ratingsoutput

proposed criteria

criteria ratingsinput

criteria focus

criteria vagueness

default criteria taxonomies

item class

item similarity matrix1

Internet

sentiment analysis - stanford university · pdf file ·...

sentirueval: testing object-oriented sentiment analysis...

integrated topic modeling and sentiment analysis: a review...

practical sentiment analysis

introduction to sentiment analysis - eth z · sentiment !...

sentiment analysis and the consumer...

sentiment analysis in social...

a location-sentiment-aware recommender...

sentiment analysis & opinion mining€¦ · sentiment...

sentiment analysis tools

practical sentiment analysis tutorial -...

sentiment analysis introduction to -...

sentiment analysis 20140910

geo-spatial multimedia sentiment analysis in disasters ·...

product sentiment analysis

sentiment analysis and opinion...

a novel approach to recommender system based on aspect...

sentiment analysis & computational...

workshop sentiment analysis

sentiment analysis of twitter data for a tourism ... ·...