building personalized data products with dato

23
Building Personalized Data Products with Dato Trey Causey [email protected]

Upload: turi-inc

Post on 21-Mar-2017

502 views

Category:

Technology


2 download

TRANSCRIPT

Page 1: Building Personalized Data Products with Dato

Building Personalized Data Products with Dato

Trey [email protected]

Page 2: Building Personalized Data Products with Dato

Questions?

• Now: We are monitoring chat window

• Later: Email me at [email protected]

• dato.com

Page 3: Building Personalized Data Products with Dato

What are data products?• Products that produce and consume data.

• Products that improve as they produce and consume data.

• Products that use data to provide a personalized experience.

• Personalized experiences increase engagement and retention.

Page 4: Building Personalized Data Products with Dato

What data?

• You probably already have this data

• Usage logs, transaction data, etc.

• Need a way to turn this existing data into an intelligent application

Page 5: Building Personalized Data Products with Dato

Recommender systems

• Personalized experiences through recommendations

• Recommend products, social network connections, events, songs, and more

• Implicitly and explicitly drive many of experiences you’re familiar with

Page 6: Building Personalized Data Products with Dato

Recommender uses

• Netflix, Spotify, LinkedIn, Facebook with the most visible examples• “You May Also Like”

“People You May Know”“People to Follow”

• Also silently power many other experiences

• Product listings, up-sell options, add-ons,

• Netflix —> $1MM for 10% better

Page 7: Building Personalized Data Products with Dato

What data do you need?

• Required for implicit data• User identifier• Product identifier

• That’s it!

• Further customization• Ratings (explicit data), counts• Side data

Page 8: Building Personalized Data Products with Dato

Implicit data

• User x productinteractions

• Consumed / used /clicked / etc.

Page 9: Building Personalized Data Products with Dato

How do recommenders work?

• Most basic: item similarity

Page 10: Building Personalized Data Products with Dato

Matrix factorization

• Treat users and products as a giant matrix with (very) many missing values

• Users have latent factors that describe how much they like various genres

• Items have latent factors that describe how much like each genre they are

Page 11: Building Personalized Data Products with Dato

Matrix factorization

• Turn this into a fill-in-the-missing-value exercise by learning the latent factors

• Implicit or explicit data

• Part of the winning formula for the Netflix Prize

• Predict ratings or rankings

Page 12: Building Personalized Data Products with Dato

Matrix factorization

Page 13: Building Personalized Data Products with Dato
Page 14: Building Personalized Data Products with Dato
Page 15: Building Personalized Data Products with Dato
Page 16: Building Personalized Data Products with Dato
Page 17: Building Personalized Data Products with Dato

Fill in the blanks

• Learn the latent factors that minimize prediction error on the observed values

• Fill in the missing values

• Sort the list by predicted rating &recommend the unseen items

Page 18: Building Personalized Data Products with Dato

Rankings?

• Often less concerned with predicting precise scores

• Just want to get the first few items right

• Screen real estate is precious

• Ranking factorization recommender

Page 19: Building Personalized Data Products with Dato

Side features

• Include information about users• Geographic, demographic, time of day,

etc.

• Include information about products• Product subtypes, geographic

availability, etc.

• Help with the cold start problem

Page 20: Building Personalized Data Products with Dato

How to choose which model?

• Select the appropriate model for your data (implicit/explicit), if you want side features or not, select hyperparameters, tune them…

• … or let GraphLab Create do it for you and automatically tune hyperparameters

Page 21: Building Personalized Data Products with Dato

Evaluation

• Train on a portion of your data• Test on a held-out portion

• Ratings: RMSE• Ranking: Precision, recall• Business metrics

• Evaluate against popularity

Page 22: Building Personalized Data Products with Dato

Live demo

• Building and deploying a recommender system with GraphLab Create and Dato Predictive Services

Page 23: Building Personalized Data Products with Dato

Thank you!

• dato.com

• @datoinc

[email protected]