recommender hackathon @plista 2013/04
TRANSCRIPT
The Recommender Challenge HackathonTorben Brodtplista GmbH2013/04/24
What is plista
● recommendation● advertising● network
● many big publishers in DE, AT, CH, ..● "other articles you might be interested.."● >8 billion impressions, clicks, engages, .. pM
Architecture
Architecture
Tracking Success
● each time a recommender is chosen, plista will track its success.. for context and context combinations
???
Tracking Success
● "online evaluation" technology● better than classical offline evaluation known
from papers?● cooperation with TU Berlin, aided by state
???
The hackathon
● we open the data, you provide the knowledge
● develop a recommender which implements the http + json api
● plista will track the success, if you are smart, be the winner for the the best recommender
● best is live, best is scalable and best will work in industry
The hackathon
● many interesting people● get to know developers using
○ PHP, Java, NodeJS, Python○ Redis, Storm, Elastic Search○ Apache Mahout, Lucene○ ...
The hackathon
● http://contest.plista.com/bigdataweek2013○ 4pm start○ 6pm "hello world!"○ 7pm pizza + mate○ ... open end fun
How to start (1/3)
register atcontest.plista.com
select challenge* bdw13* weekly
How to start (2/3)
● start implementation using examples● http://contest.plista.com/wiki/example
How to start (2/3)
● start implementation using examples● http://contest.plista.com/wiki/example
● have a github account?● "fork" one of the example projects● work on your local "clone"● upload to your server● enter url in your contest account
How to start (3/3)
● need a virtual server? ask us● need old data? start replay from
webinterface● try sending debug events from webinterface● wait for team activation● plista starts sending you real data● your responses are displayed on real
publishers
Recommender ideas
● concentrate on implicit feedback● think streaming / incremental
○ better to scale○ faster results, new articles are better than old
articles?● think about cross domain
○ contest is not allowed to mix items from different domains/publishers
○ want knowledge of the full data, but candidate items of a slice
Summary
join us?http://www.plista.com/career
stay in touch?Torben Brodt, plista.com, google plus, twitter, ..
Discounts
30% discount code: PLISTA30
● NoSQL Infrastructure● Killing pigs and saving Danish bacon with Riak● Introduction to Graph Databases● Yokozuna, combining Solr with Riak● Why you should care about Big Data● ...
And Lottery... ● 1 FREE TICKET
And more Torben● Talk about Realtime Recommendations
Hint
● now watch out for teammates
and have fun!