improving experimentation velocity via multi-armed bandits

Improving experimentation velocity via Multi-Armed Bandits Dr Ilias Flaounas Senior Data Scientist Growth Hacking Meetup, Sydney, 20 June 2016

Upload: ilias-flaounas

Post on 22-Jan-2018

154 views

Category:

Data & Analytics

1 download

Report

Download

Embed Size (px):

TRANSCRIPT

Improving experimentation velocity via Multi-Armed Bandits

Dr Ilias Flaounas Senior Data Scientist

Growth Hacking Meetup, Sydney, 20 June 2016

Page 2: Improving experimentation velocity via Multi-Armed Bandits

Page 3: Improving experimentation velocity via Multi-Armed Bandits

http://www.nancydixonblog.com/2012/05/-why-knowledge-management-didnt-save-general-motors-addressing-complex-issues-by-convening-conversat.html

Page 4: Improving experimentation velocity via Multi-Armed Bandits

Conversion rate

PDF

•In a classic A/B we pick where to assign the next user randomly. •In MAB we actively choose the cohort.

Pick black to exploit

Pick green (or red) to explore

Page 5: Improving experimentation velocity via Multi-Armed Bandits

Page 6: Improving experimentation velocity via Multi-Armed Bandits

Page 7: Improving experimentation velocity via Multi-Armed Bandits

Page 8: Improving experimentation velocity via Multi-Armed Bandits

Page 9: Improving experimentation velocity via Multi-Armed Bandits

Page 10: Improving experimentation velocity via Multi-Armed Bandits

Win for variation “d”.

Page 11: Improving experimentation velocity via Multi-Armed Bandits

Win for variation “d” and estimation of p-values

Page 12: Improving experimentation velocity via Multi-Armed Bandits

Let’s run it for a bit longer… Again, win for variation “d”.

Classic A/B/C/D/E: ~2.5K samples Multi-armed bandit: ~1K samples

60% Less samples

Page 13: Improving experimentation velocity via Multi-Armed Bandits

No winner after 1K iterations

Classic A/B/C: ~5K samples Multi-armed bandit: ~1K samples

80% Less samples

Page 14: Improving experimentation velocity via Multi-Armed Bandits

No winner after 1K iterations

Classic A/B/C: ~2.8K samples Multi-armed bandit: ~1K samples

64% Less samples

Page 15: Improving experimentation velocity via Multi-Armed Bandits

Win for variation “a”.

Classic A/B/C: ~1.8K samples Multi-armed bandit: ~1K samples

45% Less samples

Page 16: Improving experimentation velocity via Multi-Armed Bandits

Disadvantages

• Reaching significance for non-winning arms takes longer

• Unclear stopping criteria - App-specific heuristics

• Hard to order non-winning arms and assess reliably their impact

Advantages

• Reaching significance for the winning arm is faster

• Best arm can change over time

• There are no false positives in the long term

Page 17: Improving experimentation velocity via Multi-Armed Bandits

Page 18: Improving experimentation velocity via Multi-Armed Bandits

• How can we locate the city of Bristol from tweets?

• 10K candidate locations organised in a 100x100 grid

• At every step we get tweets from one location and count the number of mentions of the word “Bristol”

• Challenge: find the target in sub-linear time complexity!

Page 19: Improving experimentation velocity via Multi-Armed Bandits

• Contextual bandits can tackle this problem

• We proposed the KernelUCB, a non-linear & contextual flavour of MAB.

• The last few steps of the algorithm before it locates Bristol.

Technical description: M. Valko, N. Korda, R. Munos, I. Flaounas, N. Cristianini, “Finite-Time Analysis of Kernelised Contextual Bandits”, UAI, 2013.

Page 20: Improving experimentation velocity via Multi-Armed Bandits

Target is the red dot.

KernelUCB Matlab code: http://www.complacs.org/pmwiki.php/CompLACS/KernelUCB

KernelUCB with RBF kernel converges after ~300 iterations (instead of >>10K).

http://www.complacs.org/pmwiki.php/CompLACS/KernelUCB

Page 21: Improving experimentation velocity via Multi-Armed Bandits

Page 22: Improving experimentation velocity via Multi-Armed Bandits

Thank you!Yes, we are hiring

Dr Ilias Flaounas Senior Data Scientist

Multi-Armed Bandits, Gittins Index, and its Calculationamahaj1/projects/bandits/book/2013-bandit...Multi-Armed Bandits, Gittins Index, and Its Calculation Jhelum Chakravorty and Aditya

Feeling Lucky? Multi-armed Bandits for Ordering Judgements in Pooling-based Evaluation

Multi-armed Bandits with Cost Subsidy - Facebook Research

Multi-Armed Bandits, Gittins Index, and its Calculationsem/2MMS50/CM14.pdf · 441 Multi-Armed Bandits, Gittins Index, and Its Calculation J^ll a player must play one among n available

Multi-Armed Recommendation Bandits for Selecting …rahuls/pub/icra2013-rahuls.pdfMulti-Armed Recommendation Bandits for Selecting State Machine Policies for Robotic Systems ... Netﬂix

Foraging and Multi-armed Bandits Optimal Foraging and Multi-armed Bandits Vaibhav ...vaibhav/talks/2013a.pdf · 2016-08-16 · Optimal Foraging and Multi-armed Bandits Vaibhav Srivastava

On ergodic two-armed bandits · On ergodic two-armed bandits Pierre Tarr`es∗ and Pierre Vandekerkhove† April 29, 2009 Abstract A device has two arms with unknown deterministic

Introduction to Multi-Armed Bandits

Multi-armed Bandits and the Gittins Indexsem/2WB12/Weber_slideset.pdfMulti-armed Bandits and the Gittins Index Richard Weber Statistical Laboratory, University of Cambridge A talk

Designing Truthful Contextual Multi-Armed Bandits based

Multitasking, Multi-Armed Bandits, and the Italian Judiciarytintin.hec.ca/pages/decio.coviello/research_files/judges.pdf · Multitasking, Multi-Armed Bandits, and the ... The Italian

8 Armed Bandits; - Oklahoma Aquarium

Nonstochastic Multi-Armed Bandits with Graph ... - TAU

Combinatorial Pure Exploration of Multi-Armed Bandits · Combinatorial Pure Exploration of Multi-Armed Bandits Shouyuan Chen 1Tian Lin2 Irwin King Michael R. Lyu Wei Chen3 1The Chinese

COMBINATORIAL IDENTIFICATION IN MULTI-ARMED BANDITS · COMBINATORIAL IDENTIFICATION IN MULTI-ARMED BANDITS 4 1. Introduction The multi-armed bandit problem is a sequential decision

The Dueling Bandits Problem Yisong Yue. Outline Brief Overview of Multi-Armed Bandits Dueling Bandits – Mathematical properties – Connections to other

Batched K-arm pulls for Stochastic Multi-Armed Bandits

Foraging and Multi-armed Bandits Optimal Foraging …vaibhav/talks/2013a.pdfForaging and Multi-armed Bandits ... the multi-armed bandit problem with switching cost. IEEE Transactions

The k-Nearest Neighbour UCB Algorithm for Multi-Armed Bandits …proceedings.mlr.press/v83/reeve18a/reeve18a.pdf · The k-Nearest Neighbour UCB Algorithm for Multi-Armed Bandits with

One Armed Bandits Go Digital, presented at CHI 2009

Multi-armed Bandits (MAB) - New York Universitypeople.stern.nyu.edu › xchen3 › TeachingMaterial › ... · Multi-armed Bandits (MAB) Xi Chen Slides are based on AAAI 2018 Tutorial

Multi-Armed Bandits (MABs) - Purdue University Bandits (MABs) CS57300 - Data Mining Fall 2016 ... Multi-armed Bandits A B ... Formal Bandit Definition

Introduction to Multi-Armed Bandits and Reinforcement Learning...c Dargaud,Lucky Luke tome 18. Lilian Besson & Émilie Kaufmann - Introduction to Multi-Armed Bandits 23 September,

BERNOULLI TWO-ARMED BANDITS WITH …...Stochastic Processes and their Applications 11 (1981) 35-45 North-Holland Publishing Company BERNOULLI TWO-ARMED BANDITS WITH GEOMETRIC TERMINATION

Multi-Armed Bandits in Metric Spaces - Brown …cs.brown.edu/~eli/papers/bandits-lip-full.pdfMulti-Armed Bandits in Metric Spaces Robert Kleinbergy Aleksandrs Slivkinsz Eli Upfalx

Lecture 2: Exploration and Exploitation in Multi … 2: Exploration and Exploitationin Multi-Armed Bandits Lecture 2: Exploration and Exploitation in Multi-Armed Bandits Hado van Hasselt

Multitasking, Multi-Armed Bandits, and the Italian Judiciary

An Introduction to Stochastic Multi-armed Bandits

The K -armed Dueling Bandits Problem

Know Your Customer: Multi-armed Bandits with Capacity ... · Know Your Customer: Multi-armed Bandits with Capacity Constraints Ramesh Johari Vijay Kambley Yash Kanoriaz Abstract A

Multi-Armed Bandits and the Gittins Index€¦ · Multi-Armed Bandits: An Abbreviated History \The [MAB] problem was formulated during the war, and e orts to solve it so sapped the

Two-Armed Restless Bandits with Imperfect Information

Bayesian Contextual Multi-armed Bandits Contextual Multi-armed Bandits ... The Epoch-Greedy Algorithm for Contextual Multi-armed ... topic model w/ a Bayesian multi-armed bandit analysis

American Options, Multi–armed Bandits, and Optimal

Dual-Mandate Patrols: Multi-Armed Bandits for Green Security