the value of knowing a demand curve: regret bounds for online posted-price auctions

52
The Value of Knowing a Demand Curve: Regret Bounds for Online Posted-Price Auctions Bobby Kleinberg and Tom Leighton

Upload: kioko

Post on 08-Feb-2016

23 views

Category:

Documents


0 download

DESCRIPTION

The Value of Knowing a Demand Curve: Regret Bounds for Online Posted-Price Auctions. Bobby Kleinberg and Tom Leighton. Introduction. How do we measure the value of knowing the demand curve for a good?. Introduction. How do we measure the value of knowing the demand curve for a good? - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

The Value of Knowing a Demand Curve: Regret Bounds for Online

Posted-Price Auctions

Bobby Kleinberg and Tom Leighton

Page 2: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Introduction

How do we measure the value of knowing the demand curve for a good?

Page 3: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Introduction

How do we measure the value of knowing the demand curve for a good?

Mathematical formulation: What is the difference in expected revenue between an informed seller who knows the

demand curve, and an uninformed seller using an adaptive

pricing strategy… assuming both pursue the optimal strategy.

Page 4: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Online Posted-Price Auctions 1 seller, n buyers, each

wants one item. Buyers interact with

seller one at a time. Transaction:

Seller posts price.

Page 5: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Online Posted-Price Auctions 1 seller, n buyers, each

wants one item. Buyers interact with

seller one at a time. Transaction:

Seller posts price. Buyer arrives.

Page 6: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Online Posted-Price Auctions 1 seller, n buyers, each

wants one item. Buyers interact with

seller one at a time. Transaction:

Seller posts price. Buyer arrives. Buyer gives

YES/NOresponse.

6¢YES

Page 7: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Online Posted-Price Auctions 1 seller, n buyers, each

wants one item. Buyers interact with

seller one at a time. Transaction:

Seller posts price. Buyer arrives. Buyer gives

YES/NOresponse.

Seller may update price

YES 10¢

after each transaction.

Page 8: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Online Posted-Price Auctions A natural transaction

model for many forms of commerce, including web commerce. (Our motivation came from ticketmaster.com.)

10¢

Page 9: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Online Posted-Price Auctions A natural transaction

model for many forms of commerce, including web commerce. (Our motivation came from ticketmaster.com.)

Clearly strategyproof, since agents’ strategic behavior is limited to their YES/NO responses.

10¢

Page 10: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Informed vs. Uninformed Sellers

Uninformed Informed

Page 11: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Informed vs. Uninformed Sellers

InformedUninformed

Value Ask Revenue Ask Revenue

.8

.8

.8

Page 12: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Informed vs. Uninformed Sellers

InformedUninformed

Value Ask Revenue Ask Revenue

.5 .8

.8

.8

Page 13: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Informed vs. Uninformed Sellers

Value Ask Revenue Ask Revenue

.9 .5 .5 .8 .8

.8

.8

InformedUninformed

Page 14: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Informed vs. Uninformed Sellers

Value Ask Revenue Ask Revenue

.9 .5 .5 .8 .8

.75 .8

.8

InformedUninformed

Page 15: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Informed vs. Uninformed Sellers

Value Ask Revenue Ask Revenue

.9 .5 .5 .8 .8

.7 .75 0 .8 0

.8

InformedUninformed

Page 16: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Informed vs. Uninformed Sellers

Value Ask Revenue Ask Revenue

.9 .5 .5 .8 .8

.7 .75 0 .8 0

.6 .8

InformedUninformed

Page 17: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Informed vs. Uninformed Sellers

Value Ask Revenue Ask Revenue

.9 .5 .5 .8 .8

.7 .75 0 .8 0

.8 .6 .6 .8 .8

InformedUninformed

Page 18: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Informed vs. Uninformed Sellers

Value Ask Revenue Ask Revenue

.9 .5 .5 .8 .8

.7 .75 0 .8 0

.8 .6 .6 .8 .8

1.1 1.6Ex ante regret = 0.5

InformedUninformed

Page 19: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Informed vs. Uninformed Sellers

Value Ask Revenue Ask Revenue

.9 .5 .5 .7 .7

.7 .75 0 .7 .7

.8 .6 .6 .7 .7

1.1 2.1Ex post regret = 1.0

InformedUninformed

Page 20: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Definition of Regret

Regret = difference in expected revenue between informed and uninformed seller.

Ex ante regret corresponds to asking, “What is the value of knowing the demand curve?”

Competitive ratio was already considered by Blum, Kumar, et al (SODA’03). They exhibited a (1+ε)-competitive pricing strategy under a mild hypothesis on the informed seller’s revenue.

Page 21: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

3 Problem Variants

Identical valuations: All buyers have same threshold price v, which is unknown to seller.

Random valuations: Buyers are independent samples from a fixed probability distribution (demand curve) which is unknown to seller.

Worst-case valuations: Make no assumptions about buyers’ valuations, they may be chosen by an oblivious adversary.

Always assume prices are between 0 and 1.

Page 22: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Regret Bounds for the Three Cases

Valuation Model Lower Bound Upper Bound

Identical Ω(log log n) O(log log n)

Random Ω(n1/2) O((n log n)1/2)

Worst-Case Ω(n2/3) O(n2/3(log n)1/3)

Ex postEx ante

Page 23: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Identical Valuations

Exponentially better than binary search!! Equivalent to a question considered by Karp,

Koutsoupias, Papadimitriou, Shenker in the context of congestion control. (KKPS, FOCS 2000).

Our lower bound settles two of their open questions.

Valuation Model Lower Bound Upper Bound

Identical Ω(log log n) O(log log n)

Page 24: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Random Valuations

x

D(x)

01

1

Demand curve:D(x) = Pr(accepting price x)

Page 25: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Best “Informed” Strategy

x

D(x)

01

1

Expected revenue at price x:f(x) = xD(x).

Page 26: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Best “Informed” Strategy

x

D(x)

01

1

If demand curve is known, best strategy is fixed price maximizing area of rectangle.

Page 27: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Best “Informed” Strategy

x

D(x)

01

1

If demand curve is known, best strategy is fixed price maximizing area of rectangle.

Best known uninformed strategy is based on the multi-armed bandit problem...

Page 28: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

The Multi-Armed Bandit Problem You are in a casino with K

slot machines. Each generates random payoffs by i.i.d. sampling from an unknown distribution.

Page 29: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

0.3

The Multi-Armed Bandit Problem You are in a casino with K

slot machines. Each generates random payoffs by i.i.d. sampling from an unknown distribution.

You choose a slot machine on each step and observe the payoff.

0.7 0.4

0.2 0.2 0.7

0.3 0.8 0.5

0.6 0.1 0.4

0.5 0.1 0.6

Page 30: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

The Multi-Armed Bandit Problem You are in a casino with K

slot machines. Each generates random payoffs by i.i.d. sampling from an unknown distribution.

You choose a slot machine on each step and observe the payoff.

Your expected payoff is compared with that of the best single slot machine.

0.3 0.7 0.4

0.2 0.2 0.7

0.3 0.8 0.5

0.6 0.1 0.4

0.5 0.1 0.6

Page 31: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

The Multi-Armed Bandit Problem Assuming best play:

Ex ante regret = θ(log n)[Lai-Robbins, 1986]

Ex post regret = θ(√n)[Auer et al, 1995]

Ex post bound applies even if the payoffs are adversarial rather than random.(Oblivious adversary.)

0.3 0.7 0.4

0.2 0.2 0.7

0.3 0.8 0.5

0.6 0.1 0.4

0.5 0.1 0.6

Page 32: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Application to Online Pricing

Our problem resembles a multi-armed bandit problem with a continuum of “slot machines”, one for each price in [0,1].

Divide [0,1] into K subintervals, treat them as a finite set of slot machines.

Page 33: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Application to Online Pricing

Our problem resembles a multi-armed bandit problem with a continuum of “slot machines”, one for each price in [0,1].

Divide [0,1] into K subintervals, treat them as a finite set of slot machines.

The existing bandit algorithms have regret O(K2 log n + K-2n), provided xD(x) is smooth and has a unique global max in [0,1].

Optimizing K yields regret O((n log n)½).

Page 34: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

The Continuum-Armed Bandit The continuum-armed

bandit problem has algorithms with regret O(n¾), when exp. payoff depends smoothly on the action chosen.

Finite-Armed

2 0א -Armed

Ex Ante θ(log n)

O(n¾)

Ex Post θ(√n)

Page 35: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

The Continuum-Armed Bandit The continuum-armed

bandit problem has algorithms with regret O(n¾), when exp. payoff depends smoothly on the action chosen.

But: Best-known lower bound on regret was Ω(log n) coming from the finite-armed case.

Finite-Armed

2 0א -Armed

Ex Ante θ(log n) Ω(log n)

O(n¾)

Ex Post θ(√n)

?

Page 36: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

The Continuum-Armed Bandit The continuum-armed

bandit problem has algorithms with regret O(n¾), when exp. payoff depends smoothly on the action chosen.

But: Best-known lower bound on regret was Ω(log n) coming from the finite-armed case.

We prove: Ω(√n).

Finite-Armed

2 0א -Armed

Ex Ante θ(log n)Ω(√n)O(n¾)

Ex Post θ(√n)

?

Page 37: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Lower Bound: Decision Tree Setup

x

D(x)

01

1

Page 38: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Lower Bound: Decision Tree Setup

½

¼ ¾

⅛ ⅜ ⅝ ⅞x

D(x)

01

1 0.3

Page 39: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Lower Bound: Decision Tree Setup

½

¼ ¾

⅛ ⅜ ⅝ ⅞x

D(x)

01

1 0.2

Page 40: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Lower Bound: Decision Tree Setup

½

¼ ¾

⅛ ⅜ ⅝ ⅞x

D(x)

01

1 0.4

Page 41: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Lower Bound: Decision Tree Setup

½

¼ ¾

⅛ ⅜ ⅝ ⅞

vi ALG OPT Reg.

0.3 0 0.3 0.3

0.2 0 0 0

0.4 0.125 0.3 0.175

0.125 0.6 0.475

Page 42: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

How not to prove a lower bound! Natural idea: Lower

bound on incremental regret at each level…

½

¼ ¾

⅛ ⅜ ⅝ ⅞

Page 43: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

How not to prove a lower bound! Natural idea: Lower

bound on incremental regret at each level…

If regret is Ω(j-½) at level j, then total regret after n steps would be Ω(√n).

½

¼ ¾

⅛ ⅜ ⅝ ⅞

1

√½

√⅓

1 + √½ + √⅓ + … = Ω(√n)

Page 44: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

How not to prove a lower bound! Natural idea: Lower

bound on incremental regret at each level…

If regret is Ω(j-½) at level j, then total regret after n steps would be Ω(√n).

This is how lower bounds were proved for the finite-armed bandit problem, for example.

½

¼ ¾

⅛ ⅜ ⅝ ⅞

1

√½

√⅓

1 + √½ + √⅓ + … = Ω(√n)

Page 45: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

How not to prove a lower bound! The problem: If you only

want to minimize incremental regret at level j, you can typically make it O(1/j).

Combining the lower bounds at each level gives only the very weak lower bound Regret = Ω(log n).

½

¼ ¾

⅛ ⅜ ⅝ ⅞

1

½

1 + ½ + ⅓ + … = Ω(log n)

Page 46: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

How to prove a lower bound So instead a subtler

approach is required. Must account for the cost

of experimentation. We define a measure of

knowledge, KD such that regret scales at least linearly with KD.

KD = ω(√n) → TOO COSTLY

KD = o(√n) → TOO RISKY

½

¼ ¾

⅛ ⅜ ⅝ ⅞

Page 47: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Discussion of lower bound

Our lower bound doesn’t rely on a contrived demand curve. In fact, we show that it holds for almost every demand curve satisfying some “generic” axioms. (e.g. smoothness)

Page 48: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Discussion of lower bound

Our lower bound doesn’t rely on a contrived demand curve. In fact, we show that it holds for almost every demand curve satisfying some “generic” axioms. (e.g. smoothness)

The definition of KD is quite subtle. This is the hard part of the proof.

Page 49: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Discussion of lower bound

Our lower bound doesn’t rely on a contrived demand curve. In fact, we show that it holds for almost every demand curve satisfying some “generic” axioms. (e.g. smoothness)

The definition of KD is quite subtle. This is the hard part of the proof.

An ex post lower bound of Ω(√n) is easy. The difficulty is solely in strengthening it to an ex ante lower bound.

Page 50: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Open Problems

Close the log-factor gaps in random and worst-case models.

Page 51: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Open Problems

Close the log-factor gaps in random and worst-case models.

What if buyers have some control over the timing of their arrival? Can a temporally strategyproof mechanism have o(n) regret? [Parkes]

Page 52: The Value of Knowing a Demand Curve: Regret Bounds for Online  Posted-Price Auctions

Open Problems

Close the log-factor gaps in random and worst-case models.

What if buyers have some control over the timing of their arrival? Can a temporally strategyproof mechanism have o(n) regret? [Parkes]

Investigate online posted-price combinatorial auctions, e.g. auctioning paths in a graph. [Hartline]