price of anarchy in games of incomplete information

18
Tim Roughgarden PRICE OF ANARCHY IN GAMES OF INCOMPLETE INFORMATION Alon Ardenboim

Upload: nanji

Post on 24-Feb-2016

44 views

Category:

Documents


0 download

DESCRIPTION

Price of Anarchy in Games of Incomplete Information. Tim Roughgarden. Alon Ardenboim. Full Information Games. The players payoffs are common knowledge. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Price of Anarchy in Games of Incomplete Information

Tim Roughgarden

PRICE OF ANARCHY IN GAMES OF INCOMPLETE

INFORMATION

Alon Ardenboim

Page 2: Price of Anarchy in Games of Incomplete Information

The players payoffs are common knowledge.Pure (Mixed) Nash equilibrium – each players

maximizes his utility (in expectation) when sticking with his current (probabilistic) strategy.

FULL INFORMATION GAMES

Page 3: Price of Anarchy in Games of Incomplete Information

Choose a goal function (e.g. welfare maximization).How bad can an equilibrium be w.r.t. the optimal

outcome (e.g. maximum welfare)?.

PRICE OF ANARCHY

Page 4: Price of Anarchy in Games of Incomplete Information

Players are uncertain about each other payoffs.For example, auctions (eBay), VCG mechanisms.Assume players’ private preferences are drawn

independently from prior distributions.Distributions ARE common knowledge.

INCOMPLETE INFORMATION GAMES

Page 5: Price of Anarchy in Games of Incomplete Information

Type space .Action space . sampled from . is common knowledge.A strategy is a function from type space to a

distribution over actions .A strategy profile is a Bayes-Nash equilibrium if for

every , type and action ,

BAYES-NASH EQUILIBRIUM

Page 6: Price of Anarchy in Games of Incomplete Information

The corresponding PoA of such a games measures how bad is the worst Bayes-Nash equilibrium w.r.t the optimal value.

That is,

When is a product dist., this is iPoA (independent).Otherwise, we talk about cPoA (correlated).

BAYES-NASH POA

Page 7: Price of Anarchy in Games of Incomplete Information

Def: A game is -smooth w.r.t outcome and a maximization objective function if for every ,

W is payoff-dominating if it bounds the sum of players’ payoffs from above (non-negative transfers).

Thm: if a game is -smooth w.r.t. an optimal outcome for a payoff-dominating then PoA.

Let be a Nash Eq., we have:

SMOOTH FULL INFORMATION GAMES

Page 8: Price of Anarchy in Games of Incomplete Information

Def: Let be a game structure and a maximization objective function. The structure is -smooth w.r.t. social choice function if for every and feasible to , we have

Thm: If a game structure is -smooth w.r.t. an optimal choice function for a payoff-dominating , then the iPoA of the game w.r.t. .

SMOOTH INCOMPLETE INFORMATION GAMES

Page 9: Price of Anarchy in Games of Incomplete Information

Let be an optimal choice function (that is, if every player plays we get ).

Let be a Bayes-Nash equilibrium. In strategy player samples and plays .

PROOF OF THEOREM

Page 10: Price of Anarchy in Games of Incomplete Information

We have:

PROOF CONT.

(Payoff dominant)

(Lin. of Exp.)

(Equilibrium)

(Def.)

(Smooth)

(Lin. of Exp.)

Bayes-Nash

OPT

Page 11: Price of Anarchy in Games of Incomplete Information

In the Generalized Second Prize (GSP) auction there are ad slots in a web page. Each with an associated click-through rate.

Each bidder has a private information – valuation per click .

No player overbids (feasible space of bids is ).Assume .

APPLICATION TO GSP

𝛼1

𝛼2

𝛼3

…𝛼𝑘

Page 12: Price of Anarchy in Games of Incomplete Information

Assume player gets bids the highest bid.Allocation: assign the slot with CTR .Payment: Charge player the highest bid.Payoff: if .

otherwise ( if bid is feasible).

GSP (CONT.)

Page 13: Price of Anarchy in Games of Incomplete Information

Thm: The GSP is a -smooth game (and therefore the iPoA is ) w.r.t. welfare maximization goal function.

Proof: Consider welfare maximization (payoff dominant). Let’s take the social choice function (). Easy to see it’s optimal. Fix a type vector of players valuations and an outcome

(arbitrary bids). Assume . Let denote the index of the highest bidder.

SMOOTHNESS OF GSP

Page 14: Price of Anarchy in Games of Incomplete Information

Claim:

for every . :

SMOOTHNESS PROOF (CONT.)

𝛼1

…𝛼 𝑗

…𝛼 𝑖…𝛼𝑘

Page 15: Price of Anarchy in Games of Incomplete Information

Claim:

for every . :

SMOOTHNESS PROOF (CONT.)

𝛼1

…𝛼 𝑗

…𝛼 𝑖…𝛼𝑘

𝛼 𝑗≥𝛼𝑖𝑏𝑖𝑑 ( 𝑗+1 )≤𝑏𝑖=𝑣𝑖 /2

𝑢𝑖¿¿𝛼 𝑗 ⋅ (𝑣𝑖−𝑏𝑖𝑑 ( 𝑗+1 ))≥12𝛼𝑖𝑣 𝑖

Page 16: Price of Anarchy in Games of Incomplete Information

Claim:

for every . :

SMOOTHNESS PROOF (CONT.)

𝛼1

…𝛼 𝑗

…𝛼 𝑖…𝛼𝑘

𝑏𝑖𝑑 (𝑖 )≥𝑣𝑖 /2

12𝛼𝑖 𝑣𝑖−𝛼𝑖𝑏𝑖𝑑 (𝑖 )≤ 0≤𝑢𝑖¿

Page 17: Price of Anarchy in Games of Incomplete Information

Summing over all players we get:

SMOOTHNESS PROOF (CONT.)

𝑊 (𝐭 ;𝐜∗ ( 𝐭 ) ) ≤𝑊 (𝐬=𝐯 ′ ;𝐚 ) ∀ 𝐯 ′≥𝐚

Page 18: Price of Anarchy in Games of Incomplete Information

Application to other games.Other smoothness variants.What to do with correlated type distributions? Is there a relation between cPoA and sPoA?

DIRECTIONS