commitment to correlated strategies

20
Commitment to Correlated Strategies Dima cent Conitzer, Dmytro Korzhyk. Commitment to Correl tegies. In AAAI-2011.]

Upload: neka

Post on 25-Feb-2016

29 views

Category:

Documents


0 download

DESCRIPTION

Commitment to Correlated Strategies. Dima. [Vincent Conitzer , Dmytro Korzhyk . Commitment to Correlated Strategies. In AAAI-2011.]. Games in Normal Form. Player 2. L. R. U. Player 1. D. A player’s strategy is a distribution over the player’s actions. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Commitment to Correlated Strategies

Commitment to Correlated Strategies

Dima

[Vincent Conitzer, Dmytro Korzhyk. Commitment to Correlated Strategies. In AAAI-2011.]

Page 2: Commitment to Correlated Strategies

Games in Normal Form

1, 1 3, 00, 0 2, 1

Player 1

Player 2

U

D

L R

A strategy profile is a pair of strategies (pure or randomized)

An outcome of the game is an entry in the matrix

A player’s strategy is a distribution over the player’s actions

Page 3: Commitment to Correlated Strategies

Nash equilibrium

1, 1 3, 0

0, 0 2, 1Player 1

Player 2

U

D

L R

An NE is a strategy profile in which no player has an incentive to deviate.

Page 4: Commitment to Correlated Strategies

Computing a Nash Equilibrium

Iterated dominance works in this case

1, 1 3, 00, 0 2, 1

Player 1

Player 2

U

D

L R

Generally, there is no known polytime algorithm [PPAD-completeness: Daskalakis, Goldberg & Papadimitriou ‘06; Chen & Deng ’06; NP-hardness of NE with certain properties: Gilboa & Zemel ’89; Conitzer & Sandholm ‘08]

Dominated

Page 5: Commitment to Correlated Strategies

Stackelberg model

• Suppose the row player (the leader) can commit to a strategy

1, 1 3, 0

0, 0 2, 1Leader

Follower

U

D

L R

The leader benefits from commitment!

Page 6: Commitment to Correlated Strategies

Commitment to a mixed strategy

• Suppose the leader commits to (2/3 Down, 1/3 Up)

1, 1 3, 00, 0 2, 1

Leader

Follower

UD

L R

• Commitment to a mixed strategy benefits the leader even more• The optimal strategy to commit to is (50%-eps, 50%+eps)• Can be computed in polytime [Conitzer & Sandholm ’06, von

Stengel & Zamir ‘10]

1/3

2/3

Page 7: Commitment to Correlated Strategies

Applications of the Stackelberg model

• Resource allocation for airport security[Pita et al., AI Magazine ‘09]

• Scheduling of federal air marshals[Tsai et al., AAMAS ‘09]

• GUARDS system for TSA resource allocation

[Pita et al., AAMAS ’11]Photo AP

Photo STL airport

Page 8: Commitment to Correlated Strategies

LP1: Computing a Stackelberg strategy

• Given the leader’s strategy , the follower maximizes • There is always a pure-strategy best response• Idea: write an LP for each best-response , choose the

max leader’s utility among the feasible LPs

𝑀𝑎𝑥𝑖𝑚𝑖𝑧𝑒∑𝑠 1𝑢1 (𝑠1 , 𝑠2∗ )𝑝 (𝑠1)

∑𝑠 1𝑢2 (𝑠1 ,𝑠2∗ )𝑝 (𝑠1)≥∑

𝑠 1𝑢2 (𝑠1 , 𝑠2❑)𝑝 (𝑠1)

∑𝑠 1𝑝(𝑠1)=1

Objective: leader’s utility

Subject to the follower’s rationality

∀𝑠2:

[Conitzer and Sandholm ’06, von Stengel and Zamir ‘10]

Page 9: Commitment to Correlated Strategies

New idea: Commitment to a correlated strategy

• The leader draws from a distribution over the outcomes

1, 1 3, 0

0, 0 2, 1

U

D

L R

20%40%

10% 30%

• The follower only gets to know the column• The follower should have no incentive to deviate• We will look for a correlated strategy that maximizes the

leader’s utility

Leader

Follower

Page 10: Commitment to Correlated Strategies

Equivalence to Stackelberg

Proposition 1. There exists an optimal correlated strategy to commit to in whichthe follower always gets the same recommendation.

Page 11: Commitment to Correlated Strategies

Proof of Proposition 1

P (𝑠1 ,𝑠2 )=P (𝑠2 ) P(𝑠1∨𝑠2)

UCD

L M R

Set , where maximizes

Follower’s rationality: each is a best-response to P(𝑠2)

P(𝑠1∨𝑠2)

Break the correlated strategy into two components:

The leader can rearrange without breaking the follower’s rationality condition

Page 12: Commitment to Correlated Strategies

LP2 for computing an optimal correlated strategy to commit to

• We can use this LP to compute an optimal Stackelberg strategy!

𝑀𝑎𝑥𝑖𝑚𝑖𝑧𝑒∑𝑠 1∑𝑠2𝑢1 (𝑠1 ,𝑠2 )𝑝 (𝑠1 ,𝑠2)

∑𝑠 1∑𝑠2𝑝 (𝑠1 ,𝑠2)=1

∑𝑠 1𝑢2 (𝑠1 , 𝑠2 )𝑝 (𝑠1 ,𝑠2)≥∑

𝑠1𝑢2 (𝑠1,𝑠2′ )𝑝 (𝑠1 ,𝑠2❑)

Objective – leader’s utility

Follower’s rationality

∀𝑠2 ,𝑠2′ :

Page 13: Commitment to Correlated Strategies

Experimental evaluation• The single LP actually runs faster than LP1, MIP for many game classes (on 50x50 games, using CPLEX, GAMUT games [Nudelman et al. ‘04]):

• Downside: the single LP uses more memory.

BidirectionalLEG CovariantGame DispersionGame GrabTheDollar

MinimumEffortMajorityVotingLocationGameGuessTwoThirds

PolymatrixGame RandomGame

TravelersDilemma UniformLEG WarOfAttrition

RandomLEGRandomGraphical

RandomZeroSum

Page 14: Commitment to Correlated Strategies

Correlated equilibrium

• A 3rd party proposes a distribution over the outcomes• Now, both players should have no incentive to deviate

∑𝑠 1∑𝑠2𝑝 (𝑠1 ,𝑠2)=1

∑𝑠 1𝑢2 (𝑠1, 𝑠2 )𝑝 (𝑠1 ,𝑠2)≥∑

𝑠1𝑢2 (𝑠1 ,𝑠2′ )𝑝 (𝑠1 ,𝑠2❑)Player 2’s

rationality∀𝑠2 ,𝑠2′ :

∑𝑠 2𝑢1 (𝑠1 , 𝑠2 )𝑝 (𝑠1 ,𝑠2)≥∑

𝑠2𝑢1 (𝑠1′ ,𝑠2❑ )𝑝 (𝑠1 ,𝑠2❑)Player 1’s

rationality∀𝑠1 ,𝑠1′ :

Page 15: Commitment to Correlated Strategies

Stackelberg vs Correlated Equilibrium

Corollary 1. The leader’s Stackelberg utility is at least as high as the leader’s utility in any correlated equilibrium of the game.

[von Stengel and Zamir ‘10]

Page 16: Commitment to Correlated Strategies

Commitment to correlated strategies for n>2 players

• A generalization of LP2• With n>2 players, the optimal correlated strategy

to commit to may not be a product distribution

P= productdistribution

D= degeneratedistribution

Page 17: Commitment to Correlated Strategies

Example: Commitment to a correlated strategy with 3 players

Utilities

2 1

50%

50%

Optimal correlated strategy to commit to:

Leader

Different from Stackelberg and CE!

Page 18: Commitment to Correlated Strategies

Advantages of commitment to a correlated equilibrium

• Same as Stackelberg for n=2 players

• Well defined for n>2 players (unlike the Stackelberg model)

• Easy to compute for any number of players

Page 19: Commitment to Correlated Strategies

Overview of contributions

• A single LP for Stackelberg (for 2 players)

• Easy proof of relationship between Stackelberg and correlated equilibrium (for 2 players)

• Model for commitment to a correlated strategy (for n≥2 players)

• LP for commitment to a correlated strategy (for n≥2 players)

Page 20: Commitment to Correlated Strategies

Thank you!