game theory lecture 7

Post on 10-Jan-2016

38 Views

Category:

Documents

2 Downloads

Preview:

Click to see full reader

DESCRIPTION

Game Theory Lecture 7. problem set 7. from Osborne’s Introd. To G.T. p.442 Ex. 442.1 p.443 Ex. 443.1. from Binmore’s Fun and Games. p.342 Ex. 38,40 (p.383 Ex. 8,9 p. 388 Ex. 24,29). payoffs. Correlated Equilibria. The equilibria of this game are:. [B,B]. (2 , 1). [X,X]. - PowerPoint PPT Presentation

TRANSCRIPT

1

22

problem set 7

from Osborne’sIntrod. To G.T.

p.442 Ex. 442.1p.443 Ex. 443.1

p.342 Ex. 38,40(p.383 Ex. 8,9p. 388 Ex. 24,29)

from Binmore’sFun and Games

3

Correlated EquilibriaB X

B 2 , 1 0 , 0

X 0 , 0 1 , 2

The equilibria of this game are:

[B,B]

[X,X]

and the mixed strategy equilibrium:

[(2/3 , 1/3),(1/3 , 2/3) ] (2/3 , 2/3)

(2 , 1)

(1 , 2)payoffs

4

Correlated EquilibriaB X

B 2 , 1 0 , 0

X 0 , 0 1 , 2

The equilibria of this game are:

[B,B]

[X,X]

and the mixed strategy equilibrium:

[(2/3 , 1/3),(1/3 , 2/3) ] (2/3 , 2/3)

(2 , 1)

(1 , 2)payoffs

5

Correlated EquilibriaB X

B 2 , 1 0 , 0

X 0 , 0 1 , 2

(2/3 , 2/3)(2 , 1) (1 , 2)

π1

π2

6

π1

π2

Correlated EquilibriaB X

B 2 , 1 0 , 0

X 0 , 0 1 , 2

A roulette wheelseen by both players

By varying the roulette one can obtain each

point in the convex hull

7

π1

π2

Correlated EquilibriaB X

B 2 , 1 0 , 0

X 0 , 0 1 , 2

If roulette stops on redred play BBif on greengreen play XXif on greygrey play (2/3,1/3)(2/3,1/3) or (1/3,2/3)

is a Nash equilibrium

The strategy pair

8

π1

π2

Correlated EquilibriaB X

B 2 , 1 0 , 0

X 0 , 0 1 , 2

Can one do more with this coordination device?

D H

D 2 , 2 0 , 3

H 3 , 0 -1 , -1

9

π1

π2

Correlated Equilibria

Can one do more with this coordination device?

D H

D 2 , 2 0 , 3

H 3 , 0 -1 , -1

10

D H

D 2 , 2 0 , 3

H 3 , 0 -1 , -1

Hawk Dove game‘Chicken’

D H

D 2 , 2 0 , 3

H 3 , 0 -1 , -1

D H

D 2 , 2 0 , 3

H 3 , 0 -1 , -1

Correlated Equilibria

Nash Equilibria:

[H,D]

[D,H]

A mixed strategy equilibrium[ (1/2 , 1/2) , (1/2 , 1/2) ]

11

D H

D 2 , 2 0 , 3

H 3 , 0 -1 , -1

Correlated Equilibria

Nash Equilibria:

[H,D]

[D,H]

A mixed strategy equilibrium[ (1/2 , 1/2) , (1/2 , 1/2) ]π1

π2

Hawk Dove game‘Chicken’

D H

D 2 , 2 0 , 3

H 3 , 0 -1 , -1Not a Nash Equilibrium

12

Correlated Equilibria

π1

π2

Hawk Dove game‘Chicken’

D H

D 2 , 2 0 , 3

H 3 , 0 -1 , -1

Nash Equilibria:

[H,D]

[D,H]

A mixed strategy equilibrium[ (1/2 , 1/2) , (1/2 , 1/2) ]

αβ

γ

A referee chooses one of the

three cells with the probabilities

α + β + γ = 1

The players agree on probabilities

α, β,γ

13

Correlated Equilibria

π1

π2

Hawk Dove game‘Chicken’

D H

D 2 , 2 0 , 3

H 3 , 0 -1 , -1

αβ

γ

A referee chooses one of the

three cells with the probabilities

α + β + γ = 1

The players agree on probabilities

α, β,γ

14

Correlated EquilibriaD H

D 2 , 2 0 , 3

H 3 , 0 -1 , -1

αβ

γWhen he chose a cell, the referee

tells player the row of the cell

and player the column of the cell.

1

2

This is interpreted as a recommendation

to play the row (column) strategy.

When is it an Equilibrium to follow the referees advice?

15

Correlated EquilibriaD H

D 2 , 2 0 , 3

H 3 , 0 -1 , -1

αβ

γ

When a player has heard ,

he prefers to play it H

!!!!

.

When player heard ,

he knows that player heard

with probability with probability

1 D

2

β αD , H

α + β α + β

For him, player 2 mixes

β α,

α + β α + β

16

Correlated EquilibriaD H

D 2 , 2 0 , 3

H 3 , 0 -1 , -1

αβ

γ

if player 2 mixes

β α,

α + β α + β

2β 0α

α + β α + β

3β α

α + β α + β

he prefers to play ifD α β

17

Correlated EquilibriaD H

D 2 , 2 0 , 3

H 3 , 0 -1 , -1

αβ

γ

β

β + γ

he prefers to play ifD

A similar argument for player 2 :

γ β

γ

β + γ

β + γ

3β - γ

β + γ

18

D H

D 2 , 2 0 , 3

H 3 , 0 -1 , -1

α

γ β

Correlated EquilibriaThe referee's recommendations are

self enforcing (Nash Equil.) if

α β

β

γ

α + β + γ = 1

What payoffs can be obtained ???

19

D H

D 2 , 2 0 , 3

H 3 , 0 -1 , -1

α

Correlated Equilibriaβ

γ

π1

π2

γ βα β

α + β + γ = 1

All payoffs in this area can be achieved by choosing α,β,γ.

20

D H

D 2 , 2 0 , 3

H 3 , 0 -1 , -1

α

Correlated Equilibriaβ

γ

π1

π2

γ βα β

α + β + γ = 1

5 5,

3 3

1α = β = γ =

3

With a simple Roulette

21

Correlated Equilibria

Robert Aumann

22

C D

C 2 , 2 0 , 3

D 3 , 0 1 , 1

The Prisoners’ Dilemma C D

C 2 , 2 0 , 3

D 3 , 0 1 , 1The unique Nash Equilibrium:

23

The Prisoners’ Dilemma C D

C 2 , 2 0 , 3

D 3 , 0 1 , 1The unique Nash Equilibrium:

24

C D

C 2 , 2 0 , 3

D 3 , 0 1 , 1

In a repeated game, there may be a possibility of coperation by

Playing D in every stage is a Nash equilibrium of the repeated game.

punishing

deviations from cooperation.

25

C D

C 2 , 2 0 , 3

D 3 , 0 1 , 1

The grim (trigger) strategy

1. Begin by playing C and do not initiate a deviation from C2. If the other played D, play D for ever after.

26

i.e. is the pair (grim , grim) a N.E. ??

C D

C 2 , 2 0 , 3

D 3 , 0 1 , 1

The grim (trigger) strategy

1. Begin by playing C and do not initiate a deviation from C2. If the other played D, play D for ever after.

Is the grim strategy a Nash equilibrium?

If both play grim, they never defect

Not in a finitely repeated Prisoners’

Dilemma.

27

C D

C 2 , 2 0 , 3

D 3 , 0 1 , 1

The grim (trigger) strategy

1. Begin by playing C and do not initiate a deviation from C2. If the other played D, play D for ever after.

The grim strategy is Not a Nash equilibrium

in a finitely repeated Prisoners’ Dilemma.

Given that the other plays grim it pays to deviate in the last period and play D

Indeed, to obtain cooperation in the repeated P.D. it is necessary* to have infinite repetitions.

* It will be shown later

28

An infinitely repeated game

1

2

D

C

2

1

22

CC D

1

22

1

22

1

22

D

1C D

1C D

history is:[D,D]history is:{ [C,D], [C,C] }

sub-games

29

An infinitely repeated game

where ai is a vector of actions taken at time iai is [C,C] or [DC] etc.

A history at time t is: { a1, a2, ….. at }

A strategy is a function that assigns an action for each history.

for all histories

1 2 t

1 2 t

S a ,a , ......a C,D

a ,a , ......a

30

An infinitely repeated game

The payoff of player 1 following a history { a1, a2, ….. at,...… }is a stream { G1(a1), G1(a2), ….. G1(at)...… }

one way of evaluating an infinite

stream of incomes

is as , where

0 1

t tt

t=0 t=0

w ,w , .....

c

δ c = δ w

t t

t=0 t=0

cδ c = c δ

1 - δ

tt

t=0

c = 1 - δ δ w

t0 1 t t

t=0

u w ,w , ...w , ...... = 1 - δ δ w

a discount factor 0 δ 1

top related