the computational complexity of finding nash equilibria

42
The Computational Complexity of Finding Nash Equilibria Edith Elkind Intelligence, Agents, Multimedia group (IAM) School of Electronics and CS U. of Southampton

Upload: thuong

Post on 12-Jan-2016

39 views

Category:

Documents


2 download

DESCRIPTION

The Computational Complexity of Finding Nash Equilibria. Edith Elkind Intelligence, Agents, Multimedia group (IAM) School of Electronics and CS U. of Southampton. Games and Strategies. Games: strategic interactions between rational entities Solution concepts: what’s going to happen? - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: The Computational Complexity of Finding Nash Equilibria

The Computational Complexityof Finding Nash Equilibria

Edith ElkindIntelligence, Agents, Multimedia group (IAM)

School of Electronics and CS

U. of Southampton

Page 2: The Computational Complexity of Finding Nash Equilibria

Games and Strategies

• Games: strategic interactions between rational entities

• Solution concepts: what’s going to happen?– dominant strategies– Nash equilibrium– ….

• Can it be computed?– if your computer cannot find it, the market

probably cannot either

Page 3: The Computational Complexity of Finding Nash Equilibria

Matrix (Normal Form) Games

2 0

0 1

1 0

0 3

Row player:

Column player:

0

1

0 1 0 1

0

1

• finite set of players {1, …, n}

• each player has k actions

(pure strategies): 1, …, k

• payoffs of the ith player: Pi: {1, …, k}n → R

Page 4: The Computational Complexity of Finding Nash Equilibria

Nash Equilibrium

2 0

0 1

1 0

0 3

Row player:

Column player:

0

1

0 1 0 1

0

1

• Nash equilibrium: a strategy profile such that

noone wants to deviate given other players’ strategies, i.e., each player’s strategy is a best response to others’ strategies:– (0, 0) and (1, 1) are both NE

Page 5: The Computational Complexity of Finding Nash Equilibria

Pure vs. Mixed Strategies

1 -1

-1 1

-1 1

1 -1

Row player:

Column player:

H

T

H T H T

H

T

• NE in pure strategies may not exist!– “matching pennies”

• Mixed strategy: a probability distribution over actions– 50% tail, 50% head

Page 6: The Computational Complexity of Finding Nash Equilibria

Existence of NE

• Theorem (Nash 1951): any n-player k-action game in normal form has an equilibrium in mixed strategies

can we find one in poly-time?

Page 7: The Computational Complexity of Finding Nash Equilibria

Plan of the Talk

2 players, k actions

n players, 2 actions

other cool stuff

Page 8: The Computational Complexity of Finding Nash Equilibria

2 (r=const) players, k actions

• Input representation:– 2 players: two k x k matrices– r players: r k x k x … x k matrices

• poly-size for constant r

• Output representation: – for 2 players all NE are in Q– but not for 3 and more players…

• Checking for pure NE: easy – at most k2 strategy profiles

Page 9: The Computational Complexity of Finding Nash Equilibria

2 players, k actions: mixed NE

• Naïve approaches: exp(k)

• Simplex-like approach (Lemke-Howson algorithm):– works well in practice– exp(k) in the worst case (2004)

• Is it time to give up?– maybe the problem is NP-hard?

Page 10: The Computational Complexity of Finding Nash Equilibria

Is Finding NE NP-hard?

• Reminder: a problem P is NP-hard if you can reduce 3-SAT to it:– “yes”-instance 3-SAT → “yes”-instance of P– “no”-instance 3-SAT → “no”-instance of P

• Problem: each instance of NASH is a “yes”-instance!– every game has a NE

• Formally: if NASH is NP-hard then NP = coNP• Need: complexity theory for

total search problems

Page 11: The Computational Complexity of Finding Nash Equilibria

Reducibility Among Search Problems

• S associates x in X with a solution set S(x)• Total search problem: for any x, S(x) is not empty

S: X Y

T: X’ Y’

If T is easy, so is S• S is reducible to T if:

– f, g easy to compute– g(T(f(x))) is in S(x)

f g

Page 12: The Computational Complexity of Finding Nash Equilibria

Completeness Results?

• Can we prove that any total search problem is reducible to r-NASH?

• Not really: the class T of all total search problems is a semantic class– not known how to find complete problems for these

• Want to pick a large subclass S of T s.t.– S includes some natural problems– there are problems that are complete for S– in particular, r-NASH is complete for S

Page 13: The Computational Complexity of Finding Nash Equilibria

• Input: Boolean circuits S (Successor), P (Predecessor):– n inputs, n outputs– S(0n) ≠ 0n, P(0n) = 0n

• Output: x ≠ 0n s.t. – S(P(x)) ≠ x or P(S(x)) ≠ x

Intuition: G=(V, E): – V = n; – E = {(x,y) | y=S(x), x=P(y)}

END OF THE LINE

00000

01011

11001

01011

Page 14: The Computational Complexity of Finding Nash Equilibria

PPAD

• PPAD: Polynomial Parity Argument, Directed version

• PPAD is the class of all search problems that are reducible to END OF THE LINE

search problem solution

circuits S, T “end of the line”f g

Page 15: The Computational Complexity of Finding Nash Equilibria

r-NASH is in PPAD

• Proof on Nash’s theorem:– existence of NE reduces to Brouwer’s fixpoint

theorem– Brouwer’s fixpoint theorem reduces to

Sperner’s lemma– Sperner’s lemma is proven by a parity

argument (similar to END OF THE LINE)

• Reduction of r-NASH to END OF THE LINE can be extracted from these proofs (Papadimitriou 94)

Page 16: The Computational Complexity of Finding Nash Equilibria

Brouwer’s Fixpoint Theorem

• Brouwer’s Theorem: Any continuous mapping from the simplex to itself has a fixpoint.

• Nash Brouwer proof sketch:– set of all strategy profiles → simplex– mapping: (s1, …, sn) → (s1+1, …, sn+n),

where i is a shift in the direction of best response to (s1, …, si-1, si+1, …, sn)

– NE is a point where noone wants to deviate, i.e., a fixpoint

Page 17: The Computational Complexity of Finding Nash Equilibria

• Proper coloring:– vertices on BC are not blue– vertices on AC are not green– vertices on AB are not yellow

• Sperner’s Lemma: there exists a trichromatic triangle

• Brouwer’s theorem Sperner’s Lemma:– x is blue if the grad(F) at x points away from A, etc.– trichromatic triangle “has no direction” – repeat at increased resolution…

Sperner’s Lemma

A

B

C

Page 18: The Computational Complexity of Finding Nash Equilibria

Reductions (Papadimitriou 1994)

END OF THE LINE is PPAD-complete

TRICHROMATIC TRIANGLE is PPAD-complete

3D-BROUWER is PPAD-complete

r-NASH is in PPAD

Page 19: The Computational Complexity of Finding Nash Equilibria

r-NASH vs 3D BROUWER

• Existence of NE follows from Brouwer’s fixpoint theorem

• NE are special cases of Brouwer’s fixpoints– just how special?

• Can any fixpoint be represented as a NE of a game?– Is there a reduction

from 3D BROUWER to r-NASH?

Page 20: The Computational Complexity of Finding Nash Equilibria

Hardness Reductions: the Timeline

• 3D-BROUWER is PPAD-complete (Papadimitriou, 1994)

• 4-NASH is PPAD-complete (Daskalakis,Goldberg, Papadimitriou, Sep 2005)

• 3-NASH is PPAD-complete (Daskalakis, Papadimitriou, Oct 2005, Chen, Deng, Nov 2005)

• 2-NASH is PPAD-complete !!! (Chen, Deng, Dec 2005)

Page 21: The Computational Complexity of Finding Nash Equilibria

n players, 2 actions

• representation: payoffs to each player for every action profile (vector in {0, 1}n): n2n numbers

• graphical games:– players are vertices of a graph– V’s payoff depends on actions of W in N(V) U V– n players, max degree d => n2d+1 numbers

TU

V

W t=0, u=0, v=0, w=0: 12t=1, u=0, v=0, w=0: 31 ….t=1, u=1, v=1, w=1: -6

W’s payoffs(16 cases):

Page 22: The Computational Complexity of Finding Nash Equilibria

Algorithms: What Was Known

• Bounded-degree trees:– Exp-time algorithm/poly-time approximation

algorithm to find all NE (Kearns, Littman, Singh, UAI 2001)

– ??? poly-time algorithm to find a single NE (Kearns, Littman, Singh, NIPS’2001)

• Heuristics for graphs with cycles

Page 23: The Computational Complexity of Finding Nash Equilibria

Our Results (E., Goldberg, Goldberg’06)

• Algorithm in NIPS’01 paper is incorrect (does not always output a NE)

• We fix the NIPS’01 algorithm, but…– our algorithm runs in poly-time on paths– with a trick, also on cycles

• There is a graph of pathwidth 2 on which our algorithm runs in exp time– true for all algorithms that use the basic

approach of the UAI’01 paper

Page 24: The Computational Complexity of Finding Nash Equilibria

Warm-up: 2-player 2-action games

2 0

0 1

1 0

0 3

Row player:

Column player:

0

1

0 1 0 1

0

1

Suppose R plays 1 w.p. r

EP(C) from playing 0: (1-r)*1

EP(C) from playing 1: r*3

1-r > 3r iff r < ¼

Suppose C plays 1 w.p. c

EP(R) from playing 0: (1-c)*2

EP(R) from playing 1: c*1

(1-c)*2 > c iff c < 2/3

1/4 1

r

BR(C)c1

2/3BR(R)

mixed NE: r=1/4, c=2/3

Page 25: The Computational Complexity of Finding Nash Equilibria

• Potential best response: v is a PBR to w iff when W plays w, there is a NE for TV in which V plays v.

• upstream pass: construct PBRV(w) from PBRU1(v), PBRU2(v) and PBRU3(v)

• downstream pass: root selects its strategy based on the children’s PBR’s; propagates to leaves

Algorithm for Trees (KLS’01)

TV

W

V

U1U2

U3

v

w

Page 26: The Computational Complexity of Finding Nash Equilibria

Computing PBR on a Path

• E0 = EP(V) from playing 0: (1-u)(1-w)*v000+(1-u)w*v001+u(1-w)*v100+uw*v101 = auw+bu+cw+d

• E1 = EP(V) from playing 1: (1-u)(1-w)*v010+(1-u)w*v011+u(1-w)*v110+uw*v111 = a’uw+b’u+c’w+d’

• E0 = E1 iff w = (Au+B)/(Cu+D) = f(u)

U V W

.5 1

1u

v

v

1

1

.5

.1 .9 w

(v, u) → (f(u), v)

PBRU(v) PBRV(w)

Page 27: The Computational Complexity of Finding Nash Equilibria

Trees: too many segments

v v w

ut v

v1 v2 v1 v2

v1

v2

KLS (NIPS’01): can “trim” PBR

Incorrect!

W

V

T U

(v,t), (v,u) → (f(u,t), v)

u2

u1

t2

t1

Page 28: The Computational Complexity of Finding Nash Equilibria

Solutions?

• Solution 1 (for paths): algorithm of UAI’01 paper, careful analysis– the number of segments/rectangles in each

PBR is O(n2)– running time O(n3)

• Solution 2 (for paths): can pick a subset of each PBR consisting of O(n) segments– O(n2) running time

Page 29: The Computational Complexity of Finding Nash Equilibria

Extension to trees? V0 V1 V2 Vn-1 Vn

U1

T1

Un-1

T2

U2

Tn-1 Tn

Un

Page 30: The Computational Complexity of Finding Nash Equilibria

Graphical games: hardness results

• NP-hard?– no: total search problem

• PPAD-hard?– yes!– in fact, this is how the hardness result for

4-player games was obtained (Goldberg, Papadimitriou, Aug 2005)

Page 31: The Computational Complexity of Finding Nash Equilibria

Equivalences: GP’05r-player game G NE of G

deg 3 graphical game G’ NE of G’

f g

d2-player game G’ NE of G’

deg d graphical game G NE of G

f g

Page 32: The Computational Complexity of Finding Nash Equilibria

Combining Reductions: GP’05

r-player game G NE of G

9-player game G’ NE of G’

f g

Finding NE in a 4-player game is as hard as

finding NE in a r-player game for any constant r

X4

Page 33: The Computational Complexity of Finding Nash Equilibria

PPAD-hardness: missing details

• 3D-Brouwer is PPAD-complete (Papadimitriou, 1994)

• 4-NASH is as hard as deg 3-GG (Goldberg, Papadimitriou, Aug 2005)

• deg 3-GG is PPAD-complete and hence 4-NASH is PPAD-complete

(Daskalakis,Goldberg, Papadimitriou, Sep 2005) • 3-NASH is PPAD-complete

(Daskalakis, Papadimitriou, Oct 2005, Chen, Deng, Nov 2005)

• 2-NASH is PPAD-complete !!! (Chen, Deng, Dec 2005)

Page 34: The Computational Complexity of Finding Nash Equilibria

NE with special properties

Pure NE:

• easy for constant number of players

• NP-hard for general graphical games – even if max degree = 3– NP vs. PPAD: pure NE may not exist!

• poly-time on trees (KLS algorithm)– also on graphs with bounded treewidth

Page 35: The Computational Complexity of Finding Nash Equilibria

Welfare-Maximizing NE

2 0

0 1

1 0

0 3

Row player:

Column player:

0

1

0 1 0 1

0

1

Nash equilibria: • (0, 0): total payoff is 3• (1, 1): total payoff is 4• (1/4, 2/3): total payoff is 17/12

not all NE are created equal…

Page 36: The Computational Complexity of Finding Nash Equilibria

Algorithms for Good NE

• 2-player games: checking for NE with total payoff > T is NP-hard (Gilboa Zemel 89, Conitzer, Sandholm 03)

• Graphical games: for any algebraic , deg() = n, there is a GG with int payoffs on a path of length O(n) in which in the best NE player 1 plays approximation algorithms for any (E., Goldberg, Goldberg 07)

Page 37: The Computational Complexity of Finding Nash Equilibria

Approximate NE

• -Nash equilibrium: a strategy profile such that noone can gain > by deviating

• Graphical games on trees: poly-time algorithms for any (KLS’01)

• 2-player games ( utilities in [0, 1] ): – PPAD-complete for =O(1/n)– Approximation for constant :

• 0.5 WINE’06 (Dec 2006)• 0.382 ( =1-1/) ACM EC’07 (June 2007)• 0.364 WINE’07 (Dec 2007)• 0.339 WINE’07 (Dec 2007)

Page 38: The Computational Complexity of Finding Nash Equilibria

Conclusions

• Computational aspects of game-theoretic questions are crucial

• Lots of cool open problems– computing NE in graphical games on trees– finding -Nash in 2-player games for small

• A rich set of techniques

• Talk to me if you want to know more…

Page 39: The Computational Complexity of Finding Nash Equilibria

Mixed strategies and payoffs

• Payoff matrices:

• the row player plays a = (a1, …, an)• the column player plays b = (b1, …, bn)• expected payoff of R when playing i: (Ri, *, b)• expected payoff of C when playing i: (C*, j, a)

R11 R12 … R1n

R21 R22 … R2n

…Rn1 Rn2 … Rnn

C11 C12 … C1n

C21 C22 … C2n

…Cn1 Cn2 … Cnn

R: C:

Page 40: The Computational Complexity of Finding Nash Equilibria

• if 1st player’s strategy a supported on I N ai ≠ 0 iff i I

2nd player’s strategy b supported on J N bj ≠ 0 iff j J

• then I BR(b): (b, Ri, *) ≥ (b, Rk, *) for all i I, k N

J BR(a): (a, C*, j) ≥ (a, C*, k) for all j J, k N– LP on variables a1, …, an, b1, …, bn

– solutions to LP ↔ Nash equilibria

• running time: 22kpoly(k)

2 players, k actions: support guessing

linear inequalities!

Page 41: The Computational Complexity of Finding Nash Equilibria

Reminder: 2-player 2-action games

2 0

0 1

1 0

0 3

Row player:

Column player:

0

1

0 1 0 1

0

1

Suppose R plays 1 w.p. r

EP(C) from playing 0: (1-r)*1

EP(C) from playing 1: r*3

1-r > 3r iff r < ¼

Suppose C plays 1 w.p. c

EP(R) from playing 0: (1-c)*2

EP(R) from playing 1: c*1

(1-c)*2 > c iff c < 2/3

1/4 1

r

BR(C)c1

2/3BR(R)

mixed NE: r=1/4, c=2/3

Page 42: The Computational Complexity of Finding Nash Equilibria

Computing PBR on a path

• f(u) = (au+b)/(cu+d)• a, b, c, d are determined by V’s payoffs

U V W

.5 1

1u

v

v

1

1

.5

.1 .9 w

(v, u) → (f(u), v) + “tails”

PBRU(v) PBRV(w)