Stackelberg Strategies Algorithmic Game Theory Course Co RE

Stackelberg Strategies Algorithmic Game Theory Course Co. RE. Lab. - N. T. U. A.

Stackelberg Routing n n In (classic) selfish routing all players act selfishly. In Stackelberg routing there exist players willing to cooperate for social welfare (a fraction of the total players). n Both Selfish and Cooperative players are present. n Leader determines the paths of the coordinated players. n Selfish players (followers) minimize their own cost. Nash Equilibriaare considered as the possible outcomes of the game. A Stackelberg Strategy is an algorithm that allocates paths to coordinated players so as to lead selfish players to a good Nash Equilibrium.

Example: Pigou’s Network c(x)=x One unit of flow is to be routed from s to t s c(x)=1 t

Example: Pigou’s Network c(x)=x One unit of flow is to be routed from s to t s Optimal flow c(x)=1 Flow = ½ t Flow = ½

Example: Pigou’s Network c(x)=x One unit of flow is to be routed from s to t s Optimal flow (Classic) Nash flow x s 1 Flow = 1 t c(x)=1 Flow = ½ t Flow = ½

Example: Pigou’s Network c(x)=x One unit of flow is to be routed from s to t s c(x)=1 Flow = ½ t Flow = ½ Optimal flow Nash flow when a fractionα of (coordinated) playersis sent through the lower edge (Classic) Nash flow x s 1 x Flow = 1 t s 1 Flow = 1 -α t Flow = α

Example: Braess’s Network One unit of flow is to be routed from s to t s x 0 1 1 x t

Example: Braess’s Network s One unit of flow is to be routed from s to t ½ x 1 Optimal flow ½ 0 ½ 1 ½ x t

Example: Braess’s Network s One unit of flow is to be routed from s to t 1 Optimal flow (Classic) Nash flow s x 1 1 0 1 1 ½ x x t ½ 0 ½ 1 ½ x t

Example: Braess’s Network s One unit of flow is to be routed from s to t ½ x 1 ½ 0 ½ 1 ½ x t Optimal flow Nash flow when a fractionα of coordinated playersis sent through the lower edge (Classic) Nash flow s x 1 1 0 1 1 x t s x 1 α/2 0 1 -α 1 x t

Slightly more formal n We will consider single commodity networks. n An instance in such networks: n n Assume that a fraction α of the players are cooperative. A Stackelberg strategy assigns cooperative players to paths. n n They induce a congestion A new game is “created”: n Where

In the “new” game n Selfish players choose paths (as usual), and Nash flows are considered as the possible outcomes of the game (as usual). n On Equilibrium, selfish players induce a congestion n The Price of Anarchy is

The Central Questions n Given a Stackelberg routing instance, we can ask: n n Among all Stackelberg strategies, can we characterize and/or compute the strategy inducing the Stackelberg equilibrium- i. e. , the eq. of minimum total latency? What is the worst-case ratio between the total latency of the Stackelberg eq. and that of the optimal assignment of users to paths?

Finding best strategy: NP-hard Reduction from problem: Given n positive integers is there an satisfying: Given an instance of create an instance of stackelberg routing: n A network G with n+1 parallel links n Demand: n ¼ of the players are followers n Cost functions:

LLF Strategy n Largest Latency First (LLF): n n Compute an optimal configuration Assign coordinated players to optimal paths of largest latency

LLF Strategy n Largest Latency First (LLF): n n Compute an optimal configuration Assign coordinated players to optimal paths of largest latency 4 6 units to be routed. s x 2 x t

LLF Strategy n Largest Latency First (LLF): n n Compute an optimal configuration Assign coordinated players to optimal paths of largest latency 6 units to be routed. 4 Opt routes: n 3 to upper edge n 2 to middle edge n 1 to lower edge x Flow =2 s 2 x Flow =3 t Flow = 1

LLF Strategy n Largest Latency First (LLF): n n Compute an optimal configuration Assign coordinated players to optimal paths of largest latency 6 units to be routed. 4 Opt routes: n 3 to upper edge n 2 to middle edge n 1 to lower edge x Flow =2 s In Nash Flow players are routed: n 4 to middle edge n 2 to lower edge Nash Flow s 2 x Flow =3 Flow = 1 4 x 2 x t Flow=4 Flow=2 t

LLF Strategy n Largest Latency First (LLF): n n Compute an optimal configuration Assign coordinated players to optimal paths of largest latency 6 units to be routed. 4 Opt routes: n 3 to upper edge n 2 to middle edge n 1 to lower edge x Flow =2 s 2 x Flow =3 t Flow = 1

LLF Strategy n Largest Latency First (LLF): n n Compute an optimal configuration Assign coordinated players to optimal paths of largest latency 6 units to be routed. 4 Opt routes: n 3 to upper edge n 2 to middle edge n 1 to lower edge x Flow =2 s LLF controlling ¼ players, e. g. 1½ units, routes: n 1½ to upper edge s Nash Flow 2 x 4 x 2 x Flow =3 t Flow = 1 Flow =1½ Flow=3 t Flow=1½

LLF Strategy n Largest Latency First (LLF): n n Compute an optimal configuration Assign coordinated players to optimal paths of largest latency 6 units to be routed. 4 Opt routes: n 3 to upper edge n 2 to middle edge n 1 to lower edge x Flow =2 s 2 x Flow =3 t Flow = 1

LLF Strategy n Largest Latency First (LLF): n n Compute an optimal configuration Assign coordinated players to optimal paths of largest latency 6 units to be routed. 4 Opt routes: n 3 to upper edge n 2 to middle edge n 1 to lower edge x Flow =2 s LLF controlling ½ players, e. g. 3 units, routes: n 3 to upper edge s Nash Flow 2 x 4 x 2 x Flow =3 t Flow = 1 Flow =3 Flow=2 Flow=1 t

$LLF in parallel links Let α be the fraction of the cooperative players. Theorem$

LLF in parallel links Let α be the fraction of the cooperative players. Theorem 1: In parallel links LLF induces an assignment of cost no more than 1/α times the OPT: Proof by induction: When LLF saturates a link we can restrict to the instance that has: n this link deleted and n fraction of players the “remainders” of the previous instance. Some problems: n n LLF may fail to saturate any link. No problem: Let m be the “heaviest” link. If L is the cost of selfish players and x* is the optimal assignment, it is When a link gets saturated selfish users could use it. No problem: There is an induced equilibrium that doesn’t use it.

Networks with Unbounded Po. A Theorem: Let and. There is an instance such that for any Stackelberg strategy inducing s, it is: Proof: The network is the following The demands are: Cost functions: B=1, C=0 and A is (total flow=1)

LLF in parallel links Let oe denote the optimal congestion Lemma: The proof follows from the variational inequality, similar to the “classic” result.

LLF in parallel links Let oe denote the optimal congestion Lemma: The proof follows from the variational inequality, similar to the “classic” result. Theorem 2: Proof: and It is This is maximized for with maximum value .