Game Theory: Lecture of March 26, 2019
Chiara Mocenni
MSc in Game Theory
In game theory we consider a set of n alternatives available to take a given decision. Formally, we use a vector x of n components (x ∈ R
n). In particular, the i -th component x
iwill be the only equal 1 if we decided to adopt the strategy labeled by the number i . This vector identifies univocally the decision we have taken.
Specifically:
x
i=
1 if the adopted strategy is the i -th
0 if the adopted strategy is not the i -th (1)
We can think to a profit, usually named payoff, as a function
π(x ) : R
n−→ R that associates to each choice vector x a real
number that quantifies the reward that the individual is obtaining
from that choice.
In game theory the choices of the various participants are independent one from the other and are driven by the willing of each subject to maximize his own payoff. The individuals are called players.
In the simplest case the game is played by two players.
Let be x ∈ R
nthe vector indicating the choice of the first player in a set of n strategies, while y ∈ R
mwill indicate the choice of the second participant, who has m options available. The payoff functions are the following:
π
1(x , y ) : R
n× R
m−→ R payoff of player 1
π
2(x , y ) : R
n× R
m−→ R payoff of player 2
Each player is aimed at maximizing his own payoff taking into
account the choices of the opponent. The decisions of the two
players are independent, communicated at the same time, then
there is no time for hesitation or collusion. The mechanism is like
the public auctions. Sealed-bid auctions in which bidders place
their bid in a sealed envelope and simultaneously hand them to the
auctioneer. The envelopes are opened and the individual with the
highest bid wins, getting the bigger payoff (paying the amount bid
for the real first-price auctions).
Game theory is a tool supporting people to take decisions. It is robust and take advantage by many theorems and results. All these results are based on the assumption that the payoff functions are all known. Actually, to identify correctly a payoff function is a hard job: it is like to quantify the happyness that someone has when doing something!
In the decision theory we can define the payoff functions by means
of estimates of the information cost or by assessing the utility
functions during appropriate interviews to the decision maker.
The payoff of players 1 and 2 are determined by the couple of strategies (i , j ), where i is the strategy of player 1 and j the one adopted by player 2. We indicate with a
ijthe payoff that player 1 earns if he uses strategy i while his opponent is using strategy j . b
ijindicates the payoff of player 2: again, b
ijis the payoff the he gets when he is choosing j and his opponent (player 1) chooses i . It follows that we have introduced two payoff matrices
A, B ∈ R
n×m, such that A = {a
ij} and B = {b
ij}.
We can use the above matrices to introduce the payoff functions:
π
1(x , y ) = x
TAy π
2(x , y ) = x
TBy
Indeed, when x and y are vectors defined as above, then x
TAy and
x
TBy identify the elements a
ijof matrix A and b
ijof matrix B,
assuming the x
iand y
jare the only components of x and y equal 1.
In game theory we can introduce the best reply . These are functions that support the decision mechanism. The best reply of player 1 associates for each strategy of player 2 the set of strategies that player 1 must adopt to maximize his payoff. We indicate with β
1(j ) and β
2(i ) the best reply of players. For example:
A =
1 3 0 2 0 4 5 9 3
=⇒
β
1(1) = {3}
β
1(2) = {3}
β
1(3) = {2}
Actually, considering that columns indicate the strategies of player
2, we fix the column relative to the strategy j ; then we look for the
maximum element along this column. The best reply will be the
strategy i (row) where is placed the maximum element.
One more exemple:
A =
3 6 2
3 6 1
5 1 0
=⇒
β
1(1) = {3}
β
1(2) = {1, 2}
β
1(3) = {1}
In this case, if player 2 adopts strategy j = 2, then player 1 is
indifferent when using strategies i = 1 and i = 2, since they
guarantee the same payoff. The best reply function of player 2 is
similar to the one above, except that we have to exchange rows
(strategies of player 1) with columns (strategies of player 2).
Let’s consider the following game:
A =
3 6 20
3 6 1
5 1 0
=⇒
β
1(1) = {3}
β
1(2) = {1, 2}
β
1(3) = {1}
B =
4 8 2
1 1 10
3 5 5
=⇒
β
2(1) = {2}
β
2(2) = {3}
β
2(3) = {2, 3}
Selfishness of players could bring them to take a decision that is non optimal for the game. The idea to overcome this problem is to look for a compromise, that is take a decision that is good for both players. We must reach some equilibrium between the two parts.
Mathematically, we can define an equilibrium in the following way:
(i
∗, j
∗) is an equilibrium ⇐⇒
i
∗∈ β
1(j
∗)
j
∗∈ β
2(i
∗) (2)
Thsi idea has been formalized by the Nobel prize Nobel J.F.Nash
who gave the name to the Nash equilibrium.
Studying the best reply functions introduced above, we obtain that (i
∗, j
∗) = (1, 2). Indeed
i
∗= 1 ∈ β
1(j
∗= 2) = {1, 2}
and
j
∗= 2 ∈ β
2(i
∗= 1) = {2}.
If player 2 could be sure that his opponent will choose i = 1, then
he will choose j = 2, thus earning the maximum payoff available
reachable when i = 1. Similarly, player 1, knowing that player 2 will
choose j = 2, will be able to choose i = 1 and i = 2, because both
strategies will ensure a maximum payoff. If players will be allowed
to change strategy unilaterally from i = 1 and j = 2, they will not
have reason for doing so. For this reason we talk of equilibrium.
The Nash equilibrium we have found previously is non strict,
indeed player 1 has two options to maximize his payoff with
respect to the choice j = 2. When there is no ambiguity we can
talk of strict equilibrium. Anyway, there is no guarantee that,
given a game, only one Nash equilibrium exists. They may be more
than one, or even no one, although we will see that one Nash
equilibrium always exists in the set of mixed strategies (not yet
introduced here).
There is also another method to find Nash equilibrium, namely the bi-matrix method:
player 2
strat. 1 strat. 2
strat. 1 (5, 2) ← (1, 1)
player 1 ↑ ↓
strat. 2 (1, 7) ← (2, 3)
When we find a strategy with only pointing in arrows, then this is
a strict Nash equilibrium. In this case we have that the number of
arrows corresponds to the maximum number of available arrows (2
for games with 2 participants). In this particular case strategy
(1, 1) is a strict Nash equilibrium.
Another example:
player 2
strat. 1 strat. 2
strat. 1 (4, 20) ← (3, 15)
player 1 l ↑
strat. 2 (4, 7) → (2, 13) The double sided arrow indicates that player 1 is indifferent
between choosing his strategy when the opponent chooses strategy
1 (he will earn the same payoff in any case). We see that the
couple of strategies (1, 1) is a Nash equilibrium because it has 2
pointing in arrows, but in this case it is non strict, indeed there is
also one pointing out arrow.
Let’s define I = {1, 2, ...n} the set of players, where n is a positive integer number and S
i= {1, 2, ...m
i} the set of pure strategies for each player i ∈ I , where m
i≥ 2. Player i is characterized by a vector x
i∈ R
misuch that:
x
ij=
1 if strategy adopted by player i is j
0 if strategy adopted by player i is not j . (3)
The set of vectors {x
1, x
2, . . . , x
n} is named strategy profile.
For each strategy profile and each player i ∈ I , let’s
π
i(x
1, x
2, . . . , x
n) ∈ R the payoff associated to player i. In game theory, the payoff of one player is a function that incorporates the value of its outcome for the choices of all the other players involved in the game.
In the particular case of a game played by two players, the payoff
functions , π
1(x
1, x
2) and π
2(x
1, x
2), can be represented by two
matrices A and B like R
m1×m2; notice that in this case we used x
1instead of x , and x
2instead of y .
A mixed strategy for player i is a probability distribution in the set of its m
ipure strategies. Formally:
x
ij= Probability for which player i will use strategy j (4) Following this definition we can interpret a pure strategy as a particular case of mixed strategy. Saying that player i uses the pure strategy j ” is equivalent to say that “player i uses strategy j with 100% of probability”, that “player i uses strategy j 100% of the times”.
An example of mixed strategy is the following:
x
i=
0.4, 0.1, 0, 0.5 , (5)
where player i has m
i= 4 strategies available.
Since x
ij( j = 1, 2, ..., m
i) represent probabilities, then vector x
i∈ R
mibelongs to the simplex ∆
iin the m
i-space:
∆
i=
x
i∈ R
m+i:
mi
X
j =1
x
ij= 1
. (6)
Figura: Il simplesso ∆
icon m = 3.
Figura: Geometric representation of the simplex for m = 3.
Simplex ∆
iof player i has dimension m
i− 1.
The vertices of ∆
iare the versors of the m
i-space, indicated by e
i1= (1, 0, ..., 0), e
i2= (0, 1, ..., 0), up to e
imi= (0, 0, ..., 1).
Each vertex e
ijrepresents the mixed strategy i who assigns probability 1 to the j -th pure strategy. If player i uses pure strategy j , then x
i= e
ij.
The simplex of mixed strategies ∆
iis a convex hull of the vertices, and each x
i∈ ∆
iis a convex combination of the versors, said pure strategies e
ij:
x
i=
mi
X
j =1
x
ije
ij. (7)
The interior of ∆
iis defined as:
int(∆
i) = {x
i∈ ∆
i: x
ij> 0 ∀j} (8) The mixed strategies of this subset are called internal or fully mixed and they associate positive probabilities to all pure strategies of the players.
The set of not internal strategies of ∆
iis called boundary of
∆
i, defined as:
bd (∆
i) = {x
i∈ ∆
i: x
i∈ int(∆ /
i)} (9) The boundary of ∆
iis the set of strategies x
ifor which at least one strategy j is played with null probability, i.e.
∃j ∈ {1, . . . , m
i} : x
ij= 0
A mixed strategy profile X = {x
1, x
2, ..., x
n} is a point in the space of mixed strategies of the game:
Θ = ×
i ∈I∆
i. (10)
Θ is the product of n simple units ∆
i, where each of them is a set (m
i− 1)-dimensional for each player i ∈ I , the set is a
(m − n)-dimensional polyhedron in R
m, where
m = m
1+ m
2+ ... + m
nis the total number of pure strategies of the game. A strategy profile X is said internal if each component x
iis internal. The subset of such profiles is defined as:
int(Θ) = ×
i ∈Iint(∆
i). (11)
Similarly, the boundary of Θ, bd (Θ), is the set of non internal
profiles x ∈ Θ.
Introducing the mixed strategies implies a revision of the concept of payoff. Indeed, since mixed strategies are probability
distributions, then the payoff is the expected value of the payoffs that can be earned by means of pure strategies. This is true for each game with n players.
Let’s analyze the simple case of 2 players. Let A ∈ R
m1×m2be the payoff matrix of player 1. Suppose that player 1 decides to use the pure strategy i , while player 2 plays the mixed strategy x
2. In average, player 1 will get the following quantity:
u
1i=
m2
X
j =1
a
ijx
2j,
which is the sum of payoffs a
ijweighted by the probabilities x
2j.
If player 1 opts for the mixed strategy x
1, he will earn in average:
m1
X
i =1
x
1iu
1i=
m1
X
i =1 m2
X
j =1
x
1ia
ijx
2j,
which represents the sum of the average payoffs u
1iweighted by the probabilities x
1i. The double sum allows to rewrite the payoff in the matricial form:
m1
X
i =1 m2
X
j =1
x
1ia
ijx
2j= x
1TAx
2,
making the previous equation in line with the definition of payoff given for the pure strategies:
π
1(x
1, x
2) = x
1TAx
2. (12) Similarly, the payoff of the second player is
π
2(x
1, x
2) = x
1TBx
2.
Let’s now consider the games with two strategis. The payoffs are represented by the following matrices:
A =
a
1b
1c
1d
1, B =
a
2c
2b
2d
2. and the corresponding bi-matrix is:
G 2
s1 s2
s1 (a
1, a
2) (b
1, c
2) G 1
s2 (c
1, b
2) (d
1, d
2)
Consider the payoff functions obtained by matrices A and B, and using as vectors x = [x
11 − x
1]
Tand y = [y
11 − y
1]
T:
π
1(x , y ) = x
1[(a
1− c
1+ d
1− b
1)y
1+ b
1− d
1] + (c
1− d
1)y
1+ d
1π
2(x , y ) = y
1[(a
2− c
2+ d
2− b
2)x
1+ b
2− d
2] + (c
2− d
2)x
1+ d
2(13) Calculating the derivatives
∂π∂x11
and
∂π∂y21