• Non ci sono risultati.

Game Theory: Lecture of March 26, 2019

N/A
N/A
Protected

Academic year: 2021

Condividi "Game Theory: Lecture of March 26, 2019"

Copied!
28
0
0

Testo completo

(1)

Game Theory: Lecture of March 26, 2019

Chiara Mocenni

MSc in Game Theory

(2)

In game theory we consider a set of n alternatives available to take a given decision. Formally, we use a vector x of n components (x ∈ R

n

). In particular, the i -th component x

i

will be the only equal 1 if we decided to adopt the strategy labeled by the number i . This vector identifies univocally the decision we have taken.

Specifically:

x

i

=

 1 if the adopted strategy is the i -th

0 if the adopted strategy is not the i -th (1)

We can think to a profit, usually named payoff, as a function

π(x ) : R

n

−→ R that associates to each choice vector x a real

number that quantifies the reward that the individual is obtaining

from that choice.

(3)

In game theory the choices of the various participants are independent one from the other and are driven by the willing of each subject to maximize his own payoff. The individuals are called players.

In the simplest case the game is played by two players.

Let be x ∈ R

n

the vector indicating the choice of the first player in a set of n strategies, while y ∈ R

m

will indicate the choice of the second participant, who has m options available. The payoff functions are the following:

π

1

(x , y ) : R

n

× R

m

−→ R payoff of player 1

π

2

(x , y ) : R

n

× R

m

−→ R payoff of player 2

(4)

Each player is aimed at maximizing his own payoff taking into

account the choices of the opponent. The decisions of the two

players are independent, communicated at the same time, then

there is no time for hesitation or collusion. The mechanism is like

the public auctions. Sealed-bid auctions in which bidders place

their bid in a sealed envelope and simultaneously hand them to the

auctioneer. The envelopes are opened and the individual with the

highest bid wins, getting the bigger payoff (paying the amount bid

for the real first-price auctions).

(5)

Game theory is a tool supporting people to take decisions. It is robust and take advantage by many theorems and results. All these results are based on the assumption that the payoff functions are all known. Actually, to identify correctly a payoff function is a hard job: it is like to quantify the happyness that someone has when doing something!

In the decision theory we can define the payoff functions by means

of estimates of the information cost or by assessing the utility

functions during appropriate interviews to the decision maker.

(6)

The payoff of players 1 and 2 are determined by the couple of strategies (i , j ), where i is the strategy of player 1 and j the one adopted by player 2. We indicate with a

ij

the payoff that player 1 earns if he uses strategy i while his opponent is using strategy j . b

ij

indicates the payoff of player 2: again, b

ij

is the payoff the he gets when he is choosing j and his opponent (player 1) chooses i . It follows that we have introduced two payoff matrices

A, B ∈ R

n×m

, such that A = {a

ij

} and B = {b

ij

}.

(7)

We can use the above matrices to introduce the payoff functions:

π

1

(x , y ) = x

T

Ay π

2

(x , y ) = x

T

By

Indeed, when x and y are vectors defined as above, then x

T

Ay and

x

T

By identify the elements a

ij

of matrix A and b

ij

of matrix B,

assuming the x

i

and y

j

are the only components of x and y equal 1.

(8)

In game theory we can introduce the best reply . These are functions that support the decision mechanism. The best reply of player 1 associates for each strategy of player 2 the set of strategies that player 1 must adopt to maximize his payoff. We indicate with β

1

(j ) and β

2

(i ) the best reply of players. For example:

A =

1 3 0 2 0 4 5 9 3

 =⇒

β

1

(1) = {3}

β

1

(2) = {3}

β

1

(3) = {2}

(9)

Actually, considering that columns indicate the strategies of player

2, we fix the column relative to the strategy j ; then we look for the

maximum element along this column. The best reply will be the

strategy i (row) where is placed the maximum element.

(10)

One more exemple:

A =

3 6 2

3 6 1

5 1 0

 =⇒

β

1

(1) = {3}

β

1

(2) = {1, 2}

β

1

(3) = {1}

In this case, if player 2 adopts strategy j = 2, then player 1 is

indifferent when using strategies i = 1 and i = 2, since they

guarantee the same payoff. The best reply function of player 2 is

similar to the one above, except that we have to exchange rows

(strategies of player 1) with columns (strategies of player 2).

(11)

Let’s consider the following game:

A =

3 6 20

3 6 1

5 1 0

 =⇒

β

1

(1) = {3}

β

1

(2) = {1, 2}

β

1

(3) = {1}

B =

4 8 2

1 1 10

3 5 5

 =⇒

β

2

(1) = {2}

β

2

(2) = {3}

β

2

(3) = {2, 3}

(12)

Selfishness of players could bring them to take a decision that is non optimal for the game. The idea to overcome this problem is to look for a compromise, that is take a decision that is good for both players. We must reach some equilibrium between the two parts.

Mathematically, we can define an equilibrium in the following way:

(i

, j

) is an equilibrium ⇐⇒

 i

∈ β

1

(j

)

j

∈ β

2

(i

) (2)

Thsi idea has been formalized by the Nobel prize Nobel J.F.Nash

who gave the name to the Nash equilibrium.

(13)

Studying the best reply functions introduced above, we obtain that (i

, j

) = (1, 2). Indeed

i

= 1 ∈ β

1

(j

= 2) = {1, 2}

and

j

= 2 ∈ β

2

(i

= 1) = {2}.

If player 2 could be sure that his opponent will choose i = 1, then

he will choose j = 2, thus earning the maximum payoff available

reachable when i = 1. Similarly, player 1, knowing that player 2 will

choose j = 2, will be able to choose i = 1 and i = 2, because both

strategies will ensure a maximum payoff. If players will be allowed

to change strategy unilaterally from i = 1 and j = 2, they will not

have reason for doing so. For this reason we talk of equilibrium.

(14)

The Nash equilibrium we have found previously is non strict,

indeed player 1 has two options to maximize his payoff with

respect to the choice j = 2. When there is no ambiguity we can

talk of strict equilibrium. Anyway, there is no guarantee that,

given a game, only one Nash equilibrium exists. They may be more

than one, or even no one, although we will see that one Nash

equilibrium always exists in the set of mixed strategies (not yet

introduced here).

(15)

There is also another method to find Nash equilibrium, namely the bi-matrix method:

player 2

strat. 1 strat. 2

strat. 1 (5, 2) ← (1, 1)

player 1 ↑ ↓

strat. 2 (1, 7) ← (2, 3)

When we find a strategy with only pointing in arrows, then this is

a strict Nash equilibrium. In this case we have that the number of

arrows corresponds to the maximum number of available arrows (2

for games with 2 participants). In this particular case strategy

(1, 1) is a strict Nash equilibrium.

(16)

Another example:

player 2

strat. 1 strat. 2

strat. 1 (4, 20) ← (3, 15)

player 1 l ↑

strat. 2 (4, 7) → (2, 13) The double sided arrow indicates that player 1 is indifferent

between choosing his strategy when the opponent chooses strategy

1 (he will earn the same payoff in any case). We see that the

couple of strategies (1, 1) is a Nash equilibrium because it has 2

pointing in arrows, but in this case it is non strict, indeed there is

also one pointing out arrow.

(17)

Let’s define I = {1, 2, ...n} the set of players, where n is a positive integer number and S

i

= {1, 2, ...m

i

} the set of pure strategies for each player i ∈ I , where m

i

≥ 2. Player i is characterized by a vector x

i

∈ R

mi

such that:

x

ij

=

 1 if strategy adopted by player i is j

0 if strategy adopted by player i is not j . (3)

The set of vectors {x

1

, x

2

, . . . , x

n

} is named strategy profile.

(18)

For each strategy profile and each player i ∈ I , let’s

π

i

(x

1

, x

2

, . . . , x

n

) ∈ R the payoff associated to player i. In game theory, the payoff of one player is a function that incorporates the value of its outcome for the choices of all the other players involved in the game.

In the particular case of a game played by two players, the payoff

functions , π

1

(x

1

, x

2

) and π

2

(x

1

, x

2

), can be represented by two

matrices A and B like R

m1×m2

; notice that in this case we used x

1

instead of x , and x

2

instead of y .

(19)

A mixed strategy for player i is a probability distribution in the set of its m

i

pure strategies. Formally:

x

ij

= Probability for which player i will use strategy j (4) Following this definition we can interpret a pure strategy as a particular case of mixed strategy. Saying that player i uses the pure strategy j ” is equivalent to say that “player i uses strategy j with 100% of probability”, that “player i uses strategy j 100% of the times”.

An example of mixed strategy is the following:

x

i

= 

0.4, 0.1, 0, 0.5  , (5)

where player i has m

i

= 4 strategies available.

(20)

Since x

ij

( j = 1, 2, ..., m

i

) represent probabilities, then vector x

i

∈ R

mi

belongs to the simplex ∆

i

in the m

i

-space:

i

=

x

i

∈ R

m+i

:

mi

X

j =1

x

ij

= 1

. (6)

Figura: Il simplesso ∆

i

con m = 3.

Figura: Geometric representation of the simplex for m = 3.

(21)

Simplex ∆

i

of player i has dimension m

i

− 1.

The vertices of ∆

i

are the versors of the m

i

-space, indicated by e

i1

= (1, 0, ..., 0), e

i2

= (0, 1, ..., 0), up to e

imi

= (0, 0, ..., 1).

Each vertex e

ij

represents the mixed strategy i who assigns probability 1 to the j -th pure strategy. If player i uses pure strategy j , then x

i

= e

ij

.

The simplex of mixed strategies ∆

i

is a convex hull of the vertices, and each x

i

∈ ∆

i

is a convex combination of the versors, said pure strategies e

ij

:

x

i

=

mi

X

j =1

x

ij

e

ij

. (7)

(22)

The interior of ∆

i

is defined as:

int(∆

i

) = {x

i

∈ ∆

i

: x

ij

> 0 ∀j} (8) The mixed strategies of this subset are called internal or fully mixed and they associate positive probabilities to all pure strategies of the players.

The set of not internal strategies of ∆

i

is called boundary of

i

, defined as:

bd (∆

i

) = {x

i

∈ ∆

i

: x

i

∈ int(∆ /

i

)} (9) The boundary of ∆

i

is the set of strategies x

i

for which at least one strategy j is played with null probability, i.e.

∃j ∈ {1, . . . , m

i

} : x

ij

= 0

(23)

A mixed strategy profile X = {x

1

, x

2

, ..., x

n

} is a point in the space of mixed strategies of the game:

Θ = ×

i ∈I

i

. (10)

Θ is the product of n simple units ∆

i

, where each of them is a set (m

i

− 1)-dimensional for each player i ∈ I , the set is a

(m − n)-dimensional polyhedron in R

m

, where

m = m

1

+ m

2

+ ... + m

n

is the total number of pure strategies of the game. A strategy profile X is said internal if each component x

i

is internal. The subset of such profiles is defined as:

int(Θ) = ×

i ∈I

int(∆

i

). (11)

Similarly, the boundary of Θ, bd (Θ), is the set of non internal

profiles x ∈ Θ.

(24)

Introducing the mixed strategies implies a revision of the concept of payoff. Indeed, since mixed strategies are probability

distributions, then the payoff is the expected value of the payoffs that can be earned by means of pure strategies. This is true for each game with n players.

Let’s analyze the simple case of 2 players. Let A ∈ R

m1×m2

be the payoff matrix of player 1. Suppose that player 1 decides to use the pure strategy i , while player 2 plays the mixed strategy x

2

. In average, player 1 will get the following quantity:

u

1i

=

m2

X

j =1

a

ij

x

2j

,

which is the sum of payoffs a

ij

weighted by the probabilities x

2j

.

(25)

If player 1 opts for the mixed strategy x

1

, he will earn in average:

m1

X

i =1

x

1i

u

1i

=

m1

X

i =1 m2

X

j =1

x

1i

a

ij

x

2j

,

which represents the sum of the average payoffs u

1i

weighted by the probabilities x

1i

. The double sum allows to rewrite the payoff in the matricial form:

m1

X

i =1 m2

X

j =1

x

1i

a

ij

x

2j

= x

1T

Ax

2

,

making the previous equation in line with the definition of payoff given for the pure strategies:

π

1

(x

1

, x

2

) = x

1T

Ax

2

. (12) Similarly, the payoff of the second player is

π

2

(x

1

, x

2

) = x

1T

Bx

2

.

(26)

Let’s now consider the games with two strategis. The payoffs are represented by the following matrices:

A =

 a

1

b

1

c

1

d

1



, B =

 a

2

c

2

b

2

d

2

 . and the corresponding bi-matrix is:

G 2

s1 s2

s1 (a

1

, a

2

) (b

1

, c

2

) G 1

s2 (c

1

, b

2

) (d

1

, d

2

)

(27)

Consider the payoff functions obtained by matrices A and B, and using as vectors x = [x

1

1 − x

1

]

T

and y = [y

1

1 − y

1

]

T

:

π

1

(x , y ) = x

1

[(a

1

− c

1

+ d

1

− b

1

)y

1

+ b

1

− d

1

] + (c

1

− d

1

)y

1

+ d

1

π

2

(x , y ) = y

1

[(a

2

− c

2

+ d

2

− b

2

)x

1

+ b

2

− d

2

] + (c

2

− d

2

)x

1

+ d

2

(13) Calculating the derivatives

∂π∂x1

1

and

∂π∂y2

1

and setting them equal to 0, we obtain the following solutions:

x

1

= (d

2

− b

2

)

(d

2

− b

2

) + (a

2

− c

2

) , x

2

= (a

2

− c

2

) (d

2

− b

2

) + (a

2

− c

2

) y

1

= (d

1

− b

1

)

(d

1

− b

1

) + (a

1

− c

1

) , y

2

= (a

1

− c

1

) (d

1

− b

1

) + (a

1

− c

1

)

(14)

(28)

These solutions are valid if they belong to the simplex ∆ of dimension 2 and possibly represent the mixed equilibrium of the game.

The pure equilibria can be obtained by means of the bi-matrix

method.

Riferimenti

Documenti correlati

Atopy patch tests are useful to predict oral tolerance in children with gastrointes- tinal symptoms related to non-IgE-mediated cow’s milk allergy.. Mansueto P, Montalto G, Pacor ML,

The intensity and distribution of HA staining in the myenteric plexus of sham-operated animals was not significantly different with respect to control preparations and was only

Una risposta sempre più positiva può derivare dal controllo dei notevoli consumi energetici per la climatizzazione degli edifici, che rappresentano attualmente ancora

In a round-robin tournament with n players, each player plays every other player exactly once, and each match results in a win for one player and a loss for the other.. At the end

(American Mathematical Monthly, Vol.118, April 2011) Proposed by Kirk Bresniker and Stan Wagon (USA).. Alice and Bob play a

 when “real” status (the correction) of the past is received from server.  compare it with corresponding

Assuming the equivalence of the payoff functions of the pure strategies is equivalent to assume that the payoff bi-matrix of the second player is the transpose of the payoff matrix

Infinite population of equal individuals (players or replicators) Each individual exhibits a certain phenotype (= it is.. preprogrammed to play a