Convergence of Nash equilibria - 757 1.Introduction. Meanﬁeldgames,asintroducedbyLasryandLions[

be the corresponding strategy vector. Let Q^N be the normalized occupation mea-sure associated with u^N. More precisely, Q^N is theP2(Z)-valued random variable determined by setting, for B∈ B(X ), R ∈ B(R2), D∈ B(W),

Q^N_ω(B× R × D) (5.1)

=. 1 N

N i=1

δ_XN

i (·,ω)(B)· δ_ρ_ω^N,i(R)· δ_W^N

i (·,ω)(D), ω∈ N,

where (X₁^N, . . . , X_N^N)is the solution of the system of equations (3.1) under strategy vector u^N, and ρ^N,i is the relaxed control associated with individual strategy u^N_i , i∈ {1, . . . , N}.

Convergence results will be obtained under the hypothesis that (T) ∃δ0>0: sup

N∈NE_N

1 N

N i=1

ξ_i^N ²^+δ⁰+ ^T

u^N_i (t)²^+δ⁰dt

<∞.

Whenever (T) holds, we will—as we may—suppose that δ0∈ (0, 1 ∧ T ].

REMARK 5.1. Condition (T) is automatically satisfied if the action space is compact and the initial states, that is, the random variables ξ_i^N, N ∈ N, i ∈ {1, . . . , N}, are uniformly bounded.

LEMMA 5.1. If condition (T) holds, then the family (PN ◦ (Q^N)⁻¹)N∈N is pre-compact inP(P2(Z)).

PROOF. We verify that condition (T) implies the pre-compactness of the fam-ily (PN ◦ (Q^N)⁻¹)N∈N by using a suitable tightness function on P2(Z). For a function ψ on[0, T ] with values in R^d orR^d¹, let wψ(·, T ) denote the modulus of continuity of ψ on[0, T ], that is, the function

[0, ∞) h → wψ(h, T )=. sup

t,s∈[0,T ]:|t−s|≤hψ (t )− ψ(s) ∈ [0, ∞].

If ψ is continuous, then the modulus of continuity of ψ takes values in[0, ∞).

Then g is a tightness function onP2(Z); see AppendixC.2. It is therefore enough to check that condition (T) entails sup_N_∈NE_N[g(Q^N)] < ∞. By definition of Q^N By Lemma3.2and condition (T),

sup monotonicity of h→ h^−α and Markov’s inequality (as well as Jensen’s inequal-ity), up-per bound for the above sums that does not depend on N , we employ estimates on the moments of the modulus of continuity of Itô processes; cf. Fischer and

Nappo (2010) and the references therein. Since W₁^N, . . . , W_N^N are standard d1 -dimensional Wiener processes, we have by Lemma 3 of that paper and Hölder’s inequality that there exists a finite constant ¯C_p,d₁depending only on p and d1such that, for every i∈ {1, . . . , N}, every k ∈ N with k ≥ 1/T ,

Thanks to assumption (A3), Lemma3.2and condition (T), we have

sup

On the other hand, by Hölder’s inequality, 1

and, thanks to assumption (A3), Lemma3.1and condition (T),

sup

Recall that α=₂₍₈^δ_+δ⁰ ₀₎ and p= 2 + δ0/2. It follows that, for some finite constant

where the infinite sum on the right-hand side above has a finite limit since p/2− α· p = (8 + 2δ0)/(8+ δ0) >1.

Below, we will use the symbol I to indicate the index set of a (convergent) subsequence; thusI is a subset of N with the natural ordering and #I = ∞.

LEMMA 5.2. Suppose that (P_n◦ ξ_iⁿn

∗)n∈I converges in P2(R^d) to some ¯ν ∈ P2(R^d), where, for each n∈ I, i_∗ⁿ ∈ {1, . . . , n}. Then there exists a sequence ( ¯ξⁿ)n∈IofR^d-valued random variables such that the following hold:

(i) for every n∈ I, ¯ξⁿ is defined on (n,Fⁿ), measurable with respect to

Let n∈ I. By definition of the square Wasserstein metric, d2(νn,¯ν)²= inf

α∈P(R^d×R^d):[α]1=νnand[α]2=¯ν

R^d×R^d|x − ˜x|²α(dx, d˜x).

The infimum in the above equation is attained; see, for instance, Theorem 1.3 (Kantorovich’s theorem) inVillani(2003), pages 19–20. Thus, there exists α_∗ⁿ∈ P(R^d× R^d)such that[αⁿ_∗]1= νn,[αⁿ_∗]2= ¯ν and

d2(νn,¯ν)²=

R^d×R^d|x − ˜x|²α_∗ⁿ(dx, d˜x).

Recall that ϑ₁ⁿ, . . . , ϑ_nⁿ are independent F₀ⁿ-measurable random variables which are uniformly distributed on[0, 1] and independent of the σ -algebra generated by ξ₁ⁿ, . . . , ξ_nⁿ, W₁ⁿ, . . . , W_nⁿ. By Theorem 6.10 in Kallenberg (2001), page 112, on measurable transfers, there exists a measurable function ϕn: R^d × [0, 1] → R^d such that

Set ¯ξⁿ= ϕ. n(ξ_iⁿn

∗, ϑ_iⁿn

∗). Then ¯ξⁿis σ (ξ_iⁿn

∗, ϑ_iⁿn

∗)-measurable, Pn◦ (¯ξⁿ)⁻¹= ¯ν, and En ξ_iⁿn

∗ − ¯ξⁿ²= d2(νn,¯ν)², which tends to zero as n→ ∞.

LEMMA 5.3. Grant condition (T). Let (Qⁿ)_n_∈I be a subsequence that con-verges in distribution to someP2(Z)-valued random variable Q defined on some probability space (,F, P). Set

μω(t)= Q. ω◦ ˆX(t)⁻¹, t∈ [0, T ], ω ∈ .

Then for P-almost every ω∈ , μω∈ M2 and Qω is a solution of equation (4.2) with flow of measures μω. Moreover,

lim inf

In→∞

1 n

n i=1

J_iⁿuⁿ≥

ˆJμω(0), Qω, μω

P(dω).

PROOF. By Lemma5.1, (PN◦ (Q^N)⁻¹)N∈Nis pre-compact inP(P2(Z)). Let (Qⁿ)n∈I be a subsequence that converges in distribution to some P2(Z)-valued random variable Q, defined on some probability space (,F, P). Set μω(t)=. Qω◦ ˆX(t)⁻¹, t ∈ [0, T ], ω ∈ . Since Qω ∈ P2(Z) for every ω∈ , we have μω∈ M2for every ω∈ ; cf. Remark4.2above. By construction, ˆW (0)= 0 Qⁿω -almost surely for Pn-almost every ω∈ n. Convergence in distribution implies W (0)ˆ = 0 Qω-almost surely for P-almost every ω∈ .

In order to verify that Qω is a solution of equation (4.2) with flow of measures μω for P-almost every ω∈ , it suffices to check that condition (iii) of Defini-tion4.1holds. The proof of this fact is analogous to the proof of Lemma 5.2 in Budhiraja, Dupuis and Fischer(2012). Since the situation here is somewhat differ-ent, we give details in AppendixDbelow.

The asymptotic lower bound for the average costs is a consequence of a version of Fatou’s lemma [cf. Theorem A.3.12Dupuis and Ellis(1997), page 307] since, for every n∈ I,

1 n

n i=1

J_iⁿuⁿ=

×[0,T ]ft, ϕ(t), Qⁿ_ω◦ ˆX(t)⁻¹, γr(dγ , dt) + FT , ϕ(T ), Qⁿ_ω◦ ˆX(T )⁻¹Qⁿ_ω(dϕ, dr, dw)Pn(dω) and Qⁿ_ω◦ ˆX(t)⁻¹→ μ(t) in distribution as n → ∞.

REMARK5.2. Lemma5.3shows that, under condition (T), all limit points of the normalized occupation measures (Q^N)N∈Nare concentrated on those random variables that, with probability one, take values in the set of McKean–Vlasov so-lutions of equation (4.2). The mean field condition of Definition 4.3is therefore always satisfied.

In addition to (T), we will need the following weak symmetry condition on the costs:

∃ a sequence of indicesi_∗^N_N_∈Nwith i_∗^N ∈ {1, . . . , N} such that

sup

N∈NJ_i^NN

∗

u^N<∞ and lim sup

N→∞

1 N

N i=1

J_i^Nu^N≤ lim sup

N→∞ J_i^NN

∗

u^N. (S)

REMARK5.3. Condition (S) is automatically satisfied if the cost coefficients f, F are bounded functions. If f , F are unbounded and the costs associated with u^N are symmetric in the sense that, for every N , every i∈ {2, . . . , N}, J₁^N(u^N)= J_i^N(u^N), then thanks to assumption (A5) and Lemma3.1, condition (S) follows from condition (T).

THEOREM5.1. Let (εN)N∈N⊂ [0, ∞) be a sequence converging to zero. Sup-pose that (ξ^N)N∈N and (u^N)N∈N are such that (T) and (S) hold and, for each N ∈ N, ξ^N = (ξ₁^N, . . . , ξ_N^N) is exchangeable and u^N is a local εN-Nash equilib-rium for the N -player game. Let (Qⁿ)n∈Ibe a subsequence that converges in dis-tribution to someP2(Z)-valued random variable Q defined on some probability space (,F, P). If there is m∈ M2such that, for P-almost every ω∈ ,

Qω◦ ˆX(t)⁻¹= m(t), t∈ [0, T ],

then (Qω, m) is a solution of the mean field game for P-almost every ω∈ .

We postpone the proof of Theorem5.1to the end of this section. The crucial hy-pothesis in Theorem5.1is the almost sure nonrandomness of the flow of measures induced by a limit random variable Q. Thus, under the rather general conditions (T) and (S), we prove convergence to solutions of a mean field game for subse-quences with limit random variable Q such that P◦ (Q ◦ ( ˆX(t))⁻¹_t_{∈[0,T ]})⁻¹= δm

for some m∈ M2. This condition is reminiscent of the characterization of prop-agation of chaos in the Tanaka–Sznitman theorem. The nonrandomness of the induced flow of measures is implied by the nonrandomness of the joint law of initial condition, relaxed control and noise process, that is, by the condition P◦ (Q ◦ ( ˆX(0), ˆρ, ˆW )⁻¹)⁻¹= δνfor some ν∈ P(R^d× R2× W). This condition, in turn, is satisfied if the initial states and individual strategies of each N -player game are independent and identically distributed, where the marginal distributions are allowed to vary with N .

COROLLARY 5.2. Let (εN)N∈N⊂ [0, ∞) be a sequence converging to zero.

Suppose that (ξ^N)_N_∈N and (u^N)_N_∈N are such that (T) holds and, for each N ∈ N, u^N is a local ε_N-Nash equilibrium for the N -player game and the ran-dom variables (ξ₁^N, u^N₁, W₁^N), . . . , (ξ_N^N, u^N_N, W_N^N) are independent and identically distributed. Let (Qⁿ)n∈Ibe a subsequence that converges in distribution to some P2(Z)-valued random variable Q defined on some probability space (, F, P).

Then Qω is a solution of the mean field game for P-almost every ω∈ .

PROOF. By distributional symmetry of the vectors of initial states and indi-vidual strategies, the costs are symmetric and condition (T) entails condition (S);

cf. Remark5.3above.

Let T ⊂ Cb(R^d × R2× W) be a countable and measure determining set of functions. Let (Qⁿ)n∈Ibe a convergent subsequence with limit random variable Q on (,F, P). Let ∈ T , and set

m= E. P

E_QˆX(0), ˆρ, ˆW, v= E. P

E_QˆX(0), ˆρ, ˆW− m

2 , mⁿ= E. n

E_QⁿˆX(0), ˆρ, ˆW, n∈ I.

The mapping → dis continuous onP2(Z). By convergence of (Qⁿ)to Qand the continuous mapping theorem,

v= lim_n_→∞E_nE_QⁿˆX(0), ˆρ, ˆW− mⁿ²

= lim_n_→∞En

1 n

n i=1

ξ_iⁿ, ρ^n,i, W_iⁿ− mⁿ

₂ ,

where ρ^n,iis the relaxed control random variable induced by uⁿ_i. As a consequence of the i.i.d. hypothesis, the random variables (ξ_iⁿ, ρ^n,i, W_iⁿ), i∈ {1, . . . , n}, are independent and identically distributed with common mean equal to mⁿ. Since  is bounded, it follows that v= 0. This implies

E_QˆX(0), ˆρ, ˆW= m P-almost surely.

SinceT is countable, we have with P-probability one EQ

ˆX(0), ˆρ, ˆW= m for all ∈ T .

SinceT is also measure determining, it follows that there exists a measure ν ∈ P(R^d× R2× W) such that, for P-almost every ω ∈ ,

Qω◦ˆX(0), ˆρ, ˆW⁻¹= ν.

On the other hand, we know by Lemma5.3that Qω∈ P2(Z) is a McKean–Vlasov solution of equation (4.2) for P-almost every ω∈ . Uniqueness of such solutions according to Lemma 4.2yields the existence of a measure ∈ P2(Z) such that Qω= for P-almost every ω ∈ . Let m ∈ M2be the flow of measures induced by . Then, for P-almost every ω∈ ,

Qω◦ ˆX(t)⁻¹= m(t), t∈ [0, T ].

The assertion is now a consequence of Theorem5.1.

Existence of local approximate Nash equilibria as required in Corollary5.2is guaranteed, in particular, under the hypotheses of Proposition 3.1 above (com-pact action space, bounded coefficients). Suppose that (ξ^N) is such that, for each N ∈ N, ξ^N is a vector of independent and identically distributed ran-dom variables with common marginal m^N₀ ∈ P2(R^d) and that, for some δ0>0, sup_N_∈N|x|²^+δ⁰m^N₀ (dx) <∞. Then, by Proposition 3.1, there exists a corre-sponding sequence (u^N) of local approximate Nash equilibria such that the hy-potheses of Corollary5.2are satisfied. In addition to the desired limit relation, we thus obtain a proof of existence of solutions for the mean field game. Note that ex-istence of solutions is just a by-product of our analysis; analogous exex-istence results can in fact be obtained by directly working with the mean field game; seeLacker (2015). The proof there is based, as in Proposition 3.1here, on relaxed controls and a version of Fan’s fixed-point theorem.

PROOF OFTHEOREM5.1. By hypothesis, Q◦ ˆX(·)⁻¹= m(·) P-almost surely for some deterministic m∈ M2. In view of Lemma 5.3, it is enough to show that the pair (Q_ω, m) satisfies the optimality condition of Definition4.3with P-probability one. This is equivalent to showing that ˆJ (m(0), Qω; m) = ˆV (m(0); m) for P-almost all ω∈ .

Let ε > 0. Choose a function ψ_ε^m: [0, T ]×R^d×W → and a probability mea-sure ^m_ε ∈ P2(Z) according to Lemma4.3. Choose a sequence of indices (i_∗ⁿ)_n∈I according to condition (S). We will, as we may, assume that i_∗ⁿ= 1 for every n ∈ I;

otherwise, renumber the components of the n-player games.

The proof proceeds in five steps. First, we construct a coupling for the initial conditions. In the second step, based on that coupling and the feedback function ψ_ε^m, we define a competitor strategy˜uⁿthat differs from uⁿonly in component one (= i_∗ⁿ). As verified in step three, the associated normalized occupation measures have the same limit Q as the sequence (Qⁿ). This is used in the fourth step to show that lim sup_n_→∞J₁ⁿ(˜uⁿ)≤ ˆV (m(0); m) + ε. Thanks to this upper limit, the local approximate Nash equilibrium property of uⁿ together with condition (S), and the asymptotic lower bound on the average costs from Lemma5.3, we establish optimality in the fifth and last step.

First step. By hypothesis, the sequence (Pn◦ (Qⁿ)⁻¹)n∈Iconverges to P◦ Q⁻¹ in P(P2(Z)). By the choice of the metric on Z, the continuity of the map Z (ϕ, r, w)→ ϕ(0) ∈ R^d, and the mapping theorem [for instance, Theorem 5.1 in Billingsley(1968), page 30], we have that

P2(Z) → ◦ˆX(0)⁻¹∈ P2

R^d

is continuous. This implies, again by the continuous mapping theorem, that Pn◦Qⁿ◦ˆX(0)⁻¹^{−1 n→∞}−→ P ◦Q◦ˆX(0)⁻¹⁻¹ inPP2

R^d.

By construction and hypothesis, respectively, Qⁿ◦ˆX(0)⁻¹=1

n i=1

δ_ξⁿ

i while P◦Q◦ˆX(0)⁻¹⁻¹= δm(0). It follows that (_n¹ⁿ_i₌₁δ_ξⁿ

i)n∈Iconverges to m(0) in distribution asP2(R^d)-valued random variables, where m(0) is deterministic. This convergence implies, in par-ticular, that

1 n

n i=1

ξ_iⁿ²

n−→→∞

R^d|x|²m(0)(dx).

By hypothesis, ξⁿ= (ξ₁ⁿ, . . . , ξ_nⁿ)is exchangeable for every n∈ I. Convergence of the associated empirical measures, by the Tanaka–Sznitman theorem [for instance, Theorem 3.2 inGottlieb(1998), page 27], implies that

Pn◦ξ₁ⁿ^{−1 n→∞}−→ m(0) inPR^d.

Actually, we have convergence inP2(R^d)since, by exchangeability, E_n ξ₁ⁿ²= En

1 n

n i=1

ξ_iⁿ²

for every n∈ I,

and the expectations on the right-hand side above converge to the second moment of m(0). We are therefore in the situation of Lemma5.2, and we apply that result with the choice i_∗ⁿ= 1 to obtain a sequence (¯ξⁿ)_n∈IofR^d-valued random variables such that ¯ξⁿis σ (ξ_iⁿn

∗, ϑ_iⁿn

∗)-measurable, Pn◦(¯ξⁿ)⁻¹= m(0) and En[|ξ₁ⁿ− ¯ξⁿ|²] → 0 as n→ ∞.

Second step. Define a strategy vector ˜uⁿ= ( ˜uⁿ₁, . . . ,˜uⁿ_n)by setting, for (t, ω)∈ [0, T ] × n,

˜uⁿi(t, ω)=.

ψ_ε^mt, ¯ξⁿ(ω), W₁ⁿ(·, ω) if i= 1,

uⁿ_i(t, ω) if i∈ {2, . . . , n}.

Notice that ˜uⁿis indeed a strategy vector for the game with n players. Moreover,

˜uⁿ_i = uⁿ_i for i∈ {2, . . . , n}, while ˜uⁿ₁∈ H2((Ft^n,1), Pn; ). Let ˜ρ^n,i be the relaxed control induced by ˜uⁿ_i, i∈ {1, . . . , n}. Clearly, ˜ρ^n,i= ρ^n,i for i≥ 2. On the other hand, by construction and since ¯ξⁿand W₁ⁿare independent,

Pn◦¯ξⁿ, ˜ρ^n,1, W₁ⁿ⁻¹= ^m_ε ◦ˆX(0), ˆρ, ˆW⁻¹ for every n∈ I.

The law of ˜uⁿ₁, in particular, does not change with n. It follows that sup

n∈IE_n

0 ˜uⁿ₁(t)²dt

<∞.

The coercivity assumption (A6) implies that there exists C > 0 such that for every n∈ I,

_T 0

uⁿ₁(t)²dt

≤ C1+ J1ⁿ

uⁿ.

By choice of the index i_∗ⁿ= 1 according to (S), we have sup_n∈NJ₁ⁿ(uⁿ) <∞.

Since E_n[|ξ₁ⁿ|²] =_n¹ⁿ_i₌₁E_n[|ξ_iⁿ|²] by exchangeability, it follows that

(5.3) sup

n∈IE_nξ₁ⁿ²+ ^T

0 uⁿ₁(t)²+ ˜uⁿ₁(t)²dt

<∞.

Third step. Let ( ˜Xⁿ₁, . . . , ˜X_nⁿ) be the solution of the system of equations (3.1) under strategy vector ˜uⁿ, and let ˜μ^N denote the empirical measure process associ-ated with ( ˜X₁ⁿ, . . . , ˜Xⁿ_n). Let ˜Qⁿbe the normalized occupation measure associated with ˜uⁿ, that is, theP2(Z)-valued random variable determined by

Q˜ⁿ_ω(B× R × D)=. 1 n

n i=1

δ_˜Xn

i(·,ω)(B)· δ_˜ρ_ω^n,i(R)· δW_iⁿ(·,ω)(D), ω∈ n, B∈ B(X ), R ∈ B(R2), D∈ B(W). We are going to show that

(5.4) Q˜^{n n→∞}−→ Q in distribution asP2(Z)-valued random variables.

Since Qⁿ→ Q in distribution, it suffices to show that

d_P₍_P₂₍_Z₎₎Pn◦Q˜ⁿ⁻¹, Pn◦Qⁿ⁻¹ⁿ−→ 0.^→∞

Let n∈ I. By construction, definition of the bounded Lipschitz metric, inequality (2.1) and Hölder’s inequality,

d_P₍_P₂₍_Z₎₎P_n◦Q˜ⁿ⁻¹, Pn◦Qⁿ⁻¹

= sup

G∈C(P2(_Z)):GbLip≤1En

GQⁿ− GQ˜ⁿ

≤ En

d_P₂(_Z)

Qⁿ, ˜Qⁿ

≤

E_n

1 n

n i=1

d_ZX_iⁿ, ρ^n,i, W_iⁿ,˜X_iⁿ, ˜ρ^n,i, W_iⁿ²

≤ 1

√n+

1 n

n i=1

sup

t∈[0,T ]X_iⁿ(t)− ˜X_iⁿ(t)²

where the last inequality follows by definition of d_Z and from the fact that ρ^n,i= ˜ρ^n,ifor i∈ {2, . . . , n}. Using assumption (A2), Hölder’s inequality, Doob’s

maximal inequality, Itô’s isometry, inequality (2.1) and Fubini’s theorem, we find

Similarly, but also using assumption (A3), En con-sequence of (5.3), condition (T), and Lemma3.1.

Fourth step. We are going to show that

(5.5) lim sup

Let ¯Xⁿ₁ be the unique solution to

¯X₁ⁿ(t)= ¯ξⁿ+ ^t

bs, ¯X₁ⁿ(s), m(s),˜uⁿ₁(s)ds + ^t

0 σs, ¯X₁ⁿ(s), m(s)dW₁ⁿ(s), t∈ [0, T ].

Then, by uniqueness in law and construction, for every n∈ I, ˆJm(0), ^m_ε; m

= En

_T 0

ft, ¯X₁ⁿ(t), m(t),˜uⁿ₁(t)dt+ F¯Xⁿ₁(T ), m(T ).

Using assumption (A2), Hölder’s inequality, Itô’s isometry and Fubini’s theorem, we find that for every t∈ [0, T ],

En ˜X₁ⁿ(t)− ¯X₁ⁿ(t)²

≤ 3En ξ₁ⁿ− ¯ξⁿ²+ 6(T + 1)L²En

_T 0

˜μⁿ(s), m(s)²ds

+ 6(T + 1)L² ^t

0 En ˜X₁ⁿ(s)− ¯Xⁿ1(s)²ds.

The limit relation (5.4) implies that (˜μⁿ(0))_n_∈Iconverges to m(0) in distribution as P2(R^d)-valued random variables and that, by uniform integrability thanks to Lemma3.2and condition (T),

sup

t∈[0,T ]En

˜μⁿ(t), m(t)²ⁿ−→ 0.^→∞

By choice of the random variables ¯ξⁿaccording to Lemma5.2, En ξ₁ⁿ− ¯ξⁿ²ⁿ−→ 0.^→∞

Therefore, by Gronwall’s lemma, sup

t∈[0,T ]En ˜Xⁿ₁(t)− ¯Xⁿ1(t)²^n→∞−→ 0.

Thanks to assumption (A4) and Hölder’s inequality, J₁ⁿ˜uⁿ− ˆJm(0), ^m_ε; m

≤ En

T 0

ft, ˜Xⁿ₁(t),˜μⁿ(t),˜uⁿ₁(t)− ft, ¯X₁ⁿ(t), m(t),˜uⁿ₁(t) dt

+ En F˜X₁ⁿ(T ), ˜μⁿ(T )− F¯Xⁿ₁(T ), m(T )

≤√

10L(1+√

T ) sup

t∈[0,T ]En ˜Xⁿ₁(t)− ¯Xⁿ₁(t)²+ d2

˜μⁿ(t), m(t)²^1/2

× sup

t∈[0,T ]En

1+ ˜Xⁿ₁(t)²+ ¯X₁ⁿ(t) ²+ d2

˜μⁿ(t), δ0

₂ + d2

m(t ), δ0

₂_1/2 .

By (5.3) together with Lemma3.1and an analogous estimate applied to ¯X₁ⁿ, and since sup_t_{∈[0,T ]}d2(m(t), δ0)²<∞ by continuity, we have

sup

n∈I sup

t∈[0,T ]E_n ˜Xⁿ₁(t)²+ ¯X₁ⁿ(t) ²+ d2

˜μⁿ(t), δ0

₂ + d2

m(t ), δ0

₂

<∞.

It follows that J₁ⁿ(˜uⁿ)→ ˆJ(m(0), ^mε; m) as n → ∞, which establishes (5.5).

Fifth step. The limit relation (5.5) and the choice of ^m_ε imply that lim sup

j→∞ J₁^N^j˜u^N^j≤ ˆVm(0); m+ ε.

By hypothesis, uⁿis a local ε_n-Nash equilibrium. By construction, ˜uⁿdiffers from uⁿonly in component number one (= i_∗ⁿ), and ˜uⁿ₁ is (Ft^n,1)-adapted. Therefore,

J₁ⁿuⁿ≤ J₁ⁿ˜uⁿ+ εn.

By choice of the index 1= i_∗ⁿaccording to (S) and since εn→ 0 by hypothesis, lim sup

n→∞

1 n

n i=1

J_iⁿuⁿ≤ lim sup

n→∞ J₁ⁿuⁿ≤ lim sup

n→∞ J₁ⁿ˜uⁿ. It follows that

lim sup

n→∞

1 n

n i=1

J_iⁿuⁿ≤ ˆVm(0); m+ ε.

On the other hand, thanks to the second part of Lemma5.3, lim inf

n→∞

1 n

n i=1

J_iⁿuⁿ≥

ˆJm(0), Qω, mP(dω).

It follows that

ˆJm(0), Qω, mP(dω)≤ ˆVm(0); m+ ε.

Since ε > 0 was arbitrary and ˆJ (m(0), Qω, m)≥ ˆV (m(0); m) for every ω ∈ by definition of ˆV, we conclude that

ˆJm(0), Qω, m= ˆVm(0); m for P-almost all ω∈ . REMARK 5.4. The proof of Theorem 5.1 gives some insight into why the assumption that the limit flow of measures m is deterministic cannot simply be dropped. In the second step of the proof, we define a competitor strategy ˜uⁿ₁ for the deviating player (player one after relabeling) in terms of the noise feedback function ψ_ε^m. In general, for any t∈ [0, T ], ψε^m(t,·, ·) depends on m through its values for all times, not only through its values up to time t . Therefore, if m were random, even taking for granted the measurable dependence of ψ_ε^mon m, we might

end up with a nonadapted competitor strategy. Indeed, the natural choice for ˜uⁿ₁, namely ˜uⁿ₁(t, ω)= ψ. ε^μⁿ^ω⁽^·)(t, ¯ξⁿ(ω), W₁ⁿ(·, ω)), would in general yield a -valued process that would not be an admissible strategy for player one in the n-player game.

APPENDIX A: PROOF OF LEMMA4.1, SECOND PART

Let ∈ P(Z) be a solution of equation (4.2) with flow of measures m in the sense of Definition4.1. Using the local martingale property of M_f^mfor f a mono-mial of first or second order as in the proof of Proposition 5.4.6 in [Karatzas and Shreve(1991), pages 315–316], we find that, under and with respect to the fil-tration (Gt):

• ˆW is a d1-dimensional vector of continuous local martingales with ˆW (0)= 0 and quadratic covariations

ˆWl, ˆW_˜l(t) = t · δ_l,˜l, l, ˜l∈ {1, . . . , d1};

• ¯X= ˆX− ˆX(0)−. ×[0,·]b(s, ˆX(s), m(s), γ )ˆρ(dγ, ds) is a d-dimensional vector of continuous local martingales with quadratic covariations

 ¯Xj, ¯Xk(t) = ^t

σ σ^T_{j k}s, ˆX(s), m(s)ds, j, k∈ {1, . . . , d};

• ˆW, ¯X have quadratic covariations

 ¯Xk, ˆWl(t) = ^t

0 σkl

s, ˆX(s), m(s)ds,

where k∈ {1, . . . , d}, l ∈ {1, . . . , d1}.

The local martingale property also holds with respect to the filtration (Gt+); see the solution to Problem 5.4.13 in Karatzas and Shreve (1991), pages 318–319, 392, and Remark 4.2 in Budhiraja, Dupuis and Fischer(2012). By Lévy’s char-acterization of Brownian motion [for instance, Theorem 3.3.16 in Karatzas and Shreve(1991), page 157], we see that ˆW is a standard Wiener process with respect to (Gt+). As a consequence, the process

Y (t)=. ^t

σs, ˆX(s), m(s)d ˆW (s), t∈ [0, T ],

is well defined and a d-dimensional vector of continuous local martingales [under

with respect to (Gt+)] with quadratic covariations

Yj, Yk(t) = ^t

σ σ^T_{j k}s, ˆX(s), m(s)ds, j, k∈ {1, . . . , d},

Yj, ˆWl(t) = ^t

σj l

s, ˆX(s), m(s)ds, j ∈ {1, . . . , d}, l ∈ {1, . . . , d1}.

The quadratic covariations between the components of the vectors of continuous local martingales ¯X, Y are given by [cf. Proposition 3.2.24 inKaratzas and Shreve (1991), page 147]

Yj, ¯X_k(t) =

d₁

l=1

_t 0

σ_{j l}s, ˆX(s), m(s)d ¯Xk, ˆW_l(s)

= ^t

σ σ^T_{j k}s, ˆX(s), m(s)ds, j, k∈ {1, . . . , d}.

It follows that ¯X− Y is a d-dimensional vector of continuous local martingales with ¯X(0)= 0 = Y (0) and quadratic covariations

 ¯Xj − Yj, ¯Xk− Yk = ¯Xj, ¯Xk − Yj, ¯Xk − ¯Xj, Yk + Yj, Yk ≡ 0.

This implies [cf. Problem 1.5.12 in Karatzas and Shreve (1991), page 35] that

¯X = Y -almost surely, which establishes the solution property.

APPENDIX B: PROOF OF LEMMA4.3 Fix m∈ M2, and set, for (t, x, γ )∈ [0, T ] × R^d× ,

bm(t, x, γ )= b. t, x, m(t), γ, σm(t, x)= σ. t, x, m(t), fm(t, x, γ )= f. t, x, m(t), γ, Fm(x)= F. x, m(T ).

Thanks to assumptions (A1), (A2), (A4) and the continuity of m, we have that bm, σm, fmare continuous in the time and control variable, uniformly over compact subsets ofR^d, bm, σmare globally Lipschitz continuous in the state variable, uni-formly in the other variables, and fm, Fmare locally Lipschitz continuous in the state variable, uniformly in the other variables, with local Lipschitz constants that grow sublinearly in the state variable.

The function ψ_ε^m will be constructed based on the principle of dynamic pro-gramming applied in discrete time. To this end, we first introduce an original con-trol problem corresponding to the minimal costs ˆV (·, m), then we build a sequence of approximating optimal control problems by successively restricting the set of admissible strategies. The proof proceeds in six steps.

First step. LetU be the set of all quadruples ((, F, P), (Ft), ρ, W )such that the pair ((,F, P), (Ft))forms a stochastic basis satisfying the usual hypotheses, W is a d1-dimensional (Ft)-Wiener process, and ρ is an (Ft)-adaptedR2-valued random variable such that E[_{×[0,T ]}|γ |²ρ(dγ , ds)] < ∞. For simplicity, we may write ρ∈ U instead of ((, F, P), (Ft), ρ, W )∈ U. Given any ρ ∈ U, (t0, x)∈ [0, T ] × R^d, the stochastic integral equation

X(t)= x +

×[0,t]bm

t0+ s, X(s), γρ(dγ , ds) (B.1)

+ ^t

σm

t0+ s, X(s)dW (s), t∈ [0, T − t0],

has a unique solution X= X^t⁰^,x,ρ, that is, X is the unique (up to indistinguisha-bility with respect to P) R^d-valued (Ft)-adapted continuous process that sat-isfies (B.1) with P-probability one. Although the solution X of equation (B.1) starts in x at time zero, it corresponds to the solution of equation (4.2) starting in x at time t0. Define the costs associated with strategy ρ and initial condition (t0, x)∈ [0, T ] × R^d by

Jm(t0, x, ρ)= E.

×[0,T −t0]fm

t0+ s, X(s), γρ(dγ , ds)+ Fm

X(T − t0),

where X= X^t⁰^,x,ρ. The corresponding value function Vmis given by Vm(t, x)= inf.

ρ∈UJm(t, x, ρ),

which is well defined as a measurable function [0, T ] × R^d → [0, ∞). Actually, Vmis continuous. For x∈ R^d, ρ∈ U, set

^x,ρ= P ◦. X^0,x,ρ, ρ, W⁻¹.

Then ^x,ρ is a solution of equation (4.2) with flow of measures m and Jm(0, x, ρ)= ˆJδx, ^x,ρ; m.

Conversely, in view of Lemma4.1and thanks to Assumption (A6), any ∈ P(Z) with ˆJ (δx, ; m) < ∞ induces a strategy ρ ∈ U such that ^x,ρ= . It follows that Vm(0, x)= ˆV (δx; m) for every x ∈ R^dand, by conditioning on the initial state at time zero,

R^dVm(0, x)m(0)(dx)= ˆVm(0); m.

Second step. The function Vm(0,·) is locally Lipschitz continuous. To be more precise, choose c0>0, 0⊂ according to (A6), and let r0>0 be such that 0⊂ {γ ∈ R^d²: |γ | ≤ r0}. We are going to show that there exists a constant C1∈ (0, ∞) depending only on K, L, T , m, r0 and c0such that

(B.2) Vm(0, x)− Vm(0,˜x) ≤ C1(1+ R)|x − ˜x| whenever|x| ∨ | ˜x| ≤ R.

To establish (B.2), set, for ε > 0, R > 0,

Uε,R=. ρ∈ U : Jm(0, x; ρ) ≤ Vm(0, x)+ ε for some x with |x| ≤ R. Then for all x,˜x ∈ R^d with|x| ∨ | ˜x| ≤ R,

Vm(0, x)− Vm(0,˜x) ≤ inf

ε>0 sup

ρ∈Uε,R

Jm(0, x; ρ) − Jm(0,˜x; ρ).

Let x,˜x ∈ R^d, ρ∈ U and let X, ˜X be the solutions of (B.1) under ρ with initial state x and ˜x, respectively. Using Hölder’s inequality, Jensen’s inequality, Itô’s

isometry, Fubini’s theorem, assumption (A2) and Gronwall’s lemma, we find that there exists a constant CL,T depending only on L, T such that

sup

t∈[0,T ]E X(t)− ˜X(t)²≤ CL,T|x − ˜x|.

Reusing the same tools but with assumption (A3) in place of (A2) (also cf.

Lemma 3.1), we find that there exists a constant CK,T ,m depending only on K, T, and on m [through sup_t_{∈[0,T ]}|y|²m(t )(dy), which is finite since m is Thanks to the above estimates and assumption (A4), we have that there exist a con-stant CL,T ,mdepending only on L, T , and m, and a constant CK,L,T ,m depending

By the same estimates as above, but using (A5) instead of (A4), we find that there exists a constant ˜C_{K,T ,m}depending only on K, T , m such that, for all x∈ R^d,

This implies that there exists a constant CK,T ,m,depending only on K, T , m, and on (through minγ∈|γ |²) such that, for all x∈ R^d,

Vm(0, x)≤ CK,T ,m,

1+ |x|².

Let ρ∈ Uε,Rfor some ε > 0. Choose x∈ R^d with|x| ≤ R such that Jm(0, x; ρ) ≤ Vm(0, x)+ ε (possible by definition of Uε,R). By the coercivity assumption (A6),

Jm(0, x; ρ) ≥ c0E

(\0)×[0,T ]|γ |²ρ(dγ , dt)

hence c0E

(\0)×[0,T ]|γ |²ρ(dγ , dt)

≤ CK,T ,m,

1+ R²+ ε.

By construction, E

×[0,T ]|γ |²ρ(dγ , dt)

≤ T · r0²+ E

(\0)×[0,T ]|γ |²ρ(dγ , dt)

. It follows that there exists a constant CK,T ,m,c₀,r₀ depending only on K, T , m, c0

and on r0(clearly, minγ∈|γ |²≤ r₀²) such that sup

ρ∈Uε,R

×[0,T ]|γ |²ρ(dγ , dt)

≤ CK,T ,m,c₀,r₀(1+ R +√ ε).

This establishes (B.2).

Third step. For M∈ N, set M = {γ ∈ : |γ | ≤ M}. For M big enough, say. M ≥ M0, M is nonempty. Choose γ0 ∈ M₀, and set M = {γ. 0} if M < M0. Then, for every M∈ N, M is compact (and nonempty) and M ⊂ M+1. Set

UM =. ρ∈ U : ρ_M× [0, T ]= T P-almost surely,

and let Vm,M be the value function defined with respect toUM instead ofU. We claim that

(B.3) Vm,M(0,·)^M V^→∞ m(0,·) uniformly over compact subsets ofR^d. Notice that, by construction, Vm,M(0,·) ≥ Vm,M+1(0,·) ≥ Vm(0,·) for every M ∈ N. By Step 2, we know that Vm(0,·) is locally Lipschitz. Repeating the arguments of Step 2 (notice that UM ⊂ U by definition), we find that inequality (B.2) also holds for Vm,M(0,·) in place of Vm(0,·) and that the constant C1 can be cho-sen independently of M ∈ N. To establish (B.3), it is therefore enough to check that point-wise convergence holds. Fix x ∈ R^d. It suffices to show that given ρ∈ U there exits a sequence (ρ^(M))⊂ U such that ρ^(M)∈ UM for every M and Jm(0, x; ρ^(M))→ Jm(0, x; ρ) as M → ∞.

Let ρ∈ U. For M ∈ N, let ρ^(M)∈ UM be such that for every B∈ B(), every I∈ B([0, T ]),

ρ^(M)(B× I) = ρ(B∩ M)× I+ ρ(\ M)× I· δγ₀(B).

This determines a unique strategy ρ^(M)∈ UM. Clearly, ρ^(M)comes with the same stochastic basis as ρ. If (˙ρt)is a version of the time derivative process associated with ρ [thus, ρ(dγ , dt)= ˙ρt(dγ ) dt], then a version of the time derivative process of ρ^(M)is given by

˙ρt^(M)(dγ )= 1_M(γ )· ρt(dγ )+ ρt(\ M)· δγ₀(dγ ).

Let X, X^(M) be the solutions of (B.1) under ρ and ρ^(M), respectively. Thanks to Hölder’s inequality, Jensen’s inequality, Itô’s isometry, Fubini’s theorem and assumption (A2), there exists a constant CL,T depending only on L, T such that, for every t∈ [0, T ],

E X(t)− X^(M)(t) ²

≤ CL,T

_t 0

E X(s)− X^(M)(s)²ds + CL,TE

×[0,t]bm

s, X(s), γρ^(M)− ρ(dγ , ds)

2 . Using the definition of ρ^(M), Hölder’s inequality and assumption (A3), we find that, for some constant CK,T ,mdepending only on K, T and m,

×[0,t]b_ms, X(s), γρ^(M)− ρ(dγ , ds)

≤ 2T E ^T

s, X(s), γ ²˙ρs(dγ ) ds

+ 2E

ρ(\ M)× [0, T ]· ^T

s, X(s), γ0 2

≤ CK,T ,mEρ(\ M)× [0, T ]·⁽1+ sup

r∈[0,T ]X(r) ²^*

+ CK,T ,mE

×[0,T ]1\M(γ )· |γ |²ρ(dγ , ds)

By (A3) and the usual estimates, including Gronwall’s lemma, we have E[supr∈[0,T ]|X(r)|²] < ∞. Since ρω is a measure with total mass T for every ω∈ , we have ρ(( \ M)× [0, T ]) → 0 as M → ∞ P-almost surely. This implies, by dominated convergence,

Eρ(\ M)× [0, T ]·⁽1+ sup

r∈[0,T ]X(r)²^*^M−→ 0.^→∞

On the other hand, E[_{×[0,T ]}|γ |²ρ(dγ , ds)] < ∞ by definition of U. This means that

×[0,T ]1\M(γ )· |γ |²ρ(dγ , ds)

M−→ 0.→∞

An application of Gronwall’s lemma now yields

E X(t)− X^(M)(t)²^M−→ 0.^→∞

This convergence together with assumption (A5) (and an estimate completely anal-ogous to the one above) implies that Jm(0, x; ρ^(M))→ Jm(0, x; ρ) as M → ∞.

Fourth step. Choose a family (M,k)M,k∈N of finite subsets of such that

M,k ⊂ M,k+1 ⊂ M, M,k ⊂ M+1,k, and min_˜γ∈_M,k|γ − ˜γ| ≤ 1/k for any γ ∈ M. Let UM,k be the set of all ρ∈ U such that ρ is the R2-valued ran-dom variable induced by a M,k-valued adapted process that is piecewise con-stant in time with respect to the equidicon-stant grid of step size T · 2^−k. Thus,

Nel documento 757 1.Introduction. Meanﬁeldgames,asintroducedbyLasryandLions[LasryandLions(2006a,2006b,2007)]and,independently,byHuang,MalhaméandCaines[Huang,MalhaméandCaines(2006)andsubsequentworks],arelimit CONTENTS UniversityofPadua B M F ONTHECONNECTIONBETWEENSYMMET (pagine 25-46)