BSDE representation and randomized dynamic programming principle for stochastic control problems of infinite-dimensional jump-diffusions

(1)

E l e c t ro n ic J o f P r o b a b_{i l i t y}

Electron. J. Probab. (), no. , 1–36. ISSN: 1083-6489 DOI:

BSDE Representation and Randomized Dynamic

Programming Principle for Stochastic Control

Problems of Infinite-Dimensional Jump-Diffusions

Elena Bandini

*

_{Fulvia Confortola}

†

_{Andrea Cosso}

‡

Abstract

We consider a general class of stochastic optimal control problems, where the state process lives in a real separable Hilbert space and is driven by a cylindrical Brow-nian motion and a Poisson random measure; no special structure is imposed on the coefficients, which are also allowed to be path-dependent; in addition, the diffusion co-efficient can be degenerate. For such a class of stochastic control problems, we prove, by means of purely probabilistic techniques based on the so-called randomization method, that the value of the control problem admits a probabilistic representation formula (known as non-linear Feynman-Kac formula) in terms of a suitable backward stochastic differential equation. This probabilistic representation considerably extends current results in the literature on the infinite-dimensional case, and it is also relevant in finite dimension. Such a representation allows to show, in the non-path-dependent (or Markovian) case, that the value function satisfies the so-called randomized dy-namic programming principle. As a consequence, we are able to prove that the value function is a viscosity solution of the corresponding Hamilton-Jacobi-Bellman equation, which turns out to be a second-order fully non-linear integro-differential equation in Hilbert space.

Keywords: Backward stochastic differential equations, infinite-dimensional path-dependent

controlled SDEs, randomization method, viscosity solutions.

AMS MSC 2010: 60H10; 60H15; 93E20; 49L25.

Submitted to EJP on October 3, 2018, final version accepted on June 9, 2019.

1 Introduction

In the present paper we study a general class of stochastic optimal control problems, where the infinite-dimensional state process, taking values in a real separable Hilbert

*_{Department of Mathematics, University of Milano-Bicocca, Via Roberto Cozzi 55, 20125 Milano, Italy.}

E-mail: [email protected]

†_{Department of Mathematics, Politecnico di Milano, Via Bonardi 9, 20133 Milano, Italy.}

E-mail: [email protected]

‡_{Department of Mathematics, University of Bologna, Piazza di Porta S. Donato 5, 40126 Bologna, Italy.}

(2)

spaceH, has a dynamics driven by a cylindrical Brownian motionW and a Poisson random measureπ. Moreover, the coefficients are assumed to be path-dependent, in the sense that they depend on the past trajectory of the state process. In addition, the space of control actionsΛcan be any Borel space (i.e., any topological space homeomorphic to a Borel subset of a Polish space). More precisely, the controlled state process is a so-called mild solution to the following equation on[0, T ]:

     dXt= AXtdt + bt(X, αt)dt + σt(X, αt)dWt+ Z U \{0} γt(X, αt, z) π(dt dz) − λπ(dz) dt, X0= x0,

whereAis a linear operator generating a strongly continuous semigroup{etA_{, t ≥ 0}}_,

λπ(dz)dtis the compensator of π, whileαis an admissible control process, that is a

predictable stochastic process taking values inΛ. Given an admissible controlα, the corresponding gain functional is given by

J (α) = E Z T 0 ft(Xx0,α, αt) dt + g(Xx0,α) ,

where the running and terminal reward functionalsf andgmay also depend on the past trajectory of the state process. The value of the stochastic control problem, starting at

t = 0fromx0, is defined as

V0 = sup α

J (α). (1.1)

Stochastic optimal control problems of infinite-dimensional processes have been extensively studied using the theory of Backward Stochastic Differential Equations (BSDEs); we mention in particular the seminal papers [11], [12] and the last chapter of the recent book [9], where a detailed discussion of the literature can be found. Notice however that the current results require a special structure of the controlled state equations, namely that the diffusion coefficientσ = σ(t, x)is uncontrolled and the drift has the following specific form b = b1(t, x) + σ(t, x)b2(t, x, a). Up to our knowledge,

only the recent paper [6], which is devoted to the study of ergodic control problems, applies the BSDEs techniques to a more general class of infinite-dimensional controlled state processes; in [6] the drift has the general formb = b(x, a), however the diffusion coefficient is still uncontrolled and indeed constant, moreover the space of control actionsΛis assumed to be a real separable Hilbert space (or, more generally, according to Remark 2.2 in [6],Λhas to be the image of a continuous surjectionϕdefined on some real separable Hilbert space). Finally, [6] only addresses the non-path-dependent (or Markovian) case, and does not treat the Hamilton-Jacobi-Bellman (HJB) equation related to the stochastic control problem.

The stochastic optimal control problem (1.1) is studied by means of the so-called randomization method. This latter is a purely probabilistic methodology which allows to prove directly, starting from the definition ofV0, that the value itself admits a

represen-tation formula (also known as non-linear Feynman-Kac formula) in terms of a suitable backward stochastic differential equation, avoiding completely analytical tools, as for instance the Hamilton-Jacobi-Bellman equation or viscosity solutions techniques.

This procedure was previously applied in [10] and [1], where a stochastic control problem in finite dimension for diffusive processes (without jumps) was addressed. We also mention [15], which has inspired [10] and [1], where a non-linear Feynman-Kac formula for the value function of a jump-diffusive finite-dimensional stochastic control problem is provided. Notice, however, that the methodology implemented in [15] (and adapted in various different framework, see e.g. [2], [3], [7]) is quite different and

(3)

requires more restrictive assumptions; as a matter of fact, there the authors find the BSDE representation passing through the Hamilton-Jacobi-Bellman equation, and in particular using viscosity solutions techniques; moreover, in order to apply the techniques in [15], one already needs to know that the value function is the unique viscosity solution to the HJB equation.

The randomization method developed in the present paper improves considerably the methodology used in [15] and allows to extend the results in [10] and [1] to the infinite dimensional jump-diffusive framework, addressing, in addition, the path-dependent case. We notice that it would be possible to consider a path-dependence, or delay, in the control variable as well; however, in order to make the presentation more understandable and effective, we assume a path-dependence only in the state variable. We underline that our results are also relevant for the finite-dimensional case, as it is the first time the randomization method (understood as a pure probabilistic methodology as explained above) is implemented when a jump component appears in the state process dynamics. Roughly speaking, the key idea of the randomization method consists in randomizing the control processα, by replacing it with an uncontrolled pure jump processIassociated with a Poisson random measureθ, independent ofW andπ; for the pair of processes

(X, I), a new randomized intensity-control problem is then introduced in such a way that the corresponding value coincides with the original one. The idea of this control randomization procedure comes from the well-known methodology implemented in [16] to prove the dynamic programming principle, which is based on the use of piece-wise constant policies. More specifically, in [16] it is shown (under quite general assumptions; the only not usual assumption is the continuity of all coefficients with respect to the control variable) that the supremum over all admissible controls αcan be replaced by the supremum over a suitable class of piece-wise constant policies. This allows to prove in a relatively easy but rigorous manner the dynamic programming principle, see Theorem III.1.6 in [16]. Similarly, in the randomization method we prove (Theorem 4.7), under quite general assumptions (the only not usual assumption is still the continuity of all coefficients with respect to the control variable), that we can optimize over a suitable class of piece-wise constant policies, whose dynamics is now described by the Poisson random measureθ. This particular class of policies allows to prove the BSDE representation (Theorem 5.6), as well as the randomized dynamic programming principle. Notice that in the present paper we have made an effort to simplify various arguments in the proof of Theorem 4.7 and streamline the exposition.

In the Markovian case (Section 6), namely when the coefficients are non-path-dependent, we consider a family of stochastic control problems, one for each(t, x) ∈ [0, T ] × H, and define the corresponding value function. Then, exploiting the BSDE representation derived in Section 5, we are able to prove the so-called randomized dynamic programming principle (Theorem 6.6), which is as powerful as the classical dynamic programming principle, in the sense that it allows to prove (Proposition 6.12) that the value function is a viscosity solution to the Hamilton-Jacobi-Bellman equation, which turns out to be a second-order fully non-linear integro-differential equation on

[0, T ] × H:              vt+ hAx, Dxvi + supa∈Λ n 1 2Tr σ(t, x, a)σ ∗_{(t, x, a)D}2 xv + hb(t, x, a), Dxvi + f (t, x, a) +R_{U \{0}}(v(t, x + γ(t, x, a, z)) − v(t, x) − Dxv(t, x)γ(t, x, a, z))λπ(dz) o = 0, v(T, x) = g(x). (1.2)

Notice that in the non-diffusive case, namely whenσ ≡ 0, the control problem corre-sponding to equation (1.2) has already been studied in [20]. Here the authors prove

(4)

rigorously the (classical) dynamic programming principle (Theorem 4.2 in [20]) and show that the value function solves in the viscosity sense equation (1.2) (withσ ≡ 0), Theorem 5.4 in [20]. Then, Theorem 6.6 below, which provides the randomized dynamic programming principle, can be seen as a generalization of Theorem 4.2 in [20]; similarly, Proposition 6.12 extends Theorem 5.4 in [20] to the case withσnot necessarily equal to zero. Finally, we recall [19], which is devoted to the proof of a comparison principle for viscosity solutions to equation (1.2) (withσnot necessarily equal to zero), to which we refer in Remark 6.13.

The paper is organized as follows. In Section 2 we introduce the notations used in the paper and state the assumptions imposed on the coefficients (notice however that in the last section, namely Section 6, concerning the Markovian case, we introduce a different set of assumptions and some additional notations). Section 3 is devoted to the formulation of the stochastic optimal control problem, while in Section 4 we introduce the so-called randomized control problem, which allows to prove one of our main results, namely Theorem 4.7. In Section 5 we prove the BSDE representation of the valueV0

(Theorem 5.6). Finally, Section 6 is devoted to the study of the non-path-dependent (or Markovian) case, where we prove that the value function satisfies the randomized dynamic programming principle (Theorem 6.6) and we show that it is a viscosity solution to the corresponding Hamilton-Jacobi-Bellman equation (Proposition 6.12).

2 Notations and assumptions

LetH,U andΞbe two real separable Hilbert spaces equipped with their respective Borelσ-algebrae. We denote by| · |andh·, ·i(resp.| · |U,| · |Ξandh·, ·iΞ,h·, ·iU) the norm

and scalar product inH (resp. in U andΞ). Let _{(Ω, F , P)}be a complete probability space on which are defined a random variablex0: Ω → H, a cylindrical Brownian motion

W = (Wt)t≥0with values inΞ, and a Poisson random measureπ(dt dz)on[0, ∞) × U with

compensatorλπ(dz) dt. We assume thatx0,W,πare independent. We denote byµ0the

law ofx0, which is a probability measure on the Borel subsets ofH. We also denote by

Fx0,W,π_{= (F}x0,W,π

t )t≥0theP-completion of the filtration generated byx0,W,π, which

turns out to be also right-continuous, as it follows for instance from Theorem 1 in [13]. So, in particular,_Fx0,W,π _{satisfies the usual conditions. When}_x

0is deterministic (that is,

µ0is the Dirac measureδx0) we denoteF

x0,W,π _{simply by}_FW,π_.

LetL(Ξ; H) be the Banach space of bounded linear operatorsP : Ξ → H, and let

L2(Ξ; H)be the Hilbert space of Hilbert-Schmidt operatorsP : Ξ → H.

Let T > 0be a finite time horizon. For every t ∈ [0, T ], we consider the Banach space D([0, t]; H) of càdlàg maps x : [0, t] → H endowed with the supremum norm

x∗_t := sup_s∈[0,t]|x(s)|; whent = T we also use the notation kxk∞ := sups∈[0,T ]|x(s)|.

OnD([0, T ]; H)we define the canonical filtration(Dt0)t∈[0,T ], withD0t generated by the

coordinate maps

Πs : D([0, T ]; H) → H,

x(·) 7→ x(s),

for alls ∈ [0, t]. We also define its right-continuous version(Dt)t∈[0,T ], that isDt= ∩s>tD0s

for everyt ∈ [0, T )andDT = DT0. Then, we denote byP red(D([0, T ]; H))the predictable

σ-algebra on[0, T ] × D([0, T ]; H)associated with the filtration(Dt)t∈[0,T ].

LetΛbe a Borel space, namely a topological space homeomorphic to a Borel subset of a Polish space. We denote byB(Λ)the Borelσ-algebra ofΛ. We also denote bydΛa

bounded distance onΛ.

Let A : D(A) ⊂ H → H be a linear operator and consider the maps b : [0, T ] × D([0, T ]; H) × Λ → H,σ : [0, T ] × D([0, T ]; H) × Λ → L(Ξ; H),γ : [0, T ] × D([0, T ]; H) × Λ ×

(5)

U → H,_{f : [0, T ] × D([0, T ]; H) × Λ → R},_{g : D([0, T ]; H) → R}, on which we impose the following assumptions.

(A)

(i) Agenerates a strongly continuous semigroup{etA_{, t ≥ 0}}_in_H_.

(ii) µ0, the law ofx0, satisfiesR_H|x|p0µ0(dx) < ∞for somep0 ≥ max(2, 2¯p), with the

samep ≥ 0¯ as in (2.1) below.

(iii) There exists a Borel measurable function_{ρ : U → R}, bounded on bounded subsets ofU, such that

inf

|z|U>R

ρ(z) > 0, for everyR > 0 and

Z

U

|ρ(z)|2λπ(dz) < ∞.

(iv) The mapsbandf areP red(D([0, T ]; H)) ⊗ B(Λ)-measurable. For everyv ∈ H, the mapσ(·, ·, ·)v : [0, T ] × D([0, T ]; H) × Λ → HisP red(D([0, T ]; H)) ⊗ B(Λ)-measurable. The map γ is P red(D([0, T ]; H)) ⊗ B(Λ) ⊗ B(U )-measurable. The map g is DT

-measurable.

(v) The mapgis continuous onD([0, T ]; H)with respect to the supremum norm. For every t ∈ [0, T ], the maps bt(·, ·) and ft(·, ·) are continuous on D([0, T ]; H) × Λ.

For every(t, z) ∈ [0, T ] × U, the mapγt(·, ·, z)is continuous on D([0, T ]; H) × Λ.

For every t ∈ [0, T ] and anys ∈ (0, T ], we have esAσt(x, a) ∈ L2(Ξ; H), for all

(x, a) ∈ D([0, T ]; H) × Λ, and the map esA_σ

t(·, ·) : D([0, T ]; H) × Λ → L2(Ξ; H) is

continuous.

(vi) For allt ∈ [0, T ],s ∈ (0, T ],x, x0∈ D([0, T ]; H),a ∈ Λ,

|bt(x, a) − bt(x0, a)| + |esAσt(x, a) − esAσt(x0, a)|L2(Ξ;H) ≤ L(x − x 0₎∗ t, |γt(x, a, z) − γt(x0, a, z)| ≤ L ρ(z)(x − x0)∗t, |bt(0, a)| + |σt(0, a)|L2(Ξ;H) ≤ L, |γt(0, a, z)| ≤ L ρ(z), |ft(x, a)| + |g(x)| ≤ L 1 + kxkp∞¯, (2.1)

for some constantsL ≥ 0andp ≥ 0¯ .

3 Stochastic optimal control problem

In the present section we formulate the original stochastic optimal control problem on two different probabilistic settings. More precisely, we begin formulating (see subsection 3.1 below) such a control problem in a standard way, using the probabilistic setting previously introduced. Afterwards, in subsection 3.2 we formulate it on the so-called randomized probabilistic setting (that will be used for the rest of the paper and, in particular, for the formulation of the randomized control problem in Section 4). Finally, we prove that the two formulations have the same value.

3.1 Formulation of the control problem

We formulate the stochastic optimal control problem on the probabilistic setting introduced in Section 2. An admissible control process will be any_Fx0,W,π_-predictable

processαwith values inΛ. The set of all admissible control processes is denoted byA. The controlled state process satisfies the following equation on[0, T ]:

           dXt= AXtdt + bt(X, αt)dt + σt(X, αt)dWt + Z U \{0} γt(X, αt, z) π(dt dz) − λπ(dz) dt, X0= x0. (3.1)

(6)

We look for a mild solution to the above equation in the sense of the following definition.

Definition 3.1. Letα ∈ A. We say that a càdlàg_Fx0,W,π_{-adapted stochastic process}

X = (Xt)t∈[0,T ]taking values inH is a mild solution to equation (3.1) if,P-a.s.,

Xt = etAx0+ Z t 0 e(t−s)Abs(X, αs) ds + Z t 0 e(t−s)Aσs(X, αs) dWs + Z t 0 Z U \{0}

e(t−s)Aγs(X, αs, z) π(ds dz) − λπ(dz) ds, for all0 ≤ t ≤ T. Proposition 3.2.Under assumption (A), for every α ∈ A, there exists a unique mild solutionXx0,α_{= (X}x0,α

t )t∈[0,T ]to equation (3.1). Moreover, for every1 ≤ p ≤ p0,

Eh sup t∈[0,T ] |Xx0,α t | pi _{≤ C} p 1 + E [|x0|p], (3.2)

for some positive constantCp, independent ofx0andα.

Proof. Under assumption (A), the existence of a unique mild solutionXx0,α_{= (X}x0,α

t )t∈[0,T ]

to equation (3.1), for everyα ∈ A, can be obtained by a fixed point argument proceeding as in Theorem 3.4 in [19], taking into account the fact that the coefficients of equation (3.1) are path-dependent.

We now prove estimate (3.2). In the sequel, we denote by C a positive constant depending only onT andp, independent ofx0andα, that may vary from line to line. For

brevity we will denoteXx0,α_{simply by}_X_{. We start by noticing that}

Eh sup t∈[0,T ] |Xt|p i1/p ≤ Eh sup t∈[0,T ] |etA_x 0|p i1/p + Eh sup t∈[0,T ] Z t 0 e(t−s)Abs(X, αs) ds pi1/p + Eh sup t∈[0,T ] Z t 0 e(t−s)Aσs(X, αs) dWs pi1/p + Eh sup t∈[0,T ] Z t 0 Z U \{0} e(t−s)Aγs(X, αs, z) (π(ds dz) − λπ(dz) ds) pi1/p . (3.3) On the other hand, by the Burkölder-Davis-Gundy inequalities, we have

Eh sup t∈[0,T ] Z t 0 e(t−s)Aσs(X, αs)dWs pi1/p ≤ CEh Z T 0 e2(t−s)A|σs(X, αs)|2ds p/2i1/p (3.4) = C Z T 0 e2(t−s)A|σs(X, αs)|2ds 1/2 Lp/2_{(Ω,F ,P)}≤ C Z T 0 Ehep(t−s)A|σs(X, αs)|p i2/p ds 1/2 , and Eh sup t∈[0,T ] Z t 0 Z U \{0} e(t−s)Aγs(X, αs, z) (π(ds dz) − λπ(dz) ds) pi1/p ≤ C Eh Z T 0 ||φs||2L2_(U,λ π;H)ds p/2i1/p = C Z T 0 ||φs||2L2_(U,λ π;H)ds 1/2 Lp/2_{(Ω,F ,P)} ≤ C Z T 0 E h ||φs|| p L2_(U,λ_π_;H) i2/p ds 1/2 , (3.5)

where we have setφs(z) = e(t−s)Aγs(X, αs, z)and||φs||L2_(U,λ_π_;H)= (R_{U \{0}}|φs(z)|2λπ(dz))1/2.

By (3.4), (3.5), together with assumption (A), we get

Eh sup t∈[0,T ] Z t 0 e(t−s)Aσs(X, αs) dWs pi1/p ≤ C Z T 0 Eh1 + sup r∈[0, s] |Xr| pi2/p ds 1/2 ≤ C1 + Z T 0 Eh sup r∈[0, s] |Xr|p i2/p ds 1/2 (3.6)

(7)

and Eh sup t∈[0,T ] Z t 0 Z U \{0} e(t−s)Aγs(X, αs, z) (π(ds dz) − λπ(dz) ds) pi1/p ≤ C Z T 0 e2(t−s)A_Eh1 + sup r∈[0, s] |Xr| pZ U \{0} |ρ(z)|2_λ π(dz) p/2i2/p ds 1/2 ≤ C Z T 0 Eh1 + sup r∈[0, s] |Xr| pi2/p ds 1/2 ≤ C1 + Z T 0 Eh sup r∈[0, s] |Xr|p i2/p ds 1/2 . (3.7)

Moreover, using again assumption (A),

Therefore, plugging (3.6), (3.7) and (3.8) in (3.3), we get

Eh sup t∈[0,T ] |Xt|p i1/p ≤ C Eh|x0|p i1/p + C1 + Z T 0 Eh sup r∈[0, s] |Xr|p i1/p ds + C Z T 0 Eh sup r∈[0, s] |Xr|p i2/p ds 1/2 .

Taking the square of both sides and using the Cauchy-Schwarz inequality, we find (we setψs= E[supr∈[0, s]|Xr|p]2/p) ψT ≤ E h |x0|p i2/p + C1 + Z T 0 ψsds ,

and we conclude by the Gronwall inequality.

The controller aims at maximizing over allα ∈ Athe gain functional

J (α) = E Z T 0 ft(Xx0,α, αt) dt + g(Xx0,α) .

By assumption (2.1) and estimate (3.2), we notice thatJ (α)is always finite. Finally, the value of the stochastic control problem is given by

V0 = sup α∈A

J (α).

3.2 Formulation of the control problem in the randomized setting

We formulate the stochastic optimal control problem on a new probabilistic setting that we now introduce, to which we refer as randomized probabilistic setting. Such a setting will be used for the rest of the paper and, in particular, in Section 4 for the formulation of the randomized stochastic optimal control problem.

We consider a new complete probability space ( ˆΩ, ˆF , ˆ_P) on which are defined a random variablexˆ0: ˆΩ → H, a cylindrical Brownian motionW = ( ˆˆ Wt)t≥0with values in

Ξ, a Poisson random measureπ(dt dz)ˆ on[0, ∞) × U with compensatorλπ(dz) dt(with

λπ as in Section 2), and also a Poisson random measure θ(dt da)ˆ on [0, ∞) × Λ with

(8)

ˆ

x0,Wˆ,πˆ, θˆare independent. We denote byµ0 the law ofxˆ0 (withµ0as in Section 2).

We also denote by_Fˆxˆ0, ˆW ,ˆπ, ˆθ_{= ( ˆ}_Fxˆ0, ˆW ,ˆπ, ˆθ

t )t≥0 (resp. Fˆ ˆ θ_{= ( ˆ}_Fθˆ

t)t≥0) thePˆ-completion of

the filtration generated byxˆ0, Wˆ, πˆ, θˆ(resp. θˆ), which satisfies the usual conditions.

Moreover, we defineP( ˆ_Fˆx0, ˆW ,ˆπ, ˆθ₎_{as the predictable}_σ_{-algebra on}_{[0, T ] × ˆ}_Ω_associated

with_Fˆxˆ0, ˆW ,ˆπ, ˆθ_{. Finally, we denote by}_Aˆ_{the family of all admissible control processes,}

that is the set of allP( ˆFxˆ0, ˆW ,ˆπ, ˆθ₎_{-measurable maps}_{α : [0, T ] × ˆ}_ˆ _{Ω → Λ}_.

We impose the following additional assumptions.

(AR)

(i) λ0is a finite positive measure onB(Λ), the Borel subsets ofΛ, with full topological

support.

(ii) a0is a fixed point inΛ.

Similarly to Proposition 3.2, for every admissible controlα ∈ ˆˆ A, we can prove the following result.

Proposition 3.3. Under assumptions (A)-(AR), for everyα ∈ ˆˆ A, there exists a unique

mild solutionXˆxˆ0, ˆα_{= ( ˆ}_Xxˆ0, ˆα

t )t∈[0,T ]to equation (3.1) withx0,W,π,αreplaced

respec-tively byxˆ0,Wˆ,πˆ,αˆ. Moreover, for every1 ≤ p ≤ p0,

ˆ Eh sup t∈[0,T ] | ˆXˆx0, ˆα t | pi _{≤ C} p 1 + ˆE [|ˆx0|p],

with the same constantCpas in Proposition 3.2, whereEˆ denotes the expectation under

ˆ P.

In the present randomized probabilistic setting the formulations of the control prob-lem reads as follows: the controller aims at maximizing over allα ∈ ˆˆ Athe gain functional

ˆ J ( ˆα) = ˆ_E Z T 0 ft( ˆXˆx0, ˆα, ˆαt) dt + g( ˆXxˆ0, ˆα) . (3.9)

The corresponding value is defined as

ˆ V0 = sup ˆ α∈ ˆA ˆ J ( ˆα). (3.10)

Proposition 3.4. Under assumptions (A)-(AR), the following equality holds:

V0 = ˆV0.

Proof. The proof is organized as follows:

1) firstly we introduce a new probabilistic setting in product form on which we formulate the control problem (3.10) and denote the new value functionV¯0; then, we show that

ˆ V0= ¯V0;

2) we prove thatV0= ¯V0.

Step 1. Let(Ω0, F0_{, P}0)be another complete probability space where a Poisson random measureθon[0, ∞) × Λ, with intensityλ0(da)dt, is defined. DenoteΩ = Ω × Ω¯ 0,F¯ the

completion ofF ⊗ F0with respect to_{P ⊗ P}0, and_P¯ the extension of_{P ⊗ P}0toF¯. Notice thatx0, W, π, which are defined onΩ, as well asθ, which is defined onΩ0, admit obvious

extensions toΩ¯. We denote those extensions byx¯0, ¯W , ¯π, ¯θ. LetF¯x¯0, ¯W ,¯π = ( ¯Ftx¯0, ¯W ,¯π)t≥0

(resp. _F¯x¯0, ¯W ,¯π, ¯θ _{= ( ¯}_Fx¯0, ¯W ,¯π, ¯θ

(9)

¯

x0, W¯, ¯π(resp. x¯0, W¯, π¯, θ¯). Finally, letA¯(resp. A¯ ¯

θ_{) be the set of}_A_-valued _¯

Fx¯0, ¯W ,¯π

-predictable (_F¯¯x0, ¯W ,¯π, ¯θ_{-predictable) stochastic processes. Notice that}_{A ⊂ ¯}¯ _Aθ¯_.

For anyα ∈ ¯¯ Aθ¯_{define (with}_E¯ _{denoting the expectation under}_P¯₎ ¯ J ( ¯α) = ¯_E Z T 0 ft( ¯X¯x0, ¯α, ¯αt) dt + g( ¯Xx¯0, ¯α) , whereX¯¯x0, ¯α_{= ( ¯}_X¯x0, ¯α

t )t≥0denotes the stochastic process onΩ¯, mild solution to equation

(3.1), withα,x0,W,πreplaced respectively byα¯,x¯0,W¯,π¯. We define the value function

¯ V0 = sup ¯ α∈ ¯Aθ¯ ¯ J ( ¯α).

Finally, we notice thatVˆ0 = ¯V0. As a matter of fact, the only difference between the

control problems with value functionsVˆ0 and V¯0 is that they are formulated on two

different probabilistic settings. Given any α ∈ ˆˆ A, it is easy to see (by a monotone class argument) that there existsα ∈ ¯¯ Aθ¯_{such that}_{( ˆ}_{α, ˆ}_x

0, ˆW , ˆπ, ˆθ)has the same law as

( ¯α, ¯x0, ¯W , ¯π, ¯θ), so thatJ ( ˆˆα) = ¯J ( ¯α), which impliesVˆ0≤ ¯V0. In an analogous way we get

the other inequalityVˆ0≥ ¯V0, from which we deduce thatVˆ0= ¯V0.

Step 2. Let us prove thatV0= ¯V0. We begin noting that, given anyα ∈ A, denoting by

¯

αthe canonical extension ofαtoΩ¯, we have thatα ∈ ¯¯ A, moreover(α, x0, W, π)has the

same law as( ¯α, ¯x0, ¯W , ¯π), so thatJ (α) = ¯J ( ¯α). Sinceα ∈ ¯¯ AandA ⊂ ¯¯ A ¯

θ_,_α_¯_{belongs to}_A¯_θ¯

, henceJ (α) = ¯J ( ¯α) ≤ ¯V0. Taking the supremum overα ∈ A, we conclude thatV0≤ ¯V0.

It remains to prove the other inequality V0 ≥ ¯V0. In order to prove it, we begin

denoting_F¯θ¯= ( ¯Fθ¯

t)t≥0the P¯-completion of the filtration generated byθ¯. Notice that

¯ Fx¯0, ¯W ,¯π, ¯θ t = ¯F ¯ x0, ¯W ,¯π t ∨ ¯F ¯ θ

t, for everyt ≥ 0. Now, fixα ∈ ¯¯ A ¯

θ_{and observe that, for every}

ω0∈ Ω0_{, the stochastic process}_αω0_{: Ω × [0, T ] → A}_{, defined by}

αωt0(ω) = ¯αt(ω, ω0), for all(ω, ω0) ∈ ¯Ω = Ω × Ω0, t ≥ 0,

is_Fx0,W,π_{-progressively measurable, as}_α_¯ _is_F¯x¯0, ¯W ,¯π, ¯θ_{-predictable and so, in particular,}

¯

Fx¯0, ¯W ,¯π, ¯θ_{-progressively measurable. It is well-known (see for instance Theorem 3.7 in [4])}

that, for everyω0 ∈ Ω0_{, there exists an}

Fx0,W,π_{-predictable process}_α_ˆω0_{: Ω × [0, T ] → A}

such thatαω0 = ˆαω0,_{dP ⊗ dt}-a.e.. Now, recall thatX¯x¯0, ¯α_{= ( ¯}_Xx¯0, ¯α

t )t≥0denotes the mild solution to equation (3.1) onΩ¯,

withα, x0, W, πreplaced respectively byα, ¯¯ x0, ¯W , ¯π. Similarly, for every fixedω0∈ Ω0, let

Xx0, ˆαω0 _{= (X}x0, ˆαω0

t )t≥0denotes the mild solution to equation (3.1) onΩ, withαreplaced

byαˆω0. It is easy to see that there exists a _P0-null set N0 ⊂ Ω0 _{such that, for every}

ω0∈ N/ 0_{, the stochastic processes}_X¯x¯0, ¯α_{(·, ω}0₎_and_Xx0, ˆαω0_(·)_{solve the same equation on}

Ω. Therefore, by pathwise uniqueness, for everyω0∈ N/ 0 _{we have that}_X¯¯x0, ¯α_{(·, ω}0₎_and

Xx0, ˆαω0_(·)_are_P_{-indistinguishable. Then, by Fubini’s theorem we obtain}

¯ J ( ¯α) = Z Ω0E Z T 0 ft Xx0, ˆα ω0 , ˆαω_t0 dt + g Xx0, ˆαω0 P0(dω0_{) = E}0J ˆαω0 ≤ V0.

The claim follows taking the supremum over allα ∈ ¯¯ Aθ¯_.

We end this section stating a result slightly stronger than Proposition 3.4. More precisely, we fix aσ-algebraGˆindependent of (ˆx0, ˆW , ˆπ) and such thatFˆ

ˆ θ

∞ ⊂ ˆG. We

denote by_Fˆˆx0, ˆW ,ˆπ, ˆG _{= ( ˆ}_Fxˆ0, ˆW ,ˆπ, ˆG

t )t≥0 the Pˆ-completion of the filtration generated by

ˆ

x0, Wˆ, ˆπ, Gˆ and satisfying G ⊂ ˆˆ F0xˆ0, ˆW ,ˆπ, ˆG. Then, we define Aˆ ˆ

G _{as the family of all}

ˆ

(10)

Proposition 3.5. Under assumptions (A)-(AR), the following equality holds: V0 = sup ˆ α∈ ˆAGˆ ˆ J ( ˆα).

Proof. We begin observing that there exists measurable space(M, M)and a random variableΓ : ( ˆˆ Ω, ˆF ) → (M, M)such thatG = σ(ˆˆ Γ)(for instance, take(M, M) = ( ˆΩ, ˆG)and

ˆ

Γthe identity map). Then, the proof can be done proceeding along the same lines as in the proof of Proposition 3.4, simply noting that the role played byθˆin the proof of Proposition 3.4 is now played byΓˆ.

4 Formulation of the randomized control problem

We now formulate the randomized stochastic optimal control problem on the proba-bilistic setting introduced in subsection 3.2. Our aim is then to prove that the value of such a control problem coincides withV0or, equivalently (by Proposition 3.4), withVˆ0.

Here we simply observe that the randomized problem may depend onλ0anda0, but its

value will be independent of these two objects, as it will coincide with the valueV0of the

original stochastic control problem (which is independent ofλ0anda0).

We begin introducing some additional notation. We firstly notice that there exists a double sequence ( ˆTn, ˆηn)n≥1 of Λ × (0, ∞)-valued pairs of random variables, with

( ˆTn)n≥1 strictly increasing, such that the random measure θˆcan be represented as

ˆ

θ(dt da) =P

n≥1δ( ˆTn, ˆηn)(dt da). Moreover, for every Borel setB ∈ B(Λ), the stochastic

process(ˆθ((0, t] ×B) − t λ0(B))t≥0is a martingale underPˆ. Now, we introduce the pure

jump stochastic process taking values inΛdefined as

ˆ It = X n≥0 ˆ ηn1[ ˆTn, ˆTn+1)(t), for allt ≥ 0, (4.1)

where we setTˆ0:= 0andηˆ0:= a0(notice that, whenΛis a subset of a vector space, we

can write (4.1) simply asIˆt= a0+

Rt

0

R

A(a − ˆIs−) ˆθ(ds da)).

We useIˆto randomize the control in equation (3.1), which then becomes:

           d ˆXt= A ˆXtdt + bt( ˆX, Ît)dt + σt( ˆX, Ît)d ˆWt + Z U \{0} γt( ˆX, Ît−, z) ˆπ(dt dz) − λπ(dz)dt, ˆ X0= ˆx0. (4.2)

As for equation (3.1), we look for a mild solution to (4.2), namely anH-valued càdlàg

ˆ

Fxˆ0, ˆW ,ˆπ, ˆθ_{-adapted stochastic process}_{X = ( ˆ}ˆ _X

t)t∈[0,T ]such that,Pˆ-a.s.,

ˆ Xt= etAxˆ0+ Z t 0 e(t−s)Ab( ˆX, ˆIs) ds + Z t 0 e(t−s)Aσ( ˆX, ˆIs) d ˆWs (4.3) + Z t 0 Z U \{0}

e(t−s)Aγ( ˆX, ˆIs−, z) (ˆπ(ds dz) − λπ(dz) ds), for all0 ≤ t ≤ T.

Under assumptions (A)-(AR), proceeding as in Proposition 3.2, we can prove the

follow-ing result.

Proposition 4.1. Under assumptions (A)-(AR), there exists a unique mild solution

ˆ

X = ( ˆXt)t∈[0,T ]to equation (4.2), such that, for every1 ≤ p ≤ p0,

ˆ Eh sup t∈[0,T ] | ˆXt|p i ≤ Cp 1 + ˆE [|ˆx0|p], (4.4)

(11)

with the same constantCpas in Proposition 3.2. In addition, for everyt ∈ [0, T ]and any 1 ≤ p ≤ p0, we have ˆ Eh sup s∈[t,T ] | ˆXs|p ˆ Fˆx0, ˆW ,ˆπ, ˆθ t i ≤ Cp 1 + sup s∈[0,t] | ˆXs|p , _Pˆ-a.s. (4.5) with the same constantCpas in Proposition 3.2.

Proof. Concerning estimate (4.4), the proof can be done proceeding along the same lines as in the proof of Proposition 3.2. Regarding estimate (4.5), we begin noting that, given any two integrableFˆxˆ0, ˆW ,ˆπ, ˆθ

t -measurable random variablesηandξ, then the following

property holds:η ≤ ξ,_Pˆ-a.s., if and only if_{E[η 1}ˆ E] ≤ ˆE[ξ 1E], for everyE ∈ ˆF ˆ x0, ˆW ,ˆπ, ˆθ

t . So,

in particular, estimate (4.5) is true if and only if the following estimate holds:

ˆ Eh sup s∈[t,T ] | ˆXs|p1E i ≤ Cp ˆE[1E] + ˆE h sup s∈[0,t] | ˆXs|p1E i , for everyE ∈ ˆFxˆ0, ˆW ,ˆπ, ˆθ t . (4.6)

The proof of estimate (4.6) can be done proceeding along the same lines as in the proof of Proposition 3.2, firstly multiplying equation (4.3) by₁E.

We can now formulate the randomized control problem. The family of all admissible control maps, denoted byVˆ, is the set of allP( ˆFxˆ0, ˆW ,ˆπ, ˆθ_{) ⊗ B(Λ)}_{-measurable functions}

ˆ

ν : [0, T ] × ˆΩ × Λ → (0, ∞) which are bounded from above and bounded away from zero, namely0 < inf_{[0,T ]× ˆ}_Ω×Λˆν ≤ sup_{[0,T ]× ˆ}_Ω×Λν < +∞ˆ . Givenν ∈ ˆˆ V, we consider the probability measure_Pˆνˆ_on_{( ˆ}_{Ω, ˆ}_Fxˆ0, ˆW ,ˆπ, ˆθ

T )given byd ˆP ˆ ν _{= ˆ}_κνˆ

Td ˆP, where(ˆκνtˆ)t∈[0,T ]denotes

the Doléans-Dade exponential

ˆ κˆν_t = Et Z · 0 Z Λ ˆ

νs(a) − 1 θ(ds da) − λˆ 0(da) ds

. (4.7)

By Girsanov’s theorem (see e.g. Theorem 15.2.6 in [5]), under _Pˆνˆ _the _Fˆˆx0, ˆW ,ˆπ, ˆθ

-compensator ofθˆon[0, T ] × Λisνˆs(a)λ0(da)ds.

Notice that, under_Pˆνˆ,Wˆ remains a Brownian motion and the_Fˆˆx0, ˆW ,ˆπ, ˆθ_-compensator

ofπˆon[0, T ] × Λisλπ(dz)ds(see e.g. Theorem 15.3.10 in [5] or Theorem 12.31 in [14]).

As a consequence, the following generalization of estimate (4.4) holds: for every

1 ≤ p ≤ p0, sup ˆ ν∈ ˆV ˆ Eνˆh sup t∈[0,T ] | ˆXt|p i ≤ Cp 1 + ˆEνˆ|x0|p, (4.8)

with the same constantCpas in (4.4), whereEˆνˆdenotes the expectation with respect to

ˆ Pνˆ_.

The controller aims at maximizing over allν ∈ ˆˆ V the gain functional

ˆ JR(ˆν) = ˆ_Eˆν Z T 0 ft( ˆX, ˆIt) dt + g( ˆX) .

By assumption (2.1) and estimate (4.8), it follows thatJˆR(ˆν)is always finite. Finally, the value function of the randomized control problem is given by

ˆ V₀R = sup ˆ ν∈ ˆV ˆ JR(ˆν).

In the sequel, we denote the probabilistic setting we have adopted for the randomized control problem shortly by the tuple( ˆΩ, ˆF , ˆ_{P; ˆ}x0, ˆW , ˆπ, ˆθ; ˆI, ˆX; ˆV).

Our aim is now to prove thatVˆ₀Rcoincides with the valueV0of the original control

(12)

1) the first result (Lemma 4.2) shows that the valueVˆ₀R of the randomized control prob-lem is independent of the probabilistic setting on which the probprob-lem is formulated; 2) in Lemma 4.3 we prove that there exists a probabilistic setting for the randomized

control problem whereJˆRcan be expressed in terms of the gain functionalJˆin (3.9); as noticed in Remark 4.5, this result allows to formulate the randomized control problem in “strong” form, rather than as a supremum over a family of probability measures;

3) finally, in Lemma 4.6 we prove, roughly speaking, that given anyα ∈ Aandε > 0

there exist a probabilistic setting for the randomized control and a suitableνˆsuch that the “distance” under_Pˆνˆbetween the pure jump processIˆandαis less thanε. In order to do it, we need to introduce the following distance onAˆ(see Definition 3.2.3 in [16]), for every fixedν ∈ ˆˆ V:

ˆ dˆνKr( ˆα, ˆβ) := ˆEˆν Z T 0 dΛ( ˆαt, ˆβt) dt , for allα, ˆˆ β ∈ ˆA.

Lemma 4.2. Suppose that assumptions (A)-(AR) hold. Consider a new probabilistic

set-ting for the randomized control problem characterized by the tuple( ¯Ω, ¯F , ¯_{P; ¯}x0, ¯W , ¯π, ¯θ; ¯I,

¯

X; ¯V). Then

ˆ

V0R = ¯V0R.

Proof. The proof can be done proceeding along the same lines as in the proof of Propo-sition 3.1 in [1]. Here we just recall the main steps. Firstly we take ˆν ∈ ˆV which admits an explicit functional dependence on(ˆx0, ˆW , ˆπ, ˆθ). For such a νˆ it is easy to

findν ∈ ¯¯ Vsuch that(ˆν, ˆx0, ˆW , ˆπ, ˆθ)has the same law as(¯ν, ¯x0, ¯W , ¯π, ¯θ)(simply replacing

ˆ

x0, ˆW , ˆπ, ˆθbyν, ¯¯ x0, ¯W , ¯π, ¯θin the expression ofνˆ). So, in particular,JˆR(ˆν) = ¯JR(¯ν). By

a monotone class argument, we deduce that the same equality holds true for every

ˆ

ν ∈ ˆV, which impliesVˆ₀R ≤ ¯V₀R. Interchanging the role of( ˆΩ, ˆF , ˆ_{P; ˆ}x0, ˆW , ˆπ, ˆθ; ˆI, ˆX; ˆV)

and( ¯Ω, ¯F , ¯_{P; ¯}x0, ¯W , ¯π, ¯θ; ¯I, ¯X; ¯V), we obtain the other inequality, from which the claim

follows.

Lemma 4.3. Suppose that assumptions (A)-(AR) hold. Then, there exists a probabilistic

setting for the randomized control problem( ¯Ω, ¯F , ¯_{P; ¯}x0, ¯W , ¯π, ¯θ; ¯I, ¯X; ¯V)and aσ-algebra

¯

G ⊂ ¯F, independent ofx¯0,W¯,¯π, withF¯ ¯ θ

∞⊂ ¯G, such that: given anyν ∈ ¯¯ V there exists

¯ αν¯_{∈ ¯}_AG¯_satisfying Law of(¯x0, ( ¯Wt)0≤t≤T, ¯π|[0,T ]×Λ, ( ¯It)0≤t≤T)underP¯ ¯ ν = Law of(¯x0, ( ¯Wt)0≤t≤T, ¯π|[0,T ]×Λ, ¯α ¯ ν₎_under _¯ P. (4.9) So, in particular, ¯ JR(¯ν) = ¯J ( ¯αν¯).

Remark 4.4. Recall thatA¯G¯_{was defined just before Proposition 3.5, even though it was}

denotedAˆGˆ_{since it was defined in the probabilistic setting}_{( ˆ}_{Ω, ˆ}_{F , ˆ}

P; ˆx0, ˆW , ˆπ, ˆθ; ˆI, ˆX; ˆV)

instead of( ¯Ω, ¯F , ¯_{P; ¯}x0, ¯W , ¯π, ¯θ; ¯I, ¯X; ¯V).

Proof of Lemma 4.3. Let_{(Ω, F , P; x}0, W, π; X; A)be the setting of the original stochastic

control problem in Section 3.1.

Proceeding along the same lines as at the beginning of Section 4.1 in [1], we construct an atomless finite measure λ0₀ on _{(R, B(R))} and a surjective Borel-measurable map

(13)

probability space of a Poisson random measureθ0 =P

n≥1δ(T0

n,ρ0n)on [0, ∞) × Λwith

intensity measureλ0₀(dr)dt, where(T_n0, ρ0_n)n≥1 is the marked point process associated

with θ0. Then, θ = P

n≥1δ(T0

n,π(ρ0n)) is a Poisson random measure on[0, ∞) × Λ with

intensity measureλ0(dr)dt.

LetΩ = Ω × Ω¯ 0,F¯ the_{P ⊗ P}0-completion ofF ⊗ F0_{, and}_P¯ _{the extension of}_{P ⊗ P}0

toF¯. Then, we consider the corresponding probabilistic setting for the randomized control problem( ¯Ω, ¯F , ¯P; ¯x0, ¯W , ¯π, ¯θ; ¯I, ¯X; ¯V), where x¯0, W¯, π¯, θ¯denote the canonical

extensions ofx0,W,π,θtoΩ¯. We also denote byθ¯0 the canonical extension ofθ0 toΩ¯.

Let_F¯θ¯0 _{= ( ¯}_Fθ¯0

t )t≥0(resp.F¯ ¯ θ_{= ( ¯}_Fθ¯

t)t≥0) the filtration generated byθ¯0 (resp.θ¯). We define

¯ G := ¯Fθ¯0

∞. Notice thatF¯ ¯ θ

∞⊂ ¯GandG¯is independent ofx¯0,W¯,¯π. Finally, we denote by

¯

Fx¯0, ¯W ,¯π, ¯G _{= ( ¯}_Fx¯0, ¯W ,¯π, ¯G

t )t≥0theP¯-completion of the filtration generated byx¯0,W¯,π¯,G¯

and satisfyingG ⊂ ¯¯ Fx¯0, ¯W ,¯π, ¯G

0 .

Now, fixν ∈ ¯¯ V. By an abuse of notation, we still denote byFthe canonical extension of theσ-algebraFtoΩ¯. Then, we notice that in the probabilistic setting( ¯Ω, ¯F , ¯_{P; ¯}x0, ¯W , ¯π, ¯θ;

¯

I, ¯X; ¯V) just introduced (4.9) follows if we prove the following: there existsα¯¯ν _{∈ ¯}_AG¯

satisfying

Conditional law of( ¯It)0≤t≤T underP¯ν¯givenF =Conditional law ofα¯¯νunderP¯ givenF .

(4.10) It only remains to prove (4.10). To this end, we recall that the processI¯is defined as

¯ It = X n≥0 ¯ ηn1[ ¯Tn, ¯Tn+1)(t), for allt ≥ 0,

where ( ¯T0, ¯η0) := (0, a0), while ( ¯Tn, ¯ηn), n ≥ 1, denotes the canonical extension of

(T_n0, π(ρ0_n))toΩ¯. Then, (4.10) follows if we prove the following: there exists a sequence

( ¯T¯ν

n, ¯ηn¯ν)n≥1on( ¯Ω, ¯F , ¯P)such that:

(i) ( ¯T¯ν

n, ¯ηn¯ν) : ¯Ω → (0, ∞) × ΛandT¯nν¯< ¯Tn+1ν¯ ;

(ii) T¯n¯ν is aF¯x¯0, ¯W ,¯π, ¯G-stopping time andη¯νn¯isF ¯ x0, ¯W ,¯π, ¯G ¯ Tν¯ n -measurable; (iii) limn→∞T¯nν¯= ∞;

(iv) the conditional law of the sequence( ¯T1, ¯η1) 1{ ¯T1≤T },. . .,( ¯Tn, ¯ηn) 1{ ¯Tn≤T },. . .under

¯

Pν¯_given_F _{is equal to the conditional law of the sequence}_{( ¯}_Tν¯

1, ¯η1ν¯) 1{ ¯T¯ν 1≤T },. . . , ( ¯T_nν¯, ¯η_nν¯_{) 1}_{{ ¯}T¯ν n≤T },. . .under ¯ PgivenF.

As a matter of fact, if there exists( ¯T_n¯ν, ¯η_n¯ν)n≥1satisfying (i)-(ii)-(iii)-(iv), then the process

¯ αν¯, defined as ¯ ανt¯ := X n≥0 ¯ ηνn¯1[ ¯T¯ν

n, ¯Tn+1¯ν )(t), for all0 ≤ t ≤ T, with( ¯T

¯ ν 0, ¯η ¯ ν 0) := (0, a0),

belongs toA¯G¯_{and (4.10) holds.}

Finally, concerning the existence of a sequence( ¯Tν¯

n, ¯ηνn¯)n≥1satisfying (i)-(ii)-(iii)-(iv),

we do not report the proof of this result as it can be done proceeding along the same lines as in the proof of Lemma 4.3 in [1], the only difference being that the filtration

FW _{in [1] (notice that in [1]}_W _{denotes a finite dimensional Brownian motion) is now}

replaced by_Fx0,W,π_{: this does not affect the proof of Lemma 4.3 in [1].}

Remark 4.5. Let ( ¯Ω, ¯F , ¯_{P; ¯}x0, ¯W , ¯π, ¯θ; ¯I, ¯X; ¯V) and G¯ be respectively the probabilistic

setting for the randomized control problem and theσ-algebra mentioned in Lemma 4.3. We denote byA¯V¯ _{the family of all controls}_{α ∈ ¯}_¯ _A_G¯

(14)

such thatJ ( ¯¯α) = ¯JR(¯ν). Then, by definitionA¯V¯ ⊂ ¯AG¯. Moreover, by Lemma 4.3 we have the following “strong” formulation of the randomized control problem:

¯ V₀R = sup ¯ α∈ ¯AV¯ ¯ J ( ¯α).

Lemma 4.6. Suppose that assumptions (A)-(AR) hold. For anyα ∈ Aandε > 0there

exist:

1) a probabilistic setting for the randomized control problem( ¯Ω, ¯F , ¯_Pα,ε_{; ¯}_x

0, ¯W , ¯π, ¯θα,ε;

¯

Iα,ε_{, ¯}_Xα,ε_{; ¯}_Vα,ε₎_{(notice that}_{Ω, ¯}¯ _{F , ¯}_x₀_{, ¯}_{W , ¯}_π_{do not depend on}_{α, ε}_);

2) a probability measure_Q¯ on( ¯Ω, ¯F )equivalent to_P¯α,ε_{, which does not depend on}

α, ε;

3) a stochastic processα : [0, T ] × ¯¯ Ω → Λ, depending only onαbut not onε, which is predictable with respect to the_P¯α,ε-completion (or, equivalently,_Q¯-completion) of the filtration generated byx¯0,W¯,π¯;

4) ν¯α,ε_{∈ ¯}_Vα,ε_,

such that, denoting by_P¯ν¯α,ε the probability measure1 on( ¯Ω, ¯Fx¯0, ¯W ,¯π, ¯θα,ε

T ) defined as

d ¯_P¯να,ε

= ¯κ¯να,ε

T d ¯P

α,ε_{, the following properties hold:}

(i) the restriction of_Q¯ toF¯x¯0, ¯W ,¯π, ¯θα,ε

T coincides withP¯ ¯ να,ε_;

(ii) the following inequality holds:

¯ EQ¯ Z T 0 dΛ( ¯Itα,ε, ¯αt) dt ≤ ε;

(iii) the quadruple(x0, W, π, α)underPhas the same law as(¯x0, ¯W , ¯π, ¯α)underP¯α,ε.

Proof. Fixα ∈ Aandε > 0. In order to construct the probabilistic setting of item 1), we apply Proposition A.1 in [1] (with filtration_{G = F}x0,W,π _and_{δ = ε}_{), from which we}

deduce the existence of a probability space( ¯Ω, ˜F , ˜_Q)independent ofα, ε(corresponding to( ˆΩ, ˆF , Q)in the notation of Proposition A.1) and a marked point process( ¯Tα,ε

n , ¯ηα,εn )n≥1

with corresponding random measureθ¯α,ε = P

n≥1δ( ¯Tnα,ε, ¯ηα,εn ) on

¯

Ω (corresponding re-spectively to( ˆSn, ˆηn)n≥1andµˆin Proposition A.1) with the following properties:

(a) there exists a probability space (Ω0, F0_{, P}0) such that Ω = Ω × Ω¯ 0, F = F ⊗ F˜ 0_,

˜

Q = P ⊗ P0; we denote byx¯0,W¯,π¯the natural extensions ofx0,W,πtoΩ¯ (which

obviously do not depend onα, ε); we also denote by_F˜¯x0, ¯W ,¯π _{the extension of}_Fx0,W,π

toΩ¯;

(b) denoting_E˜Q˜ _{the expectation with respect to}_Q˜_{, we have} ˜ EQ˜ Z T 0 dΛ( ¯Itα,ε, ¯αt) dt ≤ ε,

whereα¯ is the natural extension ofαtoΩ = Ω × Ω¯ 0 (which clearly depend only onα, not onε), whileI¯α,ε_{is given by}

¯ I_tα,ε = X n≥0 ¯ η_nα,ε₁_{[ ¯}_Tα,ε n , ¯Tn+1α,ε)(t), for allt ≥ 0, withT¯₀α,ε= 0andη¯₀α,ε= a0; 1_Here_F¯x¯0, ¯W ,¯π, ¯θα,ε _{= ( ¯}_Fx¯0, ¯W ,¯π, ¯θα,ε

t )t≥0 denotes theP¯α,ε-completion of the filtration generated by ¯

x0, ¯W , ¯π, ¯θα,ε, while¯κν¯

α,ε

is the Doléans-Dade exponential given by (4.7) withˆν,θˆreplaced respectively by

¯ να,ε_,_θ¯α,ε_.

(15)

(c) let_F˜θ¯α,ε

= ( ˜Fθ¯α,ε

t )t≥0denote the filtration generated byθ¯α,ε; let alsoP( ˜F ¯ x0, ¯W ,¯π

t ∨

¯ Fθ˜α,ε

t ) be the predictable σ-algebra on [0, T ] × ¯Ω associated with the filtration

( ˜F¯x0, ¯W ,¯π

t ∨ ˜F ¯ θα,ε

t )t≥0; then, there exists aP( ˜F ¯ x0, ¯W ,¯π t ∨ ¯F ˜ θα,ε t ) ⊗ B(Λ)-measurable map ¯ να,ε_{: [0, T ] × ¯}_{Ω × Λ → (0, ∞)}_{, with} _{0 < inf} [0,T ]× ¯Ω×Λν¯α,ε ≤ sup[0,T ]× ¯Ω×Λν¯α,ε < +∞,

such that under_Q˜ the random measureθ¯α,εhas( ˜Fx¯0, ¯W ,¯π

t ∨ ˜F ¯ θα,ε

t )-compensator on

[0, T ] × Λgiven byν¯_tα,ε(a)λ0(da)dt.

Now, proceeding as in Section 4.2 of [1], we consider the completion( ¯Ω, ¯F , ¯_Q)of( ¯Ω, ˜F , ˜_Q). Then, from item (b) above we immediately deduce item (ii).

Let_F¯x¯0, ¯W ,¯π, ¯θα,ε _{= ( ¯}_F¯x0, ¯W ,¯π, ¯θα,ε

t )t≥0be theQ¯-completion of the filtration( ˜Ft¯x0, ¯W ,¯π∨

˜ Fθ¯α,ε

t )t≥0. It easy to see that underQ¯ theF¯x¯0, ¯W ,¯π, ¯θ

α,ε

-compensator ofθ¯α,εon[0, T ] × Λ

is still given byν¯tα,ε(a)λ0(da)dt. Denote byP( ¯F¯x0, ¯W ,¯π, ¯θ

α,ε

)the predictableσ-algebra on

[0, T ]× ¯Ωassociated with_F¯x¯0, ¯W ,¯π, ¯θα,ε_{. Then, we define}_V¯α,ε_{as the set of all}_{P( ¯}_Fx¯0, ¯W ,¯π, ¯θα,ε_)⊗

B(Λ)-measurable functions¯ν : [0, T ] × ¯Ω × Λ → (0, ∞)which are bounded from above and bounded away from zero. Notice thatν¯α,ε_{∈ ¯}_Vα,ε_{. Let}_κ_¯¯να,ε _{be the Doléans-Dade}

exponen-tial given by (4.7) withνˆ,θˆreplaced respectively byν¯α,ε_,_θ¯α,ε_{. Since}_inf

[0,T ]× ¯Ω×Λ¯να,ε> 0,

it follows thatν¯α,εhas bounded inverse, so that we can define the probability measure

¯

Pα,εon( ¯Ω, ¯F ), equivalent to_Q¯, byd ¯_Pα,ε= (¯κ¯ν_Tα,ε)−1d ¯_Q. Notice that the restriction of_Q¯ toF¯¯x0, ¯W ,¯π, ¯θα,ε

T coincides withP¯ ¯

να,ε_{, which is the probability measure on}_{( ¯}_{Ω, ¯}_F¯x0, ¯W ,¯π, ¯θα,ε

T )

defined asd ¯_Pν¯α,ε_{= ¯}_κν¯α,ε

T d ¯Pα,ε. This proves item (i).

By Girsanov’s theorem, under_P¯α,εthe random measureθ¯α,εhas_F¯¯x0, ¯W ,¯π, ¯θα,ε_-compensator

on[0, T ] × Λgiven byλ0(da)dt, so in particular it is a Poisson random measure.

More-over, underP¯α,ε_{the random variable}_x_¯

0has still the same law, the processW¯ is still a

Brownian motion, and the random measureπ¯ is still a Poisson random measure with

¯

Fx¯0, ¯W ,¯π, ¯θα,ε_{-compensator on}_{[0, T ] × U} _{given by}_λ

π(dz)dt. In addition, x¯0, W¯, ¯π, θ¯are

independent under_P¯α,ε_{. This shows the validity of item (iii) and concludes the proof.} Theorem 4.7. Under assumptions (A)-(AR), the following equality holds:

V0 = ˆV0R.

Proof. Proof of the inequalityV0≥ ˆV0R. Let( ¯Ω, ¯F , ¯P; ¯x0, ¯W , ¯π, ¯θ; ¯I, ¯X; ¯V)andG¯be

respec-tively the probabilistic setting for the randomized control problem and theσ-algebra mentioned in Lemma 4.3. Recall from Proposition 3.5 that

V0 = sup

¯ α∈ ¯AG¯

¯ J ( ¯α).

Then, the inequalityV0≥ ˆV0Rfollows directly by Lemma 4.2 and Remark 4.5, from which

we have ˆ V₀R = ¯V₀R = sup ¯ α∈ ¯AV¯ ¯ J ( ¯α) ≤ sup ¯ α∈ ¯AG¯ ¯ J ( ¯α) = V0.

Proof of the inequalityV0≤ ˆV0R. Fixα ∈ A. Then, for every positive integerk, it follows

from Lemma 4.6 withε = 1/kthat there exist a probabilistic setting for the randomized control problem( ¯Ω, ¯F , ¯_Pα,k_{; ¯}_x

0, ¯W , ¯π, ¯θα,k; ¯Iα,k, ¯Xα,k; ¯Vα,k), a probability measureQ¯ on

( ¯Ω, ¯F )equivalent to_P¯α,k_,_{α : [0, T ] × ¯}_¯ _{Ω → Λ}_,_ν_¯α,k_{∈ ¯}_Vα,k_{such that:}

(i) _Q¯| ¯Fx0, ¯¯ W , ¯π, ¯θα,k T coincides withP¯ ¯ να,k ; (ii) _E¯Q¯ RT 0 dΛ( ¯I α,k t , ¯αt) dt ≤ 1/k, so, in particular, ¯ EQ¯ Z T 0 dΛ( ¯Itα,k, ¯αt) dt k→+∞ −→ 0; (4.11)

(16)

(iii) (x0, W, π, α)underPhas the same law as(¯x0, ¯W , ¯π, ¯α)underP¯α,k.

The claim follows if we prove that

lim

k→+∞

¯

JR,α,k(¯να,k) = J (α), (4.12)

whereJ¯R,α,kdenotes the gain functional for the randomized control problem( ¯Ω, ¯F , ¯_Pα,k_{; ¯}_x 0, ¯ W , ¯π, ¯θα,k_{; ¯}_Iα,k_{, ¯}_Xα,k_{; ¯}_Vα,k₎_{, which is given by} ¯ JR,α,k(¯να,k) = ¯_Eν¯α,k Z T 0 ft( ¯Xα,k, ¯I α,k t ) dt + g( ¯X α,k₎ , with            d ¯Xtα,k= A ¯X α,k t dt + bt( ¯Xα,k, ¯Itα,k)dt + σt( ¯Xα,k, ¯Itα,k)d ¯Wt + Z U \{0} γt( ¯Xα,k, ¯It−α,k, z) ¯π(dt dz) − λπ(dz)dt, ¯ X₀α,k= ¯x0.

As a matter of fact, if (4.12) holds true then for every ε > 0there existskε such that

J (α) ≤ ¯JR,α,k_(¯_να,k_{) + ε ≤ ¯}_VR,α,k

0 + ε, for all k ≥ kε. By Lemma 4.2 we know that

¯

V₀R,α,k= ˆV₀R, so the claim follows.

It remains to prove (4.12). By item (i) above we notice that J¯R,α,k(¯να,k₎_{can be}

equivalently written in terms of_E¯Q¯_:

¯ JR,α,k(¯να,k) = ¯_EQ¯ Z T 0 ft( ¯Xα,k, ¯Itα,k) dt + g( ¯X α,k₎ .

On the other hand, by item (iii) above,J (α)is also given by

J (α) = ¯_EQ¯ Z T 0 ft( ¯Xα¯, ¯αt) dt + g( ¯Xα¯) , with            d ¯X_tα¯ = A ¯X_tα¯dt + bt( ¯Xα¯, ¯αt)dt + σt( ¯Xα¯, ¯αt)d ¯Wt + Z U \{0} γt( ¯Xα¯, ¯αt, z) ¯π(dt dz) − λπ(dz)dt, ¯ X0α¯= ¯x0.

Hence, (4.12) can be equivalently rewritten as follows:

¯ EQ¯ Z T 0 ft( ¯Xα,k, ¯I α,k t ) dt + g( ¯X α,k₎ k→+∞ −→ _E¯Q¯ Z T 0 ft( ¯Xα¯, ¯αt) dt + g( ¯Xα¯) . (4.13)

Now, we notice that, under assumptions (A)-(AR), proceeding along the same lines as in

the proof of Proposition 3.2, we can prove the following result: for every1 ≤ p ≤ p0,

¯ EQ¯h sup t∈[0,T ] ¯X α,k t − ¯X ¯ α t pi _k→+∞ −→ 0. (4.14)

It is then easy to see that, from the continuity and polynomial growth assumptions onf

andgin (A)-(v) and (A)-(vi), convergence (4.13) follows directly from (4.11) and (4.14). This concludes the proof of the inequalityV0≤ ˆV0R.

(17)

5 BSDE with non-positive jumps

Let( ˆΩ, ˆF , ˆ_P)be the complete probability space on which are definedxˆ0,Wˆ,ˆπ,θˆas

in Section 3.2._Fˆxˆ0, ˆW ,ˆπ, ˆθ_{= ( ˆ}_Fxˆ0, ˆW ,ˆπ, ˆθ

t )t≥0still denotes thePˆ-completion of the filtration

generated byxˆ0,Wˆ,πˆ,θˆ; we also recall thatP( ˆFxˆ0, ˆW ,ˆπ, ˆθ)is the predictableσ-algebra on

[0, T ] × ˆΩcorresponding to_Fˆxˆ0, ˆW ,ˆπ, ˆθ_{. We begin introducing the following notations.}

• S2_{denotes the set of càdlàg}_Fˆxˆ0, ˆW ,ˆπ, ˆθ_{-adapted processes}_{Y : [0, T ] × ˆ}_{Ω → R}

satisfy-ing kY k2 S2 := ˆE h sup 0≤t≤T |Yt|2 i < ∞.

• Lp_{(0, T )}_,_{p ≥ 1}_{, denotes the set of}_Fˆxˆ0, ˆW ,ˆπ, ˆθ_{-adapted processes}_{φ : [0, T ] × ˆ}_{Ω → R}

satisfying kφkp_{Lp (0,T )} := ˆ_E Z T 0 |φt|pdt < ∞.

• Lp_{( ˆ}_{W )}_,_{p ≥ 1}_{, denotes the set of}_{P( ˆ}

Fˆx0, ˆW ,ˆπ, ˆθ₎_{-measurable processes}_{Z : [0, T ] × ˆ}_{Ω →} Ξsatisfying kZkp Lp ( ˆW ) := ˆE Z T 0 |Zt|2Ξdt p₂ < ∞.

We shall identifyΞwith its dualΞ∗. Notice also thatΞ∗ = L2(Ξ, R), the space of

Hilbert-Schmidt operators fromΞintoRendowed with the usual scalar product. • Lp_(ˆ_π)_,_{p ≥ 1}_{, denotes the set of}_{P( ˆ}

Fˆx0, ˆW ,ˆπ, ˆθ_{) ⊗ B(U )}_{-measurable maps}_{L : [0, T ] ×}

ˆ Ω × U → Rsatisfying kLkp Lp ( ˆπ) := ˆE Z T 0 Z U |Lt(z)|2λπ(dz) dt p2 < ∞.

• Lp_(ˆ_θ)_, _{p ≥ 1}_{, denotes the set of}_{P( ˆ}

Fxˆ0, ˆW ,ˆπ, ˆθ_{) ⊗ B(Λ)}_{-measurable maps}_{R : [0, T ] ×} ˆ Ω × Λ → Rsatisfying kRkp Lp ( ˆθ) := ˆE Z T 0 Z Λ |Rt(b)|2λ0(db) dt p2 < ∞. • Lp_(λ

0),p ≥ 1, denotes the set ofB(Λ)-measurable mapsr : Λ → Rsatisfying

krkp

Lp (λ0) :=

Z

Λ

|r(b)|pλ0(db) < ∞.

• K2_{denotes the set of non-decreasing}_{P( ˆ}

Fˆx0, ˆW ,ˆπ, ˆθ₎_{-measurable processes}_{K ∈ S}2

satisfyingK0= 0, so that

kKk2

S2 = ˆE|KT|

2_.

Consider the following backward stochastic differential equation with non-positive jumps:

Yt = g( ˆX) + Z T t f ( ˆX, ˆIs) ds + KT − Kt− Z T t Z Λ Rs(b) ˆθ(ds db) (5.1) − Z T t Zsd ˆWs− Z T t Z U Ls(z) (ˆπ(ds dz) − λπ(dz) ds), 0 ≤ t ≤ T, ˆP-a.s. Rt(b) ≤ 0, dt ⊗ d ˆP ⊗ λ0(db)-a.e. onΩ × [0, T ] × Λ.ˆ (5.2)

(18)

Definition 5.1. A minimal solution to equation (5.1)-(5.2) is a quintuple (Y, Z, L, R, K) ∈ S2_{× L}2_{( ˆ}_{W ) × L}2_(ˆ_{π) × L}2_(ˆ_{θ) × K}2 _{satisfying (5.1)-(5.2) such that for any other}

quintuple( ˜Y , ˜Z, ˜L, ˜R, ˜K) ∈ S2_{× L}2_{( ˆ}_{W ) × L}2_(ˆ_{π) × L}2_(ˆ_{θ) × K}2_{satisfying (5.1)-(5.2), we}

have

Yt ≤ ˜Yt, 0 ≤ t ≤ T, ˆP-a.s.

Lemma 5.2. Under assumptions (A)-(AR), there exists at most one minimal solution to

equation (5.1)-(5.2).

Proof. The uniqueness ofY follows from the definition of minimal solution. Now, let

(Y, Z, L, R, K),(Y, ˜Z, ˜L, ˜R, ˜K) ∈ S2_×L2_{( ˆ}_{W )× L}2_(ˆ_{π)× L}2_(ˆ_{θ)× K}2_{be two minimal solutions.}

Then Kt− ˜Kt− Z t 0 Zs− ˜Zs d ˆWs+ Z t 0 Z U Ls(z) − ˜Ls(z) λπ(dz)ds = Z t 0 Z U Ls(z) − ˜Ls(z) ˆπ(ds dz) + Z t 0 Z Λ Rs(b) − ˜Rs(b) _ˆ θ(ds db), (5.3)

for all0 ≤ t ≤ T,_Pˆ-a.s.. Observe that on the left-hand side of (5.3) there is a predictable process, which has therefore no totally inaccessible jumps, while on the right-hand side in (5.3) there is a pure jump process which has only totally inaccessible jumps. We deduce that both sides must be equal to zero. Therefore, we obtain the two following equalities: for all0 ≤ t ≤ T,_Pˆ-a.s.,

Kt− ˜Kt+ Z t 0 Z U Ls(z) − ˜Ls(z) λπ(dz)ds = Z t 0 Zs− ˜Zs d ˆWs, Z t 0 Z U Ls(z) − ˜Ls(z) ˆπ(ds dz) = Z t 0 Z Λ Rs(b) − ˜Rs(b) _ˆ θ(ds db).

Concerning the first equation, the left-hand side is a finite variation process, while the process on the right-hand side has not finite variation, unlessZ = ˜Z andK − ˜K + R·

0

R

U(Ls(z) − ˜Ls(z))λπ(dz)ds = 0. On the other hand, sinceˆπandθˆare independent,

they have disjoint jump times, therefore from the second equation above we findL = ˜L

andR = ˜R, from which we also obtainK = ˜K.

We now prove that focus on the existence of a minimal solution to (5.1)-(5.2). To this end, we introduce, for every integern ≥ 1, the following penalized backward stochastic differential equation: Y_tn = g( ˆX) + Z T t f ( ˆX, ˆIs) ds + KTn− K n t − Z T t Z Λ Rn_s(b) ˆθ(ds db) (5.4) − Z T t Z_snd ˆWs− Z T t Z U Ln_s(z) (ˆπ(ds dz) − λπ(dz)ds), 0 ≤ t ≤ T, ˆP-a.s. where K_tn = n Z t 0 Z Λ Rn_s(b)+λ0(db)ds, 0 ≤ t ≤ T, ˆP-a.s.

withf+_{= max(f, 0)}_{denoting the positive part of the function}_f_.

Lemma 5.3 (Martingale representation). Suppose that assumptions (A)-(iii) and (AR)-(i)

hold. Given anyξ ∈ L2_{( ˆ}_{Ω, ˆ}_Fˆx0, ˆW ,ˆπ, ˆθ

T , ˆP), there existZ ∈ L2( ˆW ),L ∈ L2(ˆπ), R ∈ L2(ˆθ) such that ξ = ˆ_E[ξ|ˆx0] + Z T 0 Ztd ˆWt+ Z T 0 Z U Lt(z) ˆπ(dt dz) + Z T 0 Z Λ Rt(b) ˆθ(dt db), Pˆ-a.s. (5.5)

(19)

Proof. We begin noting that, whenWˆ is a finite-dimensional Brownian motion, repre-sentation (5.5) forξcan be easily proved using for instance Lemma 2.3 in [21]. As a matter of fact, letFˆˆx0 _{= ( ˆ}_Fˆx0

t )t≥0, Fˆ ˆ W0 _{= ( ˆ}_FWˆ0 t )t≥0, Fˆπˆ0 = ( ˆFtπˆ0)t≥0, Fˆ ˆ θ0 _{= ( ˆ}_Fθˆ0 t )t≥0

be the _Pˆ-completion of the filtration generated respectively by xˆ0, Wˆ, ˆπ, θˆ. When

ξ = 1E ˆ_x01E ˆ_W01E ˆ_π01E ˆ_θ0, with Exˆ0 ∈ ˆF ˆ x0 T , EWˆ0 ∈ ˆF ˆ W0 T , Eπˆ0 ∈ ˆF ˆ π0 T , E_θˆ₀ ∈ ˆF ˆ θ0 T , then

rep-resentation (5.5) forξ follows easily by Lemma 2.3 in [21]. Since the linear span of the random variables of the form₁_{E ˆ}

x01E ˆ_W01E ˆπ01E ˆ_θ0 is dense inL

2_{( ˆ}_{Ω, ˆ}_Fˆx0, ˆW ,ˆπ, ˆθ

T , ˆP), we

deduce the validity of (5.5) for a generalξ ∈ L2_{( ˆ}_{Ω, ˆ}_Fxˆ0, ˆW ,ˆπ, ˆθ

T , ˆP).

In the infinite-dimensional case, let(ek)k≥1be an orthonormal basis ofΞand define

ˆ

W_t(k)= h ˆWt, ekiΞ, fort ≥ 0. The processesW(k)are independent standard real Brownian

motions. For any positive integern, let_Fˆ(n) _{= ( ˆ}_F(n)

t )t≥0 denote the Pˆ-completion of

the filtration generated byxˆ0,Wˆ(1), . . . , ˆW(n),πˆ,θˆ. Notice thatFˆ(n)satisfies the usual

conditions. Denoteξ(n) = ˆ_{E[ξ| ˆ}F_T(n)]. By the previously mentioned finite-dimensional version of representation (5.5), we have a martingale representation forξ(n)_{. It is then}

easy to see that, lettingn → +∞in such a martingale representation, (5.5) follows.

Proposition 5.4. Under assumptions (A)-(AR), for every integer n ≥ 1 there exists

a unique solution(Yn, Zn, Ln, Rn) ∈ S2× L2_{( ˆ}_{W ) × L}2_(ˆ_{π) × L}2_(ˆ_θ)_{to equation (5.4). In}

addition, the following estimate holds:

kZn_k2 L2 ( ˆW )+ kL n_k2 L2 ( ˆπ)+ kR n_k2 L2 ( ˆθ)+ kK n_k2 S2 ≤ ˆC kYn_k2 S2 + ˆE Z T 0 |f ( ˆX, ˆIt)|2dt , (5.6) for some constantC ≥ 0ˆ , depending only onT and on the constantL in assumption

(A)-(vi), independent ofn.

Proof. The existence and uniqueness result can be proved as in the finite-dimensional case dimΞ < ∞, see Lemma 2.4 in [21]. We simply recall that, as usual, it is based on a fixed point argument and on the martingale representation (concerning this latter result, since we did not find a reference for it suitable for our setting, we proved it in Lemma 5.3).

Similarly, estimate (5.6) can be proved proceeding along the same lines as in the finite-dimensional case dimΞ < ∞, for which we refer to Lemma 2.3 in [15]; we just recall that its proof is based on the application of Itô’s formula to|Yn_|2_{, as well as on}

Gronwall’s lemma and the Burkholder-Davis-Gundy inequality.

For every integern ≥ 1, we provide the following representation ofYnin terms of a suitable penalized randomized control problem. To this end, we defineVˆn as the subset

ofVˆ of all mapsˆνbounded from above byn.

We recall that, for every ν ∈ ˆˆ V, _Eˆˆν _{denotes the expectation with respect to the}

probability measure on( ˆΩ, ˆFxˆ0, ˆW ,ˆπ, ˆθ

T )given byd ˆPνˆ= ˆκTνˆd ˆP, where(ˆκνtˆ)t∈[0,T ]denotes

the Doléans-Dade exponential defined in (4.7).

Lemma 5.5. Under assumptions (A)-(AR), for every integern ≥ 1the following

equali-ties hold: Ytn = ess sup ˆ ν∈ ˆVn ˆ Eˆν Z T t f ( ˆX, ˆIs) ds + g( ˆX) ˆ Fˆx0, ˆW ,ˆπ, ˆθ t , _Pˆ-a.s., 0 ≤ t ≤ T (5.7) and ˆ E[Y0n] = sup ˆ ν∈ ˆVn ˆ Eˆν Z T 0 f ( ˆX, ˆIs) ds + g( ˆX) , (5.8)

(20)

• for every0 ≤ t ≤ T, the sequence(Yn

t )n is non-decreasing;

• there exists a constantC ≥ 0¯ , depending only onT, p¯, and on the constantLin assumption (A)-(vi), independent ofn, such that

sup s∈[0, T ] |Ysn| ≤ ¯C 1 + sup s∈[0, T ] | ˆXs|p¯ , _Pˆ-a.s. (5.9)

Proof. Proof of formulae (5.7) and (5.8). We report the proof of formula (5.7), as (5.8) can be proved proceeding along the same lines (simply replacing all the_Pˆνˆ-conditional expectations with normal_Pˆνˆ-expectations, and also noting that_Pˆˆνcoincides with_Pˆ on

ˆ

Fxˆ0, ˆW ,ˆπ, ˆθ

0 , which is thePˆ-completion of theσ-algebra generated byxˆ0). Fix an integer

n ≥ 1and let (Yn_{, Z}n_{, L}n_{, R}n₎ _{be the solution to (5.4), whose existence follows from}

Proposition 5.4. As consequence of the Girsanov Theorem, the two following processes

Z t 0 Z_snd ˆWs, Z t 0 Z U Ln_s(z) ˆπ(ds dz) − λπ(dz)ds,

are_Pˆˆν_{-martingales (see e.g. Theorem 15.3.10 in [5] or Theorem 12.31 in [14]). Moreover}

ˆ Eνˆ Z T t Z Λ Rn_s(b) ˆθ(ds db) ˆ Fˆx0, ˆW ,ˆπ, ˆθ t = ˆ_Eˆν Z T t Z Λ R_sn(b) ˆνs(b)λ0(db)ds ˆ Fxˆ0, ˆW ,ˆπ, ˆθ t .

Therefore, taking the_Pˆνˆ-conditional expectation givenFˆxˆ0, ˆW ,ˆπ, ˆθ

t in (5.4), we obtain Y_tn = ˆ_Eνˆ g( ˆXT) + Z T t f ( ˆXs, ˆIs) ds ˆ Fxˆ0, ˆW ,ˆπ, ˆθ t (5.10) + ˆ_Eˆν Z T t Z Λ [n(Rn_s(b))+− ˆνs(b) Rns(b)] λ0(db)ds ˆ Fxˆ0, ˆW ,ˆπ, ˆθ t , _Pˆ-a.s., 0 ≤ t ≤ T.

Firstly, we notice thatnu+− νu ≥ 0for all_{u ∈ R},ν ∈ (0, n], so that (5.10) gives

Y_tn ≥ ess sup ˆ ν∈ ˆVn ˆ Eˆν g( ˆXT) + Z T t f ( ˆXs, ˆIs) ds ˆ Fxˆ0, ˆW ,ˆπ, ˆθ t ˆ P-a.s., 0 ≤ t ≤ T. (5.11) On the other hand, sinceRn_{∈ L}2_(ˆ_θ)_{, by Lebesgue’s dominated convergence theorem for}

conditional expectation, we obtain

lim N →∞ ˆ E Z T t Z Λ |Rns(b)| 2 1{Rn s(b)≤−N }λ0(db)ds ˆ Fxˆ0, ˆW ,ˆπ, ˆθ t = 0.

So, in particular, for everyn ≥ 1there exists a positive integerNn such that

ˆ E Z T t Z Λ |Rn s(b)| 2 1{Rn s(b)≤−Nn}λ0(db)ds ≤ e−(n−1)λ0(Λ)(T −t)_. _(5.12)

Now, let us define

ˆ ν_sn,ε_{(b) := n1}{Rn s(b)≥0}+ ε1{−1<Rns(b)<0}− εR n s(b) −1 1{−Nn<Rns(b)≤−1}+ ε1{Rns(b)≤−Nn}.

It is easy to see thatνˆn,ε∈ ˆVn. Moreover, we have

ˆ Eνˆn,ε Z T t Z Λ [n(Rn_s(b))+− ˆν_sn,ε(b) Rn_s(b)] λ0(db)ds ˆ Fxˆ0, ˆW ,ˆπ, ˆθ t ≤ εp(T − t)λ0(Λ) ( p (T − t)λ0(Λ) (5.13) + s ˆ E ˆ κνˆn,ε T ˆ κνˆn,ε t 2 ˆ Fxˆ0, ˆW ,ˆπ, ˆθ t s ˆ E Z T t Z Λ |Rn s(b)|21{Rn s(b)≤−Nn}λ0(db)ds ˆ Fxˆ0, ˆW ,ˆπ, ˆθ t ) .

(21)

Recalling that, for everyν ∈ ˆˆ V, it holds that |ˆκˆν s|2 = ˆκνˆ 2 s e Rs 0 R Λ(ˆνr(b)−1)λ0(db)dr_,_{s ∈ [0, T ]}

(see e.g. the proof of Lemma 4.1 in [15]), we obtain

where the last equality follows from the fact that, for everyν ∈ ˆˆ V, we haveνˆ2_{∈ ˆ}_V_{, so}

thatˆκνˆ2 _{is a martingale. Plugging (5.12) and (5.14) into (5.13), we end up with}

ˆ Eνˆn,ε Z T t Z Λ [n(Rns(b))+− ˆνsn,ε(b) Rns(b)] λ0(db)ds ˆ Fxˆ0, ˆW ,ˆπ, ˆθ t (5.15) ≤ εp(T − t)λ0(Λ) np (T − t)λ0(Λ) + 1 o = ε ˜C,

withC :=˜ p(T − t)λ0(Λ){p(T − t)λ0(Λ) + 1}. Plugging (5.15) into (5.10) we get

Y_tn ≤ ˆ_Eˆνn,ε g( ˆXT) + Z T t f ( ˆXs, ˆIs) ds ˆ Fxˆ0, ˆW ,ˆπ, ˆθ t + ε ˜C ≤ ess sup ˆ ν∈ ˆVn ˆ Eνˆ g( ˆXT) + Z T t f ( ˆXs, ˆIs) ds ˆ Fxˆ0, ˆW ,ˆπ, ˆθ t + ε ˜C, _Pˆ-a.s., 0 ≤ t ≤ T.

From the arbitrariness ofε, we find the reverse inequality of (5.11), from which (5.7) follows.

Proof of the monotonicity of(Yn)n. By definitionVˆn ⊂ ˆVn+1. Then inequalityYtn≤ Y n+1 t ,

ˆ

P-a.s. for allt ∈ [0, T ], follows directly from (5.7).

Proof of formula (5.9). In the sequel we denote byC¯ a non-negative constant, depending only onT,p¯, and on the constantLin assumption (A)-(vi), independent ofn, which may change from line to line.

Recalling the polynomial growth condition (2.1) onf andgin assumption (A)-(vi), it follows from formula (5.7) that

|Yn t | ≤ ¯C ess sup ˆ ν∈ ˆVn ˆ Eνˆh1 + sup s∈[0,T ] | ˆXs|p¯ ˆ Fxˆ0, ˆW ,ˆπ, ˆθ t i , _Pˆ-a.s., 0 ≤ t ≤ T.

Finally, by estimate (4.5), together with the fact thatYn _{is a càdlàg process, we see that}

(5.9) follows.

We can now prove the main result of this section.

Theorem 5.6. Under assumptions (A)-(AR), there exists a unique minimal solution

(Y, Z, L, R, K) ∈ S2_{× L}2_{( ˆ}_{W ) × L}2_(ˆ_{π) × L}2_(ˆ_{θ) × K}2_{to (5.1)-(5.2), satisfying} Yt = ess sup ˆ ν∈ ˆV ˆ Eνˆ Z T t fs( ˆX, ˆIs) ds + g( ˆX) ˆ Fˆx0, ˆW ,ˆπ, ˆθ t , _Pˆ-a.s., 0 ≤ t ≤ T (5.16) and ˆ E[Y0] = sup ˆ ν∈ ˆV ˆ Eνˆ Z T 0 fs( ˆX, ˆIs) ds + g( ˆX) = ˆV₀R, (5.17)