• Non ci sono risultati.

Elliptic Hamilton-Jacobi-Bellman equations degenerating at the boundary and applications to stochastic control

N/A
N/A
Protected

Academic year: 2021

Condividi "Elliptic Hamilton-Jacobi-Bellman equations degenerating at the boundary and applications to stochastic control"

Copied!
25
0
0

Testo completo

(1)

Elliptic Hamilton-Jacobi-Bellman equations degenerating at the boundary

and applications to stochastic control

Martino Bardi

with Annalisa Cesaroni and Luca Rossi

Department of Mathematics University of Padua, Italy

"Hamilton-Jacobi Equations: new trends and applications"

University Rennes 1, France

28th May - 3rd June, 2016

(2)

H-J-Bellman operator

F [u] := sup

α∈A

n

− tr 

a(x , α)D 2 u(x ) 

− b(x, α) · Du(x) o , in Ω ⊆ R n bounded connected open set with C 2 boundary, A ⊆ R m closed, a ≡ σσ T ≥ 0

b : Ω × A → R n , σ : Ω × A → M n×r , r ≥ 1 Associated controlled diffusion process

 

 

dX t α

·

= b(X t α

·

, α t )dt + √

2σ(X t α

·

, α t )dW t X 0 α

·

= x ∈ Ω,

W t r −dimensional Brownian motion.

(3)

Outline

Solutions of F [u] ≤ 0 or F [u] ≥ 0 in Ω with an "infinite boundary condition" are constant.

Application 1 - Stochastic control with exit time:

v (x ) := inf

α

·

∈A E [φ(X τ α

α··

x

)] = explicit constant, where τ x α

·

is the first exit time of the process X t α

·

from Ω.

Application 2 - Ergodic HJB equation:

F [χ] − l(x ) = c in Ω, + "infinite boundary condition" on ∂Ω

in the unknowns (c, χ) has a unique solution (χ up to additive

constants).

(4)

Standing assumptions

(H) Hölder continuity of data

 

 

|b(x, α) − b(y , α)| ≤ C|x − y | η , 0 < η ≤ 1

|σ(x, α) − σ(y , α)| ≤ B|x − y | β , 1 2 < β ≤ 1 (SMP) Strong Maximum Principle:

u ∈ USC(Ω) solving F [u] ≤ 0 in Ω and ∃ x o such that u(x o ) = max u =⇒ u is constant.

Sufficient conditions for (SMP) [M.B. - F. Da Lio 2001-3]:

∀ x ∃ α x ∈ A such that

either a(x , α x ) > 0 (strict ellipticity for some control) or the operator tr a(x , α x )D 2 · 

is Hörmander hypoelliptic.

(5)

Standing assumptions continued

Define

d (x ) := dist(x , R n \Ω) − dist(x, Ω)

(B1) ∃ δ, k > 0, γ < 2β − 1 such that ∀x ∈ ∂Ω, ∃ α ∈ A :

 

 

σ T (x , α)Dd (x ) = 0, (a)

b(x , α) · Dd (x ) + tr(a(x , α)D 2 d (x )) ≥ k d γ (x ), ∀x ∈ Ω ∩ B δ (x ) (b) Meaning: (a): diffusion with control α degenerates in the direction n(x ) = Dd (x ), but

(b) b(x , α) points inward Ω in a neighborhood of x . Sufficient condition for (b): (a) +

b(x , α) · Dd (x ) + tr(a(x , α)D 2 d (x)) ≥ k > 0.

N.B.: (B1) =⇒ Ω viable, or weakly invariant.

(6)

A Liouville-type property 1: subsolutions

Theorem (1)

(H), (SMP), (B1), u ∈ USC(Ω) viscosity subsolution F [u] ≤ 0 in Ω, and

(BC1) lim sup

x →∂Ω

u(x )

− log d (x) ≤ 0,

=⇒ u is constant.

Lemma (Lyapunov-like function)

∀ M ≥ 0, ∃ δ > 0 such that

F [− log(d (x))] > M, ∀ x ∈ Ω δ ,

where Ω δ := {x ∈ Ω | d (x ) < δ}.

(7)

Proof of the Lyapunov Lemma

Choose x ∈ ∂Ω : d (x ) = |x − x | and Dd (x ) = Dd (x ), and α ∈ A such that σ T (x , α)Dd (x ) = 0 by (B1).

For w (x ) := − log(d (x )) ,

Dw = −Dd /d , D 2 w = −D 2 d /d + Dd ⊗ Dd /d 2 , =⇒

F [w ](x ) = sup

α∈A



b(x , α) · Dd (x ) d (x ) + tr



a(x , α) D 2 d (x ) d (x )



− 1

d 2 (x ) |σ T (x , α)Dd (x )| 2



≥ 1

d (x )



b(x , α) · Dd (x ) + tr(a(x , α)D 2 d (x ) 

− 1

d 2 (x ) |σ T (x , α)Dd (x )| 2

≥ kd γ−1 (x ) − B 2 d 2β−2 (x ) ≥ M

by (H) and γ < 2β − 1 , for d small enough.

(8)

Proof of the Lyapunov Lemma

Choose x ∈ ∂Ω : d (x ) = |x − x | and Dd (x ) = Dd (x ), and α ∈ A such that σ T (x , α)Dd (x ) = 0 by (B1).

For w (x ) := − log(d (x )) ,

Dw = −Dd /d , D 2 w = −D 2 d /d + Dd ⊗ Dd /d 2 , =⇒

F [w ](x ) = sup

α∈A



b(x , α) · Dd (x ) d (x ) + tr



a(x , α) D 2 d (x ) d (x )



− 1

d 2 (x ) |σ T (x , α)Dd (x )| 2



≥ 1 d (x )



b(x , α) · Dd (x ) + tr(a(x , α)D 2 d (x ) 

− 1

d 2 (x ) |σ T (x , α)Dd (x )| 2

≥ kd γ−1 (x ) − B 2 d 2β−2 (x ) ≥ M

by (H) and γ < 2β − 1 , for d small enough.

(9)

Proof of the Theorem

Set

U ε (x ) := u(x ) − max

Ω\Ω

δ

u + ε log d (x ) Then

U ε (x ) = ε log d (x ) < 0 if d (x ) = δ < 1 lim x →∂Ω U ε (x ) = −∞ by (BC1).

=⇒ U ε < 0 in Ω δ for δ small, because, if not, u − εw has a max at x o ∈ Ω δ , but

F [εw ](x o ) = εF [w ] > 0 by the Lemma, contradiction with F [u] ≤ 0.

=⇒ u(x ) ≤ max Ω\Ω

δ

u − ε log d (x ) in Ω δ ∀ ε > 0

=⇒ max u ≤ max Ω\Ω

δ

u.

(SMP) =⇒ u ≡ constant.

(10)

A Liouville-type property 2: supersolutions

(B2) ∃ δ, k > 0, γ < 2β − 1 such that ∀x ∈ ∂Ω, ∀α ∈ A ( σ T (x , α)Dd (x ) = 0,

b(x , α) · Dd (x ) + tr(a(x , α)D 2 d (x )) ≥ k d γ (x ), ∀x ∈ Ω δ Strong Minimum Principle:

u ∈ LSC(Ω) solving F [u] ≥ 0 in Ω and ∃ x o such that u(x o ) = min u =⇒ u is constant.

Sufficient conditions [M.B. - F. Da Lio]: a(x , α) > 0 ∀ x ∈ Ω, α ∈ A.

N.B.: (B2) =⇒ Ω invariant.

Theorem (1bis)

(H), (SMinP), (B2), u ∈ LSC(Ω) visco. supersol. F [u] ≥ 0 in Ω, and

(BC2) lim sup

x →∂Ω

u(x )

− log d (x) ≥ 0,

=⇒ u is constant.

(11)

Application 1: control with exit time

Assume

(L) η = β = 1 in (H), i.e., b, σ Lipschitz uniformly in α.

=⇒ ∀α . ∈ A admissible control, exists a unique trajectory of the control diffusion process

( dX t α

·

= b(X t α

·

, α t )dt + √

2σ(X t α

·

, α t )dW t

X 0 α

·

= x ∈ Ω, Define

τ x α

·

:= inf{t ≥ 0 | X t α

·

6∈ Ω} ∈ [0, +∞]

G(x , α) :=

( φ(X τ α

α··

x

) τ x α

·

< +∞

0 τ x α

·

= +∞, v (x ) := inf

α

·

∈A E [G(x , α)]

By a Dynamic Programming Principle we expect F [v ] = 0 in Ω and

then v ≡ constant under the assumptions of Thm. 1.

(12)

Application 1: control with exit time

Assume

(L) η = β = 1 in (H), i.e., b, σ Lipschitz uniformly in α.

=⇒ ∀α . ∈ A admissible control, exists a unique trajectory of the control diffusion process

( dX t α

·

= b(X t α

·

, α t )dt + √

2σ(X t α

·

, α t )dW t

X 0 α

·

= x ∈ Ω, Define

τ x α

·

:= inf{t ≥ 0 | X t α

·

6∈ Ω} ∈ [0, +∞]

G(x , α) :=

( φ(X τ α

α··

x

) τ x α

·

< +∞

0 τ x α

·

= +∞, v (x ) := inf

α

·

∈A E [G(x , α)]

By a Dynamic Programming Principle we expect F [v ] = 0 in Ω and

then v ≡ constant under the assumptions of Thm. 1.

(13)

Theorem (2)

Assume (L), (SMP), (B1), and

A compact and {(b(x , α), a(x , α)) | α ∈ A} convex ∀ x ∈ Ω

∃ x ∈ ∂Ω, α ∈ A such that φ(x) = min φ and

(ND)

( either σ T (x , α)Dd (x )6=0, (a)

or b(x , α) · Dd (x ) + tr(a(x , α)D 2 d (x ))< 0. (b) Then

v (x ) = min{min φ, 0} ∀x ∈ Ω.

Rmk.: (ND) (a) is a non-degeneracy condition of the diffusion at x ∈ ∂Ω,

(b) says that b(x , a) points outward Ω at x :

the control α "does the opposite" than α in condition (B1).

(14)

Theorem (2)

Assume (L), (SMP), (B1), and

A compact and {(b(x , α), a(x , α)) | α ∈ A} convex ∀ x ∈ Ω

∃ x ∈ ∂Ω, α ∈ A such that φ(x) = min φ and

(ND)

( either σ T (x , α)Dd (x )6=0, (a)

or b(x , α) · Dd (x ) + tr(a(x , α)D 2 d (x ))< 0. (b) Then

v (x ) = min{min φ, 0} ∀x ∈ Ω.

Rmk.: (ND) (a) is a non-degeneracy condition of the diffusion at x ∈ ∂Ω,

(b) says that b(x , a) points outward Ω at x :

the control α "does the opposite" than α in condition (B1).

(15)

Application 2: ergodic HJB equation

Further data: running cost : R n × A → R satisfying (H’) |l(x, α) − l(y , α)| ≤ B|x − y | γ .

Ergodic H-J-Bellman equation:

(EBE) sup

α∈A

n

− tr 

a(x , α)D 2 u(x )



− b(x, α) · Du(x) − l(x , α) o

= c in Ω + boundary conditions, in the unknowns (c, u).

Motivations: ergodic stochastic control (with state constraints), homogenization, singular perturbations,

weak KAM theory, Mean Field Games,....

Notation: H(x , Du, D 2 u) := left-hand side of (EBE).

(16)

Theorem (3)

Assume (H), (H’), (B2) (i.e., Ω invariant), and a(x , α) > 0 ∀ x ∈ Ω, α ∈ A. Then

∃ unique c ∈ R such that (EBE) has a visco. sol. χ :

(BC) lim

x →∂Ω

χ(x )

− log d (x) = 0,

χ ∈ C 2 (Ω) and is unique up to addition of constants.

The proof is constructive: appoximate (EBE) with (λBE) λu λ + H(x , Du λ , D 2 u λ ) = 0, x ∈ Ω,

for λ > 0, the HJB equation of discounted, infinite horizon stochastic control with state constrained in Ω.

Idea: let λ → 0+ .

(17)

Proposition

Under the assumptions of Thm. 3

1

∃ unique bounded solution u λ ∈ C 2 (Ω) of (λBE) and ku λ k ≤ klk ∞ /λ;

2

∀ K ⊂⊂ Ω, ∀ ˜ x ∈ Ω, the family (u λ − u λ (˜ x )) λ∈(0,1] is bounded in C 1,θ (K ), for some θ ∈ (0, 1).

3

for all h > 0, there exists δ h > 0 such that h log(d (x )) + min Ω\Ω

δh

u λ ≤ u λ (x ) ≤ −h log(d (x)) + max Ω\Ω

δh

u λ .

∀λ ∈ (0, 1], x ∈ Ω δ

h

.

Main ingredient of the proof: Krylov-Safonov estimates for uniformly

elliptic operators.

(18)

Proposition

Under the assumptions of Thm. 3

1

∃ unique bounded solution u λ ∈ C 2 (Ω) of (λBE) and ku λ k ≤ klk ∞ /λ;

2

∀ K ⊂⊂ Ω, ∀ ˜ x ∈ Ω, the family (u λ − u λ (˜ x )) λ∈(0,1] is bounded in C 1,θ (K ), for some θ ∈ (0, 1).

3

for all h > 0, there exists δ h > 0 such that h log(d (x )) + min Ω\Ω

δh

u λ ≤ u λ (x ) ≤ −h log(d (x)) + max Ω\Ω

δh

u λ .

∀λ ∈ (0, 1], x ∈ Ω δ

h

.

Main ingredient of the proof: Krylov-Safonov estimates for uniformly

elliptic operators.

(19)

Proposition

Under the assumptions of Thm. 3

1

∃ unique bounded solution u λ ∈ C 2 (Ω) of (λBE) and ku λ k ≤ klk ∞ /λ;

2

∀ K ⊂⊂ Ω, ∀ ˜ x ∈ Ω, the family (u λ − u λ (˜ x )) λ∈(0,1] is bounded in C 1,θ (K ), for some θ ∈ (0, 1).

3

for all h > 0, there exists δ h > 0 such that h log(d (x )) + min Ω\Ω

δh

u λ ≤ u λ (x ) ≤ −h log(d (x)) + max Ω\Ω

δh

u λ .

∀λ ∈ (0, 1], x ∈ Ω δ

h

.

Main ingredient of the proof: Krylov-Safonov estimates for uniformly

elliptic operators.

(20)

Steps of the proof of Thm. 3

λu λ (˜ x ) → −c as λ → 0+

u λ − u λ (˜ x ) → χ solution of (EBE), locally uniformly on a subsequence;

χ satisfies (BC) by point 3 of the Proposition.

Uniqueness: (c 1 , χ 1 ), (c 2 , χ 2 ) solutions of (EBE), w.l.o.g. c 1 ≥ c 2 . F [χ 1 − χ 2 ] = sup

α∈A

n

− tr(a(x, α)D 2 (χ 1 − χ 2 )) − b(x , α) · D(χ 1 − χ 2 ) o

≥ sup

α∈A

n

− tr(a(x, α)D 2 χ 1 ) − b(x , α) · Dχ 1 − l(x, α) o + inf

α∈A

n

tr(a(x , α)D 2 χ 2 )) + b(x , α) · Dχ 2 + l(x , α) o

= c 1 − c 2 ≥ 0.

By Theorem 1bis χ 1 − χ 2 ≡ constant, and then c 1 = c 2 .

(21)

Stochastic control formula for c

Corollary

Assume in addition (L). Then c = lim

λ→0+ inf

α

·

∈A E

 λ

Z ∞ 0

e −λt l(X t α

·

, α t )dt

 , uniformly w.r.t. x = X 0 α

·

,

i.e., c = vanishing discount limit of minimum long-time average cost, for all initial positions x .

Remark. Uniqueness holds also in the larger class:

for some κ > 0

lim

x →∂Ω χ(x )d (x ) κ = 0.

(22)

Stochastic control formula for c

Corollary

Assume in addition (L). Then c = lim

λ→0+ inf

α

·

∈A E

 λ

Z ∞ 0

e −λt l(X t α

·

, α t )dt

 , uniformly w.r.t. x = X 0 α

·

,

i.e., c = vanishing discount limit of minimum long-time average cost, for all initial positions x .

Remark. Uniqueness holds also in the larger class:

for some κ > 0

lim

x →∂Ω χ(x )d (x ) κ = 0.

(23)

Developments

D. Castorina, A. Cesaroni and L. Rossi studied the evolutive problem u t + H(x , Du, D 2 u) = 0

with initial conditions and (infinite) boundary conditions, and showed u(x , t) − ct → χ(x ) + k , as t → +∞,

where (c, χ) solves (EBE) and k ∈ R depends on u(x, 0).

Using that χ(x ) − ct is a solution of the parabolic equation ∀ t ∈ R,

they also show that χ is bounded, which improves the existence part

of Thm. 3.

(24)

Some references

M.B., A. Cesaroni, L. Rossi: Nonexistence of nonconstant solutions of some degenerate Bellman equations....

ESAIM-COCV in print

D. Castorina, A. Cesaroni and L. Rossi: A parabolic HJB equation degenerating at the boundary, Comm. Pure Appl. Anal. 2016 J.M. Lasry, P.-L. Lions: Nonlinear elliptic equations with singular boundary conditions and stochastic control with state

constraints, I. Math. Ann. 1989

T. Leonori, A. Porretta: Gradient bounds for elliptic problems

singular at the boundary. Arch. Ration. Mech. Anal. 2011

A. Arapostathis, V.S. Borkar, M.K. Ghosh: Ergodic control of

diffusion processes. Cambridge University Press 2012.

(25)

Thanks for your attention!

Riferimenti

Documenti correlati

[r]

At the origins of SLA research, it was assumed that the L2 learner transferred as much as possible from the L1, and that transfer was largely, if not solely,

Partendo dalla superficie originaria del deposito, egli scavò una grande buca ovale, che gli permise di portare alla luce la camera, oggi a lui intitolata, dove realizzò

Linee cellulari utilizzate

The last point, about the impact of the establishment and development of dip- lomatic relations between China and France on the relations between China and Europe: the establishment

Abbreviations: IBTR, ipsilateral breast tumor recurrences; PBI, partial breast irradiation; 3D-CRT, three-dimensional conformal radiotherapy; IMRT, intensity modulated

the analysis of the selected masonry, by means of the study of the tech- niques, the archaeological survey, the measure and the material of the natural stone elements, bricks

Exosomes purified from HEK cell line and the EV depleted supernatant (SN, as negative control) were analyzed with NTA (Supporting Information, Figure SI1) to determine the