This thesis has been submitted in fulfilment of the requirements for a postgraduate degree (e.g. PhD, MPhil, DClinPsychol) at the University of Edinburgh. Please note the following terms and conditions of use: This work is protected by copyright and other intellectual property rights, which are retained by the thesis author, unless otherwise stated. A copy can be downloaded for personal non-commercial research or study, without prior permission or charge. This thesis cannot be reproduced or quoted extensively from without first obtaining permission in writing from the author. The content must not be changed in any way or sold commercially in any format or medium without the formal permission of the author. When referring to this work, full bibliographic details including the author, title, awarding institution and date of the thesis must be given. On Parabolic Stochastic Integro-Differential Equations Existence, Regularity, and Numerics James-Michael Leahy Doctor of Philosophy University of Edinburgh June 2015 Declaration I hereby declare that this thesis was composed by myself and that the work contained herein is my own, except where explicitly stated otherwise. Further, I declare that this work has not been submitted for any other degree or professional qualification. James-Michael Leahy June 2015 iii Abstract In this thesis, we study the existence, uniqueness, and regularity of systems of degener- ate linear stochastic integro-differential equations (SIDEs) of parabolic type with adapted coefficients in the whole space. We also investigate explicit and implicit finite difference schemes for SIDEs with non-degenerate diffusion. The class of equations we consider arise in non-linear filtering of semimartingales with jumps. In Chapter 2, we derive moment estimates and a strong limit theorem for space inverses of stochastic flows generated by Lévy driven stochastic differential equations (SDEs) with adapted coefficients in weighted Hölder norms using the Sobolev embedding theorem and the change of variable formula. As an application of some basic properties of flows of Weiner driven SDEs, we prove the existence and uniqueness of classical solutions of lin- ear parabolic second order stochastic partial differential equations (SPDEs) by partition- ing the time interval and passing to the limit. The methods we use allow us to improve on previously known results in the continuous case and to derive new ones in the jump case. Chapter 3 is dedicated to the proof of existence and uniqueness of classical solutions of degenerate SIDEs using the method of stochastic characteristics. More precisely, we use Feynman-Kac transformations, conditioning, and the interlacing of space inverses of stochastic flows generated by SDEs with jumps to construct solutions. In Chapter 4, we prove the existence and uniqueness of solutions of degenerate linear stochastic evolution equations driven by jump processes in a Hilbert scale using the varia- tional framework of stochastic evolution equations and the method of vanishing viscosity. As an application, we establish the existence and uniqueness of solutions of degenerate linear stochastic integro-differential equations in the L2-Sobolev scale. Finite difference schemes for non-degenerate SIDEs are considered in Chapter 5. Specifically, we study the rate of convergence of an explicit and an implicit-explicit finite difference scheme for linear SIDEs and show that the rate is of order one in space and order one-half in time. Key words: Stochastic flows, stochastic differential equations (SDEs), Lévy processes, strong-limit the- orem, stochastic partial differential equations (SPDEs), degenerate parabolic type, parabolic stochastic integro-differential equations (SIDEs), partial integro-differential equations (PIDEs), non-local operators, method of stochastic characteristics, Ito-Wentzell formula, stochastic evolution equations, vanishing viscos- ity, finite difference schemes AMS subject classifications (MSC2010). 60H10, 60J75, 60F15, 60H15, 34F05, 35F40, 35R60, 35K15, 35K65, 60H20, 45K05, 65M06, 65M12 v Lay Summary In this thesis, we investigate a class of equations governing random processes that depend on time and space. These equations are a generalization of deterministic parabolic partial differential equations (PDEs). The prototypical example of a parabolic PDE is the heat equation, which is an equation that governs the evolution of heat in some medium in time and space. Randomness can be introduced to the heat equation through a random source and sink of heat at certain points of time and space, a random initial heat profile, and a random conductivity coefficient of the medium. The heat equation is an example of a diffusion equation, which is an equation that describes the density of particles as they diffuse according to some law. The physical law that governs the heat equation is called Fick’s law and it describes only a special type of diffusion. The equations that we study, even if we take them to be deterministic, allow for a much wider range of diffusions than the prototypical heat equation. When investigating an equation, a natural first question to ask is whether there exists a solution to the equation in some well-defined sense. Moreover, if there exists a solution, then one ought to ask whether it is unique, and if so, what properties it has. The equations we consider have range of possible inputs, and thus we want to know how the properties of the solution depend on the inputs. There are many properties of the equations to explore, but we consider only a property that is referred to as regularity. Regularity is characterized by a class of spaces to which the solution and inputs belong that are ordered by inclusion; smaller spaces correspond to higher regularity. It is natural to expect that if you take more regular inputs, then the solution should be more regular. This is the case for the equations we consider. It is rare for the solution of an equation we study to have a closed form expression of time, space, and randomness. Nevertheless, it is still possible to prove that there exists a unique solution and to study the regularity of that solution. In fact, there are different approaches to accomplish this objective and each way has its own advantage. We will dis- cuss two such approaches in this thesis. Since we can not expect a closed form expression for the solutions, it also practical and interesting to develop some method to approximate the solutions and to prove that the approximate solutions are close in some well-defined sense to the true solution that we know exists. In the final chapter, we describe a simple approximation scheme for a special subclass of the equations considered. The approxi- mate solutions are defined on some countable set of points in time and space and satisfy an equation that can be solved by simple algebraic manipulations for some realization of the random variables in the equation. vii I dedicate this thesis to my family. ix Acknowledgements First, I would like to thank my supervisor, István Gyöngy, for the support and guidance he has provided me during my time at the University of Edinburgh. I am indebted to Remigijus Mikulevicˇius for the opportunity to work with him at the University of Southern California in the Fall of 2013 and to collaborate with him on numerous projects. I have also benefited from having Kontantinos Dareiotis as a friend, flatmate, and collaborator. The research contained in this thesis would not have been possible without funding from the Principal’s Career Development Scholarship and the Edinburgh Global Research Scholarship and support from Gill Law, Graduate School Administrator. I have been fortunate enough to participate in numerous discussions with other students that have directly contributed to my academic development. In particular, I would like to mention: Eric Hall, Máté Gerencsèr, Brian Hamilton, Marc Ryser, David Cottrell, Andrea Meireles Rodrigues, Sebastian Vollmer, Chaman Kumar, and José Rodríguez Villarreal. In fact, it was Marc Ryser who introduced me to Stochastic Partial Differential Equations and suggested I apply to the University of Edinburgh to work with István Gyöngy. I would like to thank my friends and family, especially my mother, grandparents, and Aunt Mary Ann and Uncle Greg, for supporting all my endeavours. Last, I am grateful for the love, patience, and encouragement Bihotz Barrenetxea Dominguez has shown me. xi Contents Declaration iii Abstract v Lay Summary vii Acknowledgements xi 1 Introduction 1 2 Properties of space inverses of stochastic flows 7 2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 2.2 Statement of main results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 2.3 Properties of stochastic flows . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14 2.3.1 Homeomorphism property of flows . . . . . . . . . . . . . . . . . . . 14 2.3.2 Moment estimates of inverse flows: Proof of Theorem 2.2.1 . . . . . 18 2.3.3 Strong limit of a sequence of flows: Proof of Theorem 2.2.3 . . . . . 23 2.4 Classical solutions of degenerate SPDEs: Proof of Theorem 2.2.4 . . . . . . 27 2.5 Appendix. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34 3 The method of stochastic characteristics for parabolic SIDEs 39 3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39 3.2 Statement of main results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42 3.3 Proof of main theorems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48 3.3.1 Proof of uniqueness for Theorem 3.2.2 . . . . . . . . . . . . . . . . . 48 3.3.2 Small jump case . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53 3.3.3 Adding free and zero-order terms. . . . . . . . . . . . . . . . . . . . . 57 3.3.4 Adding uncorrelated part (Proof of Theorem 3.2.2) . . . . . . . . . . 63 3.3.5 Proof of Theorem 3.2.5. . . . . . . . . . . . . . . . . . . . . . . . . . . 65 3.4 Appendix. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68 3.4.1 Martingale and point measure moment estimates . . . . . . . . . . . 69 3.4.2 Optional projection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70 3.4.3 Estimates of Hölder continuous functions . . . . . . . . . . . . . . . . 71 3.4.4 Stochastic Fubini theorem . . . . . . . . . . . . . . . . . . . . . . . . . 82 3.4.5 Itô-Wentzell formula . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84 xiii 4 The L2-Sobolev theory for parabolic SIDEs 89 4.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89 4.2 Degenerate linear stochastic evolution equations . . . . . . . . . . . . . . . . 91 4.2.1 Basic notation and definitions . . . . . . . . . . . . . . . . . . . . . . . 91 4.2.2 Statement of main results . . . . . . . . . . . . . . . . . . . . . . . . . 95 4.2.3 Proof of Theorem 4.2.4. . . . . . . . . . . . . . . . . . . . . . . . . . . 97 4.3 The L2-Sobolev theory for degenerate SIDEs . . . . . . . . . . . . . . . . . . 106 4.3.1 Statement of main results . . . . . . . . . . . . . . . . . . . . . . . . . 106 4.3.2 Proof of Theorem 4.3.2. . . . . . . . . . . . . . . . . . . . . . . . . . . 111 4.3.3 Appendix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 125 5 A finite difference scheme for non-degenerate parabolic SIDEs 131 5.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 131 5.2 Statement of main results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 134 5.3 Simulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 141 5.4 Auxiliary results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 144 5.5 Proof of the main theorems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 158 6 Conclusions and future work 165 References 167 xiv Chapter 1 Introduction The subject of this thesis is parabolic linear stochastic integro-differential equations (SIDEs). These equations arise in the study of non-linear filtering of semimartingales, scaled lim- its of particle systems, flows of stochastic diffeomorphisms, and mathematical biology, physics, and finance. For the author, the strongest impetus to the advancement of the theory of SIDEs is the problem of non-linear filtering of jump diffusions. The first derivation of the filtering equations for semimartingales with jumps is due to B. Grigelionis in [Gri72]. The reduced form filtering equations, which are linear SIDEs, were derived in [Gri76]. A proof of existence and uniqueness of solutions to the reduced form filtering equations was given in [Tin77b] by E. Tinfavicˇius under the assumption of non-degenerate stochastic parabolicity. While in [Tin77b] essentially the variational ap- proach of SPDEs was used, one important aspect of this approach was missing, namely Itô’s formula for the square of the norm for jump processes. Such a result is used to conclude that the variational solution of a stochastic evolution equation is càdlàg with values in the pivot space and to get an energy estimate of the supremum in time of the norm of the solution in the pivot space. In [GK81], I. Gyöngy and N.V. Krylov derived Itô’s formula for the square of the norm in the jump case, and the corresponding exis- tence and uniqueness result for monotone stochastic evolution equations with jumps was studied in [Gyö82]. In [Gri82], B. Grigelionis applied the result in [GK81] to complete the variational existence and uniqueness result for the reduced form equations in the non- degenerate setting that began in [Tin77b]. The reader that is interested in the derivation of the reduced form filtering equations for semimartingales with jumps is advised to consult the article [Gri82] and the review article [GM11]. It is also worth mentioning that the stochastic evolution equations driven by addi- tive càdlàg martingale noise were studied by G. Pistone and M. Métivier in Section 5 of [MP76] using the semigroup approach. One important aspect of the reduced form filter- ing equation is that, in general, the principal part of the operators in the drift of these equations depend on space and time and are random. The standard semigroup approach to stochastic equations in infinite dimensions cannot treat these type of equations, since if the semigroup generated by the principal part is random, the stochastic convolution does not make sense as an ordinary Itô integral. In a recent work, M. Pronk and M. Veraar [PV14] showed that it is possible to extend the semigroup approach to treat equations where the 1 2 Chapter 1. Introduction principal part of stochastic evolution equation is random. However, at the time of writing, there still seems to be some limitations in this approach; for example, some regularity in time is needed for the diffusion coefficient in the stochastic heat equation when applying their theory. We emphasize that this is not needed in the variational approach. This thesis is dedicated to the the proof of existence, uniqueness, and regularity of fully degenerate linear SIDEs. Specifically, we derive a theory for these equations in Hölder spaces using the method of stochastic characteristics and a theory in L2-Sobolev spaces using the variational approach of stochastic evolution equations and the method of vanishing viscosity. We also investigate finite difference approximations of SIDEs with non-degenerate diffusion. Let us state the general form of the equation that we will investigate in this thesis. Let (Ω,F ,F = (Ft)t≥0,P) be a complete filtered probability space satisfying the usual condi- tions of right-continuity and completeness. For each real number T > 0, we let RT and PT be the F-progressive and F-predictable sigma-algebra on Ω × [0,T ], respectively. For our driving processes, we take a sequence wϱt , t ≥ 0, ϱ ∈ N, of independent one-dimensional F- adapted Wiener processes and a F-adapted Poisson random measure p(dt, dz) on (R+×Z1, B(R+)⊗Z1) with intensity measure π1(dz)dt, where (Z1,Z1, π1) is a sigma-finite measure space. Denote by q(dt, dz) = p(dt, dz) − π1(dz)dt the compensated Poisson random mea- sure. Let D1, E1,V1 ∈ Z be disjoint Z1-measurable subsets such that D1 ∪ E1 ∪ V1 = Z1 and π(V1) < ∞. Let (Z2,Z2, π2) be a sigma-finite measure space and D2, E2 ∈ Z2 be dis- jointZ2-measurable subsets such that D2 ∪ E2 = Z2. Fix an arbitrary positive real number T > 0 and integers d1, d2 ≥ 1. Let α ∈ (0, 2] and let φ : Ω × Rd1 → Rd2 be F0 ⊗ B(Rd1)-measurable. We consider the system of SIDEs on [0,T ] × Rd1 given by dult = ( (L1;lt +L 2;l t )ut + 1[1,2](α)b i;ll¯ t ∂iu l¯ t + c ll¯ t u l¯ t + f l t ) dt + ( N lϱt ut + g lϱ t ) dwϱt + ∫ Z1 ( I1;lt,z ut− + h l t(z) ) [1D1(z)q(dt, dz) + 1E1∪V1(z)p(dt, dz)], t ≤ T, ul0 = φ l, l ∈ {1, . . . , d2}, SIDE where for ϕ ∈ C∞c (R d1; Rd2), k ∈ {1, 2}, and l ∈ {1, . . . , d2}, Lk;lt ϕ(x) : = 1{2}(α)a k;i j t (x)∂i jϕ l(x) + ∫ Dk ρ k;ll¯ t (x, z) ( ϕl¯(x + Hkt (x, z)) − ϕ l¯(x) ) πk(dz) + ∫ Dk ( ϕl(x + Hkt (x, z)) − ϕ l(x) − 1(1,2](α)H k;i t (x, z)∂iϕ l(x) ) πk(dz) + 1{2}(k) ∫ E2 ( (Ill¯d2 + ρ 2;ll¯ t (x, z))ϕ l¯(x + H2t (x, z)) − ϕ l(x) ) π2(dz), N lϱt ϕ(x) : = 1{2}(α)σ iϱ t (x)∂iϕ l(x) + υll¯ϱt (x)ϕ l¯(x), ϱ ∈ N, 3Ilt,zϕ(x) : = (I ll¯ d2 + ρ 1;ll¯ t (x, z))ϕ l¯(x + H1t (x, z)) − ϕ l(x), and ∫ Dk ( |Hkt (x, z)| α + |ρkt (x, z)| 2 ) πk(dz) + ∫ Ek ( |Hkt (x, z)| 1∧α + |ρkt (x, z)| ) πk(dz) < ∞. The summation convention with respect to repeated indices i, j ∈ {1, . . . , d1}, l¯ ∈ {1, . . . , d2}, and ϱ ∈ N is used here and below. The d2 × d2 dimensional identity matrix is denoted by Id2 . For a subset A of a larger set X, 1A denotes the {0, 1}-valued function taking the value 1 on the set A and 0 on the complement of A. We assume that for each k ∈ {1, 2}, σkt (x) = (σ k;iϱ t (ω, x))1≤i≤d1, ϱ∈N, bt(x) = (b i t(ω, x))1≤i≤d1 , ct(x) = (c ll¯ t (ω, x))1≤l,l¯≤d2 , υkt (x) = (υ k;ll¯ϱ t (ω, x))1≤l,l¯≤d2, ϱ∈N, ft(x) = ( f i t (ω, x))1≤i≤d2 , gt(x) = (g iϱ t (ω, x))1≤i≤d2, ϱ∈N, are random fields on Ω× [0,T ]×Rd1 that are RT ⊗B(Rd1)-measurable. For each k ∈ {1, 2}, we assume that Hkt (x, z) = (H k;i t (ω, x, z))1≤i≤d1 , ρ k t (x, z) = (ρ k;ll¯ t (ω, x, z))1≤l,l¯≤d2 , are random fields on Ω×[0,T ]×Rd1×Zk that are PT ⊗B(Rd1)⊗Zk-measurable. Moreover, we assume that ht(x, z) = (hit(ω, x, z))1≤i≤d2 is a random field on Ω × [0,T ] × R d1 × Z1 that is PT ⊗ B(Rd1) ⊗Z1-measurable. This thesis is organized as follows. Chapter 2 is dedicated to establishing some prop- erties of space inverses of stochastic flows generated by an SDEs with jumps. These properties play an important role in Chapter 3 when we construct classical solutions of (SIDE). Specifically, in Chapter 2, we derive moment estimates and a strong limit theorem for space inverses of stochastic flows generated by jump SDEs with adapted coefficients in weighted Hölder norms. We also give a relatively simple, but novel, proof of existence and uniqueness of classical solutions of a degenerate SPDE (i.e. (SIDE) with a1;i j = σiϱσ jϱ, a2 ≡ H1 ≡ H2 ≡ ρ1 ≡ ρ2 ≡ b ≡ c ≡ f ≡ υ ≡ g ≡ h ≡ 0, and φ(x) = x). Our method of proof allows us to prove existence and uniqueness when σ can degenerate on set of pos- itive probability under less regularity than done previously. If σ is non-degenerate, then much more can be done, and we refer the reader to the seminal work [FGP10]. The focus of Chapter 3 is to give a complete proof of existence and uniqueness of clas- sical solutions of (SIDE) with ai j = σiϱσ jϱ + σ2;iϱσ2; jϱ and with Hölder assumptions on the coefficients and data using Feynman-Kac transformations, conditioning, and the inter- lacing of space-inverses of stochastic flows associated with the equations. We emphasize that σ,σ2,H, and H2 can vanish on a set of positive probability. In Chapter 4, we prove the existence and uniqueness of solutions of degenerate linear 4 Chapter 1. Introduction stochastic evolution equations driven by jump processes in a Hilbert scale using the varia- tional framework of stochastic evolution equations and the method of vanishing viscosity. This result extends the work of B. Rozovskii in [Roz90] from the continuous case to the jump case. As an application of our result, we derive unique solutions of (SIDE) with a1;i j ≡ a2;i j ≡ σ ≡ g ≡ 0, E2 = E1 = V1 = ∅, and α ∈ (1, 2) in the integer L2-Sobolev scale. We confine ourself to an equation without continuous noise and diffusion because the the- ory of Sobolev solutions for equations in this case are well-studied (see, e.g. [KR82], [Roz90], and [GGK14]). The results derived in Chapters 1, 2, and 3 are the fruit of a collaborative effort with Remigijus Mikulevicˇius at the University of Southern California; both R. Mikulevicˇius and the author have contributed substantially to the development of the ideas contained in these chapters. Finally, in Chapter 5, we consider finite difference schemes for (SIDE) with d2 = 1, H1t (x, z) = z, ρ 1 = 1, E1 = V1 = ∅, α = 2, h ≡ H2 ≡ ρ2 ≡ 0, and ai j − σiϱσ jϱ > 0. We show that the L2(Ω)-pointwise rate of convergence is of order one in space and order one-half in time. The results given in this chapter are the fruit of a collaborative effort with Konstantinos Dareiotis at the University of Edinburgh; both K. Dareiotis and the author have contributed substantially to the development of the ideas contained in this chapter. Basic Notation Let N be the set of natural numbers and N0 = {0, 1, . . .} be the set of natural numbers including zero. Let Z be the set of integers. For a topological space (X,X), we denote the Borel sigma-field on X by B(X). For each integer n ≥ 1, let Rn be the n-dimensional Euclidean space and for each x ∈ Rn, denote by |x| the Euclidean norm of x = (x1, . . . , xn). We set R = R1 and for two elements of x, y ∈ R, we denote by x ∨ y the maximum of x and y and by x ∧ y, the minimum of x and y. Let R+ denote the set of non-negative elements of R1. For an integer n ≥ 1 and for each i ∈ {1, . . . , n}, we let ∂i = ∂∂xi be the spatial derivative operator with respect to the direction xi and write ∂i j = ∂i∂ j for each i, j ∈ {1, . . . , n}. We also denote the identity operator by ∂0. For a once differentiable function f = ( f 1 . . . , f n) : Rn → Rn, we denote the gradient of f by ∇ f = (∂ j f i)1≤i, j≤d1 . For a multi-index γ = (γ1, . . . , γd) ∈ {0, 1, 2, . . . , }d1 of length |γ| := γ1 + · · · + γd, denote by ∂γ the operator ∂γ = ∂ γ1 1 · · · ∂ γd d , where ∂ 0 i is the identity operator for all i ∈ {1, . . . , d1}. For i ∈ {1, . . . , d}, let ∂−i = −∂i. For integers d1, d2 ≥ 1, we denote by C∞c (R d1; Rd2) the space of Rd2-valued infinitely differentiable functions with compact support in Rd1 . For each integer n ≥ 1, the norm of an element x of ℓ2(Rn), the space of square- summable Rn-valued sequences, is denoted by |x|. We set ℓ2 = ℓ2(R). For integers n ≥ 1 5and a once differentiable function f = ( f 1ϱ, . . . , f nϱ)ϱ∈N : Rn → ℓ2(Rn), we denote the gradient of f by ∇ f = (∂ j f iϱ)1≤i, j≤n,ϱ∈N and understand it as a function from Rd1 to ℓ2(Rn 2 ). For a Fréchet space χ and fixed time T > 0, we denote by D([0,T ]; χ) the space of χ-valued càdlàg (continuous from the right and limits from the left) functions on [0,T ] and by C([0,T ]2; χ) the space of χ-valued continuous functions on [0,T ] × [0,T ]. The spaces D([0,T ]; χ) and C([0,T ]2; χ) are Fréchet spaces when endowed with the set of the supremum (in time) seminorms, which we assume, unless otherwise specified. The notation N = N(·, · · · , ·) is used to denote a positive constant depending only on the quantities appearing in the parentheses. In a given context, the same letter is often used to denote different constants depending on the same parameter. Moreover, for any function f defined on a set X and taking valued in a linear space Y with zero element 0Y , the notation f ≡ 0 indicates that f (x) = 0Y for all x ∈ X. Chapter 2 Properties of space inverses of stochastic flows 2.1 Introduction Let (Ω,F ,F = (Ft)t≥0,P) be a complete filtered probability space satisfying the usual con- ditions of right-continuity and completeness. Let wϱt , t ≥ 0, ϱ ∈ N, be a sequence of inde- pendent one-dimensional F-adapted Wiener processes. For a sigma-finite measure space (Z,Z, π), we let p(dt, dz) be an F-adapted Poisson random measure on (R+×Z,B(R+)⊗Z) with intensity measure π(dz)dt and denote by q(dt, dz) = p(dt, dz) − π(dz)dt the compen- sated Poisson random measure. For each real number T > 0, we let RT and PT be the F-progressive and F-predictable sigma-algebra on Ω × [0,T ], respectively. Fix a real number T > 0 and an integer d ≥ 1. For each stopping time τ ≤ T , consider the flow Xt = Xt(τ, x), (t, x) ∈ [0,T ] × Rd, generated by the SDE dXt = bt(Xt)dt + σ ϱ t (Xt)dw ϱ t + ∫ Z Ht(Xt−, z)q(dt, dz), τ < t ≤ T, Xt = x, t ≤ τ, (2.1.1) where bt(x) = (bit(ω, x))1≤i≤d and σt(x) = (σ iϱ t (ω, x)1≤i≤d,ϱ∈N are R d-valued RT ⊗ B(Rd)- measurable functions defined on Ω × [0,T ] × Rd and Ht(x, z) = (Hit(ω, x, z))1≤i≤d is an Rd-valued PT ⊗ B(Rd) ⊗ Z-measurable function defined on Ω × [0,T ] × Rd × Z. The summation convention with respect to the repeated index ϱ is used here and below. In this chapter, under Hölder regularity conditions on the coefficients b, σ, and H, we provide a simple and direct derivation of moment estimates of the space inverse of the flow, denoted X−1t (τ, x), in weighted Hölder norms. This is done by applying the Sobolev embedding theorem and the change of variable formula. Using a similar method, we es- tablish a strong limit theorem in weighted Hölder norms for a sequence of flows X(n)t (τ, x) and their inverses X(n);−1t (τ, x) corresponding to a sequence of coefficients (b (n), σ(n),H(n)) converging in weighted Hölder norms. Furthermore, as an application of the diffeomor- phism property of flow, we give a direct derivation of the linear second order degenerate SPDE governing the inverse flow X−1t (τ, x) when H ≡ 0. Specifically, for each τ ≤ T , 7 8 Chapter 2. Properties of space inverses of stochastic flows consider the stochastic flow Yt = Yt(τ, x), (t, x) ∈ [0,T ] × Rd, generated by the SDE dYt = bt(Yt)dt + σ ϱ t (Yt)dw ϱ t , τ < t ≤ T, Yt = x, t ≤ τ. Assume that b and σ have linear growth, bounded first and second derivatives, and that the second derivatives of b and σ are α-Hölder for some α > 0. By partitioning the time interval and using Taylor’s theorem, the Sobolev embedding theorem, and some basic properties of the flow and its inverse, we show that ut(x) = ut(τ, x) := Y−1t (τ, x), (t, x) ∈ [0,T ] × Rd is the unique classical solution of the SPDE given by dut(x) = ( 1 2 σ iϱ t (x)σ jϱ t (x)∂i jut(x) − bˆ i t(x)∂iut(x) ) dt − σiϱt (x)∂iut(x)dw ϱ t , τ < t ≤ T, ut(x) = x, t ≤ τ, (2.1.2) where bˆit(x) = b i t(x) − σ jϱ t (x)∂ jσ iϱ t (x). In Chapter 3, we will use all of the properties of the flow Xt(τ, x) that are established in this first chapter in order to derive the existence and uniqueness of classical solutions of linear parabolic SIDEs. One of the earliest works to investigate the homeomorphism property of flows of SDEs with jumps is by P. Meyer in [Mey81]. In [Mik83], R. Mikulevicˇius extended the properties found in [Mey81] to SDEs driven by arbitrary continuous martingales and random measures. Many other authors have since expanded upon the work in [Mey81], see for example [FK85, Kun04, MB07, QZ08, Zha13, Pri14] and references therein. In [Kun04, Kun86b], H. Kunita studied the diffeomorphism property of the flow Xt(s, x), (s, t, x) ∈ [0,T ]2 × Rd, and in the setting of deterministic coefficients, he showed that for each fixed t, the inverse flow X−1t (s, x), (s, x) ∈ [t,T ] × R d, solves a backward SDE. By estimating the associated backward SDE, one can obtain moment estimates and a strong limit theorem for the inverse flow in essentially the same way that moment estimates are obtained for the direct flow (see, e.g. [Kun86b]). However, this method of deriving mo- ment estimates and a strong limit theorem for the inverse flow uses a time reversal, and thus ,requires that the coefficients are deterministic. In the case H ≡ 0, numerous authors have investigated properties of the inverse flow with random coefficients. In Chapter 2 of [Bis81], Lemma 2.1 and 2.2 of [OP89], and Section 6.1 and 6.2 of [Kun96], properties of Y−1t (τ, x) (i.e. moment estimates, strong limit theorem, and that it solves (2.1.2)) are established by first showing that the inverse flow solves the Stratonovich form SDE for 2.1. Introduction 9 Zt = Zt(τ, x), (t, x) ∈ [0,T ] × Rd, given by dZt(x) = −Ut(Zt(x))bt(x)dt − Ut(Zt(x))σ ϱ t (x) ◦ dw ϱ t , τ < t ≤ T, (2.1.3) Z0(x) = x, τ < t, where Ut(x) = Ut(τ, x) = ∇Yt(τ, x)−1. In order to obtain a strong solution of (2.1.3), condi- tions are imposed to ensure that ∇Ut(x) is locally-Lipschitz in x. In the degenerate setting, the third derivative of bt and σt need to be α-Hölder for some α > 0 to ensure that ∇Ut(x) is locally-Lipschitz in x. In direct contrast to this approach, we first derive properties of the inverse flow under the very assumptions that guarantee Yt(τ, x) is a diffeomorphism (i.e. when the first derivative of the coefficients are α-Hölder for some α > 0), and then we derive the existence of the equation without resorting to the SDE interpretation of the SPDE. Classical solutions of (2.1.2) have been constructed in [Bis81, Kun96] by directly showing that Y−1t (τ, x) solves (2.1.3). As we have mentioned above, this approach requires the third derivatives of bt and σt to be α-Hölder for some α > 0. Yet another approach to deriving existence of classical solutions of (2.1.2) is using the method of time reversal (see, e.g. [Kun96, DPT98]). While this method only requires that the second derivatives of bt and σt are α-Hölder for some α > 0, it does impose that the coefficients are deter- ministic. In [KR82], N.V. Krylov and B.L. Rozvskii derived the existence and uniqueness of generalized solutions of degenerate second order linear parabolic SPDEs in Sobolev spaces using variational approach of SPDEs and the method of vanishing viscosity (see, also, [GGK14] and Chapter 4, Section 2, Theorem 1 in [Roz90]). Thus, by appealing to the Sobolev embedding theorem, this theory can be used to obtain classical solutions of degenerate linear SPDEs. Proposition 1 of Chapter 5, Section 2, in [Roz90] shows that if σ is uniformly bounded and four-times continuously differentiable in x with uniformly bounded derivatives and b is uniformly bounded and three-times continuously differen- tiable with uniformly bounded derivatives, then there exists a classical solution of (2.1.2) and ut(x) = Y−1t (x). This is more regularity than we require and we are also able to obtain solutions in the entire Hölder scale. This chapter is organized as follows. In Section 2.2, we state our notation and main results. Section 2.3 is devoted to deriving moment estimates and a strong limit theorem for the space inverse of a stochastic flow generated by a Lévy driven SDE. In Section 2.4, we show that Y−1t (τ, x) is the unique classical solution of (2.1.2). In Section 2.5, the appendix, auxiliary facts that are used throughout the chapter are discussed. 10 Chapter 2. Properties of space inverses of stochastic flows 2.2 Statement of main results Let us describe some notation that will be used in this chapter. Elements of Rd are un- derstood as column vectors and elements of Rd 2 are understood as matrices of dimension d × d. We denote the transpose of an element x ∈ Rd by x∗. For a Banach space V with norm | · |V , domain (i.e. open connected set) Q of Rd, and continuous function f : Q→ V , we define | f |0;Q;V = sup x∈Q | f (x)|V and [ f ]β;Q;V = sup x,y∈Q,x,y | f (x) − f (y)|V |x − y|βV , β ∈ (0, 1]. For any real number β ∈ R, we write β = [β]− + {β}+, where [β]− is an integer and {β}+ ∈ (0, 1]. For a Banach space V with norm | · |V , real number β > 0, and domain Q of Rd, we denote by Cβ(Q; V) the Banach space of all bounded continuous functions f : Q→ V having finite norm | f |β;Q;V := ∑ |γ|≤[β]− |∂γ f |0;Q;V + ∑ |γ|=[β]− [∂γ f ]{β}+;Q;V . When Q = Rd and V = Rn or V = ℓ2(Rn) for any integer n ≥ 1, we drop the subscripts Q and V from the norm | · |β;Q;V and write | · |β. For a Banach space V and for each β > 0, denote by Cβloc(R d; V) the Fréchet space of continuous functions f : Rd → V satisfying f ∈ Cβ(Q; V) for all bounded domains Q ⊂ Rd. We call a function f : Rd → Rd a Cβloc(R d; Rd)-diffeomorphism if f is a homeomorphism and both f and its inverse f −1 are in Cβloc(R d; Rd). If we do not specify to which space the parameters ω, t, x, y, z and n belong, then we mean ω ∈ Ω, t ∈ [0,T ], x, y ∈ Rd, z ∈ Z, and n ∈ N. Let r1(x) = √ 1 + |x|2, x ∈ Rd. For each real number β > 1, we introduce the following regularity condition on the coefficients b, σ, and H. Assumption 2.2.1 (β). (1) There is a constant N0 > 0 such that for all (ω, t, z) ∈ Ω × [0,T ] × Z, |r−11 bt|0 + |∇bt|β−1 + |r −1 1 σt|0 + |∇σt|β−1 ≤ N0 and |r −1 1 Ht(z)|0 + |∇Ht(z)|β−1 ≤ Kt(z), 2.2. Statement of main results 11 where K : Ω × [0,T ] × Z → R+ is a PT ⊗Z-measurable function satisfying Kt(z) + ∫ Z Kt(z) 2π(dz) ≤ N0, for all (ω, t, z) ∈ Ω × [0,T ] × Z. (2) There are constants η ∈ (0, 1) and Nκ > 0 such that for all (ω, t, x, z) ∈ {(ω, t, x, z) ∈ Ω × [0,T ] × Rd × Z : |∇Ht(ω, x, z)| > η}, | (Id + ∇Ht(x, z))−1 | ≤ Nκ. The following theorem shows that if Assumption 2.2.1 (β) holds for some β > 1, then for any β′ ∈ [1, β], the solution Xt(τ, x) of (2.1.1) has a modification that is a C β′ loc(R dRd) -diffeomorphism and the p-th moments of the weighted β′-Hölder norms of the inverse flow are bounded. This theorem will be proved in the next section. Theorem 2.2.1. Let Assumption 2.2.1(β) hold for some β > 1. (1) For any stopping time τ ≤ T and β′ ∈ [1, β), there exists a modification of the strong solution Xt(τ, x) of (2.1.1), also denoted by Xt(τ, x), such that P-a.s. the mapping Xt(τ, ·) : Rd → Rd is a C β′ loc(R d; Rd)-diffeomorphism, X·(τ, ·), X−1· (τ, ·) ∈ D([0,T ]; Cβ ′ loc(R d; Rd)), and X−1t− (τ, ·) coincides with the inverse of Xt−(τ, ·). Moreover, for all ϵ > 0 and p ≥ 2, there is a constant N = N(d, p,N0,T, β′, ϵ) such that E [ sup t≤T |r−(1+ϵ)1 Xt(τ)| p 0 ] + E [ sup t≤T |r−ϵ1 ∇Xt(τ)| p β′−1 ] ≤ N and a constant N = N(d, p,N0,T, β′, η,Nκ, ϵ) such that E [ sup t≤T |r−(1+ϵ)1 X −1 t (τ)| p 0 ] + E [ sup t≤T |r−ϵ1 ∇X −1 t (τ)| p β′−1 ] ≤ N. (2.2.1) (2) If H ≡ 0, then for all β′ ∈ (1, β), P-a.s. X·(·, ·), X−1· (·, ·) ∈ C([0,T ] 2;Cβ ′ loc(R d; Rd)) and for all ϵ > 0 and p ≥ 2, there is a constant N = N(d, p,N0,T, β′, ϵ) such that E [ sup s,t≤T |r−(1+ϵ)1 Xt(s)| p 0 ] + E [ sup s,t≤T |r−ϵ1 ∇Xt(s)| p β′−1 ] ≤ N and E [ sup s,t≤T |r−(1+ϵ)1 X −1 t (s)| p 0 ] + E [ sup s,t≤T |r−ϵ1 ∇X −1 t (s)| p β′−1 ] ≤ N. Remark 2.2.2. The estimate (2.2.1) is used in Chapter 3 to take the optional projection of a linear transformation of the inverse flow of a jump SDE driven by two independent 12 Chapter 2. Properties of space inverses of stochastic flows Weiner processes and two independent Poisson random measures relative to the filtration generated by one of the Weiner processes and Poisson random measures. Now, let us state our strong limit theorem for a sequence of flows, which will also be proved in the next section. We will use this strong limit theorem in [LM14a] to show that the inverse flow of a jump SDE solves a parabolic SIDE. For each n, consider the stochastic flow X(n)t = X (n) t (τ, x), (t, x) ∈ [0,T ] × R d, generated by the SDE dX(n)t = b (n) t (X (n) t )dt + σ (n)lϱ t (X (n) t )dw ϱ t + ∫ Z H(n)t (X (n) t− , z)q(dt, dz), τ ≤ t ≤ T, X(n)t = x, t ≤ τ. Here we assume that for all n, b(n), σ(n), and H(n) satisfy the same measurability conditions as b, σ, and H, respectively. Theorem 2.2.3. Let Assumption 2.2.1(β) hold for some β > 1 and assume that b(n), σ(n), and H(n) satisfy Assumption 2.2.1 (β) uniformly in n ∈ N. Moreover, assume that dPdt − lim n→∞ ( |r−11 b (n) t − r −1 1 bt|0 + |∇b (n) t − ∇bt|β−1 ) = 0, dPdt − lim n→∞ ( |r−11 σ (n) t − r −1 1 σt|β−1 + |∇σ (n) t − ∇σt|0 ) = 0, and for all (ω, t, z) ∈ Ω × [0,T ] × Z and n ∈ N, |r−11 H (n) t (z) − r −1 1 Ht(z)|0 + |∇H (n) t (z) − ∇Ht(z)|β−1 ≤ K (n)(t, z), where (K(n)t (z))n∈N is a sequence of R+-valued PT ⊗ Z measurable functions defined on Ω × [0,T ] × Z satisfying for all (ω, t, z) ∈ Ω × [0,T ] × Z and n ∈ N, K(n)t (z) + ∫ Z K(n)t (z) 2π(dz) ≤ N0 and dPdt − lim n→∞ ∫ Z K(n)t (z) 2π(dz) = 0. Then for any stopping time τ ≤ T, β′ ∈ [1, β), and all ϵ > 0, and p ≥ 2, we have lim n→∞ ( E [ sup t≤T |r−(1+ϵ)1 X (n) t (τ) − r −(1+ϵ) 1 Xt(τ)| p 0 ] + E [ sup t≤T |r−ϵ1 ∇X (n) t (τ) − r −ϵ 1 ∇Xt(τ)| p β′−1 ]) = 0, lim n→∞ E [ sup t≤T |r−(1+ϵ)1 X (n);−1 t (τ) − r −(1+ϵ) 1 X −1 t (τ)| p 0 ] = 0, and lim n→∞ E [ sup t≤T |r−ϵ1 ∇X (n);−1 t (τ) − r −ϵ 1 ∇X −1 t (τ)| p β′−1 ] = 0. 2.2. Statement of main results 13 Let us introduce our class of solutions for the equation (2.1.1). LetOT be the F-optional sigma-algebra on Ω× [0,T ]. For a each number β′ > 2, let Cβ ′ cts(R d; Rd) be the linear space of all random fields v : Ω × [0,T ] × Rd → Rd such that v is OT ⊗ B(Rd)-measurable and P-a.s. r−λ1 (·)v·(·) is a C([0,T ];C β′(Rd; Rd)) for a real number λ > 0. We introduce the following assumption for a real number β > 2. Assumption 2.2.2 (β). There is a constant N0 such that for all (ω, t) ∈ Ω × [0,T ], |r−11 bt|0 + |r −1 1 σt|0 + |∇bt|β−1 + |∇σt|β−1 ≤ N0. Theorem 2.2.4. Let Assumption 2.2.2(β) hold for some β > 2. Then for any stopping time τ ≤ T and β′ ∈ [1, β), there exists a unique process u(τ) in Cβ ′ cts(R d; Rd) that solves (2.1.2). Moreover, P-a.s. ut(τ, x) = Y−1t (τ, x) for all (t, x) ∈ [0,T ]×R d and for all ϵ > 0 and p ≥ 2, there is a constant N = N(d, p,N0,T, β′, ϵ) such that E [ sup s,t≤T |r−(1+ϵ)1 ut(s)| p 0 ] + E [ sup s,t≤T |r−ϵ1 ∇ut(s)| p β′−1 ] ≤ N. Remark 2.2.5. It is clear by the proof of this theorem that if σ ≡ 0, then we only need to assume that Assumption 2.2.2 (β) holds for some β > 1. Now let us consider the SPDE given by du¯t(x) = ( 1 2 σ iϱ t (x)σ jϱ t (x)∂i ju¯t(x) + b i t(x)∂iu¯t(x) ) dt + σiϱt (x)∂iu¯t(x)dw ϱ t , τ < t ≤ T, u¯t(x) = x, t ≤ τ. (2.2.2) This SPDE differs from the one given in (2.1.2) by the first-order coefficient in the drift. In order to obtain an existence and uniqueness theorem for this equation, we have to impose additional assumptions on σ. We introduce the following assumption for a real number β > 2. Assumption 2.2.3 (β). There is a constant N0 > 0 such that for all (ω, t) ∈ Ω × [0,T ], |r−11 bt|0 + |∇bt|β−1 + |σt|β+1 ≤ N0. For each τ ≤ T , consider the stochastic flow Yˆt = Yˆt(τ, x), (t, x) ∈ [0,T ]×Rd, generated by the SDE dY¯t = −bˆt(Y¯t)dt − σ ϱ t (Y¯t)dw ϱ t , τ < t ≤ T, Yt = x, t ≤ τ. 14 Chapter 2. Properties of space inverses of stochastic flows If Assumption 2.2.3(β) holds for some β > 2, then for all (ω, t, x) ∈ Ω × [0,T ] × Rd, |bˆt(x)| ≤ |bt(x)| + |σt(x)|∇σt(x)| ≤ N0(N0 + 1) + N0|x| and |∇bˆt|β−1 ≤ |∇bt|β−1 + |σt|β−1|∇2σt|β−1 + |∇σt|2β−1 ≤ N0 + 2N 2 0 , which immediately implies the following corollary of Theorem 2.2.4. Corollary 2.2.6. If Assumption 2.2.3(β) holds for some β > 2, then for any stopping time τ ≤ T and β′ ∈ [1, β), there exists a unique process u¯(τ) in Cβ ′ cts(R d; Rd) that solves (2.2.2). Moreover, P-a.s. u¯t(τ, x) = Y¯−1t (τ, x) for all (t, x) ∈ [0,T ]×R d and for all ϵ > 0 and p ≥ 2, there is a constant N = N(d, p,N0,T, β′, ϵ) such that E [ sup s,t≤T |r−(1+ϵ)1 u¯t(s)| p 0 ] + E [ sup s,t≤T |r−ϵ1 ∇u¯t(s)| p β′−1 ] ≤ N. 2.3 Properties of stochastic flows 2.3.1 Homeomorphism property of flows In this subsection, we collect some results about flows of jump SDEs that we will need. In particular, we present sufficient conditions that guarantee the homeomorphism property of flows of jump SDEs. First, let us introduce the following assumption, which is the usual linear growth and Lipschitz condition on the coefficients b, σ, and H of the SDE (2.1.1). Assumption 2.3.1. There is a constant N0 > 0 such that for all (ω, t, x, y) ∈ Ω×[0,T ]×R2d, |bt(x)| + |σt(x)| ≤ N0(1 + |x|), |bt(x) − bt(y)| + |σt(x) − σt(y)| ≤ N0|x − y|. Moreover, for all (ω, t, x, y, z) ∈ Ω × [0,T ] × R2d × Z, |Ht(x, z)| ≤ K1(t, z)(1 + |x|), |Ht(x, z) − Ht(y, z)| ≤ K2(t, z)|x − y|, where K1,K2 : Ω × [0,T ] × Z → R+ are PT ⊗Z-measurable functions satisfying K1(t, z) + K2(t, z) + ∫ Z ( K1(t, z) 2 + K2(t, z) 2 ) π(dz) ≤ N0, for all (ω, t, z) ∈ Ω × [0,T ] × Z. 2.3. Properties of stochastic flows 15 It is well-known that under this assumption, there exists a unique strong solution Xt(s, x) of (2.1.1) (see e.g. Theorem 3.1 in [Kun04]). We will also make use of the follow- ing assumption. Assumption 2.3.2. For all (ω, t, x, z) ∈ Ω × [0,T ] × Rd × Z, Ht(x, z) is differentiable in x, and there are constants η ∈ (0, 1) and Nκ > 0 such that for all (ω, t, x, z) ∈ {(ω, t, x, z) ∈ Ω × [0,T ] × Rd × Z : |∇Ht (ω, x, z)| > η}, ∣∣∣(Id + ∇Ht(x, z))−1 ∣∣∣ ≤ Nκ. The coming lemma shows that under Assumptions 2.3.1 and 2.3.2, the mapping x + Ht(x, z) from Rd to Rd is a diffeomorphism and the gradient of inverse map is bounded. Lemma 2.3.1. Let Assumptions 2.3.1 and 2.3.2 hold. For all (ω, t, z) ∈ Ω× [0,T ]× Z, the mapping H˜t(·, z) : Rd → Rd defined by H˜t(x, z) := x + Ht(x, z) is a diffeomorphism and |H˜−1t (x, z)| ≤ N¯N0 + N¯ |x| and |∇H˜ −1 t (x, z)| ≤ N¯, where N¯ := (1 − η)−1 ∨ N0. Proof. (1) On the set (ω, t, x, z) ∈ {(ω, t, x, z) ∈ Ω × [0,T ] × Rd × Z : |∇Ht(ω, x, z)| ≤ η}, we have |κt(ω, x, z)| ≤ ∣∣∣∣∣∣∣ Id + ∞∑ n=1 (−1)n[∇Ht(ω, x, z)]n ∣∣∣∣∣∣∣ ≤ 1 1 − η . It follows from Assumption 2.3.2 that for all ω, t, x, and z, the mapping ∇H˜t(x, z) has a bounded inverse. Therefore, by Theorem 0.2 in [DMGZ94], the mapping H˜t(·, z) : Rd → Rd is a global diffeomorphism. Moreover, for all ω, t, x and z, |H˜−1t (x, z) − H˜ −1 t (y, z)| ≤ N¯ |x − y|, which yields |H˜t(x, z) − H˜t(y, z)| ≥ N¯−1|x − y| =⇒ |H˜t(x, z)| + K1(t, z) ≥ N¯−1|x|, and hence |H˜−1t (x, z)| ≤ N¯K1(t, z) + N¯|x| ≤ N¯N0 + N¯ |x|. □ The following estimates are essential in the proof of the homeomorphic property of the flow and the derivation of moment estimates of the inverse flow. We refer the reader to Theorem 3.2 and Lemmas 3.7 and 3.9 in [Kun04] and Lemma 4.5.6 in [Kun97] (H ≡ 0 case) for the proof of the following lemma. 16 Chapter 2. Properties of space inverses of stochastic flows Lemma 2.3.2. Let Assumption 2.3.1 hold. (1) For all p ≥ 2, there is a constant N = N(p,N0,T ) such that for all s, s¯ ∈ [0,T ] and x, y ∈ Rd, E [ sup t≤T r1(Xt(s, x)) p ] ≤ Nr1(x)p, (2.3.1) E [ sup t≤T |Xt(s, x) − Xt(s, y)|p ] ≤ N|x − y|p. (2.3.2) (2) If Assumption 2.3.2 holds, then for any p ∈ R, there is a constant N = N(p,N0,T, η,Nκ) such that for all s ∈ [0,T ] and x, y ∈ Rd, E [ sup t≤T r1(Xt(s, x)) p ] ≤ Nr1(x)p, (2.3.3) and E [ sup t≤T |Xt(s, x) − Xt(s,Y)|p ] ≤ N|x − y|p. (2.3.4) In the next proposition, we collect some facts about the homeomorphic property of the flow. Let us mention that the homeomorphism property has been shown in [QZ08] to hold under a log-Lipschitz condition on the coefficients. The key idea is to use Bihari’s inequality instead of Gronwall’s inequality, but we do not pursue this here. Proposition 2.3.3. Let Assumptions 2.3.1 and 2.3.2 hold. (1) There exists a modification of the strong solution Xt(s, x), (s, t, x) ∈ [0,T ]2 × Rd, of (2.1.1), also denoted by Xt(s, x), that is càdlàg in s and t and continuous in x. Moreover, for any stopping time τ ≤ T, P-a.s. for all t ∈ [0,T ], the mappings Xt(τ, ·), Xt−(τ, ·) : Rd → Rd are homeomorphisms and the inverse of Xt(τ, ·), denoted by X−1t (τ, ·), is càdlàg in t and continuous in x, and X −1 t− (τ, ·) coincides with the inverse of Xt−(τ, ·). In particular, if (xn)n≥1 is a sequence in Rd such that limn→∞ xn = x for some x ∈ Rd, then P-a.s. lim n→∞ sup t≤T |X−1t (τ, xn) − X −1 t (τ, x)| = 0. Furthermore, for all β′ ∈ [0, 1), P-a.s. X(τ, ·) ∈ D([0,T ];Cβ ′ loc(R d; Rd)) and for all ϵ > 0 and p ≥ 2, there is a constant N = N(d, p,N0,T, β′, ϵ) such that E [ sup t≤T |r−(1+ϵ)1 Xt(τ)| p β′ ] ≤ N. (2.3.5) (2) If H ≡ 0, then P-a.s. for all s, t ∈ [0,T ], the Xt(s, x) and X−1t (s, x) are continuous in s, t, and x. Moreover, for all β′ ∈ [0, 1], P-a.s. X(·, ·) ∈ C([0,T ]2;Cβ ′ loc(R d; Rd)) and for 2.3. Properties of stochastic flows 17 all ϵ > 0 and p ≥ 2, there is a constant N = N(d, p,N0,T, β′, ϵ) such that E [ sup s,t≤T |r−(1+ϵ)1 Xt(s)| p β′ ] ≤ N. (2.3.6) Proof. (1) Owing to Assumptions 2.3.1 and 2.3.2, by Lemma 2.3.1, for all ω, t and z, the process H˜t(x, z) := x + Ht(x, z) is a homeomorphism (in fact, it is a diffeomorphism) in x and H˜−1t (x, z) has linear growth and is Lipschitz. This implies that assumptions of Theorem 3.5 in [Kun04] hold and hence there is modification of Xt(s, x), denoted Xt(s, x), such that for all s ∈ [0,T ], P-a.s. for all t ∈ [0,T ], Xt(s, ·) is a homeomorphism. Following [Kun04], for each (s, t, x) ∈ [0,T ]2 × Rd, we set X¯t(s, x) =    x t ≤ s Xt(0, X−1s (0, x)) t ≥ s, (2.3.7) and remark that P-a.s. X¯t(s, x) is càdlàg in s and t and continuous in x, and P-a.s. for all (s, t) ∈ [0,T ]2, X¯t(s, ·) is a homeomorphism, and X¯t(s, x) is a version of Xt(s, x) (the equation started at s). Fix a stopping time τ ≤ T . We will now show that X¯t(τ, x) = X¯t(s, x)|s=τ (i.e. X¯t(s, x) evaluated at s = τ) is a version of Xt(τ, x). Define the sequence of stopping times (τn)n≥1 by τn = n−1∑ k=1 kT n 1{ (k−1)T n ≤τ< kT n } + T1{ τ≥ (n−1)Tn }. For each n and x, let X(n)t = X (n) t (x) = X¯t(τn, x), t ∈ [0,T ]. It follows that for all n, t, and x, P-a.s. for all k ∈ {1, . . . , n}, X(n)t (x)1{τn= kTn } = Xt ( kT n , x ) 1{τn= kTn }, and hence X(n)t (x)1{τn= kTn } = 1{τn= kTn }x + 1{τn= kTn } ∫ ] kTn , kT n ∨t] br(X (n) r (x))dr + 1{τn= kTn } ∫ ] kTn , kT n ∨t] σϱr (X (n) r (x))dw ϱ r + 1{τn= kTn } ∫ ] kTn , kT n ∨t] ∫ Z Hr(X (n) r (x), z)q(dr, dz). Since Ω is the disjoint union of the sets { τn = kT n } , k ∈ {1, . . . , n}, it follows that X(n)t (x) 18 Chapter 2. Properties of space inverses of stochastic flows solves X(n)t (x) = x + ∫ ]τn,τn∨t] br(X (n) r (x))dr + ∫ ]τn,τn∨t] σϱr (X (n) r (x))dw ϱ r + ∫ ]τn,τn∨t] ∫ Z Hr(X (n) r (x), z)q(dr, dz). Thus, by uniqueness, we find that for all t and x, P-a.s. X¯t(τn, x) = X (n) t (x) = Xt(τn, x). It is easy to check that for all t and x, P-a.s. Xt(τn, x) converges to Xt(τ, x) as n tends to infinity. Since X¯t(s, x) is càdlàg in s, X¯t(τn, x) converges to X¯t(τ, x) as n tends to infinity. Therefore, X¯t(τ, x) is a version of Xt(τ, x) for all t and x. We identify Xt(s, x) and X¯t(s, x) for all (s, t, x) ∈ [0,T ]2 × Rd. Using Lemma 2.3.2(1) and Corollary 2.5.3, we obtain that P-a.s X·(τ, ·) ∈ D([0,T ];C β′ loc(R d; Rd)) and that the estimate (2.3.5) holds. Note here that for all β ≥ 0, the Fréchet spaces D([0,T ];Cβloc(R d; Rd)) and Cβloc(R d; D([0,T ]; Rd)) are equivalent. It follows from the proof of Theorem 3.5 in [Kun04] that for every stopping time τ¯ ≤ T , P-a.s. lim |x|→∞ inf t≤T |Xt(τ¯, x)| = ∞. (2.3.8) Let (tn) ⊆ [0,T ] and (xn) ⊆ Rd be convergent sequences with limits t and x, respectively. First, assume tn < t for all n. By (2.3.8), for every stopping time τ¯ ≤ T , P-a.s. the sequence( X−1tn (τ¯, xn) ) is uniformly bounded. Since P-a.s. X·(τ, ·) ∈ D([0,T ];Cβ(Rd; Rd)), β′ ∈ (0, 1), we have lim n→∞ ( Xt−(τ¯, X −1 tn (τ¯, xn)) − Xt−(τ¯, X −1 t− (τ¯, x) ) = lim n→∞ ( Xt−(τ¯, X −1 tn (τ¯, xn)) − x ) = lim n→∞ ( Xtn(τ¯, X −1 tn (τ¯, xn)) − x ) = lim n→∞ (xn − x) = 0, which implies lim n→∞ X−1tn (τ¯, xn) = X −1 t− (τ¯, x). A similar argument is used for tn > t. (2) It follows from the definition (2.3.7) that X¯t(s, x) and X¯−1t (s, x) are continuous in s, t, and x. Moreover, applying Lemma 2.3.2(1) and Corol- lary 2.5.3, we conclude that P-a.s. X·(·, ·) ∈ C([0,T ]2;C β′ loc(R d; Rd)) and that the estimate (2.3.6) holds. The continuity of Xs(τ, x) with respect to s plays an important role in the proof of Theorem 2.2.4. □ 2.3.2 Moment estimates of inverse flows: Proof of Theorem 2.2.1 In this subsection, under Assumption 2.2.1(β), β ≥ 1, we derive moment estimates for the flow Xt(τ, x) and its inverse X−1t (τ, x) in weighted Hölder norms and complete the proof of Theorem 2.2.1. In particular, we will apply Corollaries 2.5.2 and 2.5.3 with the Banach 2.3. Properties of stochastic flows 19 spaces V = D([0,T ]; Rd) and V = C([0,T ]2; Rd). Proposition 2.3.4. Let Assumption 2.2.1(β) hold for some β > 1 (1) For all stopping time sτ ≤ T and β′ ∈ [1, β), P-a.s. ∇X·(τ, ·) ∈ D([0,T ];C β′−1 loc (R d; Rd)) and for all ϵ > 0 and p ≥ 2, there is a constant N = N(d, p,N0,T, β′, ϵ) such that E [ sup t≤T |r−ϵ1 ∇Xt(τ)| p β′−1 ] ≤ N. (2.3.9) Moreover, for all p ≥ 2, there is a constant N = N(d, p,N0, β,T ) such that for all multi-indices γ with 1 ≤ |γ| ≤ [ β ] and all x ∈ Rd, E [ sup t≤T |∂γXt(τ, x)|p ] ≤ N (2.3.10) and for all multi-indices γ with |γ| = [β]− and all x, y ∈ Rd, E [ sup t≤T |∂γXt(τ, x) − ∂γXt(τ, y)|p ] ≤ N |x − y|{β} + p. (2.3.11) (2) If H ≡ 0, then for all β′ ∈ [1, β), P-a.s. ∇X·(·, ·) ∈ C([0,T ]2;C β′−1 loc (R d; Rd)) and for all ϵ > 0 and p ≥ 2, there is a constant N = N(d, p,N0,T, β′, ϵ) such that E [ sup s,t≤T |r−(1+ϵ)1 ∇Xt(s)| p β′−1 ] ≤ N. Moreover, for all p ≥ 2, there is a constant N = N(d, p,N0,T, β) such that for all multi-indices γ with |γ| = [β]− and all s, s¯ ∈ [0,T ] and x ∈ Rd, E [ sup t≤T |∂γXt(s, x) − ∂γXt(s, x)|p ] ≤ N |s − s|p/2. (2.3.12) Proof. (1) Fix a stopping time τ ≤ T and write Xt(τ, x) = Xt(x). First, let us assume that [β]− = 1. It follows from Theorem 3.4 in [Kun04] that P-a.s. for all t, Xt(τ, ·) is continuously differentiable and Ut = ∇Xt(τ, x) satisfies dUt = ∇bt(Xt)Utdt + ∇σ ϱ t (Xt−)Utdw ϱ t + ∫ Z ∇Ht(Xt−, z)Ut−q(dt, dz), τ < t ≤ T, ∇Xt = Id, t ≤ τ, (2.3.13) where Id is the d × d-dimensional identity matrix. Taking λ = 0 in the estimates (3.10) and (3.11) in Theorem 3.3 in [Kun04], we obtain (2.3.10) and (2.3.11). Then applying Corollary 2.5.3 with V = D([0,T ]; Rd), we have that X·(·) ∈ D([0,T ];C β′ loc(R d; Rd)) and 20 Chapter 2. Properties of space inverses of stochastic flows that (2.3.9) holds. The proof for [β]− > 1 follows by induction (see, e.g. the proof of Theorem 6.4 in [Kun97]). (2) The estimate (2.3.12) is given in Theorem 4.6.4 in [Kun97] in equation (19). The remaining items of part (2) then follow in exactly the same way as part (1) with the only exception being that we apply Corollary 2.5.3 with V = C([0,T ]2; Rd). □ Lemma 2.3.5. Let Assumption 2.2.1(β) hold for some β > 1. (1) For any stopping time τ ≤ T and β′ ∈ [1, β), P-a.s. ∇X·(τ, ·)−1 ∈ D([0,T ];C β′−1 loc (Rd; Rd)) and for all p ≥ 2, there is a constant N = N(d, p,N0,T, η,Nκ) such that for all x, y ∈ Rd E [ sup t≤T |∇Xt(τ, x)−1|p ] ≤ N (2.3.14) and E [ sup t≤T |∇Xt(τ, x)−1 − ∇Xt(τ, y)−1|p ] ≤ N|x − y|((β−1)∧1)p. (2.3.15) (2) If H ≡ 0, then for all β′ ∈ [1, β), P-a.s. ∇X·(·, ·)−1 ∈ C([0,T ]2;C β′−1 loc (R d; Rd)) and for all p ≥ 2, there is a constant N = N(d, p,N0,T ) such that for all s, s¯ ∈ [0,T ] and x ∈ Rd, E [ sup t≤T |∇Xt(s, x)−1 − ∇Xt(s, x)−1|p ] ≤ N|s − s¯|p/2. Proof. (1) Let τ ≤ T be a fixed stopping time and write Xt(τ, x) = Xt(x). Using Itô’s formula (see also Lemma 3.12 in [Kun04]), we deduce that U¯t = ∇Xt(x)−1 satisfies dU¯t = U¯t ( ∇σϱt (Xt−)∇σ ϱ t (Xt−(τ)) − ∇bt(Xt) ) dt − U¯t∇σ ϱ t (Xt)dw ϱ t − ∫ Z U¯t−∇Ht(Xt−, z)(Id + ∇Ht(Xt−, z))−1q(dt, dz) + ∫ Z U¯t∇Ht(Xt−, z)2(Id + ∇Ht(Xt−, z))−1π(dz)dt, τ < t ≤ T, U¯t = Id, t ≤ τ. (2.3.16) Since matrix inversion is a smooth mapping, the coefficients of the linear equation (2.3.16) satisfy the same assumptions as the coefficients of the linear equation (2.3.13), and hence the derivation of the estimates (2.3.14) and (2.3.15) proceed in the same way as the anal- ogous estimates for (2.3.13). To see that P-a.s. X·(·)−1 ∈ D([0,T ];C β′−1 loc (R d; Rd)), we only need to note that P-a.s. X·(·) ∈ D([0,T ];C β′−1 loc (R d; Rd)) and that matrix inversion is a smooth mapping. Part (2) follows with the obvious changes. □ As an immediate corollary, we obtain the diffeomorphism property of the flow Xt(τ, x) under Assumption 2.2.1(β), β > 1. 2.3. Properties of stochastic flows 21 Corollary 2.3.6. Let Assumption 2.2.1(β) hold. (1) For all stopping times τ ≤ T and β′ ∈ [1, β) the mapping Xt(τ, ·) : Rd → Rd is a Cβ ′ loc(R d; Rd)-diffeomorphism, P-a.s. X·(τ, ·), X−1· (τ, ·) ∈ D([0,T ];C β′ loc(R d; Rd)) and for all t ∈ [0,T ], X−1t− (τ) coincides with the inverse of Xt−(τ). (2) If H ≡ 0, then for all β′ ∈ [1, β), P-a.s. X·(·, ·), X−1· (·, ·) ∈ C([0,T ] 2,Cβ ′ loc(R d; Rd)). Proof. (1) Fix a stopping time τ ≤ T and write Xt(τ, x) = Xt(x). It follows from Propo- sitions 2.3.3 and 2.3.4 that P-a.s. for all t, the mappings Xt(·), Xt−(·) : Rd → Rd are homeomorphisms and X·(·) ∈ D([0,T ];C β′ loc(R d; Rd)). Moreover, by Lemma 2.3.5, P-a.s. for all t and x, the matrix ∇Xt (τ, x) has an inverse. Therefore, by Hadamard’s Theorem (see, e.g., Theorem 0.2 in [DMGZ94]), P-a.s. for all t, Xt(·) is a diffeomorphism. Using the chain rule, P-a.s. for all t and x, ∇X−1t (x) = ∇Xt(X −1 t (x)) −1. (2.3.17) Since, by Lemma 2.3.5, P-a.s. [∇X·(·)]−1 ∈ D([0,T ];C β′−1 loc (R d; Rd)) and we know that P-a.s. for all t, X−1t (·) is differentiable, it follows from (2.3.17) that P-a.s. ∇X·(X−1· (·)) −1 ∈ D([0,T ];C(β ′−1)∧1 loc (R d; Rd)). We proceed inductively to complete the proof. Making the obvious changes in the proof of part (1), we obtain part (2). □ We conclude with a derivation of Hölder moment estimates of the inverse flow X−1t (τ, x), which will complete the proof of Theorem 2.2.1. Proof of Theorem 2.2.1. (1) Fix a stopping time τ ≤ T and write Xt(τ, x) = Xt(x). Fix ϵ > 0. First, let us assume that [ β ]− = 1. Set Jt(x) = | det∇Xt(x)|. It is clear from (2.3.10) that for all p ≥ 2 and x, there is a constant N = N(d, p,N0,T ) such that E[sup t≤T |Jt(x)|p] ≤ N. (2.3.18) By the mean value theorem, for all x and y and p¯ ∈ R, we have |r1(x) p¯ − r1(y) p¯| ≤ |p¯|(r1(x) p¯−1 + r1(y) p¯−1)|x − y|. (2.3.19) Using the change of variable (x¯, y¯) = (X−1t (x), X −1 t (y)), Fatou’s lemma, Fubini’s theorem, Hölder’s inequality, and the inequalities (2.3.3), (2.3.18), (2.3.19), (2.3.2), and (2.3.4), for all δ ∈ (0, 1] and p > d ϵ , we conclude that there is a constant N = N(d, p,N0,T, δ, η,Nκ, ϵ) 22 Chapter 2. Properties of space inverses of stochastic flows such that E sup t≤T ∫ Rd |r1(x)−(1+ϵ)X−1t (x)| pdx ≤ ∫ Rd |x¯|pE sup t≤T [r1(Xt(x¯)) −p(1+ϵ)Jt(x¯)]dx¯ ≤ NE ∫ Rd r1(x¯) −pϵdx¯ ≤ N and E sup t≤T ∫ |x−y|<1 |r−(1+ϵ)1 (x)X −1 t (x) − r −(1+ϵ) 1 (y)X −1 t (y)| p |x − y|2d+δp dxdy ≤ ∫ |x¯−y¯|<1 E sup t≤T   r−p(1+ϵ)1 (Xt(x¯))|x¯ − y¯| pJt(x¯)Jt(y¯) |Xt(x¯) − Xt(y¯)|2d+δp   dx¯dy¯ + ∫ |x¯−y¯|<1 E sup t≤T   |y¯|p|r−(1+ϵ)1 (Xt(x¯)) − r −(1+ϵ) 1 (Xt(y¯))| pJt(x¯)Jt(y¯) |Xt(x¯) − Xt(y¯)|2d+δp   dx¯dy¯ ≤ N ∫ |x¯−y¯|<1 r1(x¯)−p(1+ϵ) |x¯ − y¯|2d−(1−δ)p dx¯dy¯ + N ∫ |x¯−y¯|<1 r1(x¯)−p(1+ϵ) + r1(y¯)−p(1+ϵ) |x¯ − y¯|2d−(1−δ)p dx¯dy¯ ≤ N. Similarly, making use of the inequalities (2.3.3), (2.3.18), (2.3.19), (2.3.2), (2.3.4), (2.3.14), and (2.3.15), for all p > d ϵ ∨ d β−β′ ∨ d 2−β′ , we get E sup t≤T ∫ Rd |r−ϵ(x)∇X−1t (x)| pdx ≤ ∫ Rd E sup t≤T [r1(Xt(x¯)) −pϵ |[∇Xt(x¯)]−1|pJt(x¯)]dx¯ ≤ NE ∫ Rd r1(x¯) −pϵdx¯ ≤ N and E sup t≤T ∫ |x−y|<1 |r−ϵ1 (x)∇X −1 t (x) − r −ϵ 1 (y)∇X −1 t (y)| p |x − y|2d+(β′−1)p dxdy ≤ ∫ |x¯−y¯|<1 E sup t≤T [ |r−ϵ1 (Xt(x¯))[∇Xt(x¯)] −1 − r−ϵ1 (Xt(y¯))[∇Xt(y¯)] −1|pJt(x¯)Jt(y¯) |Xt(x¯) − Xt(y¯)|2d+(β ′−1)p ] dx¯dy¯ ≤ N ∫ |x¯−y¯|<1 r1(x¯)−pϵ |x¯ − y¯|2d−(β−β′)p dx¯dy¯ + N ∫ |x¯−y¯|<1 r1(x¯)−pϵ + r1(y¯)−pϵ |x¯ − y¯|2d−(2−β′)p dx¯dy¯ ≤ N, where N = N(d, p,N0,T, β′, η,Nκ, ϵ) is a positive constant. Therefore, combining the above estimates and applying Corollary 2.5.2, we find that for all p ≥ 2, there is f a constant N = N(d, p,N0,T, β′, η,Nκ, ϵ), such that E [ sup t≤T |r−(1+ϵ)1 X −1 t (τ)| p 0 ] + E [ sup t≤T |r−ϵ1 ∇X −1 t (τ)| p β′−1 ] ≤ N. It is well-known that the the inverse map I on the set of invertible d × d-dimensional 2.3. Properties of stochastic flows 23 matrices is infinitely differentiable and for all n, there is a constant N = N(n, d) such that for all invertible matrices M, the nth derivative of I evaluated at M, denoted I(n)(M), satisfies ∣∣∣I(n)(M) ∣∣∣ ≤ N |M−n−1| ≤ N ∣∣∣M−1 ∣∣∣ n+1 . We claim that for all n and every multi-index γ with |γ| = n, the components of ∂γX−1t (x) are a polynomial in terms of the entries of [∇Xt(X−1t (x))] −1 and ∂γ ′ ∇Xt(X−1t (x)) for all multi-indices γ′ with 1 ≤ |γ′| ≤ n − 1. Assume that statement holds for some n. By the chain rule, for all ω, t, and x, we have ∇(∇Xt(X−1t (x)) −1) = I(1)(∇Xt(X−1t (x)))∇ 2Xt(X −1 t (x))∇Xt(X −1 t (x)) −1 and for all multi-indices γ with 1 ≤ |γ′| ≤ n − 1, we have ∇(∂γ ′ ∇Xt(X−1t (x))) = ∂ γ′∇2Xt(X−1t (x))∇Xt(X −1 t (x)) −1, where ∇2Xt(X−1t (x)) is the tensor of second-order derivatives of Xt(·) evaluated at X −1 t (x). This implies that for every multi-index γ with |γ| = n + 1, the components of ∂γX−1t (x) are a polynomial in terms of the entries of ∇Xt(X−1t (x)) −1 and ∂γ ′ ∇Xt(X−1t (x)) for all multi- indices γ′ with 1 ≤ |γ′| ≤ n. By induction, the claim is true. Therefore, for [β]− ≥ 2, using (2.3.10) and (2.3.11), we obtain the moment estimates for the inverse flow in the almost exact same way we did for [β]− = 1. Making the obvious changes in the proof of part (1), we obtain part (2). This completes the proof of Theorem 2.2.1. □ 2.3.3 Strong limit of a sequence of flows: Proof of Theorem 2.2.3 Proof of Theorem 2.2.3. Let τ ≤ T be a fixed stopping time and write Xt(x) = Xt(τ, x). For each n, let Z(n)t (x) = X (n) t (x) − Xt(x), (t, x) ∈ [0,T ] × R d. Throughout the proof we denote by (δn)n≥1 a deterministic sequence with δn → 0 as n→ ∞ that may change from line to line. Let N = N(p,N0,T ) be a positive constant, which may change from line to line. By virtue of Theorem 2.1 in [Kun04] and (2.3.1), for all p ≥ 2 and t, x and n, we have E [ sup s≤t |Z(n)s (x)| p ] ≤ NE ∫ ]0,t] |Z(n)s (x)| pds + Nδnr1(x) p. 24 Chapter 2. Properties of space inverses of stochastic flows Since the right-hand-side is finite by (2.3.1), applying Gronwall’s lemma we find that for all x and n, E [ sup t≤T |Z(n)t (x)| p ] ≤ Nδnr1(x)p. (2.3.20) Similarly, by (2.3.10), for all x and n, we have E [ sup t≤T |∇Z(n)t (x)| p ] ≤ Nδn. Using (2.3.10), for all x, y, and n, we obtain E [ sup t≤T |Z(n)t (x) − Z (n)(y)|p ] ≤ |x − y|pE sup t≤T ∫ 1 0 |∇Z(n)t (y + θ(x − y))| pdθ ≤ N|x − y|p. It follows immediately from (2.3.11) that for all x, y, and n, E [ sup t≤T ]|∇Z(n)t (x) − ∇Z (n) t (y)| p ] ≤ N|x − y|(β−1)∨1. Thus, by Corollary 2.5.4, we have lim n→∞ ( E [ sup t≤T |r−(1+ϵ)1 X (n) t − r −(1+ϵ) 1 Xt| p 0 ] + E [ sup t≤T |r−ϵ1 ∇X (n) t − r −ϵ 1 ∇Xt| p 0 ]) = 0. (2.3.21) Owing to a standard interpolation inequality for Hölder spaces (see, e.g. Lemma 6.32 in [GT01]), for all δ ∈ (0, 1) and β¯ ∈ (β′, β), there is a constant N(δ) such that E [ sup t≤T |r−ϵ1 ∇X (n) t − r −ϵ 1 ∇Xt| p β′−1 ] ≤ δE [ sup t≤T |r−ϵ1 ∇X (n) t − ∇Xt| p β¯−1 ] + CδE [ sup t≤T |r−ϵ1 ∇X (n) t − ∇Xt| p 0 ] , and hence since sup n∈N E [ sup t≤T |rε1∇X (n)|p β¯−1 ] + E [ sup t≤T |rε1∇Xt| p β¯−1 ] < ∞, we have lim n→∞ E[sup t≤T |r−ϵ1 ∇X (n) t − r −ϵ 1 ∇Xt| p β′−1] = 0. By Theorem 2.2.1, Corollary 2.5.4, and the interpolation inequality for Hölder spaces used above, in order to show lim n→∞ E [ sup t≤T |r−(1+ϵ)1 X (n);−1 t (τ) − r −(1+ϵ) 1 X −1 t (τ)| p 0 ] = 0 2.3. Properties of stochastic flows 25 lim n→∞ E [ sup t≤T |r−ϵ1 ∇X (n);−1 t (τ) − r −ϵ 1 ∇X −1 t (τ)| p β′−1 ] = 0, it suffices to show that for all x, dP − lim n→∞ sup t≤T |X(n);−1t (x) − X −1 t (x)| = 0 (2.3.22) and dP − lim n→∞ sup t≤T |∇X(n);−1t (x) − ∇X −1 t (x)| = 0. (2.3.23) For each n, define Θ (n) t (x) = r1(X (n) t (x)) −1 − r1(Xt(x))−1, (t, x) ∈ [0,T ] × Rd. For all ω, t, x, and n, we have |Θ(n)t (x)| ≤ r1(X (n) t (x)) −1r1(Xt(x)) −1|Z(n)t (x)|, and hence using Hölder’s inequality, (2.3.4), and (2.3.20), we obtain that for all p ≥ 2, x, there is a constant N = N(p,N0,T, η,Nκ) such that for all n, E [ sup t≤T |Θ(n)t (x)| p ] ≤ Nr1(x)−pδn, where N = N(p,N0,T, η,Nκ) is a constant. Furthermore, since |∇Θ(n)t (x)| ≤ r1(X (n) t (x)) −2|∇X(n)t (x)| + r1(Xt(x)) −2|∇X(n)t (x)|, for all ω, t, x, and n, applying (2.3.4) and (2.3.10), for all p ≥ 2, x, and n, we get E [ sup t≤T |r1(x)Θ (n) t (x) − r1(y)Θ (n) t (y)| p ] ≤ N|x − y|p. Then owing to Corollary 2.5.4, for all p ≥ 2, lim n→∞ E [ sup t≤T |Θ(n)t | p 0 ] = 0. (2.3.24) We claim that for all R > 0, dP − lim n→∞ E(n,R) := dP − lim n→∞ sup t≤T |X(n);−1t − X −1 t |0;{|x|≤R} = 0. (2.3.25) Fix R > 0. It is enough to show that every subsequence of E(n) = E(n,R) has a sub- subsequence converging to 0, P-a.s.. Owing to (2.3.21) and (2.3.24), for a given subse- 26 Chapter 2. Properties of space inverses of stochastic flows quence (E(nk)), we can always find sub-subsequence (still denoted (E(nk)) to avoid double indices) such that P-a.s., lim k→∞ sup t≤T |X(nk)t − Xt|β′;{|x|≤R¯} = 0, ∀ R¯ > 0, (2.3.26) and lim k→∞ sup t≤T |r1(X (nk) t (x)) −1 − r1(Xt(x))−1|0 = 0. Fix an ω for which both limits are zero. We will prove that lim k→∞ sup t≤T |X(nk);−1t (ω) − X −1 t (ω)|0;{|x|≤R} = 0. (2.3.27) Suppose, by contradiction, that (2.3.27) is not true. Then there exists an ε > 0 and a subsequence of (nk) (still denoted (nk)) such that tnk → t− (or tnk → t+) and xnk → x as k → ∞ with ∣∣∣xnk ∣∣∣ ≤ R such that (dropping ω), |X(nk);−1tnk (xnk) − X −1 tnk (xnk)| ≥ ε. (2.3.28) Arguing by contradiction and using (2.3.3), we have sup k∈N |X(nk);−1tnk (xnk)| < ∞. (2.3.29) Applying (2.3.29), (2.3.26), and the fact that X·(·), X−1· (·) ∈ D([0,T ];C β′ loc(R d; Rd)) , we obtain lim k→∞ ( Xt−(X (nk);−1 tnk (xnk)) − Xt−(X −1 tnk (xnk)) ) = lim k→∞ ( Xt−(X (nk);−1 tnk (xnk)) − xnk ) = lim k→∞ ( Xt−(X (nk);−1 tnk (xnk)) − X (nk) tnk (X(nk);−1tnk (xnk)) ) = lim k→∞ ( Xt−(X (nk);−1 tnk (xnk)) − Xtnk (X (nk);−1 tnk (xnk)) ) + lim k→∞ ( Xtnk (X (nk);−1 tnk (xnk)) − X (nk) tnk (X(nk);−1tnk (xnk)) ) = 0, which contradicts (2.3.28), and hence proves (2.3.27), (2.3.25), and (2.3.22). For each n, define U¯ (n)t = U¯ (n)(t, x) = ∇X(n)t (x) −1 and U¯(t) = U¯(t, x) = ∇Xt(x)−1, (t, x) ∈ [0,T ] × Rd. 2.4. Classical solutions of degenerate SPDEs: Proof of Theorem 2.2.4 27 Using (2.3.14) and (2.3.15) and repeating the arguments given above, for all p ≥ 2, we get lim n→∞ E [ sup t≤T |r−ϵ1 U¯ (n) t − r −ϵ 1 U¯t| p β′−1 ] = 0. Then (2.3.3) and (2.3.25) imply that for all R > 0, dP − lim n→∞ sup t≤T |∇X(n);−1t (x) − ∇X −1 t (x)|0;{|x|≤R} = dP − lim n→∞ sup t≤T |∇X(n)t (X (n);−1 t (x)) −1 − ∇Xt(X−1t (x)) −1|0;{|x|≤R} = 0, which yields (2.3.23) and completes the proof. □ 2.4 Classical solutions of degenerate SPDEs: Proof of The- orem 2.2.4 Proof of Theorem 2.2.4. Fix a stopping time τ ≤ T . By virtue of Theorem 2.2.1, we only need to show that Y−1(τ) = Y−1t (τ, x) solves (2.1.2) and is the unique solution. Suppose we have shown that Y−1(s, x), s ∈ [0,T ], solves (2.1.2) (i.e. where τ = s is deterministic). It is then a routine argument to conclude that Y−1(τ′) solves (2.1.2) for finite-valued stopping times τ′ ≤ T . We can then use an approximation argument as in the proof of Proposition 2.3.3 to show that Y−1(τ) = Y−1t (τ, x) solves (2.1.2). Thus, it suffices to take τ deterministic. Let ut(x) = ut(s, x) = Y−1t (s, x), (s, t, x) ∈ [0,T ] 2 × Rd. Fix (s, t, x) ∈ [0,T ]2 × Rd with s < t and write Yt(x) = Yt(s, x). Let ((tMn )0≤n≤M)1≤M≤∞ be a sequence of partitions of the interval [s, t] such that for all M > 0, (tMn )0≤n≤M has mesh size (t − s)/M. Fix M and set (tn)0≤n≤M = (tMn )0≤n≤M. Immediately, we obtain ut(x) − x = M−1∑ n=0 (utn+1(x) − utn(x)). (2.4.1) We will use Taylor’s theorem to expand each term in the sum on the right-hand-side of (2.4.1). By Taylor’s theorem, for all n and y, we have utn+1(Ytn+1(y)) − utn(Ytn+1(y)) = y − utn(Ytn+1(y)) = utn(Ytn(y)) − utn(Ytn+1(y)) = ∇utn(Ytn(y))(Ytn(y) − Ytn+1(y)) − (Ytn(y) − Ytn+1(y)) ∗Θn(Ytn(y))(Ytn(y) − Ytn+1(y)), (2.4.2) where Θi jn (z) = ∫ 1 0 (1 − θ)∂i jutn ( z + θ(Ytn+1(Y −1 tn (z)) − z) ) dθ. 28 Chapter 2. Properties of space inverses of stochastic flows Since for all n, Ytn+1(s, x) = Ytn+1(tn,Ytn(s, x)), we have Ytn+1(Y −1 tn (x)) = Ytn+1(tn, x) and hence substituting y = Y−1tn (x) into (2.4.2), for all n, we get utn+1(x) − utn(x) = An + Bn, (2.4.3) where An := ∇utn(x)(x − Ytn+1(tn, x)) − (x − Ytn+1(tn, x)) ∗Θi jn (x)(x − Ytn+1(tn, x)) and Bn := (utn+1(x) − utn(x)) − (utn+1(Ytn+1(tn, x)) − utn(Ytn+1(tn, x))). Applying Taylor’s theorem once more, for all n, we obtain Bn = Cn + Dn, (2.4.4) where Cn := (∇utn+1(x) − ∇utn(x))(x − Ytn+1(tn, x)), Dn := −(x − Ytn+1(tn, x)) ∗Θ˜n(x)(x − Yti+1(ti, x))), and Θ˜n(x) i j := ∫ 1 0 (1 − θ)∂i j(utn+1 − utn)(x + θ(Ytn+1(tn, x) − x))dθ. Thus, combining (2.4.1), (2.4.3), and (2.4.4), P-a.s. we have ut(x) − x = M−1∑ n=0 (An + Cn + Dn). (2.4.5) Now, we will derive the limit of the right-hand-side of (2.4.5). Claim 2.4.1. (1) dP − lim M→∞ M−1∑ n=0 An = − ∫ ]s,t] [ 1 2 σiϱr (x)σ jϱ r (x)∂i jur(x) + b i r(x)∂iur(x)]dr − ∫ ]s,t] σiϱr (x)∂iur(x)dw ϱ r ; (2) dP − limM→∞ ∑M−1 n=0 Dn = 0; (3) dP − limM→∞ ∑M−1 n=0 Cn = ∫ ]s,t] σ jϱ r (x)∂ jσ iϱ r (x)∂iur(x)dr + ∫ ]s,t] σ iϱ r (x)σ jϱ r (x)∂i jur(x)dr. 2.4. Classical solutions of degenerate SPDEs: Proof of Theorem 2.2.4 29 Proof of Claim 2.4.1. (1) For all n, we have ∇utn(x) ( x − Ytn+1(tn, x) ) = − ∫ ]tn,tn+1] bir(x)∂iutn(x)dr − ∫ ]tn,tn+1] σiϱr (x)∂iutn(x)dw ϱ r + R(1)n + R (2) n , where R(1)n := ∫ ]tn,tn+1] ( bir(x) − b i r(Yr(tn, x)) ) ∂iutn(x)dr and R(2)n := ∫ ]tn,tn+1] [σiϱr (x) − σ iϱ r (Yr(tn, x))]∂iutn(x)dw ϱ r . Since b and σ are Lipschitz, there is a constant N = N(N0,T ) such that M−1∑ n=0 ∣∣∣R1n ∣∣∣ ≤ N sup s≤r≤t |∇ur(x)| sup |r1−r2 |≤ tM |x − Yr1(r2, x)| and ∫ ]s,t] ∣∣∣∣∣∣∣ M−1∑ n=0 1]tn,tn+1](r) ( σi·r (x) − σ i· r (Yr(tn, x)) ) ∂iutn(x) ∣∣∣∣∣∣∣ 2 ds ≤ N sup s≤r≤t |∇ur(x)|2 sup |r1−r2 |≤ tM |x − Yr1(r2, x)| 2. Owing to the joint continuity of Yt(s, x) in s and t and the dominated convergence theorem for stochastic integrals, we obtain dP − lim M→∞ M−1∑ n=0 (R(1)n + R (2) n ) = 0. (2.4.6) In a similar way, this time using the continuity of ∇ut(x) in t and the linear growth of b and σ, we find dP − lim M→∞ M−1∑ n=0 ( − ∫ ]tn,tn+1] bir(x)∂iutn(x)dr − ∫ ]tn,tn+1] σiϱr (x)∂iutn(x)dw ϱ r ) = − ∫ ]s,t] br(x)∂iur(x)dr − ∫ ]s,t] σϱr (x)∂iur(x)dw ϱ r . For all n, we have −(x − Ytn+1(tn, x)) ∗Θn(x)(x − Ytn+1(tn, x)) = S (1) n + S (2) n , 30 Chapter 2. Properties of space inverses of stochastic flows where S (1)n (t, x) has only drdr and drdw ϱ r terms and where S (2)n : = − 1 2 (∫ ]tn,tn+1] σiϱr (Yr(tn, x))dw ϱ r ) ∂i jutn(x) (∫ ]tn,tn+1] σ jϱr (Yr(tn, x))dw ϱ r ) − (∫ ]tn,tn+1] σiϱr (Yr(tn, x))dw ϱ r ) ( Θi jn (x) − 1 2 ∂i jutn(x) ) (∫ ]tn,tn+1] σ jϱr (Yr(tn, x))dw ϱ r ) . Since ∣∣∣∣∣Θ i j n (x) − 1 2 ∂i jutn(x) ∣∣∣∣∣ = ∣∣∣∣∣∣ ∫ 1 0 (1 − θ)(∂i jutn(x + θ(Ytn+1(tn, x) − x)) − ∂i jutn(x))dθ ∣∣∣∣∣∣ ≤ N sup |r1−r2 |≤ tM ,θ∈(0,1) |∂i jur1(x + θ(Yr2(r1, x) − x)) − ∂i jur1(x))|, proceeding as in the derivation of (2.4.6) and using the joint continuity of ∂i jut(x) in t and x, the continuity of Yt(s, x) in s and t, the explicit form of the quadratic variation of the stochastic integral (i.e. part (5) of Chapter 2, Section 2, Theorem 2 in [LS89]), and the stochastic dominated convergence theorem, we obtain dP − lim M→∞ M−1∑ n=0 S (2)n = − 1 2 ∫ ]0,t] σiϱr (x)σ jϱ r (x)∂i jur(x)dr. Similarly, making use of the properties stated in Theorem 2.2.1(2), we have dP − lim M→∞ M−1∑ n=0 S (1)n = 0, which completes the proof of part (1). The proof of part (2) is similar to the proof of part (1), so we proceed to the proof of part (3). We know that for all n, Ytn+1(x) = Ytn+1(tn,Ytn(x)). Thus, for all n, we have utn+1(x) = utn(Y −1 tn+1(tn, x)), and hence by the chain rule, ∇utn+1(x) = ∇utn(Y −1 tn+1(tn, x))∇Y −1 tn+1(tn, x). (2.4.7) By (2.4.7) and Taylor’s theorem, for all n, we have Cn = (∇utn+1(x) − ∇utn(x))(x − Ytn+1(tn, x)) = ∇utn(Y −1 tn+1(tn, x))(∇Y −1 tn+1(tn, x) − Id)(x − Ytn+1(tn, x)) +(Y−1tn+1(tn, x) − x) ∗Θ˜n(x)(x − Ytn+1(tn, x)) =: En + Fn, where Θ˜i jn (x) := ∫ 1 0 ∂i jutn(x + θ(Y −1 tn+1(tn, x) − x))dθ. 2.4. Classical solutions of degenerate SPDEs: Proof of Theorem 2.2.4 31 Using Itô’s formula, for all n, we get (see, also, Lemma 3.12 in [Kun04]), ∇Ytn+1(tn, x) −1 = Id − ∫ ]tn,tn+1] ∇Yr(tn, x)−1∇σϱr (Yr(tn, x))dw ϱ r + ∫ ]tn,tn+1] ∇Yr(tn, x)−1 ( ∇σϱr (Yr(tn, y))∇σ ϱ r (Yr(tn, x)) − ∇br(Yr(tn, x)) ) dr, and hence ∇Y−1tn+1(tn) − Id = ∇Y −1 tn+1(tn,Y −1 tn+1(tn, x)) − Id =: G (1) tn,tn+1(Y −1 tn+1(tn, x)) + G (2) tn,tn+1(Y −1 tn+1(tn, x)), where for y ∈ Rd, G(1)tn,tn+1(y) := ∫ ]tn,tn+1] ∇Yr(tn, z)−1 ( ∇σϱr (Yr(tn, y))∇σ ϱ r (Yr(tn, y)) − ∇br(Yz(tn, y)) ) dr and G(2)tn,tn+1(z) := − ∫ ]tn,tn+1] ∇Yr(tn, y)−1∇σϱr (Yr(tn, y))dw ϱ r . By the Burkholder-Davis-Gundy inequality, Hölder’s inequality, and the inequalities (2.3.2), (2.3.14), and (2.3.15), for all p ≥ 2, there is a constant N = N(p, d,N0,T ) such that for all x1 and x2, E [ |G(2)tn,tn+1(x1)| p ] ≤ NM−p/2+1 ∫ ]tn,tn+1] E [ |∇Yr(tn, x1)−1|p|∇σr(Yr(tn, x1)|p ] dr ≤ NM−p/2 and E [ |G(2)tn,tn+1(x1) −G (2) tn,tn+1(x2)| p ] ≤ NM−p/2+1 ∫ ]tn,tn+1] E [ |∇Yr(tn, x1)−1 − ∇Yr(tn, x2)−1|p ] dr +NM−p/2+1 ∫ ]tn,tn+1] ( E [ |∇Yr(tn, x1)−1|2p ])1/2 ( E [ |Yr(tn, x1) − Yr(tn, x2)|2p ])1/2 dr ≤ NM−p/2|x − y|p. Thus, by Corollary 2.5.3, we obtain that for all p ≥ 2, ϵ > 0, and δ < 1, there is a constant N = N(p, d, δ,N0,T ) such that E [ |r−ϵG(2)tn,tn+1 | p δ ] ≤ NM−p/2. (2.4.8) For all n, we have En = ∇utn(Y −1 tn+1(tn, x))G (1) tn,tn+1(Y −1 tn+1(tn, x))(x − Ytn+1(tn, x)) + ∇utn(x)G (2) tn,tn+1(x)(x − Ytn+1(tn, x)) 32 Chapter 2. Properties of space inverses of stochastic flows + ∇utn(Y −1 tn+1(tn, x))(G (2) tn,tn+1(Y −1 tn+1(tn, x)) −G (2) tn,tn+1(x))(x − Ytn+1(tn, x)) + (∇utn(Y −1 tn+1(tn, x) − ∇utn(x))G (2) tn,tn+1(x)(x − Ytn+1(tn, x)) One can easily check that dP − lim M→∞ M−1∑ n=0 ∇utn(Y −1 tn+1(tn, x))G (1) tn,tn+1(Y −1 tn+1(tn, x))(x − Ytn+1(tn, x)) = 0. (2.4.9) Since ∇ut(x) is jointly continuous in t and x and Y−1t (s, x) is jointly in s and t, we have dP − lim M→∞ sup n |∇utMn (Y −1 tMn+1 (tMn , x)) − ∇utMn (x)| = 0. Moreover, using Hölder’s inequality, (2.4.8), and (2.3.1), we get sup M E M−1∑ n=0 |G(2)tn,tn+1(x)|x − Ytn+1(tn, x)| < ∞, and hence dP − lim M→∞ M−1∑ n=0 (∇utn(Y −1 tn+1(tn, x)) − ∇utn(x))G (2) tn,tn+1(x)(x − Ytn+1(tn, x)) = 0. (2.4.10) We claim that dP − lim M→∞ M−1∑ n=0 ∇utn(Y −1 tn+1(tn, x)) ( G(2)tn,tn+1(Y −1 tn+1(tn, x)) −G (2) tn,tn+1(x) ) (x − Ytn+1(tn, x)) = 0. (2.4.11) Set JM = M−1∑ n=0 |G(2)tn,tn+1(Y −1 tn+1(tn, x)) −G (2) tn,tn+1(x)|x − Ytn+1(tn, x)|. For all δ¯, ϵ ∈ (0, 1), we have P(JM > δ¯) ≤ P ( JM > δ¯, max n |Y−1tn+1(tn, x) − x| ≤ ϵ ) + P ( max n |Y−1tn+1(tn, x) − x| > ϵ ) . By virtue of (2.4.8), there is a deterministic constant N = N(x) independent of M such that for all ω ∈ V M := {maxn |Y−1tn+1(tn, x) − x| ≤ ϵ}, JM ≤ Nϵδ M−1∑ n=0 [r−ϵ1 G (2) tn,tn+1]δ|x − Ytn+1(tn, x)|, 2.4. Classical solutions of degenerate SPDEs: Proof of Theorem 2.2.4 33 which implies that E1V M J M ≤ NϵδE M−1∑ n=0 ( [r−ϵ1 G (2) tn,tn+1] 2 δ + |x − Ytn+1(tn, x)| 2 ) ≤ Nϵδ M−1∑ n=0 M−1 ≤ Nϵδ. Applying Markov’s inequality, we derive P(JM | > δ¯, max n |Y−1tn+1(tn, x) − x| ≤ ϵ) ≤ N ϵδ δ¯ , and hence for all δ¯ > 0, lim M→∞ P(JM > δ¯) = 0, which yields (2.4.11). Owing to (2.4.9), (2.4.10), and (2.4.11) we have dP − lim M→∞ M−1∑ n=0 En = lim M→∞ M−1∑ n=0 ∇utn(x)G (2) tn,tn+1(x)(x − Ytn+1(tn, x)). Proceeding as in the proof of part (1) of the claim, we obtain lim M→∞ M−1∑ n=0 Kn = lim M→∞ M−1∑ n=0 ∇utn(x) ∫ ]tn,tn+1] (∇Yr(tn, x)−1 − Id)∇σϱr (x)dW ϱ r ∫ ]tn,tn+1] σϱr (x)dW ϱ r + lim M→∞ M−1∑ n=0 ∇utn(x) ∫ ]tn,tn+1] ∇σϱr (x)dw ϱ r ∫ ]tn,tn+1] σϱr (x)dw ϱ r = ∫ ]s,t] σ jϱr (x)∂ jσ iϱ r (x)∂iur(x)dr. (2.4.12) It is easy to check that for all n, Fn = (Y −1 tn+1(tn, x) − x) ∗Θ˜n(x)(x − Ytn+1(tn, x)) =: (G(3)tn,tn+1(Y −1 tn+1(tn, x)) + G (4) tn,tn+1(Y −1 tn+1(tn, x))) ∗Θ˜n(x)(x − Ytn+1(tn, x)), where for y ∈ Rd, G(3)tn,tn+1(y) := − ∫ ]tn,tn+1] br(Yr(tn, y))dr, G (4) tn,tn+1(y) := − ∫ ]tn,tn+1] σϱr (Yr(tn, y))dw ϱ r . 34 Chapter 2. Properties of space inverses of stochastic flows Arguing as in the proof of (2.4.12), we get dP − lim M→∞ M−1∑ n=0 Fn = ∫ ]s,t] σiϱr (x)σ jϱ r (x)∂i jur(x)dt, which completes the proof of the claim. □ By virtue of (2.4.5) and Claim 2.4.1, for all s and t with s ≤ t and x, P-a.s. ut(x) = x + ∫ ]s,t] ( 1 2 σiϱr (x)σ jϱ(x)∂i jur(x) − bˆit(x)∂iur(x) ) dr − ∫ ]s,t] σiϱr (x)∂iur(x)dw ϱ r . (2.4.13) Owing to Theorem 2.2.1, u = ut(x) has a modification that is jointly continuous in s and t and twice continuously differentiable in x. It is easy to check that the Lebesgue integral on the right-hand-side of (2.4.13) has a modification that is continuous in s, t, and x. Thus, the stochastic integral on the right-hand-side of (2.4.13) has a modification that is continuous in s, t, and x, and hence the equality in (2.4.13) holds P-a.s. for all s and t with s ≤ t and x. This proves that Y−1(τ) = Y−1t (τ, x) solves (2.1.2). However, if u 1(τ), u2(τ) ∈ Cβ ′ cts(R d; Rd) are solutions of (2.1.2), then applying the Itô-Wentzell formula (see, e.g. Theorem 9 in Chapter 1, Section 4.8 in [Roz90]), we get that P-a.s. for all t and x, u1t (τ,Yt(τ, x)) = x = u 2 t (τ,Yt(τ, x)), which implies that P-a.s. for all t and x, u1(τ) = Y−1t (τ, x) = u 2(τ). Thus, Y−1(τ) = Y−1t (τ, x) is the unique solution of (2.1.2) in Cβ ′ cts(R d; Rd). □ 2.5 Appendix Let V be an arbitrary Banach space. The following lemma and its corollaries are indis- pensable in this chapter. Lemma 2.5.1. Let Q ⊆ Rd be an open bounded cube, p ≥ 1, δ ∈ (0, 1], and f be a V-valued integrable function on Q such that [ f ] δ;p;Q;V := (∫ Q ∫ Q | f (x) − f (y)|pV |x − y|2d+δp dxdy )1/p < ∞. Then f has a Cδ(Q; V)-modification and there is a constant N = N(d, δ, p) independent of f and Q such that [ f ]δ;Q;V ≤ N [ f ] δ,p;Q;V 2.5. Appendix 35 and sup x∈Q | f (x)|V ≤ N|Q|δ/d[ f ]δ;p;Q;V + |Q|−1/p (∫ Q | f (x)|pVdx )1/p , where |Q| is the volume of the cube. Proof. If V = R, then the existence of a continuous modification of f and the estimate of [ f ] δ;Q follows from Lemma 2 and Exercise 5 in Chapter 10, Section 1, in [Kry08]. The proof for a general Banach space is the same. For all x ∈ Q, we have | f (x)|V ≤ 1 |Q| ∫ Q | f (x) − f (y)|Vdy + 1 |Q| ∫ Q | f (y)|Vdy ≤ N 1 |Q| [ f ] δ,p;Q ∫ Q |x − y|δdy + 1 |Q| ∫ Q | f (y)|Vdy ≤ N |Q|δ/d[ f ]δ,p;Q + |Q|−1/p (∫ Q | f (y)|pVdy )1/p , which proves the second estimate. □ The following is a direct consequence of Lemma 2.5.1. Corollary 2.5.2. Let p ≥ 1, δ ∈ (0, 1], and f be a V-valued function on Rd such that | f |δ;p;V := (∫ Rd | f (x)|pVdx + ∫ |x−y|<1 | f (x) − f (y)|pV |x − y|2d+δp dxdy )1/p < ∞. Then f has a Cδ(Rd; V)-modification and there is a constant N = N(d, δ, p) independent of f such that | f |δ;V ≤ N | f |δ;p;V . Corollary 2.5.3. Let X be a V-valued random field defined on Rd. Assume that for some p ≥ 1, l ≥ 0, and β ∈ (0, 1] with βp > d there is a constant N¯ > 0 such that for all x, y ∈ Rd, E [ |X(x)|pV ] ≤ N¯r1(x)lp (2.5.1) and E [ |X(x) − X(y)|pV ] ≤ N¯[r1(x)lp + r1(y)lp]|x − y|βp. (2.5.2) Then for all δ ∈ (0, β − dp ) and ϵ > d p , there exists a C δ(Rd; V)-modification of r−(l+ϵ)1 X and a constant N = N(d, p, δ, ϵ) such that E [ |r−(l+ϵ)1 X| p δ ] ≤ NN¯. Proof. Fix δ ∈ (0, β − dp ) and ϵ > d p . Owing to (2.5.1), there is a constant N = N(d, p, N¯ 36 Chapter 2. Properties of space inverses of stochastic flows , δ, ϵ) such that ∫ Rd E [ |r1(x)−(l+ϵ)X(x)| p V ] dx ≤ N¯ ∫ Rd r1(x) −pϵdx ≤ NN¯. Appealing to (2.5.2) and (2.3.19), we find that there is a constant N = N(d, p, δ, ϵ) such that ∫ |x−y|<1 E [ |r1(x)−(l+ϵ)X(x) − r1(y)−(l+ϵ)X(y)| p V ] |x − y|2d+δp dxdy ≤ N¯ ∫ |x−y|<1 r1(x)−pϵ + r1(y)−pϵ |x − y|2d−(β−δ)p dxdy + N¯ ∫ |x−y|<1 r1(y)pl|r1(x)−(l+ϵ) − r1(y)−(l+ϵ)|p |x − y|2d+δp dxdy ≤ NN¯ + NN¯ ∫ |x−y|<1 r1(x)−p(1+ϵ) + r1(y)−p(1+ϵ) |x − y|2d−(1−δ)p dxdy ≤ NN¯. Therefore, E[r−(l+ϵ)1 X] p δ,p ≤ NN¯, and hence, by Corollary 2.5.3, r −(l+ϵ) 1 X has a C δ(Rd; V)- modification and the estimate follows immediately. □ Corollary 2.5.4. Let (X(n))n∈N be a sequence of V-valued random fields defined on Rd. Assume that for some p ≥ 1, l ≥ 0 and β ∈ (0, 1], with βp > d there is a constant N¯ > 0 such that for all x, y ∈ Rd and n ∈ N, E [ |X(n)(x)|pV ] ≤ N¯r1(x)lp and E [ |X(n)(x) − X(n)(y)|pV ] ≤ N¯(r1(x)lp + r1(y)lp)|x − y|βp. Moreover, assume that for all x ∈ Rd, limn→∞ E [ |X(n)(x)|p ] = 0. Then for all δ ∈ (0, β − dp ) and ϵ > dp , lim n→∞ E [ |r−(l+ϵ)1 X (n)|p δ ] = 0. Proof. Fix δ ∈ (0, β− dp ) and ϵ > d p . Using the Lebesgue dominated convergence theorem, we attain lim n→∞ ∫ Rd E [ |r1(x)−(l+ϵ)X(n)(x)| p V ] dx = 0, and therefore for all ζ ∈ (0, 1), lim n ∫ ζ<|x−y|<1 E [ |r1(x)−(l+ϵ)Xn(x) − r1(y)−(l+ϵ)Xn(y)| p V ] |x − y|2d+δp dxdy = 0. 2.5. Appendix 37 By repeating the proof of Corollary 2.5.3, we obtain that there is a constant N such that ∫ |x−y|≤ζ E [ |r1(x)−(l+ϵ)X(n)(x) − r1(y)−(l+ϵ)X(n)(y)| p V ] |x − y|2d+δp dxdy ≤ N¯ ∫ |x−y|≤ζ r1(x)−pϵ + r1(y)−pϵ |x − y|2d+(δ−β)p dxdy + N¯ ∫ |x−y|≤ζ r1(x)−p(1+ϵ) + r1(y)−p(1+ϵ) |x − y|2d+(δ−1)p dxdy ≤ N¯ζβp−δp−d. Therefore, limn→∞ E [ [r−(l+ϵ)1 X] p δ,p ] = 0, and the statement is confirmed. □ Chapter 3 The method of stochastic characteristics for parabolic SIDEs 3.1 Introduction Let (Ω,F ,P) be a complete probability space and F˜0 be a sub-sigma-algebra of F . We assume that this probability space supports a sequence w1;ϱt , t ≥ 0, ϱ ∈ N, of independent one-dimensional Wiener processes and a Poisson random measure p1(dt, dz) on (R+ × Z1, B(R+)⊗Z1) with intensity measure π1(dz)dt, where (Z1,Z1, π1) is a sigma-finite measure space. We also assume that (w1;ϱt )ϱ∈N and p 1(dt, dz) are independent of F0. Let F = (Ft)t≥0 be the standard augmentation of the filtration (F¯t)t≥0, where for each t ≥ 0, F¯t = σ ( F˜0, (w1;ϱs )ϱ∈N, p 1([0, s],Γ) : s ≤ t, Γ ∈ Z1 ) . For each real number T > 0, we let RT , OT , and PT be the F-progressive, F-optional, and F-predictable sigma-algebra on Ω×[0,T ], respectively. Denote by q1(dt, dz) = p1(dt, dz)− π1(dz)dt the compensated Poisson random measure. Let D1, E1,V1 ∈ Z be disjoint Z1- measurable subsets such that D1 ∪ E1 ∪ V1 = Z1 and π(V1) < ∞. Let (Z2,Z2, π2) be a sigma-finite measure space and D2, E2 ∈ Z2 be disjoint Z2-measurable subsets such that D2 ∪ E2 = Z2. Fix an arbitrary positive real number T > 0 and integers d1, d2 ≥ 1. Let α ∈ (0, 2] and τ ≤ T be a stopping time. Let Fτ be the stopping time sigma-algebra associated with τ and let φ : Ω × Rd1 → Rd2 be Fτ ⊗ B(Rd1)-measurable. We consider the system of SIDEs on [0,T ] × Rd1 given by dult = ( (L1;lt +L 2;l t )ut + 1[1,2](α)b i t∂iu l t + c ll¯ t u l¯ t + f l t ) dt + ( N1;lϱt ut + g lϱ t ) dw1;ϱt + ∫ Z1 ( I1;lt,z ut− + h l t(z) ) [1D1(z)q 1(dt, dz) + 1E1∪V1(z)p 1(dt, dz)], τ ≤ t ≤ T, ult = φ l, t ≤ τ, l ∈ {1, . . . , d2}, (3.1.1) where for ϕ ∈ C∞c (R d1; Rd2), k ∈ {1, 2}, and l ∈ {1, . . . , d2}, Lk;lt ϕ(x) : = 1{2}(α) 1 2 σ k;iϱ t (x)σ k; jϱ t (x)∂i jϕ l(x) + 1{2}(α)σ k;iϱ t (x)υ k;ll¯ϱ t (x)∂iϕ l¯(x) 39 40 Chapter 3. The method of stochastic characteristics for parabolic SIDEs + ∫ Dk ρ k;ll¯ t (x, z) ( ϕl¯(x + Hkt (x, z)) − ϕ l¯(x) ) πk(dz) + ∫ Dk ( ϕl(x + Hkt (x, z)) − ϕ l(x) − 1(1,2](α)H k;i t (x, z)∂iϕ l(x) ) πk(dz) + 1{2}(k) ∫ E2 ( (Ill¯d2 + ρ 2;ll¯ t (x, z))ϕ l¯(x + H2t (x, z)) − ϕ l(x) ) π2(dz), N1;lϱt ϕ(x) : = 1{2}(α)σ 1;iϱ t (x)∂iϕ l(x) + υ1;ll¯ϱt (x)ϕ l¯(x), ϱ ∈ N, I1;lt,zϕ(x) : = (I ll¯ d2 + ρ 1;ll¯ t (x, z))ϕ l¯(x + H1t (x, z)) − ϕ l(x), and ∫ Dk ( |Hkt (x, z)| α + |ρkt (x, z)| 2 ) πk(dz) + ∫ Ek ( |Hkt (x, z)| 1∧α + |ρkt (x, z)| ) πk(dz) < ∞. The summation convention with respect to repeated indices i, j ∈ {1, . . . , d1},l¯ ∈ {1, . . . , d2}, and ϱ ∈ N is used here and below. The d2 × d2 dimensional identity matrix is denoted by Id2 . For a subset A of a larger set X, 1A denotes the {0, 1}-valued function taking the value 1 on the set A and 0 on the complement of A. We assume that for each k ∈ {1, 2}, σkt (x) = (σ k;iϱ t (ω, x))1≤i≤d1, ϱ∈N, bt(x) = (b i t(ω, x))1≤i≤d1 , ct(x) = (c ll¯ t (ω, x))1≤l,l¯≤d2 , υkt (x) = (υ k;ll¯ϱ t (ω, x))1≤l,l¯≤d2, ϱ∈N, ft(x) = ( f i t (ω, x))1≤i≤d2 , gt(x) = (g iϱ t (ω, x))1≤i≤d2, ϱ∈N, are random fields on Ω× [0,T ]×Rd1 that are RT ⊗B(Rd1)-measurable. For each k ∈ {1, 2}, we assume that Hkt (x, z) = (H k;i t (ω, x, z))1≤i≤d1 , ρ k t (x, z) = (ρ k;ll¯ t (ω, x, z))1≤l,l¯≤d2 , are random fields on Ω × [0,T ] × Rd1 × Zk that are PT ⊗ B(Rd1) ⊗Zk-measurable. More- over, we assume that ht(x, z) = (hit(ω, x, z))1≤i≤d2 is a random field on Ω × [0,T ] × R d1 that is PT ⊗ B(Rd1)-measurable. Systems of linear SIDEs appear in many contexts. They may be considered as extensions of both first-order symmetric hyperbolic systems and lin- ear fractional advection-diffusion equations. The equation (3.1.1) also arises in non-linear filtering of semimartingales as the equation for the unormalized filter of the signal (see, e.g., [Gri76] and [GM11]). Moreover, (3.1.1) is intimately related to linear transforma- tions of inverse flows of jump SDEs and it is precisely this connection that we will exploit to obtain solutions. Systems of linear SIDEs appear in many contexts. They may be considered as ex- tensions of both first-order symmetric hyperbolic systems and linear fractional advec- tion-diffusion equations. The equation (3.1.1) also arises in non-linear filtering of semi- martingales as the equation for the unormalized filter of the signal (see, e.g., [Gri76] and 3.1. Introduction 41 [GM11]). Moreover, (3.1.1) is intimately related to linear transformations of inverse flows of jump SDEs and it is precisely this connection that we will exploit to obtain solutions. There are various techniques available to derive the existence and uniqueness of clas- sical solutions of linear parabolic SPDEs and SIDEs. One approach is to develop a theory of weak solutions for the equations (e.g. variational, mild solution, or etc...) and then study further regularity in classical function spaces via an embedding theorem. We refer the reader to [Par72, Par75, MP76, KR77, Tin77a, Gyö82, Wal86, DPZ92, Kry99, CK10, PZ07, Hau05, RZ07, BvNVW08, HØUZ10, LM14b] for more information about weak solutions of SPDEs driven by continuous and discontinuous martingales and martingale measures. This approach is especially important in the non-degenerate setting where some smoothing occurs and has the obvious advantage that it is broader in scope. Another ap- proach is to regard the solution as a function with values in a probability space and use the method of deterministic PDEs (i.e. Schauder estimates, see, e.g. [Mik00, MP09]). A third approach is a direct one that uses solutions of stochastic differential equations. The direct method allows to obtain classical solutions in the entire Hölder scale while not restricting to integer derivative assumptions for the coefficients and data. In this chapter, we derive the existence of a classical solutions of (3.1.1) with regular coefficients using a Feynman-Kac-type transformation and the interlacing of the space-inverse (first integrals [KR81]) of a stochastic flow associated with the equation. The construction of the solution gives an insight into the structure of the solution as well. We prove that the solution of (3.1.1) is unique in the class of classical solutions with polynomial growth (i.e. weighted Hölder spaces). As an immediate corollary of our main result, we obtain the ex- istence and uniqueness of classical solutions of linear partial integro-differential equations (PIDEs) with random coefficients, since the coefficients σ1, H1, a1, ρ1, and free terms g and h can be zero. Our work here directly extends the method of characteristics for de- terministic first-order PDEs and the well-known Feynman-Kac formula for deterministic second-order PDEs. In the continuous case (i.e. H1 ≡ 0,H2 ≡ 0, h ≡ 0), the classical solution of (3.1.1) was constructed in [KR81, Kun81, Kun86a, Roz90] (see references therein as well) us- ing the first integrals of the associated backward SDE. This method was also used to obtain classical solutions of (3.1.1) in [DPMT07]. In the references above, the forward Liouville equation for the first integrals of associated stochastic flow was derived directly. However, since the backward equation involves a time reversal, the coefficients and in- put functions are assumed to be non-random. The generalized solutions of (3.1.1) with d2 = 1, non-random coefficients, non-degenerate diffusion, and finite measures π1 = π2 were discussed in [MB07]. In this chapter, we give a direct derivation of (3.1.1) and all the equations considered are forward, possibly degenerate, and the coefficients and input functions are adapted. For other interesting and related developments, we refer the reader 42 Chapter 3. The method of stochastic characteristics for parabolic SIDEs to [Pri12, Zha13, Pri14], which all concern the fascinating regularizing property of noise in the case of non-degenerate noise. This chapter is organized as follows. In Section 3.2, our notation is set forth and the main results are stated. In Section 3.3, the main theorems are proved. In Section 3.4, the appendix, auxiliary facts used throughout the chapter are discussed. 3.2 Statement of main results In this chapter, elements of Rd1 and Rd2 are understood as column vectors and elements of Rd 2 1 and Rd 2 2 are understood as matrices of dimension d1 × d1 and d2 × d2, respectively. We also adopt the notation of Chapter 2. If we do not specify to which space the parameters ω, t, x, y, z and n belong, then we mean ω ∈ Ω, t ∈ [0,T ], x, y ∈ Rd1 , z ∈ Zk, and n ∈ N. Let us introduce some regularity conditions on the coefficients and free terms. We consider these assumptions for β¯ > 1 ∨ α and β˜ > α. Assumption 3.2.1 (β¯). (1) There is a constant N0 > 0 such that for each k ∈ {1, 2} and all ω, t ∈ Ω × [0,T ], |r−11 bt|0 + |∇bt|β¯−1 + |r −1 1 σ k t |0 + |∇σ k t |β¯−1 ≤ N0. Moreover, for each k ∈ {1, 2} and all (ω, t, z) ∈ Ω × [0,T ] × (Dk ∪ Ek), |r−11 H k t (z)|0 ≤ K k t (z) and |∇H k t (z)|β¯−1 ≤ K¯ k t (z) where Kk, K¯k : Ω × [0,T ] × (Dk ∪ Ek) → R+ are PT ⊗ Zk-measurable functions satisfying Kkt (z) + K¯ k t (z) + ∫ Dk ( Kkt (z) α + K¯kt (z) 2 ) πk(dz) + ∫ Ek ( Kkt (z) 1∧α + K¯kt (z) ) πk(dz) ≤ N0, for all (ω, t, z) ∈ Ω × [0,T ] × (Dk ∪ Ek). (2) There is a constant η ∈ (0, 1) such that for all (ω, t, x, z) ∈ {(ω, t, x, z) ∈ Ω × [0,T ] × Rd1 × (Dk ∪ Ek) : |∇Hkt (ω, x, z)| > η}, ∣∣∣∣ ( Id1 + ∇H k t (x, z) )−1∣∣∣∣ ≤ N0. Assumption 3.2.2 (β˜). There is a constant N0 > 0 such that for each k ∈ {1, 2} and all (ω, t) ∈ Ω × [0,T ], |ct|β˜ + |υ k t |β˜ + |r −θ 1 ft|β˜ + |r −θ 1 gt|β˜ ≤ N0. 3.2. Statement of main results 43 Moreover, for each k ∈ {1, 2} and all (ω, t, z) ∈ Ω × [0,T ] × (Dk ∪ Ek), |ρkt (z)|β˜ ≤ l k t (z), |r −θ 1 ht(z)|β˜ ≤ l k t (z), where lk : Ω × [0,T ] × Zk → R+ are PT ⊗Zk-measurable function satisfying lkt (z) + ∫ Dk lkt (z) 2πk(dz) + ∫ Ek lkt (z)π k(dz) ≤ N0, for all (ω, t, z) ∈ Ω × [0,T ] × (Dk ∪ Ek). Remark 3.2.1. It follows from Lemma 3.4.10 and Remark 3.4.11 that if Assumption 3.2.1(β¯) holds for some β¯ > 1 ∨ α, then for all ω, t, and z ∈ Dk ∪ Ek, x 7→ H˜kt (x, z) := x + Hkt (x, z) is a diffeomorphism. Let Assumptions 3.2.1(β¯) and 3.2.2(β˜) hold for some β¯ > 1 ∨ α and β˜ > α. In our derivation of a solution of (3.1.1), we first obtain a solution of an equation of a special form. Specifically, consider the system of SIDEs on [0,T ] × Rd1 given by duˆlt = ( (L1;lt +L 2;l t )uˆt + bˆ i t∂iu l t + cˆ ll¯ t u l¯ t + fˆ l t ) dt + ( N1;lϱt uˆt + g lϱ t ) dw1;ϱt + ∫ Z1 ( I1;lt,z uˆt− + h l t(z) ) [1D1(z)q 1(dt, dz) + 1E1(z)p 1(dt, dz)], τ < t ≤ T, uˆlt = φ l, t ≤ τ, l ∈ {1, . . . , d2}, (3.2.1) where bˆit(x) : = 1[1,2](α)b i t(x) + 2∑ k=1 1{2}(α)σ k; jϱ t (x)∂ jσ k;iϱ t (x) + 2∑ k=1 1(1,2](α) ∫ Dk ( Hk;it (x, z) − H k;i t (H˜ k;−1 t (x, z), z) ) πk(dz), cˆll¯t (x) : = c ll¯ t (x) + 2∑ k=1 1{2}(α)σ k; jϱ t (x)∂ jυ k;ll¯ϱ t (x) + 2∑ k=1 ∫ Dk (ρk;ll¯t (x, z) − ρ k;ll¯ t (H˜ k;−1 t (x, z), z))π k(dz), fˆ lt (x) : = f l t (x) + σ 1; jϱ t (x)∂ jg lϱ t (x) + ∫ D1 ( hlt(x, z) − h l t(H˜ 1;−1 t (x, z), z) ) π1(dz). Let (w2;ϱt )ϱ∈N, t ≥ 0, ϱ ∈ N, be a sequence of independent one-dimensional Wiener processes. Let p2(dt, dz) be a Poisson random measure on ([0,∞) × Z2,B([0,∞) ⊗ Z2) with intensity measure π2(dz)dt. Extending the probability space if necessary, we take w2 44 Chapter 3. The method of stochastic characteristics for parabolic SIDEs and p2(dt, dz) to be independent of w1 and p1(dt, dz). Let Fˆt = σ ( (w2;ϱs )ϱ∈N, p 2([0, s],Γ) : s ≤ t, Γ ∈ Z2 ) and F˜ = ( F˜t ) t≤T be the standard augmentation of ( Ft ∨ Fˆt ) t≤T . Denote by q2(dt, dz) = p2(dt, dz) − π2(dz)dt the compensated Poisson random measure. We associate with the SIDE (3.2.1), the F˜-adapted stochastic flow Xt = Xt(x) = Xt(τ, x), (t, x) ∈ [0,T ] × Rd1 , generated by the SDE dXt = −1[1,2](α)bt(Xt)dt + 2∑ k=1 1{2}(α)σ k;ϱ t (Xt)dw k;ϱ t − 2∑ k=1 ∫ Dk Hkt (H˜ k;−1 t (Xt−, z), z)[p k(dt, dz) − 1(1,2](α)πk(dz)dt] − 2∑ k=1 ∫ Ek Hkt (H˜ k;−1 t (Xt−, z), z)p k(dt, dz), τ < t ≤ T, Xt = x, t ≤ τ, (3.2.2) and the F˜-adapted random field Φt(x) = Φt(τ, x), (t, x) ∈ [0,T ] × Rd1 , solving the linear SDE given by dΦt(x) = (ct(Xt(x))Φt(x) + ft(Xt(x))) dt + 2∑ k=1 υ k;ϱ t (Xt(x))Φt(x)dw k;ϱ t + g ϱ t (Xt(x))dw 1;ϱ t + 2∑ k=1 ∫ Zk ρkt (H˜ k;−1 t (Xt−(x), z), z)Φt−(x)[1Dk(z)q k(dt, dz) + 1Ek(z)p k(dt, dz)] + ∫ Z1 ht(H˜ 1;−1 t (Xt−(x), z), z)[1D1(z)q 1(dt, dz) + 1E1(z)p 1(dt, dz)], τ < t ≤ T, Φt(x) = φ(x), t ≤ τ. The coming theorem is our existence, uniqueness, and representation theorem for (3.2.1). Let us describe our solution class. For each β′ ∈ (0,∞), denote by Cβ ′ (Rd1; Rd2) the linear space of all F-adapted random fields v = vt(x) such that P-a.s. 1[τn,τn+1)r −λn 1 v ∈ D([0,T ];C β′(Rd1 ,Rd2)), where (τn)n≥0 is an increasing sequence of F-stopping times with τ0 = 0 and τn = T for sufficiently large n, and where for each n, λn is a positive Fτn-measurable random variable. Theorem 3.2.2. Let Assumptions 3.2.1(β¯) and 3.2.2(β˜) hold for some β¯ > 1∨α and β˜ > α. For any stopping time τ ≤ T and Fτ ⊗ B(Rd1)-measurable random field φ such that for 3.2. Statement of main results 45 some β′ ∈ (α, β¯ ∧ β˜) and θ′ ≥ 0, P-a.s. r−θ ′ 1 φ ∈ C β′(Rd1; Rd2), there exists a unique solution uˆ = uˆ(τ) of (3.2.1) in Cβ ′ (Rd1; Rd2) and for all (t, x) ∈ [0,T ] × Rd1 , P-a.s. uˆt(τ, x) = E [ Φt(τ, X −1 t (τ, x))|Ft ] . (3.2.3) Moreover, for all ϵ > 0 and p ≥ 2, there is a constant N = N(d1, d2, p,N0,T, β′, η1, η2, ϵ, θ, θ′) such that E [ sup t≤T |r−θ∨θ ′−ϵ 1 uˆt(τ)| p β′ ∣∣∣Fτ ] ≤ N(|r−θ ′ 1 φ| p β′ + 1). (3.2.4) Using Itô’s formula, it is easy to verify that if m = 1 and gt(x) = 0, ht(x) = 0, and ρ k t (x, z) ≥ −1, for all (ω, t, x, z) ∈ Ω × [0,T ] × Rd1 × (Dk ∪ Ek), k ∈ {1, 2}, then Φt(x) = Ψt(x)ϕ(x) + Ψt(x) ∫ ]τ,τ∨t] Ψ−1s (x) fs(Xs(x))ds, (3.2.5) where P-a.s. for all t and x, Ψt(x) := eBt(x), and Bt(x) := ∫ [τ,τ∨t]  cs(Xs(x)) − 2∑ k=1 1 2 υk;ϱs (Xs(x))υ k;ϱ s (Xs(x))   ds + 2∑ k=1 ∫ ]τ,τ∨t] υk;ϱs (Xs(x))dw k;ϱ s − 2∑ k=1 ∫ ]τ,τ∨t] ∫ Dk ( ln ( 1 + ρks(H˜ k;−1 s (Xs−(x), z), z) ) − ρks(H˜ k;−1 s (Xs−(x), z), z) ) πk(dz)ds 2∑ k=1 ∫ ]τ,τ∨t] ∫ Zk ln ( 1 + ρks(H˜ k;−1 s (Xs−(x), z), z) ) [1Dk(z)q k(ds, dz) + 1Ek(z)p k(ds, dz)]. The following corollary then follows directly from (3.2.3) and (3.2.5). Corollary 3.2.3. Let m = 1 and assume that for each k ∈ {1, 2} and all (ω, t, x, z) ∈ Ω × [0,T ] × Rd1 × (Dk ∪ Ek), gt(x) = 0, ht(x, z) = 0, ρ k t (x, z) ≥ −1. Moreover, let Assumptions 3.2.1(β¯) and 3.2.2(β˜) hold for some β¯ > 1 ∨ α and β˜ > α. Let τ ≤ T be a stopping time and φ be a Fτ ⊗ B(Rd1)-measurable random field such that for some β′ ∈ (α, β¯ ∧ β˜) and θ′ ≥ 0, P-a.s. r−θ ′ 1 φ ∈ C β′(Rd1; Rd2). (1) If for all (ω, t, x) ∈ Ω × [0,T ] × Rd1 , ft(x) ≥ 0 and φ(x) ≥ 0, then the solution uˆ of (3.1.1) satisfies uˆt(x) ≥ 0, P-a.s. for all (t, x) ∈ [0,T ] × Rd1 . 46 Chapter 3. The method of stochastic characteristics for parabolic SIDEs (2) If for all (ω, t, x, z) ∈ Ω × [0,T ] × Rd1 × (Dk ∪ Ek), k ∈ {1, 2}, υkt (x) = 0, ft(x) ≤ 0, ct(x) ≤ 0, φ(x) ≤ 1, and ρkt (x, z) ≤ 0, then the solution uˆ of (3.1.1) satisfies uˆt(x) ≤ 1, P-a.s. for all (t, x) ∈ [0,T ] × Rd1 . Remark 3.2.4. Since L2 can be the zero operator, both Theorem 3.2.2 and Corollary 3.2.3 apply to fully degenerate SIDEs and PIDEs with random coefficients. Now, let us discuss our existence and uniqueness theorem for (3.1.1). We construct the solution of u = u(τ) of (3.1.1) by interlacing the solutions of (3.2.1) along a sequence of large jump moments (see Section 3.3.5). By using an interlacing procedure we are also able to drop the condition of boundedness of (I + ∇H1t (x, z)) −1 on the set (ω, t, x, z) ∈ {(ω, t, x, z) ∈ Ω × [0,T ] × Rd1 × (D1 ∪ E1) : |∇H1t (ω, x, z)| > η k}. Also, in order to remove the terms in bˆ, cˆ, and fˆ that appear in (3.2.1), but not in (3.1.1), we subtract terms from the relevant coefficients in the flow and the transformation. However, in order to do this, we need to impose stronger regularity assumptions on some of the coefficients and free terms. We will introduce parameters µ1, µ2, δ1, δ2 ∈ [0, α2 ] to the regularity assumptions. These parameters allows for a trade-off of integrability in z and regularity in x of the coefficients Hkt (x, z), ρ k t (x, z), h k t (x, z). It is worth mentioning that the removal of terms and the interlacing procedure are independent of each other and that it is due only to the weak assumptions on H1 and ρ1 on the set V1 that we do not have moment estimates and a simple representation property like (3.2.4) for the solution of (3.1.1). However, there is a representation of the solution and we refer the reader to the proof of the coming theorem for the description of the representation. We introduce the following assumptions for β¯ > 1∨α and β˜ > α. For each k ∈ {1, 2}, let D¯k be the trace sigma-algebra of Dk ∪Ek relative toZk andV1 be the trace sigma-algebra of V1 relative toZ1. Assumption 3.2.3 (β¯). (1) There is a constant N0 > 0 such that for each k ∈ {1, 2} and all (ω, t) ∈ Ω × [0,T ], |r−11 bt|0 + |∇bt|β¯−1 + |σ k t |β¯+1 ≤ N0. (2) For each k ∈ {1, 2} , there are real-numbers δk, µk ∈ [0, α2 ] such that for all (ω, t) ∈ Ω × [0,T ], |υkt |β¯+1 ≤ N0, if σ k t , 0, |gt|β¯+1 ≤ N0, if σ 1 t , 0, |Hkt (z)|0 ≤ K k t (z), |∇H k t (z)|β¯+δk−1 ≤ K¯ k t (z), ∀z ∈ D k, ∑ |γ|=[β¯+δk]− [∂γHkt (z)]{β¯+δk}+ ≤ K˜ k t (z), ∀z ∈ D k, |r−11 H k t (z)|0 ≤ K k t (z), |∇H k t (z)|β¯−1 ≤ K¯ k t (z), ∀z ∈ E k, 3.2. Statement of main results 47 |ρkt (z)|β¯+µk ≤ l k t (z), ∑ |γ|=[β¯+µk]− [∂γρkt (z)]{β¯+µk}+ ≤ l˜ k t (z), ∀z ∈ D k, |r−θ1 ht(z)|β¯+µ1 ≤ l 1 t (z), ∑ |γ|=[β¯+µ1]− [∂γht(z)]{β¯+µ1}+ , ∀z ∈ D 1, where Kk, K¯k, K˜k, lk, l˜k : Ω×[0,T ]×(Dk∪Ek)→ R+ arePT⊗D¯k-measurable functions satisfying for all (ω, t, z) ∈ Ω × [0,T ], Kkt (z) + K¯ k t (z) + l˜ k t (z) + l k t (z) + l˜ k t (z) ≤ N0, ∀z ∈ D k ∪ Ek, ∫ Dk ( Kkt (z) α + K¯kt (z) 2 + K˜kt (z) α α−δk + K˜kt (z) 2 + lkt (z) 2 + l˜kt (z) α α−µk + l˜kt (z) 2 ) πk ≤ N0(dz), ∫ Ek ( Kkt (z) 1∧α + K¯kt (z) ) πk(dz) ≤ N0. (3) There is a constant η ∈ (0, 1) such that for all (ω, t, x, z) ∈ {(ω, t, x, z) ∈ Ω × [0,T ] × Rd1 × Z2 : |∇H2t (ω, x, z)| > η}, ∣∣∣∣ ( Id1 + ∇H 2 t (x, z) )−1∣∣∣∣ ≤ N0. Assumption 3.2.4 (β˜). (1) There is a constant N0 > 0 such that for each k ∈ {1, 2} and all (ω, t) ∈ Ω × [0,T ], |ct|β˜ + |r −θ 1 ft|β˜ ≤ N0, |υkt |β˜ ≤ N0, if σ k t = 0, |gt|β˜ ≤ N0, if σ 1 t = 0, |ρk(t, z)|β˜ ≤ l k t (z), ∀z ∈ E k, |r−θ1 ht(z)|β˜ ≤ l 1 t (z), ∀z ∈ E 1, where for all (ω, t) ∈ Ω × [0,T ], ∫ Ek lkt (z)π k(dz) ≤ N0. (2) There exist processes ξ, ζ : Ω × [0,T ] × V1 → R+ that are PT ⊗ V1measurable and satisfy |r−ξt(z)1 H 1 t (z)|β˜∨1 + |r −ξt(z) 1 ρ 1 t (z)|β˜ + |r −ξt(z) 1 ht(z)|β˜ ≤ ζt(z), for all (ω, t, z) ∈ Ω × [0,T ] × V1. We now state our existence and uniqueness theorem for (3.1.1). Theorem 3.2.5. Let Assumptions 3.2.3(β¯) and 3.2.4(β˜) hold for some β¯ > 1∨α and β˜ > α. For any stopping time τ ≤ T and Fτ ⊗ B(Rd1)-measurable random field φ such that for some β′ ∈ (α, β¯ ∧ β˜) and θ′ ≥ 0, P-a.s. r−θ ′ 1 φ ∈ C β′(Rd1; Rd2), there exists a unique solution u = u(τ) of (3.1.1) in Cβ ′ (Rd1; Rd2). 48 Chapter 3. The method of stochastic characteristics for parabolic SIDEs 3.3 Proof of main theorems We will first prove uniqueness of the solution of (3.2.1) in the class Cβ ′ (Rd1; Rd2). The existence part of the proof of Theorem 3.2.2 is divided into a series of steps. In the first step, by appealing to the representation theorem we derived for solutions of continuous SPDEs shown in Theorem 2.4 in [LM14c], we use an interlacing procedure and the strong limit theorem given in Theorem 2.3 in [LM14c] to show that the space inverse of the flow generated by a jump SDE (i.e. the SDE (3.2.2) without the uncorrelated noise) solves a degenerate linear SIDE. Then we linearly transform the inverse flow of a jump SDE to obtain solutions of degenerate linear SIDEs with free and zero-order terms and an initial condition. In the last step of the proof of Theorem 3.2.2, we introduce an independent Wiener process and Poisson random measure as explained above, apply the results we know for fully degenerate equations, and then take the optional projection. In the last section, Subsection 3.3.4, we prove Theorem 3.2.5 using an interlacing procedure and removing the extra terms in bˆ, cˆ and fˆ . The uniqueness of the solution u of (3.1.1) follows directly from our construction. 3.3.1 Proof of uniqueness for Theorem 3.2.2 Proof of Uniqueness for Theorem 3.2.2. Fix a stopping time τ ≤ T and Fτ ⊗ B(Rd1)- measurable random field φ such that for some β′ ∈ (α, β¯ ∧ β˜) and θ′ ≥ 0, P-a.s. r−θ ′ 1 φ ∈ Cβ ′ (Rd1; Rd2). In this section, we will drop the dependence of processes on t, x, and z when we feel it will not obscure the argument. Let uˆ1(τ) and uˆ2(τ) be solutions of (3.2.1) in Cβ ′ . It follows that v := uˆ1(τ) − uˆ2(τ) solves dvlt = [(L 1;l t +L 2;l t )vt + bˆ i t∂iv l t + cˆ ll¯ t v l¯ t]dt +N 1;ϱ t v l tdw 1;ϱ t + ∫ Z1 I1;lt,z vt−[1D1(z)q 1(dt, dz) + 1E1(z)p 1(dt, dz)], τ < t ≤ T, vlt = 0, t ≤ τ, l ∈ {1, . . . , d2}, and P-a.s. 1[τn,τn+1)(·)r −λn 1 v ∈ D([0,T ];C β′(Rd1 ,Rd2)), where (τn)n≥0 is an increasing sequence of F-stopping times with τ0 = 0 and τn = T for sufficiently large n, and where for each n, λn is a positive Fτn-measurable random variable. Clearly, it suffices to take τ1 = τ and λ0 = 0. Thus, vt(x) = 0 for all (ω, t) ∈ [[τ0, τ1)). Assume that for some n, P-a.s. for all t and x, vt∧τn(x) = 0. We will show that P-a.s. for all 3.3. Proof of main theorems 49 t and x, v˜t(x) := v(τn∨t)∧τn+1(x) = 0. Applying Itô’s formula, for all x, P-a.s. for all t, we find d|v˜t|2 = ( 2v˜ltL 1;l t v˜t + |N 1 t v˜t| 2 + 2v˜ltb i t∂iv˜ l t + 2v˜ l tc ll¯ t v˜ l¯ t ) dt + ( 2v˜ltI 1;l t,z v˜t + ∫ D1∪E1 |I1;lt,z v˜t| 2π1(dz) ) dt + ( 2vltL 2;l t v˜t + 2v˜ l tI 2;l t,z v˜t ) dt + 2vltN 1;ϱ t v˜ l tdw 1;ϱ t + ∫ Z1 ( 2v˜lt−I 1;l t,z v˜t− + |I 1;l t,z v˜t−| 2 ) q1(dt, dz), τn < t ≤ τn+1, |v˜t|2 = 0, t ≤ τn, l ∈ {1, . . . , d2}, (3.3.1) where for ϕ ∈ C∞c (R d1; Rd2), k ∈ {1, 2}, and l ∈ {1, . . . , d2}, Lk;lϕ := 1 2 σk;iϱσk; jϱ∂i jϕ l + σk; jϱ∂ jσ k;iϱ∂iϕ l + σk;iϱυk;ll¯ϱ∂iϕ l¯ + σk; jϱ∂ ja k;ll¯ϱϕl¯ and Ik;lϕ : = ∫ Dk ( ρk;ll¯ϕl¯(H˜k) − ρk;ll¯(H˜k;−1)ϕl¯ ) πk(dz) + ∫ Dk ( ϕl(H˜k) − ϕl + 1(1,2](α)Fk;i∂iϕl ) πk(dz) + ∫ Ek ( (Ill¯d2 + ρ k;ll¯)ϕl¯(H˜k) − ϕl ) πk(dz). For each ω and t, let Qt = ∫ Rd1 |v˜t(x)|2r−λ1 (x)dx, where λ = λn + (d′ + 2)/2 and d′ > d1. Note that EQt ≤ ∫ Rd1 r−d ′ 1 (x)dxE|r −λn 1 v˜t|0 < ∞. It suffices to show that supt≤T EQt = 0. To this end, we will multiply the equation (3.3.1) by the weight r−2λ1 = r −2λn+1 1 r −d′ 1 , integrate in x, and change the order of the integrals in time and space. Thus, we must verify the assumptions of stochastic Fubini theorem hold (see Corollary 3.4.13 and Remark 3.4.14 as well) with the finite measure µ(dx) = r−d ′ 1 (x)dx on Rd1 . Since b and σk have linear growth an υk and c are bounded, owing to Lemma 3.4.6, we easily obtain that there is a constant N = N(d1, d2,N0, λn) such that P-a.s for all t, ∫ Rd1   2∑ k=1 2|r−λn1 v˜||r −λn−2 1 L kv˜| + |rλn−11 N 1v˜|2   r −d′ 1 dx ≤ N sup t≤T |r−λn1 v˜| 2 β′ , 50 Chapter 3. The method of stochastic characteristics for parabolic SIDEs ∫ Rd1 4|r−λn1 v˜| 2|r−λn−11 N 1v˜|2r−d ′ 1 dx ≤ N sup t≤T |r−λn1 v˜t| 4 β′ , and ∫ Rd1 ( 2|r−λn1 v˜|r −λn−1 1 b∂iv˜| + 2|r −λn 1 v˜||r −λn 1 cv˜| ) r−d ′ 1 dx ≤ N sup t≤T |r−λn1 v˜t| 2 β′ . For all ϕ ∈ Cαloc(R d1; Rd2) and all k, ω, t, x, p, and z, r−p1 (ϕ(H˜ k) − ϕ + 1(1,2](α)Fk;i∂iϕ) = ϕ¯(H˜k) − ϕ¯ − 1(1,2](α)Hk;i∂iϕ¯ + 1(1,2](α)(Hk;i + Fk;i)∂iϕ¯ +p1(1,2](α)(Hk;i + Fk;i)r−21 x iϕ¯ +   rp1 (H˜ k) rp1 − 1   (ϕ¯(H˜k) − 1(1,2](α)ϕ¯) +1(1,2](α)   rp1 (H˜ k) rp1 − 1 + pHk;ir−21 x i   ϕ¯, (3.3.2) where ϕ¯ := r−pϕ. By Taylor’s formula, for all ϕ ∈ Cα(Rd1; Rd2) and all k, ω, t, x, and z, we have |ϕ(H˜k) − ϕ − 1(1,2](α)Hk;i∂iϕ| ≤ rα1 |ϕ|α|r −1 1 H| α 0 . (3.3.3) Combining (3.3.2), (3.3.3), and the estimates given in Lemma 3.4.10 (1), for all k, ω, t, x and z, we obtain r−α1 |ρ k(H˜k;−1) − ρk| ≤ N |ρ|α∧1|r−11 H k|α∧10 and r−λn−α1 |v˜(H˜ k) − v˜ + 1(1,2](α)Fk;i∂iv˜| ≤ N |r−λn1 v˜|α ( |r−11 H k|α0 + |r −1 1 H|0[H k]1 + |r−11 H| [α]−+1 0 + [H] [α]−+1 1 ) , (3.3.4) for some constant N = N(d1, λn,N0, η1, η2). Therefore, P-a.s for all t, ∫ Rd1   2∑ k=1 2|r−λn1 v˜||r −λn−2 1 I kv˜| + ∫ D1∪E1 |r−λ−11 Izv˜| 2π1(dz)   r −d′ 1 dx ≤ N sup t≤T |r−λn1 v˜| 2 β′ , and ∫ Rd1 ( 2|r−λn1 v˜||r −λn−2 1 I k z v˜| + |r −λn−1 1 Izv˜| 2 )2 r−d ′ 1 dx ≤ N sup t≤T |r−λn1 v˜t| 4 β′ , for some constant N = N(d1, d2, λn,N0, η1, η2). Let L2(Rd1; Rd2) be the space of square-integrable functions f : Rd1 → Rd2 with norm ∥ · ∥0 and inner product (·, ·)0. Moreover, let L2(Rd1; ℓ2(Rd2)) be the space of square- integrable functions f : Rd1 → ℓ2(Rd2) with norm ∥ · ∥0. With the help of the above 3.3. Proof of main theorems 51 estimates and Corollary 3.4.13, denoting v¯ = r−λv˜, P-a.s. for all t, we have d∥v¯t∥20 = ( 2(v¯lt, L¯ 1 t v¯t)0 + ∥N¯ 1 t v¯t∥ 2 0 + 2(v¯t, I¯ 1 t,zv¯t)0 + ∫ D1∪E1 ∥I¯1t,zv¯t∥ 2 0π 1(dz) ) dt + ( 2(v˜t, b i t∂iv˜t + c¯ l¯ tv˜ l¯ t)0 + 2(v˜t, L¯ 2 t v˜t)0 + 2(v˜t, I¯ 2 t,zv˜t)0 ) dt + 2(vt, N¯ 1;ϱ t v˜t)0dw 1;ϱ t + ∫ Z1 ( 2(v˜t−, I¯1t,zv˜t−)0 + ∥I¯ 1 t,zv˜t−∥ 2 0 ) q1(dt, dz), τn < t ≤ τn+1, ∥v¯t∥20 = 0, t ≤ τn, l ∈ {1, . . . , d2}, (3.3.5) where all coefficients and operators are defined as in (3.2.1) with the following changes: (1) for each k ∈ {1, 2}, υk is replaced with υ¯k;ll¯ := υk;ll¯ + 1{2}(α)λσk;iϱr−21 x iδll¯; (2) for each k ∈ {1, 2}, ρk replaced with ρ¯k;ll¯ := ρk;ll¯ +   rλ1(H˜ k) rλ1 − 1   (Ill¯d2 + ρ k;ll¯); (3) c is replaced with c¯ll¯ = cll¯ + λbir−2xiδll¯ + 2∑ k=1 λ2σk;iϱσk; jϱr−41 x ix j + 2∑ k=1 ∫ Dk     rλ1 rλ1(H˜ k;−1) − 1   (Ill¯m + ρ k(H˜k;−1)) − 1(1,2](α)λr−21 xiH k;i(H˜k;−1)   π k(dz). Since for all k, ω and t, |r−11 σ k|0 + |r−11 ∇σ k|β¯−1 + |υk|β˜ ≤ N0, for β¯ > 1 ∨ α and β˜ > α, it is clear that |υ¯k|α ≤ N. Moreover, since for all k, ω and t, |r−11 H k|0 + |Hk|β¯ ≤ Kk and |ρ|β˜′ ≤ lk, applying the estimates in Lemma (3.4.10) (1), we get |ρ¯k|α ≤ lk + Kk(1 + lk) and |c|α ≤ N0. We will now estimate the drift terms of (3.3.5) in terms of ∥v¯t∥20. We write f ∼ g if∫ Rd1 | f (x)|dx = ∫ Rd1 |g(x)|dx and f ≪ g if ∫ Rd1 | f (x)|dx ≤ ∫ Rd1 |g(x)|dx. Using the divergence theorem, for any v : Rd1 → Rd2 , σ : Rd1 → Rd1 and υ : Rd1 → R2d2 and all x, we get σiσ jvlvli j ∼ 1 2 (σiσ j)i jv − σiσ jvliv l j = (σ i i jσ j + σijσ j i )|v| 2 − σiσ jvliv l j, 2σijσ jvlvli ∼ −(σ i jσ j)i|v|2 = (σii jσ j + σijσ j i )|v| 2, 52 Chapter 3. The method of stochastic characteristics for parabolic SIDEs and σivlυll¯vl¯i + σ ivl¯υll¯vli = σ ivlυll¯symv l¯ i ∼ −(σ iυll¯sym)i|v| 2 = −(σiiυ ll¯ sym + σ iυll¯sym)|v| 2, where υll¯sym = (υ ll¯ + υl¯l)/2. Consequently, for all ω, t, and x, we have 2v¯lL¯1;lv¯ + |N¯1v¯|2 ∼ 1 2 ( | divσ1|2 − ∂iσ1; jϱ∂ jσ1;iϱ ) |v¯|2 − υ¯1;ll¯ϱsym v¯ lv¯l¯ divσ1;ϱ + |υ¯1v¯|2 ≪ N |v¯|2 and 2v¯lL¯(2);lv¯ ≪ −(1 + ϵ)|σ2;i∂iv¯|2 + N|v¯|2, for any ϵ > 0, where in the last estimate we have also used Young’s inequality. By Lemma 3.4.10 (2) and basic properties of the determinant, there is a constant N = N(d,N0, η1, η2) such that for all k, ω, t, x, and z, det H˜k;−1 − 1 = det(Id1 + F k) − 1 ≤ |∇Fk| ≤ N|∇Hk| and det H˜k;−1 − 1 − div Fk ≤ |∇Fk|2 ≤ N |∇Hk|2. Thus, integrating by parts, for all ω, t, and x, we get 2v¯lI¯1;lv¯ + ∫ D1∪E1 |I¯1v¯|2π1(dz) ∼ 2 ∫ D1 ρ¯1;ll¯sym(H˜ 1;−1)(det∇H˜1;−1 − 1)π1(dz)v¯l¯v¯l + ∫ D1∪E1 ( det∇H˜1;−1 − 1 + 1(1,2](α)1D1 div F 1 ) π1(dz)|v¯|2 + ∫ D1∪E1 ( 1E12ρ¯ 1;ll¯ sym(H˜ 1;−1)v¯l¯v¯l + |ρ¯1(H˜1;−1)v¯|2 ) det∇H˜1;−1π1(dz) ≪ N (∫ D1 ( K1(z)2 + l1(z)K1(z) + l1(z)2 ) π1(dz) + ∫ E1 ( Kk(z) + lk(z) ) π1(dz) ) |v¯|2. Analogously, for all ω, t, and x, we attain 2v¯lI¯2;lv¯ ≤ −(1 + ϵ) ∫ D2∪E2 |v¯(H˜2) − v¯|2π2(dz) + N|v¯|2. Therefore, combining the above estimates, P-a.s. for all t, Qt ≤ N ∫ t 0 Qsds + Mt, (3.3.6) where (Mt)t≤T is a càdlàg square-integrable martingale. Taking the expectation of (3.3.6) 3.3. Proof of main theorems 53 and applying Gronwall’s lemma, we get supt≤T EQt = 0, which implies that P-a.s. for all t and x, v˜t(x) = 0. This completes the proof. □ 3.3.2 Small jump case Set (wϱ)ϱ∈N = (w1;ϱ)ϱ∈N, (Z,Z, π) = (Z1,Z1, π1), p(dt, dz) = p1(dt, dz), and q(dt, dz) = q1(dt, dz). Let σt(x) = (σ iϱ t (x))1≤i≤d1,ϱ≥1 be a ℓ2(R d1)-valued RT ⊗B(Rd1)-measurable func- tion defined on Ω × [0,T ] × Rd1 and Ht(x, z) = (Hit(x, z))1≤i≤d1 be a PT ⊗ B(R d1) ⊗ Z- measurable function defined on Ω × [0,T ] × Rd1 × Z. We introduce the following assumption for β > 1 ∨ α. Assumption 3.3.1 (β). (1) There is a constant N0 > 0 such that for all (ω, t) ∈ Ω× [0,T ], |r−11 bt|0 + |r −1 1 σt|0 + |∇bt|β−1 + |∇σt|β−1 ≤ N0. Moreover, for all (ω, t, z) ∈ Ω × [0,T ] × Z, |r−11 Ht(z)|0 ≤ Kt(z) and |∇Ht(z)|β−1 ≤ K¯t(z), where K : Ω × [0,T ] × Z → R+ is a PT ⊗Z-measurable function satisfying Kt(z) + K¯t(z) + ∫ Z ( Kt(z) α + K¯t(z) 2 ) π(dz) ≤ N0, for all (ω, t, z) ∈ Ω × [0,T ] × Z. (2) There is a constant η ∈ (0, 1) such that for all (ω, t, x, z) ∈ {(ω, t, x, z) ∈ Ω × [0,T ] × Rd1 × Z : |∇Ht(ω, x, z)| > η}, | ( Id1 + ∇Ht(x, z) )−1 | ≤ N0. Let Assumption 3.3.1(β) hold for some β > 1 ∨ α. Let τ ≤ T be a stopping time. Consider the system of SIDEs on [0,T ] × Rd1 given by dvt(x) = ( 1{2}(α) 1 2 σ iϱ t (x)σ jϱ t (x)∂i jvt(x) + b i t(x)∂ivt(x) ) dt + 1{2}(α)σ iϱ t (x)∂ivt(x)dw ϱ t + ∫ Z (vt−(x + Ht(x, z)) − vt−(x)) [1(1,2](α)q(dt, dz) + 1[0,1](α)p(dt, dz)] + 1(1,2](α) ∫ Z (vt(x + Ht(x, z)) − vt(x) + Ft(x, z)∂ivt(x)) π(dz)dt, τ < t ≤ T, vt(x) = x, t ≤ τ, (3.3.7) 54 Chapter 3. The method of stochastic characteristics for parabolic SIDEs where bit(x) := 1[1,2](α)b i t(x) + 1{2}(α)σ jϱ t (x)∂ jσ iϱ t (x) and Ft(x, z) := −Ht(H˜−1t (x, z), z). We associate with (3.3.7) the stochastic flow Yt = Yt(τ, x), (t, x) ∈ [0,T ] × Rd1 , generated by the SDE dYt = −1[1,2](α)bt(Yt)dt − 1{2}(α)σ ϱ t (Yt)dw ϱ t + ∫ Z Ft(Yt−, z)[1(1,2](z)q(dt, dz) + 1[0,1](z)p(dt, dz)], τ < t ≤ T, (3.3.8) Yt = x, t ≤ τ. Owing to parts (1) and (2) of Lemma 3.4.10, for all ω, t, and z, the inverse of the mapping F˜t(x, z) := x+Ft(x, z) = x−Ht(H˜−1t (x, z), z) is H˜t(x, z) := x+Ht(x, z) and there is a constant N = N(d1,N0, β, η) such that for all ω, t, x, y, and z, |r−11 Ft(z)|0 ≤ NKt(z), |∇Ft(z)|β−1 ≤ Kt(z), |(Id1 + ∇Ft(x, z)) −1| ≤ N. Thus, by Theorem 2.1 in [LM14c], there is a modification of the solution of (3.3.8), which we still denote by Yt = Yt(τ, x), that is a C β′ loc-diffeomorphism for any β ′ ∈ [1, β). Moreover, P-a.s. Y·(τ, ·),Y−1· (τ, ·) ∈ D([0,T ];C β′ loc(R d1; Rd1)), and Y−1t− (τ, ·) coincides with the inverse of Yt−(τ, ·) for all t. In the proof of the following proposition, we will show that the inverse flow Y−1t (τ) solves (3.3.7). Proposition 3.3.1. Let Assumption 3.3.1(β) hold for some β > 1 ∨ α. For any stopping time τ ≤ T and β′ ∈ [1 ∨ α, β), vt(x) = vt(τ, x) = Y−1t (τ, x) solves (3.3.7) and for all ϵ > 0 and p ≥ 2, there is a constant N = N(d1, p,N0,T, β′, η, ϵ) such that E [ sup t≤T |r−(1+ϵ)1 vt(τ)| p 0 ] + E [ sup t≤T |r−ϵ1 ∇vt(τ)| p β′−1 ] ≤ N. (3.3.9) Proof. The estimate (3.3.9) is given in Theorem 2.1 in [LM14c], so we only need to show that Y−1t (τ, x) solves (3.3.7). Let (δn)n≥1 be a sequence such that δn ∈ (0, η) for all n and δn → 0 as n→ ∞. It is clear that there is a constant N = N(N0) such that for all ω and t, π({z : Kt(z) > δn}) ≤ N δαn . (3.3.10) 3.3. Proof of main theorems 55 For each n, consider the system of SIDEs on [0,T ] × Rd1 given by dv(n)t (x) = ( 1{2}(α) 1 2 σ iϱ t (x)σ jϱ t (x)∂i jv (n) t (x) + b i t(x)∂iv (n) t (x) ) dt +1(1,2](α) ∫ Z 1{Kt>δn}(z) ( v(n)t (x + Ht(x, z)) − v (n) t (x) + F i t(x, z)∂iv (n) t (x) ) π(dz)dt + ∫ Z 1{Kt>δn}(z) ( v(n)t− (x + Ht(x, z)) − v (n) t− (x) ) [1(1,2](α)q(dt, dz) + 1[0,1](α)p(dt, dz)], +1{2}(α)σ iϱ t (x)∂iv (n) t (x)dw ϱ t , τ < t ≤ T, v (n) t (x) = x, t ≤ τ, (3.3.11) and the stochastic flow Y (n)t = Y (n) t (τ, x), (t, x) ∈ [0,T ] × R d1 , generated by the SDE dY (n)t = −1[1,2](α)bt(Y (n) t )dt − 1{2}(α)σ ϱ t (Y (n) t )dw ϱ t + ∫ Z 1{Kt>δn}(z)Ft(Y (n) t− , z)[1(1,2](α)q(dt, dz) + 1[0,1](α)p(dt, dz)], τ < t ≤ T, Y (n)t (x) = x, t ≤ τ. (3.3.12) Since (3.3.10) holds, we can rewrite equation (3.3.12) as dY (n)t = − ( 1[1,2](α)bt(Y (n) t ) + 1(1,2](α) ∫ Z 1{Kt>δn}(z)Ft(Y (n) t , z)π(dz) ) dt (3.3.13) − 1{2}(α)σ ϱ t (Yn(t))dw ϱ t + ∫ Z 1{Kt>δn}(z)Ft(Y (n) t− , z)p(dt, dz), τ < t ≤ T, and (3.3.11) as dv(n)t (x) = ( 1{2}(α) 1 2 σ iϱ t (x)σ jϱ t (x)∂i jv (n) t (x) + b i t(x)∂ jσ iϱ t (x) ) dt + 1{2}(α)σ iϱ t (x)∂iv (n) t (x)dw ϱ t + 1(1,2](α) ∫ Z 1{Kt>δn}(z)F i t(x, z)π(dz)∂iv (n) t (x)dt + ∫ Z 1{Kt>δn}(z) ( v(n)t− (x + Ht(x, z)) − v (n) t− (x) ) p(dt, dz), τ < t ≤ T. (3.3.14) It is well-known that the solution Y (n)t = Y (n) t (x) of (3.3.13) can be written as the solution of continuous SDEs with a finite number of jumps interlaced. Indeed, for each n and stopping time τ′ ≤ T , consider the stochastic flow Y˜ (n)t = Y˜ (n) t (τ ′, x), (t, x) ∈ [0,T ] × Rd1 , generated by the SDE dY˜ (n)t = − ( 1[1,2](α)bt(Y˜ (n) t ) + 1(1,2](α) ∫ Z 1{K>δn}(t, z)Ft(Y˜ (n) t , z)π(dz) ) dt − 1{2}(α)σ ϱ t (Y˜ (n) t )dw ϱ t , τ ′ < t ≤ T, Y˜ (n)t = x, t ≤ τ ′. 56 Chapter 3. The method of stochastic characteristics for parabolic SIDEs By Theorems 2.1 and 2.4 and Remark 2.5, there is a modification of Y˜ (n)t = Y˜ (n) t (τ ′, x), still denoted Y˜ (n)t (τ ′, x), that is a Cβ ′ loc-diffeomorphism. Furthermore, P-a.s. we have that Y˜ (n)· (τ ′, ·), Y˜ (n);−1· (τ ′, ·) ∈ C([0,T ];Cβ ′ loc) and v˜(n)t = v˜ (n) t (τ ′, x) = Y˜ (n);−1t (τ ′, x) solves the SPDE given by dv˜(n)t (x) = ( 1{2}(α) 1 2 σ iϱ t (x)σ jϱ t (x)∂i jv (n) t (x) + b i t(x)∂iv (n) t (x) ) dt + 1{2}(α)σ iϱ t (x)∂iv (n) t (x)dw ϱ t + 1(1,2](α) ∫ Z 1{K>δn}(t, z)F i(t, z)π(dz)∂iv (n) t (x)dt, τ ′ < t ≤ T, v˜(n)t (x) = x, t ≤ τ ′. For each n, let A(n)t = ∫ ]0,t] ∫ Z 1{Ks>δn}(z)p(ds, dz), t ≥ 0, and define the sequence of stopping times (τ(n)l ) ∞ l=1 recursively by τ (n) 0 = τ and τ (n) l+1 = inf { t > τ(n)l : ∆A (n) t , 0 } ∧ T. Fix some n ≥ 1. It is clear that P-a.s. for all x and t ∈ [0, τ(n)1 ), Y (n);−1t (τ, x) = Y˜ (n);−1 t (τ, x) = v˜ (n) t (τ, x) satisfies (3.3.14) up to, but not including time τ(n)1 . Moreover, P-a.s. for all x, Y (n) τ (n) 1 (τ, x) = Y˜ (n) τn1− (τ, x) + ∫ Z F τ (n) 1 (Y˜ (n) τ (n) 1 − (τ, x), z)p({τ(n)1 }, dz), and hence Y (n);−1 τ (n) 1 (τ, x) = ∫ Z v˜(n) τ (n) 1 − (τ, x + H τ (n) 1 (x, z))p({τ(n)1 }, dz). Consequently, v(n)t (τ, x) = Y (n);−1 t (τ, x) solves (3.3.14) up to and including time τ (n) 1 . Assume that for some l ≥ 1, v(n)t (τ, x) = Y (n);−1 t (τ, x) solves (3.3.14) up to and including time τ (n) l . Clearly, P-a.s. for all x and t ∈ [τ(n)l , τ (n) l+1), Y (n) t (x) = Y˜ (n) t (τ (n) l ,Y (n) τ (n) l − (x)), and thus P-a.s. for all x and t ∈ [τ(n)l , τ (n) l+1), Y (n);−1t (x) = Y˜ (n) t (τ (n) l ,Y (n) τ (n) l − (x)) = v˜(n)t (τ (n) l ,Y (n) τ (n) l − (x)). 3.3. Proof of main theorems 57 Moreover, P-a.s. for all x, Y−1n (τ n l+1, x) = ∫ U v˜n(τ n l , τ n l+1−, x + H(τ n l+1, x, z))p({τ n l+1}, dz), which implies that v(n)t (τ, x) = Y (n);−1 t (τ, x) solves (3.3.14) up to and including time τ n l+1. Therefore, by induction, for each n, v(n)t (τ, x) = Y (n);−1 t (τ, x) solves (3.3.14). It is easy to see that for all ω, t, and z, |r−11 1{Kt>δn}(z)Ft(z) − r −1 1 Ft(z)|0 + |1{Kt>δn}(z)∇Ft(z) − ∇Ft(z)|β−1 ≤ 1{Kt≤δn}(z)Kt(z) and thus dPdt − lim n→∞ ∫ D 1{K≤δn}(t, z)Kt(z) 2π(dz) + dPdt − lim n→∞ ∫ E 1{K≤δn}(t, z)Kt(z)π(dz) = 0. By virtue of Theorem 2.3 in [LM14c], for all ϵ > 0 and p ≥ 2 we have lim n→∞ ( E [ sup t≤T |r−(1+ϵ)1 (Y (n) t (τ) − r −(1+ϵ) 1 Yt(τ)| p 0 ] + E [ sup t≤T |r−ϵ1 ∇Y (n) t (τ) − r −ϵ 1 ∇Yt(τ)| p β′−1 ]) = 0, lim n→∞ E [ sup t≤T |r−(1+ϵ)1 Y (n);−1 t (τ) − r −(1+ϵ) 1 Y −1 t (τ)| p 0 ] = 0 and lim n→∞ E [ sup t≤T |r−ϵ1 ∇Y (n);−1 t (τ) − r −ϵ 1 ∇Y −1 t (τ)| p β′−1 ] = 0. Then passing to the limit in both sides of (3.3.11) and making use of Assumption 3.3.1(β), the estimate (3.3.4), and basic convergence properties of stochastic integrals, we discover that vt(τ, x) = X−1t (τ, x) solves (3.3.7) . □ 3.3.3 Adding free and zero-order terms Set (wϱ)ϱ∈N = (w1;ϱ)ϱ∈N, (Z,Z, π) = (Z1,Z1, π1), p(dt, dz) = p1(dt, dz), and q(dt, dz) = p1(dt, dz) − π1(dz)dt. Also, set D = D1, E = E1, and assume Z = D ∪ E. Let υt(x) = (υll¯ϱt (ω, x))1≤l,l¯≤d2, ϱ∈N be a ℓ2(R 2d2)-valued RT ⊗ B(Rd1)-measurable function defined on Ω×[0,T ]×Rd1 and ρt(x, z) = (ρll¯t (ω, x, z))1≤l,l¯≤d2 be a PT ⊗B(R d1)⊗Z-measurable function defined on Ω × [0,T ] × Rd1 × Z. We introduce the following assumptions for β > 1 ∨ α and β˜ > α. Assumption 3.3.2 (β). (1) There is a constant N0 > 0 such that for all (ω, t) ∈ Ω× [0,T ], |r−11 bt|0 + |r −1 1 σt|0 + |∇bt|β−1 + |∇σt|β−1 ≤ N0. 58 Chapter 3. The method of stochastic characteristics for parabolic SIDEs Moreover, for all (ω, t, z) ∈ Ω × [0,T ] × Z, |r−11 Ht(z)|0 ≤ Kt(z) and |∇Ht(z)|β−1 ≤ K¯t(z), where K : Ω × [0,T ] × Z → R+ is a PT ⊗Z-measurable function satisfying Kt(z) + K¯t(z) + ∫ D ( Kt(z) α + K¯t(z) 2 ) π(dz) + ∫ E ( Kt(z) α∧1 + K¯t(z) ) π(dz) ≤ N0, for all (ω, t, z) ∈ Ω × [0,T ] × Z. (2) There is a constant η ∈ (0, 1) such that for all (ω, t, x, z) ∈ {(ω, t, x, z) ∈ Ω × [0,T ] × Rd1 × Z : |∇Ht(ω, x, z)| > η}, | ( Id1 + ∇Ht(x, z) )−1 | ≤ N0. Assumption 3.3.3 (β˜). There is a constant N0 > 0 such that for all (ω, t) ∈ Ω × [0,T ], |ct|β˜ + |υt|β˜ + |r −θ 1 ft|β˜ + |r −θ 1 gt|β˜ ≤ N0. Moreover, for all (ω, t, z) ∈ Ω × [0,T ] × Z, |ρt(z)|β˜ + |r −θ 1 ht(z)|β˜ ≤ lt(z), where l : Ω × [0,T ] × Z → R+ is a PT ⊗Z-measurable function satisfying lt(z) + ∫ D lt(z) 2π(dz) + ∫ E lt(z)π(dz) ≤ N0. (ω, t, z) ∈ Ω × [0,T ] × Z. Let Assumptions 3.3.2(β¯) and 3.3.3(β˜) hold for some β¯ > 1 ∨ α and β˜ > α. Let τ ≤ T be a stopping time and φ : Ω × Rd1 → Rd2 be a Fτ ⊗ B(Rd1)-measurable random field. Consider the system of SIDEs on [0,T ] × Rd1 given by dvlt = ( Lltvt + bˆ i t∂iϕ l + cˆll¯t ϕ l¯ + fˆlt ) dt + ( N lϱt vt + g lϱ t ) dwϱt + ∫ Z ( Ilt,zvt− + h l t(z) ) [1D(z)q(dt, dz) + 1E(z)p(dt, dz)], τ < t ≤ T, vlt = φ l, t ≤ τ, l ∈ {1, . . . , d2}, (3.3.15) where for ϕ ∈ C∞c (R d1; Rd2) and l ∈ {1, . . . , d2}, Lltϕ(x) := 1{2}(α) 1 2 σ iϱ t (x)σ jϱ t (x)∂i jϕ l(x) + 1{2}(α)σ iϱ t (x)a ll¯ϱ t (x)∂iϕ l¯(x) 3.3. Proof of main theorems 59 + ∫ Dk ρll¯t (x, z) ( ϕl¯(x + Ht(x, z)) − ϕl¯(x) ) π(dz) + ∫ Dk ( ϕl(x + Ht(x, z)) − ϕl(x) − 1(1,2](α)∂iϕl(x)Hit(x, z) ) π(dz) N lϱt ϕ l(x) := 1{2}(α)σ iϱ t (x)∂iϕ l(x) + υll¯ϱt (x)ϕ l¯(x), Ilt,zϕ l(x) := (Id2 + ρ ll¯ t (x, z))ϕ l¯(x + Ht(x, z)) − ϕl(x), and where bˆit(x) : = 1[1,2](α)b i t(x) + 1{2}(α)σ jϱ t (x)∂ jσ iϱ t (x) + ∫ D ( 1(1,2](α)Hit(x, z) − H i t(H˜ −1 t (x, z), z) ) π(dz), cˆll¯t (x) : = c ll¯ t (x) + 1{2}(α)σ jϱ t (x)∂ jυ ll¯ϱ t (x) + ∫ D ( ρll¯t (x, z) − ρ ll¯ t (H˜ −1 t (x, z), z) ) π(dz), fˆlt(x) : = f l t (x) + 1{2}(α)σ jϱ t (x)∂ jg l t(x) + ∫ D ( hlt(x, z) − h l t(H˜ −1 t (x, z), z) ) π(dz). We associate with (3.3.15) the stochastic flow Xt = Xt(x) = Xt (τ, x) , (t, x) ∈ [0,T ] × Rd1 , given by (3.3.8). Let Γt(x) = Γt(τ, x), (t, x) ∈ [0,T ]×Rd1 , be the solution of the linear SDE given by dΓt(x) = (ct(Xt(x))Γt(x) + ft(Xt(x))) dt + ( υ ϱ t (Xt(x))Γt(x) + g ϱ t (Xt(x)) ) dwϱt + ∫ Z ρt(H˜ −1 t (Xt−(x), z), z)Γt−(x)[1D(z)q(dt, dz) + 1E(z)p(dt, dz)] + ∫ Z ht(H˜ −1 t (Xt−(x), z), z)[1D(z)q(dt, dz) + 1E(z)p(dt, dz)], τ < t ≤ T, Γt(x) = 0, t ≤ τ. (3.3.16) Let Ψt(x) = Ψt(τ, x), (t, x) ∈ [0,T ] × Rd1 , be the unique solution of the linear SDE given by dΨt(x) = ct(Xt(x))Ψt(x)dt + υ ϱ t (Xt(x))Ψt(x)dw ϱ t + ∫ Z ρt(H˜ −1 t (Xt−(x), z), z)Ψt−(x)[1D(z)q(dt, dz) + 1E(z)p(dt, dz)], τ < t ≤ T, Ψt(x) = Id2 , t ≤ τ. In the following lemma, we obtain p-th moment estimates of the weighted Hölder norms of Γ and Ψ. Lemma 3.3.2. Let Assumptions 3.3.2(β¯) and 3.3.3(β˜) hold for some β¯ > 1 ∨ α and β˜ > α. For any stopping time τ ≤ T and β′ ∈ [0, β¯ ∧ β˜), there exists a D([0,T ],Cβ ′ loc(R d1; Rd2))- 60 Chapter 3. The method of stochastic characteristics for parabolic SIDEs modification of Γ(τ, ) and Ψ(τ), also denoted by Γ¯(τ) and Ψ(τ), respectively. Moreover, for all ϵ > 0 and p ≥ 2, there is a constant N = N(d1, d2, p,N0,T, β′, η, ϵ, θ) such that E [ sup t≤T |r−(θ+ϵ)1 Γt(τ)| p β′ ] + E [ sup t≤T |r−ϵ1 Ψt(τ)| p β′ ] ≤ N. (3.3.17) Proof. Let τ ≤ T be a fixed stopping time and β := β¯∧ β˜. Estimating (3.3.16) directly and using the Burkholder-Davis-Gundy inequality and Lemma 3.4.1, we get E [ sup t≤T |Γt(x)|p ] ≤ NE ∫ ( |Γt(x)|p + r1(Xt(x))pθ|r−θ1 ft| p 0 ) dt + E (∫ r1(Xt(x)) 2θ|r−θ1 gt| 2 0dt )p/2 +E (∫ ∫ D ∫ r1(H˜ −1 t (Xt(x))) 2θ|r−θ1 ht| 2 ∞π(dz)dt )p/2 +E ∫ ∫ D∪E r1(H˜ −1 t (Xt(x))) pθ|r−θ1 ht| p ∞π(dz)dt +E (∫ ∫ E r1(H˜ −1 t (Xt(x))) θ|r−θ1 ht|∞π(dz)dt )p . Then using multiplicative decomposition ht(x, H˜ −1 t (Xt−(x), z), z) = r1(Xt−(x)) θ r1(H˜ −1 t (Xt−(x), z)) θ r1(Xt(x))θ ht(H˜−1t (Xt−(x), z), z) r1(H˜−1t (Xt−(x), z))θ , Hölder’s inequality, Lemma 3.4.10 (1), Lemma 3.2 in [LM14c], and Gronwall’s inequality, we get that for all x and y, E [ sup t≤T |Γt(x)|p ] ≤ Nr−θp1 (x), where N = N(d1, p,N0,T, η, θ) is a positive constant. In a similar way, grouping terms in the obvious manner and using Lemma 3.4.3 and Lemma 3.4.10 (3), we obtain E [ sup t≤T |Γt(x) − Γt(y)|p ] ≤ N(r−pθ1 (x) ∨ r −pθ 1 (y))|x − y| (β′∧1)p. Now, assume that [β]− ≥ 1. As in the proof of Theorem 3.4 in [Kun04], it follows that Ut = ∇Γt(τ, x) solves dUt = (ct(Xt)Ut + ∇ct(Xt)∇XtΓt + ∇ ft(Xt)∇Xt) dt + ∫ Z ρt(H˜ −1 t (Xt−, z), z)Ut−[1D(z)q(dt, dz) + 1E(z)p(dt, dz)] + ∫ Z ∇ρt(H˜−1t (Xt−, z), z)∇[H˜ −1 t (Xt−)]Γt−[1D(z)q(dt, dz) + 1E(z)p(dt, dz)] + ∫ Z ∇ht(x, H˜−1t (Xt, z), z)∇[H˜ −1 t (Xt−)]][1D(z)q(dt, dz) + 1E(z)p(dt, dz)] 3.3. Proof of main theorems 61 + ( υ ϱ t (Xt)Ut + ∇υ ϱ t (Xt)∇XtΓt + ∇g ϱ t (Xt)∇Xt ) dwϱt τ < t ≤ T, Ut = 0, t ≤ τ. Recall that by Lemma 3.4.6, a function ϕ : Rd1 → Rn, n ≥ 1 satisfies |r−θϕ|β < ∞ if and only if |r−θϕ|0, . . . , |r−θ∂γϕ|0, |γ| ≤ [β]−, and [r−θ∂γϕ]|{β}+ are finite. Estimating as above and using Proposition 3.4 in [LM14c] and Lemma 3.4.10 we obtain that for all p ≥ 2, there is a constant N = N(d1, d2, p,N0,T, θ) such that for all x and y, E [ sup t≤T |∇Γt(x)|p ] ≤ r−pθ1 (x)N and E [ sup t≤T |∇Γt(x) − ∇Γt(y)|p ] ≤ N(r−pθ1 (x) ∨ r −pθ 1 (y))|x − y| ((β−1)∧1)p. Using induction we get that for all p ≥ 2 and all multi-indices γ with 0 ≤ |γ| ≤ [β]− and all x, E sup t≤T [|∂γΓt(x)|p] ≤ r −pθ 1 (x)N, and for all multi-indices γ with |γ| = [β]− and all x, y, E [ sup t≤T |∂γΓt(x) − ∂γΓt(y)|p ] ≤ N(r−pθ1 (x) ∨ r −pθ 1 (y))|x − y| (β−[β]−)p, for a constant N = N(d1, d2, p,N0,T, β, η, θ). It is also clear that for all p ≥ 2 and all multi-indices γ with 0 ≤ |γ| ≤ [β]− and all x, E [ sup t≤T |∂γΨt(x)|p ] ≤ N, and for all multi-indices γ with |γ| = [β]− and all x, y, E [ sup t≤T |∂γΨt(x) − ∂γΨt(y)|p ] ≤ N|x − y|(β−[β] −)p. We obtain the existence of a D([0,T ],Cβ ′ loc(R d1; Rd2))-modification of Γ(τ) and Ψ(τ) using estimate (3.3.17) and Corollary 5.3 in [LM14c]. This completes the proof. □ Let Φ˜t(x) = Φ˜t(τ, x), (t, x) ∈ [0,T ] × Rd1 , be the solution of the linear SDE given by dΦ˜t(x) = ( ct(Xt(x))Φ˜t(x) + ft(Xt(x)) ) dt + ( υ ϱ t (Xt(x))Φ˜t(x) + g ϱ t (Xt(x)) ) dwϱt + ∫ Z ρt(H˜ −1 t (Xt−(x), z), z)Φ˜t−(x, y)[1D(z)q(dt, dz) + 1E(z)p(dt, dz)] 62 Chapter 3. The method of stochastic characteristics for parabolic SIDEs + ∫ Z ht(H˜ −1 t (Xt−(x), z), z)[1D(z)q(dt, dz) + 1E(z)p(dt, dz)], τ < t ≤ T, Φ˜t(x) = φ(x), t ≤ τ. The following is a simple corollary of Lemma 3.3.2. Corollary 3.3.3. Let Assumptions 3.3.2(β¯) and 3.3.3(β˜) hold for some β¯ > 1∨α and β˜ > α. For any stopping time τ ≤ T andFτ⊗B(Rd1)-measurable random field φ such that for some β′ ∈ [0, β¯∧ β˜), P-a.s. φ ∈ Cβ ′ loc(R d1; Rd2), there is a D([0,T ];Cβ ′ loc(R d1 ,Rd2))-modification of Φ˜(τ), also denoted by Φ˜(τ), and P-a.s. for all (t, x) ∈ [0,T ] × Rd1 , Φ˜t(τ, x) = Ψt(x)φ(x) + Γt(x). Moreover, if for some θ′ ≥ 0 and β′ ∈ [0, β¯ ∧ β˜), P-a.s. r−θ ′ 1 φ ∈ C β′(Rd1; Rd2), then for all ϵ > 0 and p ≥ 2, there is a constant N = N(d1, d2, p,N0,T, θ, θ′, β′, ϵ) such that E [ sup t≤T |r−(θ∨θ ′)−ϵ 1 Φ˜t(τ)| p β′ ∣∣∣Fτ ] ≤ N(|r−θ ′ 1 φ| p β′ + 1). (3.3.18) Let us now state our main result concerning degenerate SIDEs and their connection with linear transformations of inverse flows of jump SDEs. Proposition 3.3.4. Let Assumptions 3.3.2(β¯) and 3.3.3(β˜) hold for some β¯ > 1 ∨ α and β˜ > α. For any stopping time τ ≤ T and Fτ ⊗ B(Rd1)-measurable random field φ such that for some β′ ∈ (α, β¯ ∧ β˜) and θ′ ≥ 0, P-a.s. r−θ ′ 1 φ ∈ C β′(Rd1; Rd2), we have that P- a.s. Φ˜(τ, X−1(τ)) ∈ D([0,T ];Cβ ′ loc(R d1; Rd2)) and vt(x) = vt(τ, x) = Φ˜t(τ, X−1t (τ, x)) solves (3.3.15). Moreover, for all ϵ > 0 and p ≥ 2, E [ sup t≤T |r−(θ∨θ ′)−ϵ 1 vt(τ)| p β′ ∣∣∣Fτ ] ≤ N(|r−θ ′ 1 φ| p β′ + 1), (3.3.19) for a constant N = N(d1, d2, p,N0,T, β′, η, ϵ, θ, θ′). Proof. Fix a stopping time τ ≤ T and random field φ such that for some β′ ∈ (α, β¯ ∧ β˜) and θ′ ≥ 0, P-a.s. r−θ ′ 1 φ ∈ C β′(Rd1 ; Rd2). By virtue of Corollary 3.3.3 and Theorem 2.1 in [LM14c], P-a.s. Φ˜(τ, X−1(τ)) ∈ D([0,T ];Cβ ′ loc(R d1 ,Rd2)). Then using the Ito-Wenzell formula (Proposition 3.4.16) and following a simple calcula- tion, we obtain that vt(τ, x) := Φ˜t(τ, X−1t (τ, x)) solves (3.3.15). By Theorem 2.1 in [LM14c] and Corollary 3.3.3, for all ϵ > 0 and p ≥ 2, there exists a constant N = N(d1, p,N0,T, 3.3. Proof of main theorems 63 β′, η, ϵ) such that E [ sup t≤T |r−(1+ϵ)1 X −1 t (τ)| p β′ ] + E [ sup t≤T |r−ϵ1 ∇X −1 t (τ)| p β′−1 ] ≤ N. (3.3.20) Therefore applying Lemma 3.4.9 and Hölder’s inequalty and using the estimates (3.3.20) and (3.3.18), we procure (3.3.19), which completes the proof. □ 3.3.4 Adding uncorrelated part (Proof of Theorem 3.2.2) Proof of Theorem 3.2.2 . Fix a stopping time τ ≤ T and random field φ such that for some β′ ∈ (α, β¯∧ β˜) and θ′ ≥ 0, P-a.s. r−θ ′ 1 φ ∈ C β′(Rd1; Rd2). Consider the system of SIDEs given by dv˜lt = ( (L1;lt +L 2;l t )v˜t + 1[1,2](α)bˆ i t∂iu l t + cˆ ll¯ t u l¯ t(x) + fˆ l t ) dt + ( N1;lϱt v˜t + g lϱ t ) dw1;ϱt +N2;lϱt v˜tdw 2;ϱ t + ∫ Z1 ( I1;lt,z v˜t− + h l t(z) ) [1D1(z)q 1(dt, dz) + 1E1 p 1(dt, dz)] + ∫ Z2 I2;lt,z v˜t−[1D2(z)q 2(dt, dz) + 1E2(z)p 2(dt, dz)] τ < t ≤ T, v˜lt = φ l, t ≤ τ, l ∈ {1, . . . , d2}, where for ϕ ∈ C∞c (R d1; Rd2) and l ∈ {1, . . . , d2}, N2;lϱt ϕ(x) := 1{2}(α)σ 2;iϱ t (x)∂iϕ l(x) + υ2;ll¯ϱt (x)ϕ l¯(x), ϱ ∈ N, I2;lt,zϕ(x) := (I ll¯ d2 + ρ 2;ll¯ t (x, z))ϕ l¯(x + H2t (x, z)) − ϕ l(x). By Proposition 3.3.4, P-a.s. Φ(τ, X−1(τ)) ∈ D([0,T ];Cβ ′ loc(R d1; Rd2)) and v˜t(τ, x) = Φt(τ, X−1t (τ, x)) solves (3.3.15). We write vt(x) = vt(τ, x). Moreover, for all ϵ > 0 and p ≥ 2, E [ sup t≤T |r−(θ∨θ ′)−ϵ 1 v˜t(τ)| p β′ ∣∣∣Fτ ] ≤ N(|r−θ ′ 1 φ| p β′ + 1), (3.3.21) where N = N(d1, d2, p,N0,T, β′, η1, η2, ϵ, θ, θ′) is a positive constant. Without loss of gen- erality we will assume that for all ω and t, |r−θ ′ 1 φ|β′ ≤ N, since we can always multiply the equation by indicator function. For each n ∈ N0, let Cnloc(R d1; Rd2) be the separable Fréchet space of n-times continuously differentiable functions f : Rd1 → Rd2 endowed with the countable set of semi-norms given by | f |n,k = ∑ 0≤|γ|≤n sup |x|≤k |∂γ f (x)|, k ∈ N. 64 Chapter 3. The method of stochastic characteristics for parabolic SIDEs Owing to Lemma 3.4.2, there is a the family of measures Etω(dU), (ω, t) ∈ Ω × [0,T ] on D([0,T ]; C[β] − loc (R d1; Rd2)), corresponding to A = v˜ such that for all bounded G : Ω × [0,T ]×[0,T ]×D([0,T ]; C[β] − loc (R d1; Rd2))→ Rd2 that are OT ×B ([0,T ])×B(D([0,T ]; C [β]− loc (Rd1; Rd2))) measurable, P-a.s. for all t, we have Et[Gt(t, v˜)] = ∫ D([0,T ];C[β ′]− loc (R d1 ;Rd2 )) Gt(t,U)E t(dU) = E [Gt(t, v˜)|Ft] , where the right-hand-side is the càdlàg modification of the conditional expectation. Set uˆt(x) = uˆt(τ, x) = E t[v˜t(τ, x)] = ∫ D([0,T ];C[β ′]− loc (R d1 ;Rd2 )) Ut(x)E t(dU). Let λ = (θ ∨ θ′) + ϵ. We claim that for all multi-indices γ with |γ| ≤ [β]−, P-a.s. for all t and x, ∂γ[r−λ1 (x)uˆt(x)] = ∫ D([0,T ];C[β ′]− loc (R d1 ;Rd2 )) ∂γ[r−λ1 (x)Ut(x)]E t(dU) = Et[∂γ[r−λ1 (x)v˜t(x)]]. Indeed, since Mt = E t [ sup s≤T |∂γ[r−λ1 v˜s]|0 ] , t ∈ [0,T ], is a (F,P) martingale, we have E [ sup t≤T |Mt|2 ] ≤ 4E [ |MT |2 ] ≤ 4E [ sup t≤T |∂γ[r−λ1 v˜t]| 2 0| ] < ∞, (3.3.22) which implies that P-a.s. for all t, ∫ D([0,T ];C[β ′]− loc (R d1 ;Rd2 )) sup s≤T,x∈Rd1 |∂γ[r−λ1 (x)Us(x)]|E t(dU) = Et [ sup t≤T |∂γ[r−λ1 v˜t]|0 ] < ∞. Similarly, since E [ supt≤T |r −λ 1 v˜t| 2 β′ ] < ∞, P-a.s. for all x and y, |∂γ[r−λ1 (x)uˆt(x)] − ∂ γ[r−λ1 (y)uˆt(y)]| |x − y|{β′}+ ≤ Et [ |∂γ[r−λ1 (x)v˜t(x)] − ∂ γ[r−λ1 (y)v˜t(y)]| |x − y|{β′}+ ] ≤ Et[|r−λ1 v˜t|β′], and thus, P-a.s. sup t≤T |r−λ1 uˆt|β′ ≤ sup t≤T Et [ sup t≤T |r−λ1 v˜t|β′ ] < ∞. 3.3. Proof of main theorems 65 Thus, P-a.s. r−λ1 (·)uˆ(τ) ∈ D([0,T ];C β′(Rd1; Rd2)) and (3.2.4) follows from (3.3.21) (see the argument (3.3.22)). For each l ∈ {1, . . . , d2}, letAlt(x) = A l t(z) be defined by Alt = φ l + ∫ ]τ,τ∨t] ( (L1;ls +L 2;l s )uˆs + 1[1,2](α)bˆ i s∂iuˆ l s + cˆ ll¯ s uˆ l¯ s + fˆ l s ) ds + ∫ ]τ,τ∨t] ( N1;lϱs uˆs + g lϱ s ) dw1;ϱs + ∫ ]τ,τ∨t] ∫ Z1 ( I1;ls,zuˆs− + h l s(z) ) [1D1(z)q 1(ds, dz) + 1E1(z)p 1(ds, dz)]. By Theorem 12.21 in [Jac79], the representation property holds for (F,P), and hence every bounded (F,P)- martingale issuing from zero can be represented as Mt = ∫ ]0,t] oϱsdw 1;ϱ s + ∫ ]0,t] ∫ Z1 es(z)q 1(ds, dz), t ∈ [0,T ], where E ∫ ]0,T ] |os|2ds + E ∫ ]0,T ] ∫ Z1 |es(z)|2π1(dz)ds < ∞. Then for an arbitrary F-stopping time τ¯ ≤ T and bounded (F,P)- martingale, applying Itô’s product rule and taking the expectation, we obtain Ev˜τ¯(τ, x)M¯τ¯ = EAτ¯(x)M¯τ¯. Since the optional projection is unique (see Theorem 13 in Chapter 1, Section 8 in [LS89]), P-a.s. for all t and x, uˆt(x) = At(x). This completes the proof. □ 3.3.5 Proof of Theorem 3.2.5 Proof of Theorem 3.2.5. Fix a stopping time τ ≤ T and random field φ such that for some β′ ∈ (α, β¯ ∧ β˜) and θ′ ≥ 0, P-a.s. r−θ ′ 1 φ ∈ C β′(Rd1; Rd2). For any δ > 0, we can rewrite (3.1.1) as dult = ( (L¯1;lt +L 2;l t )ut + 1[1,2](α)b¯ i t∂iu l t + c¯ ll¯ t u l¯ t + f l t ) dt + ( N1;lϱt ut + g lϱ t ) dw1;ϱt + ∫ Z1 ( I¯1;lt,z ut− + h¯ l t(z) ) [1D1(z)q 1(dt, dz) + 1E1(z)p 1(dt, dz)] + ∫ Z1 ( 1(D1∪E1)∩{K1t >δ}(z) + 1V1(z) ) ( I1;lt,z ut− + h l t(z) ) p1(dt, dz), τ < t ≤ T, ult = φ l, t ≤ τ, l ∈ {1, . . . , d2}, (3.3.23) 66 Chapter 3. The method of stochastic characteristics for parabolic SIDEs where for ϕ ∈ C∞c (R d1; Rd2) and l ∈ {1, . . . , d2}, L¯1;lt ϕ(x) := 1{2}(α) 1 2 σ 1;iϱ t (x)σ 1; jϱ t (x)∂i jϕ l(x) + 1{2}(α)σ k;iϱ t (x)υ 1;ll¯ϱ t (x)∂iϕ l¯(x) + ∫ D1 ρ¯ 1;ll¯ t (x, z) ( ϕl¯(x + H¯1t (x, z)) − ϕ l¯(x) ) π1(dz) + ∫ D1 ( ϕl(x + H¯1t (x, z)) − ϕ l(x) − 1(1,2](α)H¯ 1;i t (x, z)∂iϕ l(x) ) π1(dz), I¯1t,zϕ l(x) = (Ill¯d2 + 1{K1t ≤δ}(z)ρ 1;ll¯ t (x, z))ϕ l¯(x + 1{K1t ≤δ}(z)H 1 t (x, z)) − ϕ l(x), H¯1 := 1{K1t ≤δ}H 1, ρ¯1 := 1{K1t ≤δ}ρ 1, h¯ := 1{K1t ≤δ}h, b¯it(x) := b i t(x) − ∫ D1∩{K1t >δ} 1(1,2](α)H 1;i t (x, z)π 1(dz), c¯ll¯t (x) := c ll¯ t (x) − ∫ D1∩{K1t >δ} ρ 1;ll¯ t (x, z)π 1(dz). For an arbitrary stopping time τ′ ≤ T and Fτ′ ⊗ B(Rd1)-measurable random field φτ ′ : Ω × Rd1 → Rd2 satisfying for some θ(τ′) > 0, P-a.s. r−θ(τ ′) 1 φ τ′ ∈ Cβ ′ (Rd1; Rd2), consider the system of SIDEs on [0,T ] × Rd1 given by dvlt = ( (L¯t 1;l +L2;lt )vt + 1[1,2](α)b¯ i t∂iv l t + c¯ ll¯ t v l¯ t + f l t ) dt + ( N1;lϱt vt + g lϱ t ) dw1;ϱt + ∫ Z1 ( I¯1;lt,z ut− + h¯ l t(z) ) [1D1(z)q 1(dt, dz) + 1E1(z)p 1(dt, dz)], τ′ < t ≤ T, vlt = φ τ′;l, t ≤ τ′, l ∈ {1, . . . , d2}. (3.3.24) Set H¯2 = H2 and ρ¯2 = ρ2. In order to invoke Theorem 3.2.2 and obtain a unique solution vt = vt(τ′, x) = vt(τ′, φτ ′ , x) of (3.3.24), we will show that for all ω and t, |r−11 b˜t|0 + |∇b˜t|β¯−1 + |c˜t|β˜ + |r −θ f˜ |β˜ ≤ N0, (3.3.25) where b˜it(x) : = 1[1,2](α)b¯ i t(x) − 2∑ k=1 1{2}(α)σ k; jϱ t (x)∂ jσ k;iϱ t (x) − 2∑ k=1 ∫ Dk ( 1(1,2](α)H¯ k;i t (x, z) − H¯ k;i t ( ˜¯Hk;−1t (x, z), z) ) πk(dz), c˜ll¯t (x) : = c¯ ll¯ t (x) − 2∑ k=1 1{2}(α)σ k;iϱ t (x)∂iυ k;ll¯ϱ t (x) 3.3. Proof of main theorems 67 − 2∑ k=1 ∫ Dk ( ρ¯ k;ll¯ t (x, z) − ρ¯ k;ll¯ t ( ˜¯Hk;−1t (x, z), z) ) πk(dz), f˜ lt (x) : = f l t (x) − σ 1; jϱ t (x)∂ jg l t(x) − ∫ D1 ( h¯lt(x, z) − h¯ l t( ˜¯H1;−1t (x, z), z) ) π1(dz). Owing to Assumption 3.2.3(β¯, δ1), we easily deduce that there is a constant N = N(d1, N0, β¯) such that for each k ∈ {1, 2} and all ω and t, |σk; jϱt ∂ jσ k;ϱ t |β¯ + |σ k; jϱ t ∂ ja k;ϱ t (x)|β¯ + |σ 1; jϱ t ∂ jg ϱ t |β¯ ≤ N, if α = 2. Since |∇H¯1t |0 ≤ δ, for any fixed η 1 < 1, for all (ω, t, x, z) ∈ {(ω, t, x, z) ∈ Ω × [0,T ] × Rd1 × (D1 ∪ E1) : |∇H¯1t (ω, x, z)| > η 1}, ∣∣∣∣ ( Id1 + ∇H 1 t (ω, x, z) )−1∣∣∣∣ ≤ 1 1 − δ . Appealing to Assumption 3.2.3(β¯, δ1) and applying Lemma 3.4.10, we obtain that there is a constant N = N(d1, d2,N0) such that for each k ∈ {1, 2} and all ω, t, and z, |H¯k;it (z) − H¯ k;i t ( ˜¯Hk;−1t (z), z)|β¯ ≤ N(K k t (z) + K¯ k t (z)) 2 + N1(0,1]({β¯}+ + δk)K˜kt (z)K k t (z) δk + N1(1,2]({β¯}+ + δk) ( K˜kt (z)K k t (z) δk + K¯kt (z) 2 ) , |ρ¯kt (z) − ρ¯ k t ( ˜¯Hk;−1t (z), z)|β¯ ≤ Nl k t (z)(K k t (z) + K¯ k t (z)) + N1(0,1]({β¯} + + µk)l˜kt (z)K k t (z) µk + N1(1,2]({β¯}+ + µk) ( l˜kt (z)K k t (z) µk + lkt (z)K¯ k t (z) ) , and |r−θ1 h¯t(z) − r −θ 1 h¯t( ˜¯H1;−1t (z), z)|β¯ ≤ Nl 1 t (z)(K 1 t (z) + K¯ k t (z)) + N1(0,1]({β¯} + + µ1)l˜kt (z)K 1 t (z) µ1 + N1(1,2]({β¯}+ + µ1) ( l˜kt (z)K 1 t (z) µ1 + lkt (z)K¯ 1 t (z) ) . Moreover, using Lemma 3.4.10, we find that there is a constant N = N(d1, d2,N0) such that for each k ∈ {1, 2} and all ω, t, and z, |r−11 H¯ k t ( ˜¯Hk;−1t (z), z)|0 ≤ |r −1 1 H k|0, |∇[H¯ k;i t ( ˜¯Hk;−1t (z), z)]|β¯ ≤ |∇H k|β¯−1. Combining the above estimates and using Hölder’s inequality and the integrability prop- erties of lkt (z) and K k t (z), we obtain (3.3.25). Therefore, by Theorem 3.2.2, for each stop- ping time τ′ ≤ T and and Fτ′ ⊗ B(Rd1)-measurable random field φτ ′ satisfying for some θ(τ′) > 0, P-a.s. r−θ(τ ′) 1 φ τ′ ∈ Cβ ′ (Rd1; Rd2), there exists a unique solution vt(x) = vt(τ′, φτ ′ , x) 68 Chapter 3. The method of stochastic characteristics for parabolic SIDEs of (3.3.24) such that E [ sup t≤T |r−θ(τ ′)∨θ−ϵ 1 vt(τ ′)|p β′ ∣∣∣Fτ′ ] ≤ N(|r−θ(τ ′) 1 φ τ′ |p β′ + 1), (3.3.26) where N = N(d1, d2, p,N0,T, β′, η1, η2, ϵ, θ, θ(τ′)) is a positive constant. Let At = ∫ ]0,t] ∫ Z1 ( 1(D1∪E1)∩{K1s>η1}(z) + 1V1(z) ) p1(ds, dz), t ≤ T. Define a sequence of stopping times (τn)n≥0 recursively by τ1 = τ and τn+1 = inf(t > τn : ∆At , 0) ∧ T. We construct the unique solution u = u(τ) of (3.3.23) in Cβ ′ (Rd1; Rd2) by interlacing solu- tions of (3.3.24) along the sequence of stopping times (τn). For (ω, t) ∈ [[0, τ1)), we set ut(τ, x) = vt(τ, φ, x) and note that E [ sup t≤τ1 |r−θ ′∨θ−ϵ 1 ut(τ)| p β′ ∣∣∣Fτ ] ≤ N(|r−θ ′ 1 φ| p β′ + 1). For each ω and x, we set uτ1(x) = uτ1−(x) + ∫ Z1 ( 1(D1∪E1)∩{K1>η1}(τ1, z) + 1V1(z) ) ( I1τ1,zuτ1−(x) + h l τ1 (x, z) ) p1({τ1}, dz). By virtue of Lemma 3.4.9, there is a constant N = N(d1, d2, θ, θ′, ζτ1(z), β ′) |uτ1− ◦ H˜ 1 τ1 (z) · r −ξτ1 (z)(θ∨θ ′+ϵ+β′) 1 |β′ ≤ N|r −θ∨θ′−ϵ 1 u l τ1−|β′ , and hence |r−λ11 uτ1(x)|β′ ≤ N |r −θ∨θ′−ϵ 1 u l τ1−|β′ + ζτ1(z), where λ1 = (ξτ1(z)(θ ∨ θ ′ + 1 + ϵ + β′)) ∨ θ ∨ (θ ∨ θ′ + ϵ). We then proceed inductively, each time making use of the estimate (3.3.26), to obtain a unique solution u = u(τ) of (3.3.23), and hence (3.1.1), in Cβ ′ (Rd1; Rd2). This completes the proof of Theorem 3.2.5. □ 3.4 Appendix 3.4. Appendix 69 3.4.1 Martingale and point measure moment estimates Set (Z,Z, π) = (Z1,Z1, π1), p(dt, dz) = p1(dt, dz), and q(dt, dz) = q1(dt, dz). The follow- ing moment estimates are used to derive the estimates of Γt and Ψt in Lemma 3.3.2. The notation a ∼ p,T b is used to indicate that the quantity a is bounded above and below by a constant depending only on p and T times b. Lemma 3.4.1. Let h : Ω × [0,T ] × Z → Rd1 be PT ⊗Z-measurable (1) For any stopping time τ ≤ T and p ≥ 2, E [ sup t≤τ ∣∣∣∣∣∣ ∫ ]0,τ] ∫ Z hs(z)q(ds, dz) ∣∣∣∣∣∣ p] ∼ p,T E [∫ ]0,τ] ∫ Z |hs(z)|p π(dz)ds ] + E   (∫ ]0,τ] ∫ Z |hs(z)|2 π(dz)ds )p/2 .x (2) For any stopping time τ ≤ T and p¯ ≥ 1, E  sup t≤τ (∫ ]0,τ] ∫ Z |hs(z)|p(ds, dz) )p¯ ∼ p,T E [∫ ]0,τ] ∫ Z |hs(z)| p¯ π(dz)ds ] + E   (∫ ]0,τ] ∫ Z |hs(z)|π(dz)ds ) p¯ , Proof. We will only prove part (2), since part (1) is well-known (see, e.g., [Nov75] or [Kun04]). Assume that ht(ω, z) > 0 for all ω, t and z. Let At = ∫ ]0,t] ∫ Z hs(z)p(ds, dz) and Lt = ∫ ]0,t] ∫ Z hs(z)π(dz)ds, t ≤ T. It suffices to prove (2) for p > 1, since the case p = 1 is obvious. Fix an arbitrary stopping time τ ≤ T and p > 1. For all t, we have Apt = ∑ s≤t [ (As− + ∆As) p − Aps− ] . Thus, by the inequality bp ≤ (a + b)p − ap ≤ p(a + b)p−1b ≤ p2p−1[ap−1b + bp], a, b ≥ 0, we get Apt ≥ ∫ ]0,t] ∫ Z hs(z) p p(ds, dz) and Apt ≤ p2 p−2 [∫ ]0,t] ∫ h hs(z) p p(ds, dz) + ∫ t 0 ∫ Z Ap−1s− hs(z)p(ds, dz) ] . 70 Chapter 3. The method of stochastic characteristics for parabolic SIDEs Since At is increasing, we obtain E ∫ ]0,τ] ∫ Z hs(z) p p(ds, dz) ≤ EApτ ≤ p2 p−2E [∫ ]0,τ] ∫ Z hs(z) p p(ds, dz) + Ap−1τ Lτ ] . It is easy to see that E[Lpτ ] = pE ∫ ]0,τ] Lp−1s dLs = pE ∫ ]0,τ] Lp−1s dAs ≤ pE[L p−1 τ Aτ]. Applying Young’s inequality, for any ε > 0, we get Ap−1τ Lτ ≤ εA p τ + (p − 1)p−1 εp−1 pp Lpτ and L p−1 τ Aτ ≤ εL p τ + (p − 1)p−1 εp−1 pp Apτ . Combining the estimates for any ε1 ∈ (0, 1p ), we have   ε p−1 1 p p(1 − pε1) p(p − 1)p−1 ELpτ   ∨ E ∫ ]0,τ] ∫ Z hs(z) p p(ds, dz) ≤ E[Apτ ]. and for any ε2 ∈ (0, 1p2p−2 ) E[Apτ ] ≤ p2p−2 (1 − p2p−2ε2) E   ∫ ]0,τ] ∫ Z hs(z) p p(ds, dz) + (p − 1)p−1 ε p−1 2 p p Lpτ   , which completes the proof. □ 3.4.2 Optional projection The following lemma concerning the optional projection plays an integral role in Sec- tion 3.3.4 and the proof of Theorem 3.2.2. For more information on the Skorokhod J1- topology, we refer the reader to Chapter 6, Section 1 of [LS89]. Also, we refer the reader to Theorem 5.3 [Kal97] for the construction of regular conditional probability measures on Borel spaces. Lemma 3.4.2. (cf. Theorem 1 in [Mey76]) Let X be a Polish space and D ([0,T ];X) be the space of X-valued càdlàg trajectories with the Skorokhod J1-topology. If A is a random variable taking values in D ([0,T ];X), then there exists a family of B([0,T ])×F - measurable non-negative measures Et(dU), (ω, t) ∈ Ω × [0,T ], on D ([0,T ];X) and a random-variable ζ satisfying P (ζ < T ) = 0 such that Et(D ([0,T ];X)) = 1 for t < ζ and Et(D([0,T ];X)) = 0 for t ≥ ζ. In addition, Et is càdlàg in the topology of weak convergence, Et = Et+ for all t ∈ [0,T ], and for each continuous and bounded functional F on D ([0,T ];X) , the process Et (F) is the càdlàg version of E[F (A) |Ft]. If G : Ω × 3.4. Appendix 71 [0,T ] × [0,T ] × D ([0,T ];X) → Rd2 is bounded and O × B ([0,T ]) × B (D ([0,T ];X))- measurable, then ∫ D([0,T ];X) Gt(ω, t,U)E t(dU) = Et(Gt) is the optional projection of Gt(A) = Gt(ω, t,A). Furthermore, if G = Gt(ω, t,U) is bounded and P × B([0,T ]) × B(D([0,T ];X))-measurable, then Et−(Gt) is the predictable projection of Gt(A) = Gt(ω, t,A). Proof. We follow the proof of Theorem 1 in [Mey76]. Since D([0,T ];X) is a Polish space, for each t ∈ [0,T ], there is family of probability measures E˜tω(dw), ω ∈ Ω, on D([0,T ];X) such that for each A ∈ B(D([0,T ];X)), E˜t(A) is Ft-measurable and P-a.s. P (A ∈ A|Ft) = E˜t (A) . For each ω ∈ Ω, let I (ω) be the set of all t ∈ (0,T ] such that for any bounded continuous function F on D(([0,T ];X), the function r 7→ E˜rω(F) = ∫ D([0,T ];X) F(w)E˜r(dw) has a right-hand limit on [0, s) ∩ Q and a left-hand limit on (0, s] ∩ Q for every rational s ∈ [0,T ] ∩ Q. Let ζ (ω) = sup (t : t ∈ I(ω)) ∧ T. It is easy to see that P (ξ < T ) = 0. We set E˜tω = 0 if ξ(ω) < t ≤ T . The function E˜ t ω has left-hand and right-hand limits for all t ∈ Q ∩ [0,T ]. We define Etω = E˜ t+ ω for each t ∈ [0,T ) (the limit is taken along the rationals), and ETω is the left-hand limit at T along the rationals. The statement follows by repeating the proof of Theorem 1 in [Mey76] in an obvious way. □ 3.4.3 Estimates of Hölder continuous functions In the coming lemmas, we establish some properties of weighted Hölder spaces that are used Section 3.3.5 and the proof of Theorem 3.2.5. Lemma 3.4.3. Let β ∈ (0, 1] and θ1, θ2 ∈ R with θ1 − θ2 ≤ β. (1) There is a constant c1 = c1 (θ2, β) such that for all ϕ : Rd1 → R with |r −θ1 1 ϕ|0 + [r−θ21 ϕ]β =: N1 < ∞, |ϕ(x) − ϕ(y)| ≤ c1N1(r1(x)θ2 ∨ r1(y)θ2)|x − y|β, for all x, y ∈ Rd1 . 72 Chapter 3. The method of stochastic characteristics for parabolic SIDEs (2) Conversely, if ϕ : Rd1 → R satisfies |r−θ11 ϕ|0 < ∞ and there is a constant N2 such that for all x, y ∈ Rd1 , |ϕ(x) − ϕ(y)| ≤ N2(r1(x)θ2 ∨ r1(y)θ2)|x − y|β, then [r−θ21 ϕ]β ≤ c1|r −θ1 1 ϕ|0 + N2. Proof. (1) For all x, y with r1 (x) θ2 ≥ r1 (y)θ2 , we have |ϕ(x) − ϕ(y)| ≤ r1(x)θ2[r −θ2 1 ϕ]β|x − y| β + r1(y) θ1−θ2 |r−θ11 ϕ|0|r θ2 1 (x) − r1(y) θ2 | ≤ ([r−θ21 ϕ]β + c1|r −θ 1 ϕ|0)r1(x) θ2 |x − y|β, where c1 := 1 + supt∈(0,1) 1−tθ2 (1−t)β if θ2 ≥ 0 and c1 := 1 + supt∈(1,∞) (tθ2−1)tβ (t−1)β if θ2 < 0, which proves the first claim. (2) For all x and y with r1(x)θ2 > r1(y)θ2 , we have |r1(x)−θ2ϕ(x) − r1(y)−θ2ϕ(y)| ≤ r1(x)−θ2 |ϕ(x) − ϕ(y)| + r1(y)θ1−θ2 |r −θ1 1 (y)ϕ(y)||r1(y) θ2r1(x) −θ2 − 1| ≤ (c1|r−θ1ϕ|0 + N2)|x − y|β, which proves the second claim. □ Lemma 3.4.4. Let β, µ ∈ (0, 1] and θ1, θ2, θ3, θ4 ∈ R with θ1 − θ2 ≤ β, θ3 − θ4 ≤ µ, and θ3 ≥ 0. If ϕ : Rd1 → R and H : Rd1 → Rd1 are such that |r−θ11 ϕ|0 + [r −θ2 1 ϕ]β =: N1 < ∞ and |r −θ3 1 H|0 + [r −θ4 1 H]µ =: N2 < ∞, then |ϕ ◦ H · r−θ1θ31 |0 ≤ |r −θ1 1 ϕ|0(1 + |r −θ3 1 H|0) ≤ N1 (1 + N2) θ1 and there is a constant N = N(β, µ, θ1, θ2) such that [ϕ ◦ H · r−θ2θ3−βθ41 ]βµ ≤ NN1(1 + N2) θ2+β. Proof. For all x, we have r1(H(x)) ≤ (1 + |r −θ3 1 H|0)r1(x) θ3 ≤ (1 + N2)r1(x)θ3 , and hence |ϕ ◦ H · r−θ1θ31 |0 ≤ |r −θ1 1 ϕ|0|r θ1 1 ◦ H · r −θ1θ3 1 |0 ≤ N1(1 + N2) θ1 . 3.4. Appendix 73 Using Lemma 3.4.3, for all x and y, we get |ϕ(H(x)) − ϕ(H(y))| ≤ NN1(r1(H(x)) ∨ r1(H(y)))θ2 |H(x) − H(y)|β ≤ NN1(1 + N2)θ2(r1(x) ∨ r1(y))θ2θ3 N β 2 (r1(x) ∨ r1(y)) βθ4 |x − y|βµ ≤ NN1(1 + N2)θ2+β(r1(x) ∨ r1(y))θ2θ3+βθ4 |x − y|βµ, for some constant N = N(β, µ, θ1, θ2). Noting that θ1θ3 − θ2θ3 − βθ4 = (θ1 − θ2)θ3 − βθ4 ≤ β(θ3 − θ4) ≤ βµ, we apply Lemma 3.4.3 to complete the proof. □ Remark 3.4.5. Let β ∈ (0, 1] and θ1, θ2 ∈ R. Then there is a constant N = N(β, θ1, θ2) such that for all ϕ : Rd1 → R with |r−θ11 ϕ|0 + [r −θ2 1 ϕ]β =: N1 < ∞, we have |r −θϕ|β ≤ NN1, where θ = max {θ1, θ2} . In particular, if in Lemma 3.4.4, θ1 = θ2 and θ4 ≥ 0, then |ϕ ◦ H · r−θ1θ3−βθ4 |βµ ≤ NN1(1 + N2)θ1+β. Proof. If θ2 ≥ θ1, then the claim is obvious and if θ1 > θ2, for all x and y, we find |r1(x)−θ1ϕ(x) − r1(y)−θ1ϕ(y)| ≤ r1(x)θ2−θ1 |r1(x)−θ2ϕ(x) − r1(y)−θ2ϕ(y)| + ∣∣∣∣∣∣ r(y)θ1−θ2 r(x)θ1−θ2 − 1 ∣∣∣∣∣∣ |r−θ11 ϕ|0 ≤ N1(1 + c1)|x − y| β, where c1 := supt∈(0,1) 1−tθ1−θ2 (1−t)β . □ Lemma 3.4.6. For any θ ≥ 0 and β > 1 , there are constants N1 = N1(d1, θ, β) and N2(d1, θ, β) such that for all ϕ : Rd1 → R, N1|r−θ1 ϕ|β ≤ ∑ |γ|≤[β]− |r−θ1 ∂ γϕ|0 + ∑ |γ|=[β]− |r−θ1 ∂ γϕ|{β}+ ≤ N2|r−θ1 ϕ|β. (3.4.1) Proof. For any multi-index γ with |γ| ≤ [β]− and x, we have ∂γ(r−θ1 ϕ)(x) = ∑ γ1+γ2+=γ |γ1 |≥1 r1(x) θ∂γ1(r−θ1 )(x)r1(x) −θ∂γ2ϕ(x) + r1(x) −θ∂γϕ(x). It is easy to show by induction that for all multi-indices γ, |rθ1∂ γ(r−θ1 )|1 < ∞. Moreover, for all multi-indices γ with |γ| < [β]−, |r−θ1 ∂ γϕ|1 ≤ |∇(r−θ1 ∂ γϕ)| ≤ |r−θ1 ∇(r −θ 1 )|0|r −θ 1 ∂ γ∇ϕ|0. 74 Chapter 3. The method of stochastic characteristics for parabolic SIDEs Thus, for any multi-index γ with |γ| ≤ [β]−, |∂γ(r−θ1 ϕ)|0 ≤ ∑ γ1+γ2+=γ |γ1 |≥1 |rθ1∂ γ1(r−θ1 )|0|r −θ 1 ∂ γ2ϕ|0 + |r−θ1 ∂ γϕ|0 and for any multi-index γ with |γ| = [β]−, |∂γ(r−θ1 ϕ)|{β}+ ≤ ∑ γ1+γ2+=γ |γ1 |≥1 |rθ1∂ γ1(r−θ1 )|1|r −θ 1 ∇(r −θ 1 )|0|r −θ 1 ∂ γ2∇ϕ|0 + |r−θ1 ∂ γϕ|0. This proves the leftmost inequality in (3.4.1). For all i ∈ {1, . . . , d} and x, r−θ1 ∂iϕ(x) = ∂i(r −θ 1 ϕ)(x) − r1(x) −θϕ(x)r1(x) θ∂i(r −θ 1 )(x). It follows by induction that for all multi-indices γ with |γ| ≤ [β]− and x, r−θ1 ∂ γϕ(x) is a sum of ∂γ(r−θ1 ϕ)(x), a finite sum of terms, each of which is a product of one term of the form ∂γ˜(r−θ1 ϕ)(x), |γ˜| < |γ|, and a finite number of terms of the form ∂ γ1(rθ1)∂ γ2(r−θ1 ), |γ1|, |γ2| ≤ |γ|. Since for all multi-indices γ1 and γ2, we have |∂γ1(rθ1)∂ γ2(r−θ1 )|1 < ∞, the rightmost inequality in (3.4.1) follows. □ Corollary 3.4.7. For any θ ≥ 0 and β > 1 , there are constants N1 = N1(d1, θ, β) and N2(d1, θ, β) such that for all ϕ : Rd1 → R, N1|r−θ1 ϕ|β ≤ |r −θ 1 ϕ|0 + ∑ |γ|=[β]− |r−θ1 ∂ γϕ|{β}+ ≤ N2|r−θ1 ϕ|β. Proof. It is well known that for an arbitrary unit ball B ⊂ Rd1 and any 1 ≤ k < [β]−, there is a constant N such that for any ε > 0, sup x∈B,|γ|=k |∂γϕ| ≤ N(ε sup x∈B,|γ|=[β]− |∂γϕ(x)| + ε−k sup x∈B |ϕ(x)|). Let U0 = {x ∈ Rd1 : |x| ≤ 1} and U j = {x ∈ Rd1 : 2 j−1 ≤ |x| ≤ 2 j}, j ≥ 1. For all j, we have sup x∈U j,|γ|=k |∂γϕ(x)| = sup B⊆U j sup x∈B,|γ|=k |∂γϕ(x)| ≤ N(ε sup B⊆U j sup x∈B,|γ|=[β]− |∂γϕ(x)| + ε−k sup B⊆U j sup x∈B |ϕ(x)|) ≤ N(ε sup x∈U j,|γ|=[β]− |∂γϕ(x)| + ε−k sup x∈U j |ϕ(x)|). 3.4. Appendix 75 Since for every j, 2−θ/22− jθ sup x∈U j,|γ|=k |∂γϕ(x)| ≤ sup x∈U j,|γ|=k |r−θ∂γϕ(x)| ≤ 2 θ 2−( j−1)θ sup x∈U j,|γ|=k |∂γϕ(x)|, we see that 2−θ/2 sup j 2− jθ sup x∈U j,|γ|=k |∂γϕ(x)| ≤ sup j sup x∈U j,|γ|=k |r−θ∂γϕ(x)| = |r−θ∂γϕ|0 ≤ 2θ sup j 2− jθ sup x∈U j,|γ|=k |∂γϕ(x)|, and the statement follows. □ Remark 3.4.8. If ϕ : Rd1 → R is such that |r−θ1ϕ|0 + |r−θ2∇ϕ|0 < ∞ for θ1, θ2 ∈ R with θ1 − θ2 ≤ 1, then [r−θ2ϕ]1 ≤ N(|r−θ1ϕ|0 + |r−θ2∇ϕ|0) Proof. Indeed, for all x and y, we have |ϕ(x) − ϕ(y)| ≤ |r−θ2∇ϕ|0 ∫ 1 0 rθ2(x + s(y − x))ds|y − x| ≤ |r−θ2∇ϕ|0(r(y)θ2 ∨ r(x)θ2)|y − x|, and hence the claim follows from Lemma 3.4.3. □ Lemma 3.4.9. Let n ∈ N, β, µ ∈ (0, 1], θ3, θ4 ≥ 0 be such that θ3 − θ4 ≤ 1. There is a constant N = N(d1, θ1, θ3, θ4, n, β) such that for all ϕ : Rd1 → R with r −θ1 1 ϕ ∈ C n+β(Rd1 ,Rd1) and H : Rd1 → Rd1 with |r−θ31 H|0 + |r −θ4 1 ∇H|n−1+µ =: N2 < ∞, we have |ϕ ◦ H · r−θ1θ3 |0 ≤ |r −θ1 1 ϕ|0(1 + |r −θ3 1 H|0) θ1 and |r−θ1θ3−θ4(n+µ∧β)1 ∇(ϕ ◦ H)|n−1+µ∧β ≤ N|r −θ1 1 ϕ|n+β(1 + N2) θ1+µ∧β+n. Proof. It follows immediately from Lemma 3.4.4 and Remark 3.4.8 that |ϕ ◦ H · r−θ1θ3 |0 ≤ |r −θ1 1 ϕ|0(1 + |r −θ3 1 H|0) θ1 . Using induction, we get that for all x and |γ| = n, ∂γ(ϕ(H(x))) = Iγ1(x) + I γ 2(x) + I γ 3 (x), 76 Chapter 3. The method of stochastic characteristics for parabolic SIDEs where Iγ1(x) = d1∑ i=1 ∂iϕ(H(x))∂ γHi(x) Iγ2(x) is a finite sum of terms of the form ∂i1 · · · ∂i|γ|ϕ(H(x))∂ γ˜1 Hi1 · · · ∂γ˜|γ|Hi|γ| with i1, . . . , i|γ| ∈ {1, 2, . . . , d}, |γ˜1| = · · · = |γ˜|γ|| = 1, and ∑|γ| k=1 γ˜k = γ, if n ≥ 2 and zero otherwise, and where Iγ3(x) is a finite sum of terms of the form ∂i1 · · · ∂ikϕ(H(x))∂ γ˜1 Hi1(x) · · · ∂γ˜k Hik(x) with 2 ≤ k < n, i1, i2, . . . , ik ∈ {1, . . . , d}, and ∑k j=1 γ˜ j = γ, 1 ≤ |γ˜ j| < |γ|, if n ≥ 3, and zero otherwise. Thus, owing to Lemmas 3.4.4 and 3.4.6, for any multi-index γ with |γ| = n, we have |r−θ3θ1−θ41 I γ 1 |0 ≤ N |r −θ1 1 ∇ϕ|0(1 + |r −θ3 1 H|0) θ1 |r−θ41 ∂ γH|0, |r−θ3θ1−nθ41 I γ 2 |0 ≤ N|r −θ1 1 ∂ γϕ|0(1 + |r −θ3 1 H|0) θ1 |r−θ41 ∇H| n 0, and |r−θ3θ1−(n−1)θ41 I γ 3 |0 ≤ N |r −θ1 1 ϕ|n−1(1 + |r −θ3 1 H|0 + |r −θ4 1 ∇H|) θ1+n−1, and hence |r−θ1θ3−nθ4∂γ(ϕ ◦ H)|0 ≤ N |r −θ1 1 ϕ|n(1 + |r −θ3 1 H|0 + |r −θ4 1 ∇H|) θ1+n. Appealing again to Lemmas 3.4.4 and 3.4.6, for all multi-indices γ with |γ| = n, we get |r−θ1θ3−(1+µ∧β)θ41 I γ 1 |µ∧β ≤ N|r −θ1 1 ϕ|1+µ∧β (1 + N2) θ1+µ∧β+1 , |r−θ1θ3−(n+µ∧β)θ41 I γ 2 |µ∧β + |r −θ1θ3−(n−1+µ∧β)θ4 1 I γ 3 |µ∧β ≤ N|r −θ1 1 ϕ|n+µ∧β (1 + N2) θ1+n+µ∧β . Then applying Lemmas 3.4.4 and 3.4.6, we complete the proof. □ We shall now provide some useful estimates of composite functions of diffeomor- phisms. Lemma 3.4.10. Let H : Rd1 → Rd1 be continuously differentiable and assume that for all x ∈ Rd1 , |H(x)| ≤ L0 + L1|x| and |∇H(x)| ≤ L2. Assume that for all x ∈ Rd1 , κ(x) = (Id1 + ∇H(x)) −1 exists and |κ(x)| ≤ Nκ. (1) Then the mapping H˜(x) := x + H(x) is a diffeomorphism with H˜−1(x) = x−H(H˜−1(x)) 3.4. Appendix 77 =: x + F(x) and for all x ∈ Rd1 , |F(x)| ≤ L0 + L1L0Nκ + L1Nκ|x|, |∇F(x)| ≤ NκL2, | ( Id1 + ∇F(x) )−1 | ≤ 1 + L2. For all p ∈ R, there is a constant N = N(L0, L1,Nκ, p) such that for all x ∈ Rd1 , rp1 (H˜(x)) rp1 (x) + rp1 (H˜ −1(x)) rp1 (x) ≤ N, r−11 (x)|H i(x) + Fk;i(x)| ≤ N[H]1|r−11 H|0. Moreover, there is a constant N = N(L0, L1,Nκ, p) such that ∣∣∣∣∣∣ rp1 (H˜) rp1 − 1 + 1(1,2](α)pHir−21 x i ∣∣∣∣∣∣ α + ∣∣∣∣∣∣ rp1 (H˜ −1) rp1 − 1 − 1(1,2](α)pF ir−21 x i ∣∣∣∣∣∣ α ≤ N(|r−11 H| [α]−+1 0 + [H] [α]−+1 1 ). (2) If for some β > 1, |∇H|β−1 ≤ L3, then there is a constant N = N(d1, β,Nκ, L3) such that |∇F|β−1 ≤ N |∇H|β−1. (3.4.2) (3) If for some β ≥ 1, |∇H|β−1 ≤ L3, then for all θ ≥ 0, there is a constant N = N(d1, β,Nκ, L1, L3, θ) such that ∣∣∣∣∣∣ rθ1 ◦ H˜ −1 rθ1 − 1 ∣∣∣∣∣∣ β ≤ N(|r−11 H|0 + |∇H|β−1). (4) If |H|0 ≤ L4, and for some β > 0, |∇H|β∨1−1 ≤ L5 and ϕ : Rd1 → R is such that for some µ ∈ (0, 1] and θ ≥ 0, r−θ1 ϕ ∈ C β+µ(Rd1; R), then there is a constant N = N(d1, β, µ,Nκ, L4, L5, θ) such that |r−θ1 (ϕ ◦ H˜ −1 − ϕ)|β ≤ N |r−θ1 ϕ|β(|H|0 + |∇H|β∨1−1) +N1(0,1]({β}+ + µ) ∑ |γ|=[β]− [∂γ(r−θ1 ϕ)]{β}++µL µ 4 +N1(1,2]({β}+ + µ) ∑ |γ|=[β]− ( [∇∂γ(r−θ1 ϕ)]{β}++µ−1L µ 4 + |∇∂ γ(r−θ1 ϕ)|0|∇H|0 ) . Proof. (1) Since (Id1 +∇H(x)) −1 exists for all x, it follows from Theorem 0.2 in [DMGZ94] that the mapping H˜ is a global diffeomorphism. For all x, we easily verify H˜−1(x) = x − H(H˜−1(x)) by substituting H˜(x) into the expression. Simple computations show that for all x, we have |∇H˜(x)| ≤ 1 + L2, |∇H˜−1(x)| = |κ(H˜−1(x))| ≤ Nκ, 78 Chapter 3. The method of stochastic characteristics for parabolic SIDEs |∇F(x)| = |∇H(H˜−1(x))∇H˜−1(x)| ≤ NκL2, |(Id1 + ∇F(x)) −1| = |∇H˜−1(x)−1| = |κ(H˜−1(x))−1| = |Id1 + ∇H(H˜ −1(x))| ≤ 1 + L2. For all x and y, we easily obtain |H˜(x) − H˜(y)| ≤ (1 + L2)|x − y|, |H˜−1(x) − H˜−1(y)| ≤ Nκ|x − y|, and hence N−1κ |x − y| ≤ |H˜(x) − H˜(y)|, (1 + L2) −1|x − y| ≤ |H˜−1(x) − H˜−1(y)|. (3.4.3) Making use of (3.4.3), for all x, we get N−1κ |x| ≤ L0 + |H˜(x)|, |H˜ −1(x)| ≤ NκL0 + Nκ|x|, |x| ≤ L0 + L1|H˜−1(x)|, and thus |F(x)| ≤ L0 + L1NκL0 + L1Nκ|x|. The rest of the estimates then follow easily from the above estimates and Taylor’s theorem. (2) Using the chain rule, for all x, we obtain ∇F(x) = −∇H(H˜−1(x))∇H˜−1(x) = −∇H(H˜−1(x))κ(H˜−1(x)), (3.4.4) and hence |∇F|0 ≤ Nκ|∇H|0. For all x and y, we have κ(H˜−1(y)) − κ(H˜−1(x)) = κ(y)[∇H(H˜−1(x)) − ∇H(H˜−1(y))]κ(x), and thus since [H˜−1]1 ≤ (1 + NκL3) by part (1), we have for all δ ∈ (0, 1 ∧ β], [κ(H˜−1)]δ ≤ N2κ (1 + NκL3) δ[∇H]δ. It follows that there is a constant N = N(Nκ, L3) such that for all δ ∈ (0, 1 ∧ β], |∇F|δ ≤ N|H|δ. It is well-known that the inverse map I on the set of invertible d1×d1 matrices is infinitely differentiable and for each n, there exists a constant N = N(n, d1) such that for all invertible matrices A, the n-th derivative of I evaluated at A, denoted I(n)(A), satisfies |I(n)(A)| ≤ N |A−n−1| ≤ N |A−1|n+1. 3.4. Appendix 79 Using induction we find that for all multi-indices γ with |γ| ≤ [β]− and for all x, ∂γF(x) is a finite sum of terms, each of which is a finite product of ∂γ¯H(H˜−1(x)), κ(H˜−1(x))n¯, I(n¯−1)(I + ∇H(H˜−1(x))), |γ¯| ≤ |γ|, n¯ ∈ {1, . . . , |γ|}. Therefore, differentiating (3.4.4) and estimating directly we easily obtain (3.4.2). (3) For each x, we have r1(H˜−1(x))θ r1(x)θ − 1 = r1(x)−θ ∫ 1 0 r1(Gs(x)) θ−2Gs(x) ∗F(x)ds = ∫ 1 0 rθ−11 (Gs(x)) r1(x)θ−1 K(Gs(x)) ∗dsr1(x) −1F(x), where Gs(x) := x + sF(x), s ∈ [0, 1], and J(x) := r1(x)−1x. According to part (1) and (2), we have |r−11 F|0 ≤ N|r −1 1 H|0 and |∇F|β−1 ≤ N|∇H|β−1, and hence |r−11 Gs|0 ≤ N(1 + |r −1 1 H|0), |∇Gs(x)|β−1 ≤ N(1 + |∇H|β−1). and |J ◦Gs|β ≤ N(1 + |r−11 H|0 + |∇H|β−1), for some constant N independent of s. Moreover, using Lemma 3.4.9 we find |rθ−11 ◦Gs · r 1−θ 1 |β ≤ N ( 1 + |r−11 H|0 + |∇H|β−1 )θ+β . The statement then follows. (4) First, we will consider the case θ = 0. By part (1), we have that for all µ¯ ∈ (0, (β+µ)∧1], |ϕ ◦ H˜−1 − ϕ|0 ≤ [ϕ]µ¯|H ◦ H˜−1| µ 0 ≤ [ϕ]µ¯|H| µ¯ 0. First, let us consider the case β ≤ 1. For each x, let J(x) = ϕ(H˜−1(x))−ϕ(x). For all x and y, it is clear that |J(x) − J(y)| ≤ A(x, y) + B(x, y) + C(x, y), where A(x, y) := |J(x)|1[L4,∞)(|x − y|), B(x, y) := |J(y)|1[L4,∞)(|x − y|), and C(x, y) := |J(x) − J(y)|1[0,L4)(|x − y|). Moreover, owing to part (1), if β + µ ≤ 1, then for all x and y, we have A(x, y) ≤ [ϕ]β+µL β+µ 4 1[L4,∞)(|x − y|) ≤ [ϕ]β+µL µ 4 |x − y| {β}+ , 80 Chapter 3. The method of stochastic characteristics for parabolic SIDEs B(x, y) ≤ [ϕ]β+µL µ 4 |x − y| β, and C(x, y) ≤ [ϕ]β+µ|[H˜−1] β+µ 1 |x − y| β+µ1[0,L4)(|x − y|) + [ϕ]β+µ|x − y| β+µ1[0,L4)(|x − y|) ≤ N[ϕ]β+µL µ 4 |x − y| β for some constant N = N(µ,Nκ, L4). Using the identity J(x) − J(y) = − ∫ 1 0 ( ∇ϕ ( x − θH(H˜−1(x)) ) − ∇ϕ ( y − θH(H˜−1(y)) )) H(H˜−1(x))dθ − ∫ 1 0 ∇ϕ ( y − θH(H˜−1(y)) ) (H(H˜−1(y)) − H(H˜−1(x))), and part (1), if β + µ > 1, then there is a constant N = N(µ,Nκ, L4) such that for all x and y, |J(x) − J(y)|1[L4,∞)(|x − y|) ≤ N([∇ϕ]β+µ−1|x − y| β+µ−1L4 + |∇ϕ|0|x − y|[H]1)1[L4,∞)(|x − y|) ≤ N[∇ϕ]β+µ−1L µ 4 |x − y| β + N|∇ϕ|0|∇H|0|x − y|. Moreover, since J(x) − J(y) = ∫ 1 0 ∇ϕ ( H˜−1(x + θ(y − x)) ) ( ∇H˜−1(x + θ(y − x)) − Id1 ) (x − y)dθ + ∫ 1 0 ( ∇ϕ ( H˜−1 (x + θ(y − x)) ) − ∇ϕ (x + θ(y − x)) ) (x − y)dθ, by part (1) and (3.4.2), if β + µ > 1, we attain that there is a constant N = N(µ,Nκ, L4) such that for all x and y, |J(x) − J(y)|1[0,L4)(|x − y|) ≤ (|∇ϕ|0|∇H|0 + [∇ϕ]β+µ−1L β+µ−1 4 )|x − y|1[0,L4)(|x − y|) ≤ |∇ϕ|0|∇H|0|x − y| + [∇ϕ]β+µ−1L µ 4 |x − y| β. Combining the above estimates, we get that for all β ≤ 1 and µ ∈ (0, 1], there is a constant N = N(µ,Nκ, L4) such that [ϕ ◦ H˜−1 − ϕ]β ≤ N1[0,1](β+ µ)[ϕ]β+µL µ 4 + N1(1,2](β+ µ) ( [∇ϕ]β+µ−1 + |∇ϕ|0|∇H|0 ) . (3.4.5) 3.4. Appendix 81 This proves the desired estimate for β ≤ 1 and θ = 0. We now consider the case β > 1. For β > 1, it is straightforward to prove by induction that for all multi-indices γ with 1 ≤ |γ| ≤ [β]− and for all x, ∂γ(ϕ(H˜−1))(x) = Jγ1 (x) +J γ 2 (x) +J γ 3 (x) +J γ 4 (x), where Jγ1 (x) := ∂ γϕ(H˜−1(x)), Jγ2 (x) = ∂ γϕ(H˜−1)(∂1H˜ −1;1)γ1 · · · (∂dH˜−1;d)γd − 1, Jγ3 (x) is a finite sum of terms of the form ∂ j1 · · · ∂ jkϕ(H˜ −1(x))∂γ˜1 H˜−1; j1(x) · · · ∂γ˜k H˜−1; jk(x) with 1 ≤ k < [β]−, j1, . . . , jk ∈ {1, . . . , d}, and ∑k j=1 γ˜ j = γ, and J4(x) is a finite sum of terms of the form ∂ j1 . . . ∂ j[β]−ϕ(H˜ −1(x))∂i1 H˜ −1; j1(x) · · · ∂i[β]− H˜ −1; j[β]− (x) with i1, j1, . . . , i[β]− , j[β]− ∈ {1, . . . , d} and at least one pair ik , jk. Since for all x, ∇H˜−1(x) = I + ∇F(x) and (3.4.2) holds, there is a constant N = N(d1, β) such that ∑ 1≤|γ|≤β 4∑ i=2 |Jγi |0 + ∑ |γ|=β 4∑ i=2 |Jγi |{β}+ ≤ N |∇ϕ|β−1|∇F|β−1 ≤ N |∇ϕ|β−1|∇H|β−1. If β > 2, then for all multi-indices γ with 1 ≤ |γ| < [β]−, we get |Jγ1 − ∂ γϕ|0 = |∂γϕ ◦ H˜−1 − ∂γϕ|0 ≤ [∂γϕ]1|H|0. It is easy to see that there is a constant N = N(L4,Nκ) such that for all γ with |γ| = [β]− and all µ¯ ∈ (0, ({β}+ + µ) ∧ 1], |Jγ1 − ∂ γϕ|0 = |∂γϕ ◦ H˜−1 − ∂γϕ|0 ≤ [∂γϕ]µ¯|H| µ¯ 0. Moreover, appealing to the estimate (3.4.5) we obtain [Jγ1 − ∂ γϕ]{β}+ ≤ N1[0,1]({β}+ + µ)[∂γϕ]{β}++µL µ 4 + N1(1,2]({β} + + µ) ( [∇∂γϕ]{β}++µ−1 + |∇∂γϕ|0|∇H|0 ) . 82 Chapter 3. The method of stochastic characteristics for parabolic SIDEs Let us now consider the case θ > 0. The following decomposition obviously holds for all x: r1(x) −θϕ(H˜−1(x)) − r1(x)−θϕ(x) = ϕˆ(H˜−1) − ϕˆ(x) + ( r1(H˜−1(x))θ r1(x)θ − 1 ) ϕˆ(H˜−1(x)), where ϕˆ = r−θ1 ϕ ∈ C β(Rd1; Rd1). Thus, to complete the proof we require |ϕˆ ◦ H˜−1|β ≤ N |ϕˆ|β and ∣∣∣∣∣∣ rθ1 ◦ H˜ −1 rθ1 − 1 ∣∣∣∣∣∣ β ≤ N(|H|0 + |∇H|β∨1−1). The latter inequality was proved in part (3) and the first inequality follows from part (2) and Lemma 3.4.9. □ Remark 3.4.11. Let H : Rd1 → Rd1 be continuously differentiable and assume that for all x, |∇H(x)| ≤ η < 1. Then for all x ∈ Rd1 , |(Id1 + ∇H(x)) −1| ≤ |Id1 + ∞∑ k=1 (−1)k∇H(x)k| ≤ 1 1 − η . 3.4.4 Stochastic Fubini theorem Let m = (mϱ)t≤T , ϱ ∈ N, be a sequence of F-adapted locally square integrable continuous martingales issuing from zero such that P-a.s. for all t ∈ [0,T ], ⟨mϱ1 ,mϱ2⟩t = 0 for ϱ1 , ϱ2 and ⟨mϱ⟩t = Nt for ϱ ∈ N, where Nt is a PT -measurable continuous increasing processes issuing from zero. Let η(dt, dz) be a F-adapted integer-valued random measure on ([0,T ]× E,B([0,T ])⊗E), where (U,U) is a Blackwell space. We assume that η(dt, dz) is optional, P˜T -sigma-finite, and quasi-left continuous. Thus, there exists a unique (up to a P-null set) dual predictable projection (or compensator) ηp(dt, dz) of η(dt, dz) such that µ(ω, {t}×U) = 0 for all ω and t. We refer the reader to Chapter II, Section 1, in [JS03] for any unexplained concepts relating to random measures. Let (X,Σ, µ) be a sigma-finite measure space; that is, there is an increasing sequence of Σ-measurable sets Xn, n ∈ N, such that X = ∪∞n=1Xn and µ(Xn) < ∞ for each n. Let f : Ω × [0,T ] × X → Rd2 be RT ⊗ Σ-measurable, g : Ω × [0,T ] × X → ℓ2(Rd2) be RT ⊗ Σ/B(ℓ2(Rd2))-measurable, and h : Ω × [0,T ] × X × U → Rd2 be PT ⊗ Σ ⊗ U- measurable. Moreover, assume that for all t ∈ [0,T ] and x ∈ X, P-a.s. ∫ ]0,T ] |gt(x)|2dNt + ∫ ]0,T ] ∫ U |ht(x, z)|2ηp(dt, dz) < ∞. 3.4. Appendix 83 Let F = Ft(x) : Ω × [0,T ] × X → Rd2 be OT ⊗ B(X)-measurable and assume that for dPµ-almost all (t, x) ∈ [0,T ] × X, Ft(x) = ∫ ]0,t] gϱs(x)dm ϱ s + ∫ ]0,t] ∫ U hs(x, z)η˜(dt, dz), where η˜(dt, dz) = η(dt, dz) − ηp(dt, dz). The following version of the stochastic Fubini theorem is a straightforward extension of Lemma 2.6 [Kry11] and Corollary 1 in [Mik83]. See also Proposition 3.1 in [Zho13], Theorem 2.2 in [Ver12], and Theorem 1.4.8 in [Roz90]. Indeed, to prove it for a bounded measure we can use a monotone class argument as in Theorem 64 in [Pro05]. To handle the general setting with possibly infinite µ, we use assumptions (ii) and (iii) below and take limits on the sets Xn using the Lenglart domination lemma (Theorem 1.4.5 in [LS89]) and the following well-known inequalities: E sup t≤T ∣∣∣∣∣∣ ∫ ]0,t] gϱsdm ϱ s ∣∣∣∣∣∣ ≤ NE (∫ ]0,T ] |gt(x)|2dm ϱ t )1/2 E sup t≤T ∣∣∣∣∣∣ ∫ ]0,t] ∫ U ht(x, z)η˜(dt, dz) ∣∣∣∣∣∣ ≤ NE (∫ ]0,T ] ∫ U |ht(x, z)|2ηp(dt, dz) )1/2 , where τ ≤ T is an arbitrary stopping time and N = N(T ) is a constant independent of g and h. Proposition 3.4.12 (c.f. Corollary 1 in [Mik83] and Lemma 2.6 in [Kry11]). Assume that (1) P-a.s. for all n ≥ 1, ∫ Xn (∫ ]0,T ] |gt(x)|2dNt )1/2 µ(dx) + ∫ Xn (∫ ]0,T ] ∫ U1 |ht(x, z)|2ηp(dt, dz) )1/2 µ(dx) < ∞; (2) P-a.s. ∫ ]0,T ] (∫ X |gt(x)|µ(dx) )2 dt + ∫ ]0,T ] ∫ U (∫ X |ht(x, z)|µ(dx) )2 ηp(dt, dz); (3) P-a.s. for al t ∈ [0,T ], ∫ X |Ft(x)|µ(dx) < ∞. Then P-a.s. for all t ∈ [0,T ], ∫ X Ft(x)µ(dx) = ∫ ]0,t] ∫ X gϱs(x)µ(dx)dm ϱ s + ∫ ]0,t] ∫ U ∫ X hs(x, z)µ(dx)η˜(dr, dz). We obtain the following corollary by applying Minkowski’s integral inequaility. 84 Chapter 3. The method of stochastic characteristics for parabolic SIDEs Corollary 3.4.13. Assume that P-a.s. ∫ X (∫ ]0,T ] |gt(x)|2dNt )1/2 µ(dx) + ∫ X (∫ ]0,T ] ∫ U1 |ht(x, z)|2ηp(dt, dz) )1/2 µ(dx) < ∞. (3.4.6) Then P-a.s. for all t ∈ [0,T ], ∫ X Ft(x)µ(dx) = ∫ ]0,t] ∫ X gϱs(x)µ(dx)dm ϱ s + ∫ ]0,t] ∫ U ∫ X hs(x, z)µ(dx)η˜(dr, dz). Remark 3.4.14. If µ is a finite-measure and P-a.s. ∫ X ∫ ]0,T ] |gt(x)|2dNtµ(dx) + ∫ X ∫ ]0,T ] ∫ U1 |ht(x, z)|2ηp(dt, dz)µ(dx) < ∞, then (3.4.6) holds by Hölder’s inequality. 3.4.5 Itô-Wentzell formula Definition 3.4.15. We say that an Rd1-valued F-adapted quasi-left continuous semimartin- gale Lt = (Lkt )1≤k≤d1 , t ≥ 0, is of α-order for α ∈ (0, 2], if P-a.s. for all t ≥ 0, ∑ s≤t |∆Ls|α < ∞ and Lt = L0 + ∫ ]0,t] ∫ R d1 0 zpL(ds, dz), if α ∈ (0, 1), Lt = L0 + At + ∫ ]0,t] ∫ |z|≤1 zqL(ds, dz) + ∫ ]0,t] ∫ |z|>1 zpL(ds, dz), if α ∈ [1, 2), Lt = L0 + At + L c t + ∫ ]0,t] ∫ |z|≤1 zqL(ds, dz) + ∫ ]0,t] ∫ |z|>1 zpL(ds, dz), if α = 2, where pL(dt, dz) is the jump measure of L with dual predictable projection πL(dt, dz), qL (dt, dz) = pL(dt, dz) − πL(dt, dz) is a martingale measure, At = (Ait)1≤i≤d1 is a continuous process of finite variation with A0 = 0, and Lct = (L c;i t )1≤i≤d1 is a continuous local martingale issuing from zero. Set (wϱ)ϱ∈N = (w1;ϱ)ϱ∈N, (Z,Z, π) = (Z1,Z1, π1), p(dt, dz) = p1(dt, dz), and q(dt, dz) = q1(dt, dz). Also, set D = D1, E = E1, and assume Z = D ∪ E. Let f : Ω × [0,T ] × Rd1 → Rd2 be RT ⊗ B(Rd1)-measurable, g : Ω × [0,T ] × Rd1 → ℓ2(Rd2) be RT ⊗ B(Rd1)/B(ℓ2(Rd2))-measurable, and h : Ω × [0,T ] × Rd1 × Z → Rd2 be 3.4. Appendix 85 PT ⊗ B(Rd1) ⊗Z-measurable. Moreover, assume that, P-a.s. for all x ∈ Rd1 , ∫ ]0,T ] | ft(x)|dt + ∫ ]0,T ] |gt(x)|2dt < ∞ + ∫ ]0,T ] ∫ D |ht(x, z)|2π(dz)dt + ∫ ]0,T ] ∫ E |ht(x, z)|π(dz)dt < ∞. Let F = Ft(x) : Ω × [0,T ] × Rd1 → Rd2 be OT ⊗ B(Rd1)-measurable and assume that for all x, P-a.s. for all t, Ft(x) = F0(x) + ∫ ]0,t] fs(x)ds + ∫ ]0,t] gϱs(x)dw ϱ s + ∫ ]0,t] ∫ Z hs(x, z)[1D(z)q(ds, dz) + 1E(z)p(ds, dz)]. For each n ∈ {1, 2}, let Cnloc(R d1; Rd2) be space of n-times continuously differentiable func- tions f : Rd1 → Rd2 . We now state our version of the Itô-Wentzell formula. For each ω, t and x, we denote ∆F(x) = Ft(x) − Ft−(x). Proposition 3.4.16 (cf. Proposition 1 in [Mik83] ). Let (Lt)t≥0 be an Rd1-valued quasi-left continuous semimartingale of order α ∈ (0, 2]. Assume that: (1) (a) P-a.s. F ∈ D([0,T ];Cαloc(R d; Rm) if α is fractional and F ∈ D([0,T ]; Cαloc(R d; Rm) if α = 1, 2 ; (b) for dPdt-almost-all (ω, t) ∈ Ω × [0,T ], ft(x) and gt(x) = (g iϱ t (x))ϱ∈N ∈ ℓ2(R d2) are continuous in x and dPdt − lim y→x [∫ D |ht(y, z) − ht(x, z)|2π(dz) + ∫ E |ht(y, z) − ht(x, z)|π(dz) ] = 0; (c) for all ϱ ∈ N and i ∈ {1, . . . , d1} and for dPd|⟨Lc;i,wϱ⟩|t-almost-all (ω, t) ∈ Ω × [0,T ], giϱt ∈ C 1 loc(R d; R), if α = 2, ; (2) for all compact subsets K of Rd1 , P-a.s. ∫ ]0,T ] sup x∈K ( | ft(x)| + |gt(x)|2 + ∫ D |ht(x, z)|2π(dz) + ∫ E |ht(x, z)|π(dz) ) dt < ∞, ∑ ϱ∈N ∫ ]0,T ] sup x∈K |∇giϱt (x)|d|⟨L c;i,wϱ⟩|t + ∑ t≤T |∆Ft|α∧1;K |∆Lt|α∧1 < ∞. Then P-a.s for all t ∈ [0,T ], Ft(Lt) = F0(L0) + ∫ ]0,t] fs(Ls)ds + ∫ ]0,t] gϱs(Ls)dw ϱ s 86 Chapter 3. The method of stochastic characteristics for parabolic SIDEs + ∫ ]0,t] ∫ Z hs(Ls−, z)[1D(z)q(dr, dz) + 1E(z)p(dr, dz)] + ∫ ]0,t] ∂iFs−(Ls−)[1[1,2](α)dAis + 1{2}(α)dL c;i s ] + ∑ s≤t ( Fs−(Ls) − Fs−(Ls−) − 1[1,2](α)∇Fs−(Ls−)∆Ls ) + 1{2}(α) 1 2 ∫ ]0,t] ∂i jFs(Ls)d⟨Lc;i, Lc; j⟩s (3.4.7) + 1{2}(α) ∫ ]0,t] ∂ig ϱ s(Ls)d⟨w ϱ, Lc;i⟩s + ∑ s≤t (∆Fs(Ls) − ∆Fs(Ls−)) . Proof. Since both sides have identical jumps and we can always interlace a finite set of jumps, we may assume that |∆Lt| ≤ 1 for all t ∈ [0,T ]; that is, it is enough to prove the statement for L˜t = Lt − ∑ s≤t 1[1,∞)(|∆Ls|)∆Ls, t ∈ [0,T ]. It suffices to assume that for some K and all ω, |L0| ≤ K. For each R > K, let τR = inf  t ∈ [0,T ] : |A|t + |⟨L c⟩|t + ∑ s≤t |∆Ls|α + |Lt| > R   ∧ T and note that P-a.s. τR ↑ T as R tends to infinity. If instead of L, f , g, h, and F, we take L·∧τR , f 1(0,τR], g ϱ1(0,τR], h1(0,τR], F1(0,τR], then the assumptions of the proposition hold for this new set of processes. Moreover, if we can prove (3.4.7) for this new set of processes, then by taking the limit as R tends to infinity, we obtain (3.4.7). Therefore, we may assume that for some R > 0, P-a.s. for all t ∈ [0,T ], |A|t + |⟨Lc⟩|t + ∑ s≤t |∆Ls|α + |Lt| ≤ R. (3.4.8) Let ϕ ∈ C∞c (R d1 ,R) have support in the unit ball in Rd1 and satisfy ∫ Rd1 ϕ(x)dx = 1, ϕ(x) = ϕ(−x), and ϕ(x) ≥ 0, for all x ∈ Rd1 . For each ε ∈ (0, 1), let ϕε(x) = ε−dϕ (x/ε) , x ∈ Rd1 . By Itô’s formula, for all x ∈ Rd1 , P-a.s. for all t ∈ [0,T ], Ft(x)ϕε(x − Lt) = F0(x)ϕε(x − L0) − ∫ ]0,t] Fs−(x)∂iϕε(x − Ls−)dLis + ∫ ]0,t] ϕε (x − Ls) fs(x)ds + ∫ ]0,t] ϕε (x − Ls) gϱs(x)dw ϱ s + 1{2}(α) 1 2 ∫ ]0,t] Fs(x)∂i jϕε(x − Ls)d⟨Lc;i, Lc; j⟩s + 1{2}(α) ∫ ]0,t] gϱs(x)∂iϕε(x − Ls)d⟨w ϱ, Lc;i⟩s + ∫ ]0,t] ∫ Z ϕε (x − Ls−) hs(x, z)[1D(z)q(dr, dz) + 1E(z)p(dr, dz)] 3.4. Appendix 87 + ∑ s≤t ∆Fs(x) (ϕε(x − Ls) − ϕε (x − Ls−)) + ∑ s≤t Fs−(x) (ϕε(x − Ls) − ϕε(x − Ls−) + ∂iϕε (x − Ls−) ∆Ls) . Appealing to assumption (2) and (3.4.8) (i.e. for the integrals against F), we integrate both sides of the above in x, apply Corollary 3.4.13 (see, also, Remark 3.4.14) and the deterministic Fubini theorem, and then integrate by parts to get that P-a.s. for all t ∈ [0,T ], F(ε)t (Lt) = F (ε) 0 (L0) + ∫ ]0,t] ∇F(ε)s− (Ls−)[1[1,2](α)dA i s + 1{2}(α)dL c;i s ] + ∫ ]0,t] f (ε)s (Ls)dr + ∫ ]0,t] g(ε)s (Ls)dw ϱ s + ∫ ]0,t] ∫ Z h(ε)s (Ls−, z)[1D(z)q(dr, dz) + 1E(z)p(dr, dz)] + 1{2}(α) 1 2 ∫ ]0,t] ∂i jF (ε) s (Ls)d⟨L c;i, Lc; j⟩s + 1{2}(α) ∫ ]0,t] ∂ig (ε);ϱ s (Ls)d⟨w ϱ, Lc;i⟩s + ∑ s≤t ( ∆F(ε)s (Ls) − ∆F (ε) s (Ls−) ) + ∑ s≤t ( F(ε)s− (Ls) − F (ε) s− (Ls−) − 1[1,2](α)∇F (ε) s− (Ls−)∆Ls ) (3.4.9) where for all ω, t, x, and z, F(ε)t (x) := ϕε ∗ Ft(x), f (ε) t = ϕε ∗ ft(x), g (ε);ϱ t (x) = ϕε ∗ g ϱ t (x), h (ε) t (x, z) = ϕε ∗ ht(x, z), and ∗ denotes the convolution operator on Rd1 . Let BR+1 = {x ∈ Rd1 : |x| ≤ R+1}. Owing to assumption (1)(a) and standard properties of mollifiers, for any multi-index γ with |γ| ≤ α, P-a.s. for all t, |∂γF(ε)t (Lt)| ≤ sup t≤T sup x∈BR+1 |∂γFt(x)| < ∞ and for all x, dPdt − lim ε↓0 |∂γF(ε)t (x) − ∂ γF(ε)t (x)| = 0. Similarly, by assumption 1(b), dPdt-almost-all (ω, t) ∈ Ω × [0,T ], | f (ε)t (Lt)| ≤ sup x∈BR+1 | ft(x)| < ∞, |g (ε) t (Lt)| ≤ sup x∈BR+1 |gt(x)| < ∞, ∫ D |hεt (Lt, z)| 2π(dz) ≤ sup x∈BR+1 ∫ D |ht(x, z)|2π(dz), ∫ E |hεt (Lt, z)|π(dz) ≤ sup x∈BR+1 ∫ E |ht(x, z)|π(dz) 88 Chapter 3. The method of stochastic characteristics for parabolic SIDEs and for all x, dPdt − lim ε↓0 | f (ε)t (x) − ft(x)| = 0, dPdt − lim ε→0 |g(ε)t (x) − gt(x)| = 0 and dPdt − lim ε↓0 ∫ Z [1D(z)|h (ε) t (x, z) − ht(x, z)| 2 + 1E(z)|h (ε) t (x, z) − ht(x, z)|]π(dz) = 0, where in the last-line we have also used Minkowski’s integral inequality and a standard mollifying convergence argument. Using assumption 1(d), for all ϱ ∈ N and i ∈ {1, . . . , d1} and for dPd|⟨Lc;i,wϱ⟩|t-almost-all (ω, t) ∈ Ω × [0,T ] |∇g(ε);iϱt (Lt)| ≤ sup x∈BR+1 |∇giϱt (x)| and for all x, dPd|⟨Lc;i,wϱ⟩|t − lim ε→0 |∇g(ε);iϱt (x) − ∇g iϱ t (x)| = 0, if α = 2. Owing to assumption 1(a) and (3.4.8), P-a.s. ∑ s≤t |F(ε)s− (Ls) − F (ε) s− (Ls−) − 1[1,2](α)∇F (ε) s− (Ls−)∆Ls| ≤ sup t≤T |Ft|α;BR+1 ∑ s≤t |∆Ls|α ≤ R sup t≤T |Ft|α;BR+1 . Since P-a.s. F ∈ D([0,T ];Cα(Rd; Rm), it follows that for all x, P-a.s. for all t, lim ε↓0 |∆Fεt (x) − ∆Ft(x)| = 0. By assumption (2), P-a.s for all t, we have ∑ s≤t ( ∆F(ε)s (Ls− + ∆Ls) − ∆F (ε) s (Ls−) ) ≤ ∑ s≤t |∆Ft|α∧1;BR+1 |∆Ls| α∧1. Combining the above and using assumptions (1)(a) and (2) and the bounds given in (3.4.8) and the deterministic and stochastic dominated convergence theorem, we obtain conver- gence of all the terms in (3.4.9), which complete the proof. □ Chapter 4 The L2-Sobolev theory for parabolic SIDEs 4.1 Introduction Let (Ω,F ,P) be a probability space with the filtration F = (Ft)0≤t≤T of sigma-algebras satisfying usual conditions. In a triple of Hilbert spaces (Hα+µ,Hα,Hα−µ) with parameters µ ∈ (0, 1] and α ≥ µ, we consider a linear stochastic evolution equation given by dut = (Ltut + ft) dVt + (Mtut− + gt) dMt, t ≤ T, (4.1.1) u0 = φ, where Vt is a continuous non-decreasing process, Mt is a cylindrical square integrable mar- tingale, L andM are linear adapted operators, and ϕ, f , and g are adapted input functions. By virtue of Theorems 2.9 and 2.10 in [Gyö82], under some suitable conditions on the data φ, f and g, if L satisfies a growth assumption and L andM satisfy a coercivity condition in the triple (Hα+µ,Hα,Hα−µ), then there exists a unique solution (ut)t≤T of (4.1.1) that is strongly càdlàg in Hα and belongs to L2(Ω × [0,T ],OT , dVtdP; Hα+µ), where OT is the optional sigma-algebra on Ω × [0,T ]. In this chapter, under a weaker assumption than coercivity (see Assumption 4.2.1(α, µ)) and using the method of vanishing viscosity, we prove that there exists a unique solution (ut)t≤T of (4.1.1) that is strongly càdlàg in Hα ′ for all α′ < α and belongs to L2(Ω × [0,T ], dVtdP; Hα). Furthermore, under some additional assumptions on the operators L andM we can show that the solution u is weakly càdlàg in Hα. The variational theory of deterministic degenerate linear elliptic and parabolic PDEs was established by O.A. Oleinik and E.V. Radkevich in [Ole65] and [OR71]. In [Par75], É. Pardoux developed the variational theory of monotone stochastic evolution equations, which was extended in [KR77, KR79, GK81, Gyö82] by N.V. Krylov, B.L Rozovskiı˘, and I. Gyöngy. Degenerate parabolic stochastic partial differential equations (SPDEs) driven by continuous noise were first investigated by N.V. Krylov and B.L. Rozovskiı˘ in [KR82]. These types of equations arise in the theory of non-linear filtering of continuous diffusion processes as the Zakai equation and as equations governing the inverse flow of continuous 89 90 Chapter 4. The L2-Sobolev theory for parabolic SIDEs diffusions. In [GGK14], the solvability of systems of linear SPDEs in Sobolev spaces was proved by M. Gerencsér, I. Gyöngy, and N.V. Krylov, and a small gap in the proof of the main result of [KR82] was fixed. In Chapters 2, 3, and 4 of [Roz90], B.L. Rozovskiı˘ offers a unified presentation and extension of earlier results on the variational framework of linear stochastic evolution systems and SPDEs driven by continuous martingales (e.g. [Par75, KR77, KR79, KR82]). Our existence and uniqueness result on degenerate linear stochastic evolution equations driven by jump processes (Theorem 4.3.2) extends Theorem 2 in Chapter-3-Section 2.2 of [Roz90] to include the important case of equations driven by jump processes. It is also worth mentioning that the semigroup approach for non- degenerate SPDEs driven by Lévy processes is well-studied (see, e.g. [PZ07, PZ13]). As a special case of (4.1.1), we will consider a system of SIDEs. Before introducing the equation, let us describe our driving processes. Let η(dt, dz) be an integer-valued random measure on (R+ × Z,B(R+) ⊗ Z) with predictable compensator πt(dz)dVt. Let η˜(dt, dz) = η(dt, dz) − πt(dz)dt be the martingale measure corresponding to η(dt, dz). Let (Z2,Z2) be a measurable space with RT -measurable family π2t (dz) of sigma-finite random measures on Z. Let wϱt , t ≥ 0, ϱ ∈ N, be a sequence of continuous local uncorrelated martingales such that d⟨wϱ⟩t = dVt, for all ρ ∈ N. Let d1, d2 ∈ N. For convenience, we set (Z1,Z1) = (Z,Z) and π1t = πt. We consider the d2-dimensional system of SIDEs on [0,T ] × Rd1 given by dult = ( (L1;lt +L 2;l t )ut + b i t∂iu l t + c ll¯ t u l¯ t(x) + f l t ) dVt + (N lϱ t ut + g lϱ t )dw ϱ t (4.1.2) + ∫ Z1 ( Ilt,zu l¯ t− + h l t(z) ) η˜(dt, dz), ul0 = φ l, l ∈ {1, . . . , d2}, where for k ∈ {1, 2}, l ∈ {1, . . . , d2}, and ϕ ∈ C∞c (R d1; Rd2), Lk;lt ϕ(x) := 1 2 σ k;iϱ t (x)σ k; jϱ t (x)∂i jϕ l(x) + σk;iϱt (x)υ k;ll¯ϱ t (x)∂iϕ l¯(x) ∫ Zk (( δll¯ + ρ k;ll¯ t (x, z) ) ( ϕl¯(x + ζk(x, z)) − ϕl¯(x) ) − ζk;it (x, z)∂iϕ l(x) ) πkt (dz) N lϱt ϕ(x) := σ 1;iϱ t (x)∂iϕ l(x) + υ1;ll¯ϱt (x)ϕ l¯(x), ϱ ∈ N, Ilt,zϕ(x) := (δll¯ + ρ 1;ll¯ t (x, z))ϕ l¯(x + ζ1t (x, z)) − ϕ l(x), and where δll¯ is the Kronecker delta (i.e. δll¯ = 1 if l = l¯ and δll¯ = 0 otherwise). The summation convention with respect to repeated indices is used here and below; summation over i is performed over the set {1, . . . , d1} and the summation over l, l¯ is performed over the set {1, . . . , d2}. Without the noise term η˜(dt, dz) and integro-differential operators in L1 and L2, equation (4.1.2) has been well-studied (see, e.g. [KR82], [Roz90] (Chapter 3), 4.2. Degenerate linear stochastic evolution equations 91 and the recent paper [GGK14]). The choice to use the notation ζk(x, z) rather than the traditional Hk(x, z) (as is used above) in this chapter due to the conflict of notation with our spaces. Let (Hα(Rd1; Rd2))α∈R be the L2-Sobolev-scale. For each m ∈ N, using our theorem on degenerate stochastic evolution equations discussed above, under suitable measurability and regularity conditions on the coefficients, initial condition, and free terms, we derive the existence of a unique solution (ut)t≤T of (4.1.2) that is weakly càdlàg in Hm(Rd1; Rd2), strongly càdlàg in Hα(Rd1; Rd2) for all α < m, and belongs to L2(Ω × [0,T ],OT , dVtdP; Hm(Rd1; Rd2)). Degenerate SIDEs of type (4.1.2) arise in the theory of non-linear filtering of semi- martingales as the Zakai equation and as the equations governing the inverse flow of jump diffusion processes. We constructed solutions of the above equation (with πt(dz) determin- istic and independent of time) using the method This chapter is organized as follows. We derive our existence and uniqueness result for (4.1.1) in Section 4.2 and for (4.1.2) in Section 4.3. 4.2 Degenerate linear stochastic evolution equations 4.2.1 Basic notation and definitions All vector spaces considered in this paper are assumed to have base field R. We also assume that all Hilbert spaces are separable. For a Hilbert space H, we denote by H∗ the dual of H and by B(H) the Borel sigma-algebra of H. Unless otherwise stated, the norm and inner product of a Hilbert space H are denoted by | · |H and (·, ·)H, respectively. For Hilbert spaces H and U and a bounded linear map L : H → U, we denote by L∗ the Hilbert adjoint of L. Whenever we say that a map F from a sigma-finite measure space (S ,S, µ) to a Hilbert space H is S-measurable without specifying the sigma-algebra on H, we always mean that F is S/B(H)-measurable. For any Hilbert space H and sigma-finite measure space (S ,S, ν), we denote by L2(S ,S, µ; H) the linear space of all S-measurable functions F : S → H such that |F|L2(S ,S,ν;H) = ∫ S |F(s)|2Hν(ds) < ∞, where we identify functions F,G : S → H that are equal µ-almost-everywhere (ν-a.e.). The linear space L2(S ,S, ν; H) is a Hilbert space when endowed with the inner product (F,G)L2(S ,S,ν;H) := ∫ S (F(s),G(s))Hν(ds). We use the notation N = N(·, · · · , ·) below to denote a positive constant depending only on the quantities appearing in the parentheses. In a given context, the same letter is often 92 Chapter 4. The L2-Sobolev theory for parabolic SIDEs used to denote different constants depending on the same parameter. All the stochastic processes considered below are (at least) F-adapted unless explicitly stated otherwise. Furthermore, we will often drop the dependence on ω ∈ Ω for random quantities. In this section, we consider a scale of Hilbert spaces (Hα)α∈R and a family of operators (Λα)α∈R satisfying the following properties: • for all α, β ∈ R with β > α, Hβ is densely embedded in Hα; • for all α, β, µ ∈ R with α < β < µ and all ε > 0, there is a constant N = N(α, β, µ, ε) such that |v|β ≤ ε|v|µ + N|v|α, ∀v ∈ Hµ; (4.2.1) • Λ0 = I; for all α, µ ∈ R, Λα : Hµ → Hµ−α is an isomorphism; for all α, β ∈ R, Λα+β = ΛαΛβ; • for all α ∈ R, the inner product in Hα is given by (·, ·)α = (Λα·,Λα·)0; • for all α > 0, the dual (Hα)∗ can be identified with H−α through the duality product given by ⟨u, v⟩α = ⟨u, v⟩Hα,H−α = ( Λαu,Λ−αv ) 0 , u ∈ H α, v ∈ H−α; • We assume that for every α ≥ 0, Λα is selfadjoint as an unbounded operator in H0 with domain Hα ⊆ H0: i.e. (Λαu, v)0 = (u,Λαv)0 for all u, v ∈ Hα. Remark 4.2.1. It follows from the above properties that for all α ∈ R, the Hα norm is given by |v|α = |Λαv|0, Λα is defined and linear on ∪β∈RHβ, Λ−α = (Λα) −1, and ΛαΛβ = ΛβΛα, for all β ∈ R. Moreover, for each α ≥ 0, if u ∈ Hα and v ∈ H0, then ⟨u, v⟩α = (u, v)0. We will now describe our driving cylindrical martingale (Mt)t≥0 in (4.1.1) ) and the associated stochastic integral. For a more thorough exposition, we refer to [MR99]. Let E be a locally convex quasi-complete topological vector space; all bounded closed subsets of E are complete. Let E∗ be its topological dual. Denote by ⟨·, ·⟩E∗,E the canonical bilinear form (duality product) on E∗ × E. Assume that E∗ is weakly separable. Denote by L+(E) the space of symmetric non-negative definite forms Q from E∗ to E; that is, for all Q ∈ L+(E), we have ⟨x,Qy⟩E∗,E = ⟨y,Qx⟩E∗,E, and ⟨x,Qx⟩E∗,E ≥ 0, ∀x, y ∈ E∗. Recall that PT is the predictable sigma-algebra on Ω × [0,T ]. We say that a process Q : Ω×[0,T ]→ L+(E) isPT -measurable if ⟨y,Qtx⟩E∗,E isPT -measurable for all x, y ∈ E∗. 4.2. Degenerate linear stochastic evolution equations 93 Assume that we are given a family of real-valued locally square integrable martingales M = (Mt(y))y∈E∗)t≥0 indexed by E∗ and an increasing PT -measurable process Q : Ω × [0,T ]→ L+(E) such that for all x, y ∈ E∗, Mt(x)Mt(y) − ∫ t 0 ⟨x,Qsy⟩E∗,EdVs, t ≥ 0, is a local martingale. For each (ω, t) ∈ Ω × [0,T ], letHt = Hω,t be the Hilbert subspace of E defined as the completion of Qω,tE∗ with respect to the inner product ( Qω,tx,Qω,ty ) Hω,t := ⟨x,Qω,ty⟩, x, y ∈ E ∗. It can be shown that for all (ω, t) ∈ Ω × [0,T ], E∗ is densely embedded into H∗t , the map Qt : E∗ → E can be extended to the Riesz isometry Qt : H∗t → Ht (still denoted Qt), and the bilinear form ⟨x,Qty⟩E∗,E, x, y ∈ E∗, can be extended to ⟨x,Qty⟩H∗t ,Ht , x, y ∈ H ∗ t . Note that for all x, y ∈ H∗t , we have (x, y)H∗t = ⟨x,Qty⟩H∗t ,Ht . Let Lˆ2loc (Q) the space of all processes f such that ft ∈ H ∗ t , dVtdP-a.e., ⟨ ft,Qty⟩H∗t ,Ht is PT -measurable for all y ∈ E∗, and P-a.s. ∫ T 0 | ft|2H∗t dVt = ∫ T 0 ⟨ ft,Qt ft⟩H∗t ,HtdVt < ∞. In [MR99], the stochastic integral of f ∈ Lˆ2loc(Q) against M, denoted It( f ) = ∫ t 0 fsdMs, t ≥ 0, was constructed and has the following properties: (It( f ))t≥0 is a locally square integrable martingale and P-a.s. for all t ∈ [0,T ]: • for all y ∈ E∗, It(y) = ∫ t 0 ydMs = Mt(y) (recall that E∗ is embedded into allH∗s ); • for all g ∈ Lˆ2loc(Q). ⟨I( f ),I(g)⟩t = ∫ t 0 ( fs, gs)H∗s dVs = ∫ t 0 ⟨ fs,Qsgs⟩H∗s ,HsdVs; • for all bounded PT -measurable processes ϕ : Ω × [0,T ]→ R, ∫ t 0 ϕsdIs( f ) = It(ϕ f ) = ∫ t 0 ϕs fsdMs. For a Hilbert space H and (ω, t) ∈ Ω × [0,T ], denote by L2(H,H∗t ) the space of all 94 Chapter 4. The L2-Sobolev theory for parabolic SIDEs Hilbert-Schmidt operators Ψ : H → H∗t with norm and inner product given by |Ψ|2L2(H,H∗t ) := ∞∑ n=1 |Ψhn|2H∗t , (Ψ, Ψ˜)L2(H,H∗t ) = ∞∑ n=1 (Ψhn, Ψ˜hn)H∗t , Ψ˜ ∈ L2(H,H ∗ t ), where (hn)n∈N is a complete orthogonal system in H. Denote by L 2 loc(H,Q) the space of all processes Ψ such that Ψt ∈ L2(H,H∗t ), dVtdP-a.e., Ψth ∈ Lˆ 2(Q), for each h ∈ H,, and P-a.s. ∫ T 0 |Ψt|2L2(H,H∗t )dVt < ∞. For each Ψ ∈ L2loc (H,Q), we define the stochastic integral It(Ψ) = ∫ t 0 ΨsdMs as the unique H-valued càdlàg locally square integrable martingale such that P-a.s. for all t ∈ [0,T ] and h ∈ H, (It(Ψ), h)H = ∫ t 0 ΨshdMs. For all Ψ, Ψ˜ ∈ L2loc (H,Q) , we have that |I· (Ψ) |2H − ∫ · 0 |Ψs|2L2(H,H∗s )dVs and (I·(Ψ),I·(Ψ˜))H − ∫ · 0 (Ψs, Ψ˜s)L2(H,H∗s )dVs are real-valued local martingales. Moreover, for all bounded PT -measurable H-valued processes u : Ω × [0,T ]→ H, P-a.s. for all t ∈ [0,T ], ∫ t 0 usdIs(Ψ) = ∫ t 0 {usΨs}HdMs, where for a complete orthogonal system ( e˜ns ) n∈N inH ∗ s , {usΨs}H := ∞∑ n=1 ( Ψsus, e˜ n s ) H∗s e˜ns . If H and Y are Hilbert spaces and L : H → Y is a bounded linear operator and Ψ ∈ L2loc(H,Q), then it follows that LIt(Ψ) = It(ΨL ∗); indeed, for all y ∈ Y , we have (LIt(Ψ), y)Y = (It(Ψ), L ∗y)H = ∫ t 0 ΨsL ∗ydMs and ΨL∗ ∈ L2loc(Y,Q). 4.2. Degenerate linear stochastic evolution equations 95 4.2.2 Statement of main results In this section, for µ ∈ (0, 1], we consider the linear stochastic evolution equation in the triple (H−µ,H0,Hµ) given by dut = (Ltut + ft) dVt + (Mtut− + gt) dMt, t ≤ T, (4.2.2) u0 = φ, where φ is an F0-measurable H0-valued random variable and Vt is a continuous non- decreasing process such that Vt ≤ C for all (ω, t) ∈ Ω × [0,T ], for some positive constant C. Let α ≥ µ be given. We assume that: (1) the mapping L : Ω × [0,T ] × Hµ → H−µ is linear in Hµ, and for all v ∈ Hµ, Lv is RT/B(H−µ)-measurable; in addition, dVtdP-a.e., Ltv ∈ Hα−µ for all v ∈ Hα+µ; (2) for dVtdP-almost-all (ω, t) ∈ Ω × [0,T ],Mω,t : Hµ → L2(H0,H∗ω,t) is linear, and for all v ∈ Hµ, ϕ ∈ H0, y′ ∈ E∗, ⟨(Mv)ϕ,Qty′⟩H∗t ,Ht is PT -measurable for all y ′ ∈ E∗; in addition, dVtdP-a.e,Mtv ∈ L2(Hα,H∗t ) for all v ∈ H α+µ; (3) the process f : Ω × [0,T ] → Hα−µ is RT/B(Hα−µ)-measurable and g ∈ L2loc (H α,Q) ∩ L2loc(H 0,Q), Let us introduce the following assumption for λ ∈ {0, α}. Recall that (u, v)λ = ( Λλu,Λλv ) 0 . Assumption 4.2.1 (λ, µ). There are positive constants L and K and an RT -measurable function f¯ : Ω × [0,T ]→ R such that the following conditions hold dVtdP-a.e.: (1) for all v ∈ Hλ+µ, 2(Λµv,Λ−µLtv)λ + |Mtv|2L2(Hλ,H∗t ) ≤ L|v| 2 λ; 2(Λµv,Λ−µt Ltv + ft)λ + |Mtv + gt| 2 L2(Hλ,H∗t ) ≤ L|v|2λ + f¯t; (2) for all v ∈ Hλ+µ, |Ltv|λ−µ ≤ K|v|λ+µ, |Mtv|L2(Hλ,H∗t ) ≤ K|v|λ+µ; (3) | ft|2λ−µ + |gt| 2 L2(Hλ,H∗t ) ≤ f¯t, E ∫ T 0 f¯tdVt < ∞. Let OT be the optional sigma-algebra on Ω × [0,T ]. For µ ∈ (0, 1]) and λ ∈ R+ with λ ∈ {0, α}, we denote byWλ,µ the space of all Hλ-valued strongly càdlàg processes v : Ω × [0,T ] → Hλ that belong to L2(Ω × [0,T ],OT , dVtdP; Hλ+µ). The following is our definition of the solution of (4.2.2) and is standard in the variational theory or L2-theory of stochastic evolution equations. 96 Chapter 4. The L2-Sobolev theory for parabolic SIDEs Definition 4.2.2. A process u ∈ W0,µ is said to be a solution of the stochastic evolution equation (4.2.2) if P-a.s. for all t ∈ [0,T ] ut H−µ = u0 + ∫ t 0 (Lsus + fs)dVs + ∫ t 0 (Msus− + gs)dMs, where H−µ = indicates that the equality holds in the H−µ. That is, P-a.s. for all t ∈ [0,T ] and v ∈ Hµ, (v, ut)0 = (v, u0) + ∫ t 0 ⟨v,Lsus + fs⟩µdVs + ∫ t 0 {v(Msus− + gs)}H0dMs. Remark 4.2.3. In Definition 4.2.2, it is implicitly assumed that the integrals in (4.2.2) are well-defined. Moreover, it is easy to check that if Assumption 4.2.1(0, µ) holds, then the integrals in Definition 4.2.2 are well-defined. In order to obtain estimates of the second moments of the supremum in t of the solution of (4.2.2), in the Hα norm, we will need to impose the upcoming assumption. Before introducing this assumption, we describe a few notational conventions. For two real- valued semimartingales Xt and Yt, we write P-a.s. dXt ≤ dYt if with probability 1, Xt−Xs ≤ Yt − Ys for any 0 ≤ s ≤ t ≤ T . For v ∈ Wλ,µ, we define Mt (v) := ∫ t 0 MsvsdMs, t ∈ [0,T ], and denote by [M(v)]λ;t the quadratic variation process of Mt (v) in H λ. Assumption 4.2.2 (λ, µ). There is a positive constant L, a PT -measurable function g¯ : Ω× [0,T ]→ R, and increasing adapted processes A, B : Ω×[0,T ]→ R with dAtdP ≤LdVtdP, dBtdP ≤g¯tdVtdP on PT such that the following conditions hold P-a.s.: (1) for all v ∈ Wλ,µ, (Λµvt,Λ −µLtvt)λdVt + d [M(v)]λ;t + 2{vt−Mtvt−}HλdMt ≤ |vt−| 2 λdAt + Gt(v)dMt, where G(v) ∈ Lˆ2loc (Q) satisfies |Gt(v)|H∗t dVt ≤ L|vt−| 2 λ dVt; (2) for all v ∈ Wλ,µ, 2d [ M (v) ,I (g) ] λ,t + 2{vt−gt}HλdMt ≤ |vt−|λdBt + G˜t (v) dMt, where G¯(v) ∈ Lˆ2loc (Q) satisfies |G¯t(v)|H∗t dVt ≤ L|vt−|λg¯tdVt, and E ∫ T 0 g¯2t dVt < ∞. 4.2. Degenerate linear stochastic evolution equations 97 Although Assumption 4.2.2(λ, µ) looks rather technical, it is satisfied for a large class of parabolic stochastic integro-differential equations (see Section 4.3) under what we con- sider to be reasonable assumptions. Let T be the set of all stopping times τ ≤ T and T p be the set of all predictable stopping times τ ≤ T . Theorem 4.2.4. Let µ ∈ (0, 1] and α ≥ µ. Let Assumption 4.2.1(λ, µ) hold for λ ∈ {0, α} and assume that E [ |φ|2α ] < ∞. (1) Then there exists a unique solution u = (ut)t≤T of (4.2.2) such that for any α′ < α, u is an Hα ′ -valued strongly càdlàg process and there is a constant N = N(L,K,C) such that E [ sup t≤T |ut|2α−µ ] + sup τ∈T E [ |uτ|2α ] + E ∫ T 0 |us|2α dVs ≤ N ( E [ |φ|2α ] + E ∫ T 0 f¯tdVt ) . Moreover, for all p ∈ (0, 2) and α′ < α, there is a constant N = N(L,K,C, p, α′) such that E [ sup t≤T |ut| p α′ ] ≤ NE   ( |φ|2α + ∫ T 0 f¯tdVt ) p 2   . (2) If, in addition, Assumption 4.2.2(λ, µ) holds for λ ∈ {0, α}, then u is an Hα-valued weakly càdlàg process and there is a constant N = N(L,K,C) such that E [ sup t≤T |ut|2α ] ≤ NE [ |φ|2α + ∫ T 0 ( f¯t + g¯ 2 t )dVt ] . Remark 4.2.5. If V is an arbitrary continuous increasing adapted process, then Theo- rem 4.2.4 can be applied locally by considering VCt = Vt∧τC , t ∈ [0,T ], with τC = inf (t ∈ [0,T ] : Vt ≥ C) ∧ T. 4.2.3 Proof of Theorem 4.2.4 We will construct a sequence of approximations in Wα,µT of the solution of (4.2.2) by solving in the triple (H−µ,H0,Hµ) the equation dut = ( Lnt ut + ft ) dVt + (Mtut− + gt) dMt, t ≤ T, (4.2.3) u0 = φ, where Lnt = Lt − 1 n (Λ µ)2. In order to apply the foundational theorems on stochastic evolu- tion equations with jumps established in [GK81] and [Gyö82], it is convenient for us first 98 Chapter 4. The L2-Sobolev theory for parabolic SIDEs to consider the following equation in the triple (H−µ,H0,Hµ) : dvt = ( ΛαLtΛ−αvt − 1 n (Λµ)2vt + Λ α ft ) dVt + ( MtΛ−αvt−(Λα)∗ + gt(Λα)∗ ) dMt, t ≤ T,(4.2.4) v0 = Λ αφ. The solutions of (4.2.3) and (4.2.4) are to be understood following Definition 4.2.2. Lemma 4.2.6. Let µ ∈ (0, 1] and α ≥ µ. Let Assumption 4.2.1(α, µ) hold and assume that E [ |φ|2α ] < ∞. (1) For each n ∈ N, there is a unique solution vn = (vnt )t≤T of (4.2.3), and there is a constant N = N(L,K,C) independent of n such that sup τ∈T E [ |vnτ| 2 0 ] + E ∫ T 0 |vnt | 2 0dVt + 1 n E ∫ T 0 |vnt | 2 µdVt ≤ NE [ |φ|2α + ∫ T 0 f¯tdVt ] . (4.2.5) Moreover, for all p ∈ (0, 2), there is a constant N = N(L,K,T, p) E [ sup t≤T |vnt | p 0 ] ≤ NE   ( |φ|2α + ∫ T 0 f¯tdVt ) p 2   . (4.2.6) (2) If, in addition, Assumption 4.2.2(α, µ) holds, then there is a constant N = N(L,K,C) such that E [ sup t≤T |vnt | 2 0 ] ≤ NE [ |φ|2α + ∫ T 0 ( f¯t + g¯ 2 t )dVt ] . (4.2.7) Proof. (1) For each (ω, t) ∈ Ω × [0,T ] and n ∈ N, let Lαt v = Λ αLtΛ−αv, L α,n t v = L α t v − 1 n (Λµ)2 v, Mαt v =MtΛ −αv(Λα)∗. Using basic properties of the spaces (Hα)α∈R and the operators (Λα)α∈R, dVtdP-a.e. for all v ∈ Hµ, we have 2⟨v,Lαt v⟩µ = 2(Λ µv,Λ−µΛαLtΛ−αv)0 = 2(ΛµΛ−αv,Λ−µLtΛ−αv)α, 2⟨v, (Λµ)2v⟩µ = 2(Λµv,Λµv)0 = |v|2µ, and ∣∣∣Mαt v ∣∣∣ 2 L2(H0,H∗t ) = ∞∑ k=1 |Λα(MtΛ−αv)∗e˜nt | 2 H0 = ∞∑ k=1 |(MtΛ−αv)∗e˜nt | 2 Hα = ∞∑ k=1 |MtΛ−αvh¯k|2H∗t = ∣∣∣Mαt v ∣∣∣ 2 L2(Hα,H∗t ) , 4.2. Degenerate linear stochastic evolution equations 99 where (e˜kt )k∈N,and (h¯ k)k∈N are orthonormal basis ofHt and Hα, respectively. It follows from Assumption 4.2.1(α, µ) that dVtdP-a.e. for all v ∈ Hµ, we have |Lα,nt v|−µ ≤ ( K + 1 n ) |v|µ, ∣∣∣Mαt v ∣∣∣ L2(H0,H∗t ) ≤ K|v|µ, and 2⟨v,Lα,nt v + Λ α ft⟩µ + ∣∣∣Mαt v + gt(Λ α)∗ ∣∣∣ 2 L2(H0,H∗t ) ≤ − 2 n |v|2µ + L|v| 2 0 + f¯t. (4.2.8) In [Gyö82], the variational theory for monotone stochastic evolution equations driven by locally square integrable Hilbert-space-valued martingales was derived; it is worth mentioning that the càdlàg version of the variational solution in the pivot space and the uniqueness of the solution was obtained using Theorem 2 in [GK81]. The theorems and proofs given in [Gyö82] continue to hold for equations driven by the cylindrical martin- gales we consider in this paper. Therefore, by Theorems 2.9 and 2.10 in [Gyö82], for every n ∈ N, there exists a unique solution vn = (vnt )t≤T of the stochastic evolution equation given by vnt = Λ αφ + ∫ t 0 (Lα,ns v n s + Λ α fs)dVs + ∫ t 0 (Mαs v n s− + Λ αgs)dMs, t ≤ T. Furthermore, by virtue of Theorem 4.1 in [Gyö82], there is a constant N(n) = N(n, L, K,C) such that E [ sup t≤T |vnt | 2 0 ] + E ∫ T 0 |vnt | 2 µdVt ≤ N(n)E [ |φ|2α + ∫ T 0 f¯tdVt ] . (4.2.9) We will now use our assumptions to obtain bounds of the solutions vnt , n ∈ N, in the H0-norm independent of n. Applying Theorem 2 in [GK81], P-a.s. for all t ∈ [0,T ], we have |vnt | 2 0 = |φ| 2 α + ∫ t 0 2⟨vns ,L α,n t v n s + Λ α fs⟩µdVs + [ M˜ ] 0,t + mt, (4.2.10) where (M˜t)t≤T and (mt)t≤T are local martingales given by M˜t := ∫ t 0 (Mαs v n s− + Λ αgs)dMs, mt := 2 ∫ t 0 {vns−(M α s v n s− + Λ αgs)}H0dMs. Thus, taking the expectation of (4.2.10) and making use of Assumption 4.2.1(α, µ), (4.2.8), and (4.2.9), we find that for all τ ∈ T , E [ |vnτ| 2 0 ] ≤ E [ |φ|2α ] + E ∫ τ 0 ( 2⟨vnt ,L α,n t v n t + Λ α ft⟩µ + |Mαt v + Λ αgt|2L2(H0,Ht) ) dVt ≤ E [ |φ|2α + ∫ τ 0 ( L|vnt | 2 0 − 2 n |vnt | 2 µ + f¯t ) dVt ] . 100 Chapter 4. The L2-Sobolev theory for parabolic SIDEs This implies that for any τ ∈ T p, E [ |vnτ−| 2 0 ] + 2 n E ∫ τ 0 |vnt | 2 µdVt ≤ E [ |φ|2α + ∫ τ 0 ( L|vns | 2 0 + f¯s ) dVs ] . By virtue of Lemmas 2 and 3 in [GM83], we deduce that there is a constant N = N(L,K,C) such that for any τ ∈ T , E [ |vnτ| 2 0 ] + 1 n E ∫ τ 0 |vnt | 2 µdVt ≤ NE [ |φ|2α + ∫ τ 0 f¯sdVs ] , which implies that (4.2.5) holds since Vt is uniformly bounded by the constant C. Finally, owing to Corollary II in [Len77], we have (4.2.6). (2) Using Assumption 4.2.2(α, µ) and estimating (4.2.10), we get that P-a.s. d|vnt | 2 0 ≤ |φ| 2 α + |v n t−| 2 λdAt + |v n t−|λdBt + (Gt(v n) + G¯t (v n))dMt. Moreover, for any τ ∈ T , we obtain E [ sup t≤τ |vnt | 2 0 ] ≤ NE [ |φ|2α + ∫ τ 0 |vnt | 2 0dVt + ∫ τ 0 ( f¯t + g¯ 2 t )dVt + sup t≤τ |lnt | ] , where lnt := ∫ t 0 (Gs(vn)+G¯s(vn))dMs.Moreover, by the Burkholder-Davis-Gundy inequality and Young’s inequality, we have E sup t≤τ |lnt | ≤ NE   (∫ τ 0 (|vnt−| 4 α + |v n t−| 2 αg¯ 2 t )dVt ) 1 2   ≤ NE  sup t≤τ |vnt |0 (∫ τ 0 (|vnt−| 2 α + g¯ 2 t )dVt ) 1 2   ≤ 1 4N E [ sup t≤τ |vnt | 2 0 ] + NE ∫ τ 0 (|vnt | 2 α + g¯ 2 t )dVt, from which estimate (4.2.7) follows. □ Proof of Theorem 4.2.4 . (1) Let vn = (vnt )t≤T be the unique solution of (4.2.4) constructed in Lemma 4.2.6. Since vn ∈ W0,µ, it follows that un := Λ−αvn ∈ Wα,µ ⊆ W0,µ is a solution of (4.2.3) in the triple (H−µ,H0,Hµ). We will first show that (un)n∈N is Cauchy in H0. Letting un,m = un−um, for all n,m ∈ N, we have un,mt = ∫ t 0 (Lnt u n t − L m t u m t )dVt + ∫ t 0 Mtun,mdMt, t ≤ T. Applying Theorem 2 in [GK81], we obtain that P-a.s. for all t ∈ [0,T ], |unt − u m t | 2 0 = ∫ t 0 2⟨uns − u m s ,L n su n s − L m s u m s ⟩µ,0dVs + [M n,m]t + η n,m t (4.2.11) 4.2. Degenerate linear stochastic evolution equations 101 where (Mn,mt )t≤T and (η n,m t )t≤T are local martingales given by Mn,mt := ∫ t 0 Ms(uns− − u m s−)dMs, η m,n t := 2 ∫ t 0 {(uns− − u m s−)Ms(u n s− − u m s−)}H0dMs. Assumption 4.2.1(0, µ) (1) implies that for any τ ∈ T , E [ |un,mτ | 2 0 ] ≤ E ∫ τ 0 ( 2⟨un,ms ,L n su n,m s ⟩µ + |Mtu n,m s | 2 L2(H0,H∗t ) ) dVs ≤ E ∫ τ 0 ( L|un,ms | 2 0 + 1 n |uns | 2 µ + 1 m |ums | 2 µ ) dVs, and hence for any τ ∈ T p, we have E [ |un.mτ− | 2 0 ] ≤ E ∫ τ 0 ( L|un.ms | 2 0 + 1 n |uns | 2 µ + 1 m |ums | 2 µ ) dVs. By virtue of Lemmas 2 and 4 in [GM83] and (4.2.5) (noting that |unt |0 = |Λ −αvnt |0 ≤ N|v n t |0), there is a constant N such that for any τ ∈ T , E [ |un,mτ | 2 0 ] ≤ NE ∫ τ 0 ( 1 n |uns | 2 µ + 1 m |ums | 2 µ ) dVs ≤ N ( 1 n + 1 m ) E [ |φ|2α + ∫ T 0 f¯tdVt ] . Using Corollary II in [Len77], we have that for all p ∈ (0, 2), there is a constant N such that E [ sup t≤T |un,mt | p 0 ] ≤ N ( 1 n + 1 m ) p 2 [ E [ |φ|2α + ∫ T 0 f¯tdVt ]] p 2 . Therefore, lim n,m→∞ [ sup τ∈T E [ |un,mτ | 2 0 ] + E [ sup t≤T |un,mt | p 0 ]] = 0, (4.2.12) and hence there exists a strongly càdlàg H0-valued process u = (ut)t≤T such that dP − lim n→∞ sup t≤T |ut − unt |0 = 0. (4.2.13) Since for all n, un is solution of (4.2.3), we have that P-a.s. for all t ∈ [0,T ] and ϕ ∈ Hµ, (ϕ, unt )0 = (ϕ, φ)0 + ∫ t 0 ⟨ϕ,Lnsu n s + fs⟩µ,0dVs + ∫ t 0 {ϕ(Msuns− + gs)}H0dMs. (4.2.14) Owing to (4.2.13), we know that the left-hand-side of (4.2.14) converges P-a.s. for all t ∈ [0,T ] to (ϕ, ut)0 as n tends to infinity. Our aim, of course, is to pass to the limit as n tends to infinity on the right-hand-side. 102 Chapter 4. The L2-Sobolev theory for parabolic SIDEs This can be done quite simply when α > µ. Indeed, owing to the interpolation inequal- ity (4.2.1), for all ε > 0, α′ < α, and p ∈ (0, 2), there is a constant N = N(α, α′, ε, p) such that sup t≤T |un,mt | p α′ ≤ ε sup t≤T |un,mt | p α + N sup t≤T |un,mt | p 0 . (4.2.15) Since |unt |α = |Λ −αvnt |α ≤ N |v n t |0, by (4.2.7) and (4.2.15), we have that for all α ′ < α and p ∈ (0, 2), E [ sup t≤T |un,mt | p α′ ] ≤ εNE   ( |φ|2α + ∫ T 0 f¯tdVt ) p 2   + NE [ sup t≤T |un,mt | p 0 ] . (4.2.16) Using (4.2.12) and passing to the limit as n and m tend to infinity on both sides of (4.2.16), and then taking ε ↓ 0, we get that for all α′ < α and p ∈ (0, 2), lim n,m→∞ E [ sup t≤T |un,mt | p α′ ] = 0. Combining the above results, we conclude that for any α′ < α, u is an Hα ′ -valued strongly càdlàg process and dP − lim n→∞ sup t≤T |ut − unt |α′ = 0. (4.2.17) In particular, if α > µ, then taking α′ > µ in (4.2.17) and appealing to Assumption 4.2.1(0, µ) (2), (4.2.5), and the identity, ⟨ϕ,Λ2µuns⟩µ = (Λ µϕ,Λµuns)0, we can take the limit as n tends to infinity on the right-hand-side of (4.2.14) by the domi- nated convergence theorem to conclude that u is a solution of (4.2.2). The case α = µ must be handled with weak convergence. Let S (OT ) = (Ω × [0,T ],OT , dV¯tdP) and S (PT ) = (Ω × [0,T ],PT , dV¯tdP), where V¯t =: Vt +t. It follows from (4.2.5) that there exists a subsequence (unk)k∈N of (un)n∈N that converges weakly in L2(S (OT ); Hµ) to some u¯ ∈ L2(S (OT ); Hµ) which satisfies E ∫ ]0,T ] |u¯t|2µdV¯t ≤ NE [ |φ|2µ + ∫ T 0 f¯tdVt ] . For any ϕ ∈ H0 and bounded predictable process ξt, we have lim k→∞ E ∫ T 0 ξt⟨ϕ, u nk t ⟩µdV¯t = lim k→∞ E ∫ T 0 ξt(u nk t , ϕ)0dV¯t = E ∫ T 0 ξt(ut, ϕ)0dV¯t, 4.2. Degenerate linear stochastic evolution equations 103 and hence u = u¯ in L2(S (OT ); Hµ) and unk converges to u weakly in L2(S (OT ); Hµ) as k tends to infinity. Define unk− = (u nk t−)t≤T and u− = (ut−)t≤T , where the limits are taken in H0. By repeating the argument above, we conclude that unk− converges to u− weakly in L2(S (OT ); Hµ) as k tends to infinity. Fix ϕ ∈ Hµ and a PT -measurable process (ξt)t≤T bounded by the constant K. Define the linear functionals ΦL : L2(S (OT ); Hµ)→ R and ΦM : L2(S (PT ); Hµ)→ R by ΦL(v) = E ∫ ]0,T ] ξt ∫ ]0,t] ⟨ϕ,Lsvs⟩µ,0dVsdV¯t, ∀v ∈ L2(S (OT ); Hµ) and ΦM(v) = E ∫ ]0,T ] ξt ∫ ]0,t] {ϕMsvs}H0dMsdV¯t, ∀v ∈ L 2(S (PT ); Hµ), respectively. Owing to Assumption 4.2.1(0, µ) (2), the Burkholder-Davis-Gundy inequal- ity, and the fact that (V¯t)t≤T is uniformly bounded by the constant C, there is a constant N = N(K,C) such that |ΦL(v)| ≤ N |ϕ|µ ( E ∫ T 0 |vt|2µdV¯t ) 1 2 , ∀v ∈ L2(S (OT ); Hµ), and |ΦM(v)| ≤ N|ϕ|µ ( E ∫ ]0,T ] |vs|2µdV¯s ) 1 2 , ∀v ∈ L2(S (PT ); Hµ). This implies that ΦL is a continuous linear functional on L2(S (OT ); Hµ) and ΦM is a con- tinuous linear functional on L2(S (PT ); Hµ), and hence that lim k→∞ ΦL(unk) = ΦL(u), lim k→∞ ΦL(unk− ) = Φ M(u−). (4.2.18) For each k, we have that E ∫ T 0 ξt(ϕ, u nk t )0dV¯t = E ∫ T 0 ξt(ϕ, φ)0dV¯t + E ∫ T 0 ξt ∫ t 0 ⟨ϕ,Lnks u nk s + fs⟩µ,0dVsdV¯t(4.2.19) + E ∫ T 0 ξt ∫ t 0 {ϕ(Msuns− + gs)}H0dMsdV¯t. Passing to the limit as k tends to infinity on both sides of (4.2.19) using (4.2.18) and ⟨ϕ,Λ2µunks ⟩µ = (Λ µϕ,Λµunks )0, 104 Chapter 4. The L2-Sobolev theory for parabolic SIDEs we obtain that dV¯tdP-a.e. (ϕ, ut)0 = (ϕ, φ)0 + ∫ ]0,t] ⟨ϕ, (Lsus + fs)⟩µdVs + ∫ ]0,t] {ϕ(Msus− + gs)}H0dMs, t ≤ T. Therefore, for all α ≥ µ, u is a solution of (4.2.2). We will now show that sup τ∈T E [ |uτ|2α ] ≤ NE [ |φ|2α + ∫ T 0 f¯tdVt ] . (4.2.20) Let (hk)k∈N be a complete orthonormal basis in Hα such that the linear subspace spanned by (hk)k∈N is dense in H2α. Owing to (4.2.5), for all m ≥ 1 and τ ∈ T , E   m∑ k=1 |(unτ,Λ 2αhk)0|2   = E   m∑ k=1 |(unτ, h k)α|2   ≤ E [ |unτ| 2 α ] ≤ NE [ |φ|2α + ∫ T 0 f¯tdVt ] . Applying Fatou’s lemma first in n and then in m, we have that for all τ ∈ T , E   ∞∑ k=1 |(uτ,Λ2αhk)0|2   ≤ NE [ |φ|2α + ∫ T 0 f¯tdVt ] . Hence, for all t ∈ [0,T ], P-a.s. vt = ∑ k(ut,Λ 2αhk)0hk ∈ Hα. Since the linear subspace spanned by (Λ2αhk)k∈N is dense in H0 and for all t ∈ [0,T ], P-a.s., (ut − vt,Λ2αhk)0 = 0, for all k ∈ N, it follows that P-a.s. for all τ ∈ T , uτ = v and E [ |uτ|2α ] = E ∞∑ k=1 ∣∣∣(uτ,Λ 2αhk)0 ∣∣∣ 2 ≤ NE [ |φ|2α + ∫ T 0 f¯tdVt ] , which proves (4.2.20). Estimating (4.2.2) directly in the Hα−µ-norm, we easily derive that E [ sup t≤T |ut|2α−µ ] ≤ NE [ |φ|2α + ∫ T 0 f¯sdVt ] . If (vt)t≤T be another solution of (4.2.2), then by Theorem 2 in [GK81] and Assumption 4.2.1(0, µ)(1), P-a.s. for all t ∈ [0,T ], we have |ut − vt|20 ≤ L ∫ t 0 |us − vs|20dVs + mt, where (mt)t≤T is a local martingale with m0 = 0, and hence applying Lemmas 2 and 4 in 4.2. Degenerate linear stochastic evolution equations 105 [GM83], we get P ( sup t≤T |ut − vt|0 > 0 ) = 0, which implies that (ut)t≤T is the unique solution of (4.2.2). This completes the proof of part (1). (2) Estimating (4.2.11) using Assumption 4.2.2(0, µ), we get that P-a.s. d|un,mt | 2 0 ≤ |φ| 2 0 + ( 1 n + 1 m ) |un,m|20dVt + |u n,m t− | 2 λdAt + |u n,m t− |λdBt + (Gt(u n,m) + G¯t (u n,m))dMt. Then estimating the stochastic integrand as in the proof of part (2) of Lemma 4.2.6, for any τ ∈ T , we get E [ sup t≤τ |un.mt | 2 0 ] ≤ NE ∫ τ 0 ( |un.ms | 2 0 + 1 n |uns | 2 µ + 1 m |ums | 2 µ) ) dVs, and hence by Gronwall’s lemma and Lemma 4.2.6(2), E [ sup t≤τ |un.mt | 2 0 ] ≤ N ( 1 n + 1 m ) E [ |φ|2α + ∫ ]0,T ] ( f¯t + g¯ 2 t ) dVt ] Thus, lim n,m→∞ E [ sup t≤τ |un.mt | 2 0 ] = 0. Let (hk)k∈N be a complete orthonormal basis Hα such that the linear subspace spanned by (hk)k∈N is dense in H2α. Owing to (4.2.7), for all m ≥ 1 and τ ∈ T , E  sup t≤T m∑ k=1 (unt , h k)α   = E  sup t≤T m∑ k=1 |(unt ,Λ 2αhk)0|2   ≤ E [ sup t≤T |unt | 2 α ] ≤ NE [ |φ|2α + ∫ ]0,T ] ( f¯t + g¯ 2 t )dVt ] . Applying Fatou’s lemma first in n and then in m, we have that E  sup t≤T ∞∑ k=1 |(ut,Λ2αhk)0|2   ≤ NE [ |φ|2α + ∫ ]0,T ] ( f¯t + g¯ 2 t )dVt ] . Thus, v = ∑ k(u,Λ 2αhk)0hk is an Hα-valued weakly càdlàg process. Since the linear sub- space spanned on (Λ2αhk)k∈N is dense in H0 and ( ut − vt,Λ2αhk ) 0 = 0, for all k ∈ N, it follows that P-a.s. for all t ∈ [0,T ], ut = vt and E [ sup t≤T |ut|2α ] = E  sup t≤T ∞∑ k=1 |(ut,Λ2αhk)0|2   ≤ NE [ |φ|2α + ∫ ]0,T ] ( f¯t + g¯ 2 t )dVt ] . 106 Chapter 4. The L2-Sobolev theory for parabolic SIDEs □ 4.3 The L2-Sobolev theory for degenerate SIDEs 4.3.1 Statement of main results In this section, we consider the d2-dimensional system of SIDEs on [0,T ] × Rd1 given by dult = ( (L1;lt +L 2;l t )ut + b i t∂iu l t + c ll¯ t u l¯ t + f l t ) dVt + ( N lϱt ut + g lϱ t ) dwϱt (4.3.1) + ∫ Z1 ( Ilt,zu l¯ t− + h l t(z) ) η˜(dt, dz), ul0 = φ l, l ∈ {1, . . . , d2}, where for k ∈ {1, 2}, l ∈ {1, . . . , d2}, and ϕ ∈ C∞c (R d1; Rd2), Lk;lt ϕ(x) := 1 2 σ k;iϱ t (x)σ k; jϱ t (x)∂i jϕ l(x) + σk;iϱt (x)υ k;ll¯ϱ t (x)∂iϕ l¯(x) + ∫ Zk (( δll¯ + ρ k;ll¯ t (x, z) ) ( ϕl¯(x + ζkt (x, z)) − ϕ l¯(x) ) − ζk;it (x, z)∂iϕ l(x) ) πkt (dz) N lϱt ϕ(x) := σ 1;iϱ t (x)∂iϕ l(x) + υ1;ll¯ϱt (x)ϕ l¯(x), ϱ ∈ N, Ilt,zϕ(x) := ( δll¯ + ρ 1;ll¯ t (x, z) ) ϕl¯(x + ζ1t (x, z)) − ϕ l(x). We assume that σkt (x) = (σ k;iϱ ω,t (x))1≤i≤d1, ϱ∈N, bt(x) = (b i ω,t(x))1≤i≤d1 , ct(x) = (c ll¯ ω,t(x))1≤l,l¯≤d2 , υkt (x) = (υ k;ll¯ϱ ω,t (x))1≤l,l¯≤d2, ϱ∈N, ft(x) = ( f i ω,t(x))1≤i≤d2 , gt(x) = (g iϱ ω,t(x))1≤i≤d2, ϱ∈N are random fields on Ω × [0,T ] × Rd1 that are RT ⊗ B(Rd1)-measurable. Moreover, we assume that ζ1t (x, z) = (ζ 1;i ω,t(x, z))1≤i≤d1 , ρ 1 t (x, z) = (ρ 1;ll¯ ω,t (x, z))1≤l,l¯≤d2 , h i ω,t(x, z))1≤i≤d2 , are random fields on Ω × [0,T ] × Rd1 × Z1 that are PT ⊗ B(Rd1) ⊗Z1-measurable and ζ2t (x, z) = (ζ 2;i ω,t(x, z))1≤i≤d1 , ρ 2 t (x, z) = (ρ 2;ll¯ ω,t (x, z))1≤l,l¯≤d2 , are random fields on Ω× [0,T ]×Rd1 ×Z2 that are RT ⊗B(Rd1)⊗Z2-measurable. We also assume that there is a constant C such that Vt ≤ C for all (ω, t) ∈ Ω × [0,T ]. Let us introduce the following assumption for an integer m ≥ 1 and a real number α1 ∈ (1, 2). We remind the reader that the norms and seminorms | · |0 = [·]0 and | · |β, 4.3. The L2-Sobolev theory for degenerate SIDEs 107 β ∈ (0, 1], were defined in the beginning of Section 2.2. Let us introduce the following assumption for m ∈ N and a real number β ∈ [0, 2]. Assumption 4.3.1 (m, d2). Let N0 be a positive constant. (1) For all (ω, t, x) ∈ Ω × [0,T ] × Rd1 , the derivatives in x of the random fields bt, ct, σ2t , and υ2t up to order m and σ k t and υ k t , k ∈ {1, 2}, up to order m + 1 exist, and for all x ∈ Rd1 , max |γ|≤m ( |∂γbt(x)| + |∂γct(x)| + |∂γ∇σ1t (x)| + |∂ γσ2t (x)| + |∂ γ∇υ1t (x)| + |∂ γυ2t (x)| ) ≤ N0. (2) For each k ∈ {1, 2} and all (ω, t, x, z) ∈ Ω × [0,T ] × Rd1 × Zk, the derivatives in x of the random field ζkt (z) up to order m exist, and for all x ∈ R d1 , max |γ|≤m |∂γζ1t (x, z) | + max |γ|=m [ ∂γζ1t (·, z) ] β 2 ≤ K1t (z) , ∀z ∈ Z 1, max |γ|≤m |∂γζ2t (x, z) | ≤ K 2 t (z), ∀z ∈ Z 2, where K1t (resp. K 2 t ) are PT ⊗Z 1 (resp. PT ⊗Z1) -measurable processes satisfying sup z∈Zk Kkt (z) + ∫ Z1 K1t (z) βπ1t (dz) + ∫ Z2 K2t (z) 2π2(dz) ≤ N0. (3) There is a constant η < 1 such that for each k ∈ {1, 2} on the set all (ω, t, x, z) ∈ Ω × [0,T ] × Rd1 × Zk for which |∇ζkt (x, z)| > η, ∣∣∣∣ ( Id1 + ∇ζ k t (x, z) )−1∣∣∣∣ ≤ N0. (4) For each k ∈ {1, 2} and all (ω, t, x, z) ∈ Ω × [0,T ] × Rd1 × Zk, the derivatives in x of the random field ρkt (z) up to order m exist, and for all x ∈ R d1 , max |γ|≤m |∂γρ1t (x, z)| + max |γ|=m [ ρ1t (·, z) ] β 2 ≤ l1t (z), max |γ|≤m |∂γρ2t (x, z)| ≤ l 2 t (z) , where l1 (resp. l2) is PT ⊗Z1 (resp. PT ⊗Z2) -measurable function satisfying ∫ Z1 l1t (z) 2π1t (dz) + ∫ Z2 l2t (z) 2π2(dz) ≤ N0. Let L2 = L2(Rd1 ,B(Rd1), ν; R), where ν (differential is denoted by dx) is the Lebesgue measure. Let S = S(Rd1) be the Schwartz space of rapidly decreasing functions on Rd1 . 108 Chapter 4. The L2-Sobolev theory for parabolic SIDEs The Fourier transform of an element v ∈ S is defined by vˆ(ξ) = F v(ξ) = ∫ Rd1 v(x)e−i2πξ·xdx, ξ ∈ Rd1 . We denote by F −1 its inverse. Denote the space of tempered distributions by S′, the dual of S. Let ∆ := ∑d1 i=1 ∂ 2 i be the Laplace operator on R d1 . For α ∈ R, we define the Sobolev scale Hα(Rd1; Rd) = { v = (vi)1≤i≤d : v i ∈ S′ and ( 1 + 4π2 |ξ|2 )α/2 vˆi ∈ L2(Rd1), ∀i ∈ {1, . . . , d} } = { v = (vi)1≤i≤d : v i ∈ S′ and (I − ∆) α 2 vi ∈ L2(Rd1), ∀i ∈ {1, . . . , d} } with the norm and inner product given by ∥v∥α,d =   d∑ i=1 ∣∣∣∣ ( 1 + 4π2 |ξ|2 )α/2 vˆi ∣∣∣∣ 2 L2   1/2 =   d∑ i=1 ∣∣∣(I − ∆)α/2 vi ∣∣∣ 2 L2   1/2 and (v, u)α,d = d∑ i=1 ( (I − ∆)α/2 vi, (I − ∆)α/2 ui ) L2 , ∀u, v ∈ Hα(Rd1; Rd), where (I − ∆)α/2 vi = F −1 (( 1 + 4π2 |ξ|2 )α/2 vˆi ) . It is well-known that C∞c (R d1; Rd) is dense in Hα(Rd1; Rd) for all α ∈ R. For v ∈ H1(Rd1; Rd) and u ∈ H−1(Rd1; Rd), we let ⟨v, u⟩1,d = (Λ1v,Λ−1u)0,d, and identify the dual of H1(Rd1; Rd) with H−1(Rd1; Rd) through this bilinear form. More- over, all of the properties imposed in Section 4.2 for the abstract family of spaces (Hα)α∈R and operators (Λα)α∈R hold for the Sobolev scale. We refer the reader to [Tri10] for more details about the Sobolev scale (see the references therein as well). For each α ∈ R, let Hα(Rd1; Rd;F0) be the space of all F0-measurable Hα(Rd1; Rd)- valued random variables φ˜ satisfying E [ ∥φ˜∥2α ] < ∞. Let Hα(Rd1; Rd2) be the space of all Hα(Rd1; Rd2)-valued RT -measurable processes f : 4.3. The L2-Sobolev theory for degenerate SIDEs 109 Ω × [0,T ]→ Hα(Rd1; Rd2) such that E ∫ T 0 ∥ ft∥2αdVt < ∞. Let Hα(Rd1; ℓ2(Rd)) be the space of all sequences of Hα(Rd1; Rd)-valuedPT -measurable processes g˜ = (g˜ϱ)ϱ∈N, g˜ϱ : Ω × [0,T ]→ Hα(Rd1; Rd), satisfying E ∫ T 0 ∥g˜t∥2αdVt = E ∫ T 0 ∑ ρ∈N ∥g˜ϱt ∥ 2 αdVt < ∞. Let Hα(Rd1; Rd; π1) be the space of all Hα(Rd1; Rd)-valued PT ⊗Z1-measurable processes h˜ : Ω × [0,T ] ×Z1 → Hα(Rd1; Rd) such that E ∫ T 0 ∫ Z1 ∥h˜t (z) ∥2απ 1 t (dz)dVt < ∞.. For each α ∈ R, we set Hα = Hα(Rd1; Rd2), Hα(F0), Hα = Hα(Rd1; Rd2), Hα(ℓ2) = Hα(Rd1; ℓ2(Rd2)), Hα(π1) = Hα(Rd1; Rd2; π1), and ∥ · ∥α = ∥ · ∥α,d2 , (·, ·)α = (·, ·)α,d2 , ⟨·, ·⟩1 = ⟨·, ·⟩1,d2 . We also set C ∞ c = C ∞ c (R d1; Rd). Definition 4.3.1. Let φ ∈ H0(F0), f ∈ H−1, g ∈ H0(ℓ2), and h ∈ H0(π1). An H0-valued strongly càdlàg process u = (ut)t≤T is said to be a solution of the SIDE (4.3.1) if u ∈ L2(Ω × [0,T ],OT , dVtdP; H1) and P-a.s. for all t ∈ [0,T ], ut H−1 = φ + ∫ t 0 ( (L1;ls +L 2;l s )us + b i s∂iu l s + c ll¯ s u l¯ s + f l s ) dVs + ∫ t 0 ( N lϱs us + g lϱ s ) dwϱs + ∫ t 0 ∫ Z1 ( Ils,zu l¯ s− + h l s(z) ) η˜(ds, dz), where H−1 = indicates that the equality holds in the H−1. That is, P-a.s. for all t ∈ [0,T ] and v ∈ H1, (v, ut)0 = (v, u0) + ∫ t 0 ⟨v, (L1s +L 2 s)us + b i s∂ius + csus + fs⟩1dVs + ∫ t 0 ( v, ( N lϱs us + g lϱ s )) 0 dwϱs + ∫ t 0 ∫ Z1 ( v, ( Ils,zu l¯ s− + h l s(z) )) η˜(ds, dz). The coming theorem is our existence result for equation (4.3.1). In the next section, we prove this theorem by appealing to Theorem 4.2.4. Theorem 4.3.2. Let Assumption 4.3.1(m, d2) hold for m ∈ N and a real number β ∈ [0, 2]. Then for every φ ∈ Hm(F0), f ∈ Hm, g ∈ Hm+1(ℓ2), h ∈ Hm+ β 2 (π1), and there exists a unique solution u = (ut)t≤T of (4.3.1) that is weakly càdlàg as an Hm-valued process and 110 Chapter 4. The L2-Sobolev theory for parabolic SIDEs strongly càdlàg as an Hα ′ -valued process for any α′ < m. Moreover, there is a constant N = N(d1, d2,N0,m, η, β) such that E [ sup t≤T ∥ut∥2m ] ≤ NE [ ∥φ∥2m + ∫ T 0 ( ∥ ft∥2m + ∥gt∥ 2 m+1 + ∫ Z1 ∥ht(z)∥2m+ β2 π1t (dz) ) dVt ] . The following corollary can be proved in the same way as Corollary 1 (with p = 2) in Chapter 4, Section 2.2 of [Roz90] by making use of the Sobolev embedding theorem. Corollary 4.3.3. Let Assumption 4.3.1(m, d2) hold for an integer m > d1 2 and a real number β ∈ [0, 2]. Then the solution (ut)t≤T of (4.3.1) has a version with the following properties: (1) for every x ∈ Rd1 , ut(x) is a OT -measurable Rd2-valued process; (2) for β = (2m − d1)/2 and for all ω ∈ Ω, u ∈ D([0,T ];C β loc(R d1 ; Rd2)); (3) it possess all the properties mentioned in Theorem 4.3.2; (4) for each bounded subset Q ⊆ Rd, E [ sup t≤T |ut|2β;Q;Rd2 ] + E ∫ ]0,T ] ∥ut∥2β;Q;Rd2 dVt ≤ NE [ ∥φ∥2m + ∫ T 0 ( ∥ ft∥2m + ∥gt∥ 2 m+1 + ∫ Z1 ∥ht(z)∥2m+ β2 π1t (dz) ) dVt ] , where N = N(d1, d2,N0,m, η, β) is a constant; (5) if (u1t )t≤T and (u 2 t )t≤T are solutions of (4.3.1) possessing properties (1) and (2), then P   sup t≤T,x∈Rd1 |u1t (x) − u 2 t (x)| > 0   = 0. Remark 4.3.4. If m > α1∨α2 + d2 , then u ∈ D([0,T ];C α1∨α2 loc (R d1; Rd2)) and we can obtain a representation of the solution of (4.3.1) as in Theorem 3.2.2 by applying the Ito-Wentzell formula given in Proposition 3.4.16. However, there are two important points to notice. First, since we have only established an integer regularity theory for our equation, then the best we can hope for is a theory for classical solutions with integer assumptions on the coefficients, initial condition, and free terms. This is in stark contrast with Chapter 3. Moreover, the restriction to p = 2 has dire consequences, since the d12 term gets larger as d1 grows, and hence one must impose more regularity as the dimension grows. This is a strong motivation to consider the Lp-Sobolev theory for (4.3.1) in weighted spaces, so that the d12 can be replaced with d1 p . This way, by taking p large, the term d1 p can be made as small as one likes. It is worth mentioning that we could have considered weighted scale of Sobolev spaces here (see, e.g. [GK92]), but we will leave this for a future project. At the time of writing, the integer scale Lp-Sobolev theory for (degenerate) (4.3.1) is currently underway and will be available soon. 4.3. The L2-Sobolev theory for degenerate SIDEs 111 4.3.2 Proof of Theorem 4.3.2 By [MR99] (see Examples 2.3-2.4), the stochastic integrals in (4.3.1) can be written as stochastic integrals with respect to a cylindrical martingale. We will apply Theorem 4.2.4 to (4.3.1) with α = m and µ = 1 by checking that Assumptions 4.2.1(λ, 1) and 4.2.2(λ, 1) for λ ∈ {0,m} are implied by Assumption 4.3.1(m, d2). We start with λ = 0 as our base case and show that λ = m can be reduced to it. We introduce our base assumption for β ∈ [0, 2] . Assumption 4.3.2 (d2). Let N0 be a positive constant. (1) For all (ω, t, x) ∈ Ω × [0,T ] × Rd1 , the derivatives in x of the random fields bt, σ1t , σ 2 t , and divσ1t exist, and for all x ∈ R d1 , |∇ divσ1t (x) | + |σ k t (x) | + |∇σ k t (x)| + |divbt(x)| + |ct(x)| + |υ 2 t,sym (x) | + |∇υ 1 t (x) | ≤ N0. (2) For each k ∈ {1, 2} and all (ω, t, x, z) ∈ Ω × [0,T ] × Rd1 × Zk, the derivatives in x of the random fields ζkt (z) exist, and for all x ∈ R d1 , |ζ1t (x, z)| ≤ K 1 t (z) , |∇ζ 1 t (x, z) | ≤ K¯ 1 t (z) , [ div ζ1t (·, z) ] β 2 ≤ K˜1t (z) , ∀z ∈ Z 1, |ζ2t (x, z)| ≤ K 2 t (z) , |∇ζ 2 t (x, z) | ≤ K¯ 2 t (z) , ∀z ∈ Z 2, where K1t , K¯ 1 t , K˜ 1 t (resp. K 2 t , K¯ 2 t ) are PT ⊗ Z 1 (resp. PT ⊗ Z2) -measurable processes satisfying sup z∈Z1 ( K1t (z) + K¯ 1 t (z) + K˜ 1 t (z) ) + ∫ Z1 ( K1t (z) β + K¯1t (z) 2 + K˜1t (z) 2 ) π1t (dz) ≤ N0, sup z∈Z2 ( K2t (z) + K¯ 2 t (z) ) + ∫ Z2 K¯2t (z) 2π2(dz) ≤ N0. (3) There is a constant η < 1 such that for each k ∈ {1, 2} on the set all (ω, t, x, z) ∈ Ω × [0,T ] × Rd1 × Zk for which |∇ζkt (x, z)| > η, ∣∣∣∣ ( Id1 + ∇ζ k t (x, z) )−1∣∣∣∣ ≤ N0. (4) For each k ∈ {1, 2} and all (ω, t, x, z) ∈ Ω × [0,T ] × Rd1 × Zk, |ρkt (x, z)| ≤ l k t (z), and for all (ω, t, z) ∈ Ω × [0,T ] × Z1, [ ρ1t,sym(·, z) ] β 2 ≤ l˜1t (z) , where l k (resp., l˜1) is a PT ⊗ Zk- measurable (resp. PT ⊗Z1-measurable) functions satisfying ∫ Z1 ( l1t (z) 2 + l˜1t (z) 2 ) π1(dz) + ∫ Z2 l2t (z) 2π2(dz) ≤ N0. 112 Chapter 4. The L2-Sobolev theory for parabolic SIDEs Note that Assumption 4.3.2(d2) is weaker than Assumption 4.3.1(0, d2). Let us make the following convention for the remainder of this section. If we do not specify to which space the parameters ω, t, x, y, and z belong, then we mean ω ∈ Ω, t ∈ [0,T ], x ∈ Rd1 , and z ∈ Zk. Moreover, unless otherwise specified, all statements hold for all ω, t, x, y, and z independent of any constant N introduced is independent of ω, t, x, y, and z. We will also drop the dependence of processes t, x, and z when we feel it will not obscure our argument. Lastly, all derivatives and Hölder norms are taken with respect to x ∈ Rd1 . Remark 4.3.5. Let Assumption 4.3.2(d2) hold. For each k and θ ∈ [0, 1], on the set all ω, t, and z in which |Kkt (z)| ≤ η, we have ∣∣∣(Id1 + θ∇ζ k t (x, z)) −1 ∣∣∣ ≤ 1 1 − θη . Moreover, for each k and all ω, t, and z, we have ∣∣∣(Id1 + ∇ζ k t (x, z)) −1 ∣∣∣ ≤ max ( 1 1 − θη ,N0 ) . Therefore, by Hadamard’s theorem (see, e.g., Theorem 0.2 in [DMGZ94] or 51.5 in [Ber77]): • for each k and θ ∈ [0, 1], on the set all ω, t, and z in which |Kkt (z)| ≤ η, the mapping ζ˜kt,θ(x, z) := x + θζ k t (x, z) is a global diffeomorphism in x; • for each k and all ω, t, and z, the mapping ζ˜kt (x, z) = ζ˜ k t,1(x, z) = x + ζ k t (x, z) is a global diffeomorphism in x. When inverse of the mapping x 7→ ζ˜kt,θ(x, z) exists, we denote it by ζ˜ k;−1 t,θ (x, z) = ( ζ˜ k;−1; j t,θ (x, z) ) 1≤ j≤d1 and note that ζ˜ k;−1 t,θ (x, z) = x − θζ k t (ζ˜ k;−1 t,θ (x, z), z). Furthermore, for each k and θ ∈ [0, 1], on the set all ω, t, and z in which |Kkt (z)| ≤ η, there 4.3. The L2-Sobolev theory for degenerate SIDEs 113 is a constant N = N(d1,N0, η) such that |∇ζ˜k;−1t,θ (x, z)| ≤ N (4.3.2) and for each k and all ω, t, and z, |∇ζ˜k;−1t (x, z)| ≤ N. (4.3.3) Using simple properties of the determinant, we can easily show that there is a constant N = N(d1) such that for an arbitrary real-valued d1 × d1 matrix A, | det(Id1 + A) − 1| ≤ N|A| and | det(Id1 + A) − 1 − tr A| ≤ N|A| 2. Thus, there is a constant N = N(d1,N0, η) such that | det∇ζ˜k;−1 − 1| = ∣∣∣∣det ( Id − ζkt (ζ˜ k;−1 t ) ) − 1 ∣∣∣∣ ≤ N |∇ζk(ζ˜k;−1)| (4.3.4) and ∣∣∣∣det∇ζ˜k;−1 − 1 + div ( ζk(ζ˜k;−1) )∣∣∣∣ ≤ N |∇ζk(ζ˜k;−1)|2. Since ∂lζ˜k;−1; j = δl j − ∂mζk; j(ζ˜k;−1)∂lζ˜k;−1;m), we have | div(ζk(ζ˜k;−1)) − div ζk(ζ˜k;−1)| = |∂ jζk;l(ζ˜ k;−1 t )(∂lζ˜ k;−1; j − δl j)| ≤ N|∇ζk(ζ˜k;−1)|2, and thus ∣∣∣det∇ζ˜k;−1 − 1 + div ζk(ζ˜k;−1) ∣∣∣ ≤ N |∇ζk(ζ˜k;−1)|2. (4.3.5) In the following three lemmas, we will show that Assumptions 4.2.1(λ, 1) and 4.2.2(λ, 1) for λ ∈ {0,m} hold under Assumption 4.3.2(β) for any β ∈ [0, 2]. For each l ∈ {1, . . . , d2} and all ϕ ∈ C∞c , let Lltϕ = (L 1;l t +L 2;l t )ϕ + b i t∂iϕ l + cll¯t ϕ l¯, A1;lt ϕ = 1 2 σ 1;iϱ t σ 1; jϱ t ∂i jϕ l + σ 1;iϱ t υ 1;ll¯ϱ t ∂iϕ l¯, and J1t ϕ = L 1 t ϕ −A 1 t ϕ. Lemma 4.3.6. Let Assumption 4.3.2(d2) hold. Then there is a constant N = N(d1, d2,N0, η) such that for all (ω, t) ∈ Ω × [0,T ] and v ∈ H1, ∥Ltv∥−1 ≤ N∥v∥1, ∥Atv∥−1 ≤ N∥v∥1, ∥J1t v∥−1 ≤ N∥v∥1, ∥Ntv∥0 ≤ N∥ϕ∥1, and ∫ Z1 ∥It,zv∥20π 1 t (dz) ≤ N∥v∥ 2 1. 114 Chapter 4. The L2-Sobolev theory for parabolic SIDEs Proof. First we will show that there is a constant N such that (ψ,Ltϕ)0 ≤ N∥ψ∥1∥ϕ∥1, ∀ϕ ∈ C∞c . Once this is established, we know that L extends to a linear operator from H1 to H−1 (still denoted by L) and ∥Ltv∥−1 ≤ N∥v∥1, for all v ∈ H1. Using Taylor’s formula and the divergence theorem, we get that for all and all ϕ, ψ ∈ C∞c (ψ,Lϕ)0 = 2∑ k=1 ( (ψ,Lktϕ)0 + (∂iψ,Y k;i t ϕ)0 + (ψ, bt∂iϕ)0 + (ψ, ctϕ)0 ) , where for each k ∈ {1, 2}, l ∈ {1, . . . , d2}, and i ∈ {1, 2, . . . , d1}, Lk;lϕ : = − ∫ K¯k<η ∫ 1 0 ( ϕl(ζ˜kθ ) − ϕ l ) ∂iζ k;idθπk(dz) − ∫ K¯k<η ∫ 1 0 θ∂ jϕ l(ζ˜kθ )∂iζ k; jζk;idθπk(dz) + ∫ Zk ρk;ll¯ ( ϕl(ζ˜k) − ϕl ) πk(dz) + σk;iϱυk;ll¯ϱ∂iϕ l¯ + ∫ K¯k>η ( ϕl(ζ˜k) − ϕl − ζk;i∂iϕl ) πk(dz), Yk;liϕ : = − ∫ K¯k<η ∫ 1 0 ( ϕl(ζ˜kθ ) − ϕ l ) ζk;idθπk(dz) − 1 2 ∂i ( σk;iϱσk; jϱ ) ∂ jϕ l. For the remainder of the proof, we make the convention that statements hold for all ϕ, ψ ∈ C∞c and that all constants N are independent of ϕ. By Minkowski’s integral inequality and Hölder’s inequality, we have (using the notation of Remark 4.3.5) ||Lkϕ||0 ≤ (∫ K¯k<η (Kk)2πk(dz) ) 1 2 ∫ 1 0 (∫ Rd1 ∫ K¯k<η |ϕ(ζ˜kθ (z)) − ϕ| 2πk(dz)dx ) 1 2 dθ + ∫ K¯k<η Kk(z)2 ∫ 1 0 (∫ Rd1 |∇ϕ(ζ˜kθ )| 2dx ) 1 2 πk(dz)θdθ + (∫ Z1 (lk)2πk(dz) ) 1 2 (∫ Rd1 ∫ Z1 |ϕ(ζ˜k) − ϕ|2πk(dz)dx ) 1 2 +N∥∇ϕ∥0 + ∫ K¯k≥η ∫ 1 0 (∫ Rd1 |ϕ(ζ˜k) − ϕ − ζk;i∂iϕ|2dx ) 1 2 dθπk(dz), and for all i ∈ {1, . . . , d1}, ||Yk;iϕ||0 ≤ (∫ K¯k<η (Kk)2πk(dz) ) 1 2 ∫ 1 0 (∫ Rd1 ∫ K¯k<η |ϕ(ζ˜kθ ) − ϕ| 2πk(dz)dx ) 1 2 dθ + N∥∇ϕ∥0. 4.3. The L2-Sobolev theory for degenerate SIDEs 115 Applying the change of variable formula and appealing to (4.3.2), we find that ∫ Rd1 ∫ K¯k<η |ϕ(ζ˜kθ ) − ϕ| 2πk(dz)dx ≤ θ ∫ K¯k<η ∫ 1 0 ∫ Rd1 |∇ϕ(ζ˜k θθ¯ )|2|ζk|2dxdθ¯ ≤ θ ∫ K¯k<η (Kk)2 ∫ 1 0 ∫ Rd1 |∇ϕ|2 ∣∣∣det∇ζ˜k;−1 θθ¯ ∣∣∣ dxdθ¯ ≤ Nθ∥∇ϕ∥20. Similarly, since πk({z ∈ Zk : K¯k ≥ η}) ≤ N0, we have ∫ Rd1 ∫ K¯k≥η |ϕ(ζ˜k) − ϕ|2πk(dz)dx ≤ 2 ∫ Rd1 ∫ K¯k≥η ( |ϕ(ζ˜k)|2 + |ϕ|2 ) πk(dz)dx ≤ ∫ K¯k≥η ∫ Rd1 |ϕ|2 ( 1 + ∣∣∣det∇ζ˜k;−1 ∣∣∣ ) dxπk(dz) ≤ N∥ϕ∥20 and ∫ K¯k≥η (∫ Rd1 |ϕ(ζ˜k) − ϕ − ζk;i∂iϕ|2dx ) 1 2 πk(dz) ≤ N ∫ K¯k≥η (∫ Rd1 ( |ϕ|2 ( 1 + ∣∣∣det∇ζ˜k;−1 ∣∣∣ ) + (Kk)2|∇ϕ|2 ) dx ) 1 2 πk(dz) ≤ N∥ϕ∥1, where in the last inequality we used (4.3.3). Moreover, ∫ K¯k<η (Kk)2 (∫ Rd1 |∇ϕ(ζ˜kθ )| 2dx ) 1 2 πk(dz) ≤ ∫ K¯k<η (Kk)2 (∫ Rd1 |∇ϕ|2 det∇ζ˜k;−1 θ dx ) 1 2 πk(dz) ≤ N∥∇ϕ∥0. Combining the above estimates, we get that (ψ,Lϕ)0 ≤ N∥ψ∥1∥ϕ∥1. It is clear from the above computation that ∥A1t ϕ∥−1 ≤ N∥ϕ∥1, ∥J 1 t ϕ∥−1 ≤ N∥ϕ∥1, where actually A1 and J1 are actually extensions of the operators defined above. The inequality ∥Nϕ∥0 ≤ N∥ϕ∥1 can easily be obtained. Following similar calculations to ones we derived above (using (4.3.3) and (4.3.2)), we obtain ∫ Z1 ||Iϕ||20π 1(dz) ≤ N(A1 + A2), 116 Chapter 4. The L2-Sobolev theory for parabolic SIDEs where A1 := ∫ Rd1 ∫ K¯1≤η ∫ 1 0 |∇ϕ(ζ˜1θ )| 2|ξ1|2π1(dz)dθdx + ∫ Rd1 ∫ K1>η ( |ϕ(ζ˜1)|2 + |ϕ|2 ) π1(dz)dx ≤ N∥ϕ∥21 and A2 := ∫ Rd1 ∫ Z1 ρ1;ll¯ϕl¯(ζ˜1)π1 (dz) dx ≤ N∥ϕ∥20. □ Lemma 4.3.7. Let Assumption 4.3.2(d2) hold. Then there is a constant N = N(d1, d2,N0, η, β) such that for all (ω, t) ∈ Ω × [0,T ] and all v ∈ H1, 2⟨v,L2t v + b i t∂ivt + c ·l¯ t v l¯ t⟩1 + 1 4 (σ2;iϱt ∂iv, σ 2; jϱ t ∂ jv)0 + 1 4 ∫ Z2 ∥v(ζ˜2t (z)) − v∥ 2 0π 2 t (dz) ≤ N||v|| 2 0, (4.3.6) 2⟨v,A1t v⟩1 + ||Ntv|| 2 0 ≤ N||v|| 2 0, 2⟨v,J 1 t v⟩1 + ∫ Z1 ||It,zv||20π 1 t (dz) ≤ N ||v|| 2 0, (4.3.7) and 2⟨v,Ltv + ft⟩1 + ||Ntv + gt||20 + ∫ Z1 ||It,zv + ht (z) ||20π 1 (dz) + 1 4 (σ2;iϱt ∂iv, σ 2; jϱ t ∂ jv)0 + 1 4 ∫ Z2 ∥v(ζ˜2t (z)) − v∥ 2 0π 2 t (dz) ≤ N ( ∥v∥20 + || ft|| 2 0 + ||gt|| 2 1 + ∫ Z1 ||ht (z) ||2β 2 π1t (dz) ) . Proof. For the remainder of the proof, we make the convention that statements hold for all ϕ ∈ C∞c and that all constants N are independent of ϕ. Using the divergence theorem, we get 2⟨ϕ,A1ϕ⟩1 + ||Nϕ||20 = 1 2 ∫ Rd1 ( | divσ1|2 + 2σ1;i∂i divσ1 + ∂ jσ1;i∂iσ1; j ) |ϕ|2dx + ∫ Rd1 ( |υ1ϕ|2 − 2ϕl ( υ1;ll¯sym divσ 1 + σ1;i∂iυ 1;ll¯ sym ) ϕl¯ ) dx ≤ N∥ϕ∥20. Rearranging terms and using the identity 2a(b − a) = −|b − a|2 + |b|2 − |a|2, a, b ∈ R, we obtain 2⟨ϕ,J1ϕ⟩1 + ∫ Z1 ||Iϕ||20π 1 t (dz) = A1 + A2, 4.3. The L2-Sobolev theory for degenerate SIDEs 117 where A1 := ∫ Rd1 ∫ Z1 ( |ϕ(ζ˜1)|2 − |ϕ|2 − 2ϕζ1;i∂iϕ ) π1(dz)dx A2 := 2 ∫ Rd1 ∫ Z1 ( ϕl(ζ˜1)ρ1;ll¯ϕl¯(ζ˜1) − ϕlρ1;l¯ϕl ) π1(dz)dx + ∫ Rd1 ∫ Z1 |ρ1ϕ(ζ˜1)|2π1(dz)dx, Since | div ζ1(ζ˜1;−1) − div ζ1| ≤ [div ζ1] β 2 (K1) β 2 ≤ (K˜1)2 + (K1)β, changing the variable of integration and making use of the estimate (4.3.5), we obtain A1 ≤ ∫ Rd1 |ϕ|2 ∫ Z1 | det∇ζ˜1;−1 − 1 + div ζ1|π1(dz)dx ≤ N∥ϕ∥20 and A2 = 2 ∫ Rd1 ∫ Z1 ϕl ( ρ1;ll¯(ζ˜1;−1) − ρ1;ll¯ ) ϕl¯π1(dz)dx + ∫ Rd1 ∫ Z1 2ϕlρ1;ll¯(ζ˜1;−1)ϕl¯ ( det∇ζ˜1;−1 − 1 ) π1(dz)dx + ∫ Rd1 ∫ Z1 |ρ1(ζ˜1;−1)ϕ|2 det∇ζ˜1;−1π1(dz)dx =: A21 + A22 + A23. Owing to (4.3.4) and Hölder’s inequality, we have A22 + A23 ≤ N ∫ Z1 ((l1)2 + (K1)2)π1(dz)||ϕ||20. For β > 0, we have A21 ≤ N ∫ Z1 [ ρ1sym ] β 2 (K1) β 2π1(dz)∥ϕ∥20 ≤ N ∫ Z1 ( (l˜1)2 + (K1)β ) π1(dz)∥ϕ∥20 ≤ N∥ϕ∥ 2 0 and for β = 0, using Holder’s inequality, we get A21 ≤ N∥ϕ∥20 ∫ Z1 (l1)2π1(dz). By the divergence theorem, we have 2⟨ϕ,L2ϕ⟩0 = B1 + B2 + B3, where B1 := ∫ Rd1 ( ϕlσ2;iϱσ2; jϱ∂i jϕ l + 2σ2;iϱυ2;ll¯ϱ∂iϕ l¯ ) dx, 118 Chapter 4. The L2-Sobolev theory for parabolic SIDEs B2 := 2 ∫ Rd1 ∫ Z2 ϕl ( ϕl(ζ˜2) − ϕl − ζ2;i∂iϕl ) π2(dz)dx, B3 := 2 ∫ Rd1 ∫ Z2 ϕlρ2;ll¯ ( ϕl¯(ζ˜2) − ϕl¯ ) π2(dz)dx. Owing to the divergence theorem, we have (ϕ, σ2;iϱσ2; jϱ∂i jϕ)0 = − (( σ2;iϱσ2; jϱ∂iϕ + ϕ ( σ2; jϱ divσ2;ϱ + σ2;iϱ∂iσ 2; jϱ )) , ∂ jϕ ) 0 (ϕσ2;iϱ∂iσ 2; jϱ, ∂ jϕ)0 = − 1 2 ( ϕ ( ∂ jσ 2;iϱ∂iσ 2; jϱ + σ2; jϱ∂ j divσ 2;ϱ ) , ϕ ) 0 , (ϕσ2; jϱ∂ j divσ 2;ϱ, ϕ)0 = −(ϕ| divσ2|2, ϕ)0 + 2(ϕσ2;iϱ divσ2;ϱ, ∂ jϕ)0, and hence, (ϕ, σ2;iϱt σ 2; jϱ t ∂i jϕ)0 = − ( σ2;iϱσ2; jϱ∂iϕ l∂ jϕ l + 2ϕσ2; jϱ divσ2;ϱ, ∂ jϕ ) 0 + 1 2 ( ϕ ( ∂ jσ 2;iϱ∂iσ 2; jϱ − | divσ2|2 ) , ϕ ) 0 . Thus, by Young’s inequality, B1 ≤ − 1 2 ∫ ∂iϕ lσ2;iϱσ2; jϱ∂ jϕ ldx + N∥ϕ∥20. Once again making use of the identity 2a(b − a) = −|b − a|2 + |b|2 − |a|2, a, b ∈ R, we get 2ϕl(ϕl(ζ˜2) − ϕl − ζ2;i∂iϕl) = −|ϕ(ζ˜2) − ϕ|2 + |ϕ(ζ˜2)|2 − |ϕ|2 − ζ 2;i t ∂i|ϕ| 2. Changing the variable of integration and applying the divergence theorem, we obtain B2 = − ∫ Rd1 ∫ Z2 |ϕ(ζ˜2) − ϕ|2π2(dz)dx + ∫ Rd1 ∫ Z2 |ϕ|2 ( det∇ζ˜2;−1 − 1 + div ζ2(ζ˜2;−1) ) π2(dz)dx + ∫ Rd1 ∫ Z2 |ϕ|2 ( div ζ2 − div ζ2(ζ˜2;−1) ) π2(dz)dx. Changing the variable of integration in the last term of B2, we get ∫ Rd1 ∫ Z2 |ϕ|2 ( div ζ2 − div ζ2(ζ˜2;−1) ) π2(dz)dx = ∫ Rd1 ∫ Z2 ( |ϕ|2 − |ϕ(ζ˜)|2 det∇ζ˜2 ) div ζ2π2(dz)dx = ∫ Rd1 ∫ Z2 |ϕ(ζ˜2)|2 ( 1 − det∇ζ˜2 ) div ζ2π2(dz)dx 4.3. The L2-Sobolev theory for degenerate SIDEs 119 + ∫ Rd1 ∫ Z2 (ϕl − ϕl(ζ˜2))(ϕl + ϕl(ζ˜2)) div ζ2π2(dz)dx =: B21 + B22. Clearly, B21 ≤ N ∫ Z2 (K¯2)2π2(dz)∥ϕ∥20, and applying Hölder’s inequality, B22 ≤ N ∫ Rd1 (∫ Z2 |ϕ(ζ˜2) − ϕ|2π2(dz) ) 1 2 (∫ Z2 ( |ϕ|2 + |ϕ(ζ˜2)|2 ) (K¯2)2π2(dz) ) 1 2 dx. Hence, by Remark 4.3.5 and Young’s inequality, B2 ≤ − 1 2 ∫ Rd1 ∫ Z2 |ϕ(ζ˜2) − ϕ|2π2(dz)dx + N∥ϕ∥20. By Hölder’s inequality, B3 ≤ N ∫ Rd1 (∫ Z2 |ϕ(ζ˜2) − ϕ|2π2(dz) ) 1 2 (∫ Z2 (l2)2π2(dz) ) 1 2 |ϕ|dx. Applying Young’s inequality again and combining B2 and B3, we derive 2⟨ϕ,L2ϕ⟩1 ≤ N∥ϕ∥20 − 1 4 ∫ ∂iϕ lσ2;iϱσ2; jϱ∂ jϕ ldx − 1 4 ∫ ∫ Z2 |ϕ(ζ˜2) − ϕ|2π2(dz)dx. (4.3.8) By the divergence theorem, we have 2⟨ϕ, bi∂iϕ + c·l¯ϕl¯ + f ⟩0 = 2 (ϕ, f )0 + (ϕ, ϕ div b)0 + 2(ϕ, cϕ)0 ≤ N(∥ϕ∥ 2 0 + ∥ f ∥ 2 0). (4.3.9) Combining (4.3.8) and (4.3.9), we obtain (4.3.6). To obtain the estimate (4.3.7), we use (4.3.6) and (4.3.7), and estimate the additional terms: D := ( σ1;iϱ∂iϕ + υ 1;·l¯ϱϕl¯, gϱ ) 0 and 2 ∫ Z1 (( ϕ(ζ˜1) − ϕ, h ) 0 + (ρ1ϕ(ζ˜1), h)0 ) π1(dz) =: E1 + E2. By the divergence theorem and Hölder’s inequality,|D| ≤ N ( ||ϕ||20 + ||g|| 2 1 ) .Applying Hölder’s inequality and changing the variable of integration, we get E2 ≤ ∫ Rd1 ∫ Z1 ( |ρ1ϕ(ζ˜1)|2 + |h|2 ) π1(dz)dx ≤ N ( ||ϕ||20 + ∫ Z1 ||h (z) ||20π 1(dz) ) . 120 Chapter 4. The L2-Sobolev theory for parabolic SIDEs Then by (4.3.4), Hölder’s inequality, and Lemma 4.3.10, E1 = 2 ∫ Z1 ∫ Rd1 ϕl ( hl(ζ˜1;−1)(det∇ζ˜1;−1 − 1) + hl(ζ˜1;−1) − h ) dxπ1(dz) ≤ N ( ∥ϕ∥20 + ∫ Rd1 ∫ Z1 ( |h|2 + |h(ζ˜1;−1) − h|2 ) π1(dz)dx ) ≤ N ( ∥ϕ∥20 + ∫ Z1 ||h(z)||2β 2 π1(dz) ) . This completes the proof. □ In the following lemma, we verify that Assumption 4.2.2(0, 1) holds for (4.3.1). Recall thatW0,1 is the space of all H0-valued strongly càdlàg processes v : Ω × [0,T ]→ H0 that belong to L2(Ω × [0,T ],OT , dVtdP; H1). Lemma 4.3.8. Let Assumption 4.3.2(d2) hold. Then there is a constant N = N(d1, d2,N0, η, β) such that for all v ∈ W0,1, P-a.s.: (1) 2⟨vt,Ltvt⟩1dVt + ∥Ntvt∥20dVt + ∫ Z1 ∥It,zvt−∥20η(dt, dz) +2(vt,N ϱ t vt)0dw ϱ t + 2 ∫ Z1 (vt−,It,zvt−)0η˜ (dt, dz) ≤ ( N ||vt||20dVt + ∫ Z1 Nκt(z)||vt−||20η(dt, dz) + 2(vt,N ρ t vt)0dw ϱ t + ∫ Z1 Gt,z(v)η˜(dt, dz) ) , where |Gt,z(v)|dVt ≤ κ¯t(z)||vt−||20dVt, ∀z ∈ Z 1, |(vt,Ntvt)0|dVt ≤ N ||vt||20 dVt, and κt and κ¯t are PT ×Z1-measurable processes such that for all t ∈ [0,T ], ∫ Z1 ( κt(z) + κ˜t(z) 2 ) π1t (dz) ≤ N; (2) 2(Nϱt vt, g ϱ t )0dVt + 2 ∫ Z1 (It,zvt−, ht(z))0η(dt, dz) + 2(vt, g ϱ t )0dw ϱ t + 2 ∫ Z1 (vt−, ht(z))0η˜(dt, dz) ≤ ( N∥vt−∥0rtdVt + ∫ Z1 N∥vt−∥0∥ht(z)∥0κˆt(z)η(dt, dz) + 2(vt, g ϱ t )0dw ϱ t + 2 ∫ Z1 G¯t,z(v)η˜(dt, dz) ) , 4.3. The L2-Sobolev theory for degenerate SIDEs 121 where rt := ∥gt∥1 + ∣∣∣∣∣ ∣∣∣∣∣ ∫ Z1 ( ht(ζ˜ 1;−1 t (z), z) − ht(z) ) π1t (dz) ∣∣∣∣∣ ∣∣∣∣∣ 0 , t ∈ [0,T ], |(vt, gt)0|dVt ≤ N∥vt∥0∥gt∥0dVt, G¯t,z(v)dVt ≤ N ||vt−||0||ht(z)||0, dVt, ∀z ∈ Z1, and κˆt is a PT ×Z1-measurable process such that for all t ∈ [0,T ], ∫ Z1 κˆt(z) 2π1t (dz) ≤ N. Proof. (1) Owing to the divergence theorem, we have 2(vt,N ϱ t vt)0 = (vt, ut divσ 1ϱ t )0 + 2(vt, υ 1ϱ t vt)0, ∀ϱ ∈ N, and hence P-a.s., |2(vt,Ntvt)0|dVt ≤ N||vt||2dVt. By virtue of Lemma 4.3.7(1), it suffices to estimate Q := 2⟨vt,J1t,zvt⟩1dVt + ∫ Z1 ∥It,zvt−∥20η(dt, dz) + 2 ∫ Z1 (vt−,It,zvt−)0η˜ (dt, dz) . An application of divergence theorem shows that Q = ∫ Z1 Pt,z(u)η(dt, dz) + ∫ Z1 Gt,z(v)η˜(dt, dz), where Gt,z(v) := 2(vt−, ρ 1 t (z)vt−)0 − (vt−, vt− div ζ 1 t (z))0, and Pt,z(v) := D1 + D2 + D3 with D1 := 2(vt−(ζ˜ 1 t (z)), ρ 1 t (z)vt−(ζ˜ 1 t ))0 − 2(vt−, ρ 1 t (z)vt−)0, D2 := ∥vt−(ζ˜1t (z))∥ 2 0 − ∥vt−∥ 2 0 + (vt−, vt− div ζ 1 t (z))0, D3 := ∥ρ 1 t (z)vt−(ζ˜ 1 t (z))∥ 2 0. Given our assumptions, it is clear that P-a.s., Gt,z(v)dVt ≤ N ( l1t (z) + K¯ 1 t (z) ) ∥vt−∥20dVt and D3dVt ≤ l 1 t (z) 2 ||vt−||20dVt, where in the last inequality we used the change of variable formula. Changing the variable 122 Chapter 4. The L2-Sobolev theory for parabolic SIDEs of integration and using (4.3.4) and (4.3.5), we find that dP -a.s., D1dVt ≤ N ( vt−, vt− ∣∣∣ρ1t (ζ˜ 1;−1 t (z), z) det∇ζ˜ 1;−1 t (z) − ρ 1 t (z) ∣∣∣ ) 0 dVt ≤ N ( l1t (z)K¯ 1 t (z) + l˜ 1 t (z)K 1 t (z) β 2 ) ∥vt−∥20dVt, and D2dVt = ( vt−, vt−| det∇ζ˜ 1;−1 t (z) − 1 + div ζ 1 t (z)| ) 0 dVt ≤ ( K¯1t (z) 2 + K˜1t (z)K 1 t (z) β 2 ) N∥vt−∥20dVt. Setting κt(z) = l 1 t (z) 2 + l1t K¯ 1 t (z) + l˜ 1 t (z)K 1 t (z) β 2 + K¯1t (z) 2 + K˜1t (z)K 1 t (z) β 2 , κ¯t(z) = l 1 t (z) + K¯ 1 t (z), z ∈ Z 1, and appealing to our assumptions, we complete the proof (1). (2) By the divergence theorem, we have (gϱt ,N ϱ t vt)0 = (g ϱ t , divσ 1ϱ t vt)0 + (σ 1;iϱ∂ig ϱ t , vt)0, ∀ρ ∈ N, and thus by the Cauchy-Schwarz inequality, |(gt,Ntvt)0|dVt ≤ N∥vt∥0∥gt∥1dVt. Changing the variable of integration, we obtain (It,zvt−, ht(z))0 = ( ht(z), ( vt−(ζ˜ 1 t (z)) − vt− + ρ 1 t (z)vt−(ζ˜ 1 t (z)) )) 0 = (ht(ζ˜ 1;−1 t (z), z) − ht(z), vt−)0 + (ht(ζ˜ 1;−1 t (z), z), (det∇ζ˜ 1;−1 t (z) − 1)vt−)0 + (ht(z), ρ 1 t (z)vt−(ζ˜ 1 t (z)))0. A simple calculation shows that P-a.s., 2 ∫ Z1 (It,zvt−, ht(z))0η(dt, dz) + 2 ∫ Z1 (vt−, ht(z))0η˜(dt, dz) ≤ 2∥vt−∥0r1t dVt + ∫ Z1 P¯t,z(v)η(dt, dz) + ∫ Z1 G¯t,z(v)η˜(dt, dz), where r1t := ∣∣∣∣∣ ∣∣∣∣∣ ∫ Z1 (ht(ζ˜ 1;−1 t (z), z) − ht(z))π 1 t (dz) ∣∣∣∣∣ ∣∣∣∣∣ 0 , G¯t,z(v) = (ht(ζ˜ 1;−1 t (z), z), vt), and Pt,z(v) := (ht(ζ˜ 1;−1 t (z)), vt−(det∇ζ˜ 1;−1 t (z) − 1))0 + (ht(z), ρ 1 t (z)vt−(ζ˜ 1 t (z)))0. 4.3. The L2-Sobolev theory for degenerate SIDEs 123 Applying the change of variable formula and Hölder’s inequality, P-a.s. we obtain ∫ Z1 P˜t,z(u)η(dt, dz) ≤ N∥vt−∥0 ∫ Z1 (K¯1t (z) + l 1 t (z))∥ht(z)∥0η(dt, dz) and |G˜t,z(v)|dVt ≤ N∥vt−∥0∥ht(z)∥0dVt. This completes the proof. □ Let d ∈ N. For a function v ∈ Hm(Rd1 ,Rd), define the linear operator Dv ∈ Hm−1(Rd1; Rd(d1+1)) by Dv = ( ∂0v, ∂1v, . . . , ∂d1v ) = v˜ with v˜l0 = vl and v˜l j = ∂ jvl, 1 ≤ l ≤ d, 0 ≤ j ≤ d1 (recall ∂0v = v). We define Dnv for n ∈ N by iteratively applyingD n-times. Recall that Λ = (I − ∆) 1 2 . It is easy to check that for each n ∈ N and all u, v ∈ Hn+1(Rd1 ,Rd), (u, v)n,d = (Λ nu,Λnv)0,d = (Dnu,Dnv)0,dd¯n1 , (4.3.10) (Λu,Λ−1v)n,d = (Λ n+1u,Λn−1v)0,d = (DnΛu,DnΛ−1v)0,dd¯n1 , (Dnu,Dnv)−1,dd¯n1 = (D nΛ−1u,DnΛ−1v)0,dd¯n1 = (u, v)n−1,d, where d¯1 = d1 + 1. Let us introduce the operators E(L), E(N), and E(Iz) acting on ϕ = (ϕl j)1≤l≤d2,1≤ j≤d¯1 ∈ C ∞ c (R d1 ,Rd2d¯1) that are defined as L,N , and I, respectively, but with the d2 × d2-dimensional coefficients υkt , ρ k, and c replaced by the d2d¯1 × d2d¯1-dimensional coefficients given by υk;l j,l¯ j¯ϱ = υk;ll¯ϱδ j j¯ + 1 j≥1(∂ jσ k; j¯ϱδll¯ + ∂ jυ k;ll¯ϱδ j¯0), ρk;l j,l¯ j¯ = ρk;ll¯t δ j¯ j + 1 j≥1(∂ jρ k;ll¯δ j¯0 + (δll¯ + ρ k;ll¯)∂ jζ k; j¯), and cl j,l¯ j¯ = cll¯δ j¯ j + ∂ jb j¯δll¯ + ∂ jc ll¯δ j¯0 + 2∑ k=1 ( υk;ll¯ϱ∂ jσ k; j¯ϱ + ∫ Zk ρk;ll¯∂ jζ k; j¯πkt (dz) ) , for 1 ≤ l, l¯ ≤ d2 and 0 ≤ j, j¯ ≤ d1. The coefficients σk, b, and functions ζk, k ∈ {1, 2}, remain unchanged in the definition of E(L), E(N), and E(I). We define En(L), En(N), and En(I), for n ∈ N by iteratively applying E n-times by the rules above with σk, b, and ζk, k ∈ {1, 2}, unchanged. A simple calculation shows that for all v ∈ H2(Rd1; Rd2), D[Lv] = E(L)Dv, D[Nϱv] = E(Nϱ)Dv, ϱ ∈ N, D[Izv] = E(Iz)Dv. 124 Chapter 4. The L2-Sobolev theory for parabolic SIDEs Continuing, for all v ∈ Hn+1(Rd1; Rd) we have Dn[Lv] = En(L)Dnv, Dn[Nϱv] = En(Nϱ)Dnv, ϱ ∈ N, Dn[Izv] = En(Iz)Dnv. (4.3.11) If Assumption 4.3.1(m, d2) holds, it can readily be verified by induction and the definitions (4.3.2)-(4.3.2) that Assumption 4.3.2(0, d2d¯m1 ) holds for the coefficients of the operators Em(L), Em(N), and Em(I). Moreover, owing to our assumptions on the input data, we have Dmϕ ∈ H0(Rd1 ; Rd2d¯ m 1 ;F0), Dm f ∈ H0(Rd1; Rd2d¯ m 1 ) Dmg ∈ ζ1(Rd1; ℓ2(Rd2d¯ m 1 )), Dmh ∈ H β 2 (Rd1; Rd2d¯ m 1 ; π1). Making use of (4.3.10), (4.3.11) and applying Lemma 4.3.6 to Em(L), for all v ∈ Hm+1, we obtain ∥Lv∥m−1 = ∥Dm[Lv]∥−1 = ∥Em(L)Dmv∥−1,d2d¯m1 ≤ N∥D mv∥1,d2d¯m1 = N∥v∥m+1. Likewise, for all v ∈ Hm+1, we derive ∥Nv∥m ≤ N∥v∥m+1, ∫ Z1 ||Iv||2mπ 1(dz) ≤ N∥v∥2m+1. By virtue of Lemma 4.3.7, we have that for all v ∈ Hm+1, 2(Λv,Λ−1Ltv)m + ||Ntv||2m + ∫ Z1 ||It,zv||2mπ 1 (dz) = 2(DmΛv,DmΛ−1[Ltv])0 + ||Dn[Ntv]||20 + ∫ Z1 ||Dm[It,zv]||20π 1(dz) 2⟨Dmv,Em(Lt)Dmv⟩1 + ||Em(Nt)Dmv||20 + ∫ Z1 ||Em(It,z)Dnv||20π 1(dz) ≤ N ||Dmv||20,d2d¯m1 = N ||v|| 2 m Using a similar argument, we find that for all v ∈ Hm+1, 2(Λv,Λ−1(Ltv + ft))m + ||Ntv + gt||2m + ∫ Z1 ||It,zv + ht(z)||2mπ 1(dz) ≤ N||v||2m + N f¯t, where f¯t = ∥ ft∥2m + ∥gt∥ 2 m+1 + ∫ Z1 ∥ht(z)∥2m+ β2 π1t (dz). Therefore, Assumption 4.2.1(m, d2) holds for the equation (4.3.1). Similarly, using Lem- mas 4.3.8 and 4.3.10, we find that Assumption 4.2.2(m, d2) holds for equation (4.3.1) as well. The statement of the theorem then follows directly from Theorem 4.2.4. 4.3. The L2-Sobolev theory for degenerate SIDEs 125 4.3.3 Appendix For each κ ∈ (0, 1) and tempered distribution f on Rd1 , we define ∂κ f = F −1[| · |κF f (·)], where F denotes the Fourier transform and F −1 denotes the inverse Fourier transform. Lemma 4.3.9 (cf. Lemma 2.1 in [Kom84]). Let f : Rd1 → R be smooth and bounded. Then for all κ ∈ (0, 1), there are constants N1 = N1(d1, κ), N2 = N2(d1, κ), and N3 = N2(d1, κ) such that for all x, y, z ∈ Rd1 , ∂κ f (x) = N1 ∫ Rd ( f (x + z) − f (x)) dz |z|d+δ and f (x + y) − f (x) = N2 ∫ Rd1 ∂κ f (x − z)k(κ)(y, z)dz, where k(κ)(y, z) = |y + z|κ−d − |z|κ−d and ∫ Rd1 |k(κ)(y, z)|dz = N3|z|κ. Lemma 4.3.10. Let (Z,Z, π) be a sigma-finite measure space. Let H : Rd1 × Z → Rd1 be B(Rd) ⊗Z-measurable and assume that for all (x, z) ∈ Rd1 × Z, |ζ(x, z)| ≤ K(z) and |∇ζ(x, z)| ≤ K¯(z) where K, K¯ : Z → R+ is a Z-measurable function for which there is a positive constant N0 such that for some fixed β ∈ (0, 2], sup z∈Z K(z) + sup z∈Z K¯(z) + ∫ Z ( K(z)β + K¯(z)2 ) π(dz) < N0 Assume that there is a constant η < 1 such that (x, z) ∈ {(x, z) ∈ Rd1 × Z : |∇ζ(x, z)| > η}, | ( Id1 + ∇ζt(x, z) )−1 | ≤ N0. Then there is a constant N = N(d1,N0, β, η) such that for all B(Rd1) ⊗ Z-measurable h : Rd1 × Z → Rd2 with h ∈ L2(Z,Z, π; H β 2 (Rd1; Rd2)), ∫ Rd1 ∣∣∣∣∣ ∫ Z (h(x + ζ(x, z), z) − h(x, z)) π(dz) ∣∣∣∣∣ 2 dx ≤ N ∫ Z ∥h(z)∥2β 2 π(dz). 126 Chapter 4. The L2-Sobolev theory for parabolic SIDEs Proof. It is easy to see that for any B(Rd1) ⊗Z-measurable h : Rd1 × Z → Rd2 such that ∫ Z sup x∈Rd1 |∇h(x, z)|2π(dz) < ∞, (4.3.12) the integral ∫ Z (h(x+ζ(x, z), z)−h(x, z))π(dz) is well-defined. Moreover, for anyB(Rd1)⊗Z- measurable h : Rd1 × Z → R with h ∈ L2(Z,Z, π; H β 2 (Rd1; Rd2)), we can always find a sequence (hn)n∈N of B(Rd1) ⊗ Z-measurable processes such that each element of the sequence is smooth with compact support in x and satisfies (4.3.12) and lim n→∞ ∫ Z ∥h(z) − hn(z)∥2β 2 π(dz) = 0. Thus, if we prove this lemma for h that is smooth with compact support in x and satisfies (4.3.12), then we can conclude that the sequence ∫ Z (hn(x + ζ(x, z), z) − hn(x, z)) π(dz), n ∈ N, is Cauchy in H0(Rd1; Rd2). We then define ∫ Z (h(x + ζ(x, z), z) − h(x, z)) π(dz) for any B(Rd1) ⊗Z-measurable h : Rd1 × Z → R with h ∈ L2(Z,Z, π; H β 2 (Rd1; Rd2)) to be the unique H0(Rd1; Rd2) limit of the Cauchy sequence. Hence, it suffices to consider h that is smooth with compact support in x and satisfies (4.3.12). First, let us consider the case β ∈ (0, 2). By Lemma 4.3.9, we have ∫ Rd1 ∣∣∣∣∣ ∫ Z ( h(ζ˜(x, z), z) − h(x, z) ) π(dz) ∣∣∣∣∣ 2 dx = N22 ∫ Rd1 ∣∣∣∣∣ ∫ Z ∫ Rd1 ∂ β 2 h(x − y, z)k( β 2 )(ζ(x, z), y) dyπ(dz) ∣∣∣∣∣ 2 dx =: N22 ∫ Rd1 | ∫ Z A(x, z)π(dz)|2dx. Applying Hölder’s inequality and Lemma 4.3.9, for all x and z, we have A(x, z) ≤ (∫ Rd1 |∂β/2h(x − y, z)|2k( β 2 )(ζ(x, z), y)) dy ) 1 2 (∫ Rd1 k( β 2 )(ζ(x, z), y)) dy ) 1 2 = √ N3 (∫ Rd1 |∂β/2h(x − y, z)|2k( β 2 )(ζ(x, z), y) dy ) 1 2 |ζt(x, z)| β 4 4.3. The L2-Sobolev theory for degenerate SIDEs 127 ≤ K(z) β 2 √ N3 (∫ Rd1 |∂ β 2 h(x − y, z)|2k( β 2 )(ζ(x, z), y) dyK(z)− β 2 ) 1 2 . Using Hölder’s inequality again, for all x, we get ∣∣∣∣∣ ∫ Z A(x, z)π(dz) ∣∣∣∣∣ 2 ≤ N3N0 ∫ Z ∫ Rd1 |∂β/2h(x − y, z)|2k( β 2 )(ζ(x, z), y) dyK(z)− β 2π(dz). For each x and z, we set B(x, z) = ∫ |y|≤2K(z) |∂ β 2 h(x − y, z)|2k( β 2 )(ζ(x, z), y) dy C(x, z) = ∫ |y|>2K(z) |∂ β 2 h(x − y, z)|2k( β 2 )(ζ(x, z), y) dy. Changing the variable integration, for all x and z, we find B(x, z) ≤ ∫ |y+ζ(x,z)|≤3K(z) |∂ β 2 h(x − y, z)|2 dy |y + ζ(x, z)|d1− β 2 + ∫ |y|≤2K(z) |∂ β 2 h(x − y, z)|2 dy |y|d1− β 2 =: B1(x, z) + B2(x, z), and B1(x, z) ≤ ∫ |y|≤3K(z) |∂ β 2 h((ζ˜(x, z) − y), z)|2| dy |y|d1− β 2 ≤ K(z) β 2 ∫ |y|≤3 |∂ β 2 h((ζ˜(x, z) − yK(z)), z)|2 dy |y|d1− β 2 , B2(x, z) ≤ K(z) β 2 ∫ |y|≤2 |∂β/2h(x − yK(z), z)|2 dy |y|d1− β 2 . Owing to Remark 4.3.5, for all z, the map x 7→ x + ζ(x, z) = ζ˜(x, z) is a global diffeomor- phism and det∇ζ˜−1(x, z) ≤ N. for some constant N = N(N0, d1, η). Thus, by the change of variable formula, there is a constant N = N(d1,N0, β, η) such that ∫ Rd1 ∫ Z B1(x, z)K(z) − β2π(dz)dx ≤ ∫ Z ∫ |y|≤3 ∫ Rd1 |∂β/2h((ζ˜(x, z) − yK(z)), z)|2dx dy |y|d1− β 2 π(dz) ≤ ∫ Z ∫ |y|≤3 ∫ Rd1 |∂β/2h((x − yK(z)), z)|2| det∇ζ˜−1(x, z)|dx dy |y|d1− β 2 π(dz) 128 Chapter 4. The L2-Sobolev theory for parabolic SIDEs ≤ N ∫ Z ∫ Rd1 |∂ β 2 h(x, z)|2dxπ(dz) and ∫ Rd1 ∫ Z K(z) β 2 B2(x, z)dxπ(dz) ≤ N ∫ Z ∫ Rd1 |∂ β 2 h(x, z)|2dxπ(dz). For all x, y, and z such that |ζ(x, z)| ≤ K(z) ≤ 12 |y|, we have ∣∣∣∣∣∣∣ 1 |y + ζ(x, z)|d1− β 2 − 1 |y|d1−β/2 ∣∣∣∣∣∣∣ ≤ ∣∣∣∣∣d1 − β 2 ∣∣∣∣∣ ∣∣∣∣∣∣∣   1 |y + ζ(x, z)|1+d1− β 2 + 1 |y|1+d1− β 2   ∣∣∣∣∣∣∣ |ζ(x, z)| ≤ 3 ∣∣∣∣∣d1 − β 2 ∣∣∣∣∣ |ζ(x, z)| |y|1+d1− β 2 , and hence for all x and z, C(x, z) = ∫ |y|>2K(z) |∂ β 2 h(x − y, z)|2 ∣∣∣∣∣∣∣ 1 |y + ζ(x, z)|d− β 2 − 1 |y|d− β 2 ∣∣∣∣∣∣∣ dy ≤ N ∫ |y|>2K(z) |∂ β 2 h(x − y, z)|2 |K(z)| |y|1+d1− β 2 dy ≤ NK(z) β 2 ∫ |y|>2 |∂ β 2 h((x − K(z)y), z)|2 dy |y|1+d1− β 2 . Estimating as above, we find that there is a constant N = N(d1,N0, β) ∫ Z ∫ Rd1 K(z) β 2 C(x, z)dxπ(dz) ≤ N ∫ Z ∫ Rd1 |∂ β 2 h(x, z)|2dxπ(dz). Combining the above estimates, we obtain the desired estimate for β ∈ (0, 2). Let us now consider the case β = 2. It follows from Remark 4.3.5 that for all θ ∈ [0, 1], on the set of z ∈ {z : K¯(z) < 12 }, the map x 7→ x + θζ(x, z) = ζ˜θ(x, z) is a global diffeomorphism and det∇ζ˜−1θ (x, z) ≤ N, for some constant N = N(N0, d1). Hence, making use of Taylor’s theorem and the change of variable formula, we find ∫ Rd1 ∣∣∣∣∣ ∫ Z (h(x + ζ(x, z), z) − h(x, z)) π(dz) ∣∣∣∣∣ 2 dx ≤ ∫ Rd1 ∣∣∣∣∣∣ ∫ K¯(z)≥ 12 (h(x + ζ(x, z), z) − h(x, z)) π(dz) ∣∣∣∣∣∣ 2 dx + ∫ Rd1 ∣∣∣∣∣∣ ∫ K¯(z)< 12 ∫ 1 0 |∇h(x + θζ(x, z), z) ∣∣∣∣∣∣ dθK(z)π(dz)|2dx 4.3. The L2-Sobolev theory for degenerate SIDEs 129 ≤ π { K¯(z) ≥ 1 2 }∫ K¯(z)≥η ∫ Rd1 |h(x, z)|2| det ζ˜−1(x, z) + 1|dxπ(dz) +N0 ∫ K¯(z)< 12 ∫ Rd1 ∫ 1 0 |∇h(x, z)|2| det∇ζ˜−1θ (x, z)|dθdxπ(dz) ≤ N ∫ Z ∥h(z)∥21π(dz). This completes the proof. □ Chapter 5 A finite difference scheme for non-degenerate parabolic SIDEs 5.1 Introduction Let (Ω,F ,F,P), F = (Ft)t≥0, be a complete filtered probability space such that the filtration is right continuous and F0 contains all P-null sets of F . Let w ϱ t , t ≥ 0, ϱ ∈ N, be a sequence of independent real-valued F-adapted Wiener processes. Let π1(dz) and π2(dz) be a Borel sigma-finite measures on Rd satisfying ∫ Rd |z|2 ∧ 1 πr(dz) < ∞, r ∈ {1, 2}. Let q(dt, dz) = p(dt, dz) − π2(dz)dt be a compensated F-adapted Poisson random measure on R+ × Rd. Let T > 0 be an arbitrary fixed constant. On [0,T ] × Rd, we consider finite difference approximations for the following SIDE dut = ((Lt + I)ut + ft) dt + ∞∑ ϱ=1 ( Nϱt ut + g ϱ t ) dwϱt + ∫ Rd (I(z)ut− + ot(z)) q(dt, dz),(5.1.1) with initial condition u0(x) = φ(x), x ∈ Rd, where the operators are given by Ltϕ(x) := d∑ i, j=0 ai jt (x)∂i jϕ(x), Iϕ(x) := ∫ Rd  ϕ(x + z) − ϕ(x) − 1[−1,1](|z|) d∑ j=1 z j∂ jϕ(x)   π 1(dz), (5.1.2) Nϱt ϕ(x) := d∑ i=0 σ iϱ t (x)∂iϕ(x), I(z)ϕ(x) = ϕ(x + z) − ϕ(x). Here, we denote the identity operator by ∂0. 131 132 Chapter 5. A finite difference scheme for non-degenerate parabolic SIDEs Equation (5.1.1) arises naturally in non-linear filtering of jump-diffusion processes. We refer the reader to [Gri82] and [GM11] for more information about non-linear filter- ing of jump-diffusions and the derivation of the Zakai equation. Various methods have been developed to solve stochastic partial differential equations (SPDEs) numerically. For SPDEs driven by continuous martingale noise see, for example, [GK96, Gyö98, Gyö99, GM09, LR04, JK10, Yan05], and for SPDEs driven by discontinuous martingale noise, see [HM06, Hau08, Lan12, BL12]. Among the various methods considered in the literature is the method of finite differences. For second order linear SPDEs driven by continu- ous martingale noise it is well-known that the Lp(Ω)-pointwise error of approximation in space is proportional to the parameter h of the finite difference (see, e.g., [Yoo00]). In [GM09], I. Gyöngy and A. Millet consider abstract discretization schemes for stochastic evolution equations driven by continuous martingale noise in the variational framework and, as a particular example, show that the L2(Ω)-pointwise rate of convergence of an Euler-Maruyuma (explicit and implicit) finite difference scheme is of order one in space and one-half in time. More recently, it was shown by I. Gyöngy and N.V. Krylov that under certain regularity conditions, the rate of convergence in space of a semi-discretized finite difference approximation of a linear second order SPDE driven by continuous martingale noise can be accelerated to any order by Richardson’s extrapolation method. For the non- degenerate case, we refer to [GK10] and [GK11], and for the degenerate case, we refer to [Gyö11]. In [Hal12] and [Hal13], E. Hall proved that the same method of acceleration can be applied to implicit time-discretized SPDEs driven by continuous martingale noise. Also, for a pathwise convergence result of a spectral scheme for SPDEs on a bounded domain we refer the reader to [CJM92]. In the literature, finite element, spectral, and, more generally, Galerkin schemes have been studied for SPDEs driven by discontinuous martingale noise. One of the earliest works in this direction is a paper [HM06] by E. Hausenblas and I. Marchis concerning Lp(Ω)-convergence of Galerkin approximation schemes for abstract stochastic evolution equations in Banach spaces driven by Poisson noise of impulsive-type. As an applica- tion of their result, they study a spectral approximation of a linear SPDE in L2([0, 1]) with Neumann boundary conditions driven by Poisson noise of impulsive-type and derive Lp(Ω)-error estimates in the L2([0, 1])-norm. In [Hau08], E. Hausenblas considers finite element approximations of linear SPDEs in polyhedral domains D driven by Poisson noise of impulsive-type and derives Lp(Ω) error estimates in the Lp(D)-norm. In a more recent work [Lan12], A. Lang studied semi-discrete Galerkin approximation schemes for SPDEs of advection diffusion type in bounded domains D driven by cádlág square integrable mar- tingales in a Hilbert Space. A. Lang showed that the rate of convergence in the Lp(Ω) and almost-sure sense in the L2(D)-norm is of order two for a finite-element Galerkin scheme. In [BL12], A. Lang and A. Barth derive L2(Ω) and almost-sure estimates in the L2(D)- 5.1. Introduction 133 norm for the error of a Milstein-Galerkin approximation scheme for the same equation considered in [Lan12] and obtain convergence of order two in space and order one in time. In the articles [Lan12, BL12, HM06, Hau08], the authors make use of the semigroup theory of SPDEs (mild solution) and only consider SPDEs in which the principal part of the operator in the drift is non-random. Moreover, the authors there do not address the approximation of equations with non-local operators in the drift or noise. The principal part of the operator in the drift of the Zakai equation is, in general, random, and hence numerical schemes that approximate SPDEs or SIDEs with random-adapted principal part are of importance. More precisely, the coefficients of the Zakai equation are random if the coefficients of the SDE governing the signal depend on the observation. In this chapter, since we use the variational framework (L2-theory) of SPDEs, we are easily able to treat the case of random-coefficients, and hence the diffusion coefficients ai jt (x) appearing in (5.1.1) are random. In dimension one, a finite difference scheme for degenerate integro-differential equa- tions (deterministic) has been studied by R. Cont and E. Voltchkova in [CV05]. The authors in [CV05] first approximate the integral operator near the origin with a second derivative operator. The resulting PDE is then non-degenerate and has an integral operator of order zero. The error of this approximation is obtained by means of the probabilistic representation of the solution of both the original equation and the non-degenerate equa- tion. In the second step of their approximation, R. Cont and E. Voltchkova consider an implicit-explicit finite difference scheme and obtain pointwise error estimates of order one in space. As a consequence of the two-step approximation scheme, there are two separate errors for the approximation In this chapter, we consider the non-degenerate stochastic integro-differential equation (5.1.1) with random coefficients and apply the method of finite differences in the time and space variables. To the best of our knowledge, this article is the first to use the finite difference method to approximate SIDEs. The approximations of the non-local integral operators in the drift and in the noise of (5.1.1) we choose are both natural and relatively easy to implement. In particular, we are able to treat the singularity of the integral oper- ators near the origin directly. We consider a fully-explicit time-discretization scheme and an implicit-explicit time-discretization scheme, where we treat part of the approximation of the integral operator in the drift explicitly. To obtain error estimates for our approximations, we use the approach in [Yoo00], where the discretized equations are first solved as time-discretized SDEs in Sobolev spaces over Rd and an error estimate is obtained in Sobolev norms. After obtaining L2(Ω) error estimates in Sobolev norms, the Sobolev embedding theorem is used to obtain L2(Ω)- pointwise error estimates. So, in sum, we obtain two types of error estimates: in Sobolev norms and on the grid. Naturally, when using the Sobolev embedding to obtain the point- 134 Chapter 5. A finite difference scheme for non-degenerate parabolic SIDEs wise estimates, we do not need the equation to be differentiable to obtain pointwise error estimates, only continuous. Using the approach of first obtaining estimates in Sobolev spaces, we are also easily able to deduce that the more regularity on the coefficients and data we have, the stronger the error estimates we can obtain (see Corollaries 5.5.3 and 5.5.4). The chapter is organized as follows. In the next section, we introduce the notation that will be used throughout the chapter and state the main results. In the third section, we present a simulation that we did to confirm our rates of convergence. In the fourth section, we prove auxiliary results that will be used in the proof of the main theorems. In the fifth section, we prove the main theorems of the chapter. 5.2 Statement of main results Let us consider the following assumption for an integer m ≥ 0. Assumption 5.2.1 (m). For i, j ∈ {0, . . . , d}, ai jt = a i j t (x) are real-valued functions defined on Ω × [0,T ] × Rd that are RT ⊗ B(Rd)-measurable and σit = (σ iϱ t (x)) ∞ ϱ=1 are ℓ2-valued functions that are RT ⊗ B(Rd)-measurable. Moreover, (i) for each (ω, t) ∈ Ω× [0,T ], the functions ai jt are max(m, 1)-times continuously differ- entiable in x for all i, j ∈ {1, . . . , d}, ai0t and a 0i t are m-time continuously differentiable in x for all i ∈ {0, 1, . . . , d}, and σit are m-times continuously differentiable in x as ℓ2-valued functions for all i ∈ {0, . . . , d}. Furthermore, there is a constant K > 0 such that for all (ω, t, x) ∈ Ω × [0,T ] × Rd, |∂γai jt | ≤ K, ∀ i, j ∈ {1, . . . , d}, ∀ |γ| ≤ max(m, 1), |∂γai0t | + |∂ γa0it | + |∂ γσit|ℓ2 ≤ K, ∀ i ∈ {0, . . . , d}, ∀|γ| ≤ m; (ii) there exists a positive constant κ > 0 such that for all (ω, t, x) ∈ Ω × [0,T ] × Rd and η ∈ Rd d∑ i, j=1  2a i j t − ∞∑ ϱ=1 σ iϱ t σ jϱ t   ηiη j ≥ κ|η| 2. In this chapter, for each integer m ≥ 0, we set Hm = Hm(Rd; R), Hm(F0), Hm = Hm(Rd; R), Hm(ℓ2) = Hm(Rd; ℓ2), Hm(π2) = Hα(Rd; R; π2), and ∥ · ∥m = ∥ · ∥m,1, (·, ·)m = (·, ·)m,1, ⟨·, ·⟩1 = ⟨·, ·⟩1,1. We also set C∞c = C ∞ c (R d; R. Assumption 5.2.2 (m). We have ϕ ∈ Hm(F0), f ∈ Hm−1, g ∈ Hm(ℓ2), and o ∈ Hm(π2). Set κ2m = E [ ∥φ∥2m ] + E ∫ ]0,T ] ( ∥ ft∥2m−1 + ∥gt∥ 2 m,ℓ2 + ∫ Rd ∥ot(z)∥2mπ 2(dz) ) dt. 5.2. Statement of main results 135 For a real-valued twice continuous differentiable function ϕ on Rd, it is easy to see that for all x, z ∈ Rd, ϕ(x + z) − ϕ(x) − d∑ j=1 z j∂ jϕ(x) = ∫ 1 0 d∑ i, j=1 ziz j∂i jϕ(x + θz)(1 − θ)dθ. (5.2.1) For each δ ∈ (0, 1], let ς1(δ) = ∫ |z|≤δ |z|2π1(dz), ς2(δ) = ∫ |z|≤δ |z|2π2(dz), and ς(δ) = ς1(δ) + ς2(δ). Fix δ ∈ (0, 1] such that ς(δ) < κ, (5.2.2) and notice that 2∑ r=1 πr({|z| > δ}) < ∞. (5.2.3) We write I = Iδ + Iδc , where Iδϕ(x) := ∫ |z|≤δ ∫ 1 0 d∑ i, j=1 ziz j∂i jϕ(x + θz)(1 − θ)dθπ1(dz) and Iδc is defined as in (5.1.2) with integration over {|z| > δ} instead of Rd. Definition 5.2.1. An H0-valued càdlàg adapted process u is called a solution of (5.1.1) if (i) ut ∈ H1 for dP × dt-almost-every (ω, t) ∈ Ω × [0,T ]; (ii) E ∫ ]0,T ] ∥ut∥21dt < ∞; (iii) there exists a set Ω˜ ⊂ Ω of probability one such that for all (ω, t) ∈ [0,T ] × Ω˜ and ϕ ∈ C∞c (R d), (ut, ϕ)0 = (φ, ϕ)0 + ∫ ]0,t]   d∑ i, j=1 ( ∂ jus, ∂−i(a i j s ϕ) ) 0 + [ϕ, fs]0   ds + ∫ ]0,t] ∫ |z|≤δ ∫ 1 0 d∑ i, j=1 ( z j∂ jus(· + θz), zi∂−iϕ ) 0 (1 − θ)dθπ1(dz)ds + ∫ ]0,t] ∫ |z|>δ  us(· + z) − us − 1[−1,1](|z|) d∑ j=1 z j∂ jus, ϕ   0 π1(dz)ds 136 Chapter 5. A finite difference scheme for non-degenerate parabolic SIDEs + ∞∑ ϱ=1 ∫ ]0,t] d∑ i=0 ( σiϱs ∂ius + g ϱ s , ϕ ) 0 dwϱs + ∫ ]0,t] ∫ Rd (us−(· + z) − us− + ot(z), ϕ)0 q(dz, ds). Remark 5.2.2. In the above definition, instead of δ we may choose any other positive constant. The following existence theorem is a consequence of Theorems 2.9, 2.10, and 4.1 in [Gyö82] and will be verified in Section 4. Theorem 5.2.3. If Assumptions 5.2.1(m) and 5.2.2(m) hold with m ≥ 0, then there ex- ist a unique solution u of (5.1.1). Furthermore, u is a cádlág Hm-valued process with probability one and there is a constant N = N(d,m, κ,K,T ) such that E [ sup t≤T ∥ut∥2m ] + E ∫ ]0,T ] ∥us∥2m+1ds ≤ Nκ 2 m. (5.2.4) Remark 5.2.4. We have used the standard definition of solution for the variational (or L2) theory fo stochastic evolution equations. In what follows below, we will always assume m ≥ 2, and so we have enough regularity to formulate the solution in the weak sense in (H1,H0,H−1) without integrating by parts. The following proposition is needed to establish the rate of convergence in time of our approximation scheme and is proved in Section 4. Proposition 5.2.5. Let Assumptions 5.2.1(m) and 5.2.2(m) hold for some m ≥ 1 and u be the solution of (5.1.1). Moreover, assume that sup t≤T E [ ∥gt∥2m−1,ℓ2 ] + sup t≤T E ∫ Rd ∥ot(z)∥2m−1π 2(dz) ≤ K. Then there is a constant λ = λ(d,m,K,T, κ, κ2m) such that for all s, t ∈ [0,T ], E [ ∥ut − us∥2m−1 ] ≤ λ|t − s|. Assumption 5.2.3 (m). For m ≥ 3, in addition to Assumption 5.2.2(m), there exists a random variable ξ with Eξ < K such that for all ω ∈ Ω, t, s ∈ [0,T ], ∥gt∥2m−1,ℓ2 + ∫ Rd ∥ot(z)∥2m−1π 2(dz) ≤ ξ ∥ ft − fs∥2m−2 + ∥gt − gs∥ 2 m−2,ℓ2 + ∫ Rd ∥ot(z) − os(z)∥2m−1π 2(dz) ≤ ξ|t − s|. 5.2. Statement of main results 137 Assumption 5.2.4 (m). For m ≥ 3, in addition to Assumption 5.2.1 (i), there is a constant C such that for all (ω, x) ∈ Ω × Rd, s, t ∈ [0,T ], i, j ∈ {0, 1, . . . , d}, |∂γ ( ai jt − a i j s ) |2 + |∂γ ( σit − σ i s ) |2ℓ2 ≤ C|t − s|, ∀|γ| ≤ m − 2. We turn our attention to the discretisation of equation (5.1.1). For each h ∈ R−{0} and standard basis vector ei, i ∈ {1, . . . , d}, of Rd we define the first-order difference operator δh,i by δh,iϕ(x) := ϕ(x + hei) − ϕ(x) h , for all real-valued functions ϕ on Rd. We define δh,0 to be the identity operator. Notice that for all ψ, ϕ ∈ H0, we have (ϕ, δ−h,iψ)0 = −(δh,iϕ, ψ)0. (5.2.5) Set δhi := 1 2 (δh,i + δ−h,i) and observe that for all ϕ ∈ H0, (ϕ, δhi ϕ)0 = 0. (5.2.6) For each h , 0, we introduce the grid Gh := {hzk : zk ∈ Zd, k ∈ N0, z0 = 0} with step size |h|. Let ℓ2(Gh) be the Hilbert space of real-valued functions ϕ on Gh such that ∥ϕ∥2ℓ2(Gh) := |h| d ∑ x∈Gh |ϕ(x)|2 < ∞. We approximate the operators L and Nϱ by Lht ϕ(x) := d∑ i, j=0 ai jt (x)δh,iδ−h, jϕ(x) and N ϱ;h t ϕ(x) := d∑ i=0 σ iϱ t (x)δh,iϕ(x), respectively. In order to approximate I, we approximate Iδ and Iδc separately. For each k ∈ N ∪ {0} and h , 0, define the rectangles in Rd Ahk := ( z1k |h| − |h| 2 , z1k |h| + |h| 2 ] × · · · × ( zdk |h| − |h| 2 , zdk |h| + |h| 2 ] , where zik, i ∈ {1, ..., d}, are the coordinates of zk ∈ Z d, and set Bhk := A h k ∩ {|z| ≤ δ}, B¯ h k := A h k ∩ {|z| > δ}. 138 Chapter 5. A finite difference scheme for non-degenerate parabolic SIDEs We approximate Iδc by Ihδcϕ(x) := ∞∑ k=0  (ϕ(x + hzk) − ϕ(x)) ζ¯h,k − d∑ i=1 ξ¯ih,kδ h i ϕ(x)   , where ζ¯h,k := π 1(B¯hk) and ξ¯ i h,k := ∫ B¯hk∩[−1,1] ziπ1(dz). We continue with the approximation of the operator Iδ. By (5.2.1), for all x ∈ Gh, Iδϕ(x) = ∞∑ k=0 ∫ Bhk ∫ 1 0 d∑ i, j=1 ziz j∂i jϕ(x + θz)(1 − θ)dθπ1(dz), where there are only a finite number of non-zero terms in the infinite sum over k. The closest point in Gh to any point z ∈ Bhk is clearly hzk. This simple observation leads us to the following (intermediate) approximation of Iδϕ(x): ∞∑ k=0 ∫ 1 0 d∑ i, j=1 ∫ Bhk ziz jπ1(dz)∂i jϕ(x + θhzk)(1 − θ)dθ. However, in order to ensure that our approximation is well-defined for functions ϕ ∈ ℓ2(Gh), we need to approximate the integral over θ ∈ [0, 1]. Fix k ∈ N0 and h , 0. Consider the directed line segment {θhzk : θ ∈ [0, 1]} extending from the origin to the point hzk ∈ Rd. It is clear that this line segment intersects a unique finite sequence of rect- angles from the set {Ah k¯ }k¯∈N0 . Denote the number of rectangles by χ(h, k). Since the line’s start point is the origin, the first rectangle it intersects is Ah0, and since the line’s endpoint is hzk, the last rectangle it intersects is Ahk , the center of which is the point hzk. If χ(h, k) > 2, then in between these two rectangles, the line segment intersects χ(h, k) − 2 additional rectangles from the set {Ah k¯ }k¯∈N0 − {A h 0 ∪ A h k}. Denote the indices of these rectangles by r h,k l , l ∈ {2, . . . , χ(h, k)−1}, and set rh,k1 = 0 and r h,k χ(h,k) = k; that is, {θhzk; θ ∈ [0, 1]} ⊆ ∪ χ(h,k) l=1 A h rh,kl . Corresponding to the set of rectangles {Ah rh,kl }χ(h,k)l=1 is a partition 0 = θ h,k 0 ≤ · · · ≤ θ h,k χ(h,k) = 1 of the interval [0, 1] such that for each l ∈ {1, . . . , χ(h, k)} and θ ∈ (θh,kl−1, θ h,k l ), θhzk ∈ A h rh,kl . Since the diagonal of a d-dimensional hypercube with side length |h| has length √ dh, for each k ∈ N0, z ∈ Bhk , and l ∈ {1, . . . , χ(h, k)}, |θz − hzrh,kl | ≤ |θz − θhzk| + |θhzk − hzrh,kl | ≤ √ d|h|, (5.2.7) 5.2. Statement of main results 139 for all θ ∈ (θh,kl−1, θ h,k l ). Set ζ i j h,k = ∫ Bhk ziz jπ1(dz), θ¯h,kl = ∫ θh,kl θ h,k l−1 (1 − θ)dθ and define the operator Ihδϕ(x) =: ∞∑ k=0 χ(h,k)∑ l=1 θ¯ h,k l d∑ i, j=1 ζ i j h,kδh,iδ−h, jϕ(x + hzrh,kl ), where there are only a finite number of non-zero terms in the infinite sum over k. Set Ih = Ih δ + Ih δc and introduce the martingales ph,k,it = ∫ ]0,t] ∫ Bhk ziq(dt, dz), p¯h,kt = q(B¯ h k , ]0, t]). Moreover, set θ˜ h,k l := θ h,k l+1 − θ h,k l . Let T ≥ 1 be an integer and set τ = T/T and tn = nτ for i ∈ {0, 1, . . . ,T }. For any F-martingale (pt)t≤T , we use the notation ∆pn+1 := ptn+1 − ptn . Define recursively the ℓ2(Gh)-valued random variables (uˆ h,τ n )Tn=0 by uˆh,τn (x) =uˆ h,τ n−1(x) + ( (Lhtn−1 + I h)uˆh,τn−1(x) + ftn−1(x) ) τ + ∞∑ ϱ=1 (Nϱ;htn−1 uˆ h,τ n−1(x) + g ϱ tn−1(x))∆w ϱ n + ∞∑ k=0 d∑ i=1   χ(h,k)∑ l=1 θ˜ h,k l δh,iuˆ h,τ n−1(x + hzrh,kl )   ∆p h,k,i n + ∫ Rd otn−1(x, z)q(]tn−1, tn], dz) + ∞∑ k=0 ( uˆh,τn−1(x + hzk) − uˆ h,τ n−1(x) ) ∆p¯h,kn , n ∈ {1, . . . ,T }, (5.2.8) with initial condition uˆh,τ0 (x) = φ(x), x ∈ Gh It is clear that uˆh,τn is Ftn-measurable for every n ∈ {0, 1, . . . ,T }. Define the operators L˜ht ϕ = d∑ i, j=0 ai jt δh,iδ−h, jϕ − π 1({|z| > δ})ϕ − d∑ i=1 ∫ δ<|z|≤1 ziπ1(dz)δhi ϕ and I˜hδcϕ = ∞∑ k=0 ϕ(x + hzk)ζh,k and note that L˜h+ I˜h δc +Iδ = Lh+Ih. On Gh, we also consider the following implicit-explicit 140 Chapter 5. A finite difference scheme for non-degenerate parabolic SIDEs discretization scheme of (5.1.1): vˆh,τn (x) =vˆ h,τ n−1(x) + ( (L˜htn + I h δ )vˆ h,τ n (x) + I˜ h δcv h,τ n−1(x) + ftn(x) ) τ + 1n>1 ∞∑ ϱ=1 (Nϱ;htn−1 vˆ h,τ n−1(x) + g ϱ tn−1(x))∆w ϱ n + 1n>1 ∞∑ k=0 d∑ i=1   χ(h,k)∑ l=1 θ˜ h,k l δh,ivˆ h,τ n−1(x + hzrh,kl )   ∆p h,k,i n + ∫ Rd otn−1(x, z)q(]tn−1, tn], dz) + 1n>1 ∞∑ k=0 ( vˆh,τn−1(x + hzk) − vˆ h,τ n−1(x) ) ∆p¯h,kn , n ∈ {1, . . . ,T }, (5.2.9) with initial condition vˆh,τ0 (x) = φ(x), x ∈ Gh, where 1n>1 = 0 if n = 1 and 1n>1 = 1 if n ≥ 2. A solution (vˆ h,τ n )Mn=0 of (5.2.9) is understood as a sequence of ℓ2(Gh)-valued random variables such that vˆ h,τ n is Ftn-measurable for every n ∈ {0, 1, . . . ,M} and satisfies (5.1.1). Remark 5.2.6. Under Assumptions 5.2.2 and 5.2.3, for m > 2 + d/2, by virtue of the embedding Hm−2 ↪→ ℓ2(Gh), the free-terms f , g, and o(z) are continuous ℓ2(Gh) valued processes, and consequently the above schemes make sense. Moreover, for 0 < |h| < 1, there is a constant N independent of h such that - ∥ϕ∥ℓ2(Gh) ≤ N∥ϕ∥m−2. (5.2.10) Assumption 5.2.5. The parameters h , 0 and T are such that d τ h2 < κ − ς(δ) ( 2 ( supt,x,ω ∑d i, j=1 |a i j t (x)|2 )1/2 + ς1(δ) )2 . (5.2.11) Remark 5.2.7. We have assumed that the coefficients ai jt were bounded uniformly in ω, t, and x, so that quantity in in denominator (5.2.11) is well-defined. The following are our main theorems. Theorem 5.2.8. Let Assumptions 5.2.1(m) through 5.2.4(m) hold for some m > 2 + d2 and let Assumption 5.2.5 hold. Let u be the solution of (5.1.1) and let (uˆh,τn )Tn=0 be defined by (5.2.8). Then there is a constant N = N(d,m, κ,K,T,C, λ, κ2m, δ) such that for any real number h with 0 < |h| < 1, E [ max 0≤n≤T sup x∈Gh |utn(x) − uˆ h,τ n (x)| 2 ] + E [ max 0≤n≤T ∥utn − uˆ h,τ n ∥ 2 ℓ2(Gh) ] ≤ N ( |h|2 + τ ) . 5.3. Simulation 141 Theorem 5.2.9. Let Assumptions 5.2.1(m) through 5.2.4(m) hold for some m > 2 + d2 and let u be a solution of (5.1.1). There exists a constant R = R(d,m, κ,K, δ) such that if T > R, then there exists a unique solution (vˆh,τn )Tn=0 of (5.2.9) and a constant N = N(d,m, κ,K,T,C, λ, κ2m, δ) such that for any real number h with 0 < |h| < 1, E [ max 0≤n≤T sup x∈Gh |utn(x) − vˆ h,τ n (x)| 2 ] + E [ max 0≤n≤T ∥utn − vˆ h,τ n ∥ 2 ℓ2(Gh) ] ≤ N ( |h|2 + τ ) . Remark 5.2.10. In fact, with more regularity, as consequence of Corollaries 5.5.3 and 5.5.4, we can obtain stronger error estimates with difference operators in the error norms. This is an immediate consequence of the approach we use to obtaining these estimates. 5.3 Simulation Let us consider finite difference approximations for the following SIDE on [0,T ] × Rd: dut(x) = (( σ¯21 2 + σ¯22 2 ) ∂21ut(x) + ∫ R (ut(x + z) − ut(x) − ∂1ut(x)z)π(dz) ) dt + σ¯2∂1ut(x)dwt + ∫ R (u(x + z) − u(x)) q(dt, dz), u0(x) = 1 √ 2πσ¯0 exp ( − x2 σ¯21σ¯ 2 0 ) , (5.3.1) where π(dz) = c− exp (−β−z) dz|z|1+α− 1(−∞,0)(z) + c+ exp (−β+z) dz |z|1+α+ 1(0,∞)(z). It is easily veri- fied that for (t, x) ∈ [0,T ] × Rd, vt(x) = 1 √ π(2σ¯20 + 4t) exp ( x2 σ¯21(σ¯ 2 0 + 2t) ) solves dvt(x) = σ¯21 2 ∂21vt(x)dt, v0(x) = 1 √ 2πσ¯0 exp ( − x2 σ¯21σ¯ 2 0 ) . Moreover, applying Itô’s formula, we find that ut(x) = vt ( x + σ¯2wt + ∫ Rd zq(dt, dz) ) (5.3.2) solves (5.3.1). Thus, we can compare our finite difference approximations with (5.3.2). In our numerical simulations, we used MATLAB 2013a and made the following pa- 142 Chapter 5. A finite difference scheme for non-degenerate parabolic SIDEs rameter specification: σ¯1 = 1 2 , σ¯2 = 1 4 , σ¯0 = 1 2 , c− = c+ = 1, β− = β+ = 1, α− = α+ = 1.1, T = 1. We also made a few practical simplifications. Both the explicit and implicit-explicit ap- proximations were assumed to take the value zero on (−∞, 8] ∪ [8,∞). We also restricted the support of π(dz) to [−3, 3]. We would like to investigate the associated error with these reductions in the future. Regarding the first reduction, we mention that a good heuristic is to choose the size of domain according to the support of the free terms and the exit time of the diffusion for which the drift of the SIDE (up to zero order terms) is the infinitesimal generator of. In fact, it is more than a heuristic and we aim to address this in a future work. In our simulation, we took δ = 1100 . It follows that κ = σ¯ 2 1 = 1 2 and ς(δ) = c− ∫ δ 0 exp(−β−z)z1−α−dz + c+ ∫ δ 0 exp(−β+z)z1−α+dz + z = c−β α−−2 − γ(2 − α−, β−δ) + c+β α+−2 + γ(2 − α+, β+δ) ≈ 0.0082, where γ(η, z) denotes the lower incomplete gamma function. Thus, the right-hand-side of (5.2.11) is approximately 1.0559, and hence we can always set τ = h2. The quan- tities ζ11h,k, ζ¯h,k, and ξ 1 h,k can all be calculated using MATLAB’s built-in upper and lower incomplete gamma functions, or by implementing an appropriate numerical integration procedure. The calculation of θh,kl , θ¯ h,k l , and θ˜ h,k l are all straightforward in one-dimension. Some more thought would need to spent on how to calculate these quantities in higher dimensions. Of course as an alternative, one could set δ = h2 , but then the schemes are not guaranteed to converge as h tends to zero. This is the drawback of taking δ = h2 and not including the additional terms in Iδ (see the paragraph at the bottom of page 1620 in [CV05]). It does seem that the method we propose to discretise Iδ is novel in this respect. In our error analysis, we have considered h ∈ {2−2, 2−3, 2−4, 2−5, 2−6, 2−7} and τ = h2. The term ∫ |z|>δ (ut(x + z) − ut(x)) π(dz) in the drift of (5.3.1) can be cancelled with the compensator of the compensated Poisson random measure term. We get a similar cancellation in the corresponding finite difference equations, and thus we can replace p¯h,kt = q(B¯ h k , ]tn, tn+1]) with pˆ h,k t = p(B¯ h k , ]tn, tn+1]) in the explicit 5.2.8 and implicit-explicit (5.2.9) scheme. In order to simulate ∆ph,kn = ∫ ]tn,tn+1] ∫ Bhk zq(dt, dz), pˆh,kt = p(B¯ h k , ]tn, tn+1]), 5.3. Simulation 143 for the finest time step size τ = 2−14, we used the algorithm discussed in Section 4 of [KM11]. In this algorithm, a parameter ϵ is chosen for which the process ∆ph,0n =∫ ]0,t] ∫ |z|<ϵ zq(dt, dz) is approximated by a Wiener process with infinitesimal variance ∫ |z|<ϵ z2π(dz). We chose the parameter ϵ = 2−8, which is one-half times the smallest step size h under consideration in our error analysis. The process ∫ ]0,t] ∫ |z|>ϵ zq(dt, dz) = ∫ ]0,t] ∫ |z|>ϵ zp(dt, dz) (we have used symmetry of the measure π(dz))) is a compound Poisson process with jump intensity λ := 2 ∫ 3 ϵ π(dz) ≈ 68.9676 and jump-size density f¯ (z) = 1 λ ( c− exp (−β−z) dz |z|1+α− 1(−3,2−8)(z) + c+ exp (−β+z) dz |z|1+α+ 1(2−8,3)(z) ) . The underlying Poisson process was simulated using MATLAB’s built-in Poisson random variable generator; of course there are other simple methods that one can use as an alter- native (e.g. exponential times or uniform times for fixed number of jumps). We sampled random variables from the density f¯ by sampling the positive and negative parts separately and using an acceptance-rejection algorithm with a Pareto random variable. We refer to [KM11] for more details. Once we simulated the point process on [0,T ]× [−3,−ϵ]∪[ϵ, 3], we then computed ∫ ]0,t] ∫ |z|>ϵ zp(dt, dz). In order to compute pˆh,kt = p(B¯ h k , ]tn, tn+1]), we ran a histogram with the intervals B¯hk . The quantity ∆ph,kn = ∫ ]tn,tn+1] ∫ Bhk zq(dt, dz) is zero for k , 0 when h < δ2 (for h ∈ {2−2, 2−3, 2−4, 2−5}) since Bhk = ∅ for k , 0 when h < δ 2 . For h ∈ {2 −6, 2−7}, ∆ph,kn is non-zero for k ∈ {−1, 0, 1}. A similar analysis holds for the quantity ζ11h,k. As mentioned above, we set pˆh,0t equal to the Weiner process approximating the small jumps. To compute∫ ]tn,tn+1] ∫ Bhk zq(dt, dz) for k ∈ {−1, 1} in the case h ∈ {2−6, 2−7}, we summed the jump sizes in their respective bins and compensated. To obtain the above quantities for coarser time step sizes, we cumulatively summed the finer increments and took the union of jump sizes. Lastly, we made use of the Fast Fourier Transform to compute terms of the form ∞∑ k=0 ϕ(x + hzk)∆ pˆ h,k n , which would be quite computationally expensive otherwise. In our error analysis, we ran 3000 simulations of the explicit and implicit-explicit schemes on 30 CPUs and computed the following quantities: By our main theorems and the relation τ = h2, these errors should proportional to h (i.e. O(h)). This is precisely what we observe in Figure 5.1. The slight bump down at the finest two spatial step-sizes h ∈ {2−6, 2−7} is most likely due to the increase in the number 144 Chapter 5. A finite difference scheme for non-degenerate parabolic SIDEs -7 -6 -5 -4 -3 -2−6 −5.5 −5 −4.5 −4 −3.5 −3 −2.5 −2 −1.5 −1 −0.5 0 0.5 1 log2(h = τ2) log 2(e rro r(h )) Rate of convergence (3000 simulations) √∑3000m=1 max0≤n≤T supx∈Gh |utn(x) − uˆh,τn (x)|2√∑3000m=1 max0≤n≤T ∥utn − uˆh,τn ∥2ℓ2(Gh)√∑3000m=1 max0≤n≤T supx∈Gh |utn(x) − vˆh,τn (x)|2√∑3000m=1 max0≤n≤T ∥utn − vˆh,τn ∥2ℓ2(Gh)Line of slope 1 Figure 5.1 Simulated errors with respect to the space discretization and a line as reference slope on a log2 scale . of terms in the approximation of Ih δ (three to be precise) and the analogous small jump term in the noise. 5.4 Auxiliary results In this section, we present some results that will be needed for the proof of Theorems 5.2.8 and 5.2.9. Introduce the operators Iδ;h(z)ϕ(x) := ∞∑ k=0 1Bhk (z) χ(h,k)∑ l=1 d∑ i=1 θ˜ h,k l z iδh,iϕ(x + hzrh,kl ), Iδ c;h(z)ϕ(x) := ∞∑ k=0 1B¯hk (z)(ϕ(x + hzk) − ϕ(x)), Ih(z)ϕ(x) := Iδ;h(z)ϕ(x) + Iδ c;h(z)ϕ(x). Consider the following explicit and implicit-explicit schemes in H0: uh,τn =u h,τ n−1 + ( (Lhtn−1 + I h)uh,τn−1 + ftn−1 ) τ + ∞∑ ϱ=1 (Nϱ;htn−1u h,τ n−1 + g ϱ tn−1)∆w ϱ n + ∫ Rd ( Ih(z)uh,τn−1 + otn−1(z) ) q(dz, ]tn−1, tn]), n ∈ {1, . . . ,T }, (5.4.1) 5.4. Auxiliary results 145 and vh,τn =v h,τ n−1 + ( (L˜htn + I h δ )v h,τ n + I˜ h δcv h,τ n−1 + ftn ) τ + 1n>1 ∞∑ ϱ=1 (Nϱ;htn−1v h,τ n−1 + g ϱ tn−1)∆w ϱ n + 1n>1 ∫ Rd ( Ih(z)vh,τn−1 + otn−1(z) ) q(dz, ]tn−1, tn]), n ∈ {1, . . . ,T }, (5.4.2) with initial condition uh,τ0 (x) = v h,τ 0 (x) = φ(x), x ∈ R d. We now prove some lemmas that will help us to establish the consistency of our approxi- mations. The following lemma is well-known and we omit the proof (see, e.g., [GK10]). Lemma 5.4.1. For each integer m ≥ 0, there is a constant N = N(d,m) such that for all u ∈ Hm+2 and v ∈ Hm+3, ∥δh,iu − ∂iu∥m ≤ 1 2 |h|∥u∥m+2, ∥δh,iδ−h, jv − ∂i jv∥m ≤ N|h|∥v∥m+3. Lemma 5.4.2. For each integer m ≥ 0, there is a constant N = N(d,m, δ) such that for all u ∈ Hm+3, we have ∥Iu − Ihu∥m ≤ N|h|∥u∥m+3. (5.4.3) Proof. It suffices to show (5.4.3) for u ∈ C∞c (R d). We begin with m = 0. A simple calculation shows that Iδcu(x) − Ihδcu(x) = ∞∑ k=0 ∫ B¯hk ∫ 1 0 d∑ i=1 (zi − hzik)∂iu(x + hzk + θ(z − hzk))dθπ 1(dz) − ∞∑ k=0 ∫ B¯hk∩[−1,1] d∑ i=1 zi(∂iu(x) − δhi u(x))π 1(dz). By Minkowski’s inequality, we get ∥Iδcu − Ihδcu∥0 ≤ ∞∑ k=0 ∫ B¯hk d∑ i=1 |zi − hzik|∥∂iu∥0π 1(dz) + ∞∑ k=0 ∫ B¯hk∩[−1,1] d∑ i=1 |zi|∥∂iu(x) − δhi u(x)∥0π 1(dz) ≤ N|h|∥u∥3 + N d∑ i=1 ∥∂iu(x) − δhi u(x)∥0, since |z − hzk| ≤ |h| √ d/2 and (5.2.3) holds. Thus, by Lemma 5.4.1, we have ∥Iδcu − Ihδcu∥0 ≤ N |h|∥u∥3. (5.4.4) 146 Chapter 5. A finite difference scheme for non-degenerate parabolic SIDEs We also have Iδu(x) − Ihδu(x) = ∞∑ k=0 ∫ Bhk χ(h,k)∑ l=1 ∫ θh,kl θ h,k l−1 d∑ i, j=1 ziz j ( ∂i ju(x + θz) − δh,iδ−h, ju(x + hzrh,kl ) ) (1 − θ)dθπ1(dz). (5.4.5) Note that ∂i ju(x + θz) − δh,iδ−h, ju(x + hzrh,kl ) = ∂i ju(x + θz) − ∂i ju(x + hzrh,kl ) + ∂i ju(x + hzrh,kl ) − δh,iδ−h, ju(x + hzrh,kl ) = ∫ 1 0 d∑ q=1 ( θzq − hzq rh,kl ) ∂q∂i ju ( x + hzrh,kl + ρ(θz − hzrh,kl ) ) dρ +∂i ju(x + hzrh,kl ) − δh,iδ−h, ju(x + hzrh,kl ). By (5.2.7), we have |θzq − hzq rh,kl | ≤ N |h|. Hence, substituting the above relation in (5.4.5), using Minkowski’s inequality, (5.2.2), and Lemma 5.4.1, we obtain ∥Iδu − Ihδu∥0 ≤ |h|N∥u∥3. (5.4.6) Combining (5.4.4) and (5.4.6), we have (5.4.3) for m = 0. The case m > 0 follows from the case m = 0, since for a multi-index γ, we have ∂γ(Iu − Ihu) = I∂γu − Ih∂γu. □ Lemma 5.4.3. For each integer m ≥ 0, there is a constant N = N(d,m, δ), there is a constant such that for all u ∈ Hm+2, we have ∫ Rd ∥Ih(z)u − I(z)u∥2mπ 2(dz) ≤ N |h|2∥u∥2m+2. (5.4.7) Proof. It suffices to prove the lemma for u ∈ C∞c (R d) and m = 0. We have Iδ(z)u(x) − Iδ;h(z)u(x) = ∞∑ k=0 1Bhk (z) χ(h,k)∑ l=1 ∫ θh,kl θ h,k l−1 d∑ i=1 zi(∂iu(x + θz) − δh,iu(x + hzrh,kl ))dθ. 5.4. Auxiliary results 147 Notice that ∂iu(x + θz) − δh,iu(x + hzrh,kl ) = ∫ 1 0 d∑ i, j=1 ∂i ju(x + ρ(θz − hzrh,kl ))(θz j − hz j rh,kl )dρ +∂iu(x + hzrh,kl ) − δh,iu(x + hzrh,kl ). Thus, by Remark 5.2.7 and Lemma 5.4.1, we get ∥Iδ;h(z)u − Iδ(z)u∥20 ≤ 1|z|≤δ|z| 2N|h|2∥u∥22, and hence by (5.2.2), we obtain ∫ Rd ∥Iδ;h(z)u − Iδ(z)u∥20π 2(dz) ≤ N |h|2∥u∥22. (5.4.8) We also have |Iδ c (z)u(x) − Iδ c;h(z)u(x)| = ∞∑ k=0 1B¯hk (z)|u(x + z) − u(x + hzk)| ≤ ∞∑ k=0 1B¯hk (z) ∫ 1 0 d∑ i=1 |∂iu(x + hzk + ρ(z − hzk))∥zi − hzik|dρ. Consequently, ∥Iδ c;h(z)u − Iδ c (z)u∥20 ≤ 1|z|>δN |h| 2∥u∥21, which implies by (5.2.3) that ∫ Rd ∥Iδ c;h(z)u − Iδ c (z)u∥20π 2(dz) ≤ N |h|2∥u∥21. (5.4.9) Combining (5.4.9) and (5.4.8), we have (5.4.7) for m = 0. The case m > 0 follows from the case m = 0, since for a multi-index γ, we have ∂γ(Iu − Ihu) = I∂γu − Ih∂γu. □ Lemma 5.4.4. If Assumption 5.2.1(m) holds for some m ≥ 0, then for any ϵ ∈ (0, 1) there exists constants N1 = N1(d,m, κ,K, δ, ϵ) and N2 = N2(d,m, κ,K, δ, ϵ) such that for any u ∈ Hm, G(m)t (u) := 2(u,L h t u)m + ∥N h t u∥ 2 m,ℓ2 + 2(u, I hu)m + ∫ Rd ∥Ih(z)u∥2mπ 2(dz) 148 Chapter 5. A finite difference scheme for non-degenerate parabolic SIDEs ≤ −(κ − ς(δ) − ϵ) d∑ i=1 ∥δh,iu∥2m + N1∥u∥ 2 m, (5.4.10) and (u, L˜ht u)m + (u, I h δu)m ≤ −(κ − ς1(δ) − ϵ) d∑ i=1 ∥δh,iu∥2m + N2∥u∥ m n . (5.4.11) Proof. By virtue of Lemma 3.1 and Theorem 3.2 in [GK10], under Assumption 5.2.1, there is a constant N = N(d,m, κ) such that for any u ∈ Hm and ϵ > 0, 2(u,Lht u)m + ∥N h t u∥ 2 m,ℓ2 ≤ −(κ − ϵ) d∑ i=1 ∥δh,iu∥2m + N∥u∥ 2 m. Therefore, it suffices to show that there is a constant N = N(δ) such that for all u ∈ C∞c (R d), 2(u, Ihu)m + ∫ Rd ∥Ih(z)u∥2mπ 2(dz) ≤ ς(δ) d∑ i=1 ∥δh,iu∥2m + N∥u∥ 2 m. (5.4.12) We start with m = 0. Since (u, Ihδu)0 = ∞∑ k=0 ∫ Bhk χ(h,k)∑ l=1 d∑ i, j=1 θ¯ k,h l z iz j ∫ Rd δh,iδ−h, ju(x + hzrh,kl )u(x)dxπ 1(dz) and ∫ Rd δh,iδ−h, ju(x + hzrh,kl )u(x)dx = − ∫ Rd δh,iu(x + hzrh,kl )δh, ju(x)dx, by Hölder’s inequality, we get 2(u, Ihδu)0 ≤ ∫ |z|≤δ |z|2π1(dz) d∑ i=1 ∥δh,iu∥20 = ς1(δ) d∑ i=1 ∥δh,iu∥20. In addition, owing to Holder’s inequality and (5.2.6), we have 2(u, Ihδcu)0 = ∞∑ k=0 ∫ B¯hk ∫ Rd  u(x + hzk) − u(x) − 1[−1,1](z) d∑ i=1 ziδhi u(x)   u(x)dxπ 1(dz) ≤ 0. By Minkowski’s inequality, we have ∥Iδ;h(z)u∥2 ≤ ∞∑ k=0 1Bhk (z)|z| 2 d∑ i=1 ∥δh,iu∥20 and ∥I δc;h(z)u∥20 ≤ 4 ∞∑ k=0 1B¯hk (z)∥u∥ 2 0 and hence ∫ Rd ∥Ih(z)u∥20π 2(dz) ≤ ς2(δ) d∑ i=1 ∥δh,iu∥20 + 4π 1({|z| > δ})∥u∥20, 5.4. Auxiliary results 149 which proves (5.4.12) for m = 0. The case m > 0 follows by replacing u with ∂γu for |γ| ≤ m. This proves (5.4.10), which implies (5.4.11). □ Remark 5.4.5. It follows that for m ≥ 0, there is a constant N5 = N5(d,m,K, δ) such that for any u ∈ Hm, ∥Nϱ;ht u∥ 2 m + ∫ Rd ∥Ih(z)u∥2mπ 2(dz) ≤ N5 d∑ i=0 ∥δh,iu∥2m (5.4.13) ≤ N5 ( 1 + 4d h2 ) ∥u∥2m. (5.4.14) Lemma 5.4.6. For any m ≥ 0 and u ∈ Hm, ∥I˜hδcu∥ 2 m ≤ π 1({|z| > δ})2∥u∥2m. (5.4.15) Moreover, if Assumption 5.2.1 holds for some m ≥ 0, then for any ϵ > 0 and u ∈ Hm, ∥(Lht + I h)u∥2m ≤ (1 + ε) N3d h2 d∑ i=1 ∥δh,iu∥2m + N4 ( 1 + 1 h2 ) ∥u∥2m (5.4.16) where N3 :=   2  sup t,x,ω d∑ i, j=1 |ai j(x)|2   1/2 + ς1(δ)   2 and N4 is a constant depending only on d,m,K, δ, and ϵ. Proof. It suffices to prove the lemma for u ∈ C∞c (R d). It follows that (Lht + I h δ )u(x) = ∞∑ k=0 χ(h,k)∑ l=1 θ¯ h,k l d∑ i, j=1 ζˆ i j t,h,k(x)δh,iδ−h, ju(x + hzrh,kl ) + d∑ i, j=0 i or j=0 ai jt δh,iδ−h, ju(x) where ζˆ i jt,h,k(x) := ζ i j h,k for k , 0 and ζˆ i j t,h,0(x) := ζ i j h,0 + 2a i j t (x) (recall that θ¯ h,0 1 = 1 2 and χ(h, 0) = 1). Moreover, for each multi-index γ with 1 ≤ |γ| ≤ m, ∂γ(Lht + I h δ )u(x) = ∞∑ k=0 χ(h,k)∑ l=1 θ¯ h,k l d∑ i, j=1 ζˆ i j h,k(x)δh,iδ−h, j∂ γu(x + hzrh,kl ) + ∑ {β : β<γ} N(β, γ) d∑ i, j=1 ( ∂γ−βai jt (x) ) δh,iδ−h, j∂ βu(x) + ∑ {β : β≤γ} N(β, γ) d∑ i, j=0 i or j=0 (( ∂γ−βai jt (x) ) δh,iδ−h, j∂ βu(x) ) 150 Chapter 5. A finite difference scheme for non-degenerate parabolic SIDEs =: (A1(γ) + A2(γ) + A3(γ))u(x), where N(β, γ) are constants depending only on β and γ. By Young’s inequality and Jensen’s inequality, for any ϵ ∈ (0, 1), we have ∥(Lht + I h)u∥2m ≤ (1 + ϵ) ∑ |γ|≤m ∥A1(γ)u∥20 +3 ( 1 + 1 ϵ )  ∑ |γ|≤m ( ∥A2(γ)u∥20 + ∥A3(γ)u∥ 2 0 ) + ∥Ihδcu∥ 2 m   . Applying Minkowski’s inequality and the Cauchy-Bunyakovsky-Schwarz inequality and noting that ∑χ(h,k) l=1 θ¯ h,k l = 1 2 and ||δh,i∂βu||0 ≤ 2 h ||∂βu||0; ∀i ∈ {0, 1 . . . , d}, ∀|β| = m, we obtain ∥A1(γ)u∥0 ≤ ∞∑ k=0 χ(h,k)∑ l=1 θ¯ h,k l  sup t,x,ω d∑ i, j=1 ∣∣∣ζˆ i j h,k(x) ∣∣∣ 2   1/2   d∑ i, j=1 ∣∣∣∣ ∣∣∣∣δh,iδ−h, ju(· + hzrh,kl ) ∣∣∣∣ ∣∣∣∣ 2 m   1/2 ≤ √ d h ∞∑ k=0  sup t,x,ω d∑ i, j=1 |ζˆ i jh,k(x)| 2   1/2   d∑ i=1 ∥δh,i∂γu∥20   1/2 and ∞∑ k=0  sup t,x,ω d∑ i, j=1 |ζˆ i jh,k(x)| 2   1/2 =  sup t,x,ω d∑ i, j=1 ∣∣∣∣∣∣ ∫ Bh0 ziz jπ1(dz) + 2ai jt (x) ∣∣∣∣∣∣ 2   1/2 + ∞∑ k=1   d∑ i, j=1 ∣∣∣∣∣∣ ∫ Bhk ziz jπ1(dz) ∣∣∣∣∣∣ 2   1/2 ≤ 2  sup t,x,ω d∑ i, j=1 |ai jt (x)| 2   1/2 + ς(δ). Thus, ∑ |γ|≤m ||A1(γ)u||20 ≤ N3d h2 d∑ i=1 ∥∂h,iu∥2m. Another application of the Cauchy-Bunyakovsky-Schwarz inequality and Minkowski’s inequality, combined with the inequalities ||δh,i∂βu||0 ≤ ||∂i∂βu||0 ∀i ∈ {0, 1 . . . , d}, ∀|β| ≤ m − 1, ||δh,iδ−h, j∂βu||0 ≤ ||∂i j∂βu||0, ∀; i, j ∈ {1, . . . , d}, ∀|β| ≤ m − 2, 5.4. Auxiliary results 151 and ||δh,iδ−h, j∂βu||0 ≤ 2 h ∥δh,iu∥m, ∀; i, j ∈ {1, . . . , d}, ∀|β| = m − 1, yields ∑ |γ|≤m ( ∥A2(γ)u∥20 + ∥A3(γ)∥ 2 0 ) ≤ N ( 1 + 1 h2 ) ||u||2m . By Minkowski’s integral inequality, we have ∥Ihδcu∥m ≤ ∫ Rd ∞∑ k=0 1 B h k ∥u(· + hzk) − u − 1[−1,1](z) d∑ i=1 ziδh,iu∥mπ1(dz) ≤ 3  π 1({|z| > δ}) + 2d ∫ δ<|z|≤1 |z|π1(dz) h   ∥u∥m. It is also easy to see that (5.4.15) holds. Combining above inequalities, we obtain (5.4.16). □ The following theorem establishes the stability of the explicit approximate scheme (5.4.1). Theorem 5.4.7. Let Assumption 5.2.1 hold with m ≥ 0 and Assumption 5.2.5 hold. Let F i ∈ Hm for i ∈ {0, ..., d}, G ∈ Hm(ℓ2), and R ∈ Hm(π2). Consider the following scheme in Hm: uh,τn = u h,τ n−1 + ∫ ]tn−1,tn]  (L h tn−1 + I h)uh,τn−1 + d∑ i=0 δh,iF i t   dt + ∫ ]tn−1,tn] ( Nϱ;htn−1u h,τ n−1 + G ϱ t ) dwϱt + ∫ ]tn−1,tn] ∫ Rd ( Ih(z)uh,τn−1 + Rt(z) ) q(dt, dz), n ∈ {1, . . . ,T }, for any Hm−valued F0−measurable initial condition φ. If ϕ ∈ Hm(F0), then there is a constant N = N(d,m, κ,K,T, δ) such that E [ max 0≤n≤T ∥uh,τn ∥ 2 m ] + E T∑ n=0 τ d∑ i=0 ∥δh,iuh,τn ∥ 2 m ≤ NE [ ∥φ∥2m ] +NE ∫ T 0 ( d∑ i=0 ∥F it∥ 2 m + ∥Gt∥ 2 m + ∫ Rd ∥Rt(z)∥2mπ 2(dz) ) dt. (5.4.17) Proof. If E [ ∥φ∥2m ] < ∞, then proceeding by induction on n and using Young’s and Jensen’s inequality, Itô’s isometry, (5.4.16), and (5.4.14), we get that for all n ∈ {0, 1, . . . ,T }, E [ ∥uh,τn ∥2m ] < ∞. Applying the identity ∥y∥2m − ∥x∥ 2 m = 2(x, y − x)m + ∥y − x∥ 2 m, x, y ∈ H m, 152 Chapter 5. A finite difference scheme for non-degenerate parabolic SIDEs for each n ∈ {1, . . . ,T }, we obtain ∥uh,τn ∥ 2 m = ∥u h,τ n−1∥ 2 m + 6∑ i=1 Ii(tn), (5.4.18) where I1(tn) := 2τ(u h,τ n−1, ( Lhtn−1 + I h ) uh,τn−1)m + ∥η(tn)∥ 2 m, I2(tn) := 2 ∫ ]tn−1,tn] d∑ i=0 (uh,τn−1, δh,iF i t)mdt, I3(tn) := ∣∣∣∣∣∣∣ ∣∣∣∣∣∣∣ τ ( Lhtn−1 + I h ) uh,τn−1 + ∫ ]tn−1,tn] d∑ i=0 δh,iF i tdt ∣∣∣∣∣∣∣ ∣∣∣∣∣∣∣ 2 m , I4(tn) := 2 ∫ ]tn−1,tn] ( uh,τn−1,N ϱ;h tn−1u h,τ n−1 + G ϱ t ) m dwϱt , I5(tn) := 2 ∫ ]tn−1,tn] ∫ Rd ( uh,τn−1,I h(z)uh,τn−1 + Rt(z) ) m q(dt, dz), I6(tn) := 2 ( τ(Lhtn−1 + I h)uh,τn−1, η(tn) ) m + 2   ∫ ]tn−1,tn] d∑ i=0 δh,iF i tdt, η(tn)   m , and where η(tn) := ∫ ]tn−1,tn] ( Nϱ;htn−1u h,τ n−1 + G ϱ t ) dwϱt + ∫ ]tn−1,tn] ∫ Rd ( Ih(z)uh,τn−1 + Rt(z) ) q(dt, dz). By virtue of Assumption 5.2.5, we fix q˜ > 0 and ϵ > 0 small enough such that q := κ − ς(δ) − ϵ − (1 + ϵ)(1 + q˜)N3d τ h2 − q˜ > 0, where N3 is the constant in (5.4.6). Since the two stochastic integrals that define η are orthogonal square-integrable martingales, by Young’s inequality and (5.4.13), for all q > 0, E [ ∥η(tn)∥2m ] ≤ E [ τ∥Nhtn−1u h,τ n−1∥ 2 m,ℓ2 ] + Eτ ∫ Rd ∥Ih(z)uh,τn−1∥ 2 mπ 2(dz) + qEτ d∑ i=0 ∥δh,iu h,τ n−1∥ 2 m + ( 1 + N5 q ) E ∫ ]tn−1,tn] ( ∥Gt∥2m,ℓ2 + ∫ Rd ∥Rt(z)∥2mπ 2(dz) ) dt. (5.4.19) Thus, taking q = q˜3 in (5.4.19), we have EI1(tn) ≤ EτG (m) tl−1(u h,τ l−1) + q˜ 3 Eτ d∑ i=0 ∥δh,iu h,τ n−1∥ 2 m 5.4. Auxiliary results 153 + ( 1 + 3N5 q˜ ) E ∫ ]tn−1,tn] ( ∥Gt∥2m,ℓ2 + ∫ Rd ∥Rt(z)∥2mπ 2(dz) ) dt. Using (5.2.5) and Young’s inequality, we obtain EI2(tn) ≤ q˜ 3 Eτ d∑ i=0 ∥δh,iu h,τ n−1∥ 2 m + 3 q˜ E ∫ ]tn−1,tn] d∑ i=0 ∥F it∥ 2 mdt. An application of Young’s inequality and (5.4.16) yields EI3(tn) ≤ (1 + ϵ)(1 + q˜)N3d τ h2 Eτ d∑ i=1 ∥δh,iu h,,τ n−1∥ 2 m + (1 + q˜)N4 ( τ + τ h2 ) E [ τ∥uh,τn−1∥ 2 m ] +(d + 1) ( 1 + 1 q˜ ) E ∫ ]tn−1,tn]  τ∥F 0 t ∥ 2 m + 4dτ h2 d∑ i=1 ∥F it∥ 2 m   dt. Making use of the estimate (5.4.14) and noting that E∥uh,τn ∥2m < ∞, G ∈ H m(ℓ2), and R ∈ Hm(π2), we obtain EI4(tn) = EI5(tn) = 0. Moreover, as (Lhtn−1 + I h)uh,τn−1 is Ftn−1- measurable and E(η(tn)|Ftn−1) = 0, the expectation of first term in I6(tn) is zero, and hence by Young’s inequality, for any q1 > 0, EI6(tn) ≤ q1E [ ∥η(tn)∥2m ] + 1 q1 E ∣∣∣∣∣∣∣ ∣∣∣∣∣∣∣ ∫ ]tn−1,tn] d∑ i=0 δh,iF i tdt ∣∣∣∣∣∣∣ ∣∣∣∣∣∣∣ 2 m . Moreover, by Jensen’s inequality, (5.4.19), and (5.4.13), for any q1 > 0 and q > 0, EI6(tn) ≤ (q1q + q1N5)Eτ d∑ i=0 ∥δh,iu h,τ n−1∥ 2 m +E ∫ ]tn−1,tn]   (d + 1)τ q1 ∥F0t ∥ 2 m + 4d(d + 1)τ q1h2 d∑ i=1 ∥F it∥ 2 m   dt +q1 ( 1 + N5 q ) E ∫ ]tn−1,tn] ( ∥Gt∥2m,ℓ2 + ∫ Rd ∥Rt(z)∥2mπ 2(dz) ) dt. We choose q and q1 such that q1q + q1N5 ≤ q˜/3. Thus, owing to (5.4.10), we have EG(m)tn−1(u h,τ n−1) + ( q˜ + (1 + ϵ)(1 + q˜)N3d τ h2 ) Eτ d∑ i=1 ∥δh,iu h,τ n−1∥ 2 m ≤ −qEτ d∑ i=1 ∥δh,iu h,τ n−1∥ 2 m + N1E [ τ||uh,τn−1|| 2 m ] . Taking the expectation of both sides of (5.4.18), summing-up, and combining the above 154 Chapter 5. A finite difference scheme for non-degenerate parabolic SIDEs inequalities and identities, we find that there is a constant N = N(d,m, κ,K, δ) such that for all n ∈ {0, 1, . . . ,T }, E [ ∥uh,τn ∥ 2 m ] ≤ E [ ∥φ∥2m ] − qE n∑ l=1 τ d∑ i=1 ∥δh,iu h,τ l−1∥ 2 m + ( N1 + q˜ + (1 + q˜)N4 ( τ + τ h2 )) E n∑ l=1 τ∥uh,τl−1∥ 2 m +N ( τ + τ h2 ) E ∫ ]0,tn] d∑ i=0 ∥F it∥ 2 mdt + NE ∫ ]0,tn] ( ∥Gt∥2m,ℓ2 + ∫ Rd ∥Rt(z)∥2mπ 2(dz) ) dt. Therefore, by discrete Gronwall’s inequality, there is a constant N = N(d,m, κ,K,T, δ) such that E [ ∥uh,τn ∥ 2 m ] + E n∑ l=0 τ d∑ i=0 ∥δh,iu h,τ l ∥ 2 m ≤ NE [ ∥φ∥2m ] +NE ∫ ]0,T ]   d∑ i=0 ∥F it∥ 2 m + ∥Gt∥ 2 m,ℓ2 + ∫ Rd ∥Rt(z)∥2mπ 2(dz)   dt. (5.4.20) Now that we have proved (5.4.20), we will show (5.4.17). Estimating as we did above, we get that there is a constant N such that E max 0≤n≤T n∑ l=1 (I1(tl) + I2(tl) + I3(tl) + I6(tl)) ≤ NE T−1∑ l=0 τ d∑ i=0 ∥δh,iu h,τ l ∥ 2 m +NE ∫ ]0,T ]   d∑ i=0 ∥F it∥ 2 m + ∥Gt∥ 2 m,ℓ2 + ∫ Rd ∥Rt(z)∥2mπ 2(dz)   dt. Applying the Burkholder-Davis-Gundy inequality and Young’s inequality, we obtain E max 0≤n≤T n∑ l=1 I5(tl) ≤ 6E ∣∣∣∣∣∣∣ n∑ l=1 ∫ ]tn−1,tn] ∫ Rd ( uh,τn−1,I h(z)uh,τn−1 + Rt(z) )2 m π2(dz)dt ∣∣∣∣∣∣∣ 1/2 ≤ 1 4 E [ max 0≤n≤T ∥uh,τn ∥ 2 m ] + N  E T−1∑ l=0 τE∥δh,iu h,τ l ∥ 2 m + E T−1∑ l=0 τE∥uh,τl ∥ 2 m   +NE ∫ ]0,T ] ∫ Rd ∥Rt(z)∥2mπ 2(dz)dt. We can estimate E max0≤n≤T ∑n l=1 I4(tl) in similar way. Combining the above E max0≤n≤T - estimates and (5.4.20), we obtain (5.4.17). □ The following theorem establishes the existence and uniqueness of a solution to (5.4.2) and the stability of the implicit-explicit approximation scheme. 5.4. Auxiliary results 155 Theorem 5.4.8. Let Assumption 5.2.1 hold with m ≥ 0. Let F i ∈ Hm for i ∈ {0, ..., d}, G ∈ Hm(ℓ2) and R ∈ Hm(π2). Then there exists a constant R = R(d,m, κ,K, δ) such that if T > R, then for any h , 0, there exists a unique Hm-valued solution (vh,τn )Tn=0 of vh,τn = v h,τ n−1 + ∫ ]tn−1,tn]  (L˜ h tn + I h δ )v h,τ n + I˜ h δcv h,τ n−1 + d∑ i=0 δh,iF i t   dt + ∫ ]tn−1,tn] ( 1n>1N ϱ;h tn−1v h,τ n−1 + G ϱ t ) dwϱt + ∫ ]tn−1,tn] ∫ Rd ( 1n>1Ih(z)v h,τ n−1 + Rt(z) ) q(dt, dz), (5.4.21) for n ∈ {1, . . . ,T }, for any Hm−valued F0−measurable initial condition φ. Moreover, if ϕ ∈ Hm(F0), then there is a constant N = N(d,m, κ,K,T, δ) such that E [ max 0≤n≤T ∥vh,τn ∥ 2 m ] + E T∑ n=0 τ d∑ i=0 ∥δh,ivh,τn ∥ 2 m ≤ NE [ ∥φ∥2m ] +NE ∫ T 0   d∑ i=0 ∥F it∥ 2 m + ∥Gt∥ 2 m + ∫ Rd ∥Rt(z)∥2mπ 2(dz)   dt. (5.4.22) Proof. For each n ∈ {1, . . . ,T }, we write (5.4.21) as Dnv h,τ n = yn−1, where Dn is the operator defined by Dnϕ := ϕ − τ ( L˜htn + I h δ ) ϕ and yn−1 := v h,τ n−1 + ∫ ]tn−1,tn]  I˜ h δcv h,τ n−1 + d∑ i=0 δh,iF i t   dt + ∫ ]tn−1,tn] ( 1n>1N ϱ;h tn−1v h,τ n−1 + G ϱ t ) dwϱt + ∫ ]tn−1,tn] ∫ Rd ( (1n>1Ih(z)v h,τ n−1 + Rt(z) ) q(dt, dz). Fix ϵ1 and ϵ2 in (0, 1) such that q1 := κ − ς1(δ) − ϵ1 > 0. and q2 := κ − ς(δ) − ϵ2 > 0. 156 Chapter 5. A finite difference scheme for non-degenerate parabolic SIDEs Owing to Lemma 5.4.6, there is a constant N = N(d,m,K, δ) such that for all ϕ ∈ Hm, ∥Dnϕ∥2m ≤ N ( 1 + τ2 ( 1 h2 + 1 h4 )) ∥ϕ∥2m. (5.4.23) Assume T > T N2. By (5.4.11), for all ϕ ∈ Hm, we have (ϕ,Dnϕ)m ≥ (1 − τN2)∥ϕ∥2m + q1τ d∑ i=1 ∥δh,iϕ∥2m ≥ (1 − τN2)∥ϕ∥ 2 m. (5.4.24) Using Jensen’s inequality and (5.4.15), we get ∥y0∥2m ≤ 5 ( 1 + π1({|z| > δ})2τ2 ) ∥ϕ∥2m + 20τ h2 ∫ ]0,t1] d∑ i=0 ∥F it∥ 2 mdt + 5 ∣∣∣∣∣∣ ∣∣∣∣∣∣ ∫ ]0,t1] Gϱt dw ϱ t ∣∣∣∣∣∣ ∣∣∣∣∣∣ 2 m + 5 ∣∣∣∣∣∣ ∣∣∣∣∣∣ ∫ ]0,t1] ∫ Rd Rt(z)q(dt, dz) ∣∣∣∣∣∣ ∣∣∣∣∣∣ 2 m . (5.4.25) Since φ ∈ Hm, F i ∈ Hm, i ∈ {0, 1, . . . , d}, G ∈ Hm(ℓ2), and R ∈ Hm(π2), it follows that y0 ∈ Hm. By (5.4.23), and (5.4.24), owing to Proposition 3.4 in [GM05] (p = 2), there exists a unique vh,τ1 in H m such that D1v h,τ 1 = y0, and moreover ∥vh,τ1 ∥ 2 m ≤ 1 + ∥y0∥2m (1 − τN2)2 < ∞. (5.4.26) Proceeding by induction on n ∈ {1, . . . ,T }, one can show that there exists a unique vh,τn in Hm such that Dnv h,τ n = yn−1, and moreover ∥vh,τn ∥ 2 m ≤ 1 + ∥yn−1∥2m (1 − τN2)2 < ∞. (5.4.27) Assume that E∥φ∥2m < ∞. By (5.4.25) and (5.4.26) and the fact that f i ∈ Hm, i ∈ {0, 1, . . . , d}, g ∈ Hm(ℓ2), and R ∈ Hm(π2), it follows that E∥v h,τ 1 ∥ 2 m < ∞. By Jensen’s inequality, (5.4.15), and (5.4.14), we have E [ ∥yn−1∥2m ] ≤ 7N ( 1 + π1({|z| > δ})2τ2 + 1n>1τ ( 1 + 1 h2 )) E [ ∥vh,τn−1∥ 2 m ] + 28τ h2 E ∫ ]0,t1] d∑ i=0 ∥F it∥ 2 mdt + 7E ∫ ]0,t1] ∥Gt∥2m,ℓ2dt + 7E ∫ ]0,t1] ∫ Rd ∥Rt(z)∥2mπ 2(dz)dt.(5 4.28) Proceeding by induction on n and combining (5.4.27) and (5.4.28), we obtain E [ ∥vh,τn ∥ 2 m ] < ∞, ∀n ∈ {0, 1, . . . ,T }. (5.4.29) 5.4. Auxiliary results 157 Applying the identity ∥y∥2m−∥x∥ 2 m = 2(x, y−x)m+∥y−x∥ 2 m, x, y ∈ H m, for any n ∈ {1, . . . ,T }, we have ∥vh,τn ∥ 2 m = ∥v h,τ n−1∥ 2 m + 6∑ i=1 Ii(tn), where I1(tn) := 2τ(v h,τ n , ( L˜htn + I h δ ) vh,τn )m + 2τ(v h,τ n−1, I˜ h δcv h,τ n−1)m + ∥η(tn)∥ 2 m, I2(tn) := 2 ∫ ]tn−1,tn] d∑ i=0 (uh,τn , δh,iF i t)mdt, I3(tn) := − ∣∣∣∣∣∣∣ ∣∣∣∣∣∣∣ τ ( L˜htn + I h δ ) vh,τn + d∑ i=0 ∫ [tn−1,tn] δh,iF i tdt ∣∣∣∣∣∣∣ ∣∣∣∣∣∣∣ 2 m + ∣∣∣ ∣∣∣I˜hδcv h,τ n−1 ∣∣∣ ∣∣∣ 2 m τ2, I4(tn) := 2 ∫ ]tn−1,tn] ( vh,τn−1, 1n>1N ϱ;h tn−1v h,τ n−1 + G ϱ t ) m dwϱt , I5(tn) := 2 ∫ ]tn−1,tn] ∫ Rd ( vh,τn−1, 1n>1I h(z)vh,τn−1 + Rt(z) ) m q(dt, dz), I6(tn) := ( τI˜hδcv h,τ n−1, η(tn) ) m , and where η(tn) := ∫ ]tn−1,tn] ( 1n>1N ϱ;h tn−1v h,τ n−1 + G ϱ t ) dwϱt + ∫ ]tn−1,tn] ∫ Rd ( 1n>1Ih(z)v h,τ n−1 + Rt(z) ) q(dt, dz). As in the proof Theorem 5.4.7, by Young’s inequality, (5.4.10), and (5.4.15), we have E [ ∥vh,τn ∥ 2 m ] ≤ ( 1 + 2π1({|z| > δ}) ) E [ ∥φ∥2m ] − q2E n∑ l=1 τ d∑ i=1 ∥δh,iv h,τ l ∥ 2 m + E n∑ l=1 τ ( N2 + 2π 1({|z| > δ}) + τπ1({|z| > δ})2 ) ∥vh,τl ∥ 2 m + NE ∫ ]0,tn]   d∑ i=0 ∥F it∥ 2 mdt + ∥Gt∥ 2 m,ℓ2 + ∫ Rd ∥Rt(z)∥2mπ 2(dz)   dt. Set Z := N2 + 2π 1({|z| > δ}), R := max   2π1({|z| > δ})2 √ Z2 + 4π1({|z| > δ}2 − Z ,N2   T. Assume T > R. Making use of (5.4.29) and applying discrete Gronwall’s lemma, we get 158 Chapter 5. A finite difference scheme for non-degenerate parabolic SIDEs that there exist a constant N(d,m,K, κ,T, δ) such that E [ ∥vh,τn ∥ 2 m ] + E n∑ l=1 τ d∑ i=0 ∥δh,iv h,τ l ∥ 2 m ≤ NE [ ∥φ∥2m ] +NE ∫ ]0,T ]   d∑ i=0 ∥F it∥ 2 m + ∥Gt∥ 2 m,ℓ2 + ∫ Rd ∥Rt(z)∥2mπ 2(dz)   dt. (5.4.30) Using (5.4.15) instead of (5.4.16), we obtain (5.4.22) from (5.4.30) in the same manner as Theorem 5.4.7. Note that no bound on τ/h2 is needed in this case. □ 5.5 Proof of the main theorems Proof of Theorem 5.2.3. By virtue of Theorems 2.9, 2.10, and 4.1 in [Gyö82], in order to obtain the existence, uniqueness, regularity, and the estimate (5.2.4), we only need to show that (5.1.1) may be realized as an abstract stochastic evolution equation in a Gelfand triple and that the growth condition and coercivity condition are satisfied. Indeed, since (5.1.1) is a linear equation, the hemicontinuity condition is immediate and monotonicity follows directly from the coercivity condition. By Holder’s inequality and Assumption 5.2.1(i), for u, v ∈ H1, we have d∑ i, j=0 ( ∂ ju, (v∂−ia i j t + a i j∂−iv) ) 0 + ∫ |z|>δ  u(· + z) − u − 1[−1,1](z) d∑ j=1 z j∂ ju, v   0 π1(dz) + ∫ |z|≤δ ∫ 1 0 d∑ i, j=1 ( z j∂ ju(· + θz), zi∂−iv ) 0 (1 − θ)dθπ1(dz) ≤ N∥u∥1∥v∥1. Therefore, since the pairing ⟨·, ·⟩1 brings (H1)∗ and H−1 into isomorphism, for each (ω, t) ∈ [0,T ] ×Ω, there exists a linear operator A˜t : H1 → H−1 such that ⟨v, A˜tu⟩1 agrees with the left-hand-side of the above inequality and for u, v ∈ H1, ∥A˜tu∥−1 ≤ N∥u∥1. By Assumption 5.2.2, the operator A defined by A(u) = A˜u+ f , maps H1 to H−1 and for u ∈ H1, ∥At(u)∥−1 ≤ N(∥u∥1 + ∥ f ∥−1). For an integer m ≥ 1, with abuse of notation, we write (·, ·)m = ((1 − ∆)m/2·, (1 − ∆)m/2·)0. and ∥ · ∥m for the corresponding norm in Hm. It is well known that the above inner product and norm are equivalent to the ones introduced in Section 1. For each m ≥ 1 and for all u ∈ Hm+1 and v ∈ Hm, we have (u, v)m ≤ ∥u∥m+1∥v∥m−1. Since Hm+1 is dense in Hm−1, we may define the pairing [·, ·]m : Hm+1 × Hm−1 → R by [v, v′]m = limn→∞(v, vn)m for all 5.5. Proof of the main theorems 159 v ∈ Hm+1 and v′ ∈ Hm−1, where (vn)∞n=1 ⊂ H m+1 is such that ∥vn − v′∥m−1 → 0 as n→ ∞. It can be shown that the mapping from Hm−1 to (Hm+1)∗ given by v′ 7→ [·, v′]m is an isometric isomorphism. For more details, see [Roz90]. Therefore, for all m ≥ 0, (Hm+1,Hm,Hm−1) forms a Gelfand triple with the pairing [·, ·]m, where we make the convention that ⟨·, ·⟩1 = [·, ·]0. For m ≥ 1 and all u ∈ Hm+1 and v ∈ Hm, using integration by parts, we get ⟨v, At(u)⟩1 = ((Lt + It)u + f , v)0 = ⟨v, (Lt + It)u + f ⟩1. Since this is true for all v ∈ H m, which is dense in H1, the restriction of A to Hm+1 coincides with L + I + f . Moreover, it can easily be shown under Assumptions 5.2.1(i) and 5.2.2 that for all m ≥ 1 and u, v ∈ Hm+1, ∥At(u)∥m−1 ≤ N∥u∥m+1 + ∥ f ∥m−1, where N is a constant depending only on m, d,K, and ν, which shows that A satisfies the growth condition. For u ∈ Hm, m ≥ 1, define Bϱt (u) = b iϱ t ∂iu + g ϱ t , Bt = (B ϱ t ) ∞ ϱ=1, and Cz(u) = u(· + z) − u + ot(z), z ∈ R d. Owing to Assumption 5.2.1 (i), Bt is an operator from Hm+1 to Hm(ℓ2). Furthermore, C is an operator from Hm+1 to L2(Rd, π2(dz); Hm) (see (5.5.2)). It is also clear that A, B, and C are appropriately measurable. Thus, (5.1.1) may be realized as the following stochastic evolution equation in the Gelfand triple (Hm+1,Hm,Hm−1): ut = u0 + ∫ ]0,t] As(us)ds + ∫ ]0,t] Bϱs(us)dw ϱ s + ∫ ]0,t] Cz(us−)q(dz, ds), (5.5.1) for t ∈ [0,T ]. Let u ∈ C∞c . A simple calculation shows that there is a constant N = N(δ) such that ∫ Rd ∥u(· + z) − u∥2mπ 2(dz) ≤ ς2(δ)∥u∥2m+1 + N∥u∥ 2 m. (5.5.2) Applying Holder’s inequality and the identity (u, ∂ ju) = 0, we obtain ∫ |z|>δ′  u(· + z) − u − 1[−1,1](z) d∑ j=1 z j∂ ju, u   m π1(dz) ≤ 0. By the Holder’s inequality and the Cauchy-Bunyakovsky-Schwarz inequality, we have 2 ∫ |z|≤δ′ ∫ 1 0 d∑ i, j=1 ( z j∂ ju(· + θz), zi∂−iu ) m (1 − θ)dθπ1(dz) ≤ ς1(δ)∥u∥2m+1. There exists a constant ϵ = ϵ(κ, δ) such that q := κ − ς(δ) − ϵ > 0. 160 Chapter 5. A finite difference scheme for non-degenerate parabolic SIDEs As in Theorem 4.1.2 in [Roz90] and Lemma 5.4.4, using Holder’s and Young’s inequal- ities, the above estimates, and Assumption 5.2.1, we find that for each ϵ > 0, there is a constant N = N(d,m, κ,K,T, δ) such that for all (ω, t) ∈ Ω × [0,T ], 2[u, At(u)]m + ∥Bt(u)∥2m,l2 + ∫ Rd ∥Cz(u)∥2mπ 2(dz) + q∥u∥2m+1 ≤ N ( ∥u∥2m + ∥ ft∥m−1 + ∥gt∥m,ℓ2 + ∫ Rd ∥ot(z)∥2mπ 2(dz) ) . Using the self-adjointness of (1 − ∆)1/2, the properties of the CBF [·, ·]m, and Assumption 5.2.2, for all v ∈ C∞c and u ∈ H m+1, m ≥ 1, we have [v, A(u)]m = ((L + I)u, (1 − ∆)mv)0 + ( f , (1 − ∆)mv)0. (5.5.3) Owing to (5.5.3) and the denseness of (1 − ∆)−mC∞c in H 1, from Theorems 2.9, 2.10, and 4.1 in [Gyö82], we obtain the existence and uniqueness of a solution u of (5.1.1), such that u is a càdlàg Hm-valued process satisfying (5.2.4). □ Proof of Proposition 5.2.5. Let A, B, and C be as in (5.5.1). Owing to Assumption 5.2.1, the boundedness of the m−1-norm of g in expectation, and estimate (5.2.4), using Jensen’s inequality and Itô’s isometry, for s, t ∈ [0,T ], we get E   ∣∣∣∣∣∣ ∣∣∣∣∣∣ ∫ ]s,t] Ar(ur)ds ∣∣∣∣∣∣ ∣∣∣∣∣∣ 2 m−1   ≤ |t − s| ( NE ∫ ]0,T ] ∥ut∥2m+1dt + E ∫ ]0,T ] ∥ fr∥2m−1dr ) ≤ N|t − s|, E   ∣∣∣∣∣∣ ∣∣∣∣∣∣ ∫ ]s,t] Bϱr (ur)dw ρ r ∣∣∣∣∣∣ ∣∣∣∣∣∣ 2 m−1   = E ∫ ]s,t] ∥Br(ur)∥2m−1,ℓ2dr ≤ N|t − s| ( sup t≤T E∥ut∥2m + sup t≤T E∥gt∥m−1,ℓ2 ) ≤ N|t − s|, and E   ∣∣∣∣∣∣ ∣∣∣∣∣∣ ∫ ]s,t] ∫ Rd Cz(ur−)q(dr, dz) ∣∣∣∣∣∣ ∣∣∣∣∣∣ 2 m−1   = E ∫ ]s,t] ∫ Rd ∥Cz(ur)∥2m−1π 2(dz)ds ≤ N|t − s| ( sup t≤T E∥ut∥2m + sup t≤T E ∫ Rd ∥ot(z)∥2m−1π 2(dz) ) ≤ N|t − s|, which completes the proof of the proposition. □ Theorem 5.5.1. Let Assumptions 5.2.1 through 5.2.5 hold for some m ≥ 2. Let u be the solution of (5.1.1) and (uh,τn )Tn=0 be defined by (5.4.1). Then there is a constant N = N(d,m, κ,K,T,C, λ, κ2m, δ) such that E [ max 0≤n≤T ∥utn − u h,τ n ∥ 2 m−2 ] + E T∑ l=0 τ d∑ i=0 ∥δh,iutl − δh,iu h,τ l ∥ 2 m−2ds ≤ N(τ + |h| 2). (5.5.4) 5.5. Proof of the main theorems 161 Proof. For t ∈ [0,T ], let κ1(t) := tn−1 for t ∈]tn−1, tn], and set e h,τ n := u h,τ n − utn . One can easily verify that eh,τn satisfies in Hm−2, eh,τn = e h,τ n−1 + ∫ ]tn−1,tn]  (L h tn−1 + I h)eh,τn−1 + d∑ i=0 δh,iF i t   dt + ∫ ]tn−1,tn] ( Nϱ;htn−1e h,τ n−1 + G ϱ t ) dwϱt + ∫ ]tn−1,tn] ∫ Rd ( Ih(z)eh,τn−1 + Rt(z) ) q(dt, dz), where F0t := (L h κ1(t) − Lκ1(t))ut + (Lκ1(t) − Lt)ut + (I h − I)ut + ( fκ1(t) − ft) + I h δc(uκ1(t) − ut) + d∑ j=1 a0 j κ1(t) δ−h, j(uκ1(t) − ut) + d∑ i=0 ai0κ1(t)δh,i(uκ1(t) − ut) − d∑ i, j=1 δ−h, j(uκ1(t) − ut)(· + h)δh,ia i j κ1(t) , F it := d∑ j=1 ai j κ1(t) δ−h, j(uκ1(t) − ut) + ∞∑ k=0 χ(h,k)∑ l=1 θ¯ k,h l ζ i j k,hδ−h, j(uκ1(t) − ut)(· + hzrh,kl ) Gϱt : = (N ϱ κ1(t) − Nϱt )ut + (N ϱ;h κ1(t) − Nϱ κ1(t) )ut +N ϱ κ1(t) (uκ1(t) − ut) + (g ϱ κ1(t) − gϱt ) Rht (z) : = ( Ih(z) − I(z) ) ut− + Ih(z)(uκ1(t) − ut−) + ( oκ1(t)(z) − ot(z) ) . By Theorem 5.4.7, we have E [ max 0≤n≤T ∥eh,τn ∥ 2 m−2 ] + E M∑ n=0 τ d∑ i=0 ∥δh,ieh,τn ∥ 2 m−2 ≤ NE ∫ ]0,T ] ( d∑ i=0 ∥F it∥ 2 m−2 + ∥Gt∥ 2 m−2,ℓ2 + ∫ Rd ∥Rt(z)∥2m−2π 2(dz) ) dt. Using Lemmas 5.4.1, 5.4.2, and 5.4.3 and Assumptions 5.2.1(i) and 5.2.4, the right-hand- side of the above relation can be estimated by NE ∫ ]0,T ] ( |h|2∥ut∥2m+1 + |κ1(t) − t|∥ut∥ 2 m + ∥uκ1(t) − ut∥ 2 m−1 ) dt +NE ∫ ]0,T ] ( ∥ fκ1(t) − ft∥ 2 m−2 + ∥gκ1(t) − gt∥m−2,ℓ2 + ∫ Rd ∥oκ1(t)(z) − ot(z)∥m−2π 2(dz) ) dt where N depends only on d,m, κ,K,C, λ,T, δ and ν. By virtue of (5.2.4), Proposition 162 Chapter 5. A finite difference scheme for non-degenerate parabolic SIDEs 5.2.5, and Assumption 5.2.3, we obtain (5.5.4), which completes the proof. □ Theorem 5.5.2. Let Assumptions 5.2.1 through 5.2.4 hold with m ≥ 2 and let u be the solution of (5.1.1). There exists a constant R = R(d,m, κ,K, δ) such that if T > R, then there exists a unique solution (vh,τ)Tn=0 of (5.4.2) in H m−2. Moreover, there is a constant N = N(d,m, κ,K,T,C, λ, κ2m, δ) such that E [ max 0≤n≤T ∥utn − v h,τ n ∥ 2 m−2 ] + E T∑ l=0 τ d∑ i=0 ∥δh,iutl − δh,iv h,τ l ∥ 2 m−2ds ≤ N(τ + |h| 2). (5.5.5) Proof. The existence and uniqueness follows directly from Theorem 5.4.8. Let κ1(t) be as in the previous proof and set κ2(t) = tn for t ∈]tn−1, tn]. Let G and R be defined as in Theorem 5.5.1 and define F¯ i to be F i with κ1(t) replaced with κ2(t). Set e h,τ n = v h,τ n − utn . As in the proof of Theorem 5.5.1, we have eh,τn = e h,τ n−1 + ∫ ]tn−1,tn]  (L˜ h tn + I h δ )e h,τ n + I˜ h δce h,τ n−1 + d∑ i=0 δh,iF˜ i t   dt + ∫ ]tn−1,tn] ( 1n>1N ϱ;h tn−1e h,τ n−1 + G˜ ϱ t ) dwϱt + ∫ ]tn−1,tn] ∫ Rd ( 1n>1Ih(z)e h,τ n−1 + R˜t(z) ) q(dt, dz), where F˜ i = F¯ i, for i , 0, F˜0 = F¯0 + I˜hδc(uκ1(t) − uκ2(t)), G˜ϱt = 1t≤t1(N ϱ t ut + g ϱ t ) + 1t>t1G ϱ t , R˜t(z) = 1t≤t1I(z)ut− + 1t>t1Rt(z). By Theorem 5.4.8, we have E [ max 0≤n≤T ∥eh,τn ∥ 2 m−2 ] + E M∑ n=0 τ d∑ i=0 ∥δh,ieh,τn ∥ 2 m−2 ≤ N(A1 + A2 + A3), where A1 := E ∫ ]0,T ] d∑ i=0 ∥F¯ it∥ 2 m−2dt + ∫ ]t1,T ] ( ∥Gt∥2m−2,ℓ2 + ∫ Rd ∥Rt(z)∥2m−2π 2(dz) ) dt, A2 := E ∫ ]0,T ] ∥I˜hδc(uκ1(t) − uκ2(t))∥ 2 m−2dt A3 := E ∫ ]0,t1] ( ∥Mtut + gt∥2m−2,ℓ2 + ∫ Rd ∥I(z)ut + ot(z)∥2m−2π 2(dz) ) dt. 5.5. Proof of the main theorems 163 As in the proof of Theorem 5.5.1, we have A1 ≤ N(τ + |h|2). By Proposition 5.2.5, we get A2 ≤ NE ∫ T 0 ∥uκ1(t) − uκ2(t)∥ 2 m−1dt ≤ Nτ. Owing to (5.2.3), we have A3 ≤ NE ∫ t1 0 ( ∥ut∥2m−1 + ∥gt∥ 2 m−2,ℓ2 + ∫ Rd ∥ot(z)∥2m−2π 2(dz) ) dt ≤ NτE ∫ t1 0 ( sup t≤T ∥ut∥2m−1 + ξ ) dt ≤ Nτ. Combining the above estimates yields (5.5.5). □ By virtue of Sobolev’s embedding theorem and (5.2.10), as in [GK10], we obtain the following corollaries of Theorem 5.5.1 and Theorem 5.5.2. Corollary 5.5.3. Suppose the assumptions of Theorem 5.5.1 hold with m > n + 2 + d/2, where n is an integer with n ≥ 0. Then for all λ = (λ1, . . . , λn) ∈ {1 . . . , d}n and δh,λ = δh,λ1 · · · δh,λn , there is a constant N = N(d,m, κ,K,T,C, λ, κ2m, δ) such that E [ max 0≤n≤T sup x∈Rd |δh,λutn(x) − δh,λu h,τ n (x)| 2 ] + E [ max 0≤n≤T ∥δh,λutn − δh,λu h,τ n ∥ 2 ℓ2(Gh) ] ≤ N(τ + |h|2). Corollary 5.5.4. Suppose the assumptions of Theorem 5.5.2 hold with m > n + 2 + d/2, where n is an integer with n ≥ 0. Then for all λ = (λ1, . . . , λn) ∈ {1 . . . , d}n and δh,λ = δh,λ1 · · · δh,λn , there is a constant N = N(d,m, κ,K,T,C, λ, κ2m, δ) such that E [ max 0≤n≤T sup x∈Rd |δh,λutn(x) − δh,λv h,τ n (x)| 2 ] + E [ max 0≤n≤T ∥δh,λutn − δh,λv h,τ n ∥ 2 ℓ2(Gh) ] ≤ N(τ + |h|2). Proof of Theorems 5.2.8 and 5.2.9. Let (uˆh,τn )Mn=0 be defined by (5.2.8). Denote by (·, ·)ℓ2(Gh) the inner product of ℓ2(Gh). There exists a constant ϵ = ϵ(κ, δ) such that q := κ − ς1(δ) − ϵ > 0. As in (5.4.11), there is a constant N6 = N6(d, κ,K, δ) such that for all ϕ ∈ ℓ2(Gh), (ϕ, L˜ht ϕ)ℓ2(Gh) + (ϕ, I h δϕ)ℓ2(Gh) ≤ −q d∑ i=1 ∥δh,iϕ∥2ℓ2(Gh) + N6∥ϕ∥ 2 ℓ2(Gh). Following the arguments in the beginning of the proof of Theorem 5.4.8, we conclude that if T > N6T , then there exists a unique solution (vˆ h,τ n )Mn=0 in ℓ2(Gh) of (5.2.9). It is easy to see that N6 < N2 (for the same choice of ϵ) for all m > 0, where N2 is the constant 164 Chapter 5. A finite difference scheme for non-degenerate parabolic SIDEs appearing on the right-hand-side of (5.4.11), and hence N6 < R, where R is as in Theorem 5.4.8. Let (uh,τn )Mn=1 be defined by (5.4.1). By Theorem 5.5.2, there exists a unique solution (vh,τn )Mn=1 of (5.4.2). It suffices to show that almost surely, uh,τn (x) = uˆ h,τ n (x) (5.5.6) and vh,τn (x) = vˆ h,τ n (x), (5.5.7) for all n ∈ {0, ...,M} and x ∈ Gh. Let S : Hm−2 → ℓ2(Gh) denote the embedding from Remark 5.2.6. ApplyingS to both sides of (5.4.1), one can see thatS uh,τ and uˆh,τ satisfy the same recursive relation in ℓ2(Gh) with common initial condition φ, and hence (5.5.6) follows. Similarly, S vh,τ and vˆh,τ satisfy the same equation in ℓ2(Gh) and (5.5.7) follows from the uniqueness of the ℓ2(Gh) solution of (5.2.9). □ Remark 5.5.5. It follows from Corollaries 5.5.3, 5.5.4, and relations (5.5.6) and (5.5.7) that if more regularity is assumed of the coefficients and the data of the equation (5.1.1), then better estimates can be obtained than the ones presented in Theorems 5.2.8 and 5.2.9. Chapter 6 Conclusions and future work In this thesis, we investigated existence, uniqueness, and regularity of degenerate parabolic linear SIDEs in the whole space with adapted coefficients from a couple of standpoints. First, we used the method of stochastic characteristics to derive classical solutions directly. Second, we derived the integer scale L2-Sobolev theory for the equations using the varia- tional framework of stochastic evolution equations and the method of vanishing viscosity. We then established the rate of convergence of some finite difference schemes for a simple class of SIDEs under the assumption of non-degenerate stochastic parabolicity. There are myriad of future directions to consider. We will only discuss a few. As we mentioned in Chapter 4, the integer scale Lp-Sobolev theory for degenerate SIDEs is currently underway and will be available soon. Still though, it would be ideal to obtain an existence and regularity theory in the full scale of Bessel potential spaces with Hölder assumptions on the coefficients. It is also interesting to study non-degenerate linear SIDEs under weaker regularity conditions than those required for the fully degenerate theory. Currently, an existence theory is known only in a few special cases; we refer the reader to [MP09, MP11, MP12, MP13, KK12b, KK12a, KKK13] for some relatively recent results in this direction. With regards to approximations of SIDEs, we considered only equations with non-degenerate stochastic parabolicity and where the integral operators do not depend on the space variable. It would be interesting to relax these conditions in the future. Also, from a practical standpoint, the error from truncating the domain should be studied in con- junction with the numerical approximations of the equations. It is worth mentioning that a regularity theory for SIDEs in bounded domains with degenerate stochastic parabolicity is non-existent as far as the author knows. There is also no reason to confine oneself to finite difference schemes, since there are other numerical methods such as finite elements and wavelets. Although no simulations were done in this thesis, they will be considered in a future work along with some applications to the non-linear filtering problem. 165 References [Ber77] Melvin S. Berger. Nonlinearity and functional analysis. Academic Press [Harcourt Brace Jovanovich, Publishers], New York-London, 1977. Lec- tures on nonlinear problems in mathematical analysis, Pure and Applied Mathematics. [Bis81] Jean-Michel Bismut. Mécanique Aléatoire, volume 866 of Lecture Notes in Mathematics. Springer-Verlag, Berlin-New York, 1981. With an English summary. [BL12] Andrea Barth and Annika Lang. Milstein approximation for advection- diffusion equations driven by multiplicative noncontinuous martingale noises. Appl. Math. Optim., 66(3):387–413, 2012. [BvNVW08] Z. Brzez´niak, J. M. A. M. van Neerven, M. C. Veraar, and L. Weis. Itô’s formula in UMD Banach spaces and regularity of solutions of the Zakai equation. J. Differential Equations, 245(1):30–58, 2008. [CJM92] P. L. Chow, J.-L. Jiang, and J.-L. Menaldi. Pathwise convergence of ap- proximate solutions to Zakai’s equation in a bounded domain. In Stochastic partial differential equations and applications (Trento, 1990), volume 268 of Pitman Res. Notes Math. Ser., pages 111–123. Longman Sci. Tech., Har- low, 1992. [CK10] Zhen-Qing Chen and Kyeong-Hun Kim. An Lp-Theory of Non-Divergence Form SPDEs Driven by Lévy Processes. arXiv preprint arXiv:1007.3295, 2010. [CV05] Rama Cont and Ekaterina Voltchkova. A finite difference scheme for option pricing in jump diffusion and exponential Lévy models. SIAM J. Numer. Anal., 43(4):1596–1626 (electronic), 2005. [DMGZ94] Giuseppe De Marco, Gianluca Gorni, and Gaetano Zampieri. Global inver- sion of functions: an introduction. NoDEA Nonlinear Differential Equations Appl., 1(3):229–248, 1994. [DPMT07] Giuseppe Da Prato, Jose-Luis Menaldi, and Luciano Tubaro. Some results of backward Itô formula. Stoch. Anal. Appl., 25(3):679–703, 2007. [DPT98] Giuseppe Da Prato and Luciano Tubaro. Some remarks about backward Itô formula and applications. Stochastic Anal. Appl., 16(6):993–1003, 1998. [DPZ92] Giuseppe Da Prato and Jerzy Zabczyk. Stochastic Equations in Infinite Di- mensions, volume 44 of Encyclopedia of Mathematics and its Applications. Cambridge University Press, Cambridge, 1992. [FGP10] F. Flandoli, M. Gubinelli, and E. Priola. Well-posedness of the transport equation by stochastic perturbation. Invent. Math., 180(1):1–53, 2010. 167 [FK85] Tsukasa Fujiwara and Hiroshi Kunita. Stochastic differential equations of jump type and Lévy processes in diffeomorphisms group. J. Math. Kyoto Univ., 25(1):71–106, 1985. [GGK14] Máté Gerencsér, István Gyöngy, and N. V. Krylov. On the solvability of degenerate stochastic partial differential equations in Sobolev spaces. arXiv preprint arXiv:1404.4401, 2014. [GK81] István Gyöngy and N. V. Krylov. On stochastics equations with respect to semimartingales. II. Itô formula in Banach spaces. Stochastics, 6(3-4):153– 173, 1981. [GK92] István Gyöngy and Nicolai V. Krylov. On stochastic partial differential equations with unbounded coefficients. Potential Anal., 1(3):233–256, 1992. [GK96] W. Grecksch and P. E. Kloeden. Time-discretised Galerkin approximations of parabolic stochastic PDEs. Bull. Austral. Math. Soc., 54(1):79–85, 1996. [GK10] István Gyöngy and Nicolai Krylov. Accelerated finite difference schemes for linear stochastic partial differential equations in the whole space. SIAM J. Math. Anal., 42(5):2275–2296, 2010. [GK11] István Gyöngy and Nicolai Krylov. Accelerated numerical schemes for PDEs and SPDEs. In Stochastic analysis 2010, pages 131–168. Springer, Heidelberg, 2011. [GK81] István Gyöngy and N. V. Krylov. On stochastic equations with respect to semimartingales. I. Stochastics, 4(1):1–21, 1980/81. [GM83] B. Grigelionis and R. Mikulevicˇius. Stochastic evolution equations and den- sities of the conditional distributions. In Theory and Application of Random Fields, volume 49 of Lecture Notes in Control and Inform. Sci., pages 49– 88. Springer, Berlin, 1983. [GM05] István Gyöngy and Annie Millet. On discretization schemes for stochastic evolution equations. Potential Anal., 23(2):99–134, 2005. [GM09] István Gyöngy and Annie Millet. Rate of convergence of space time approx- imations for stochastic evolution equations. Potential Anal., 30(1):29–64, 2009. [GM11] B. Grigelionis and R. Mikulevicius. Nonlinear filtering equations for stochastic processes with jumps. In The Oxford Handbook of Nonlinear Filtering, pages 95–128. Oxford Univ. Press, Oxford, 2011. [Gri72] B. I. Grigelionis. Stochastic equations for the nonlinear filtering of random processes. Litovsk. Mat. Sb., 12(4):37–51, 233, 1972. [Gri76] B. Grigelionis. Reduced stochastic equations of nonlinear filtering of ran- dom processes. Litovsk. Mat. Sb., 16(3):51–63, 232, 1976. 168 [Gri82] B. Grigelionis. Stochastic nonlinear filtering equations and semimartin- gales. In Nonlinear filtering and stochastic control (Cortona, 1981), vol- ume 972 of Lecture Notes in Math., pages 63–99. Springer, Berlin-New York, 1982. [GT01] David Gilbarg and Neil S. Trudinger. Elliptic Partial Differential Equations of Second Order. Classics in Mathematics. Springer-Verlag, Berlin, 2001. Reprint of the 1998 edition. [Gyö82] I. Gyöngy. On stochastic equations with respect to semimartingales. III. Stochastics, 7(4):231–254, 1982. [Gyö98] István Gyöngy. Lattice approximations for stochastic quasi-linear parabolic partial differential equations driven by space-time white noise. I. Potential Anal., 9(1):1–25, 1998. [Gyö99] István Gyöngy. Lattice approximations for stochastic quasi-linear parabolic partial differential equations driven by space-time white noise. II. Potential Anal., 11(1):1–37, 1999. [Gyö11] I. Gyöngy. On finite difference schemes for degenerate stochastic parabolic partial differential equations. J. Math. Sci. (N. Y.), 179(1):100–126, 2011. Problems in mathematical analysis. No. 61. [Hal12] Eric Joseph Hall. Accelerated spatial approximations for time discretized stochastic partial differential equations. SIAM J. Math. Anal., 44(5):3162– 3185, 2012. [Hal13] Eric Joseph Hall. Higher order spatial approximations for degenerate parabolic stochastic partial differential equations. SIAM J. Math. Anal., 45(4):2071–2098, 2013. [Hau08] Erika Hausenblas. Finite element approximation of stochastic partial differ- ential equations driven by Poisson random measures of jump type. SIAM J. Numer. Anal., 46(1):437–471, 2007/08. [Hau05] Erika Hausenblas. Existence, uniqueness and regularity of parabolic SPDEs driven by Poisson random measure. Electron. J. Probab., 10:1496–1546, 2005. [HM06] Erika Hausenblas and Iuliana Marchis. A numerical approximation of parabolic stochastic partial differential equations driven by a Poisson ran- dom measure. BIT, 46(4):773–811, 2006. [HØUZ10] Helge Holden, Bernt Øksendal, Jan Ubøe, and Tusheng Zhang. Stochastic Partial Differential Equations: A Modeling, White Noise Functional Ap- proach. Universitext. Springer, New York, second edition, 2010. [Jac79] Jean Jacod. Calcul Stochastique et Problèmes de Martingales, volume 714 of Lecture Notes in Mathematics. Springer, Berlin, 1979. [JK10] Arnulf Jentzen and Peter Kloeden. Taylor expansions of solutions of stochastic partial differential equations with additive noise. Ann. Probab., 38(2):532–569, 2010. 169 [JS03] Jean Jacod and Albert N. Shiryaev. Limit Theorems for Stochastic Pro- cesses, volume 288 of Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences]. Springer-Verlag, Berlin, second edition, 2003. [Kal97] Olav Kallenberg. Foundations of Modern Probability. Probability and its Applications (New York). Springer-Verlag, New York, 1997. [KK12a] Ildoo Kim and Kyeong-Hun Kim. A generalization of the Littlewood- Paley inequality for the fractional Laplacian (−∆)α/2. J. Math. Anal. Appl., 388(1):175–190, 2012. [KK12b] Kyeong-Hun Kim and Panki Kim. An Lp-theory of a class of stochastic equations with the random fractional Laplacian driven by Lévy processes. Stochastic Process. Appl., 122(12):3921–3952, 2012. [KKK13] Ildoo Kim, Kyeong-Hun Kim, and Panki Kim. Parabolic Littlewood-Paley inequality for ϕ(−∆)-type operators and applications to stochastic integro- differential equations. Adv. Math., 249:161–203, 2013. [KM11] Reiichiro Kawai and Hiroki Masuda. On simulation of tempered stable random variates. J. Comput. Appl. Math., 235(8):2873–2887, 2011. [Kom84] Takashi Komatsu. On the martingale problem for generators of stable pro- cesses with perturbations. Osaka J. Math., 21(1):113–132, 1984. [KR77] N. V. Krylov and B. L. Rozovskiı˘. The Cauchy problem for linear stochastic partial differential equations. Izv. Akad. Nauk SSSR Ser. Mat., 41(6):1329– 1347, 1448, 1977. [KR79] N. V. Krylov and B. L. Rozovskiı˘. Stochastic evolution equations. In Cur- rent Problems in Mathematics, Vol. 14 (Russian), pages 71–147, 256. Akad. Nauk SSSR, Vsesoyuz. Inst. Nauchn. i Tekhn. Informatsii, Moscow, 1979. [KR81] N. V. Krylov and B. L. Rozovskiı˘. On the first integrals and Liouville equa- tions for diffusion processes. In Stochastic differential systems (Visegrád, 1980), volume 36 of Lecture Notes in Control and Information Sci., pages 117–125. Springer, Berlin-New York, 1981. [KR82] N. V. Krylov and B. L. Rozovskiı˘. Characteristics of second-order degener- ate parabolic Itô equations. Trudy Sem. Petrovsk., 8:153–168, 1982. [Kry99] N. V. Krylov. An analytic approach to SPDEs. In Stochastic Partial Differ- ential Equations: Six Perspectives, volume 64 of Math. Surveys Monogr., pages 185–242. Amer. Math. Soc., Providence, RI, 1999. [Kry08] N. V. Krylov. Lectures on Elliptic and Parabolic Equations in Sobolev Spaces, volume 96 of Graduate Studies in Mathematics. American Mathe- matical Society, Providence, RI, 2008. [Kry11] N. V. Krylov. On the Itô-Wentzell formula for distribution-valued processes and related topics. Probab. Theory Related Fields, 150(1-2):295–319, 2011. 170 [Kun81] Hiroshi Kunita. On the decomposition of solutions of stochastic differential equations. In Stochastic Integrals (Proc. Sympos., Univ. Durham, Durham, 1980), volume 851 of Lecture Notes in Math., pages 213–255. Springer, Berlin-New York, 1981. [Kun86a] H. Kunita. Lectures on Stochastic Flows and Applications, volume 78 of Tata Institute of Fundamental Research Lectures on Mathematics and Physics. Published for the Tata Institute of Fundamental Research, Bom- bay; by Springer-Verlag, Berlin, 1986. [Kun86b] Hiroshi Kunita. Convergence of stochastic flows with jumps and Lévy pro- cesses in diffeomorphisms group. Ann. Inst. H. Poincaré Probab. Statist., 22(3):287–321, 1986. [Kun96] Hiroshi Kunita. Stochastic differential equations with jumps and stochas- tic flows of diffeomorphisms. In Itô’s Stochastic Calculus and Probability Theory, pages 197–211. Springer, Tokyo, 1996. [Kun97] Hiroshi Kunita. Stochastic Flows and Stochastic Differential Equations, volume 24 of Cambridge Studies in Advanced Mathematics. Cambridge University Press, Cambridge, 1997. Reprint of the 1990 original. [Kun04] Hiroshi Kunita. Stochastic differential equations based on Lévy processes and stochastic flows of diffeomorphisms. In Real and Stochastic Analysis, Trends Math., pages 305–373. Birkhäuser Boston, Boston, MA, 2004. [Lan12] Annika Lang. Almost sure convergence of a Galerkin approximation for SPDEs of Zakai type driven by square integrable martingales. J. Comput. Appl. Math., 236(7):1724–1732, 2012. [Len77] E. Lenglart. Relation de domination entre deux processus. Ann. Inst. H. Poincaré Sect. B (N.S.), 13(2):171–179, 1977. [LM14a] James-Michael Leahy and Remigijus Mikulevicius. On classical solu- tions of linear stochastic integro-differential equations. arXiv preprint arXiv:1404.0345, 2014. [LM14b] James-Michael Leahy and Remigijus Mikulevicius. On degenerate linear stochastic evolution equations driven by jump processes. arXiv preprint arXiv:1406.4541, 2014. [LM14c] James-Michael Leahy and Remigijus Mikulevicius. On some properties of space inverses of stochastic flows. arXiv preprint arXiv:1411.6277, 2014. [LR04] Gabriel J. Lord and Jacques Rougemont. A numerical scheme for stochastic PDEs with Gevrey regularity. IMA J. Numer. Anal., 24(4):587–604, 2004. [LS89] R. Sh. Liptser and A. N. Shiryayev. Theory of martingales, volume 49 of Mathematics and its Applications (Soviet Series). Kluwer Academic Pub- lishers Group, Dordrecht, 1989. Translated from the Russian by K. Dz- japaridze. [MB07] Thilo Meyer-Brandis. Stochastic Feynman-Kac equations associated to Lévy-Itô diffusions. Stoch. Anal. Appl., 25(5):913–932, 2007. 171 [Mey76] P. A. Meyer. La théorie de la prédiction de F. Knight. In Séminaire de Prob- abilités, X (Première partie, Univ. Strasbourg, Strasbourg, année universi- taire 1974/1975), pages 86–103. Lecture Notes in Math., Vol. 511. Springer, Berlin, 1976. [Mey81] P.A. Meyer. Flot D’une Equation Differentielle Stochastique (d’après Malliavin, Bismut, Kunita), volume 850, pages 103–117. Springer, Berlin- Heidelberg, 1981. [Mik83] R. Mikulevicˇius. Properties of solutions of stochastic differential equations. Litovsk. Mat. Sb., 23(4):18–31, 1983. [Mik00] R. Mikulevicˇius. On the Cauchy problem for parabolic SPDEs in Hölder classes. Ann. Probab., 28(1):74–103, 2000. [MP09] R. Mikulevicˇius and H. Pragarauskas. On Hölder solutions of the integro- differential Zakai equation. Stochastic Process. Appl., 119(10):3319–3355, 2009. [MP11] R. Mikulevicˇius and H. Pragarauskas. Model problem for integro- differential Zakai equation with discontinuous observation processes. Appl. Math. Optim., 64(1):37–69, 2011. [MP12] R. Mikulevicˇius and H. Pragarauskas. On Lp-estimates of some singular integrals related to jump processes. SIAM J. Math. Anal., 44(4):2305–2328, 2012. [MP13] R. Mikulevicˇius and H. Pragarauskas. On Lp- theory for stochastic parabolic integro-differential equations. Stoch PDE: Anal Comp, 1(2):282–324, 2013. [MP76] Michel Métivier and Giovanni Pistone. Une formule d’isométrie pour l’intégrale stochastique hilbertienne et équations d’évolution linéaires stochastiques. Z. Wahrscheinlichkeitstheorie und Verw. Gebiete, 33(1):1– 18, 1975/76. [MR99] R. Mikulevicius and B. L. Rozovskii. Martingale problems for stochas- tic PDE’s. In Stochastic partial differential equations: six perspectives, volume 64 of Math. Surveys Monogr., pages 243–325. Amer. Math. Soc., Providence, RI, 1999. [Nov75] A. A. Novikov. Discontinuous martingales. Teor. Verojatnost. i Primemen., 20:13–28, 1975. [Ole65] O. A. Oleı˘nik. On the smoothness of solutions of degenerating elliptic and parabolic equations. Dokl. Akad. Nauk SSSR, 163:577–580, 1965. [OP89] Daniel Ocone and Étienne Pardoux. A generalized Itô-Ventzell formula. Application to a class of anticipating stochastic differential equations. Ann. Inst. H. Poincaré Probab. Statist., 25(1):39–71, 1989. [OR71] O. A. Oleı˘nik and E. V. Radkevicˇ. Second order equations with non- negative characteristic form. In Mathematical Analysis, 1969 (Russian), pages 7–252. Akad. Nauk SSSR Vsesojuzn. Inst. Naucˇn. i Tehn. Informa- cii, Moscow, 1971. 172 [Par72] Étienne Pardoux. Sur des équations aux dérivées partielles stochastiques monotones. C. R. Acad. Sci. Paris Sér. A-B, 275:A101–A103, 1972. [Par75] Étienne Pardoux. Équations aux dérivées partielles stochastiques de type monotone. In Séminaire sur les Équations aux Dérivées Partielles (1974– 1975), III, Exp. No. 2, page 10. Collège de France, Paris, 1975. [Pri12] Enrico Priola. Pathwise uniqueness for singular SDEs driven by stable pro- cesses. Osaka J. Math., 49(2):421–447, 2012. [Pri14] Enrico Priola. Stochastic flow for SDEs with jumps and irregular drift term. arXiv preprint arXiv:1405.2575, 2014. [Pro05] Philip E. Protter. Stochastic integration and differential equations, vol- ume 21 of Stochastic Modelling and Applied Probability. Springer-Verlag, Berlin, 2005. Second edition. Version 2.1, Corrected third printing. [PV14] Matthijs Pronk and Mark Veraar. A new approach to stochastic evolution equations with adapted drift. J. Differential Equations, 256(11):3634–3683, 2014. [PZ07] S. Peszat and J. Zabczyk. Stochastic Partial Differential Equations with Lévy Noise: An Evolution Equation Approach, volume 113 of Encyclo- pedia of Mathematics and its Applications. Cambridge University Press, Cambridge, 2007. [PZ13] S. Peszat and J. Zabczyk. Time regularity of solutions to linear equa- tions with Lévy noise in infinite dimensions. Stochastic Process. Appl., 123(3):719–751, 2013. [QZ08] Huijie Qiao and Xicheng Zhang. Homeomorphism flows for non-Lipschitz stochastic differential equations with jumps. Stochastic Process. Appl., 118(12):2254–2268, 2008. [Roz90] B. L. Rozovskiı˘. Stochastic Evolution Systems: Linear Theory and Appli- cations to Nonlinear Filtering, volume 35 of Mathematics and its Applica- tions (Soviet Series). Kluwer Academic Publishers Group, Dordrecht, 1990. Translated from the Russian by A. Yarkho. [RZ07] Michael Röckner and Tusheng Zhang. Stochastic evolution equations of jump type: existence, uniqueness and large deviation principles. Potential Anal., 26(3):255–279, 2007. [Tin77a] E. Tinfavicˇius. Linearized stochastic equations of nonlinear filtering of ran- dom processes. Lithuanian Mathematical Journal, 17(3):321–334, 1977- 07-01 1977. [Tin77b] E. Tinfavicˇius. On the reduced stochastic equations for non-linear filtering of random processes. Litovsk. Mat. Sb., 17(3):53–72, 212, 1977. [Tri10] Hans Triebel. Theory of function spaces. Modern Birkhäuser Classics. Birkhäuser/Springer Basel AG, Basel, 2010. Reprint of 1983 edition [MR0730762], Also published in 1983 by Birkhäuser Verlag [MR0781540]. 173 [Ver12] Mark Veraar. The stochastic Fubini theorem revisited. Stochastics, 84(4):543–551, 2012. [Wal86] John B. Walsh. An Introduction to Stochastic Partial Differential Equations. In École d’été de Probabilités de Saint-Flour, XIV—1984, volume 1180 of Lecture Notes in Math., pages 265–439. Springer, Berlin, 1986. [Yan05] Yubin Yan. Galerkin finite element methods for stochastic parabolic partial differential equations. SIAM J. Numer. Anal., 43(4):1363–1384 (electronic), 2005. [Yoo00] Hyek Yoo. Semi-discretization of stochastic partial differential equations on R1 by a finite-difference method. Math. Comp., 69(230):653–666, 2000. [Zha13] Xicheng Zhang. Degenerate irregular SDEs with jumps and application to integro-differential equations of Fokker-Planck type. Electron. J. Probab., 18:no. 55, 25, 2013. [Zho13] Guoli Zhou. Global well-posedness of a class of stochastic equations with jumps. Adv. Difference Equ., page 2013:175, 2013. 174