1. Introduction
This paper aims to study the almost sure short-time behavior of the solution $(X_t)_{t\geq0}$ to a Lévy-driven stochastic differential equation (SDE) of the form
by relating it to the behavior of the driving process. Here, L is an $\mathbb{R}^d$ -valued Lévy process and the function $\sigma\colon\mathbb{R}^{n}\rightarrow\mathbb{R}^{n\times d}$ is chosen to be twice continuously differentiable and at most of linear growth. The latter conditions ensure that (1) has a unique strong solution (see, e.g., [Reference Protter12, Theorem 7, p. 259]) and that Itō’s formula is applicable for $\sigma(X)$ . Since the short-time behavior of a stochastic process is determined by its sample paths in an arbitrarily small neighborhood of zero, the linear growth condition may be omitted whenever the solution of the SDE is well defined on some interval $[0,\varepsilon]$ with $\varepsilon>0$ .
We characterize the short-time behavior of X by comparing the sample paths of the process to suitable functions. For real-valued Lévy processes, early results [Reference Rogozin14, Reference Shtatland20] linked the almost sure convergence of the quotient $L_t/t$ for $t\downarrow0$ to the total variation of the sample paths of the process. This was generalized to determining the behavior of $L_t/t^{{\kern1pt}p}$ for arbitrary $p>0$ from the characteristic triplet of L in [Reference Bertoin, Doney and Maller2, Reference Blumenthal and Getoor3, Reference Pruitt13]. The exact scaling function f for law of the iterated logarithm type (LIL-type) results of the form $\limsup_{t\downarrow0}L_t/f(t)=c$ almost surely (a.s.) with a deterministic constant c was determined by Khinchine for Lévy processes with a Gaussian component (see, e.g., [Reference Sato15, Proposition 47.11]) and in, e.g., [Reference Savov16, Reference Savov17] for more general types of Lévy processes. The multivariate counterpart to these LIL-type results was derived in [Reference Einmahl6], showing that the short-time behavior of the driving process in (1) is well understood overall.
For the solution X, the situation becomes less transparent. It was shown in [Reference Kühn9, Reference Schilling and Schnurr18] that, under suitable conditions on the Lévy measure of L, the solution X is a Lévy-type Feller process, i.e. the symbol of X can be expressed through a triplet $(A(x),\nu(x),\gamma(x))$ that depends on the characteristics of the driving process, the initial condition, and the function $\sigma$ . The short- and long-time behavior of these particular Feller processes can be characterized in terms of power-law functions using a generalization of Blumenthal–Getoor indices (see [Reference Schnurr19]), where the symbol now plays the role of the characteristic exponent. Using similar methods, an explicit short-time LIL in one dimension was derived in [Reference Knopova and Schilling8], and the techniques have been explored further in [Reference Kühn10]. The definition of a Lévy-type Feller process suggests that we can think of X as ‘locally Lévy’ and, since the short-time behavior of the process is determined by the sample paths in an arbitrarily small neighborhood of zero, the process X should directly reflect the short-time behavior of the driving Lévy process. We confirm this hypothesis in terms of power-law functions in Proposition 1 and Theorem 1 by showing that the almost sure finiteness of $\lim_{t\downarrow0}t^{-p}L_t$ implies the almost sure convergence of the quantity $t^{-p}(X_t-X_0)$ , and that similar results hold for $\limsup_{t\downarrow0} t^{-p}(X_t-X_0)$ and $\liminf_{t\downarrow0}t^{-p}(X_t-X_0)$ with probability 1 whenever $\lim_{t\downarrow0}t^{-p/2}L_t$ exists almost surely. The conditions on the driving process are readily checked from its characteristic triplet, e.g. from [Reference Bertoin, Doney and Maller2]. Using knowledge of the form of the scaling function for the driving Lévy process from [Reference Einmahl6], the limit theorems can be generalized to suitable functions $f\colon[0,\infty)\rightarrow\mathbb{R}$ to derive explicit LIL-type results for the solution of (1) that cover many frequently used models. As an application, we also briefly study convergence in distribution and in probability, showing that results on the short-time behavior of the driving process translate here as well. The results given partially overlap with characterizations obtained from other approaches such as the generalization of Blumenthal–Getoor indices for Lévy-type Feller processes discussed, e.g., in [Reference Schnurr19], while also covering new cases such as almost sure limits for $t\downarrow0$ . Compared to methods that rely on the symbol, the approach presented in this paper is less technical and more direct, as it only uses the behavior of the driving process as input. Whenever possible, we work with general semimartingales and include converse results to recover the limiting behavior of the driving process from the solution.
2. Preliminaries
A Lévy process $L=(L_t)_{t\geq0}$ is a stochastic process with stationary and independent increments, the paths of which are almost surely càdlàg, i.e. right-continuous with finite left limits, and start at zero with probability 1. In the following analysis, we consider both vector- and matrix-valued Lévy processes, identifying an $\mathbb{R}^{n\times d}$ -valued Lévy process with an $\mathbb{R}^{nd}$ -valued one by vectorization if needed. We follow the convention that a matrix m is vectorized by writing its entries columnwise into a vector $m^\mathrm{vec}$ . The symbols $\langle\cdot,\cdot\rangle$ and $\|\cdot\|$ denote the Euclidean scalar product and norm on $\mathbb{R}^d$ , respectively, and we write $x^\top$ for the transpose of a vector or matrix x.
By the Lévy–Khintchine formula (see, e.g., [Reference Sato15, Theorem 8.1]), the characteristic function of an $\mathbb{R}^d$ -valued Lévy process L is given by $\varphi_{L}(z)=\mathbb{E} \mathrm{e}^{\mathrm{i} z L_t} = \exp (t \psi_L(z))$ , $z\in \mathbb{R}^d$ , where $\psi_L$ denotes the characteristic exponent satisfying
Here, $A_L\in\mathbb{R}^{d\times d}$ is the Gaussian covariance matrix, $\nu_L$ is the Lévy measure, and $\gamma_L\in\mathbb{R}^d$ is the location parameter of L. The characteristic triplet of L is denoted by $(A_L,\nu_L,\gamma_L)$ . If $A_L=0$ , i.e. if the Lévy process has no Gaussian component, we refer to it as purely non-Gaussian. Whenever the Lévy measure satisfies the condition $\int_{\{\|x\|\leq 1\}} \|x\| \nu_L(\textrm{d} x)<\infty$ , we may also use the Lévy–Khintchine formula in the form
and call $\gamma_0$ the drift of L.
For any càdlàg process X, we denote by $X_{s-}$ the left-hand limit of X at time ${s\in(0,\infty)}$ , and by $\Delta X_s=X_s-X_{s-}$ its jumps. The process $X_{s-}$ is càglàd, i.e. left-continuous with finite right limits. Any integrals are interpreted as integrals with respect to semimartingales as, e.g., in [Reference Protter12], and we generally consider a filtered probability space $\bigl(\Omega,\mathcal{F},(\mathcal{F}_t)_{t\geq0},\mathbb{P})$ satisfying the usual hypotheses (see, e.g., [Reference Protter12, p. 3]). The integral bounds are assumed to be included when the notation $\int_a^b$ is used, and the exclusion of the left or right bound is denoted by $\int_{a+}^b$ or $\int_a^{b-}$ . For semimartingales X, Y, and Z taking values in $\mathbb{R}^{n\times d}$ , $\mathbb{R}^{d\times m}$ , and $\mathbb{R}^{m\times d}$ , respectively, matrix-valued integrals are interpreted as
and the integration by parts formula takes the form
3. Main results and applications
The aim of this paper is a characterization of the almost sure short-time behavior of the solution to a Lévy-driven SDE by relating it to the behavior of the driving process. In Section 3.1, we first consider the setting of general semimartingales to show that the solution of (1) a.s. reflects the short-time behavior of the driving process. Specializing to Lévy processes in Section 3.2 then allows us to strengthen the results and, by referring to the explicit scaling functions obtained in [Reference Einmahl6], derive explicit LIL-type results for the solution process.
3.1. General SDEs
A key tool in the analysis is the following lemma, which gives the desired statement for a stochastic integral when the behavior of the integrand is known. We state the result in terms of power-law functions to match Proposition 1; however, the denominator $t^{{\kern1pt}p}$ may be replaced by an arbitrary continuous function $f\colon[0,\infty)\rightarrow\mathbb{R}$ that is increasing and satisfies $f(0)=0$ and $f(t)>0$ for all $t>0$ . The proofs of this lemma and all following results are given in Section 4.
Lemma 1. Let $X=(X_t)_{t\geq 0}$ be a real-valued semimartingale, $p>0$ , and $\varphi=(\varphi_t)_{t\geq0}$ an adapted càglàd process such that $\lim_{t\downarrow0}t^{-p}\varphi_t$ exists and is finite with probability 1. Then
Lemma 1 naturally extends to multivariate stochastic integrals, with $\psi$ and X being $\mathbb{R}^{n\times d}$ -valued and $\mathbb{R}^{d\times m}$ -valued semimartingales, respectively. We thus obtain a tool to derive the almost sure short-time behavior of the solution to (1) from the behavior of the driving process. Whenever we can ensure that $\sigma(X_{s-})$ is invertible, this implication is an equivalence which allows us to recover the behavior of the driving process from the solution. We hence get a counterpart to [Reference Schnurr19, Theorem 4.4] for almost sure limits at zero.
Proposition 1. Let L be an $\mathbb{R}^{d}$ -valued semimartingale with $L_0=0$ and ${\lim_{t\downarrow 0}t^{-p}L_t=v}$ a.s. for some $v\in\mathbb{R}^{d}$ , ${p>0}$ . Further, let $\sigma\colon\mathbb{R}^{d}\rightarrow\mathbb{R}^{n\times d}$ be twice continuously differentiable and at most of linear growth, and denote the solution of (1) by ${X=(X_t)_{t\geq0}}$ .
-
(i) The process X satisfies $\lim_{t\downarrow 0}({X_t-x})/{t^{{\kern1pt}p}}=\sigma(x)v$ a.s., and whenever $\sigma(X_{t-})$ with $X_{0-}=x$ has a.s. full rank for all t in a neighborhood of zero, we have the equivalence
\begin{equation*} \lim_{t\downarrow0}\frac{L_t}{t^{{\kern1pt}p}}=v\quad \mathrm{a.s.} \quad \Longleftrightarrow \quad \lim_{t\downarrow0}\frac{X_t-x}{t^{{\kern1pt}p}}=\sigma(x)v\quad \mathrm{a.s.} \end{equation*} -
(ii) If, additionally, $[L_k,L_k]_t=o(t^{2p})$ a.s. for $k=1,\dots,d$ , then
\begin{equation*} \lim_{t\downarrow0}\frac{1}{t^{{\kern1pt}p}}\int_{0+}^tY_{s-}\,\textrm{d} L_s= Y_0v\quad \mathrm{a.s.} \end{equation*}for any $\mathbb{R}^{n\times d}$ -valued semimartingale Y, and whenever $\lim_{t\downarrow0}t^{-p}Y_t=w$ a.s. with ${w\in\mathbb{R}^{n\times d}}$ satisfying ${wv=0}$ , we have the stronger statement\begin{equation*} \lim_{t\downarrow0}\frac{1}{t^{2p}}\int_{0+}^tY_{s-}\,\textrm{d} L_s= 0\quad \mathrm{a.s.} \end{equation*}
The above results naturally extend to matrix-valued L. Note that Proposition 1(ii) holds in particular when $Y_t=\sigma(L_t)$ , but the dependence on the driving process is not needed to conclude the convergence. As an example, we apply Proposition 1 to matrix-valued stochastic exponentials.
Example 1. Recall that for an $\mathbb{R}^{d\times d}$ -valued semimartingale ${L=(L_t)_{t\geq0}}$ , the (strong) solution to the stochastic differential equation $\textrm{d} X_t=X_{t-}\textrm{d} L_t$ or $\textrm{d} Y_t=\textrm{d} L_tY_{t-}$ with the initial condition given by the identity matrix Id is referred to as left or right stochastic exponential, respectively. Since the relevant properties of the process X carry over to Y by transposition, we restrict the following discussion to $X=\mathcal{E}(L)$ . Assuming that ${\det(\mathrm{Id}+\Delta L_s)\neq0}$ for all $s\geq0$ , the inverse $\mathcal{E}(L)^{-1}$ is well defined [Reference Karandikar7] and Proposition 1 yields the equivalence
whenever one of the limits exists for some $v\in\mathbb{R}^{d\times d}$ and $p>0$ . If L is a Lévy process, the almost sure limit v appearing for $p=1$ is the drift of L. A result in [Reference Rogozin14, Reference Shtatland20] directly links the existence of this limit to the total variation of the sample paths of L. Denoting the set of stochastic processes having sample paths of bounded variation by BV, (3) allows us to extend this connection to
since $\mathcal{E}(L)$ has paths of bounded variation if and only if this holds for the paths of L.
Whenever L is a semimartingale satisfying both $\lim_{t\downarrow0}t^{-p}L_t=0$ and ${[L_k,L_k]_t=o(t^{2p})}$ a.s. for $k=1,\dots,d$ , an inspection of the proof of Proposition 1 shows that
for $t\downarrow0$ , which implies that $[\sigma(X),L]_t=o(t^{2p})$ a.s. in the limit $t\downarrow0$ . We further use the stronger assumption on the quadratic variation to consider the $\limsup$ and $\liminf$ behavior of the quotient $t^{-p}(X_t-x)$ , including the divergent case.
Theorem 1. Let L be an $\mathbb{R}^{d}$ -valued semimartingale such that $\lim_{t\downarrow0}t^{-p/2}L_t=0$ a.s. for some $p>0$ , and $[L_k,L_k]_t=o(t^{{\kern1pt}p})$ a.s. for $k=1,\dots,d$ . Further, let $\sigma\colon\mathbb{R}^{n}\rightarrow\mathbb{R}^{n\times d}$ be twice continuously differentiable and at most of linear growth, and denote by X the solution of the SDE (1). Then, almost surely,
In particular, if $\sigma(x)$ has rank d, we have
3.2. Lévy-driven SDEs
For the remainder of the section, we will assume that L is a Lévy process. In this case, the assumptions of Proposition 1 and Theorem 1 can be checked directly from the characteristic triplet of L. We give a brief overview of the behavior of $t^{-p}L_t$ for real-valued Lévy processes. Noting that the almost sure short-time behavior of L is dominated by Khintchine’s LIL (see, e.g., [Reference Sato15, Proposition 47.11]) whenever L contains a Gaussian component, and that the drift of L only occurs explicitly if $p=1$ and the process is of finite variation [Reference Shtatland20], we can further restrict the discussion to L being a pure-jump process and set its drift to zero whenever $\int_{\{\|x\|\leq 1\}}\|x\|\nu_L(\textrm{d} x)<\infty$ . A first indicator of the behavior of $t^{-p}L_t$ as $t\downarrow0$ is the (upper) Blumenthal–Getoor index
introduced in [Reference Blumenthal and Getoor3], which yields
with probability 1 (see, e.g., [Reference Pruitt13]). A more direct criterion is given in [Reference Bertoin, Doney and Maller2, Section 2], showing that the condition $\lim_{t\downarrow0}t^{-p}L_t=0$ is a.s. satisfied for $p>\frac12$ whenever
and for $p=\frac12$ whenever
If (6) or (7) fails, we have $\limsup_{t\downarrow0}t^{-p}|L_t|=\infty$ a.s. or ${\limsup_{t\downarrow0}t^{-1/2}|L_t|=\lambda^*\in(0,\infty]}$ a.s., respectively, instead.
Observe that the quadratic variation process of L is again a Lévy process, i.e. the integral test (6) in particular applies to [L, L]. For $d=1$ we have
where a is the variance of the Gaussian part of L (if present). The additional assumption on the quadratic variation in Proposition 1(ii) and in Theorem 1 is always satisfied if L is a Lévy process.
Lemma 2. Let L be a real-valued Lévy process satisfying $\lim_{t\downarrow0}t^{-p}L_t=v$ a.s. for some $v\in\mathbb{R}$ and $p>0$ . Then, a.s., $[L,L]_t=o(t^{2p})$ as $t\downarrow0$ .
Note that Lemma 2 readily extends to the multivariate case by the Kunita–Watanabe inequality (see, e.g., [Reference Protter12, Theorem II.25]). So far, we have only characterized the almost sure short-time behavior of the solution to (1) in terms of power-law functions. To derive precise LIL-type results, we now consider more general scaling functions ${f\colon[0,\infty)\rightarrow\mathbb{R}}$ . Whenever the driving Lévy process has a Gaussian component, its almost sure short-time behavior is dominated by Khintchine’s LIL. In this setting, Lemma 2 directly generalizes to continuous increasing functions f with $f(0)=0$ and $f(t)>0$ for all $t>0$ . This is because any function f for which $\lim_{t\downarrow0}L_t/f(t)$ exists in $\mathbb{R}$ satisfies $\sqrt{2t\ln(\ln(1/t))}/f(t)\rightarrow0$ , implying
by [Reference Shtatland20, Theorem 1]. Hence, $[L,L]_t=o(f(t)^2)$ and we can replace the function $t^{p/2}$ in Theorem 1 by f, obtaining a precise short-time behavior for the solution whenever (1) includes a diffusion part. In the case that L does not include a Gaussian component, [L, L] is a finite variation process without drift satisfying ${\lim_{t\downarrow0}t^{-1}[L,L]_t=0}$ a.s. by [Reference Shtatland20, Theorem 1], but an argument similar to (9) is only applicable if f decays sufficiently fast as $t\downarrow0$ . For the general case, we combine Theorem 1 with the precise information on possible scaling functions derived in [Reference Einmahl6]. Note that the conditions of Corollary 1 immediately follow from Khintchine’s LIL whenever $h\equiv1$ , as the process does not include a Gaussian component by assumption.
Corollary 1. Let L be a purely non-Gaussian $\mathbb{R}^{d}$ -valued Lévy process and ${f\colon[0,\infty)\rightarrow\mathbb{R}}$ be of the form $f(t)=\sqrt{t\ln(\ln(1/t))}h(1/t)^{-1}$ , where $h\colon[0,\infty)\rightarrow[0,\infty)$ is a continuous and non-decreasing slowly varying function such that the set of cluster points of $L_t/f(t)$ as $t\downarrow0$ is bounded with probability 1. Further, let $\sigma\colon\mathbb{R}^{d}\rightarrow\mathbb{R}^{n\times d}$ be twice continuously differentiable and at most of linear growth, and denote by $X=(X_t)_{t\geq0}$ the solution of (1). Then, a.s.,
In particular, if $\sigma(x)$ has rank d, we have
The above results show that the almost sure short-time LIL-type behavior of the driving Lévy process directly translates to the solution of the SDE (1). We also note the following statement for the corresponding cluster set.
Corollary 2. Under the assumptions of Corollary 1, let $\limsup_{t\downarrow0}\|L_t\|/f(t)$ be bounded with probability 1. Then there exists an almost sure cluster set ${C_X=C(\{X_t/f(t)\colon t\downarrow0\})}$ for the solution X of (1) which is obtained from the cluster set of ${C_L=C(\{L_t/f(t)\colon t\downarrow0\})}$ of the driving Lévy process L via $C_X=\sigma(x)C_L$ .
Corollary 2 implies in particular that $C_X$ shares the properties of $C_L$ derived in [Reference Einmahl6, Theorem 2.4], and that there is a one-to-one correspondence between the cluster sets whenever $\sigma(x)$ has rank d, e.g. $C_X=C_L$ for the stochastic exponential in Example 1. Another important special case is stable Lévy processes.
Example 2. Recall that a Lévy process L is called (strictly) stable with index ${\alpha\in(0,2]}$ if and only if the random variables $L_t$ and $t^{1/\alpha} L_1$ have the same law for each $t>0$ . For the following discussion, let $\alpha\neq2$ , i.e. L is a non-Gaussian Lévy process, and pick $d=1$ . In this case, the short-time behavior of $L_t/f(t)$ with ${f\colon(0,\infty)\rightarrow(0,\infty)}$ is determined by the behavior of the integral $\int_0^1f(t)^{-\alpha}\,\textrm{d} t$ in the sense that $\lim_{t\downarrow0} L_t/f(t)=0$ a.s. if the integral converges, and $\limsup_{t\downarrow0}|L_t|/f(t)=\infty$ a.s. if it does not (see [Reference Bertoin1, Chapter VIII, Theorem 5]). Hence, Lemma 2 yields $[L,L]_t=o(t^{2p})$ a.s., $0<p<1/\alpha$ , and Corollary 1 implies that the almost sure short-time behavior of the solution to (1) can be determined by the same integral criterion as the behavior of the driving Lévy process whenever $f\colon(0,\infty)\rightarrow(0,\infty)$ is of the specified form. For power functions, we get a strong resemblance to the Blumenthal–Getoor index, as
by Proposition 1, while Theorem 1 implies that the alnmost sure lim sup behavior for $p>1/\alpha$ translates as well. See also [Reference Kühn10] for generalizations of the above integral criterion which allow us to apply similar reasoning to a larger class of processes.
As an application of Theorem 1, we consider two similar statements concerning convergence in distribution and convergence in probability, which are denoted by $\overset{\mathrm{D}}{\rightarrow}$ and $\overset{\mathrm{P}}{\rightarrow}$ , respectively. Since the short-time behavior of Brownian motion is well known, we can assume that L does not have a Gaussian component. We further set the drift of L, whenever it exists, to zero. Sufficient conditions for the attraction of a Lévy process to a stable law are given, e.g., in [Reference Maller and Mason11, Theorem 2.3], and the conditions for the driving process in Corollary 3 are thus readily checked from the characteristic triplet. The special case of attraction to normality is, e.g., considered in [Reference Doney and Maller5, Theorem 2.5], with the condition being satisfied in particular for a Lévy process with a symmetric Lévy measure such as $\nu_L(\textrm{d} x)=\exp(-|x|)\textbf{1}_{[-1,1]}(x)\,\textrm{d} x$ . Attraction of Lévy processes to a stable law with index $\alpha<2$ is studied in [Reference de Weert4].
Corollary 3. Let L be a real-valued non-Gaussian Lévy process such that the drift of L is equal to zero whenever it exists. Further, assume that there is a continuous increasing function $f\colon[0,\infty)\rightarrow[0,\infty)$ such that $f(t)^{-1}L_t\overset{\mathrm{D}}{\rightarrow}Y$ as $t\downarrow0$ , where the random variable Y follows a non-degenerate stable law with index $\alpha\in(0,2]$ . Also let $\sigma\colon\mathbb{R}\rightarrow\mathbb{R}$ be twice continuously differentiable and at most of linear growth, and choose $x\in\mathbb{R}^{n}$ such that $\sigma(x)\neq0$ . If X is the solution of (1) with initial condition x, then $({X_t-x})/{f(t)}\overset{\mathrm{D}}{\rightarrow}\sigma(x)Y$ . Whenever f is regularly varying with index $r\in\big(0,\frac12\big]$ at zero, this result also holds if the random variable Y is a.s. constant.
We give a further result for convergence in probability. The assumptions are again readily checked from the characteristic triplet of the driving Lévy process using [Reference Doney and Maller5, Theorem 2.2] and are, e.g., satisfied for finite variation Lévy processes. As the limiting random variable obtained is a.s. constant, the proof is immediate from Corollary 3.
Corollary 4. Let $d=1$ , L be as in Corollary 3, and assume that there is a continuous increasing function $f\colon[0,\infty)\rightarrow[0,\infty)$ that is regularly varying with index $r\in\big(0,\frac12\big]$ at zero and satisfies $f(t)^{-1}L_t\overset{\mathrm{P}}{\rightarrow}v$ for some finite value $v\in\mathbb{R}$ as $t\downarrow0$ . Also, let $\sigma\colon\mathbb{R}\rightarrow\mathbb{R}$ be twice continuously differentiable and at most of linear growth, and denote by X the solution of (1). Then $({X_t-x})/{f(t)}\overset{\mathrm{P}}{\rightarrow}\sigma(x)v$ .
4. Proof of Section 3
Proof of Lemma 1. Define the process $\psi$ ( $\omega$ -wise) by
possibly setting $\psi_0(\omega)=0$ on the null set where the limit does not exist. By definition, $\psi$ is càglàd, and, as $\lim_{t\downarrow}t^{-p}\varphi_t$ exists a.s. in $\mathbb{R}$ , $\mathcal{F}_0$ contains all null sets by assumption, and the filtration is right-continuous, $\psi_0$ is $\mathcal{F}_0$ -measurable. Therefore, $\psi$ is also adapted. This implies that the semimartingale $Y_t\;:\!=\;\int_{0+}^t\psi_s\,\textrm{d} X_s$ is well defined, allowing us to rewrite the process considered in (2) as
, using the associativity of the stochastic integral. Applying integration by parts, we get
As Y has almost sure càdlàg paths and satisfies $Y_0=0$ by definition, it follows that ${\lim_{t\downarrow0}Y_t=0}$ with probability 1. The term remaining on the right-hand side is a path-by-path Lebesgue–Stieltjes integral. Note that, as $p>0$ , the integrator is increasing, which implies the monotonicity of the corresponding integral. Hence,
Recalling that $\lim_{t\downarrow0}Y_t=0$ a.s., the claim follows.
Proof of Proposition 1. (i) Let $\lim_{t\downarrow0}t^{-p}L_t=v$ with probability 1. Rearranging the integral equation for X and applying integration by parts to the components yields
As $\lim_{t\downarrow0}t^{-p}L_t=v$ a.s. by assumption, and $\lim_{t\downarrow0}X_t= x=X_0$ a.s. since X solves (1), the first term on the right-hand side of (11) converges a.s. to the desired limit. It thus remains to show that the other terms a.s. vanish as $t\downarrow0$ . For the second and third terms of (11), this follows from the assumption and Lemma 1, respectively. Finally, applying Itō’s formula for X in the quadratic covariation appearing in the last term yields
By linearity, we can treat the terms on the right-hand side here separately. Using the associativity of the stochastic integral and noting that the continuous finite variation terms do not contribute to the quadratic covariation, we are left with
As the integrators in the first term on the right-hand side of (12) are of finite variation, the corresponding integrals are path-by-path Lebesgue–Stieltjes integrals. Writing out
, and denoting integration with respect to the total variation measure of a process Y as $\textrm{d} TV_Y$ , the individual integrals can be estimated by
using the Kunita–Watanabe inequality and noting that the resulting integrals have increasing integrators. The above estimates show that the total variation of $\int_{0+}^t\sigma_{j,l}(X_{s-})\,\textrm{d}[L_l,L_k]_s$ also satisfies this estimate. For the quadratic variation terms, note that since ${(L_0)_{k,l}=0}$ a.s., it follows from the assumption and the one-dimensional version of Lemma 1 that
Therefore,
and a similar estimate holds true for the total variation of $\int_{0+}^t\sigma_{j,l}(X_{s-})\,\textrm{d} [L_l,L_k]_s$ . Denoting the total variation process of a process Y at t by $TV(Y)_t$ , we obtain the bound
for the first term on the right-hand side of (12), showing that it a.s. vanishes in the limit $t\downarrow0$ . Lastly, denote by
the jump term remaining in (12). Using the Kunita–Watanabe inequality and recalling that $[L_k,L_k]=o(t^{{\kern1pt}p})$ by the previous estimate, it remains to consider the quadratic variation of J. Evaluating
and noting that
for all $j_1,j_2=1,\dots,n$ and a fixed $t\geq0$ due to the càdlàg paths of X, it follows that
by Taylor’s formula. Noting further that
for some finite (random) constants C ′,C′′, we can estimate $[J,J]_t$ in terms of the quadratic variation of L. In particular, both terms in (12) are a.s. of order $o(t^{{\kern1pt}p})$ and vanish when the limit $t\downarrow0$ is considered in (11).
Next, let $\lim_{t\downarrow0}t^{-p}(X_t-x)=\sigma(x)v$ with probability 1. As $\sigma(X_{s-})$ has a.s. full rank for all s in a neighborhood of zero, we can recover $L_t$ for small values of t through $L_t=\int_{0+}^t\bigl(\sigma(X_{s-})\bigr)^{-1}\,\textrm{d} X_s$ . Since $\|\sigma(X_t)-\sigma(x)\|<1$ a.s. for sufficiently small $t>0$ , it follows that
where the Neumann series converges a.s. in norm. By Taylor’s formula we have
with a remainder that satisfies $r_{i,j}(t)=O((X_t-x)^2)=o(t^{{\kern1pt}p})$ . Thus,
exists a.s., and it follows that also $R_t=o(t^{{\kern1pt}p})$ as $t\downarrow0$ with probability 1. Rewriting
we conclude from Lemma 1 and the almost sure short-time behavior of X that the limit $t\downarrow0$ exists a.s. and is equal to $\sigma(x)v$ . In particular, the continuity of $\sigma$ and the assumption on the rank of $\sigma(X_{s-})$ ensure that the integrand $\sigma(x)(\sigma(X_{s-}))^{-1}-\mathrm{Id}$ converges a.s. as $t\downarrow0$ . Since $\sigma(x)$ is invertible, we recover the short-time behavior of L from the above convergence result for $\sigma(x)L$ , completing the proof of (i).
(ii) As $\lim_{t\downarrow0}t^{-p}Y_tL_t=Y_0v$ a.s. by assumption, consider $Y_t-Y_0$ and let $Y_0=0$ a.s. without loss of generality. Applying integration by parts to $t^{-p}\int_{0+}^tY_{s-}\,\textrm{d} L_s$ , we can estimate the covariation using the Kunita–Watanabe inequality. This yields
By assumption, we have $[L_k,L_k]_t=o(t^{2p})$ . This implies that $t^{-p}[Y,L]_t\rightarrow0$ a.s. for $t\downarrow0$ , yielding the first part of the proposition. Assume next that, additionally, ${\lim_{t\downarrow0}t^{-p}Y_t=w}$ for some $w\in\mathbb{R}^{n\times d}$ with $wv=0$ . Similar to the proof of Lemma 1, define an adapted stochastic process $\psi$ ( $\omega$ -wise) by
possibly setting $\psi_0(\omega)=0$ on the null set where the limit does not exist. Rewriting the integral using $Z_t=\int_{0+}^t\psi_{s-}\,\textrm{d} L_s$ and applying integration by parts yields
By the first part of the proposition we have
while the pathwise Lebesgue–Stieltjes integral can be estimated by
As $wv=0$ , both bounds converge to zero with probability 1, and hence
Thus, the limit for $t\downarrow0$ of (13) exists a.s. and is equal to zero.
Proof of Theorem 1. Similar to the proof of Proposition 1, we use integration by parts and rewrite
The claim follows by showing the desired limiting behavior for the right-hand side. For the covariation, this is immediate from the assumption and the previous calculations. Hence, it remains to study the behavior of the integral. Using Itō’s formula yields
where the jump term is again denoted by J and the component of $\sigma$ included in it is carried as a subscript. Observe that, by associativity, it follows that
which is a sum of pathwise Lebesgue–Stieltjes integrals. Thus,
As $\sigma$ is twice continuously differentiable, $\sup_{0<s\leq t}\big|(L_k)_{s-}M_{s-}\big|$ is bounded and the right-hand side of this inequality converges a.s. to zero. For the jump term we have
by definition. Using the assumption on $\sigma$ , it follows from Taylor’s formula that
for some finite (random) constants $C,C'\geq0$ . Thus,
which also converges a.s. to zero as $t\downarrow0$ . For the last term, observe first that
by associativity. This allows us to rewrite
Note that $\sup_{0<s\leq t}|(M_{i,k,l})_s|$ is bounded for any fixed small $t\geq0$ and continuous at zero due to the continuity of $\sigma$ and its derivatives. Since $\lim_{t\downarrow0}t^{-p/2}L_t=0$ a.s., it follows that $\lim_{t\downarrow0}t^{-p/2}(L_k)_t(M_{i,k,l})_t$ exists with probability 1. Thus, Proposition 1(ii) is applicable, implying that the integral a.s. vanishes in the limit. Since $\lim_{t\downarrow0}t^{-p/2}L_t=0$ a.s., we have
The assumptions on $\sigma$ ensure that the supremum on the right-hand side of (14) stays bounded as $t\downarrow0$ , while Proposition 1(i) applies for $t^{-p/2}(X_t-x)$ . Since ${\lim_{t\downarrow0}t^{-p/2}L_t=0}$ a.s., the right-hand side of (14) converges to zero a.s. as $t\downarrow0$ . In particular, the limits in (4) are indeed a.s. equal. If $\sigma(x)$ has rank d, (5) follows immediately from the convergence result in (4).
Proof of Lemma 2. Note that $([L,L]_t)_{t\geq0}$ always has sample paths of bounded variation. We distinguish three cases for $p>0$ . Recall that a denotes the diffusion coefficient of L.
Case 1 ( $p<\frac12$ ): Applying Khintchine’s LIL implies that ${\lim_{t\downarrow0}t^{-p}L_t=0}$ a.s. holds for any Lévy process. Since we also have $2p<1$ , [Reference Shtatland20, Theorem 1] yields, regardless of the value of a,
Case 2 ( $p=\frac12$ ): Here, Khinchine’s LIL yields $\limsup_{t\downarrow0} L_t/\sqrt{t}=\infty$ a.s. if the Gaussian part of L is non-zero. As the limit is assumed to be finite, the process L must satisfy $a=0$ . This implies that the quadratic variation process in (8) has no drift, so ${[L,L]_t=o(t)}$ a.s. by [Reference Shtatland20, Theorem 1].
Case 3 ( $p>\frac12$ ): Consider L with its drift (if present) subtracted from the process. This neither changes the structure of the quadratic variation nor the assumption on the almost sure convergence, but ensures that [Reference Bertoin, Doney and Maller2, Theorem 2.1] is applicable. Note that, whenever $p>1$ and L is of finite variation with non-zero drift, we have $\lim_{t\downarrow0}t^{-p}|L_t|=\infty$ by [Reference Rogozin14], showing that this case is excluded by the assumption. The almost sure existence of $\lim_{t\downarrow0}t^{-p}L_t$ further implies that the Lévy measure $\nu_L$ of L satisfies (cf. [Reference Bertoin, Doney and Maller2]) $\int_{[-1,1]}|x|^{1/p}\nu_L(\textrm{d} x)<\infty$ . Noting that $\Delta[L,L]_t=f(\Delta L_t)$ for $f(x)=x^2$ , it follows that $\nu_{[L,L]}(B)=\nu_L(f^{-1}(B))$ for all sets $B\subseteq[-1,1]$ . As we can now treat $\nu_{[L,L]}$ as an image measure, it is
Thus, the quadratic variation satisfies the same integral condition with 2p instead of p. As $[L,L]_t$ is a bounded variation Lévy process without drift, part (i) of [Reference Bertoin, Doney and Maller2, Theorem 2.1] yields the claim in the last case.
Proof of Corollary 1. As the scaling function is of the form $f(t)=t^{1/2}\ell(1/t)$ with a slowly varying function $\ell$ by assumption, the almost sure boundedness of the cluster points of $L_t/f(t)$ in $\mathbb{R}^d$ implies that, for all $\varepsilon\in\big(0,\frac12\big)$ ,
Thus, Theorem 1 is applicable with $p/2=\frac12-\varepsilon$ , yielding
Using the explicit form of f and choosing $\varepsilon\in(0,1/4)$ , it follows that
which is (10), and the remaining claims follow by analogy with the proof of Theorem 1.
Now let L be a real-valued Lévy process with Lévy measure $\nu_L$ . For $x>0$ , we write $\overline{\Pi}^{(+)}_L(x) = \nu_L((x,\infty))$ , $\overline{\Pi}^{(-)}_L(x) = \nu_L((-\infty,-x))$ , and $\overline{\Pi}_L(x) = \overline{\Pi}^{(+)}_L(x) + \overline{\Pi}^{(-)}_L(x)$ for the tail function of $\nu_L$ .
Proof of Corollary 3. If $\alpha=2$ , i.e. Y is normally disributed, the convergence of $L_t/f(t)$ implies that
for all $x>0$ and $\#\in\{+,-\}$ by [Reference Maller and Mason11, Proposition 4.1]. Choosing $x=1$ , note that the condition (15) is not sufficient to imply the integrability of $\overline{\Pi}^{(\#)}_L(f(t))$ over [0,1]. However, since the distribution of Y is non-degenerate, the scaling function f is regularly varying with index $\frac12$ at zero (see [Reference Doney and Maller5, Theorem 2.5]). Thus, we also have $\lim_{t\downarrow0}t\overline{\Pi}^{\#}L(t^{1/2-\varepsilon})=0$ . This yields the estimate
where $C_t$ is bounded as $t\downarrow0$ and the function is thus integrable over [0,1] for $0\leq k<1$ . By assumption, L does not have a Gaussian component and the drift of the process is equal to zero whenever it is defined. Hence, $\int_{0+}^t\overline{\Pi}^{\#}\big(t^{(1/2-\varepsilon)k}\big)\,\textrm{d} t<\infty$ for both $\#=+$ and $\#=-$ , and thus $\lim_{t\downarrow0}t^{-(1/2-\varepsilon)k}L_t=0$ a.s. by [Reference Bertoin, Doney and Maller2, Theorem 2.1]. Applying Theorem 1, we obtain
It now follows for $k-2\varepsilon k>\frac12$ that
which yields the desired convergence of $f(t)^{-1}(X_t-x)$ . If Y follows a non-degenerate stable law with index $\alpha\in(0,2)$ , the right-hand side of (15) is replaced by the tail function $\overline{\Pi}^{\#}_Y(x)$ (see [Reference Maller and Mason11, Proposition 4.1]), and it follows from the proof of [Reference Maller and Mason11, Theorem 2.3] that the scaling function f is regularly varying with index $1/\alpha$ at zero in this case. Thus, we can derive a bound similar to (16) and argue as before. Noting that [Reference Maller and Mason11, Proposition 4.1] does not require the law of the limiting random variable to be non-degenerate, the argument is also applicable if Y is a.s. constant and f is regularly varying with index $r\in\big(0,\frac12\big]$ at zero.
Acknowledgement
I would like to thank Alexander Lindner for suggesting this topic and supervising my work on the project.
Funding information
There are no funding bodies to thank relating to the creation of this article.
Competing interests
There are no competing interests to declare.