1. Introduction
We initiate the computation of the Esscher martingale measure for a class of general semimartingales whose predictable compensators have shot-noise and Hawkes process flavors.
Many different models to describe the evolution of financial assets have been proposed in the literature. These include the exponential Lévy processes (Merton, Reference Merton1976; Carr and Geman, Reference Carr and Geman2002) with their complements of stochastic volatility counterparts (cf. e.g. Hull and White, Reference Hull and White1987; Barndorff-Nielsen and Shephard, Reference Barndorff-Nielsen and Shephard2001). One approach to modelling the stock price process using a class of pure-jump stochastic processes has been launched by Eberlein and Jacod (Reference Eberlein and Jacod1997), with favourable arguments addressing the empirical realities and their implications of the no-arbitrage principle (Madan et al., Reference Madan, Carr and Chang1998). The price process we consider is capable of capturing contagion and jump clustering, a phenomenon that is empirically relevant when financial markets are in distress. Under these circumstances, stock price crashes occur more frequently than predicted by standard stochastic volatility models (Aït-Sahalia et al., Reference Aït-Sahalia, Cacho-Diaz and Laeven2015). In particular, asset price crashes tend to cluster over short time spans, and standard models are unable to replicate this pattern of crash clustering. Therefore, related Hawkes models have been proposed to better replicate the empirical patterns found in asset returns and lead to an improved fit for option pricing models (Boswijk and Lalu, 2016).
In particular, we study the Esscher martingale measure when the asset price follows a stochastic differential equation driven by both a Brownian motion and the counting process martingale whose predictable compensator follows shot-noise and Hawkes processes (Brémaud and Massoulié, Reference Brémaud and Massoulié2002; Dassios and Zhao, Reference Dassios and Zhao2011; Boumezoued, Reference Boumezoued2016). Such models typically generate markets which are incomplete, wherein there exist infinitely many martingale measures which are equivalent to the physical measure describing the evolution of the underlying stock price process. Each equivalent martingale measure corresponds to a set of derivative prices respecting the no-arbitrage principle. The Esscher martingale measure for Lévy processes and stochastic volatility models is well studied (Hubalek and Sgarra, Reference Hubalek and Sgarra2009). On another spectrum, Dassios and Zhao (Reference Dassios and Zhao2012) discussed the Esscher transform for a generalized contagion model for the purpose of importance sampling and ruin probability computations. The authors showed that under the Esscher transform, ruin becomes certain, and this facilitates simulation more efficiently than under the original measure, where the ruin is not certain and is even rare. Similar ideas for improving the simulation of rare events in the spirit of measure changes can be found in Asmussen (Reference Asmussen1985).
However, the Esscher martingale measure for a new class of compound shot-noise and Hawkes point processes has not been examined. Augmented with a Brownian component in a stochastic logarithm framework, we formulate the asset price as a general semimartingale whose drift is modulated by a one-layer neural network (Schmidhuber, Reference Schmidhuber2015). This model may introduce flexibility and extend the standard log-Brownian paradigm behaviour, capturing unusual business cycles or other economic benchmarks. We determine the Esscher martingale measure in concert with the procedures outlined in Kallsen and Shiryaev (Reference Kallsen and Shiryaev2002).
Section 2 summarizes the basic definitions and main results concerning that of shot-noise and Hawkes processes. In Section 3, we present our asset model as well as the basic properties of cumulant processes. In Section 4 we explicate our martingale approach to uncover the Esscher martingale measure. The conditions for absence of arbitrage and existence of solutions are discussed. Finally, the no-arbitrage valuation of derivatives is detailed in Section 5. We give some concluding remarks in Section 6. In the present text, we generally use the notation of Jacod and Shiryaev (Reference Jacod and Shiryaev2003), where $\mu_N$ and $\nu_N(dx,dt)$ respectively denote the jump measure associated to N and its compensator.
2. Preliminaries and point processes
Let $(\Omega,\mathcal{G},{\mathbb{P}})$ be a probability space. We fix a finite time horizon T, and all stochastic processes are defined on [0, T]. Let N be a simple point process on $\mathbb{R}$ , that is, a family $\{N(W)\}_{W\in\mathcal{B}(\mathbb{R})}$ of random variables with values in $\{0,1,2,...\}\,\cup\,\{\infty\}$ indexed by the Borel $\sigma$ -algebra $\mathcal{B}(\mathbb{R})$ of the real line $\mathbb{R}$ , where $N(W)=\sum_{n\in\mathbb{Z}}\mathbb{1}_W(\tau_n)$ and $(\tau_n)_{n\in\mathbb{Z}}$ is a sequence of extended real-valued random variables such that almost surely (a.s.) $\tau_0\leq 0<\tau_1$ and $\tau_n<\tau_{n+1}$ on $\{\tau_n<\infty\}\cap\{\tau_{n+1}>-\infty\}$ for every $n\in\mathbb{Z}$ . We further assume that it is nonexplosive, wherein $\tau_n\rightarrow \infty$ a.s. as $n\rightarrow\infty$ (see p. 47 of Daley and Vere-Jones, Reference Daley and Vere-Jones2003 and p. 8 of Last and Brandt, Reference Last and Brandt1995, respectively). Let $\mathcal{H}_t=\sigma(N(W),\ W\in\mathcal{B}(\mathbb{R}),\ W\subset(\!-\!\infty,t))$ . We also assume that our probability space supports a Brownian motion B, and denote its ${\mathbb{P}}$ -augmented natural filtration by $(\mathcal{F}_t)$ . Let $(\mathcal{G}_t)=(\mathcal{H}_t)\vee(\mathcal{F}_t)$ . We assume the following martingale invariance property: all $(\mathcal{F}_t)$ - and $(\mathcal{H}_t)$ -martingales remain martingales in the larger filtration $(\mathcal{G}_t)$ .
The process $\lambda_t$ is called the $\mathcal{G}_t$ -conditional intensity of N if for all intervals (s,t], we have
We use the notation $N_t\;:\!=\;N(0,t]$ to denote the number of points in the interval (0, t]. The linear Hawkes process with levels of excitation $h\;:\;\mathbb{R}_+\mapsto\mathbb{R}_+$ is a simple point process N admitting the $\mathcal{G}_t$ -intensity
where $\bar{\lambda}_t>0$ , h (resp. $\breve{h}$ ) is such that $h(\!\cdot\!)\;:\;\mathbb{R}^{+}\rightarrow\mathbb{R}^+$ , and we always assume that $\|h\|_{L^1}=\int_0^{\infty}h(t)dt<\infty$ , which ensures stationarity in the point process N (Daley and Vere-Jones, Reference Daley and Vere-Jones2003). $\breve{N}$ is an inhomogeneous Poisson process with non-random and bounded intensity $\sigma$ . The sequence of event times u gives the jump times of N, and this models the Hawkes or self-exciting property; they are endogenous by construction. By the same token, exogenous events occur at times $\breve{u}$ ; this is known as the shot-noise or commonly referred to as the ‘Cox component’ (Cox and Isham, Reference Cox and Isham1980). Furthermore, the point process martingale $\widetilde{N}$ associated with the Hawkes process is given by
Characterizations of h and $\breve{h}$ . One such function for h (and that of $\breve{h}$ ) takes the following form: $h(t)=\alpha e^{-\delta t}$ , $t\in[0,T]$ , where $\alpha>0$ and $\delta>0$ . Furthermore, we assume that $\delta>\alpha$ (resp. $\breve{\alpha}$ ), thus guaranteeing stationarity of N (Brémaud and Massoulié, Reference Brémaud and Massoulié2002; Dassios and Zhao, Reference Dassios and Zhao2011). In the case of
where $\lambda_0>0$ is the initial intensity at time $t=0$ , and ${\lambda^{\flat}}\geq 0$ is a constant with ${\lambda}^{\flat}<\lambda_0$ , we remark that $\lambda_t$ is Markovian and the pair $(N_t,\lambda_t)_{t\in[0,T]}$ forms a Markov process (Oakes, Reference Oakes1975). Another widely used function is the so-called power decay function, which is particularly used in the field of seismology; it takes the form $h(t)=\beta(t+\gamma)^{-\eta-1}$ , where $\beta\geq 0$ , $\gamma>0$ , $\eta>0$ , and $\eta\gamma^{\eta}>\beta$ for stationary of the point process to hold. When $\eta=0$ , we recover the Omori formula (Omori, Reference Omori1894). When no constraints are placed on $\eta$ , we obtain the modified Omori formula (Utsu et al., Reference Utsu, Ogata and Matsu’ura1995). For further discussion of this type of function, we refer the reader to Ogata (Reference Ogata1998).
3. Structure condition and cumulant processes
We choose the money market account as numéraire, and the interest rate is taken to be null. We assume that the asset price dynamics $\tilde{S}$ under ${\mathbb{P}}$ is given by
where $\mu_N$ denotes the jump measure of N while $\nu_N$ denotes its predictable compensator. In our framework of compound shot-noise and Hawkes point processes, we no end have
The deterministic function $a^{{\boldsymbol{\theta}}}_t$ , representing the drift of the stock price process, is formulated to be a flexible family of functions in the form of a one-layer neural network,
where $g_1(\!\cdot\!), g_2(\!\cdot\!)$ are bounded activation functions, and K is some number of hidden units (Zell, Reference Zell1997). The quantity ${\boldsymbol{\theta}}$ denotes the set of parameters $\{\alpha,(\beta_k,\gamma_k,\pi_k)_{k=1}^K\}$ , which modulate the drift of the stock price process. Each term $g_2(\gamma_k\cdot t+\pi_k)$ may be seen as a hidden representation of the input time. By choosing K sufficiently large, we can model complex nonlinear drift of the stock price process. Some choices of activation functions include the sigmoid $g_2(z)=(1+e^{-z})^{-1}$ , the inverse square $g_2(z)=(1+z^2)^{-1}$ , the hyperbolic tangent $g_2(z)=\tanh\!(z)$ , and the bounded rectified linear activation function (or ReLU for short), given by $g_2(z)=\min\!(\!\max\!(0,z),D)$ for some predetermined $D>0$ that defines the maximal output value that this function could produce (Schmidhuber, Reference Schmidhuber2015). Furthermore, we assume that $b>\epsilon$ and $f^{H}> -1$ are deterministic and bounded.
To sum up, we work with the following restrictions on the parameters $(a^{{\boldsymbol{\theta}}},b,f^H)$ .
Assumption 1. The quantities $a_t,b_t>\epsilon$ and $f^H_t$ are deterministic, bounded functions with
We often suppress the time variable in the notation, in accordance with convention. Note that $\tilde{S}$ is the stochastic exponential of S, and thus solving the differential equation yields
where $\mathcal{E}$ is the Doléans-Dade exponential. Our main goal in this manuscript is to calculate as explicitly as possible the Esscher martingale measure in the sense of Kallsen and Shiryaev (Reference Kallsen and Shiryaev2002) for the stochastic logarithm $S=\int d\tilde{S}/\tilde{S}_{-}$ , with probability measures ${\mathbb{Q}}$ such that S is a local ${\mathbb{Q}}$ -martingale. By Theorem III.33 of Protter (Reference Protter2005), the price process $\tilde{S}=\int \tilde{S}_{-} dS$ is a local ${\mathbb{Q}}$ -martingale as well. Moreover we assume that the asset price process S satisfies the following structure condition: the decomposition of S can be uniquely written in the form $S = M + A$ , where
and $\rho$ is a predictable process satisfying $K_T\;:\!=\;\int_0^T \rho_t^2\,d\langle M,M\rangle_t <\infty$ , ${\mathbb{P}}$ -a.s. (Ansel and Stricker, Reference Ansel and Stricker1992). The notation $\langle M,M\rangle$ denotes the predictable compensator of the quadratic variation process [M, M]. In our Hawkes–Cox-driven price process from Equation (3.1), it is readily computed that
as well as that
since both the point processes N and $\breve{N}$ are nonexplosive and $\lambda_t$ is finite for $t\in[0,T]$ . Hence S is a special semimartingale (see p. 209 in He et al. (Reference He, Wang and Yan1998)).
One potential approach for the evaluation of a pricing measure is related to the Esscher martingale measure (Madan and Milne, Reference Madan and Milne1991; Gerber and Shiu, Reference Gerber and Shiu1994; Eberlein and Keller, Reference Eberlein and Keller1995; Chan, Reference Chan1999). In this section, we recall some definitions from Kallsen and Shiryaev (Reference Kallsen and Shiryaev2002), which lays out an extended abstract theory for Esscher martingale transforms for general semimartingales.
We remark that if X is a semimartingale, then L(X) denotes the set of predictable X-integrable processes. Recall that a semimartingale X is called special if it can be decomposed as $X = X_0 + M + A$ for some local martingale M and some predictable process A of finite variation, null at $0.$ We denote by $\mathcal{M}_{\textit{loc}}$ the class of all local martingales.
Definition 1. Let X be a real-valued semimartingale. X is called exponentially special if $\exp\!(X-X_0)$ is a special semimartingale. A predictable process A is called the exponential compensator of X if $\exp (X - {X_0} - A) \in {{\cal M}_{{\rm{loc}}}}$ .
Definition 2. The modified Laplace cumulant process $K^X(\vartheta)$ of X in $\vartheta$ is defined to be the exponential compensator of $\int\vartheta dX$ .
Lemma 1. Let $\vartheta\in L(S)$ be such that $\int \vartheta dS$ is exponentially special. Then the modified Laplace cumulant process of S in $\vartheta$ is given by
where
Proof. This follows immediately from Theorem 2.18 of Kallsen and Shiryaev (Reference Kallsen and Shiryaev2002) following the semimartingale characteristics.
The derivative of the cumulant process $DK^{S}(\vartheta)=\int D\tilde{\kappa}^S(\vartheta)\lambda dt$ in the sense of Definition 2.22 in Kallsen and Shiryaev (Reference Kallsen and Shiryaev2002) is given by
Assume that there exists a solution $\vartheta^{\sharp}$ to the martingale equation
In the case when
is a martingale, we can then define a probability measure ${\mathbb{Q}}^{\sharp}$ by
The density process $(G^{\sharp}_t)_{t\in[0,T]}$ now defines a unique equivalent martingale measure ${\mathbb{Q}}^{\sharp}$ for S, known as the Esscher martingale measure for the process S (Theorems 4.4 and 4.5 in Kallsen and Shiryaev, Reference Kallsen and Shiryaev2002).
4. Esscher martingale measure
In this section, we spell out the necessary and sufficient conditions for $(G^{\sharp}_t)_{t\in[0,T]}$ to be an Esscher martingale measure.
Lemma 2. Define the following functions:
Then there exists a bounded function $\vartheta_t^{\sharp}\;:\;[0,T]\rightarrow\mathbb{R}$ solving $m_1(\vartheta_t^{\sharp})=m_2(\vartheta_t^{\sharp})$ for all $t\in[0,T]$ and hence Equation (3.12).
Proof. Observe that for a fixed $t\in[0,T]$ , $m_2$ is decreasing in $\vartheta$ , since
using the fact that $\lambda>0$ is well defined for each $t\in[0,T]$ and b is bounded. Also, note that $m_2(\!-\!\infty)=\infty$ , $m_2(\infty)=-\infty$ . Moreover, $m_1$ is bounded since $a^{\boldsymbol{\theta}}$ is bounded. Hence we conclude by the mean value theorem that there exists for all $t\in[0,T]$ a number $\vartheta_t^{\sharp}$ such that $m_1(\vartheta_t^{\sharp})=m_2(\vartheta_t^{\sharp})$ . Note that $(m_2)^{-1}(\!\sup a^{\boldsymbol{\theta}})\leq\vartheta^{\sharp}\leq (m_2)^{-1}(\!\inf a^{\boldsymbol{\theta}})$ , where $(m_2)^{-1}$ is the inverse function of $m_2$ ; hence $t\mapsto\vartheta_t^{\sharp}$ is a bounded function on [0, T].
We now turn our attention to proving that the process $G^{\sharp}$ is a martingale. For this we will need the following criterion, due to the criterion of Theorem 9 in Protter and Shimbo (Reference Protter and Shimbo2008).
Proposition 1. Let $\widetilde{M}$ be a locally square-integrable martingale such that $\Delta\widetilde{M}>-1$ . If
where $\widetilde{M}^c$ and $\widetilde{M}^d$ are the continuous and purely discontinuous martingale parts of $\widetilde{M}$ , then the Doléans-Dade $\mathcal{E}(\widetilde{M})$ is a strictly positive martingale on [0, T].
Lemma 3. Define the following processes:
with
Then $G^{\sharp}$ is a martingale.
Proof. We proceed to evaluate the following quantity:
with probability 1, by the boundedness of the coefficients and that of $\vartheta^{\sharp}$ , and using the fact that our point processes are nonexplosive. It follows that ${\mathbb{E}}[\!\exp(\frac{1}{2}\langle N,N\rangle^c_T+\langle N,N\rangle^d_T)]<\infty$ . Hence the assertion follows from Proposition 1.
Following the existence of solutions with the martingale property ensured, we state the main result with regard to the Esscher measure.
Theorem 2. Let the conditions of Assumption 1 hold. Then there exists a function $\vartheta^{\sharp}\;:\;[0,T]\rightarrow\mathbb{R}$ with $\vartheta^{\sharp}_t\;:\!=\;\vartheta^{\sharp}(t)$ which solves the martingale equation (3.12) for any $t\in[0,T]$ , i.e.
Furthermore, the derivative
defines a probability measure ${\mathbb{Q}}^{\sharp}\sim{\mathbb{P}}$ on $\mathcal{G}_T$ , which is called the Esscher martingale measure for S. Moreover, under ${\mathbb{Q}}^{\sharp}$ , we have that
is a ${\mathbb{Q}}^{\sharp}$ -Brownian motion, and the predictable compensator $\nu^{\sharp}_N$ under ${\mathbb{Q}}^{\sharp}$ is given by
Proof. From Lemma 2, there exists a function $\vartheta^{\sharp}$ solving Equation (3.12). To complete the proof, we apply Theorem 4.4 in Kallsen and Shiryaev (Reference Kallsen and Shiryaev2002) to conclude that the density in Equation (4.6) defines an equivalent local martingale measure for $\mathcal{E}(S)$ . By Lemma 3, we have that $G^{\sharp}$ is a proper martingale and thus defines a density process. They dynamics under ${\mathbb{Q}}^{\sharp}$ follows from Girsanov’s theorem (cf. Theorem III.40 in Protter (Reference Protter2005)).
Remark 1.
-
(1) The role of $\vartheta^{\sharp}$ and $a^{\boldsymbol{\theta}}$ . For the Esscher martingale measure to exist, the choice of $\boldsymbol{\theta}$ and the concrete specification of the drift $a^{\boldsymbol{\theta}}$ must be such that there exists a solution $\vartheta^{\sharp}$ that solves the martingale equation (3.12); otherwise we say that the Esscher martingale measure does not exist.
-
(2) Family of drifts and existence of $\vartheta^{\sharp}$ . It is well known that the dynamics of asset returns cannot be adequately described with constant drift and volatility (Mandelbrot and Taylor, Reference Mandelbrot and Taylor1967; Clark, Reference Clark1973). There have been many attempts to extend the model by describing the evolution of drift and volatility beyond constants (cf. e.g. Masi et al., Reference Masi, Kabanov and Runggaldier1995, and references therein).
-
(3) Special cases. In our case, the formulation of the flexible family of drifts in Equation (3.1) is an interesting one provided that $\vartheta^{\sharp}$ exists. This one-layer neural network drift naturally subsumes the standard constant drift as a special case if we let $\beta_k=\gamma_k=\pi_k=0$ . Second, for identity activation of $g_1(\!\cdot\!)$ and $g_2(\!\cdot\!)$ a suitable indicator function, we get an $a^{\boldsymbol{\theta}}_t$ that is piecewise constant on intervals and could be utilized as part of modelling a regime switching; see for example Çetin and Verschuere (Reference Çetin and Verschuere2009) and Elliott and Siu (Reference Elliott and Siu2010).
Example (Hawkes with neural network drift). Suppose the stock price process takes the form
where f is a bounded function independent of time t and b is a constant. We set $a_t^{\boldsymbol{\theta}}$ to be
where $\boldsymbol{\theta}=(\alpha,\beta_1,\beta_2,\gamma_1,\gamma_2,\pi_1,\pi_2)$ , all of which are taken to be positive constants. We remark that Assumption 1 is satisfied for these choices of $a^{\boldsymbol{\theta}},b,$ and f. We set $g_1(\!\cdot\!)$ and $g_2(\!\cdot\!)$ to be the identity and the sigmoid activation functions, respectively, while setting the number of hidden units to $K = 2$ . We first check for existence of a solution $\vartheta^{\sharp}$ to the martingale equation (3.12), which in this example returns
Define the quantities
Invoke Lemma 2 to ensure the existence of a solution $\vartheta^{\sharp}$ to Equation (4.11). An illustration is given in Figure 1.
From Theorem 2, we see that the process
is a ${\mathbb{Q}}^{\sharp}$ -Brownian motion, where we have defined
Furthermore, the quantity
is the predictable compensator $\nu^{\sharp}_N$ of $\mu_N$ under ${\mathbb{Q}}^{\sharp}$ . The process $\sigma^{\sharp}$ represents the risk premium associated with the continuous martingale part B, and the predictable function $e^{\vartheta_t^{\sharp}(e^{f^H(x)}-1)}-1$ is interpreted as the risk premium associated with the jumps of the discontinuous part of $\mu_N$ (Theorem III.3.23 in Jacod and Shiryaev, Reference Jacod and Shiryaev2003), where $\vartheta^{\sharp}$ solves Equation (4.11).
5. Valuation of contingent claims
Having computed the Esscher martingale measure, we now detail the optimal hedging strategy as well as the differential equations for the prices of derivatives under this measure.
5.1. Mean-variance hedging
One criterion for hedging is mean-variance hedging (Bouleau and Lamberton, Reference Bouleau and Lamberton1989). Consider a contingent claim with maturity $T>0$ , defined by a $\mathcal{G}_T$ -measurable random variable H. Define the initial capital $V_0$ and a trading strategy which will be defined by a portfolio $(\psi^0_t,\psi_t)_{t\in[0,T]}$ taking values in $\mathbb{R}^2$ . As before, we choose the savings account as numéraire and work with a zero interest rate. In mean-variance hedging, we look for a self-financing trading strategy given by an initial capital $V_0$ and a portfolio over the lifetime of the contingent claim which minimizes the shortfall at the terminal date T in a mean square sense:
where
In particular, we have chosen a pricing rule given by the Esscher martingale measure ${\mathbb{Q}}^{\sharp}$ . Let $H\in \mathcal{L}^2(\Omega,\mathbb{G},{{\mathbb{Q}}^{\sharp}})$ ; H has finite variance and $(\tilde{S}_t)_{t\in[0,T]}$ is a square-integrable ${{\mathbb{Q}}^{\sharp}}$ -martingale. Let us consider those $(\psi_t)_{t\in[0,T]}$ whose terminal values satisfy
Define $\mathcal{L}^2(\tilde{S})$ as the set of portfolios $\psi$ satisfying ( $5.1$ ). Since $\psi\in\mathcal{L}^2(\tilde{S})$ , and using the fact that $\tilde{S}$ is a martingale under ${{\mathbb{Q}}^{\sharp}}$ , the gains process $G(\psi)=\int_0^{\cdot} \psi d\tilde{S}$ is also a square-integrable martingale. The mean-variance hedging problem can now be recast as
or equivalently
The condition ( $5.1$ ) implies that the value process V is a square-integrable martingale, and we have ${\mathbb{E}_{\mathbb{Q}^{\sharp}}}[V_T]=V_0$ . Applying the identity $\mathbb{E}(Z^2)=(\mathbb{E}(Z))^2+\mathbb{V}(Z)$ to the random variable $Z\;:\!=\;H-V_0-G_T(\psi)$ , we obtain
If the writer of the contingent claim tries to minimize the residual risk $J_0(V_0,\psi)$ , the optimal value that he/she will ask for is a premium
We see that ${\mathbb{E}_{\mathbb{Q}^{\sharp}}}[H]$ is the initial value of any strategy $\psi\in \mathcal{L}^2(\tilde{S})$ designed to minimize the shortfall at maturity, and we take this as the definition of the price associated with our contingent claim H at time 0. By the same token, if the writer sells the option at time $t>0$ and intends to minimize the remaining risk $J_t(V_t,\psi)\;:\!=\;{\mathbb{E}_{\mathbb{Q}^{\sharp}}}\left( \left|H-V_0-{\int_0^T}\psi_td\tilde{S}_t \right| ^2 \big| \mathcal{G}_t\right)$ , he/she will ask a premium $V_t={\mathbb{E}_{\mathbb{Q}^{\sharp}}}(H\,|\,\mathcal{G}_t)$ . We will take this quantity to define the price of the contingent claim H at time t under the Esscher measure ${{\mathbb{Q}}^{\sharp}}$ .
For illustrative purposes, in the framework of pricing and hedging derivatives, we work with the asset price process in Equation (3.1), setting $a^{\boldsymbol{\theta}}_t$ and $b_t$ to be constants $a>0$ and $b>0$ , respectively, and setting $f^H_t(x)=f^H(x)$ to be a bounded function independent of time. Furthermore, we assume that $h(t)=\alpha e^{-\delta t}$ and, without loss of generality, that $\breve{h}=0$ in Equation (2.2). With this, we see that $(N,\lambda)$ is jointly Markov and $\lambda_t$ can be recorded with an alternative expression
to Equation (2.2) when $\bar\lambda$ takes the form in Equation (2.4). From Theorem 2, the Esscher martingale measure ${{\mathbb{Q}}^{\sharp}}$ exists and the dynamics of $\tilde{S}$ takes the form
where the quantities ${{\mathbb{Q}}^{\sharp}}$ and $\nu^{\sharp}$ are defined in Theorem 2, and $\vartheta^{\sharp}$ solves Equation (3.12).
5.1.1. Pricing
Consider a contingent claim of European type with maturity T and payoff $H(\tilde{S}_T)$ . The payoff H is assumed to satisfy
for some $k>0$ . For put and call options, this assumption is satisfied with the choice of $k=1$ . The value $C_t$ of such a derivative claim is given by $C_t=C(t,\tilde{S}_t,\lambda_t)$ , where
Let H be a contingent claim with terminal payoff $H=H(\tilde{S}_T)$ . Then the price of H at time t is given by $C(t,s,\lambda)$ where
Assume that $C\in \mathcal{C}^{1,2}$ , i.e., the functions $(t,s,\lambda)\mapsto C(t,\tilde{S},\lambda)$ are continuously differentiable with respect to t and twice continuously differentiable with respect to s. Hence the Itô formula can be applied to $C_t=C(t,\tilde{S}_t,\lambda_t)$ between 0 and T. Define
since $H\geq0$ . Applying the Doob inequality to the martingale C yields
since H has finite variance. By the Burkholder–Davis–Gundy inequality, there exists a constant $c>0$ such that
Hence we obtain
This implies that C is a square-integrable ${{\mathbb{Q}}^{\sharp}}$ -martingale. With the expression of (5.2) and by Corollary I-3.16 of Jacod and Shiryaev (Reference Jacod and Shiryaev2003), we conclude that the finite-variation term vanishes, giving us the PDE
with terminal condition
Having derived the Esscher price for European-type contracts, we now characterize the optimal hedging strategy.
5.1.2. Optimal hedging strategy
Recall that under ${{\mathbb{Q}}^{\sharp}}$ , the stock price process $\tilde{S}$ is a square-integrable martingale. Consider now a self-financing trading strategy with $\psi\in L^2(\tilde{S})$ ; the value process V of the portfolio is also a martingale whose value at maturity $T>0$ is
Also, $C(T,\tilde{S}_T,\lambda_T)=H(\tilde{S}_T)=H$ and $C(0,\tilde{S}_0,\lambda_0)={\mathbb{E}_{\mathbb{Q}^{\sharp}}}[H(\tilde{S}_T)]=V_0$ , so that
Taking into consideration Equation (5.5), we evaluate the quantity
so that
To obtain the optimal risk-minimizing hedge, we minimize the above expression with respect to $\psi$ :
Hence we have that the minimal-risk hedge amounts to holding a position in the underlying ${(S_t)_{t>0}}$ equal to $\psi_t=\Delta(t,{\tilde{S}_{t-}},\lambda_{t-})$ , where
A sanity check for convexity yields that
6. Postlude
(1) Integro-differential equations. The assumption that the kernel is of exponential form, i.e., $h(t)=\alpha e^{-\delta t}$ , in Equation (2.2) is instrumental to the results derived in Section 5, owing to the joint Markovian structure of N and $\lambda$ . For general kernels h, it is unclear whether explicit expressions can be obtained. This is due to the non-Markovian nature of the pair $(N,\lambda)$ induced by such kernel functions (cf. discussions in Boumezoued, Reference Boumezoued2016, and Gao et al., Reference Gao, Zhou and Zhu2018).
Our derivations and heuristics suggest that the same result should hold for a wide class of parameter specifications satisfying Assumption 1, provided that Lemma 2 is respected, guaranteeing the existence of $\vartheta^{\sharp}$ to Equation (3.12). In Section 5, we stick to simple specifications for the sake of illustration. Furthermore, since the payoff functions H satisfy the Lipschitz condition in Equation (5.4), these results hold for put and call options, as well as any combinations of these, such as butterfly spreads and strangles, to name a few.
(2) Other martingale measures. Given the emergence of new financial markets where an established market for exchange-traded derivatives has yet to emerge, it is often not possible to identify the statistical martingale measure (see Chapter 6 in Schoutens, Reference Schoutens2003) for the purpose of derivative valuation. The statistical martingale measure is a pricing measure ‘chosen by the market’ through the minimization of some distance between the observed and theoretical option prices. It may be tricky to compute this measure in the absence of a liquid market with actively traded vanilla options written on the underlying asset.
To overcome this problem, we propose the use of the Esscher martingale measure, which, under general conditions, minimizes the entropy-Hellinger distance. This observation has been documented; see Remark 10 in Hubalek and Sgarra, Reference Hubalek and Sgarra2009, which explicates the relationships (cf. Theorem 4.3 in Choulli and Stricker, Reference Choulli and Stricker2005, and Theorem 4.4 in Kallsen and Shiryaev, Reference Kallsen and Shiryaev2002). Other martingale measures, such as the minimal entropy martingale measure (Delbaen et al., Reference Delbaen2002), are often chosen as well. That said, the identification of this measure is quite involved and goes beyond the scope of this paper, so it is left to future research.
(3) Open problems. We have studied the Esscher martingale measure for a general stochastic differential equation augmented by a counting process whose predictable compensator has shot-noise and Hawkes elements. The techniques of proof are fairly general and may be applied to various extensions of the proposed framework. One possibility may be to generalize the Cox element $\breve{N}$ to a Lévy subordinator, which is a.s. increasing.
Acknowledgements
We are thankful to three anonymous referees for their helpful comments on improving the manuscript. This work was conducted while Y. L. was at Harvard University.
Competing interests
There were no competing interests to declare which arose during the preparation or publication process of this article.