Hydrodynamics of Markets: Hidden Links between Physics and Finance

Alexander Lipton

doi:10.1017/9781009503129

6accdae13eff 7i3l9n4o4qrr4s8t12ux

Letter from Isaac Newton to Henry Oldenburg, 24 October 1676¹

Bob Montagnet:

Yeah, good choice, Vlad.

Get back to the security system. How does it work?

Vlad:

The way everything works. Mathematics.

“The Good Thief,” screenplay by Neil Jordan, 2002

1 Introduction

1.1 Background

Newton’s discovery of differential equations and calculus was crucial in developing classical mechanics because it allowed for the mathematical description of the motion of objects. This discovery took a groundbreaking step in unifying mathematics with physics, enabling the prediction of planetary orbits, the motion of objects under various forces, and much more, and marked the beginning of a new era in mathematics and science, laying the cornerstone for over three centuries of advancements.

Newton understood the immediate impact of his discoveries and their potential to transform the understanding of the natural world. To establish and protect his intellectual property rights at the same time, he concealed his discovery in the fundamental anagram of calculus, which he included in his 1676 letter to Oldenburg. This anagram contained a Latin statement describing the method of fluxions (his term for calculus) when decoded. The need for an anagram reflected that Newton was competitive and cautious in equal measure by balancing the desire for recognition with the fear of disclosure. The number of occurrences of each Latin character in Newton’s sentence agrees with his anagram, thus proving that the actual sentence was written in 1676.Footnote ² The original letter is shown in Figure 1.

Figure 1 Newton’s letter to Oldenburg, 1676.

Reproduced by kind permission of the Syndics of Cambridge University Library.

The fact that differential equations are instrumental in mathematics and physics alike was firmly established in the late seventeenth century. However, methods for solving these equations remained ad hoc for more than a century until the work by Lagrange, Laplace, Fourier, and many other mathematicians and physicists. In particular, the Fourier transform stands out as the most potent tool in an applied mathematician’s toolkit, enabling the solving of linear partial differential equations (PDEs) and partial pseudo-differential equations (PPDEs) with spatially constant coefficients; it is also invaluable for analyzing time series and tackling other critical tasks (Reference FourierFourier (1822); Reference Morse and FeshbachMorse & Feschbach (1953)).

At the heart of the $n$ -dimensional Fourier method are wave functions, expressed as follows:

ℱ (t, x, k) = a (t) \exp (i k \cdot x) ​,

(1.1)

where $x$ and $k$ are $n$ -dimensional vectors, $\cdot$ denotes the scalar product, $a (t)$ is the amplitude, and $k \cdot x$ is the phase. Depending on the particular problem at hand, the amplitude $a (t)$ can be a scalar or a vector, hence the notation. Substituting $F$ into a PDE with spatially constant coefficients, one reduces the problem of interest to a system of ordinary differential equations (ODEs) or a single ODE when $a (t)$ is scalar. Of course, this system parametrically depends on $k .$

This Element studies PDEs and PPDEs with coefficients linearly dependent on $x,$ which are called affine. Hence, one must use a more general approach and consider wave functions with time-dependent wave vectors: Reference KelvinKelvin (1887) and Reference OrrOrr (1907) were the first to use such waves to analyze the stability of the steady motions of an incompressible fluid.

\begin{matrix} K (t, x, β (t)) = a (t) exp (i β (t) \cdot x) . \end{matrix}

(1.2)

Affine problems are not artificial constructs. They appear organically in several situations, for example, when the linear description of the underlying physical mechanism is either exact or provides an excellent approximation to reality or when the evolution in the phase space is studied; see Section 3.

Subsequently and independently, affine PDEs and the associated wave functions were used by many researchers in various areas, including the theory of stochastic processes, physics, biology, and mathematical finance, to mention a few. The Ornstein–Uhlenbeck (OU) and Feller processes are the simplest but extremely important examples of affine processes; see Reference Uhlenbeck and OrnsteinUhlenbeck and Ornstein (1930), Reference ChandrasekharChandresekhar (1943), and Reference FellerFeller (1951, Reference Feller1952). For financial applications of affine processes see Reference Duffie and KanDuffie and Kan (1996), Reference Duffie, Pan and SingletonDuffie et al. (2000), Reference Dai and SingletonDai and Singleton (2000), Reference LiptonLipton (2001), Reference Duffie, Filipovic and SchachermayerDuffie et al. (2003), Reference SeppSepp (2007), Reference Lipton and SeppLipton and Sepp (2008), and Reference FilipovicFilipovic (2009), among others.

This Element uses Kelvin waves of the form (1.2) to study transition probability density functions (t.p.d.fs) for affine stochastic processes. These processes can be either degenerate, namely, have more independent components than the sources of uncertainty, or nondegenerate, when every component has its source of uncertainty. Recall that the t.p.d.f. for a stochastic process describes the likelihood of a system transitioning from one state to another over a specified period. Knowing the iterated t.p.d.f. is fundamental for understanding the dynamics and behavior of stochastic processes over time and is tantamount to knowing the process itself.

In this Element, Kelvin waves are also used to solve several essential and intricate problems occurring in financial applications. These include pricing options with stochastic volatility, path-dependent options, and Asian options with geometric averaging, among many others.

The main objective is to link various financial engineering topics with their counterparts in hydrodynamics and molecular physics and showcase the interdisciplinary nature of quantitative finance and economic modeling. Finding such connections allows us to understand better how to model, price, and risk-manage various financial instruments, derive several new results, and provide additional intuition regarding their salient features. This Element continues previous efforts in this direction; see Reference Lipton and SeppLipton and Sepp (2008) and Reference LiptonLipton (2018), chapter 12.

There are several approaches one can use to solve affine equations efficiently. For instance, Lie symmetries are a powerful tool for studying certain classes of affine equations. Numerous authors describe general techniques based on Lie symmetries; see, for example, Reference OvsiannikovOvsiannikov (1982), Reference IbragimovIbragimov (1985), Reference OlverOlver (1986), and Reference Bluman and KumeiBluman and Kumei (1989), while their specific applications to affine equations are covered by Reference BerestBerest (1993), Reference AksenovAksenov (1995), Reference Craddock and PlatenCraddock and Platen (2004), Reference CraddockCraddock (2012), and Reference Kovalenko, Stogniy and TertychnyiKovalenko et al. (2014), among many others. However, Lie symmetry techniques are exceedingly cumbersome and might be challenging to use in practice, especially when complicated affine equations are considered.

Laplace transform of spatial variables can be used in some cases, for instance, for Feller processes; see, for example, Reference FellerFeller (1951, Reference Feller1952). However, they are hard to use for solving generic affine equations.

Reductions of a given equation to a simpler, solvable form is another powerful method that can be successfully used in many instances; see, for example, Reference ChandrasekharChandresekhar (1943), Reference Carr, Lipton and MadanCarr et al. (2002), Reference Lipton, Gal and LasisLipton et al. (2014), and Reference LiptonLipton (2018), chapter 9. Although the reduction method is quite powerful, experience suggests it is often hard to use in practice.

Finally, the affine ansatz based on Kelvin waves provides yet another approach, which is the focus of the present Element; see also Reference Duffie and KanDuffie and Kan (1996), Reference Dai and SingletonDai and Singleton (2000), Reference Duffie, Filipovic and SchachermayerDuffie et al. (2003), Reference Lipton and SeppLipton and Sepp (2008), Reference FilipovicFilipovic (2009), and Reference LiptonLipton (2018), chapter 12. Undoubtedly, the affine framework, also known as the affine ansatz, is the most potent among the abovementioned techniques due to its comprehensive nature, versatility, and (relative) ease of use, even in complex situations. In practice, applications of Kelvin waves consist of three steps:

Effectively separating variables for the evolution problems with pseudo-differential generators linearly dependent on spatial coordinates;
Solving ODEs parametrized by time-dependent wave vectors; see (1.2);
Aggregating their solutions together to get the solution to the original problem.

However, despite being a ruthlessly efficient tool, Kelvin waves have limitations – using them to solve evolution problems supplied with external boundary conditions is challenging. This exciting topic is being actively researched now; it will be discussed elsewhere in due course.

1.2 Main Results

This Element develops a coherent, unified mathematical framework using Kelvin waves as a powerful and versatile tool for studying t.p.d.fs in the context of generic affine processes. It discovers previously hidden connections among large classes of apparently unrelated problems from hydrodynamics, molecular physics, and financial engineering. All these problems require solving affine (pseudo-) differential equations, namely, equations with coefficients, which linearly depend on spatial variables. The Element discusses some classical results and derives several original ones related to:

small wave-like perturbations of linear flows of ideal and viscous fluids described by Euler and Navier–Stokes equations, respectively;
motions of free and harmonically bound particles under the impact of random external white-noise forces described by the Klein–Kramers equations and the hypoelliptic Kolmogorov equation, which play an essential role in statistical physics;
Gaussian and non-Gaussian affine processes, such as the Ornstein–Uhlenbeck and Feller processes, which are the archetypal mean-reverting processes, and their generalizations;
dynamics of financial markets, particularly derivative products.

To solve some of the more complicated problems, one must augment primary processes by introducing subordinate processes for auxiliary variables, such as integrals over the original stochastic variable, and develop a uniform mathematical formalism to construct t.p.d.fs for the abovementioned processes.

Quite unexpectedly, the analysis identifies and rectifies an error in the original solution of the Kolmogorov equation. The rectified solution is dimensionally correct, properly scales when the process parameters change, and agrees with numerical results.

Furthermore, this Element derives many original results and extends and reinterprets some well-known ones. For instance, it develops a concise and efficient expression for t.p.d.fs in the case of processes with stochastic volatility. Moreover, the analysis reveals an unexpected similarity between the propagation of vorticity in two-dimensional flows of viscous incompressible fluid and the motion of a harmonically bound particle, which is used to find a new explicit expression for the vorticity of a two-dimensional flow in terms of the Gaussian density.

Finally, the Element applies the new methodology to various financial engineering topics, such as pricing options with stochastic volatility, options with path-dependent volatility, Asian options, volatility and variance swaps, options on stocks with path-dependent volatility, and bonds and bond options. In contrast to the classical approach, the Element treats primary fixed-income products, such as bonds and bond options, as path-dependent, allowing us to gain additional intuition regarding such products’ pricing and risk management. It also highlights the flexibility of the interdisciplinary framework by incorporating additional complexities into the picture, such as jump-diffusion processes and, more generally, processes driven by affine pseudo-differential processes frequently used in financial applications.

1.3 Element Structure

Section 2 introduces Kelvin waves. Section 2.1 introduces the Euler equations, which describe the dynamics of a perfect fluid, alongside the Navier–Stokes equation for viscous incompressible fluids. Section 2.2 discusses the exact equilibria of these equations, focusing on states where velocity varies linearly and pressure quadratically with spatial coordinates, referred to as linear flows. Section 2.3 illustrates that the renowned Kelvin waves provide solutions to the linearized Euler and Navier–Stokes equations for small perturbations of the linear flows. This section also explores the use of Kelvin waves in analyzing the stability of these flows.

The Element uses Kelvin waves as a fundamental tool in the analytical arsenal, demonstrating their applicability across various study areas. For instance, they allow one to discover profound and surprising links between the viscous two-dimensional vorticity equations and the Klein–Kramers equation, a cornerstone of stochastic physics; see Section 6.6. This connection results in a novel formula representing vorticity as a Gaussian density and the stream function as the solution to the associated Poisson equation.

Section 3 investigates the degenerate stochastic process introduced by Kolmogorov in 1934, alongside the associated Fokker–Planck equation and its solution proposed by Kolmogorov. Further connections between the Kolmogorov and Klein–Kramers equations are explored in Section 4. To start with, Section 3 summarizes Kolmogorov’s original findings. Surprisingly, the Fokker–Planck equation, as used by Kolmogorov in his seminal paper, is inconsistent with his initial assumptions regarding the underlying process. Moreover, his proposed solution has dimensional inconsistencies and, as a result, does not satisfy the Fokker–Planck equation and initial conditions. However, there is a silver lining; Kolmogorov’s solution can be corrected via several complementary methods, which the section outlines. It concludes with an example of a representative corrected solution to the Kolmogorov problem.

Section 4 explores a selection of representative affine stochastic processes in statistical physics. First, it introduces the Langevin equation, which describes the dynamics of an underdamped Brownian particle in a potential field. Following this, it derives the Klein–Kramers equation, capturing the probabilistic aspects of the motion of such a particle. It turns out that the Kolmogorov equation derived in Section 3 is a particular case of the Klein–Kramers equation. The section presents Chandrasekhar’s solutions to the Klein–Kramers equations describing free and harmonically bound particles. The Klein–Kramers equation is inherently degenerate, with white noise impacting the particle’s velocity but not its position. It is shown in Section 8 that many path-dependent problems share this characteristic in mathematical finance. For instance, financial variables like the geometric price averages, which serve as the underlying instruments for a particular class of Asian options, can be conceptualized as path integrals, fitting into the category of degenerate stochastic processes.

Section 5 describes backward (Kolmogorov) and forward (Fokker–Planck) equations for t.p.d.fs of multidimensional stochastic jump-diffusion processes. The section explains the significance of studying t.p.d.fs. It sets up the general framework for Kolmogorov and Fokker–Planck equations and identifies the subset of affine stochastic processes amenable to analysis using the Kelvin-wave formalism. Subsequently, the section introduces an augmentation technique, providing a natural approach to tackle degenerate problems. Finally, it illustrates methods for transforming specific nonaffine processes into affine form through coordinate transformations, enhancing the scope of problems accessible by the Kelvin-wave methodology.

Section 6 studies Gaussian stochastic processes. It introduces a general formula for regular Gaussian processes, accommodating both degenerate scenarios and nondegenerate cases, as in Kolmogorov’s example. It expands this formula to address the practically significant scenario of killed Gaussian processes, followed by several illustrative examples. Then, the section presents the derivation of the t.p.d.f. for the Kolmogorov process with time-varying coefficients and explores the OU process with time-dependent coefficients and its extension, the augmented OU process, which models the combined dynamics of the process and its integral. Although the results are classical, their derivation through Kelvin-wave expansions provides a novel and enriching angle, offering an alternative viewpoint for understanding and deriving these established results. Next, the section examines free and harmonically bound particles, contrasting the Kelvin-wave method with Chandrasekhar’s classical approach. Finally, it revisits the basic concepts introduced in Section 2, demonstrating the akin nature of the temporal-spatial evolution of vorticity in the two-dimensional flow of a viscous fluid to the dynamics of a harmonically bound particle. This finding is intriguing and unexpected, forging a connection between seemingly unrelated physical phenomena.

Section 7 considers non-Gaussian processes. It starts with a general formula for non-Gaussian dynamics, accommodating degenerate and nondegenerate processes. Then, it expands this formula to killed processes. Several interesting examples are studied. These examples include a Kolmogorov process driven by anomalous diffusion, Feller processes with constant and time-dependent coefficients, and degenerate and nondegenerate augmented Feller processes. A novel method for investigating finite-time explosions of t.p.d.fs for augmented Feller processes is developed as a helpful by-product of the analysis. In addition, arithmetic Brownian motions with path-dependent volatility and degenerate and nondegenerate arithmetic Brownian motions with stochastic volatility are analyzed in detail.

Section 8 illustrates the application of the methodology to financial engineering. To start with, it lays the foundation of financial engineering, providing a primer for the uninitiated. Then, the section introduces the geometric Brownian motion, a staple in financial modeling, and discusses the modifications necessary to reflect the complexities of financial markets better. Several traditional models, such as Bachelier, Black–Scholes, Heston, and Stein–Stein models, and a novel path-dependent volatility model are explored via the Kelvin-wave formalism. In addition, it is shown how to price Asian options with geometric averaging via the Kolmogorov’s solution described in Section 3. Besides, volatility and variance swaps and swaptions, bonds and bond options are investigated by linking financial formulas to those used in physics for underdamped Brownian motion.

Section 9 succinctly outlines potential future expansions of the work presented in this Element and summarizes the conclusions. Finally, this Element is a revised and expanded version of Reference LiptonLipton (2023).

A note on notation: Given the wide-ranging scope of this Element, from hydrodynamics to molecular physics, probability theory, and financial engineering, adopting a unified notation system is impractical. Each field has its conventions carved in stone, leading to inevitable variations in notation. Notation is designed for consistency within and, where possible, across sections. However, readers are encouraged to remain vigilant to maintain coherence in their understanding.

2 Fluid Flows

2.1 Euler and Navier–Stokes Equations

Hydrodynamics studies how fluids (liquids and gases) move, primarily relying on fluid motion’s fundamental equations: the Euler and Navier–Stokes equations, with the Euler equations applicable to inviscid (frictionless) flow and the Navier–Stokes equations describing viscous fluids. Hydrodynamics has numerous applications across various fields, including engineering, astrophysics, oceanography, and climate change, among many others.

Recall that the Euler system of partial differential equations (PDEs) describing the motion of an inviscid, incompressible fluid has the form

\begin{matrix} \frac{\partial V}{\partial t} + (V \cdot \nabla) V + \nabla (\frac{P}{ρ}) & = 0, \\ \nabla \cdot V & = 0; \end{matrix}

(2.1)

where $t$ is time, $x$ is the position, $V (t, x)$ is the velocity vector, $P (t, x)$ is the pressure, $ρ$ is the constant density, $\nabla$ is the gradient, and $\cdot$ denotes the scalar product; see, for example, Reference ChandrasekharChandrasekhar (1961). In Cartesian coordinates, the equations in (2.1) can be written as follows:

\begin{matrix} \frac{\partial V_{i}}{\partial t} + V_{j} \frac{\partial V_{i}}{\partial x_{j}} + \frac{\partial}{\partial x_{i}} (\frac{P}{ρ}) & = 0, \\ \frac{\partial V_{i}}{\partial x_{i}} & = 0. \end{matrix}

(2.2)

Here and in what follows, Einstein’s summation convention over repeated indices is used.

The motion of the incompressible viscous fluid is described by the classical Navier–Stokes equations of the form:

\begin{matrix} \frac{\partial V}{\partial t} + (V \cdot \nabla) V - ν Δ V + \nabla (\frac{P}{ρ}) = 0, \\ \nabla \cdot V = 0; \end{matrix}

(2.3)

where $ν$ is the kinematic viscosity; see, for example, Reference ChandrasekharChandrasekhar (1961). Explicitly,

\begin{matrix} \frac{\partial V_{i}}{\partial t} + V_{j} \frac{\partial V_{i}}{\partial x_{j}} - ν \frac{\partial^{2} V_{i}}{\partial x_{j} \partial x_{j}} + \frac{\partial}{\partial x_{i}} (\frac{P}{ρ}) = 0, \\ \frac{\partial V_{i}}{\partial x_{i}} = 0. \end{matrix}

(2.4)

The diffusive term $- ν Δ V$ in (2.4) describes frictions ignored in (2.3). Due to their greater generality, the Navier–Stokes equations are fundamental to understanding important phenomena, such as the transition from laminar to turbulent flow.

2.2 Linear Flows

This section studies exact solutions of the Euler and Navier–Stokes equations known as linear flows. These solutions are valuable for several reasons: (a) exact solutions provide precise, analytical descriptions of fluid flow patterns under specific conditions; (b) they serve as benchmarks for understanding fundamental hydrodynamics phenomena like wave propagation; (c) they provide a bridge which is crucial for more complex studies by simplifying the inherently complex and nonlinear nature of hydrodynamics, and making it possible to understand the behavior of more general fluid flows. Linear solutions of the Euler and Navier–Stokes equations help to study fluid flow stability. This understanding is crucial in predicting and controlling flow behavior in various engineering applications, from aerospace to hydraulic engineering. By starting with linear solutions, one can incrementally introduce nonlinear effects, allowing for a systematic study of nonlinear phenomena in hydrodynamics. This approach can uncover the mechanisms behind complex flows, including turbulence and chaotic flow behaviors. Exact linear solutions of the Euler equations provide a clear, analytical framework for exploring the behavior of fluids and validating more complicated models.

It is easy to show that the equations in (2.1) have a family of solutions $(V (t, x), P (t, x)),$ linearly depending on spatial coordinates:

\begin{matrix} V (t, x) = L (t) x, \frac{P (t, x)}{ρ} = \frac{P_{0}}{ρ} + \frac{1}{2} M (t) x \cdot x, \end{matrix}

(2.5)

where the $3 \times 3$ matrices $L (t),$ $M (t),$ are such that

\begin{matrix} \frac{d L (t)}{d t} + L^{2} (t) + M (t) & = 0, \\ T r (L (t)) = 0, M (t) & = M^{*} (t) . \end{matrix}

(2.6)

It is clear that linear flows, given by (2.5), are unaffected by viscosity, hence they satisfy (2.14).

Flows (2.5) have stagnation points at the origin. Typical examples are planar flows of the form

\begin{matrix} V_{1} & = \frac{1}{2} (s x_{1} - w x_{2}), V_{2} = \frac{1}{2} (w x_{1} - s x_{2}), V_{3} = 0, \\ \frac{P}{ρ} & = \frac{P_{0}}{ρ} + \frac{1}{4} (w^{2} - s^{2}) (x_{1}^{2} + x_{2}^{2}) . \end{matrix}

(2.7)

These flows are elliptic when $s < w,$ and hyperbolic otherwise; see, for example, Reference Friedlander and Lipton-LifschitzFriedlander and Lipton-Lifschitz (2003).

2.3 Kelvin Waves in an Incompressible Fluid

The study of small perturbations of exact solutions of the Euler and Navier–Stokes equations is the core of the stability analysis in fluid dynamics. Examining their behavior is essential for predicting how fluid flows evolve under slight disturbances. One can determine whether a particular flow is stable or unstable by introducing small perturbations to an exact solution and observing the system’s response. If these perturbations grow over time, the flow is considered unstable; if they decay or remain bounded, the flow is stable. One of this analysis’s most critical applications is understanding the transition from laminar (smooth and orderly) to turbulent (chaotic and unpredictable) flows. Small perturbations can exhibit exponential growth, leading to the onset of turbulence. For more detailed investigations, direct numerical simulations of the perturbed Navier–Stokes equations can be used to study the nonlinear evolution of perturbations. This approach can capture the complete transition from initial instability to fully developed turbulence, offering insights into the complex interactions that drive flow dynamics. The study of perturbations offers theoretical insights into the fundamental nature of fluid dynamics, including the mechanisms of flow instability, transition, and turbulence structure. It helps in developing reduced-order models and theories that explain complex fluid phenomena. Here, Kelvin waves are used as the primary tool for studying small perturbations of linear flows. In the rest of this Element, Kelvin waves are used for other purposes. This section is dedicated to their brief description.

It is necessary to study the behavior of perturbations of solutions given by (2.5), which are denoted by $(v (t, x), p (t, x)) .$ By neglecting the quadratic term ( $v \cdot \nabla) v,$ one can write the system of PDEs for $(v, p)$ as follows:

\begin{matrix} \frac{\partial v}{\partial t} + (L (t) x \cdot \nabla) v + L (t) v + \nabla (\frac{p}{ρ}) & = 0, \\ \nabla \cdot v & = 0. \end{matrix}

(2.8)

It has been known for a long time that linear PDEs (2.8) have wavelike solutions of the form:

\begin{matrix} (v (t, x), \frac{p (t, x)}{ρ}) = (a (t), a (t)) exp (i β (t) \cdot (x - r (t))), \end{matrix}

(2.9)

where $(a (t), a (t))$ are time-dependent amplitudes, and $β (t)$ is the time-dependent wave vector; see Reference KelvinKelvin (1887), Reference OrrOrr (1907), Reference Craik and CriminaleCraik and Criminale (1986), and Reference Friedlander and Lipton-LifschitzFriedlander and Lipton-Lifschitz (2003). In this Element, these solutions are called the Kelvin waves. It should be emphasized that the so-called affine ansatz is a special instance of Kelvin wave. This observation allows one to discover similarities among seemingly unrelated topics, which, in turn, facilitates their holistic and comprehensive study. An excerpt from Kelvin’s original paper is shown in Figure 2.

Figure 2 An excerpt from Kelvin’s original paper, where Kelvin waves are introduced for the first time; see Reference KelvinKelvin (1887). Public domain.

As one can see from Figure 2, Kelvin considered the special case of the so-called shear linear flow of the form

\begin{matrix} V (t, x) = (V_{1} (x_{2}), 0, 0) = (l_{12} x_{2}, 0, 0), \end{matrix}

(2.10)

between two plates, $x_{2} = 0$ and $x_{2} = L,$ the first one at rest and the second one moving in parallel.

The triplet $r (t),$ $β (t),$ $a (t)$ satisfies the following system of ODEs:

\begin{matrix} \frac{d r (t)}{d t} - L (t) r (t) = 0, r (0) & = r_{0}, \\ \frac{d β (t)}{d t} + L^{*} (t) β (t) = 0, β (0) & = β_{0}, \\ \frac{d a (t)}{d t} + L (t) a (t) - 2 \frac{L (t) a (t) \cdot β (t)}{β (t) \cdot β (t)} β (t) = 0, a (0) & = a_{0}, \\ β_{0} \cdot a_{0} & = 0. \end{matrix}

(2.11)

Here and in what follows, the superscript $*$ stands for transpose. The corresponding $p (t)$ can be found via the incompressibility condition. It is easy to show that for $t \geq 0,$

\begin{matrix} β (t) \cdot r (t) = β_{0} \cdot r_{0}, β (t) \cdot a (t) = 0. \end{matrix}

(2.12)

Thus, the Kelvin-wave formalism results in ingenious separation of variables and allows us to solve a system of ODEs (2.11), rather than PDEs (2.8).

Typically, the equations in (2.11) are used to study the stability of the linear flow. Such a flow is unstable whenever $(a (t)) \to \infty$ for some choices of $β_{0}, a_{0}$ ; see Reference BaylyBayly (1986), Reference LifschitzLifschitz (1995), and Reference Bayly, Holm and LifschitzBayly et al. (1996). Moreover, it can be shown that the same instabilities occur in general three-dimensional flows, because locally they are equivalent to linear flows; see Reference Lifschitz and HameiriLifschitz and Hameiri (1991a), Reference Friedlander and VishikFriedlander and Vishik (1991), Reference Lifschitz and HameiriLifschitz and Hameiri (1991b), and Reference Friedlander and Lipton-LifschitzFriedlander and Lipton-Lifschitz (2003).

Interestingly, Reference ChandrasekharChandrasekhar (1961) pointed out that the superposition of the linear flow (2.5) and the Kelvin wave (2.9), namely,

\begin{matrix} \tilde{V} (t, x) & = L (t) x + v (t, x), \\ \frac{\tilde{P} (t, x)}{ρ} & = \frac{1}{2} M (t) x \cdot x + \frac{p (t, x)}{ρ}, \end{matrix}

(2.13)

satisfies the nonlinear Euler equations (2.1) since the nonlinear term $(v \cdot \nabla) v$ vanishes identically due to incompressibility.Footnote ³ Studying secondary instabilities of flows with elliptic streamlines, that is, instabilities of Kelvin waves is an important and intricate topic; see Reference Fabijonas, Holm and LifschitzFabijonas et al. (1997).

Viscosity does affect small perturbations of linear flows. For viscous incompressible fluids, Kelvin waves are governed by the following equations:

\begin{matrix} \frac{\partial v}{\partial t} + (L (t) x \cdot \nabla) v + L (t) v - ν Δ v + \nabla (\frac{p}{ρ}) & = 0, \\ \nabla \cdot v & = 0. \end{matrix}

(2.14)

The viscous version of (2.11) has the following form; see Reference LifschitzLifschitz (1991):

\begin{matrix} \frac{d r (t)}{d t} - L (t) r (t) = 0, r (0) = r_{0}, \\ \frac{d β (t)}{d t} + L^{*} (t) β (t) = 0, β (0) = β_{0}, \\ \frac{d a (t)}{d t} + L (t) a (t) - 2 \frac{L (t) a (t) \cdot β (t)}{β (t) \cdot β (t)} β (t) + ν {(β (t))}^{2} a (t) = 0, a (0) = a_{0}, \\ β_{0} \cdot a_{0} = 0. \end{matrix}

(2.15)

It is shown in Section 6.5 that in the two-dimensional case, the Navier–Stokes equations for small perturbations of linear flows are more or less identical to the Fokker–Planck equations for harmonically bound articles, which is surprising.

The evolution of a typical Kelvin wave parameters triplet $r (t),$ $β (t),$ $a (t)$ is illustrated in Figure 3. The impact of viscosity is illustrated in Figure 4. These figures show that depending on the initial orientation of the wave vector $β (t),$ the amplitude $a (t)$ can be either bounded or unbounded. For elliptic flows, unbounded amplitudes are always present for specific orientations, so all of them are unstable; see Reference BaylyBayly (1986), Reference Bayly, Holm and LifschitzBayly et al. (1996), Reference Friedlander and Lipton-LifschitzFriedlander and Lipton-Lifschitz (2003), and references therein.

Figure 3 Kelvin waves corresponding to two different orientations of the initial wave vector $β (0)$ and $a (0) .$ (a), (b) $β (0) = (sin (π / 4), 0, cos (π / 4)),$ $a (0) = (0, sin (π / 4), 0)$ ; (c), (d) $β (0) = (sin (π / 3), 0, cos (π / 3)),$ $a (0) = (0, sin (π / 3), 0) .$ Other parameters are as follows: $T = 100,$ $ω = 1,$ $s = 0.5 .$ In the first case, $a (t)$ stays bounded, while $a (t)$ explodes in the second case. This explosion means that the underlying elliptic flow is unstable. Author’s graphics.

Figure 4 Kelvin waves in the viscous fluid with viscosity $ν = 0.07 .$ Other parameters and initial conditions are the same as in Figure 3. Viscosity dampens the instability but, generally, does not suppress it entirely. Author’s graphics.

3 Kolmogorov Stochastic Process

3.1 Background

The Kolmogorov equation studies the evolution of a particle in the phase space. The particle’s position and velocity evolve in time due to the interplay between the deterministic drift and stochastic force affecting only its velocity. Since only the particle’s velocity is affected by the random force, the PDE describing the evolution of the t.p.d.f. in the phase space is degenerate. The Kolmogorov equation is a particular case of the Klein–Kramers equation studied in Section 4.

The significance of the Kolmogorov equation lies in its ability to model the intricate balance between deterministic behavior and stochastic dynamics, providing a basic framework for studying the evolution of systems in phase space. It has important applications in various fields, including physics for understanding particle dynamics, finance for modeling asset prices, and beyond. It demonstrates the profound interplay between stochastic processes and differential equations.

The Kolmogorov equation is hypoelliptic; as such, it serves as a prototype for a broad class of hypoelliptic PDEs. Although it does not meet the exact criteria for ellipticity (due to the second-order derivatives not being present in all directions of the phase space), the solutions to the equation are still smooth, which is particularly important in the context of stochastic processes, where hypoellipticity ensures that the probability density function remains smooth and well-behaved, facilitating the analysis of the system’s dynamics over time.

3.2 Summary of Kolmogorov’s Paper

In a remarkable (and remarkably concise) note, Kolmogorov considers a system of particles in $n$ -dimensional space with coordinates $q_{1}, \dots, q_{n},$ and velocities ${\dot{q}}_{1}, \dots, {\dot{q}}_{n},$ assuming the probability density function

g (t, q_{1}, \dots, q_{n}, {\dot{q}}_{1}, \dots, {\dot{q}}_{n}, t^{'}, q_{1}^{'}, \dots, q_{n}^{'}, {\dot{q}}_{1}^{'}, \dots, {\dot{q}}_{n}^{'})

exists for some time $t^{'} > t,$ and reveals (without any explanation) an analytical expression for $g$ in the one-dimensional case; see Reference KolmogoroffKolmogoroff (1934).Footnote ⁴ This note is the third in a series of papers, the previous two being Reference KolmogoroffKolmogoroff (1931), Reference Kolmogoroff(1933).

Kolmogorov makes the following natural assumptions:

\begin{matrix} E (Δ q_{i} - {\dot{q}}_{i} Δ t) & = o (Δ t), \end{matrix}

(3.1)

\begin{matrix} E {(Δ q_{i})}^{2} & = o (Δ t), \end{matrix}

(3.2)

where $Δ t = t^{'} - t .$ Equations (3.1) and (3.2) imply

\begin{matrix} E (Δ q_{i}) = {\dot{q}}_{i} Δ t + o (Δ t), \end{matrix}

(3.3)

\begin{matrix} E (Δ q_{i} Δ q_{j}) \leq \sqrt{E {(Δ q_{i})}^{2} E {(Δ q_{j})}^{2}} = o (Δ t) . \end{matrix}

(3.4)

Furthermore, under very general assumptions, the following relationships hold:

\begin{matrix} E (Δ {\dot{q}}_{i}) & = f_{i} (t, q, \dot{q}) Δ t + o (Δ t), \end{matrix}

(3.5)

\begin{matrix} E {(Δ {\dot{q}}_{i})}^{2} & = k_{i i} (t, q, \dot{q}) Δ t + o (Δ t), \end{matrix}

(3.6)

\begin{matrix} E (Δ {\dot{q}}_{i} Δ {\dot{q}}_{j}) & = k_{i j} (t, q, \dot{q}) Δ t + o (Δ t), \end{matrix}

(3.7)

where $f$ and $k$ are continuous functions. Equations (3.2), and (3.6) imply

\begin{matrix} E (Δ {\dot{q}}_{i} Δ {\dot{q}}_{j}) \leq \sqrt{E {(Δ {\dot{q}}_{i})}^{2} E {(Δ {\dot{q}}_{j})}^{2}} = o (Δ t) . \end{matrix}

(3.8)

Under some natural physical assumptions, it follows that $g$ satisfies the following differential equation of the Fokker–Planck type:

\begin{matrix} \frac{\partial g}{\partial t^{'}} = - \sum {\dot{q}}_{i}^{'} \frac{\partial g}{\partial q_{i}^{'}} - \sum \frac{\partial}{\partial {\dot{q}}_{i}^{'}} (f_{i} (t, q, \dot{q}) g) + \sum \sum \frac{\partial^{2}}{\partial {\dot{q}}_{i}^{'} \partial {\dot{q}}_{j}^{'}} (k (t, q, \dot{q}) g) . \end{matrix}

(3.9)

In the one-dimensional case, one has

\begin{matrix} \frac{\partial g}{\partial t^{'}} = - {\dot{q}}^{'} \frac{\partial g}{\partial q^{'}} - \frac{\partial}{\partial {\dot{q}}^{'}} (f (t, q, \dot{q}) g) + \frac{\partial^{2}}{\partial {\dot{q}}^{' 2}} (k (t, q, \dot{q}) g) . \end{matrix}

(3.10)

These equations are known as ultra-parabolic Fokker–Plank–Kolmogorov equations due to their degeneracy.

When $f$ and $k$ are constants, (3.10) becomes

\begin{matrix} \frac{\partial g}{\partial t^{'}} = - {\dot{q}}^{'} \frac{\partial g}{\partial q^{'}} - f \frac{\partial g}{\partial {\dot{q}}^{'}} + k \frac{\partial^{2} g}{\partial {\dot{q}}^{' 2}} . \end{matrix}

(3.11)

The corresponding fundamental solution of has the following form:

\begin{matrix} g = \frac{2 \sqrt{3}}{π k^{2} {(t^{'} - t)}^{2}} exp (- \frac{{({\dot{q}}^{'} - \dot{q} - f (t^{'} - t))}^{2}}{4 k (t^{'} - t)} - \frac{3 {(q^{'} - q - \frac{{\dot{q}}^{'} + \dot{q}}{2} (t^{'} - t))}^{2}}{k^{3} {(t^{'} - t)}^{3}}) . \end{matrix}

(3.12)

One can see that $Δ \dot{q}$ is of the order ${(Δ t)}^{1 / 2} .$ At the same time

\begin{matrix} Δ q = \dot{q} Δ t + O {(Δ t)}^{3 / 2} . \end{matrix}

(3.13)

One can prove that a similar relation holds for the general (3.9).

Kolmogorov’s original paper is shown in Figure 5.

Reproduced by kind permission of the Editors of Annals of Mathematics.

Figure 5 Kolmogorov’s original paper, presented here for the inquisitive reader to enjoy; see Reference KolmogoroffKolmogoroff (1934).

Kolmogorov equations fascinated mathematicians for a long time and generated a great deal of research; see, for example, Reference WeberWeber (1951), Reference HörmanderHörmander (1967), Reference KuptsovKuptsov (1972), Reference Lanconelli, Pascucci, Polidoro, Sh, Birman, Solonnikov and UraltsevaLanconelli et al. (2002), Reference Pascucci and Parabolic ProblemsPascucci (2005), Reference Ivasishen and MedynskyIvasishen and Medynsky (2010), and Reference Duong and TranDuong & Tran (2018), among others.

It is worth mentioning that physicists derived Equations (3.9) and (3.10) at least a decade earlier than Kolmogorov; see Section 4.

3.3 Challenge and Response

Despite its undoubted brilliance, Kolmogorov’s original paper has several issues.

First, Equations (3.9) and (3.10) are not the Fokker–Planck equations associated with the process described by Equations (3.5)–(3.7), since they lack the prefactor $1 / 2$ in front of the diffusion terms. The corrected multivariate equation has the form

\begin{matrix} \frac{\partial g}{\partial t^{'}} = & - \sum {\dot{q}}_{i}^{'} \frac{\partial}{\partial q_{i}^{'}} g - \sum \frac{\partial}{\partial {\dot{q}}_{i}^{'}} (f_{i} (t, q, \dot{q}) g) \\ + \frac{1}{2} \sum \sum \frac{\partial^{2}}{\partial {\dot{q}}_{i}^{'} \partial {\dot{q}}_{j}^{'}} ((k (t, q, \dot{q}) g)), \end{matrix}

(3.14)

while the corresponding one-dimensional equation has the form

\begin{matrix} \frac{\partial g}{\partial t^{'}} = - {\dot{q}}^{'} \frac{\partial}{\partial q^{'}} g - \frac{\partial}{\partial {\dot{q}}^{'}} (f (t, q, \dot{q}) g) + \frac{1}{2} \frac{\partial^{2}}{\partial {\dot{q}}^{' 2}} ((k (t, q, \dot{q}) g)) . \end{matrix}

(3.15)

Alternatively, Equations (3.6) and (3.7) can be altered as follows:

\begin{matrix} E {(Δ {\dot{q}}_{i})}^{2} & = 2 k_{i i} (t, q, \dot{q}) Δ t + o (Δ t), \end{matrix}

(3.16)

\begin{matrix} E (Δ {\dot{q}}_{i} Δ {\dot{q}}_{j}) & = 2 k_{i j} (t, q, \dot{q}) Δ t + o (Δ t) . \end{matrix}

(3.17)

In the following discussion, the Fokker–Planck equation is updated.

Second, $g$ given by (3.12) does not solve (3.10). It also does not satisfy the (implicit) initial condition

\begin{matrix} g (t, q, \dot{q}, t, q^{'}, {\dot{q}}^{'}) = δ (q^{'} - q) δ ({\dot{q}}^{'} - \dot{q}), \end{matrix}

(3.18)

where $δ (.)$ is the Dirac $δ$ -function. The fact that expression (3.12) does not solve (3.10) can be verified by substitution. However, it is easier to verify this statement via dimensional analysis. The dimensions of the corresponding variables and coefficients are as follows:

\begin{matrix} (t) = (t^{'}) = T, (q) = (q^{'}) = L, (\dot{q}) = ({\dot{q}}^{'}) = \frac{L}{T}, (g) = \frac{T}{L^{2}}, (f) = \frac{L}{T^{2}}, \\ (k) = \frac{L^{2}}{T^{3}} . \end{matrix}

(3.19)

It is easy to show that $g$ is scale-invariant, so that

\begin{matrix} g (λ^{2} t, λ^{3} q, λ \dot{q}, λ^{2} t^{'}, λ^{3} q^{'}, λ {\dot{q}}^{'}; λ^{- 1} f, k) = λ^{- 4} g (t, q, \dot{q}, t^{'}, q^{'}, {\dot{q}}^{'}; f, k) . \end{matrix}

(3.20)

The original Kolmogorov formula contains two typos, making it dimensionally incorrect since the term

\begin{matrix} \frac{3 {(q^{'} - q - \frac{{\dot{q}}^{'} + \dot{q}}{2} (t^{'} - t))}^{2}}{k^{3} {(t^{'} - t)}^{3}} \end{matrix}

in the exponent is not nondimensional, as it should be, and has dimension $T^{6} L^{- 1},$ while the prefactor

\begin{matrix} \frac{2 \sqrt{3}}{π k^{2} {(t^{'} - t)}^{2}} \end{matrix}

has dimension $T^{4} L^{- 1},$ instead of the right dimension, $T L^{- 2} .$

Third, due to yet another typo, the solution given by (3.12) does not converge to the initial condition in the limit $t^{'} \to t .$ Indeed, asymptotically, one has

\begin{matrix} g \sim H (\frac{k^{3} {(t^{'} - t)}^{3}}{6}, q^{'} - q) H (2 k (t^{'} - t), {\dot{q}}^{'} - \dot{q}) \to 4 δ (q^{'} - q) δ ({\dot{q}}^{'} - \dot{q}), \end{matrix}

(3.21)

where $H (μ, ν)$ is the standard heat kernel:

\begin{matrix} H (μ, ν) = \frac{exp (- \frac{ν^{2}}{2 μ})}{\sqrt{2 π μ}} . \end{matrix}

(3.22)

However, not all is lost. Dimensional analysis shows that the correct solution $g (t, q, \dot{q}, t^{'}, q^{'}, {\dot{q}}^{'}; f, k)$ of (3.10) has the following form:

\begin{matrix} g = \frac{\sqrt{3}}{2 π k {(t^{'} - t)}^{2}} exp (- \frac{{({\dot{q}}^{'} - \dot{q} - f (t^{'} - t))}^{2}}{4 k (t^{'} - t)} - \frac{3 {(q^{'} - q - \frac{{\dot{q}}^{'} + \dot{q}}{2} (t^{'} - t))}^{2}}{k {(t^{'} - t)}^{3}}), \end{matrix}

(3.23)

which is not far from Kolmogorov’s formula. Similarly, the correct solution of (3.15) has the following form:

\begin{matrix} g = \frac{\sqrt{3}}{π k {(t^{'} - t)}^{2}} exp (- \frac{{({\dot{q}}^{'} - \dot{q} - f (t^{'} - t))}^{2}}{2 k (t^{'} - t)} - \frac{6 {(q^{'} - q - \frac{{\dot{q}}^{'} + \dot{q}}{2} (t^{'} - t))}^{2}}{k {(t^{'} - t)}^{3}}) . \end{matrix}

(3.24)

3.4 Direct Verification

In order to avoid confusion, from now on, the notation is changed to make the formulas easier to read. Specifically, it is assumed that $\overset{ˉ}{x}$ represents the position of a particle at time $\overset{ˉ}{t}$ and $x$ its position at time $t,$ while $\overset{ˉ}{y}$ represents its velocity at time $\overset{ˉ}{t},$ and $y$ its velocity at time $t,$ so that

\begin{matrix} (t, q, \dot{q}) \to (t, x, y), (t^{'}, q^{'}, {\dot{q}}^{'}) \to (\overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}) . \end{matrix}

(3.25)

One of our objectives is deriving the (corrected) Kolmogorov formula from first principles using Kelvin waves. Subsequently, it is shown how to use it in the financial mathematics context. The governing SDE can be written as

\begin{matrix} d {\hat{x}}_{t} = {\hat{y}}_{t} d t, {\hat{x}}_{t} = x, \\ d {\hat{y}}_{t} = b d t + σ d {\hat{W}}_{t}, {\hat{y}}_{t} = y . \end{matrix}

(3.26)

The corresponding Fokker–Planck–Kolmogorov problem for the t.p.d.f. $ϖ (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y})$ has the form:

\begin{matrix} ϖ_{\overset{ˉ}{t}} & (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}) - \frac{1}{2} σ^{2} ϖ_{\overset{ˉ}{y} \overset{ˉ}{y}} (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}) \\ + \overset{ˉ}{y} ϖ_{\overset{ˉ}{x}} (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}) + b ϖ_{\overset{ˉ}{y}} (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}) = 0, \\ ϖ (t, x, y, t, \overset{ˉ}{x}, \overset{ˉ}{y}) = δ (\overset{ˉ}{x} - x) δ (\overset{ˉ}{y} - y) . \end{matrix}

(3.27)

The solution of (3.27) is as follows:

\begin{matrix} ϖ (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}) = \frac{\sqrt{3}}{π σ^{2} T^{2}} exp (- Φ (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y})), \end{matrix}

(3.28)

where

\begin{matrix} Φ (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}) = \frac{{(\overset{ˉ}{y} - y - b T)}^{2}}{2 σ^{2} T} + \frac{6 {(\overset{ˉ}{x} - x - \frac{(\overset{ˉ}{y} + y) T}{2})}^{2}}{σ^{2} T^{3}} = \frac{A^{2}}{2} + 6 B^{2}, \end{matrix}

(3.29)

and

\begin{matrix} A = \frac{(\overset{ˉ}{y} - y - b T)}{\sqrt{σ^{2} T}}, (A) = 1, B = \frac{(\overset{ˉ}{x} - x - \frac{(\overset{ˉ}{y} + y) T}{2})}{\sqrt{σ^{2} T^{3}}}, (B) = 1. \end{matrix}

(3.30)

Here and in what follows, the following shorthand notation is used:

\begin{matrix} T = \overset{ˉ}{t} - t . \end{matrix}

(3.31)

Let us check that $ϖ$ satisfies the Fokker–Planck equation and the initial conditions. A simple calculation yields:

\begin{matrix} Φ_{\overset{ˉ}{t}} & = - (\frac{A^{2}}{2 T} + \frac{b A}{\sqrt{σ^{2} T}} + \frac{18 B^{2}}{T} + \frac{6 (\overset{ˉ}{y} + y) B}{\sqrt{σ^{2} T^{3}}}), \\ Φ_{\overset{ˉ}{x}} & = \frac{12 B}{\sqrt{σ^{2} T^{3}}}, Φ_{\overset{ˉ}{y}} = \frac{A - 6 B}{\sqrt{σ^{2} T}}, Φ_{\overset{ˉ}{y} \overset{ˉ}{y}} = \frac{4}{σ^{2} T}, \end{matrix}

(3.32)

\begin{matrix} \frac{ϖ_{\overset{ˉ}{t}}}{ϖ} = - \frac{2}{T} - Φ_{\overset{ˉ}{t}}, \frac{ϖ_{\overset{ˉ}{x}}}{ϖ} = - Φ_{\overset{ˉ}{x}}, \frac{ϖ_{\overset{ˉ}{y}}}{ϖ} = - Φ_{\overset{ˉ}{y}}, \frac{ϖ_{\overset{ˉ}{y} \overset{ˉ}{y}}}{ϖ} = - Φ_{\overset{ˉ}{y} \overset{ˉ}{y}} + Φ_{\overset{ˉ}{x}}^{2}, \end{matrix}

(3.33)

so that

\begin{matrix} ϖ_{\overset{ˉ}{t}}^{K} - & \frac{1}{2} σ^{2} ϖ_{\overset{ˉ}{y} \overset{ˉ}{y}}^{K} + \overset{ˉ}{y} ϖ_{\overset{ˉ}{x}}^{K} + b ϖ_{\overset{ˉ}{y}}^{K} \\ = & ϖ^{K} (- \frac{2}{T} - Φ_{\overset{ˉ}{t}} + \frac{1}{2} σ^{2} (Φ_{\overset{ˉ}{y} \overset{ˉ}{y}} - Φ_{\overset{ˉ}{y}}^{2}) - \overset{ˉ}{y} Φ_{\overset{ˉ}{x}} - b Φ_{\overset{ˉ}{y}}) \\ = & ϖ^{K} (- \frac{2}{T} + \frac{A^{2}}{2 T} + \frac{b A}{\sqrt{σ^{2} T}} + \frac{18 B^{2}}{T} + \frac{6 (\overset{ˉ}{y} + y) B}{\sqrt{σ^{2} T^{3}}}) \\ (+ \frac{2}{T} - \frac{{(A - 6 B)}^{2}}{2 T} - \frac{12 \overset{ˉ}{y} B}{\sqrt{σ^{2} T^{3}}} - \frac{b (A - 6 B)}{\sqrt{σ^{2} T}}) = 0. \end{matrix}

(3.34)

When $T \to 0,$ one has the following asymptotic expression:

\begin{matrix} ϖ^{K} (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}) \sim H (\frac{σ^{2} T^{3}}{12}, \overset{ˉ}{x} - x) H (σ^{2} T, \overset{ˉ}{y} - y) \to δ (\overset{ˉ}{x} - x) δ (\overset{ˉ}{y} - y) . \end{matrix}

(3.35)

3.5 Solution via Kelvin Waves

Now, Kolmogorov’s formula is derived by using Kelvin waves (or an affine ansatz), which requires solving the problem of the following form:

\begin{matrix} K_{\overset{ˉ}{t}} (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}, k, l) - \frac{1}{2} σ^{2} K_{\overset{ˉ}{y} \overset{ˉ}{y}} (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}, k, l) \\ + & \overset{ˉ}{y} K_{\overset{ˉ}{x}} (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}, k, l) + b K_{\overset{ˉ}{y}} (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}, k, l) = 0, \\ K (t, \overset{ˉ}{x}, \overset{ˉ}{y}, t, x, y, k, l) = exp (i k (\overset{ˉ}{x} - x) + i l (\overset{ˉ}{y} - y)) . \end{matrix}

(3.36)

Here

\begin{matrix} (k) = \frac{1}{L}, (l) = \frac{T}{L}, (K) = 1. \end{matrix}

(3.37)

By using the well-known results concerning the inverse Fourier transform of the $δ$ -function, one gets the following expression for the t.p.d.f. $ϖ (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y})$ :

\begin{matrix} ϖ (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}) = \frac{1}{{(2 π)}^{2}} \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} K (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}, k, l) d k d l . \end{matrix}

(3.38)

To calculate $K,$ one can use the affine ansatz and represent it in the following form:

\begin{matrix} K (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}, k, l) = exp (Ψ (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}, k, l)), \end{matrix}

(3.39)

where

\begin{matrix} Ψ (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}, k, l) = α (t, \overset{ˉ}{t}) + i k (\overset{ˉ}{x} - x) + i γ (t, \overset{ˉ}{t}) \overset{ˉ}{y} - i l y . \end{matrix}

(3.40)

and

\begin{matrix} \frac{K_{\overset{ˉ}{t}}}{K} & = Ψ_{\overset{ˉ}{t}} = (α_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) + i γ_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) \overset{ˉ}{y}), \frac{K_{\overset{ˉ}{x}}}{K} = Ψ_{\overset{ˉ}{x}} = i k, \\ \frac{K_{\overset{ˉ}{y}}}{K} & = Ψ_{\overset{ˉ}{y}} = i γ (t, \overset{ˉ}{t}), \frac{K_{\overset{ˉ}{y} \overset{ˉ}{y}}}{K} = Ψ_{\overset{ˉ}{y}}^{2} = - γ^{2} (t, \overset{ˉ}{t}) . \end{matrix}

(3.41)

Accordingly,

\begin{matrix} α_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) + \frac{1}{2} σ^{2} γ^{2} (t, \overset{ˉ}{t}) + i γ_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) \overset{ˉ}{y} + i k \overset{ˉ}{y} + i b γ (t, \overset{ˉ}{t}) & = 0, \\ α (t, t) = 0, γ (t, t) & = l, \end{matrix}

(3.42)

so that

\begin{matrix} α_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) + \frac{1}{2} σ^{2} γ^{2} (t, \overset{ˉ}{t}) + i b γ (t, \overset{ˉ}{t}) = 0, α (t, t) = 0, \\ γ_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) + k = 0, γ (t, t) = l . \end{matrix}

(3.43)

Straightforward calculation shows that:

\begin{matrix} γ (t, \overset{ˉ}{t}) & = - k T + l, \\ α (t, \overset{ˉ}{t}) & = - \frac{1}{2} σ^{2} (\frac{k^{2} T^{3}}{3} - k l T^{2} + l^{2} T) - i b (- \frac{k T^{2}}{2} + l T) . \end{matrix}

(3.44)

Equations (3.38), (3.39), (3.40) and (3.44) yield

\begin{matrix} ϖ (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}) \\ = \frac{1}{{(2 π)}^{2}} \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} exp (- \frac{1}{2} σ^{2} (\frac{k^{2} T^{3}}{3} - k l T^{2} + l^{2} T)) \\ (+ i k (\overset{ˉ}{x} - x - \overset{ˉ}{y} T + \frac{b T^{2}}{2}) + i l (\overset{ˉ}{y} - y - b T)) d k d l . \end{matrix}

(3.45)

It is clear that $ϖ (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y})$ can be viewed as the characteristic function of the Gaussian density in the $(k, l)$ space, evaluated at the point $(\overset{ˉ}{x} - x - \overset{ˉ}{y} T)$ $(+ \frac{b T^{2}}{2}, \overset{ˉ}{y} - y - b T)$ :

\begin{matrix} ϖ & (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}) \\ = \frac{{(det (C))}^{1 / 2}}{2 π} \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} G (T, k, l) \\ \times exp (i k (\overset{ˉ}{x} - x - \overset{ˉ}{y} T + \frac{b T^{2}}{2}) + i l (\overset{ˉ}{y} - y - b T)) d k d l, \end{matrix}

(3.46)

where

\begin{matrix} G (T, k, l) = \frac{1}{2 π {(det (C))}^{1 / 2}} exp (- \frac{1}{2} (\begin{matrix} k \\ l \end{matrix}) \cdot C^{- 1} (T) (\begin{matrix} k \\ l \end{matrix})), \end{matrix}

(3.47)

and

\begin{matrix} C (T) & = (\begin{matrix} \frac{12}{σ^{2} T^{3}} & \frac{6}{σ^{2} T^{2}} \\ \frac{6}{σ^{2} T^{2}} & \frac{4}{σ^{2} T} \end{matrix}), \\ det (C (T)) & = \frac{12}{σ^{4} T^{4}} . \end{matrix}

(3.48)

As before, $\cdot$ denotes the scalar product. Accordingly,

\begin{matrix} ϖ (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}) = \frac{\sqrt{3}}{π σ^{2} T^{2}} exp (- Ω (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y})), \end{matrix}

(3.49)

where

\begin{matrix} Ω (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}) = \\ = & \frac{1}{2} (\begin{matrix} \overset{ˉ}{x} - x - \overset{ˉ}{y} T + \frac{b T^{2}}{2} \\ \overset{ˉ}{y} - y - b T \end{matrix}) \cdot C (\begin{matrix} \overset{ˉ}{x} - x - \overset{ˉ}{y} T + \frac{b T^{2}}{2} \\ \overset{ˉ}{y} - y - b T \end{matrix}) \\ = & \frac{6 {(\overset{ˉ}{x} - x - \overset{ˉ}{y} T + \frac{b T^{2}}{2})}^{2}}{σ^{2} T^{3}} + \frac{6 (\overset{ˉ}{x} - x - \overset{ˉ}{y} T + \frac{b T^{2}}{2}) (\overset{ˉ}{y} - y - b T)}{σ^{2} T^{2}} \\ + \frac{2 {(\overset{ˉ}{y} - y - b T)}^{2}}{σ^{2} T} \\ = & \frac{A^{2}}{2} + 6 B^{2} = Φ (\overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}, x, y), \end{matrix}

(3.50)

as expected. This calculation completes the derivation of the corrected Kolmogorov formula.

Note that the t.p.d.f. $ϖ$ is a bivariate Gaussian distribution. Completing the square, one can write

\begin{matrix} Φ & = \frac{{(\overset{ˉ}{y} - y - b T)}^{2}}{2 σ^{2} T} + \frac{6 {(\overset{ˉ}{x} - x - \frac{(\overset{ˉ}{y} + y) T}{2})}^{2}}{σ^{2} T^{3}} \\ = \frac{6}{σ^{2} T^{3}} {(\overset{ˉ}{x} - p)}^{2} - \frac{6}{σ^{2} T^{2}} (\overset{ˉ}{x} - p) (\overset{ˉ}{y} - q) + \frac{2}{σ^{2} T} {(\overset{ˉ}{y} - q)}^{2}, \end{matrix}

(3.51)

and represent $ϖ$ the form:

\begin{matrix} ϖ (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}) = \frac{exp (- \frac{1}{2 (1 - ρ^{2})} (\frac{{(\overset{ˉ}{x} - p)}^{2}}{σ_{x}^{2}} - \frac{2 ρ (\overset{ˉ}{x} - p) (\overset{ˉ}{y} - q)}{σ_{x} σ_{y}} + \frac{{(\overset{ˉ}{y} - q)}^{2}}{σ_{y}^{2}}))}{2 π σ_{x} σ_{y} \sqrt{1 - ρ^{2}}}, \end{matrix}

(3.52)

where

\begin{matrix} σ_{x} & = \sqrt{\frac{σ^{2} T^{3}}{3}}, σ_{y} = \sqrt{σ^{2} T}, ρ = \frac{\sqrt{3}}{2}, \\ p & = x + y T + \frac{b T^{2}}{2}, q = y + b T . \end{matrix}

(3.53)

Equation (3.28) can be derived by using the Hankel transform. Since

\begin{matrix} σ^{- 2} C^{- 1} (T) = (\begin{matrix} \frac{T^{3}}{3} & - \frac{T^{2}}{2} \\ - \frac{T^{2}}{2} & T \end{matrix}) = {(\begin{matrix} \frac{T^{3 / 2}}{2} & - \frac{T^{1 / 2}}{2} \\ - \frac{T^{3 / 2}}{2 \sqrt{3}} & \frac{\sqrt{3} T^{1 / 2}}{2} \end{matrix})}^{*} (\begin{matrix} \frac{T^{3 / 2}}{2} & - \frac{T^{1 / 2}}{2} \\ - \frac{T^{3 / 2}}{2 \sqrt{3}} & \frac{\sqrt{3} T^{1 / 2}}{2} \end{matrix}), \end{matrix}

(3.54)

one can introduce

\begin{matrix} (\begin{matrix} \overset{ˉ}{k} \\ \overset{ˉ}{l} \end{matrix}) & = (\begin{matrix} \frac{T^{3 / 2}}{2} & - \frac{T^{1 / 2}}{2} \\ - \frac{T^{3 / 2}}{2 \sqrt{3}} & \frac{\sqrt{3} T^{1 / 2}}{2} \end{matrix}) (\begin{matrix} k \\ l \end{matrix}), \\ (\begin{matrix} k \\ l \end{matrix}) & = (\begin{matrix} 3 T^{- 3 / 2} & \sqrt{3} T^{- 3 / 2} \\ T^{- 1 / 2} & \sqrt{3} T^{- 1 / 2} \end{matrix}) (\begin{matrix} \overset{ˉ}{k} \\ \overset{ˉ}{l} \end{matrix}), \end{matrix}

(3.55)

and rewrite (3.46) as follows:

\begin{matrix} ϖ & (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}) = \frac{\sqrt{3}}{2 π^{2} T^{2}} \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} exp (- \frac{1}{2} σ^{2} (\overset{ˉ}{k}^{2} + {\overset{ˉ}{l}}^{2})) \\ (+ i (\begin{matrix} \overset{ˉ}{k} \\ \overset{ˉ}{l} \end{matrix}) \cdot (\begin{matrix} 3 T^{- 3 / 2} & T^{- 1 / 2} \\ \sqrt{3} T^{- 3 / 2} & \sqrt{3} T^{- 1 / 2} \end{matrix}) (\begin{matrix} \overset{ˉ}{x} - x - \overset{ˉ}{y} T + \frac{b T^{2}}{2} \\ \overset{ˉ}{y} - y - b T \end{matrix})) d \overset{ˉ}{k} d \overset{ˉ}{l} \\ = \frac{\sqrt{3}}{2 π^{2} T^{2}} \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} exp (- \frac{1}{2} σ^{2} (\overset{ˉ}{k}^{2} + {\overset{ˉ}{l}}^{2})) \\ (+ i \overset{ˉ}{k} \frac{3 (\overset{ˉ}{x} - x - \overset{ˉ}{y} T + \frac{b T^{2}}{2}) + (\overset{ˉ}{y} - y - b T) T}{T^{3 / 2}}) \\ (+ i \overset{ˉ}{l} \frac{\sqrt{3} (\overset{ˉ}{x} - x - \overset{ˉ}{y} T + \frac{b T^{2}}{2}) + \sqrt{3} (\overset{ˉ}{y} - y - b T) T}{T^{3 / 2}}) d \overset{ˉ}{k} d \overset{ˉ}{l} . \end{matrix}

(3.56)

Thus, $ϖ (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y})$ is the Fourier transform of a radially symmetric function of ${(\overset{ˉ}{k}, \overset{ˉ}{l})}^{*} .$ Accordingly, it can be calculated via the Hankel transform of the function $exp (- σ^{2} {\overset{ˉ}{r}}^{2} / 2) :$

\begin{matrix} ϖ (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}) = \frac{\sqrt{3}}{π T^{2}} H_{0} (e^{- \frac{σ^{2} {\overset{ˉ}{r}}^{2}}{2}}) (\overset{ˉ}{s}) = \frac{\sqrt{3}}{π σ^{2} T^{2}} e^{- \frac{{\overset{ˉ}{s}}^{2}}{2 σ^{2}}}, \end{matrix}

(3.57)

where

\begin{matrix} {\overset{ˉ}{r}}^{2} & = {\overset{ˉ}{k}}^{2} + {\overset{ˉ}{l}}^{2}, \\ {\overset{ˉ}{s}}^{2} & = \frac{4 (3 {(\overset{ˉ}{x} - x - \overset{ˉ}{y} T + \frac{b T^{2}}{2})}^{2} + 3 (\overset{ˉ}{x} - x - \overset{ˉ}{y} T + \frac{b T^{2}}{2}) (\overset{ˉ}{y} - y - b T) T + {(\overset{ˉ}{y} - y - b T)}^{2} T^{2})}{T^{3}} . \end{matrix}

(3.58)

See, for example, Reference Piessens and PoularikasPiessens (2000). As expected, the corresponding expression coincides with the one given by (3.52).

3.6 Solution via Coordinate Transform

This section briefly considers the method of coordinate transformations, reducing the original Fokker–Planck equation for the Kolmogorov problem to a Fokker–Planck equation with spatially independent coefficients. To this end, the following ansatz is used:

\begin{matrix} (\tilde{x}, \tilde{y}) = (\overset{ˉ}{x} - (\overset{ˉ}{t} - t) \overset{ˉ}{y}, \overset{ˉ}{y}) . \end{matrix}

(3.59)

This choice is explained in more detail in Section 6. Straightforward calculation yields

\begin{matrix} \frac{\partial}{\partial \overset{ˉ}{t}} = \frac{\partial}{\partial \overset{ˉ}{t}} - \tilde{y} \frac{\partial}{\partial \tilde{x}}, \frac{\partial}{\partial \overset{ˉ}{x}} = \frac{\partial}{\partial \tilde{x}}, \frac{\partial}{\partial y} = - (\overset{ˉ}{t} - t) \frac{\partial}{\partial \tilde{x}} + \frac{\partial}{\partial \tilde{y}}, \end{matrix}

(3.60)

so that (3.27) becomes

\begin{matrix} (\frac{\partial}{\partial \overset{ˉ}{t}} - \tilde{y} \frac{\partial}{\partial \tilde{x}}) ϖ (t, x, y, \overset{ˉ}{t}, \tilde{x}, \tilde{y}) - \frac{1}{2} σ^{2} {(- (\overset{ˉ}{t} - t) \frac{\partial}{\partial \tilde{x}} + \frac{\partial}{\partial \tilde{y}})}^{2} ϖ (t, x, y, \overset{ˉ}{t}, \tilde{x}, \tilde{y}) \\ + \tilde{y} ϖ_{\tilde{x}} (t, x, y, \overset{ˉ}{t}, \tilde{x}, \tilde{y}) + b (- (\overset{ˉ}{t} - t) \frac{\partial}{\partial \tilde{x}} + \frac{\partial}{\partial \tilde{y}}) ϖ (t, x, y, \overset{ˉ}{t}, \tilde{x}, \tilde{y}) = 0, \\ ϖ (t, x, y, t, \tilde{x}, \tilde{y}) = δ (\tilde{x} - x) δ (\tilde{y} - y) . \end{matrix}

(3.61)

Further calculations show that coefficients of the preceding equation are spatially independent:

\begin{matrix} \frac{\partial}{\partial \overset{ˉ}{t}} ϖ (t, x, y, \overset{ˉ}{t}, \tilde{x}, \tilde{y}) - \frac{1}{2} σ^{2} {(- (\overset{ˉ}{t} - t) \frac{\partial}{\partial \tilde{x}} + \frac{\partial}{\partial \tilde{y}})}^{2} ϖ (t, x, y, \overset{ˉ}{t}, \tilde{x}, \tilde{y}) \\ + b (- (\overset{ˉ}{t} - t) \frac{\partial}{\partial \tilde{x}} + \frac{\partial}{\partial \tilde{y}}) ϖ (t, x, y, \overset{ˉ}{t}, \tilde{x}, \tilde{y}) = 0, \\ ϖ (t, x, y, t, \tilde{x}, \tilde{y}) = δ (\tilde{x} - x) δ (\tilde{y} - y) . \end{matrix}

(3.62)

Accordingly, one can use the classical Fourier transform and represent the solution of (3.62) in the form

\begin{matrix} ϖ (t, x, y, \overset{ˉ}{t}, \tilde{x}, \tilde{y}) \\ = \frac{1}{{(2 π)}^{2}} \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} exp (- \frac{1}{2} σ^{2} (\frac{k^{2} T^{3}}{3} - k l T^{2} + l^{2} T)) \\ (+ i k (\tilde{x} - x + \frac{b T^{2}}{2}) + i l (\tilde{y} - y - b T)) d k d l, \end{matrix}

(3.63)

similar to (3.45). Thus, $ϖ$ has the form given by (3.52) with $(\overset{ˉ}{x}, \overset{ˉ}{y})$ replaced by $(\tilde{x}, \tilde{y}) .$ The exact form is recovered once $(\tilde{x}, \tilde{y})$ are expressed in terms of $(\overset{ˉ}{x}, \overset{ˉ}{y})$ by virtue of (3.59).

3.7 A Representative Example

A typical solution of the Kolmogorov equation is illustrated in Figure 6. This figure clearly shows that there is a good agreement between a Monte Carlo simulation of the stochastic process $({\hat{x}}_{t}, {\hat{y}}_{t})$ given by the equations in (3.26) and the corrected Kolmogorov formula (3.24).

Figure 6 A thousand trajectories of a typical Kolmogorov process. Parameters are as follows: $T = 5,$ $d t = 0.01,$ $f = 0.2,$ $σ = 0.8 .$ (a) $x (t),$ (b) $y (t),$ (c) $(\overset{ˉ}{x} (T), \overset{ˉ}{y} (T)),$ (d) contour lines of $ϖ (0, 0, 0, T, \tilde{x}, \tilde{y}) .$ Author’s graphics.

4 Klein–Kramers Stochastic Process

4.1 Background

The Klein–Kramers equation plays a vital role in statistical physics by offering a detailed mathematical framework for studying the dynamics of particles in a viscous, random medium. Specifically, it describes the evolution of the t.p.d.f. of a particle’s momentum and position in the phase plane, accounting for deterministic forces arising from potential and stochastic thermal forces arising from random collisions with the medium’s molecules. This equation is particularly important for studying nonequilibrium systems, which cannot be analyzed via traditional equilibrium statistical mechanics tools. By incorporating frictional forces, which tend to dampen the motion of particles, potential forces, which push them deterministically, and random thermal forces, which inject randomness into the system, the Klein–Kramers equation bridges the gap between microscopic laws of motion and the macroscopic observable phenomena, such as diffusion, thermal conductivity, and viscosity. Moreover, the Klein–Kramers equation serves as a foundation for exploring more complex phenomena in nonequilibrium statistical mechanics, including the study of transition state theory in macrokinetics of chemical reactions, the behavior of particles in external fields, and the exploration of noise-induced transitions and stochastic resonance in physical and biological systems. It also arises in financial engineering, for instance, in pricing volatility and variance swaps.

4.2 Langevin Equation

Start with the Langevin equation for particles moving in a potential field and impacted by random forces; see Reference LangevinLangevin (1908). This section uses the standard notation, rather that the original notation used in Reference ChandrasekharChandresekhar (1943). Hopefully, the diligent reader will not be easily confused. The stochastic Langevin equation describes the evolution of systems under the influence of deterministic forces and random fluctuations. Because of its versatility, it is widely used in physics and other disciplines to model the dynamics of particles subjected to systematic forces derived from potential energy and random forces representing thermal fluctuations. This equation describes a particle experiencing frictional resistance proportional to its velocity (a deterministic component) and random kicks from the surrounding molecules (a stochastic component capturing the essence of Brownian motion). The Langevin equation thus provides a robust framework for studying the behavior of systems subject to noise, enabling insights into phenomena such as diffusion, thermal equilibrium, and the statistical properties of microscopic systems.

Consider an underdamped Brownian particle. In contrast to the standard Brownian motion, which is overdamped, it is assumed that the frictions are finite, so that one must treat the particle’s velocity as an independent degree of freedom. Hence, the particle’s state is described by a pair $(x, y),$ where $x$ and $y$ are its position and velocity, respectively. Consider a $d$ -dimensional space, with $d = 1$ and $d = 3$ of particular interest, and write the corresponding Langevin equations in the following form:

\begin{matrix} \frac{d {\hat{x}}_{t}}{d t} & = {\hat{y}}_{t}, {\hat{x}}_{t} = x, \\ \frac{d {\hat{y}}_{t}}{d t} & = - κ {\hat{y}}_{t} - \frac{\nabla V ({\hat{x}}_{t})}{m} + \sqrt{\frac{2 κ k_{B} T}{m}} \frac{d {\hat{W}}_{t}}{d t}, {\hat{y}}_{t} = y, \end{matrix}

(4.1)

where ${\hat{W}}_{t}$ is a standard $d$ -dimensional Wiener process. Here $m$ is the particle mass, $κ$ is the friction coefficient, $k_{B}$ is the Boltzmann constant, $T$ is the temperature, $V (x)$ is the external potential, and $d {\hat{W}}_{t} / d t$ is a $d$ -dimensional Gaussian white noise. Below, the ratio $κ k_{B} T / m$ is denoted as $a .$

Of course, one can rewrite the equations of (4.1) as a system of stochastic differential equations (SDEs):

\begin{matrix} d {\hat{x}}_{t} & = {\hat{y}}_{t} d t, {\hat{x}}_{t} = x, \\ d {\hat{y}}_{t} & = - κ {\hat{y}}_{t} d t - \frac{\nabla V ({\hat{x}}_{t})}{m} d t + \sqrt{2 a} d {\hat{W}}_{t}, {\hat{y}}_{t} = y . \end{matrix}

(4.2)

For a $1$ -dimensional particle, (4.2) becomes:

\begin{matrix} d {\hat{x}}_{t} & = {\hat{y}}_{t} d t, {\hat{x}}_{t} = x, \\ d {\hat{y}}_{t} & = - κ {\hat{y}}_{t} d t - \frac{V_{x} ({\hat{x}}_{t})}{m} d t + \sqrt{2 a} d {\hat{W}}_{t}, {\hat{y}}_{t} = y . \end{matrix}

(4.3)

It is clear that the Kolmogorov equation (3.11) is a special case of (4.3) with $κ = 0,$ $V (x) = m f x,$ $k = a .$

4.3 Klein–Kramers Equation

Fokker, Planck, and their numerous followers derived and studied the forward parabolic equation for the t.p.d.f. $ϖ (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y})$ associated with a stochastic process. For the stochastic process governed by SDEs (4.2), the corresponding equation, called the Klein–Kramers equation, has the following form:

\begin{matrix} ϖ_{\overset{ˉ}{t}} - a ϖ_{\overset{ˉ}{y} \overset{ˉ}{y}} + \overset{ˉ}{y} ϖ_{\overset{ˉ}{x}} - {((κ \overset{ˉ}{y} + \frac{\nabla V (\overset{ˉ}{x})}{m}) ϖ)}_{\overset{ˉ}{y}} = 0, \\ ϖ (t, x, y, t, \overset{ˉ}{x}, \overset{ˉ}{y}) = δ (\overset{ˉ}{x} - x) δ (\overset{ˉ}{y} - y) ​ . \end{matrix}

(4.4)

The backward parabolic Kolmogorov equation can be written as follows:

\begin{matrix} ϖ_{t} + a ϖ_{y y} + y ϖ_{x} - (κ y + \frac{\nabla V (x)}{m}) ϖ_{y} = 0, \\ ϖ (\overset{ˉ}{t}, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}) = δ (\overset{ˉ}{x} - x) δ (\overset{ˉ}{y} - y) . \end{matrix}

(4.5)

Details are given in Reference FokkerFokker (1914), Reference PlanckPlanck (1917), Reference KleinKlein (1921), Reference ChapmanChapman (1928), Reference KolmogoroffKolmogoroff (1931, Reference Kolmogoroff1933, Reference Kolmogoroff1934); Reference KramersKramers (1940), Reference ChandrasekharChandresekhar (1943), Reference RiskenRisken (1989), and Reference Hänggi, Talkner and BorkovecHänggi et al. (1990), as well as a multitude of subsequent sources. For fascinating historical details, see Reference Ebeling, Gudowska-Nowak and SokolovEbeling et al. (2008). The Klein–Kramers equation (occasionally called Klein–Kramers–Chandrasekhar equation) describes the dynamics of a particle’s probability distribution in phase space (position and momentum) for systems subjected to friction and random forces, typically at the mesoscopic scale. The Klein–Kramers equation provides a comprehensive framework for modeling and understanding complex systems far from equilibrium, linking microscopic physics with macroscopic observables. Accordingly, it is used in various fields, such as materials science, chemistry, and astrophysics, to predict the evolution of systems over time, accounting for both deterministic dynamics and the effects of randomness.

4.4 Chandrasekhar’s Solutions

In a well-known survey article, Reference ChandrasekharChandresekhar (1943) described elegant solutions of (4.4) for a free particle and a harmonically bound particle, which he derived by using ingenious changes of coordinates. For a free particle, Reference ChandrasekharChandresekhar (1943) writes the corresponding Klein–Kramers equation as follows:

\begin{matrix} ϖ_{\overset{ˉ}{t}} - a ϖ_{\overset{ˉ}{y} \overset{ˉ}{y}} + \overset{ˉ}{y} ϖ_{\overset{ˉ}{x}} - κ \overset{ˉ}{y} ϖ_{\overset{ˉ}{y}} - κ ϖ = 0, \\ ϖ (t, x, y, t, \overset{ˉ}{x}, \overset{ˉ}{y}) = δ (\overset{ˉ}{x} - x) δ (\overset{ˉ}{y} - y) . \end{matrix}

(4.6)

By using ingenious coordinate transforms, he shows that

\begin{matrix} ϖ = \frac{1}{2 π {(F G - H^{2})}^{1 / 2}} exp (- \frac{(F R^{2} - 2 H R S + G S^{2})}{2 (F G - H^{2})}), \end{matrix}

(4.7)

where

\begin{matrix} R & = \overset{ˉ}{y} - e^{- κ T} y, \\ S & = (\overset{ˉ}{x} - x) - \frac{(1 - e^{- κ T})}{κ} y, \\ F & = \frac{a}{κ^{3}} (- 3 + 4 e^{- κ T} - e^{- 2 κ T} + 2 κ T), \\ G & = \frac{a}{κ} (1 - e^{- 2 κ T}), \\ H & = \frac{a}{κ^{2}} {(1 - e^{- κ T})}^{2} . \end{matrix}

(4.8)

Here the original Chandrasekhar’s notation is slightly changed to make the exposition more internally consistent.

Since it is assumed that stochastic drivers are uncorrelated, the t.p.d.f. $ϖ^{(3)}$ can be presented as a product of three $1$ -dimensional t.p.d.f. $ϖ^{(1)}$ :

\begin{matrix} ϖ^{(3)} & = ϖ_{1}^{(1)} ϖ_{2}^{(1)} ϖ_{3}^{(1)} \\ = \frac{1}{8 π^{3} {(F G - H^{2})}^{3 / 2}} exp (- \frac{(F {(R)}^{2} - 2 H R \cdot S + G {(S)}^{2})}{2 (F G - H^{2})}), \end{matrix}

(4.9)

where ${(R)}^{2} = {({\overset{ˉ}{y}}_{1} - e^{- κ T} y_{1})}^{2} + {({\overset{ˉ}{y}}_{2} - e^{- κ T} y_{2})}^{2} + {({\overset{ˉ}{y}}_{3} - e^{- κ T} y_{3})}^{2},$ and so on.

Chandrasekhar generalized (4.7) to the case of harmonically bound particles. We shall revisit Chandrasekhar’s formulas for free and bound particles later. While Reference ChandrasekharChandresekhar (1943) stopped at (4.7), for practical applications, it is more useful to represent the exponent as an explicit quadratic form of $\overset{ˉ}{x}$ and $\overset{ˉ}{y},$ which is done in Section 6.5.

5 Transition Probability Densities for Stochastic Processes

5.1 Motivation

The problems considered in Sections 3 and 4 are used in what follows to develop a general theory. For that, one needs to know some foundational information about stochastic processes discussed in this section. Stochastic processes play a crucial role across various scientific disciplines, which is fundamental for modeling systems influenced by randomness and uncertainty. These processes are pivotal in fields ranging from physics and chemistry to biology, economics, and financial engineering. They help to understand phenomena where outcomes are not deterministic but probabilistic, capturing the dynamics of complex systems over time. The analysis of stochastic processes enables scientists and engineers to predict behavior, assess risks, and make informed decisions based on the likelihood of future events.

The backward Kolmogorov and forward Fokker–Planck equations offer a mathematical description of how systems evolve under the influence of stochastic factors. This capability to model the t.p.d.fs of diverse processes underlines the equations’ fundamental importance in scientific research and practical applications across disciplines.

The Kolmogorov and Fokker–Planck equations are adjoint partial differential equations that describe how the probability density of a system’s state evolves in time. The Kolmogorov equation focuses on calculating the expected value at a given time of random outcomes, which become known sometime in the future. Conversely, the Fokker–Planck equation is concerned with the evolution of the conditional probability density function of a process’s state at a future time, given its current state.

The Kolmogorov and Fokker–Planck equations are applied in physics and chemistry to study the random motion of particles in fluids, the statistical behavior of thermodynamic systems, and the kinetics of chemical reactions. In biology, these equations model population dynamics, genetic variation, and the spread of diseases, among other processes, providing insights into how randomness affects biological phenomena. In financial engineering, they are used to model the evolution of asset prices, interest rates, and other economic indicators, underpinning the valuation of derivatives and the management of financial risks.

5.2 Backward and Forward Equations

Start with a jump-diffusion process driven by the SDE of the form

\begin{matrix} d {\hat{z}}_{t} = b (t, {\hat{z}}_{t}) d t + σ (t, {\hat{z}}_{t}) d {\hat{W}}_{t} + υ d {\hat{Π}}_{t} (t, {\hat{z}}_{t}), {\hat{z}}_{t} = z, \end{matrix}

(5.1)

with smooth coefficients $b, σ .$ This process is driven by the standard Wiener process ${\hat{W}}_{t}$ and the Poisson process ${\hat{Π}}_{t} (t, z)$ with intensity $λ (t, z)$ such that

\begin{matrix} E ((d Π_{t} (t, z)) {\hat{z}}_{t} = z) = λ (t, z) d t, \end{matrix}

(5.2)

while $υ$ is drawn from a distribution with density $ϕ (υ, t, z),$ which (in general) is $(t, z)$ -dependent.

More generally, it is possible to consider the so-called general compound or marked Poisson processes, such that $υ = υ (t, z, q)$ , where $υ$ is monotonic in $z,$ and $q$ is a random mark variable drawn from a distribution with density $ϕ (q, t, z),$ which (in general) is $(t, z)$ -dependent. However, since this Element is interested in a particular class of stochastic processes, solvable via Kelvin waves ansatz this generalization is not particularly useful.

It is well-known that for suitable test functions $\tilde{u} (z)$ the expectation

\begin{matrix} u (t, z) = E ((\tilde{u} ({\hat{z}}_{\overset{ˉ}{t}})) {\hat{z}}_{t} = z) \end{matrix}

(5.3)

solves the following integro-differential backward Kolmogorov problem:

\begin{matrix} u_{t} (t, z) + a (t, z) u_{z z} (t, z) + b (t, z) u_{z} (t, z) \\ + λ (t, z) \int_{- \infty}^{\infty} u (t, z + υ) ϕ (υ, t, z) d υ - λ (t, z) u (t, z) = 0, \\ u (\overset{ˉ}{t}, z) = \tilde{u} (z), \end{matrix}

(5.4)

where

\begin{matrix} a (t, z) = \frac{1}{2} σ^{2} (t, z) . \end{matrix}

(5.5)

In particular, the t.p.d.f. $ϖ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z})$ such that

\begin{matrix} Pr o b ((\overset{ˉ}{z} < {\hat{z}}_{\overset{ˉ}{t}} < \overset{ˉ}{z} + d \overset{ˉ}{z}) {\hat{z}}_{t} = z) = ϖ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) d \overset{ˉ}{z}, \end{matrix}

(5.6)

solves the following backward Kolmogorov problem:

\begin{matrix} ϖ_{t} (t, z) + a (t, z) ϖ_{z z} (t, z) + b (t, z) ϖ_{z} (t, z) - \\ + λ (t, z) \int_{- \infty}^{\infty} ϖ (t, z + υ) ϕ (υ, t, z) d υ - λ (t, z) ϖ (t, z) = 0, \\ ϖ (\overset{ˉ}{t}, z, \overset{ˉ}{t}, \overset{ˉ}{z}) = δ (z - \overset{ˉ}{z}) . \end{matrix}

(5.7)

It is possible to derive a forward problem for $ϖ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}),$ which $ϖ$ satisfies as a function of $(\overset{ˉ}{t}, \overset{ˉ}{z}),$ which is called Fokker–Planck or forward Kolmogorov problem. This problem has the following form:

\begin{matrix} ϖ_{\overset{ˉ}{t}} (\overset{ˉ}{t}, \overset{ˉ}{z}) - {(a (\overset{ˉ}{t}, \overset{ˉ}{z}) ϖ (\overset{ˉ}{t}, \overset{ˉ}{z}))}_{\overset{ˉ}{z} \overset{ˉ}{z}} + {(b (\overset{ˉ}{t}, \overset{ˉ}{z}) ϖ (\overset{ˉ}{t}, \overset{ˉ}{z}))}_{\overset{ˉ}{z}} \\ - \int_{- \infty}^{\infty} λ (\overset{ˉ}{t}, \overset{ˉ}{z} - υ) ϖ (\overset{ˉ}{t}, \overset{ˉ}{z} - υ) ϕ (\overset{ˉ}{t}, \overset{ˉ}{z} - υ, υ) d υ + λ (\overset{ˉ}{t}, \overset{ˉ}{z}) ϖ (\overset{ˉ}{t}, \overset{ˉ}{z}) = 0, \\ ϖ (t, z, t, \overset{ˉ}{z}) = δ (\overset{ˉ}{z} - z) . \end{matrix}

(5.8)

One can generalize backward Kolmogorov and forward Fokker–Planck equation to the multidimensional case. The underlying $n_{z}$ -dimensional process ${\hat{z}}_{t} = ({\hat{z}}_{i, t}),$ $i = 1, \dots, n_{z},$ has the form

\begin{matrix} d {\hat{z}}_{t} = b (t, {\hat{z}}_{t}) d t + Σ (t, {\hat{z}}_{t}) d {\hat{W}}_{t} + υ d {\hat{Π}}_{t} (t, z_{t}), \end{matrix}

(5.9)

where ${\hat{W}}_{t} = ({\hat{W}}_{j, t})$ is an $n_{W}$ -dimensional Wiener process, $j = 1, \dots, n_{W},$ and $\hat{Π} = ({\hat{Π}}_{k, t})$ is an $n_{Π}$ -dimensional state-dependent Poisson process, $k = 1, \dots, n_{Π}$ with intensity $λ .$ The corresponding state-dependent coefficients are as follows:

\begin{matrix} b (t, z) & = (b_{i} (t, z)), i = 1, \dots, n_{z}, \\ Σ (t, z) & = (Σ_{i j} (t, z)), i = 1, \dots, n_{z}, j = 1, \dots, n_{W}, \\ λ (t, z) & = (λ_{i} (t, z)), i = 1, \dots, n_{Π}, \\ υ & = (υ_{i k}), i = 1, \dots, n_{z}, k = 1, \dots, n_{Π}, \end{matrix}

(5.10)

while $υ_{k}$ are drawn from distributions with densities $ϕ_{k} (υ, t, z),$ which (in general) are $(t, z)$ -dependent. Explicitly, the equations in (5.9) can be written as follows:

\begin{matrix} d {\hat{z}}_{i, t} = b_{i} (t, {\hat{z}}_{t}) d t + Σ_{i j} (t, {\hat{z}}_{t}) d {\hat{W}}_{j, t} + υ_{i k} d {\hat{Π}}_{k} (t, {\hat{z}}_{t}) . \end{matrix}

(5.11)

The backward and forward equations for the t.p.d.f. $ϖ$ can be written as follows:

\begin{matrix} ϖ_{t} (t, z) + a_{i j} (t, z) ϖ_{z_{i} z_{j}} (t, z) + b_{i} (t, z) ϖ_{z_{i}} (t, z) \\ + λ_{k} (t, z) \int_{- \infty}^{\infty} ϖ (t, z + υ_{k}) ϕ_{k} (υ_{k}, t, z) d υ_{k} - Λ (t, z) ϖ (t, z) = 0, \\ ϖ (\overset{ˉ}{t}, z, \overset{ˉ}{t}, \overset{ˉ}{z}) = δ (z - \overset{ˉ}{z}), \\ A (t, z) = (a_{i i^{'}} (t, z)) = \frac{1}{2} {Σ (t, z) Σ}^{*} (t, z) = \frac{1}{2} σ_{i j} (t, z) σ_{i^{'} j} (t, z), \\ Λ (t, z) = \sum_{k = 1}^{n_{Π}} λ_{k} (t, z) . \end{matrix}

(5.12)

For the generic terminal condition $\tilde{u} (z),$ the corresponding backward problem has the following form:

\begin{matrix} u_{t} (t, z) + a_{i j} (t, z) u_{z_{i} z_{j}} (t, z) + b_{i} (t, z) u_{z_{i}} (t, z) \\ + λ_{k} (t, z) \int_{- \infty}^{\infty} u (t, z + υ_{k}) ϕ_{k} (υ_{k}, t, z) d υ_{k} - Λ (t, z) u (t, z) = 0, \\ u (\overset{ˉ}{t}, z) = \tilde{u} (z), \end{matrix}

(5.14)

The forward equations for the t.p.d.f. $ϖ$ can be written as follows:

\begin{matrix} ϖ_{\overset{ˉ}{t}} (\overset{ˉ}{t}, \overset{ˉ}{z}) - {(a_{i j} (\overset{ˉ}{t}, \overset{ˉ}{z}) ϖ (\overset{ˉ}{t}, \overset{ˉ}{z}))}_{{\overset{ˉ}{z}}_{i} {\overset{ˉ}{z}}_{j}} + {(b_{i} (\overset{ˉ}{t}, \overset{ˉ}{z}) ϖ (\overset{ˉ}{t}, \overset{ˉ}{z}))}_{{\overset{ˉ}{z}}_{i}} \\ - \int_{- \infty}^{\infty} λ_{k} (\overset{ˉ}{t}, \overset{ˉ}{z} - υ_{k}) ϖ (\overset{ˉ}{t}, \overset{ˉ}{z} - υ_{k}) ϕ_{k} (υ_{k}, \overset{ˉ}{t}, \overset{ˉ}{z} - υ_{k}) d υ_{k} + Λ (\overset{ˉ}{t}, \overset{ˉ}{z}) ϖ (\overset{ˉ}{t}, \overset{ˉ}{z}) = 0, \\ ϖ (t, z, t, \overset{ˉ}{z}) = δ (\overset{ˉ}{z} - z) . \end{matrix}

(5.15)

Further details can be found in Reference Bharucha-ReidBharucha-Reid (1960), Reference FellerFeller (1971), Reference Gihman and SkorohodGihman and Skorohod (1972), Reference ArnoldArnold (1974), and Reference HansonHanson (2007), among others.

Although, depending on the actual problem at hand, it might be preferable to work with either the backward or the forward problem, experience suggests that in the context of mathematical finance the backward problem is easier to deal with, not least because they are meaningful for the generic terminal value $\tilde{u} (\overset{ˉ}{z}) .$

Since the preceding definitions are very general, it is necessary to be more specific in defining the class of problems which can be solved by using Kelvin waves. Consider processes such that

\begin{matrix} A (t, z) & = A^{0} (t) + z_{i} A^{i} (t), b (t, z) = b^{0} (t) + z_{i} b^{i} (t), \\ λ (t, z) & = λ^{0} (t) + z_{i} λ^{i} (t), ϕ (υ, t, z) = ϕ (υ, t), \end{matrix}

(5.16)

so that the corresponding backward Kolmogorov problem has the form

\begin{matrix} u_{t} (t, z) & + (a_{i j}^{0} (t) + z_{l} a_{i j}^{l} (t)) u_{z_{i} z_{j}} (t, z) + (b_{i}^{0} (t) + z_{l} b_{i}^{l} (t)) u_{z_{i}} (t, z) \\ + (λ_{k}^{0} (t) + z_{l} λ_{k}^{l} (t)) \int_{- \infty}^{\infty} u (t, z + υ_{k}) ϕ_{k} (υ_{k}, t) d υ_{k} \\ - (Λ^{0} (t) + z_{l} Λ^{l} (t)) u (t, z) = 0, \\ u (\overset{ˉ}{t}, z) & = \tilde{u} (z) . \end{matrix}

(5.17)

Symbolically, (5.17) can be written as follows:

\begin{matrix} u_{t} (t, z) + L^{(0)} (u) (t, z) + \sum_{l = 1}^{n_{z}} z_{l} L^{(l)} (u) (t, z) = 0, \\ u (\overset{ˉ}{t}, z) = \tilde{u} (z), \end{matrix}

(5.18)

where $L^{(0)},$ $L^{(l)}$ are spatially homogeneous operators, with coefficients depending only on time (at most):

\begin{matrix} L^{(0)} (u) (t, z) = a_{i j}^{0} (t) u_{z_{i} z_{j}} (t, z) + b_{i}^{0} (t) u_{z_{i}} (t, z) \\ + λ_{k}^{0} (t) \int_{- \infty}^{\infty} u (t, z + υ_{k}) ϕ_{k} (υ_{k}, t) d υ_{k} - Λ^{0} (t) u (t, z), \\ L^{(l)} (u) (t, z) = a_{i j}^{l} (t) u_{z_{i} z_{j}} (t, z) + b_{i}^{l} (t) u_{z_{i}} (t, z) \\ + λ_{k}^{l} (t) \int_{- \infty}^{\infty} u (t, z + υ_{k}) ϕ_{k} (υ_{k}, t) d υ_{k} - Λ^{l} (t) u (t, z) . \end{matrix}

(5.19)

For the t.p.d.f. $ϖ,$ one has

\begin{matrix} ϖ_{t} (t, z) + L^{(0)} ϖ (t, z) + \sum_{l = 1}^{n_{z}} z_{l} L^{(l)} (u) u (t, z) = 0, \\ ϖ (\overset{ˉ}{t}, z, \overset{ˉ}{t}, \overset{ˉ}{z}) = δ (z - \overset{ˉ}{z}) . \end{matrix}

(5.20)

Moreover, to cover interesting and important cases, such as anomalous diffusions and the like, generalize (5.18) and consider pseudo-differential operators $L^{(\overset{ˉ}{l})},$ $\overset{ˉ}{l} = 0, \dots, n_{z} .$ Recall that a translationally invariant pseudo-differential operator $L$ is defined as follows:

\begin{matrix} L (u) (z) = \frac{1}{{(2 π)}^{n_{z}}} \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} L (m) u (z^{'}) e^{i m (z - z^{'})} d z^{'} d m, \end{matrix}

(5.21)

where $L (m)$ is called the symbol of a pseudo-differential operator; see, for example, Reference CordesCordes (1995) and Reference WongWong (2014). It is clear that all diffusion operators belong to this category, and so do jump-diffusion operators. The symbol of the operator $L^{(\overset{ˉ}{l})} (t)$

\begin{matrix} L^{(\overset{ˉ}{l})} (t, m) = - a_{i j}^{(\overset{ˉ}{l})} (t) m_{i} m_{j} + i b_{i}^{(\overset{ˉ}{l})} (t) m_{i} + λ_{k}^{(\overset{ˉ}{l})} (t) ψ_{k} (t, m) - Λ^{(\overset{ˉ}{l})} (t), \end{matrix}

(5.22)

where $ψ_{k} (m)$ is the characteristic function of $ϕ_{k} (υ)$ :

\begin{matrix} ψ_{k} (t, m) = \int_{- \infty}^{\infty} e^{i {m υ}_{k}} ϕ_{k} (t, υ_{k}) d υ_{k} . \end{matrix}

(5.23)

While frequently studied in the pure and applied mathematical context, in the financial engineering context pseudo-differential operators are seldom discussed; see, however, Reference Jacob, Schilling and Barndorff-NielsenJacob and Schilling (2001).

By definition, Fourier and Kelvin modes are eigenfunctions of the operators $L^{(0)},$ $L^{(l)} .$ Accordingly, when all $L^{(l)} = 0,$ one can solve the corresponding backward problem via the standard Fourier modes $F$ given by (1.1):

\begin{matrix} u (t, z) = \frac{1}{{(2 π)}^{n_{z}}} \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} \tilde{u} (z^{'}) e^{α (t, \overset{ˉ}{t}, m) + i m (z - z^{'})} d z^{'} d m, \end{matrix}

(5.24)

where

\begin{matrix} α_{t} (t, \overset{ˉ}{t}, m) + L^{(0)} (t, m) = 0, α (\overset{ˉ}{t}, \overset{ˉ}{t}, m) = 0, \end{matrix}

(5.25)

so that

\begin{matrix} α (t, \overset{ˉ}{t}, m) = \int_{t}^{\overset{ˉ}{t}} L^{(0)} (s, m) d s . \end{matrix}

(5.26)

However, in general, one needs to use Kelvin modes $K,$ given by (1.2):

\begin{matrix} u (t, z) = \frac{1}{{(2 π)}^{n_{z}}} \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} \tilde{u} (z^{'}) e^{α (t, \overset{ˉ}{t}, m) + i δ (t, \overset{ˉ}{t}, m) z - i {m z}^{'}} d z^{'} d m, \end{matrix}

(5.27)

where

\begin{matrix} α_{t} (t, \overset{ˉ}{t}, m) + L^{(0)} (t, δ (t, \overset{ˉ}{t}, m)) = 0, α (\overset{ˉ}{t}, \overset{ˉ}{t}, m) = 0, \\ δ_{l, t} (t, \overset{ˉ}{t}, m) + L^{(l)} (t, δ (t, \overset{ˉ}{t}, m)) = 0, δ (\overset{ˉ}{t}, \overset{ˉ}{t}, m) = m . \end{matrix}

(5.28)

Of course, finding explicit solutions of ODEs (5.28) is possible only in exceptional cases, some of which are discussed below. However, it is always possible to solve them numerically, which is much easier than trying to solve the corresponding PDEs directly.

As mentioned earlier, three archetypal stochastic processes are arithmetic Wiener processes (or Brownian motions), Ornstein-Uhlenbeck (OU) and Feller processes; see Reference Uhlenbeck and OrnsteinUhlenbeck and Ornstein (1930), Reference ChandrasekharChandresekhar (1943), and Reference FellerFeller (1951), Reference Feller(1952). These processes are described by the following SDEs:

\begin{matrix} d {\hat{y}}_{t} & = χ d t + ε d {\hat{W}}_{t}, {\hat{y}}_{t} = y, \end{matrix}

(5.29)

\begin{matrix} d {\hat{y}}_{t} & = (χ - κ {\hat{y}}_{t}) d t + ε d {\hat{W}}_{t}, {\hat{y}}_{t} = y, \end{matrix}

(5.30)

\begin{matrix} d {\hat{y}}_{t} & = (χ - κ {\hat{y}}_{t}) d t + ε \sqrt{{\hat{y}}_{t}} d {\hat{W}}_{t}, {\hat{y}}_{t} = y, \end{matrix}

(5.31)

respectively. It is clear that the corresponding $L^{(0)}, L^{(1)}$ are:

\begin{matrix} L^{(0)} (u) & = \frac{1}{2} ε^{2} u_{y y} + χ u_{y}, L^{(1)} (u) = 0, \end{matrix}

(5.32)

\begin{matrix} L^{(0)} (u) & = \frac{1}{2} ε^{2} u_{y y} + χ u_{y}, L^{(1)} (u) = - κ u_{y}, \end{matrix}

(5.33)

\begin{matrix} L^{(0)} (u) & = χ u_{y}, L^{(1)} (u) = \frac{1}{2} ε^{2} u_{y y} - κ u_{y} . \end{matrix}

(5.34)

There are important differences among these processes. For an arithmetic Brownian motion, the operator $L^{(0)}$ is a second-order differential operator, while $L^{(1)}$ is zero, and the process is defined on the whole axis. For an OU process the operator $L^{(0)}$ is a second-order differential operator, while $L^{(1)}$ is a first-order operator; accordingly, this process is defined on the entire axis. In contrast, for a Feller process $L^{(0)}$ is a first-order differential operator, while $L^{(1)}$ is a second-order operator; hence, the process is only defined on a positive semiaxis.Footnote ⁵

5.3 Augmentation Procedure

While covering a lot of useful applications, OU and Feller processes are not sufficient to study all the practically important problems. Hence, one needs to enrich them via the so-called augmentation procedure; see Reference LiptonLipton (2001) . The underlying idea is straightforward. Given a stochastic process, say, an arithmetic Brownian motion, or an OU process, one can expand it by introducing additional stochastic variables depending on the original process. For example, an augmented Brownian motion (5.29) becomes a one-dimensional Kolmogorov process:

\begin{matrix} d {\hat{x}}_{t} = {\hat{y}}_{t} d t, {\hat{x}}_{t} = x, \\ d {\hat{y}}_{t} = χ d t + ε d {\hat{W}}_{t}, {\hat{y}}_{t} = y . \end{matrix}

(5.35)

Similarly, one can augment OU and Feller processes as follows:

\begin{matrix} d {\hat{x}}_{t} = {\hat{y}}_{t} d t, x_{t} = x, \\ d {\hat{y}}_{t} = (χ - κ {\hat{y}}_{t}) d t + ε d {\hat{W}}_{t}, y_{t} = y, \end{matrix}

(5.36)

\begin{matrix} d {\hat{x}}_{t} = {\hat{y}}_{t} d t, x_{t} = x, \\ d {\hat{y}}_{t} = (χ - κ {\hat{y}}_{t}) d t + ε \sqrt{{\hat{y}}_{t}} d {\hat{W}}_{t}, y_{t} = y, \end{matrix}

(5.37)

respectively. Of course, many other possibilities are practically important. In what follows, the Element analyzes several practically relevant and mathematically interesting augmented stochastic processes.

5.4 Reduction Procedure

Stochastic processes, which are not inherently affine, can often be transformed into an affine form through appropriate modifications. While some transformations are readily apparent, others demand significant effort and inspiration to identify, as highlighted by Reference Carr, Lipton and MadanCarr et al. (2002) and referenced works.

Consider the geometric Brownian motion, the cornerstone of mathematical finance and other disciplines. The associated stochastic process is not affine and is described by

\begin{matrix} d {\hat{X}}_{t} = μ (t) {\hat{X}}_{t} d t + ν (t) {\hat{X}}_{t} d {\hat{W}}_{t}, {\hat{X}}_{t} = X . \end{matrix}

(5.38)

Applying a logarithmic transformation,

\begin{matrix} {\hat{X}}_{t} \to {\hat{x}}_{t} = ln ({\hat{X}}_{t}), \end{matrix}

(5.39)

converts it into an arithmetic Brownian motion, which is affine:

\begin{matrix} d {\hat{x}}_{t} = (μ (t) - \frac{1}{2} ν^{2} (t)) d t + ν (t) d {\hat{W}}_{t}, {\hat{x}}_{t} = x = ln (X) . \end{matrix}

(5.40)

This example illustrates that, with some ingenuity, even nonaffine processes like the geometric Brownian motion can be adapted for use with the existing analytical frameworks.

Another helpful example is transforming the Rayleigh process into the Feller process. Recall that the Rayleigh process describes a stochastic process on the positive semiaxis. We write this process as follows:

\begin{matrix} d {\hat{σ}}_{t} = (\frac{A}{{\hat{σ}}_{t}} - B {\hat{σ}}_{t}) d t + C d {\hat{W}}_{t}, {\hat{σ}}_{t} = σ, \end{matrix}

(5.41)

where $A, B, C > 0 .$ Define ${\hat{v}}_{t} = {\hat{σ}}_{t}^{2}$ ; then, according to Ito’s lemma, the dynamics of the process ${\hat{v}}_{t}$ have the following form:

\begin{matrix} d {\hat{v}}_{t} = (2 A + C^{2} - 2 B {\hat{v}}_{t}) d t + 2 C \sqrt{{\hat{v}}_{t}} d {\hat{W}}_{t}, {\hat{v}}_{t} = v = σ^{2} . \end{matrix}

(5.42)

In financial applications considered in Section 8, the pair $σ,$ $v$ represents the volatility and variance of a price process.

6 Gaussian Stochastic Processes

6.1 Regular Gaussian Processes

Consider the governing system of SDEs, which might or might not be degenerate, and write the governing system of SDEs as follows:

\begin{matrix} d {\hat{z}}_{t} = (b + B {\hat{z}}_{t}) d t + Σ d {\hat{W}}_{t}, {\hat{z}}_{t} = z, \end{matrix}

(6.1)

where ${\hat{z}}_{t}, b$ are $(M \times 1)$ vectors, and $B$ and $Σ$ are $(M \times M)$ matrices. Below, it is assumed that the corresponding coefficients are time-dependent.

The Fokker–Plank equation has the following form:

\begin{matrix} ϖ_{\overset{ˉ}{t}} (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) - \sum \sum A ϖ_{\overset{ˉ}{z} \overset{ˉ}{z}} (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) \\ + (b + B \overset{ˉ}{z}) \cdot ϖ_{\overset{ˉ}{z}} (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) + b ϖ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) = 0, \\ ϖ (t, z, t, \overset{ˉ}{z}) = δ (\overset{ˉ}{z} - z), \end{matrix}

(6.2)

where, in agreement with the general (5.13), $A$ is proportional to the covariance matrix,

\begin{matrix} A = (a_{m m^{'}}) = \frac{1}{2} {Σ Σ}^{*} = \frac{1}{2} σ_{m k} σ_{m^{'} k}, b = T r (B) = b_{m m} . \end{matrix}

(6.3)

Recall that Einstein’s summation rule is used throughout the Element. Explicitly,

\begin{matrix} \partial_{\overset{ˉ}{t}} ϖ - a_{m m^{'}} \partial_{{\overset{ˉ}{z}}_{m}} \partial_{{\overset{ˉ}{z}}_{m^{'}}} ϖ + (b_{m} + b_{m m^{'}} {\overset{ˉ}{z}}_{m^{'}}) \partial_{{\overset{ˉ}{z}}_{m}} ϖ + b ϖ = 0, \\ ϖ (t, z, t, \overset{ˉ}{z}) = δ (\overset{ˉ}{z} - z) . \end{matrix}

(6.4)

The general Kolmogorov-type SDE, solvable via the Kelvin (or affine) ansatz, can be written in the following form:

\begin{matrix} d {\hat{x}}_{t} & = (b^{(x)} + B^{(x x)} {\hat{x}}_{t} + B^{(x y)} {\hat{y}}_{t}) d t, {\hat{x}}_{t} = x, \\ d {\hat{y}}_{t} & = (b^{(y)} + B^{(y x)} {\hat{x}}_{t} + B^{(y y)} {\hat{y}}_{t}) d t + Σ^{(y y)} d {\hat{W}}_{t}^{(y)}, y_{t} = y, \end{matrix}

(6.5)

where ${\hat{x}}_{_{t}}$ and $b^{(x)}$ are $(K \times 1)$ column vectors, ${\hat{y}}_{t}$ and $b^{(y)}$ are $(L \times 1)$ column vectors, $B^{(x x)},$ $B^{(x y)},$ $B^{(y x)},$ $B^{(y y)},$ and $Σ^{(y y)}$ are $(K \times K),$ $(K \times L),$ $(L \times K),$ $(L \times L),$ and $(L \times L)$ matrices, respectively. In what follows, it is assumed that the corresponding coefficients are time-dependent. As usual, ${\hat{W}}_{t}$ is a standard $L$ -dimensional Brownian motion.

More compactly, one can write the system of SDEs as follows:

\begin{matrix} d {\hat{z}}_{t} = (b^{(z)} + B^{(z z)} {\hat{z}}_{t}) d t + (\begin{matrix} 0 \\ Σ^{(y y)} d {\hat{W}}_{t}^{(y)} \end{matrix}), & {\hat{z}}_{t} = (\begin{matrix} x \\ y \end{matrix}), \end{matrix}

(6.6)

where

\begin{matrix} {\hat{z}}_{t} = (\begin{matrix} {\hat{x}}_{t} \\ {\hat{y}}_{t} \end{matrix}), b^{(z)} = (\begin{matrix} b^{(x)} \\ b^{(y)} \end{matrix}), B^{(z z)} = (\begin{matrix} B^{(x x)} & B^{(x y)} \\ B^{(y x)} & B^{(y y)} \end{matrix}), \end{matrix}

(6.7)

so that ${\hat{z}}_{t}$ and $b^{(z)}$ are $(M \times 1)$ column vectors, and $B^{(z z)}$ is a $(M \times M)$ matrix, with $M = K + L .$ In addition, define a scalar $b^{(z)} = T r (B^{(z z)}) = T r (B^{(x x)}) + T r (B^{(y x)}) .$

The corresponding Fokker–Plank problem has the following form:

\begin{matrix} ϖ_{\overset{ˉ}{t}} (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) - \sum \sum A ϖ_{y y} (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) \\ + (b^{(z)} + B^{(z)} \overset{ˉ}{z}) \cdot ϖ_{\overset{ˉ}{z}} (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) + b^{(z)} ϖ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) = 0, \\ ϖ (t, z, t, \overset{ˉ}{z}) = δ (x - x) δ (y - y) ​, \end{matrix}

(6.8)

where $A$ has the following form:

\begin{matrix} A = (a_{l l^{'}}) = \frac{1}{2} σ_{l \overset{ˉ}{l}} σ_{l^{'} \overset{ˉ}{l}} = \frac{1}{2} Σ^{(y y)} Σ^{(y y) *} . \end{matrix}

(6.9)

Explicitly,

\begin{matrix} \partial_{\overset{ˉ}{t}} ϖ - a_{l l^{'}} \partial_{{\overset{ˉ}{z}}_{K + l}} \partial_{{\overset{ˉ}{z}}_{K + l^{'}}} ϖ + (b_{m}^{(z)} + b_{m m^{'}}^{(z z)} {\overset{ˉ}{z}}_{m^{'}}) \partial_{{\overset{ˉ}{z}}_{m}} ϖ + b^{(z)} ϖ = 0, \end{matrix}

(6.10)

6.1.1 Solution via Kelvin Waves

By using the Kelvin-inspired ansatz, one can represent $ϖ$ in the following form:

\begin{matrix} ϖ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) & = \frac{1}{{(2 π)}^{M}} \int_{- \infty}^{\infty} \dots \int_{- \infty}^{\infty} K (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}, m) d m, \\ K (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}, m) & = exp (Ψ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}, m)), \\ Ψ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}, m) & = α (t, \overset{ˉ}{t}) + i δ (t, \overset{ˉ}{t}) \cdot \overset{ˉ}{z} - i m \cdot z, \end{matrix}

(6.11)

where $m$ is an $(M \times 1)$ column vector, $δ$ is an $(M \times 1)$ column vector, and

\begin{matrix} α (t, t) = 0, δ (t, t) = m . \end{matrix}

(6.12)

Accordingly:

\begin{matrix} \frac{K_{\overset{ˉ}{t}}}{K} & = Ψ_{\overset{ˉ}{t}} = (α_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) + i δ_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) \cdot \overset{ˉ}{z}), \\ \frac{K_{\overset{ˉ}{z}}}{K} & = Ψ_{\overset{ˉ}{z}} = i δ (t, \overset{ˉ}{t}), \frac{K_{\overset{ˉ}{z} \overset{ˉ}{z}}}{K} = Ψ_{\overset{ˉ}{z}}^{2} = - δ (t, \overset{ˉ}{t}) δ^{*} (t, \overset{ˉ}{t}) . \end{matrix}

(6.13)

The coupled equations for $α,$ $δ$ have the following form:

\begin{matrix} α_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) + i δ_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) \cdot \overset{ˉ}{z} + δ (t, \overset{ˉ}{t}) \cdot A δ (t, \overset{ˉ}{t}) + i δ (t, \overset{ˉ}{t}) \cdot (b + B \overset{ˉ}{z}) + b = 0, \end{matrix}

(6.14)

so that

\begin{matrix} α_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) + δ (t, \overset{ˉ}{t}) \cdot A δ (t, \overset{ˉ}{t}) + i δ (t, \overset{ˉ}{t}) \cdot b + b = 0, α (t, t) = 0, \end{matrix}

(6.15)

\begin{matrix} δ_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) + B^{*} δ (t, \overset{ˉ}{t}) = 0, δ (t, t) = m . \end{matrix}

(6.16)

Let $L (t, \overset{ˉ}{t})$ be the fundamental solution of the homogeneous system of ODEs (6.16), namely, the matrix such that

\begin{matrix} \partial_{\overset{ˉ}{t}} L (t, \overset{ˉ}{t}) + B^{*} (\overset{ˉ}{t}) L (t, \overset{ˉ}{t}) = 0, L (t, t) = I . \end{matrix}

(6.17)

The solution of (6.16) has the following form:

\begin{matrix} δ (t, \overset{ˉ}{t}) = L (t, \overset{ˉ}{t}) m . \end{matrix}

(6.18)

Thus,

\begin{matrix} α (t, \overset{ˉ}{t}) = - \frac{1}{2} m \cdot C^{- 1} (t, \overset{ˉ}{t}) m - i m \cdot d (t, \overset{ˉ}{t}) - ς (t, \overset{ˉ}{t}), \end{matrix}

(6.19)

where $C^{- 1}$ is an $M \times M$ positive-definite matrix of the following form:

\begin{matrix} C^{- 1} (t, \overset{ˉ}{t}) = 2 \int_{t}^{\overset{ˉ}{t}} L^{*} (t, s) A (s) L (t, s) d s, \end{matrix}

(6.20)

while $d$ is an $(M \times 1)$ column vector,

\begin{matrix} d (t, \overset{ˉ}{t}) = \int_{t}^{\overset{ˉ}{t}} L^{*} (t, s) b (s) d s, \end{matrix}

(6.21)

and $ς$ is a scalar,

\begin{matrix} ς (t, \overset{ˉ}{t}) = \int_{t}^{\overset{ˉ}{t}} b (s) d s . \end{matrix}

(6.22)

Accordingly,

\begin{matrix} Ψ (t, \overset{ˉ}{t}, \overset{ˉ}{z}, m) = - \frac{1}{2} m \cdot C^{- 1} (t, \overset{ˉ}{t}) m + i m \cdot (L^{*} (t, \overset{ˉ}{t}) \overset{ˉ}{z} - d (t, \overset{ˉ}{t}) - z) - ς (t, \overset{ˉ}{t}) . \end{matrix}

(6.23)

Thus,

\begin{matrix} ϖ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) = \frac{det {(C (t, \overset{ˉ}{t}))}^{1 / 2} exp (- ς (t, \overset{ˉ}{t}))}{{(2 π)}^{M / 2}} \\ \times \int_{- \infty}^{\infty} \dots \int_{- \infty}^{\infty} G (t, \overset{ˉ}{t}, m) exp (i m \cdot (L^{*} (t, \overset{ˉ}{t}) \overset{ˉ}{z} - d (t, \overset{ˉ}{t}) - z)) d m, \end{matrix}

(6.24)

where $G (t, \overset{ˉ}{t}, m)$ is the density of a multivariate Gaussian distribution in the $m$ -space. It is clear that $ϖ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z})$ is proportional to the characteristic function of $G$ evaluated at the point $(L^{*} (t, \overset{ˉ}{t}) \overset{ˉ}{z} - d (t, \overset{ˉ}{t}) - z),$ so that

\begin{matrix} ϖ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) = \frac{det {(C (t, \overset{ˉ}{t}))}^{1 / 2} exp (- ς (t, \overset{ˉ}{t}))}{{(2 π)}^{M / 2}} \\ \times exp (- \frac{1}{2} (L^{*} (t, \overset{ˉ}{t}) \overset{ˉ}{z} - d (t, \overset{ˉ}{t}) - z) \cdot C (t, \overset{ˉ}{t}) (L^{*} (t, \overset{ˉ}{t}) \overset{ˉ}{z} - d (t, \overset{ˉ}{t}) - z)) . \end{matrix}

(6.25)

Thus, $ϖ$ can be represented in the form:

\begin{matrix} ϖ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) = N (r (t, \overset{ˉ}{t}), H (t, \overset{ˉ}{t})), \end{matrix}

(6.26)

where

\begin{matrix} H (t, \overset{ˉ}{t}) & = {(L^{*} (t, \overset{ˉ}{t}))}^{- 1} C^{- 1} (t, \overset{ˉ}{t}) L^{- 1} (t, \overset{ˉ}{t}), \\ r (t, \overset{ˉ}{t}) & = {(L^{*} (t, \overset{ˉ}{t}))}^{- 1} (d (t, \overset{ˉ}{t}) + z) . \end{matrix}

(6.27)

These results are applicable to the general Kolmogorov-type SDE solvable via the Kelvin (or affine) ansatz, which have the form (6.5). By using the same Kelvin ansatz as before, one can represent $ϖ$ in the form (6.11):

\begin{matrix} ϖ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) & = \frac{1}{{(2 π)}^{M}} \int_{- \infty}^{\infty} \dots \int_{- \infty}^{\infty} K (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}, m) d m, \\ K (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}, m) & = exp (Ψ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}, m)), \\ Ψ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}, m) & = α (t, \overset{ˉ}{t}) + i δ (t, \overset{ˉ}{t}) \cdot \overset{ˉ}{z} - i m \cdot z, \end{matrix}

(6.28)

where $m$ is an $(M \times 1)$ column vector, $m = {(k, l)}^{*},$ $k$ is a $(K \times 1)$ column vector, $l$ is an $(L \times 1)$ column vector, $δ$ is an $(M \times 1)$ column vector, $δ = {(β, γ)}^{*},$ $β$ is a $(K \times 1)$ column vector, $γ$ is an $(L \times 1)$ column vector, and

\begin{matrix} α (t, t) = 0, δ (t, t) = {(β (t, t), γ (t, t))}^{*} = m = {(k, l)}^{*} . \end{matrix}

(6.29)

As before:

\begin{matrix} \frac{K_{t}}{K} & = Ψ_{t} = (α_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) + i δ_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) \cdot \overset{ˉ}{z}), \frac{K_{x}}{K} = Ψ_{x} = i β (t, \overset{ˉ}{t}), \\ \frac{K_{y}}{K} & = Ψ_{y} = i γ (t, \overset{ˉ}{t}), \frac{K_{y y}}{K} = Ψ_{y}^{2} = - γ (t, \overset{ˉ}{t}) γ^{*} (t, \overset{ˉ}{t}) . \end{matrix}

(6.30)

The equations for $α, δ$ have the following form:

\begin{matrix} α_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) + i δ_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) \cdot \overset{ˉ}{z} + γ (t, \overset{ˉ}{t}) \cdot A γ (t, \overset{ˉ}{t}) + i δ (t, \overset{ˉ}{t}) \cdot (b^{(z)} + B^{(z z)} \overset{ˉ}{z}) + b^{(z)} = 0. \end{matrix}

(6.31)

Accordingly,

\begin{matrix} α_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) + γ (t, \overset{ˉ}{t}) \cdot A γ (t, \overset{ˉ}{t}) + i δ (t, \overset{ˉ}{t}) \cdot b^{(z)} + b^{(z)} = 0, α (t, t) = 0, \end{matrix}

(6.32)

\begin{matrix} δ_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) + B^{(z z) *} δ (t, \overset{ˉ}{t}) = 0, δ (t, t) = m = {(k, l)}^{*} . \end{matrix}

(6.33)

Let $L (t, \overset{ˉ}{t})$ be the fundamental solution of the homogeneous system of ODEs (6.33), namely, the matrix such that

\begin{matrix} \partial_{\overset{ˉ}{t}} L (t, \overset{ˉ}{t}) + B^{(z z) *} (\overset{ˉ}{t}) L (t, \overset{ˉ}{t}) = 0, L (t, t) = I, \end{matrix}

(6.34)

where $I$ is the identity matrix. The well-known Liouville’s formula yields

\begin{matrix} det (L (t, \overset{ˉ}{t})) = exp (- \int_{t}^{\overset{ˉ}{t}} b^{(z)} (s) d s) . \end{matrix}

(6.35)

The solution of (6.32) is

\begin{matrix} δ (t, \overset{ˉ}{t}) = L (t, \overset{ˉ}{t}) m . \end{matrix}

(6.36)

It is convenient to write $L (t, \overset{ˉ}{t})$ in the block form:

\begin{matrix} L (t, \overset{ˉ}{t}) = (\begin{matrix} L^{(x x)} (t, \overset{ˉ}{t}) & L^{(x y)} (t, \overset{ˉ}{t}) \\ L^{(y x)} (t, \overset{ˉ}{t}) & L^{(y y)} (t, \overset{ˉ}{t}) \end{matrix}) . \end{matrix}

(6.37)

It follows from (6.33) that

\begin{matrix} α (t, \overset{ˉ}{t}) = - \frac{1}{2} m \cdot C^{- 1} (t, \overset{ˉ}{t}) m - i {m \cdot d}^{(z)} (t, \overset{ˉ}{t}) - ς (t, \overset{ˉ}{t}), \end{matrix}

(6.38)

where $C^{- 1}$ is an $M \times M$ positive-definite matrix split into four blocks of the form:

\begin{matrix} C^{- 1} (t, \overset{ˉ}{t}) \\ = 2 (\begin{matrix} \int_{t}^{\overset{ˉ}{t}} L^{(y x) *} (t, s) A (s) L^{(y x)} (t, s) d s & \int_{t}^{\overset{ˉ}{t}} L^{(y x) *} (t, s) A (s) L^{(y y)} (t, s) d s \\ \int_{t}^{\overset{ˉ}{t}} L^{(y y) *} (t, s) A (s) L^{(y x)} (t, s) d s & \int_{t}^{\overset{ˉ}{t}} L^{(y y) *} (t, s) A (s) L^{(y y)} (t, s) d s \end{matrix}), \end{matrix}

(6.39)

while $d^{(z)} = {(d^{(x)}, d^{(y)})}^{*},$ $d^{(x)}$ and $d^{(y)}$ are $(M \times 1)$ and $(N \times 1)$ column vectors, and $ς$ is a scalar:

\begin{matrix} d^{(z)} (t, \overset{ˉ}{t}) & = \int_{t}^{\overset{ˉ}{t}} L^{*} (t, s) b^{(z)} (s) d s, \end{matrix}

(6.40)

\begin{matrix} ς (t, \overset{ˉ}{t}) & = \int_{t}^{\overset{ˉ}{t}} b^{(z)} (s) d s . \end{matrix}

(6.41)

Accordingly,

\begin{matrix} Ψ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}, m) \\ = - \frac{1}{2} m \cdot C^{- 1} (t, \overset{ˉ}{t}) m + i L (t, \overset{ˉ}{t}) m \cdot \overset{ˉ}{z} - i m \cdot (d^{(z)} (t, \overset{ˉ}{t}) + z) - ς (t, \overset{ˉ}{t}) \\ = - \frac{1}{2} m \cdot C^{- 1} (t, \overset{ˉ}{t}) m + i m \cdot (L^{*} (t, \overset{ˉ}{t}) \overset{ˉ}{z} - d^{(z)} (t, \overset{ˉ}{t}) - z) - ς (t, \overset{ˉ}{t}) . \end{matrix}

(6.42)

Thus,

\begin{matrix} ϖ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) = & \frac{det {(C)}^{1 / 2} exp (- ς (t, \overset{ˉ}{t}))}{{(2 π)}^{M / 2}} \int_{- \infty}^{\infty} \dots \int_{- \infty}^{\infty} G (t, \overset{ˉ}{t}, m) \\ \times exp (i m \cdot (L^{*} (t, \overset{ˉ}{t}) \overset{ˉ}{z} - d^{(z)} (t, \overset{ˉ}{t}) - z)) d m, \end{matrix}

(6.43)

where $G (t, \overset{ˉ}{t}, m)$ is the density of a multivariate Gaussian distribution in the $m$ -space. It is clear that $ϖ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z})$ is proportional to the characteristic function of $G$ evaluated at the point $(L^{*} (t, \overset{ˉ}{t}) \overset{ˉ}{z} - d^{(z)} (t, \overset{ˉ}{t}) - z),$ so that

\begin{matrix} ϖ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) = \frac{det {(C)}^{1 / 2} exp (- ς (t, \overset{ˉ}{t}))}{{(2 π)}^{M / 2}} \\ \times exp (- \frac{1}{2} (L^{*} (t, \overset{ˉ}{t}) \overset{ˉ}{z} - d^{(z)} (t, \overset{ˉ}{t}) - z) \cdot C (L^{*} (t, \overset{ˉ}{t}) \overset{ˉ}{z} - d^{(z)} (t, \overset{ˉ}{t}) - z)) . \end{matrix}

(6.44)

By using (6.35), one can rewrite (6.44) in the standard Gaussian form:

\begin{matrix} ϖ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) = N (r (t, \overset{ˉ}{t}), H (t, \overset{ˉ}{t})), \end{matrix}

(6.45)

where the covariance matrix $H$ and the mean $r$ are as follows:

\begin{matrix} H (t, \overset{ˉ}{t}) & = {(L^{*} (t, \overset{ˉ}{t}))}^{- 1} C^{- 1} (t, \overset{ˉ}{t}) L^{- 1} (t, \overset{ˉ}{t}), \\ r (t, \overset{ˉ}{t}) & = {(L^{*} (t, \overset{ˉ}{t}))}^{- 1} (d^{(z)} (t, \overset{ˉ}{t}) + z) . \end{matrix}

(6.46)

6.1.2 Solution via Coordinate Transform

Consider the Fokker–Planck problem (6.4). Introduce new variables:

\begin{matrix} (\overset{ˉ}{t}, \overset{ˉ}{z}) \to (\overset{ˉ}{t}, \tilde{z}) = (\overset{ˉ}{t}, R (\overset{ˉ}{t}) \overset{ˉ}{z}), {\tilde{z}}_{m} = r_{m m^{'}} (\overset{ˉ}{t}) {\overset{ˉ}{z}}_{m^{'}}, r_{m m^{'}} (0) = δ_{m m^{'}} . \end{matrix}

(6.47)

Then

\begin{matrix} \partial_{\overset{ˉ}{t}} = \partial_{\overset{ˉ}{t}} + \partial_{\overset{ˉ}{t}} r_{m m^{'}} {\overset{ˉ}{z}}_{m^{'}} \partial_{{\tilde{z}}_{m}}, \partial_{{\overset{ˉ}{z}}_{m}} = r_{m^{'} m} \partial_{{\tilde{z}}_{m^{'}}} . \end{matrix}

(6.48)

The transformed Fokker–Planck problem becomes

\begin{matrix} \partial_{\overset{ˉ}{t}} \tilde{ϖ} - a_{m m^{'}} r_{n m} r_{n^{'} m^{'}} \partial_{{\tilde{z}}_{n}} \partial_{{\tilde{z}}_{n^{'}}} \tilde{ϖ} + ((b_{m m^{'}} {\overset{ˉ}{z}}_{m^{'}} + b_{m}) r_{n m} + \partial_{\overset{ˉ}{t}} r_{n m^{'}} {\overset{ˉ}{z}}_{m^{'}}) \partial_{{\tilde{z}}_{n}} \tilde{ϖ} \\ + b \tilde{ϖ} = 0, \\ \tilde{ϖ} (t, z, t, \tilde{z}) = δ (\tilde{z} - z) . \end{matrix}

(6.49)

To simplify the drift term, it is required that

\begin{matrix} \partial_{\overset{ˉ}{t}} r_{m m^{'}} (t, \overset{ˉ}{t}) + b_{n m^{'}} (t, \overset{ˉ}{t}) r_{m n} (t, \overset{ˉ}{t}) = 0, r_{m m^{'}} (t, t) = δ_{n m^{'}} . \end{matrix}

(6.50)

In matrix notation:

\begin{matrix} \partial_{\overset{ˉ}{t}} R (t, \overset{ˉ}{t}) + R (t, \overset{ˉ}{t}) B (t) = 0, R (t, t) = I . \end{matrix}

(6.51)

Thus, $R = L^{*},$ $r_{m m^{'}} = l_{m^{'} m},$ where $L$ is given by (6.34). It is easy to see that $\tilde{ϖ}$ satisfies the Fokker–Planck problem of the following form:

\begin{matrix} \partial_{\overset{ˉ}{t}} \tilde{ϖ} - {\tilde{a}}_{n n^{'}} (t, \overset{ˉ}{t}) \partial_{{\tilde{z}}_{n}} \partial_{{\tilde{z}}_{n^{'}}} \tilde{ϖ} + {\tilde{b}}_{n} (t, \overset{ˉ}{t}) \partial_{{\tilde{z}}_{n}} \tilde{ϖ} + b (t, \overset{ˉ}{t}) \tilde{ϖ} = 0, \\ \tilde{ϖ} (t, z, t, \tilde{z}) = δ (\tilde{z} - z), \end{matrix}

(6.52)

with

\begin{matrix} {\tilde{a}}_{n n^{'}} (t, \overset{ˉ}{t}) = l_{m n} (t, \overset{ˉ}{t}) a_{m m^{'}} (t, \overset{ˉ}{t}) l_{m^{'} n^{'}} (t, \overset{ˉ}{t}), \\ {\tilde{b}}_{n} (\overset{ˉ}{t}) = l_{m^{'} n} (t, \overset{ˉ}{t}) b_{m^{'}} (\overset{ˉ}{t}) . \end{matrix}

(6.53)

In matrix notation:

\begin{matrix} \tilde{A} = L^{*} (t, \overset{ˉ}{t}) A (t, \overset{ˉ}{t}) L (t, \overset{ˉ}{t}), \tilde{b} = L^{*} (t, \overset{ˉ}{t}) b . \end{matrix}

(6.54)

Accordingly,

\begin{matrix} \tilde{ϖ} (t, z, \overset{ˉ}{t}, \tilde{z}) = exp (- \int_{t}^{\overset{ˉ}{t}} b (s) d s) N ((\tilde{z}) z + \int_{t}^{\overset{ˉ}{t}} {\tilde{b}}_{n} (s) d s, \int_{t}^{\overset{ˉ}{t}} \tilde{C} (s) d s) . \end{matrix}

(6.55)

Reverting back to the original variables, $(\overset{ˉ}{t}, \tilde{z}) \to (\overset{ˉ}{t}, \overset{ˉ}{z}),$ one recovers (6.45), as expected.

6.2 Killed Gaussian Processes

Consider a process governed by a system of SDEs (6.1), which is killed with intensity $\overset{ˉ}{c}$ linearly depending on $\overset{ˉ}{z},$ namely,

\begin{matrix} \overset{ˉ}{c} = c + c \cdot \overset{ˉ}{z}, \end{matrix}

(6.56)

where $c$ is a scalar, and $c^{(z)}$ is an $(M \times 1)$ column vector. Thus, $\overset{ˉ}{c}$ is the intensity at which the process goes into a “killed” state at some random time. The Fokker–Planck equation for a killed process has the following form:

\begin{matrix} ϖ_{\bar{t}} (t, z, \bar{t}, \bar{z}) - \sum \sum A ϖ_{\bar{z} \bar{z}} (t, z, \bar{t %}, \bar{z}) \\ + (b + B \bar{z}) \cdot ϖ_{% \bar{z}} (t, z, \bar{t}, \bar{z}) + (b + c + c \cdot \bar{z}) ϖ (t, z, \bar{t}, % \bar{z}) = 0, \\ ϖ (t, \bar{z}, t, z) = δ (% \bar{z} - z) ​ . \end{matrix}

(6.57)

Explicitly,

\begin{matrix} ϖ_{\overset{ˉ}{t}} - a_{m m^{'}} ϖ_{{\overset{ˉ}{z}}_{m} {\overset{ˉ}{z}}_{m^{'}}} + (b_{m} + b_{m m^{'}} {\overset{ˉ}{z}}_{m^{'}}) ϖ_{{\overset{ˉ}{z}}_{m}} + (b + c + c_{m} {\overset{ˉ}{z}}_{m}) ϖ = 0, \\ ϖ (t, \overset{ˉ}{z}, t, z) = δ (\overset{ˉ}{z} - z) . \end{matrix}

(6.58)

This problem can be solved by the same technique as before.

6.2.1 Solution via Kelvin Waves

The familiar Kelvin ansatz yields

\begin{matrix} α_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) + δ (t, \overset{ˉ}{t}) \cdot A δ (t, \overset{ˉ}{t}) + i δ (t, \overset{ˉ}{t}) \cdot b + b + c = 0, α (t, t) = 0, \end{matrix}

(6.59)

\begin{matrix} δ_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) + B^{*} δ (t, \overset{ˉ}{t}) - i c = 0, δ (t, t) = m . \end{matrix}

(6.60)

Let $L (t, \overset{ˉ}{t})$ be the fundamental solution of the homogeneous system of ODEs (6.60), namely, the matrix such that

\begin{matrix} \partial_{\overset{ˉ}{t}} L (t, \overset{ˉ}{t}) + B^{*} (\overset{ˉ}{t}) L (t, \overset{ˉ}{t}) = 0, L (t, t) = I, \end{matrix}

(6.61)

The solution of (6.60) has the following form:

\begin{matrix} δ (t, \overset{ˉ}{t}) & = L (t, \overset{ˉ}{t}) m + i L (t, \overset{ˉ}{t}) \int_{t}^{\overset{ˉ}{t}} L^{- 1} (t, s) c (s) d s \equiv L (t, \overset{ˉ}{t}) (m + i e (t, \overset{ˉ}{t})), \\ e (t, \overset{ˉ}{t}) & = \int_{t}^{\overset{ˉ}{t}} L^{- 1} (t, s) c (s) d s . \end{matrix}

(6.62)

Thus,

\begin{matrix} α = - \frac{1}{2} m \cdot C^{- 1} m - i m \cdot d - ς, \end{matrix}

(6.63)

where $C^{- 1}$ is an $M \times M$ positive-definite matrix of the form:

\begin{matrix} C^{- 1} (t, \overset{ˉ}{t}) = 2 \int_{t}^{\overset{ˉ}{t}} L^{*} (t, s) A (s) L (t, s) d s, \end{matrix}

(6.64)

while $d$ is an $(M \times 1)$ column vector,

\begin{matrix} d (t, \overset{ˉ}{t}) = \int_{t}^{\overset{ˉ}{t}} L^{*} (t, s) (b (s) + A (s) L (t, s) e (s)) d s, \end{matrix}

(6.65)

and $ς = ς_{0} + ς_{1}$ is a scalar,

\begin{matrix} ς_{0} (t, \overset{ˉ}{t}) = & \int_{t}^{\overset{ˉ}{t}} b (s) d s, \\ ς_{1} (t, \overset{ˉ}{t}) = & \int_{t}^{\overset{ˉ}{t}} (c (s) - \frac{1}{2} e (t, s) \cdot L^{*} (t, s) A (s) L (t, s) e (s)) \\ (- e (t, s) \cdot L^{*} (t, s) b (s)) d s . \end{matrix}

(6.66)

Accordingly,

\begin{matrix} Ψ (t, \overset{ˉ}{t}, \overset{ˉ}{z}, m) = & - \frac{1}{2} m \cdot C^{- 1} (t, \overset{ˉ}{t}) m + i m \cdot (L^{*} (t, \overset{ˉ}{t}) \overset{ˉ}{z} - d (t, \overset{ˉ}{t}) - z) \\ - L (t, \overset{ˉ}{t}) e (t, \overset{ˉ}{t}) \cdot \overset{ˉ}{z} - ς (t, \overset{ˉ}{t}) . \end{matrix}

(6.67)

Thus,

\begin{matrix} ϖ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) = \frac{det {(C (t, \overset{ˉ}{t}))}^{1 / 2} exp (- L (t, \overset{ˉ}{t}) e (t, \overset{ˉ}{t}) \cdot \overset{ˉ}{z} - ς_{0} (t, \overset{ˉ}{t}) - ς_{1} (t, \overset{ˉ}{t}))}{{(2 π)}^{M / 2}} \\ \times \int_{- \infty}^{\infty} \dots \int_{- \infty}^{\infty} G (t, \overset{ˉ}{t}, m) exp (i m \cdot (L^{*} (t, \overset{ˉ}{t}) \overset{ˉ}{z} - d (t, \overset{ˉ}{t}) - z)) d m, \end{matrix}

(6.68)

where $G (t, \overset{ˉ}{t}, m)$ is the density of a multivariate Gaussian distribution in the $m$ -space. It is clear that $ϖ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z})$ is proportional to the characteristic function of $G$ evaluated at the point $(L^{*} (t, \overset{ˉ}{t}) \overset{ˉ}{z} - d (t, \overset{ˉ}{t}) - z),$ so that

\begin{matrix} ϖ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) = \frac{det {(C (t, \overset{ˉ}{t}))}^{1 / 2} exp (- L (t, \overset{ˉ}{t}) e (t, \overset{ˉ}{t}) \cdot \overset{ˉ}{z} - ς_{0} (t, \overset{ˉ}{t}) - ς_{1} (t, \overset{ˉ}{t}))}{{(2 π)}^{M / 2}} \\ \times exp (- \frac{1}{2} (L^{*} (t, \overset{ˉ}{t}) \overset{ˉ}{z} - d (t, \overset{ˉ}{t}) - z) \cdot C (t, \overset{ˉ}{t}) (L^{*} (t, \overset{ˉ}{t}) \overset{ˉ}{z} - d (t, \overset{ˉ}{t}) - z)) . \end{matrix}

(6.69)

It is often convenient to rewrite (6.69) as follows:

\begin{matrix} ϖ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) = Q (t, \overset{ˉ}{t}, \overset{ˉ}{z}) N (q (t, \overset{ˉ}{t}), H (t, \overset{ˉ}{t})), \end{matrix}

(6.70)

where

\begin{matrix} H (t, \overset{ˉ}{t}) & = {(L^{*} (t, \overset{ˉ}{t}))}^{- 1} C^{- 1} (t, \overset{ˉ}{t}) L^{- 1} (t, \overset{ˉ}{t}), \\ q (t, \overset{ˉ}{t}) & = {(L^{*} (t, \overset{ˉ}{t}))}^{- 1} (d (t, \overset{ˉ}{t}) + z), \\ Q (t, \overset{ˉ}{t}, \overset{ˉ}{z}) & = exp (- L (t, \overset{ˉ}{t}) e (t, \overset{ˉ}{t}) \cdot \overset{ˉ}{z} - ς_{1} (t, \overset{ˉ}{t})) . \end{matrix}

(6.71)

As could be expected, the probability $ϖ$ is no longer conserved due to a prefactor $Q,$ reflecting the fact that the process is killed with intensity $\overset{ˉ}{c} .$

It is worth noting that $Q$ depends on $\overset{ˉ}{z}$ but does not depend on $z .$ Completing the square, one can represent $ϖ$ in the form:

\begin{matrix} ϖ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) = R (t, z, \overset{ˉ}{t}) N (r (t, \overset{ˉ}{t}), H (t, \overset{ˉ}{t})), \end{matrix}

(6.72)

where

\begin{matrix} r (t, \overset{ˉ}{t}) & = {(L^{*} (t, \overset{ˉ}{t}))}^{- 1} (d (t, \overset{ˉ}{t}) + z - C^{- 1} (t, \overset{ˉ}{t}) e (t, \overset{ˉ}{t})) \\ = q (t, \overset{ˉ}{t}) - H (t, \overset{ˉ}{t}) L (t, \overset{ˉ}{t}) e (t, \overset{ˉ}{t}), \\ R (t, z, \overset{ˉ}{t}) & = exp (- e (t, \overset{ˉ}{t}) \cdot (d (t, \overset{ˉ}{t}) + z) + \frac{1}{2} e (t, \overset{ˉ}{t}) \cdot C^{- 1} (t, \overset{ˉ}{t}) e (t, \overset{ˉ}{t}) - ς_{1} (t, \overset{ˉ}{t})) . \end{matrix}

(6.73)

It is clear that $R$ depends on $z$ but does not depend on $\overset{ˉ}{z} .$ Accordingly, (6.72) is easier to use than (6.70) when future expectations are calculated.

The same formulas can be derived via the method of coordinate transforms. Details are left to the interested reader.

6.3 Example: Kolmogorov Process

Extend the Kolmogorov formula to the case when $b$ and $σ$ are functions of time, $b (t)$ and $σ (t) .$ The corresponding SDE has the following form:

\begin{matrix} d {\hat{x}}_{t} = {\hat{y}}_{t} d t, {\hat{x}}_{t} = x, \\ d {\hat{y}}_{t} = b (t) d t + σ (t) d {\hat{W}}_{t}, {\hat{y}}_{t} = y . \end{matrix}

(6.74)

Accordingly, (6.34) can be written as follows:

\begin{matrix} L^{'} (t, \overset{ˉ}{t}) + (\begin{matrix} 0 & 0 \\ 1 & 0 \end{matrix}) L (t, \overset{ˉ}{t}) = 0, L (t, t) = (\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}), \end{matrix}

(6.75)

so that

\begin{matrix} L (t, \overset{ˉ}{t}) = (\begin{matrix} 1 & 0 \\ - T & 1 \end{matrix}), L^{- 1} (t, \overset{ˉ}{t}) = (\begin{matrix} 1 & 0 \\ T & 1 \end{matrix}) . \end{matrix}

(6.76)

Once $L (t, \overset{ˉ}{t})$ is known, one can compute $C^{- 1} (t, \overset{ˉ}{t}),$ $d^{(z)} (t, \overset{ˉ}{t}),$ $ς (t, \overset{ˉ}{t})$ :

\begin{matrix} C^{- 1} (t, \overset{ˉ}{t}) & = (\begin{matrix} ψ_{2} (t, \overset{ˉ}{t}) & - ψ_{1} (t, \overset{ˉ}{t}) \\ - ψ_{1} (t, \overset{ˉ}{t}) & ψ_{0} (t, \overset{ˉ}{t}) \end{matrix}), \\ d^{(z)} (t, \overset{ˉ}{t}) & = (\begin{matrix} d^{(x)} (t, \overset{ˉ}{t}) \\ d^{(y)} (t, \overset{ˉ}{t}) \end{matrix}) = (\begin{matrix} - ϕ_{1} (t, \overset{ˉ}{t}) \\ ϕ_{0} (t, \overset{ˉ}{t}) \end{matrix}), \\ ς (t, \overset{ˉ}{t}) & = 0, \end{matrix}

(6.77)

where

\begin{matrix} ϕ_{i} (t, \overset{ˉ}{t}) = \int_{t}^{\overset{ˉ}{t}} {(s - t)}^{i} b (s) d s, ψ_{i} (t, \overset{ˉ}{t}) = \int_{t}^{\overset{ˉ}{t}} {(s - t)}^{i} σ^{2} (s) d s . \end{matrix}

(6.78)

Next, the covariance matrix $H (t, \overset{ˉ}{t}),$ and the mean $r (t, \overset{ˉ}{t})$ are calculated as follows:

\begin{matrix} H (t, \overset{ˉ}{t}) & = {(L^{*} (t, \overset{ˉ}{t}))}^{- 1} C^{- 1} (t, \overset{ˉ}{t}) L^{- 1} (t, \overset{ˉ}{t}) \\ = (\begin{matrix} 1 & T \\ 0 & 1 \end{matrix}) (\begin{matrix} ψ_{2} (t, \overset{ˉ}{t}) & - ψ_{1} (t, \overset{ˉ}{t}) \\ - ψ_{1} (t, \overset{ˉ}{t}) & ψ_{0} (t, \overset{ˉ}{t}) \end{matrix}) (\begin{matrix} 1 & 0 \\ T & 1 \end{matrix}) \end{matrix}

(6.79)

\begin{matrix} = (\begin{matrix} ψ_{0} (t, \overset{ˉ}{t}) T^{2} - 2 ψ_{1} (t, \overset{ˉ}{t}) T + ψ_{2} (t, \overset{ˉ}{t}) & ψ_{0} (t, \overset{ˉ}{t}) T - ψ_{1} (t, \overset{ˉ}{t}) \\ ψ_{0} (t, \overset{ˉ}{t}) T - ψ_{1} (t, \overset{ˉ}{t}) & ψ_{0} (t, \overset{ˉ}{t}) \end{matrix}), \\ r (t, \overset{ˉ}{t}) & = (\begin{matrix} - ϕ_{1} (t, \overset{ˉ}{t}) + x + (ϕ_{0} (t, \overset{ˉ}{t}) + y) T \\ ϕ_{0} (t, \overset{ˉ}{t}) + y \end{matrix}) . \end{matrix}

(6.80)

Accordingly, $ϖ (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y})$ is a bivariate Gaussian distribution of the form (6.26), with

\begin{matrix} σ_{x}^{2} (t, \overset{ˉ}{t}) & = ψ_{0} (t, \overset{ˉ}{t}) T^{2} - 2 ψ_{1} (t, \overset{ˉ}{t}) T + ψ_{2} (t, \overset{ˉ}{t}), σ_{y}^{2} = ψ_{0} (t, \overset{ˉ}{t}), \\ ρ (t, \overset{ˉ}{t}) & = \frac{(ψ_{0} (t, \overset{ˉ}{t}) T - ψ_{1} (t, \overset{ˉ}{t}))}{\sqrt{ψ_{0} (t, \overset{ˉ}{t}) (ψ_{0} (t, \overset{ˉ}{t}) T^{2} - 2 ψ_{1} (t, \overset{ˉ}{t}) T + ψ_{2} (t, \overset{ˉ}{t}))}}, \\ p (t, \overset{ˉ}{t}) & = - ϕ_{1} (t, \overset{ˉ}{t}) + x + (ϕ_{0} (t, \overset{ˉ}{t}) + y) T, q (t, \overset{ˉ}{t}) = ϕ_{0} (t, \overset{ˉ}{t}) + y . \end{matrix}

(6.81)

It is left to the interested reader to verify that (6.81) coincides with (3.52) when $σ$ and $b$ are constant. Therefore, the classical Kolmogorov solution can be extended to the case of time-dependent parameters.

6.4 Example: OU Process

6.4.1 OU Process

It is worth deriving the well-known t.p.d.f. for the OU process using Kelvin waves for benchmarking purposes. The following SDE governs the OU process:

\begin{matrix} d {\hat{y}}_{t} = (χ (t) - κ (t) {\hat{y}}_{t}) d t + ε (t) d {\hat{W}}_{t}, {\overset{ˉ}{y}}_{t} = y . \end{matrix}

(6.82)

Equivalently,

\begin{matrix} d {\hat{y}}_{t} = κ (t) (θ (t) - {\hat{y}}_{t}) d t + ε (t) d {\hat{W}}_{t}, {\overset{ˉ}{y}}_{t} = y, \end{matrix}

(6.83)

where $θ (t) = χ (t) / κ (t) .$

The corresponding Fokker–Planck problem has the following form:

\begin{matrix} ϖ_{\overset{ˉ}{t}} (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}) - \frac{1}{2} ε^{2} ϖ_{\overset{ˉ}{y} \overset{ˉ}{y}} (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}) + (χ (\overset{ˉ}{t}) - κ (\overset{ˉ}{t}) \overset{ˉ}{y}) ϖ_{\overset{ˉ}{y}} (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}) \\ - κ (\overset{ˉ}{t}) ϖ (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}) = 0, \\ ϖ (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}) = δ (\overset{ˉ}{y} - y) . \end{matrix}

(6.84)

The associated function $K (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}, l)$ has the following form:

\begin{matrix} K = exp (α (t, \overset{ˉ}{t}) + i γ (t, \overset{ˉ}{t}) \overset{ˉ}{y} - i l y), \end{matrix}

(6.85)

so that

\begin{matrix} α_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) + \frac{1}{2} ε^{2} (\overset{ˉ}{t}) γ^{2} (t, \overset{ˉ}{t}) + i χ (\overset{ˉ}{t}) γ (t, \overset{ˉ}{t}) - κ (\overset{ˉ}{t}) = 0, α (t, t) = 0, \\ γ_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) - κ (\overset{ˉ}{t}) γ (t, \overset{ˉ}{t}) = 0, γ (t, t) = l . \end{matrix}

(6.86)

Thus,

\begin{matrix} γ (t, \overset{ˉ}{t}) & = e^{η (t, \overset{ˉ}{t})} l, \\ α (t, \overset{ˉ}{t}) & = - \frac{1}{2} ψ_{0} (t, \overset{ˉ}{t}) l^{2} - (\int_{t}^{\overset{ˉ}{t}} e^{η (t, s)} χ (s) d s) i l + η (t, \overset{ˉ}{t}) . \end{matrix}

(6.87)

where

\begin{matrix} η (t, \overset{ˉ}{t}) & = \int_{t}^{\overset{ˉ}{t}} κ (s) d s, \end{matrix}

(6.88)

ψ_{0} (t, \bar{t}) = \int_{t}^{\bar{t}} e^{2 η (t, s)} ε^{2} (s) d s .

(6.89)

Since the same quantities will appear regularly throughout the Element, it is convenient to introduce the following notation:

\begin{matrix} A_{κ} (t, \overset{ˉ}{t}) & = e^{- η (t, \overset{ˉ}{t})}, B_{κ} (t, \overset{ˉ}{t}) = \int_{t}^{\overset{ˉ}{t}} e^{- η (t, s)} d s, {\overset{ˉ}{B}}_{κ} (t, \overset{ˉ}{t}) = \int_{t}^{\overset{ˉ}{t}} e^{- η (s, \overset{ˉ}{t})} d s, \\ A_{- κ} (t, \overset{ˉ}{t}) & = e^{η (t, \overset{ˉ}{t})}, B_{- κ} (t, \overset{ˉ}{t}) = \int_{t}^{\overset{ˉ}{t}} e^{η (t, s)} d s, {\overset{ˉ}{B}}_{- κ} (t, \overset{ˉ}{t}) = \int_{t}^{\overset{ˉ}{t}} e^{η (s, \overset{ˉ}{t})} d s, \end{matrix}

(6.90)

In particular, for constant $κ,$ one has

\begin{matrix} A_{κ} (t, \overset{ˉ}{t}) & = e^{- κ T} = A_{κ} (T), A_{- κ} (t, \overset{ˉ}{t}) = e^{κ T} = A_{- κ} (T), \\ B_{κ} (t, \overset{ˉ}{t}) & = {\overset{ˉ}{B}}_{κ} (t, \overset{ˉ}{t}) = \frac{1 - e^{- κ T}}{κ} = B_{κ} (T) = {\overset{ˉ}{B}}_{κ} (T), \\ B_{- κ} (t, \overset{ˉ}{t}) & = {\overset{ˉ}{B}}_{- κ} (t, \overset{ˉ}{t}) = \frac{e^{κ T} - 1}{κ} = B_{- κ} (T) = {\overset{ˉ}{B}}_{- κ} (T), \end{matrix}

(6.91)

and

\begin{matrix} A_{0} (t, \overset{ˉ}{t}) = 1, B_{0} (t, \overset{ˉ}{t}) = {\overset{ˉ}{B}}_{0} (t, \overset{ˉ}{t}) = T . \end{matrix}

(6.92)

In this notation, $ψ_{0}$ can be written as follows:

\begin{matrix} ψ_{0} (t, \overset{ˉ}{t}) = \int_{t}^{\overset{ˉ}{t}} A_{- 2 κ} (t, s) ε^{2} (s) d s . \end{matrix}

(6.93)

Thus, the following well-known expression is obtained:

\begin{matrix} ϖ (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}) \\ = \frac{1}{2 π} \int_{- \infty}^{\infty} exp (- \frac{ψ_{0} (t, \overset{ˉ}{t}) l^{2}}{2} + (e^{η (t, \overset{ˉ}{t})} \overset{ˉ}{y} - \int_{t}^{\overset{ˉ}{t}} e^{η (t, s)} χ (s) d s - y) i l + η (t, \overset{ˉ}{t})) d l \\ = \frac{A_{- κ} (t, \overset{ˉ}{t})}{\sqrt{2 π ψ_{0} (t, \overset{ˉ}{t})}} exp (- \frac{{(A_{- κ} (t, \overset{ˉ}{t}) \overset{ˉ}{y} - \int_{t}^{\overset{ˉ}{t}} A_{- κ} (t, s) χ (s) d s - y)}^{2}}{2 ψ_{0} (t, \overset{ˉ}{t})}) \\ = \frac{1}{\sqrt{2 π {\hat{ψ}}_{0} (t, \overset{ˉ}{t})}} exp (- \frac{{(\overset{ˉ}{y} - \int_{t}^{\overset{ˉ}{t}} A_{κ} (t, s) χ (s) d s - A_{κ} (t, \overset{ˉ}{t}) y)}^{2}}{2 {\hat{ψ}}_{0} (t, \overset{ˉ}{t})}), \end{matrix}

(6.94)

where

\begin{matrix} {\hat{ψ}}_{0} (t, \overset{ˉ}{t}) = A_{2 κ} (t, \overset{ˉ}{t}) ψ_{0} (t, \overset{ˉ}{t}) = \int_{t}^{\overset{ˉ}{t}} A_{2 κ} (s, \overset{ˉ}{t}) ε^{2} (s) d s . \end{matrix}

(6.95)

For further discussion, see the original paper by Reference Uhlenbeck and OrnsteinUhlenbeck and Ornstein (1930), as well as Reference ChandrasekharChandresekhar (1943), Reference RiskenRisken (1989), and references therein.

For time-independent parameters, (6.94) has the form:

\begin{matrix} ϖ (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}) = \frac{1}{\sqrt{2 π Σ^{2} (t, \overset{ˉ}{t})}} exp (- \frac{{(\overset{ˉ}{y} - θ - A_{κ} (T) (y - θ))}^{2}}{2 Σ^{2} (t, \overset{ˉ}{t})}), \end{matrix}

(6.96)

with

\begin{matrix} Σ^{2} (t, \overset{ˉ}{t}) = \frac{ε^{2} (1 - e^{- 2 κ T})}{2 κ} = ε^{2} B_{2 κ} (T) . \end{matrix}

(6.97)

6.4.2 Gaussian Augmented OU Process

This subsection considers an augmented one-dimensional OU process of the form:

\begin{matrix} d {\hat{x}}_{t} & = {\hat{y}}_{t} d t, {\hat{x}}_{t} = x, \\ d {\hat{y}}_{t} & = (χ (t) - κ (t) {\hat{y}}_{t}) d t + ε (t) d {\hat{W}}_{t}, {\hat{y}}_{t} = y . \end{matrix}

(6.98)

To align the analysis with the existing body of work, switch from the general notation, used above, to a specific one customarily used for the OU process. Here and in what follows, the word “augmentation” means that one expands the original process by incorporating its integral or other path-dependent characteristics, such as running maximum or minimum as part of the process; see Section 5. The augmentation is a very useful tool. In particular, in financial engineering it is used for handling large classes of path-dependent options; details can be found in Reference LiptonLipton (2001), chapter 13.

For an OU process, (6.34) can be written as follows:

\begin{matrix} L_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) + (\begin{matrix} 0 & 0 \\ 1 & - κ (\overset{ˉ}{t}) \end{matrix}) L (t, \overset{ˉ}{t}) = 0, L (t, t) = (\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}), \end{matrix}

(6.99)

so that

\begin{matrix} L (t, \overset{ˉ}{t}) = (\begin{matrix} 1 & 0 \\ - {\overset{ˉ}{B}}_{- κ} (t, \overset{ˉ}{t}) & A_{- κ} (t, \overset{ˉ}{t}) \end{matrix}), L^{- 1} (t, \overset{ˉ}{t}) = (2 p t \begin{matrix} 1 & 0 \\ B_{κ} (t, \overset{ˉ}{t}) & A_{κ} (t, \overset{ˉ}{t}) \end{matrix}) . \end{matrix}

(6.100)

Now, one can compute $C^{- 1} (t, \overset{ˉ}{t}),$ $d^{(z)} (t, \overset{ˉ}{t}),$ and $ς (t, \overset{ˉ}{t})$ :

\begin{matrix} C^{- 1} (t, \overset{ˉ}{t}) = (\begin{matrix} ψ_{2} (t, \overset{ˉ}{t}) & - ψ_{1} (t, \overset{ˉ}{t}) \\ - ψ_{1} (t, \overset{ˉ}{t}) & ψ_{0} (t, \overset{ˉ}{t}) \end{matrix}), \end{matrix}

(6.101)

where

\begin{matrix} ψ_{0} (t, \overset{ˉ}{t}) & = \int_{t}^{\overset{ˉ}{t}} A_{- κ}^{2} (t, s) ε^{2} (s) d s, \\ ψ_{1} (t, \overset{ˉ}{t}) & = - \int_{t}^{\overset{ˉ}{t}} {\overset{ˉ}{B}}_{- κ} (t, s) A_{- κ} (t, s) ε^{2} (s) d s, \end{matrix}

(6.102)

\begin{matrix} ψ_{2} (t, \overset{ˉ}{t}) & = \int_{t}^{\overset{ˉ}{t}} {\overset{ˉ}{B}}_{- κ}^{2} (t, s) ε^{2} (s) d s . \\ d^{(z)} (t, \overset{ˉ}{t}) & = (\begin{matrix} d^{(x)} (t, \overset{ˉ}{t}) \\ d^{(y)} (t, \overset{ˉ}{t}) \end{matrix}) = (\begin{matrix} - \int_{t}^{\overset{ˉ}{t}} {\overset{ˉ}{B}}_{- κ} (t, s) χ (s) d s \\ \int_{t}^{\overset{ˉ}{t}} A_{- κ} (t, s) χ (s) d s, \end{matrix}), \end{matrix}

(6.103)

\begin{matrix} ς (t, \overset{ˉ}{t}) & = - η (t, \overset{ˉ}{t}) . \end{matrix}

(6.104)

Next, one can calculate the covariance matrix $H (t, \overset{ˉ}{t}),$ and mean vector $r (t, \overset{ˉ}{t})$ as follows:

\begin{matrix} H (t, \overset{ˉ}{t}) = {(L^{*} (t, \overset{ˉ}{t}))}^{- 1} C^{- 1} (t, \overset{ˉ}{t}) L^{- 1} (t, \overset{ˉ}{t}) \\ = (\begin{matrix} 1 & B_{κ} (t, \overset{ˉ}{t}) \\ 0 & A_{κ} (t, \overset{ˉ}{t}) \end{matrix}) (\begin{matrix} ψ_{2} (t, \overset{ˉ}{t}) & - ψ_{1} (t, \overset{ˉ}{t}) \\ - ψ_{1} (t, \overset{ˉ}{t}) & ψ_{0} (t, \overset{ˉ}{t}) \end{matrix}) (\begin{matrix} 1 & 0 \\ B_{κ} (t, \overset{ˉ}{t}) & A_{κ} (t, \overset{ˉ}{t}) \end{matrix}) \\ = (\begin{matrix} h_{0} (t, \overset{ˉ}{t}) & h_{1} (t, \overset{ˉ}{t}) \\ h_{1} (t, \overset{ˉ}{t}) & h_{2} (t, \overset{ˉ}{t}) \end{matrix}), \end{matrix}

(6.105)

\begin{matrix} r (t, \overset{ˉ}{t}) = {(L^{*} (t, \overset{ˉ}{t}))}^{- 1} (d^{(z)} (t, \overset{ˉ}{t}) + (\begin{matrix} x \\ y \end{matrix})) = (\begin{matrix} p (t, \overset{ˉ}{t}) \\ q (t, \overset{ˉ}{t}) \end{matrix}) . \end{matrix}

(6.106)

Here

\begin{matrix} h_{0} (t, \overset{ˉ}{t}) = ψ_{0} B_{κ}^{2} (t, \overset{ˉ}{t}) - 2 ψ_{1} B_{κ} (t, \overset{ˉ}{t}) + ψ_{2}, \\ h_{1} (t, \overset{ˉ}{t}) = (ψ_{0} B_{κ} (t, \overset{ˉ}{t}) - ψ_{1}) A_{κ} (t, \overset{ˉ}{t}), \\ h_{2} (t, \overset{ˉ}{t}) = ψ_{0} A_{κ}^{2} (t, \overset{ˉ}{t}), \end{matrix}

(6.107)

\begin{matrix} p (t, \overset{ˉ}{t}) = - \int_{t}^{\overset{ˉ}{t}} {\overset{ˉ}{B}}_{- κ} (t, s) χ (s) d s + x + B_{κ} (t, \overset{ˉ}{t}) (\int_{t}^{\overset{ˉ}{t}} A_{- κ} (t, s) χ (s) d s + y), \\ q (t, \overset{ˉ}{t}) = A_{κ} (t, \overset{ˉ}{t}) (\int_{t}^{\overset{ˉ}{t}} A_{- κ} (t, s) χ (s) d s + y) . \end{matrix}

(6.108)

Thus, $ϖ (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y})$ is a bivariate Gaussian distribution of the form (6.26) with the covariance matrix $H,$ given by (6.105) centered at the point $r = {(p, q)}^{*}$ given by (6.106). Explicitly, one has

\begin{matrix} σ_{x}^{2} (t, \overset{ˉ}{t}) = h_{0} (t, \overset{ˉ}{t}), σ_{y}^{2} (t, \overset{ˉ}{t}) = h_{2} (t, \overset{ˉ}{t}), ρ (t, \overset{ˉ}{t}) = \frac{h_{1} (t, \overset{ˉ}{t})}{\sqrt{h_{0} (t, \overset{ˉ}{t}) h_{2} (t, \overset{ˉ}{t})}} . \end{matrix}

(6.109)

When $χ, κ, θ, ε$ are constant, the preceding formulas become significantly simpler. Namely,

\begin{matrix} L (T) = (\begin{matrix} 1 & 0 \\ - B_{- κ} (T) & A_{- κ} (T) \end{matrix}), L^{- 1} (T) = (\begin{matrix} 1 & 0 \\ B_{κ} (T) & A_{κ} (T) \end{matrix}), \end{matrix}

(6.110)

\begin{matrix} C^{- 1} (T) = (\begin{matrix} \frac{ε^{2}}{κ^{2}} (B_{0} (T) - 2 B_{- κ} (T) + B_{- 2 κ} (T)) & - \frac{ε^{2}}{2} B_{- κ} (T) \\ - \frac{ε^{2}}{2} B_{- κ} (T) & ε^{2} B_{- 2 κ} (T) \end{matrix}), \end{matrix}

(6.111)

\begin{matrix} d^{(z)} (T) = (\begin{matrix} (T - B_{- κ} (T)) θ \\ B_{- κ} (T) χ \end{matrix}), \end{matrix}

(6.112)

\begin{matrix} ς (T) = - κ T, \end{matrix}

(6.113)

\begin{matrix} H (T) = (\begin{matrix} \frac{ε^{2}}{κ^{2}} (B_{0} (T) - 2 B_{κ} (T) + B_{2 κ} (T)) & \frac{ε^{2}}{2} B_{κ} (T) \\ \frac{ε^{2}}{2} B_{κ} (T) & ε^{2} B_{2 κ} (T) \end{matrix}), \end{matrix}

(6.114)

\begin{matrix} r (T) = (\begin{matrix} x + θ T - B_{κ} (T) (θ - y) \\ θ - A_{κ} (T) (θ - y) \end{matrix}) . \end{matrix}

(6.115)

Thus, when coefficients are constant, $ϖ (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y})$ is a bivariate Gaussian distribution of the form (6.26) with the covariance matrix $H,$ given by (6.114) and the mean vector $r = {(p, q)}^{*}$ given by (6.115).

Calculate the marginal distribution of $\overset{ˉ}{x},$ denoted by $ϖ^{(x)} (t, y, \overset{ˉ}{t}, \overset{ˉ}{x}),$ which is used on several occasions in what follows. It is well known that marginal distributions of a multivariate Gaussian distribution are also Gaussian, so that

\begin{matrix} ϖ^{(x)} (t, y, \overset{ˉ}{t}, \overset{ˉ}{x}) = \frac{1}{\sqrt{2 π h_{0} (t, \overset{ˉ}{t})}} exp (\frac{{(\overset{ˉ}{x} - p (t, \overset{ˉ}{t}))}^{2}}{2 h_{0} (t, \overset{ˉ}{t})}), \end{matrix}

(6.116)

where $h_{0}$ is given by the equations in (6.114). At the same time, the density of marginal distribution for $\overset{ˉ}{y}$ has the form

\begin{matrix} ϖ^{(y)} (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}) = \frac{1}{\sqrt{2 π h_{2} (t, \overset{ˉ}{t})}} exp (\frac{{(\overset{ˉ}{y} - q (t, \overset{ˉ}{t}))}^{2}}{2 h_{2} (t, \overset{ˉ}{t})}), \end{matrix}

(6.117)

where $h_{2}$ is given by the equations in (6.114), which is the familiar density of the OU process derived in the previous section.

6.5 Example: Diffusion of Free and Harmonically Bound Particles

The preceding results can be used to revisit the motion of free and harmonically bound particles considered in Section 3.

To describe a free particle, it is assumed that $χ = 0 .$ Equation (6.114) does not change, while (6.115) can be simplified as follows:

\begin{matrix} (\begin{matrix} p (T) \\ q (T) \end{matrix}) = (\begin{matrix} x + B_{κ} (T) y \\ A_{κ} (T) y \end{matrix}) . \end{matrix}

(6.118)

It is clear that Equations (4.7), (4.8) and (6.114), (6.118) are in agreement. A typical free particle behavior is illustrated in Figure 7.

Figure 7 A thousand trajectories of a typical free particle. Parameters are as follows: $T = 5,$ $d t = 0.01,$ $κ = 0.8,$ $σ = 1.0 .$ (a) $x (t),$ (b) $y (t),$ (c) $(\overset{ˉ}{x} (T), \overset{ˉ}{y} (T)),$ (d) contour lines of $ϖ (0, 0, 0, T, \tilde{x}, \tilde{y}) .$ Author’s graphics.

Analysis of a harmonically bound particle requires additional efforts. In the case in question, (6.34) can be written as follows:

\begin{matrix} L^{'} (t, \overset{ˉ}{t}) + (\begin{matrix} 0 & - ω^{2} \\ 1 & - κ \end{matrix}) L (t, \overset{ˉ}{t}) = 0, L (t, t) = (\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}) . \end{matrix}

(6.119)

The corresponding characteristic equation and its solutions are as follows:

\begin{matrix} λ^{2} - κ λ + ω^{2} = 0, \end{matrix}

(6.120)

\begin{matrix} λ_{\pm} = μ \pm ζ, \\ μ = \frac{κ}{2}, ζ = \frac{\sqrt{κ^{2} - 4 ω^{2}}}{2} . \end{matrix}

(6.121)

Introduce

\begin{matrix} E_{0} (T) = e^{μ T} = e^{κ T / 2}, E_{\pm} (T) = e^{\pm ζ T} . \end{matrix}

(6.122)

It is left to the reader to check that

\begin{matrix} L & = \frac{E_{0}}{\sqrt{κ^{2} - 4 ω^{2}}} (\begin{matrix} - (λ_{-} E_{+} - λ_{+} E_{-}) & ω^{2} (E_{+} - E_{-}) \\ - (E_{+} - E_{-}) & (λ_{+} E_{+} - λ_{-} E_{-}) \end{matrix}), \end{matrix}

(6.123)

\begin{matrix} L^{- 1} & = \frac{E_{0}^{- 1}}{\sqrt{κ^{2} - 4 ω^{2}}} (\begin{matrix} (λ_{+} E_{+} - λ_{-} E_{-}) & - ω^{2} (E_{+} - E_{-}) \\ (E_{+} - E_{-}) & - (λ_{-} E_{+} - λ_{+} E_{-}) \end{matrix}), \end{matrix}

(6.124)

\begin{matrix} det L & = {(det L^{- 1})}^{- 1} = E_{0}^{2} = e^{κ T}, \end{matrix}

(6.125)

Accordingly,

\begin{matrix} ϖ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) = N (r (t, \overset{ˉ}{t}), H (t, \overset{ˉ}{t})), \end{matrix}

(6.126)

with

\begin{matrix} H & = {(L^{*})}^{- 1} C^{- 1} L^{- 1}, \\ r & = {(L^{*})}^{- 1} z . \end{matrix}

(6.127)

Here,

\begin{matrix} C^{- 1} = (\begin{matrix} ψ_{2} & - ψ_{1} \\ - ψ_{1} & ψ_{0} \end{matrix}), \end{matrix}

(6.128)

where

\begin{matrix} ψ_{0} & = \frac{ε^{2}}{(κ^{2} - 4 ω^{2})} \int_{t}^{\overset{ˉ}{t}} {(λ_{+} e^{λ_{+} (s - t)} - λ_{-} e^{λ_{-} (s - t)})}^{2} d s \\ = \frac{ε^{2}}{2 κ (κ^{2} - 4 ω^{2})} (E_{0}^{2} (κ λ_{+} E_{+}^{2} - 4 ω^{2} + κ λ_{-} E_{-}^{2}) - (κ^{2} - 4 ω^{2})), \\ ψ_{1} & = \frac{ε^{2}}{(κ^{2} - 4 ω^{2})} \int_{t}^{\overset{ˉ}{t}} (e^{λ_{+} (s - t)} - e^{λ_{-} (s - t)}) (λ_{+} e^{λ_{+} (s - t)} - λ_{-} e^{λ_{-} (s - t)}) d s \\ = \frac{ε^{2}}{2 (κ^{2} - 4 ω^{2})} E_{0}^{2} {(E_{+} - E_{-})}^{2}, \\ ψ_{2} & = \frac{ε^{2}}{(κ^{2} - 4 ω^{2})} \int_{t}^{\overset{ˉ}{t}} {(e^{λ_{+} (s - t)} - e^{λ_{-} (s - t)})}^{2} d s \\ = \frac{ε^{2}}{2 κ ω^{2} (κ^{2} - 4 ω^{2})} (E_{0}^{2} (κ λ_{-} E_{+}^{2} - 4 ω^{2} + κ λ_{+} E_{-}^{2}) - (κ^{2} - 4 ω^{2})) . \end{matrix}

(6.129)

Further,

\begin{matrix} H = & \frac{E_{0}^{- 2}}{(κ^{2} - 4 ω^{2})} (\begin{matrix} (λ_{+} E_{+} - λ_{-} E_{-}) & (E_{+} - E_{-}) \\ - ω^{2} (E_{+} - E_{-}) & - (λ_{-} E_{+} - λ_{+} E_{-}) \end{matrix}) \\ \times (\begin{matrix} ψ_{2} & - ψ_{1} \\ - ψ_{1} & ψ_{0} \end{matrix}) (\begin{matrix} (λ_{+} E_{+} - λ_{-} E_{-}) & - ω^{2} (E_{+} - E_{-}) \\ (E_{+} - E_{-}) & - (λ_{-} E_{+} - λ_{+} E_{-}) \end{matrix}) . \end{matrix}

(6.130)

Straightforward but tedious calculation yields

\begin{matrix} h_{0} & = \frac{ε^{2}}{2 κ ω^{2}} (1 - \frac{E_{0}^{- 2} (ω^{2} {(E_{+} - E_{-})}^{2} + {(λ_{+} E_{+} - λ_{-} E_{-})}^{2})}{(κ^{2} - 4 ω^{2})}), \\ h_{1} & = \frac{ε^{2}}{2} \frac{E_{0}^{- 2} {(E_{+} - E_{-})}^{2}}{(κ^{2} - 4 ω^{2})}, \\ h_{2} & = \frac{ε^{2}}{2 κ} (1 - \frac{E_{0}^{- 2} (ω^{2} {(E_{+} - E_{-})}^{2} + {(λ_{-} E_{+} - λ_{+} E_{-})}^{2})}{(κ^{2} - 4 ω^{2})}) . \end{matrix}

(6.131)

In the limit $ω^{2} \to 0,$

\begin{matrix} h_{0} = \frac{ε^{2}}{κ^{2}} (B_{0} - 2 B_{κ} + B_{2 κ}), h_{1} = \frac{ε^{2}}{2} B_{κ}^{2}, h_{2} = ε^{2} B_{2 κ}, \end{matrix}

(6.132)

so that Equations (6.114) and (6.132) are in agreement.

Here

\begin{matrix} r = (\begin{matrix} p \\ q \end{matrix}) = (\begin{matrix} \frac{E_{0}^{- 1} ((λ_{+} E_{+} - λ_{-} E_{-}) x + (E_{+} - E_{-}) y)}{\sqrt{κ^{2} - 4 ω^{2}}} \\ - \frac{E_{0}^{- 1} (ω^{2} (E_{+} - E_{-}) x + (λ_{-} E_{+} - λ_{+} E_{-}) y)}{\sqrt{κ^{2} - 4 ω^{2}}} \end{matrix}) . \end{matrix}

(6.133)

In the limit $ω^{2} \to 0,$

\begin{matrix} r = (\begin{matrix} p \\ q \end{matrix}) = (\begin{matrix} x + B_{κ} (T) y \\ A_{κ} (T) y \end{matrix}) . \end{matrix}

(6.134)

Moreover, while it is easy to show that Chandrasekhar’s solution given in Reference ChandrasekharChandresekhar (1943) is in agreement with the solution given by (6.126), the solution is more convenient from a practical standpoint, since it is explicitly written as a Gaussian density in the $(\overset{ˉ}{x}, \overset{ˉ}{y})$ space. A typical bounded particle behavior is shown in Figure 8.

Figure 8 A thousand trajectories of a harmonically bounded particle. Parameters are as follows: $T = 5,$ $d t = 0.01,$ $κ = 0.2,$ $ω = 0.5,$ $σ = 0.5 .$ (a) $x (t),$ (b) $y (t),$ (c) $(\overset{ˉ}{x} (T), \overset{ˉ}{y} (T)),$ (d) contour lines of $ϖ (0, 0, 0, T, \tilde{x}, \tilde{y}) .$ Author’s graphics.

6.6 Example: Vorticity of Two-Dimensional Flows

Briefly return to the starting point and consider strictly two-dimensional flows; see Reference Friedlander and Lipton-LifschitzFriedlander and Lipton-Lifschitz (2003). Velocity fields of such flows have the following form:

\begin{matrix} V (\overset{ˉ}{t}, {\overset{ˉ}{x}}_{1}, {\overset{ˉ}{x}}_{2}) & = (V_{1} (\overset{ˉ}{t}, {\overset{ˉ}{x}}_{1}, {\overset{ˉ}{x}}_{2}), V_{2} (\overset{ˉ}{t}, {\overset{ˉ}{x}}_{1}, {\overset{ˉ}{x}}_{2})), \\ v (\overset{ˉ}{t}, {\overset{ˉ}{x}}_{1}, {\overset{ˉ}{x}}_{2}) & = (v_{1} (\overset{ˉ}{t}, {\overset{ˉ}{x}}_{1}, {\overset{ˉ}{x}}_{2}), v_{2} (\overset{ˉ}{t}, {\overset{ˉ}{x}}_{1}, {\overset{ˉ}{x}}_{2})) . \end{matrix}

(6.135)

By virtue of incompressibility, one can introduce the so-called stream functions such that

\begin{matrix} V_{1} = - \frac{\partial Ψ}{\partial {\overset{ˉ}{x}}_{2}}, V_{2} = \frac{\partial Ψ}{\partial {\overset{ˉ}{x}}_{1}}, v_{1} = - \frac{\partial ψ}{\partial {\overset{ˉ}{x}}_{2}}, v_{2} = \frac{\partial ψ}{\partial {\overset{ˉ}{x}}_{1}}, \end{matrix}

(6.136)

and define the scalar vorticity as follows:

\begin{matrix} Ω = Δ Ψ, ω = Δ ψ . \end{matrix}

(6.137)

Contour lines of $Ψ$ are called streamlines of the flow.

By using the preceding definitions, the two-dimensional Navier–Stokes equations can be written as equations for the stream and vorticity:

\begin{matrix} \frac{\partial Ω}{\partial \overset{ˉ}{t}} - \frac{\partial Ψ}{\partial {\overset{ˉ}{x}}_{2}} \frac{\partial Ω}{\partial {\overset{ˉ}{x}}_{1}} + \frac{\partial Ψ}{\partial {\overset{ˉ}{x}}_{1}} \frac{\partial Ω}{\partial {\overset{ˉ}{x}}_{2}} - ν Δ Ω = 0, \\ Δ Ψ - Ω = 0. \end{matrix}

(6.138)

Time-independent quadratic stream functions $Ψ ({\overset{ˉ}{x}}_{1}, {\overset{ˉ}{x}}_{2})$ generate exact equilibrium solutions of the equations in (6.138). Consider fields consisting of pure strain and pure rotation. The corresponding $Ψ$ have the following form:

\begin{matrix} Ψ ({\overset{ˉ}{x}}_{1}, {\overset{ˉ}{x}}_{2}) = \frac{1}{4} (w ({\overset{ˉ}{x}}_{1}^{2} + {\overset{ˉ}{x}}_{2}^{2}) - 2 s {\overset{ˉ}{x}}_{1} {\overset{ˉ}{x}}_{2}), \end{matrix}

(6.139)

where $w > s,$ to ensure that streamlines are elliptic rather than hyperbolic, so that

\begin{matrix} V_{1} = - \frac{\partial Ψ}{\partial {\overset{ˉ}{x}}_{2}} = \frac{1}{2} (s {\overset{ˉ}{x}}_{1} - w {\overset{ˉ}{x}}_{2}), V_{2} = \frac{\partial Ψ}{\partial {\overset{ˉ}{x}}_{1}} = \frac{1}{2} (w {\overset{ˉ}{x}}_{1} - s {\overset{ˉ}{x}}_{2}) . \end{matrix}

(6.140)

Recall that these flows were introduced in Section 2, Equation (2.7).

Small perturbations $ψ$ of the time-independent quadratic stream function $Ψ$ satisfy the following equations:

\begin{matrix} \frac{\partial ω}{\partial \overset{ˉ}{t}} - \frac{\partial Ψ}{\partial {\overset{ˉ}{x}}_{2}} \frac{\partial ω}{\partial {\overset{ˉ}{x}}_{1}} + \frac{\partial Ψ}{\partial {\overset{ˉ}{x}}_{1}} \frac{\partial ω}{\partial {\overset{ˉ}{x}}_{2}} - ν Δ ω = 0, \\ Δ ψ - ω = 0. \end{matrix}

(6.141)

It is helpful to study the first equation (6.141) in isolation, by writing it explicitly as follows:

\begin{matrix} \frac{\partial ω}{\partial \overset{ˉ}{t}} + \frac{1}{2} (s {\overset{ˉ}{x}}_{1} - w {\overset{ˉ}{x}}_{2}) \frac{\partial ω}{\partial {\overset{ˉ}{x}}_{1}} + \frac{1}{2} (w {\overset{ˉ}{x}}_{1} - s {\overset{ˉ}{x}}_{2}) \frac{\partial ω}{\partial {\overset{ˉ}{x}}_{2}} - ν Δ ω = 0, \end{matrix}

(6.142)

and supplying it with the initial condition at time $t$ :

\begin{matrix} ω (t, {\overset{ˉ}{x}}_{1}, {\overset{ˉ}{x}}_{2}) = δ ({\overset{ˉ}{x}}_{1} - x_{1}) δ ({\overset{ˉ}{x}}_{2} - x_{2}) . \end{matrix}

(6.143)

Once the solution of Equations (6.142) and (6.143) is found, one can find $ψ$ by solving the corresponding Laplace equation.

Surprisingly, this equation is identical to the Fokker–Planck equation associated with the following SDEs for ${\hat{z}}_{t} = ({\hat{x}}_{1 t}, {\hat{x}}_{2 t})$ :

\begin{matrix} d {\hat{z}}_{t} = B {\hat{z}}_{t} d t + Σ d {\hat{W}}_{t}, {\hat{z}}_{t} = (\begin{matrix} x_{1} \\ x_{2} \end{matrix}), \end{matrix}

(6.144)

where

\begin{matrix} B = \frac{1}{2} (\begin{matrix} s & - w \\ w & - s \end{matrix}), Σ = \sqrt{2 ν} (\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}) . \end{matrix}

(6.145)

Thus, one can use Section 6.1 results. Equation (6.34) becomes

\begin{matrix} L^{'} (t, \overset{ˉ}{t}) + \frac{1}{2} (\begin{matrix} s & w \\ - w & - s \end{matrix}) L (t, \overset{ˉ}{t}) = 0, L (t, t) = (\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}) . \end{matrix}

(6.146)

The corresponding characteristic equation has the following form:

\begin{matrix} λ^{2} + \frac{1}{4} (w^{2} - s^{2}) = 0. \end{matrix}

(6.147)

Its solutions are

\begin{matrix} λ_{\pm} = \pm ζ, ζ = \frac{i \sqrt{w^{2} - s^{2}}}{2} . \end{matrix}

(6.148)

Simple but tedious calculations, omitted for the sake of brevity, show that

\begin{matrix} L & = (\begin{matrix} c_{1} - \frac{s}{2 (ζ)} s_{1} & - \frac{w}{2 (ζ)} s_{1} \\ \frac{w}{2 (ζ)} s_{1} & c_{1} + \frac{s}{2 (ζ)} s_{1} \end{matrix}), det (L) = 1 \\ L^{- 1} & = (\begin{matrix} c_{1} + \frac{s}{2 (ζ)} s_{1} & \frac{w}{2 (ζ)} s_{1} \\ - \frac{w}{2 (ζ)} s_{1} & c_{1} - \frac{s}{2 (ζ)} s_{1} \end{matrix}), det (L^{- 1}) = 1, \end{matrix}

(6.149)

where

\begin{matrix} c_{1} (t, \overset{ˉ}{t}) = cos ((ζ) T), s_{1} (t, \overset{ˉ}{t}) = sin ((ζ) T) . \end{matrix}

(6.150)

Next, (6.39) yields

\begin{matrix} C^{- 1} = 2 ν \int_{t}^{\overset{ˉ}{t}} L^{*} (t, s) L (t, s) d s = (\begin{matrix} ψ_{2} & - ψ_{1} \\ - ψ_{1} & ψ_{0} \end{matrix}), \end{matrix}

(6.151)

where

\begin{matrix} ψ_{0} & = 2 ν \int_{t}^{\overset{ˉ}{t}} (1 + \frac{s}{2 (ζ)} s_{2} (t, s) + \frac{s^{2}}{4 {(ζ)}^{2}} (1 - c_{2} (t, s))) d s \\ = 2 ν ((1 + \frac{s^{2}}{4 {(ζ)}^{2}}) T - \frac{s}{4 {(ζ)}^{2}} c_{2} - \frac{s^{2}}{8 {(ζ)}^{3}} s_{2}), \\ ψ_{1} & = - \frac{ν s w}{2 {(ζ)}^{2}} \int_{t}^{\overset{ˉ}{t}} (1 - c_{2} (t, s)) d s = - \frac{ν s w}{2 {(ζ)}^{2}} (T - \frac{1}{2 (ζ)} s_{2}), \\ ψ_{2} & = 2 ν \int_{t}^{\overset{ˉ}{t}} (1 - \frac{s}{2 (ζ)} s_{2} (t, s) + \frac{s^{2}}{4 {(ζ)}^{2}} (1 - c_{2} (t, s))) d s \\ = 2 ν ((1 + \frac{s^{2}}{4 {(ζ)}^{2}}) T + \frac{s}{4 {(ζ)}^{2}} c_{2} - \frac{s^{2}}{8 {(ζ)}^{3}} s_{2}), \end{matrix}

(6.152)

and

\begin{matrix} c_{2} (t, \overset{ˉ}{t}) = cos (2 (ζ) T), s_{2} (t, \overset{ˉ}{t}) = sin (2 (ζ) T) . \end{matrix}

(6.153)

Finally, Equations (6.26) and (6.27) yield:

\begin{matrix} ω (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) = N (r (t, \overset{ˉ}{t}), H (t, \overset{ˉ}{t})) . \end{matrix}

(6.154)

The corresponding covariance matrix $H$ and mean $r$ are as follows:

\begin{matrix} H = (\begin{matrix} h_{0} & h_{1} \\ h_{1} & h_{2} \end{matrix}), \end{matrix}

(6.155)

where

\begin{matrix} h_{0} = & (\frac{w^{2}}{8 {(ζ)}^{2}} + \frac{(4 {(ζ)}^{2} - s^{2})}{8 {(ζ)}^{2}} c_{2} + \frac{s}{2 (ζ)} s_{2}) ψ_{2} \\ + \frac{w}{2 (ζ)} (\frac{s}{2 (ζ)} (1 - c_{2}) + s_{2}) ψ_{1} + \frac{w^{2}}{8 {(ζ)}^{2}} (1 - c_{2}) ψ_{0}, \\ h_{1} = & \frac{w}{4 (ζ)} (\frac{s}{2 (ζ)} (1 - c_{2}) + s_{2}) ψ_{2} \\ - (1 + \frac{w^{2}}{4 {(ζ)}^{2}} (1 - c_{2})) ψ_{1} + \frac{w}{4 (ζ)} (\frac{s}{2 (ζ)} (1 - c_{2}) - s_{2}) ψ_{0}, \\ h_{2} = & \frac{w^{2}}{8 {(ζ)}^{2}} (1 - c_{2}) ψ_{2} + \frac{w}{2 (ζ)} (\frac{s}{2 (ζ)} (1 - c_{2}) - s_{2}) ψ_{1} \\ + (\frac{w^{2}}{8 {(ζ)}^{2}} + \frac{(4 {(ζ)}^{2} - s^{2})}{8 {(ζ)}^{2}} c_{2} - \frac{s}{2 (ζ)} s_{2}) ψ_{0}, \end{matrix}

(6.156)

and

\begin{matrix} r = (\begin{matrix} r_{1} \\ r_{2} \end{matrix}) = (\begin{matrix} (c_{1} + \frac{s}{2 (ζ)} s_{1}) x_{1} - \frac{w}{2 (ζ)} s_{1} x_{2} \\ \frac{w}{2 (ζ)} s_{1} x_{1} + (c_{1} - \frac{s}{2 (ζ)} s_{1}) x_{2} \end{matrix}) . \end{matrix}

(6.157)

The equations in (6.156) are symmetric, namely $h_{0} \to h_{2}$ when $(a, b) \to (- a, - b)$ and $(ψ_{0}, ψ_{2}) \to (ψ_{2}, ψ_{0}) .$ The second of the equations in (6.138), which is a static Poisson equation, allows us to find $ψ,$ since $ω$ is known. Its analytical solution is not easy to derive and is not presented here due to lack of space. However, the special case of purely rotational flow, $s = 0,$ can be done easily; see (6.165).

It is interesting to note that

\begin{matrix} Ψ (r_{1}, r_{2}) = Ψ (x_{1}, x_{2}), \end{matrix}

(6.158)

so that the location of the Gaussian distribution $ω$ moves along streamlines of the flow defined by the stream function $Ψ .$

When the flow is purely rotational, so that $s = 0,$ the preceding formulas considerably simplify. Specifically, one has the following:

\begin{matrix} ψ_{0} & = 2 ν T, ψ_{1} = 0, ψ_{2} = 2 ν T, \\ h_{0} & = 2 ν T, ψ_{1} = 0, h_{2} = 2 ν T, \\ r_{1} & = c_{1} x_{1} - s_{1} x_{2}, r_{2} = s_{1} x_{1} + c_{1} x_{2}, \end{matrix}

(6.159)

so that

\begin{matrix} ω (t, x_{1}, x_{2}, \overset{ˉ}{t}, {\overset{ˉ}{x}}_{1}, {\overset{ˉ}{x}}_{2}) \\ = \frac{1}{4 π ν T} exp (- \frac{{({\overset{ˉ}{x}}_{1} - c_{1} x_{1} + s_{1} x_{2})}^{2} + {({\overset{ˉ}{x}}_{2} - s_{1} x_{1} - c_{1} x_{2})}^{2}}{4 ν T}) . \end{matrix}

(6.160)

The stream function $ψ$ can be calculated directly by solving the corresponding Poisson equation.Footnote ⁶ To start, notice that both $ω$ and $ψ$ are rotational symmetric around the point $(x_{1}, x_{2}) .$ Thus, $ω$ and $ψ$ have the following form:

\begin{matrix} ω = ω (R) = \frac{1}{4 π ν T} exp (- \frac{R^{2}}{2}), ψ = ψ (R), \end{matrix}

(6.161)

where

\begin{matrix} R^{2} = \frac{{({\overset{ˉ}{x}}_{1} - c_{1} x_{1} + s_{1} x_{2})}^{2} + {({\overset{ˉ}{x}}_{2} - s_{1} x_{1} - c_{1} x_{2})}^{2}}{2 ν T} . \end{matrix}

(6.162)

Then $ψ (R)$ solves a radially symmetric Poisson equation of the following form:

\begin{matrix} \frac{1}{R} {(R ψ_{R} (R))}_{R} = \frac{1}{2 π} exp (- \frac{R^{2}}{2}) . \end{matrix}

(6.163)

Thus,

\begin{matrix} R ψ_{R} (R) = - \frac{1}{2 π} exp (- \frac{R^{2}}{2}) + C, \end{matrix}

(6.164)

where $C$ is an arbitrary constant. Next,

\begin{matrix} ψ (R) = \frac{1}{2 π} (ln (R) + \frac{1}{2} E_{1} (\frac{R^{2}}{2})), \end{matrix}

(6.165)

where the choice of $C$ guarantees that $ψ$ has the right behavior when $R \to 0$ and $R \to \infty .$ Here $E_{1} (η)$ is the exponential integral of the following form:

\begin{matrix} E_{1} (η) = \int_{η}^{\infty} \frac{e^{- η^{'}}}{η^{'}} d η^{'} . \end{matrix}

(6.166)

7 Non-Gaussian Stochastic Processes

7.1 Regular Non-Gaussian Processes

In many situations, it is useful to consider processes governed by more general SDEs of the following form:

\begin{matrix} d {\hat{z}}_{t} = & (b (t) + B (t) {\hat{z}}_{t}) d t \\ + Σ (t) {(d i a g (d^{(0)} (t) + D (t) {\hat{z}}_{t}))}^{1 / 2} d {\hat{W}}_{t}^{(z)}, \\ {\hat{z}}_{t} = & z . \end{matrix}

(7.1)

Here, in addition to the functions $b (t),$ $B (t)$ introduced in the previous section, define an $(M \times 1)$ column vector $d^{(0)},$ and an $(M \times M)$ matrix $D .$ It is convenient to introduce auxiliary vectors $d^{(i)}$ equal to the $i$ th column of $D .$

Since the corresponding $(M \times M)$ covariance matrix $A$ has the form:

\begin{matrix} A = \frac{1}{2} Σ {(d i a g (d^{(0)} (t) + D (t) z_{t}))}^{1 / 2} {(Σ {(d i a g (d^{(0)} (t) + D (t) z_{t}))}^{1 / 2})}^{*}, \end{matrix}

(7.2)

it linearly depends on $z$ :

\begin{matrix} A = \frac{1}{2} Σ (d i a g (d^{(0)} + D z)) Σ^{*} = \frac{1}{2} A^{(0)} + \frac{1}{2} A^{(m)} z_{m}, \end{matrix}

(7.3)

where

\begin{matrix} A^{(0)} = \frac{1}{2} Σ d i a g (d^{(0)}) Σ^{*}, A^{(i)} = \frac{1}{2} Σ d i a g (d^{(i)}) Σ^{*} . \end{matrix}

(7.4)

In contrast to the Gaussian case, the equations in (7.2) have to be defined in the domain $D$ such that

\begin{matrix} D = ((z) d^{(0)} + D z \geq 0), \end{matrix}

(7.5)

rather than in the whole space. In financial engineering, covariance matrices of the form (7.2) were introduced by Reference Dai and SingletonDai and Singleton (2000), and discussed by Reference Duffie, Filipovic and SchachermayerDuffie et al. (2003), Reference FilipovicFilipovic (2009), and many others.

The corresponding Fokker–Plank problem has the following form:

\begin{matrix} ϖ_{\overset{ˉ}{t}} (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) - \sum \sum (A^{0} + {\overset{ˉ}{z}}_{m} A^{(m)}) ϖ_{\overset{ˉ}{z} \overset{ˉ}{z}} (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) \\ + (\hat{b} + B \overset{ˉ}{z}) \cdot ϖ_{\overset{ˉ}{z}} (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) + b ϖ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) = 0, \\ ϖ (t, z, t, \overset{ˉ}{z}) = δ (\overset{ˉ}{z} - z) ​, \end{matrix}

(7.6)

where

\begin{matrix} {\hat{b}}_{m} & = b_{m} - (2 a_{m m}^{(m)} + a_{m m^{'}}^{(m^{'})} + a_{m^{'} m}^{(m^{'})}), (no summation over m), \\ b & = T r (B) . \end{matrix}

(7.7)

Equation (6.11) expressing $ϖ$ in terms of $K$ holds. The equations for $α, δ$ have the following form:

\begin{matrix} α_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) + i δ_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) \cdot \overset{ˉ}{z} + δ (t, \overset{ˉ}{t}) \cdot A δ (t, \overset{ˉ}{t}) + i δ (t, \overset{ˉ}{t}) \cdot (\hat{b} + B \overset{ˉ}{z}) + b = 0, \end{matrix}

(7.8)

or, more explicitly,

\begin{matrix} α_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) + & i δ_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) \cdot \overset{ˉ}{z} + δ (t, \overset{ˉ}{t}) \cdot A^{(0)} δ (t, \overset{ˉ}{t}) + δ (t, \overset{ˉ}{t}) \cdot A^{(k)} δ (t, \overset{ˉ}{t}) {\overset{ˉ}{z}}_{k} \\ + i δ (t, \overset{ˉ}{t}) \cdot (\hat{b} + B \overset{ˉ}{z}) + b = 0. \end{matrix}

(7.9)

Thus, the system of ODEs for $α, δ$ can be written as follows:

\begin{matrix} α_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) + δ (t, \overset{ˉ}{t}) \cdot A^{(0)} (t, \overset{ˉ}{t}) δ (t, \overset{ˉ}{t}) + i δ (t, \overset{ˉ}{t}) \cdot \hat{b} (t, \overset{ˉ}{t}) + b (t, \overset{ˉ}{t}) = 0, α (t, t) = 0, \\ i δ_{i}^{'} (t, \overset{ˉ}{t}) + δ (t, \overset{ˉ}{t}) \cdot A^{(i)} δ (t, \overset{ˉ}{t}) + i B_{i j} δ_{j} (t, \overset{ˉ}{t}) = 0, δ_{i} (t, t) = m_{i} . \end{matrix}

(7.10)

In the case in question, the equation for $δ$ is no longer linear. Instead, $δ$ satisfies the so-called matrix Riccati equation. Such equations are important for several applications, such as optimal control. Solving a matrix Riccati equation is quite hard, so it is more an art than a science; some of the results in this direction are reported here. However, in the one-dimensional case, the corresponding Riccati equation can be converted into the second-order ODE, and then solved explicitly when the coefficients $A,$ $b,$ $b$ are time-independent.

In case of an augmented process, one must consider an SDE of the following form:

\begin{matrix} d {\hat{x}}_{t} = & (b^{(x)} (t) + B^{(x x)} (t) {\hat{x}}_{t} + B^{(x y)} (t) {\hat{y}}_{t}) d t, \\ d {\hat{y}}_{t} = & (b^{(y)} (t) + B^{(y x)} (t) {\hat{x}}_{t} + B^{(y y)} (t) {\hat{y}}_{t}) d t \\ + Σ^{(y y)} (t) {(d i a g (d^{(0)} (t) + D (t) {\hat{z}}_{t}))}^{1 / 2} d {\hat{W}}_{t}^{(y)}, \\ {\hat{x}}_{t} = & {x, \hat{y}}_{t} = y, \end{matrix}

(7.11)

or, more compactly,

\begin{matrix} d {\hat{z}}_{t} & = (b (t) + B (t) {\hat{z}}_{t}) d t + (\begin{matrix} 0 \\ Σ^{(y y)} (t) {(d i a g (d^{(0)} (t) + D (t) {\hat{z}}_{t}))}^{1 / 2} d {\hat{W}}_{t}^{(y)} \end{matrix}), \\ {\hat{z}}_{t} & = z = (\begin{matrix} x \\ y \end{matrix}) . \end{matrix}

(7.12)

Here $b^{(x)},$ $b^{(y)},$ $b = {(b^{(x)}, b^{(y)})}^{*},$ $d^{(0)}$ are column vectors, and $B^{(x x)},$ $B^{(x y)},$ $B^{(y x)},$ $B^{(y y)},$ $B,$ and $D$ are matrices of appropriate dimensions.

The equations for $α, δ = (β, γ)$ have the following form:

\begin{matrix} α_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) + i δ_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) \cdot \overset{ˉ}{z} + γ (t, \overset{ˉ}{t}) \cdot A γ (t, \overset{ˉ}{t}) + i δ (t, \overset{ˉ}{t}) \cdot (\hat{b} + B \overset{ˉ}{z}) + b = 0, \end{matrix}

(7.13)

or, more explicitly,

\begin{matrix} α_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) + & i δ_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) \cdot \overset{ˉ}{z} + γ (t, \overset{ˉ}{t}) \cdot A^{(0)} γ (t, \overset{ˉ}{t}) + γ (t, \overset{ˉ}{t}) \cdot A^{(k)} γ (t, \overset{ˉ}{t}) {\overset{ˉ}{z}}_{k} \\ + i δ (t, \overset{ˉ}{t}) \cdot (b^{(z)} + B^{(z z)} \overset{ˉ}{z}) + b = 0. \end{matrix}

(7.14)

Thus, the system of ODEs for $α, δ$ can be written as follows:

\begin{matrix} α_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) + γ (t, \overset{ˉ}{t}) \cdot A^{(0)} (t, \overset{ˉ}{t}) γ (t, \overset{ˉ}{t}) + i δ (t, \overset{ˉ}{t}) \cdot {\hat{b}}^{(z)} (t, \overset{ˉ}{t}) + b (t, \overset{ˉ}{t}) = 0, \\ α (t, t) = 0, \\ i δ_{i, \overset{ˉ}{t}} (t, \overset{ˉ}{t}) + γ (t, \overset{ˉ}{t}) \cdot A^{(i)} γ (t, \overset{ˉ}{t}) + i B_{i j}^{(z z)} δ_{j} (t, \overset{ˉ}{t}) = 0, δ_{i} (t, t) = m_{i} . \end{matrix}

(7.15)

7.2 Killed Non-Gaussian Processes

The non-Gaussian governing SDE has the following form:

\begin{matrix} d {\hat{z}}_{t} = (b + B {\hat{z}}_{t}) d t + Σ {(d i a g (d^{(0)} + D {\hat{z}}_{t}))}^{1 / 2} d {\hat{W}}_{t}, \end{matrix}

(7.16)

where ${\hat{z}}_{t},$ $b,$ $d^{(0)}$ are $(M \times 1)$ vectors, and $Σ,$ $B,$ $D$ are the $(M \times M)$ matrices defined previously. As before, the correlation matrix $Σ$ can be a full-rank (nondegenerate) matrix. Once again, it is assumed that the process is killed with intensity $\overset{ˉ}{c}$ linearly depending on $z,$ namely,

\begin{matrix} \overset{ˉ}{c} = c + c \cdot z, \end{matrix}

(7.17)

where $c$ is a scalar, and $c^{(z)}$ is an $(M \times 1)$ column vector.

The corresponding Fokker–Plank problem has the following form:

\begin{matrix} ϖ_{\overset{ˉ}{t}} (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) - \sum \sum (A^{0} + {\overset{ˉ}{z}}_{i} A^{i}) ϖ_{\overset{ˉ}{z} \overset{ˉ}{z}} (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) \\ + (\hat{b} + B \overset{ˉ}{z}) \cdot ϖ_{\overset{ˉ}{z}} (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) + (b + c + c \cdot \overset{ˉ}{z}) ϖ (t, z, \overset{ˉ}{t}, \overset{ˉ}{z}) = 0, \\ ϖ (t, z, t, \overset{ˉ}{z}) = δ (\overset{ˉ}{z} - z) ​ . \end{matrix}

(7.18)

The equations for $α, δ$ generalize the equations in (7.10). They can be written in the following form:

\begin{matrix} α_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) + δ (t, \overset{ˉ}{t}) \cdot A^{(0)} (t, \overset{ˉ}{t}) δ (t, \overset{ˉ}{t}) + i δ (t, \overset{ˉ}{t}) \cdot b (t, \overset{ˉ}{t}) + b (t, \overset{ˉ}{t}) + c (t, \overset{ˉ}{t}) = 0, \\ α (t, t) = 0, \\ i δ_{i}^{'} (t, \overset{ˉ}{t}) + δ (t, \overset{ˉ}{t}) \cdot A^{(i)} δ (t, \overset{ˉ}{t}) + i B_{i j} δ_{j} (t, \overset{ˉ}{t}) + c_{i} = 0, δ_{i} (t, t) = m_{i} . \end{matrix}

(7.19)

As in the case without killing, finding an analytical solution to a multidimensional Riccati equation is generally impossible. However, in the time-independent one-dimensional case, it can be done. Solution becomes particularly simple in the special case when $A^{(0)} = 0 .$ The most important case is the killed one-dimensional Feller process, used, for example, to price bonds in the Cox–IngersolI–Ross (CIR) model; see Section 8.

7.3 Example: Anomalous Kolmogorov Process

Anomalous diffusion is a phenomenon in which the random motion of particles or molecules deviates from the classical Brownian motion and, as a result, exhibits non-Gaussian probability distributions, such as power-law or exponential tails. One can distinguish between subdiffusions (slower spreading) and superdiffusions (faster spreading). Anomalous diffusion often involves long-range correlations in particle motion, meaning that the movement of a particle at a one-time step depends on its previous positions over longer time scales. Anomalous diffusion frequently displays scale-invariant properties, meaning that the statistical properties of motion remain the same across different time or spatial scales. Anomalous diffusion has applications in physics, chemistry, financial engineering, biology, and geophysics.

Fractional Brownian motion (fBm) is used to model anomalous diffusion because it possesses several relevant characteristics. In particular, it exhibits long memory, which means that the process’s future values are influenced by its past values over long time scales. Additionally, fBm can produce non-Gaussian behavior while preserving scale-invariance. By adjusting the Hurst exponent and other parameters, fBm can be tailored to model different anomalous diffusions, including both subdiffusions and superdiffusions.

This section studies a fractional Kolmogorov equation of the following form:

\begin{matrix} ϖ_{\overset{ˉ}{t}} (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}) + a {(- \frac{\partial^{2}}{\partial {\overset{ˉ}{y}}^{2}})}^{ν} ϖ (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}) \\ + \overset{ˉ}{y} ϖ_{\overset{ˉ}{x}} (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}) + b ϖ_{\overset{ˉ}{y}} (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}) = 0, \\ ϖ (t, \overset{ˉ}{x}, \overset{ˉ}{y}, t, x, y) = δ (\overset{ˉ}{x} - x) δ (\overset{ˉ}{y} - y), \end{matrix}

(7.20)

where $0 < ν < 1 .$ The pseudo-differential operator ${(- (\partial^{2}) \partial {\overset{ˉ}{y}}^{2})}^{ν}$ is defined as follows:

\begin{matrix} {(- \frac{\partial^{2}}{\partial {\overset{ˉ}{y}}^{2}})}^{ν} ϖ = F^{- 1} ({(l)}^{2 ν} F (ϖ)) . \end{matrix}

(7.21)

Here $F$ and $F^{- 1}$ denote the direct and inverse Fourier transforms, respectively. Despite its complexity, problem (7.20) can be solved by using Kelvin waves. For particular solutions of the form (3.38), (3.39), and (3.40), the corresponding characteristic equations are

\begin{matrix} α_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) + a {(γ)}^{2 ν} (t, \overset{ˉ}{t}) + i γ_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) \overset{ˉ}{y} + i k \overset{ˉ}{y} + i b γ (t, \overset{ˉ}{t}) = 0, \\ α (t, t) = 0, γ (t, t) = l, \end{matrix}

(7.22)

so that

\begin{matrix} α_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) + a {(γ)}^{2 ν} (t, \overset{ˉ}{t}) + i b γ (t, \overset{ˉ}{t}) = 0, α (t, t) = 0, \\ γ_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) + k = 0, γ (t, t) = l, \end{matrix}

(7.23)

\begin{matrix} γ (t, \overset{ˉ}{t}) = - k T + l, \\ α (t, \overset{ˉ}{t}) = - a \int_{t}^{\overset{ˉ}{t}} {(- k (s - t) + l)}^{2 ν} d s - i b (- \frac{k T^{2}}{2} + l T) . \end{matrix}

(7.24)

Thus,

\begin{matrix} Ψ = & α + i k (\overset{ˉ}{x} - x) + i γ \overset{ˉ}{y} - i l y \\ = & - a \int_{t}^{\overset{ˉ}{t}} {(- k (s - t) + l)}^{2 ν} d s \\ + i k (\overset{ˉ}{x} - x - \overset{ˉ}{y} T + \frac{b T^{2}}{2}) + i l (\overset{ˉ}{y} - y - b T) . \end{matrix}

(7.25)

Now, assume that $ν = 1 / 2 .$ The key is to calculate the integral

\begin{matrix} I = \int_{t}^{\overset{ˉ}{t}} {(- k (s - t) + l)}^{2 ν} d s = \int_{0}^{T} (- k s + l) d s, \end{matrix}

(7.26)

for different values of $(k, l) .$ Depending on $(k, l),$ this integral can be calculated as follows:

\begin{matrix} I_{1} & = \int_{0}^{l / k} (- k s + l) d s + \int_{l / k}^{T} (k s - l) d s = \frac{k T^{2}}{2} - l T + \frac{l^{2}}{k}, 0 \leq k < \infty, \\ 0 \leq l \leq k T, \\ I_{2} & = - \frac{k T^{2}}{2} + l T, 0 \leq k < \infty, k T \leq l < \infty, \\ I_{3} & = - \frac{k T^{2}}{2} + l T, - \infty < k \leq 0, 0 \leq l < \infty, \\ I_{4} & = \int_{0}^{l / k} (k s - l) d s + \int_{l / k}^{T} (- k s + l) d s = - \frac{k T^{2}}{2} + l T - \frac{l^{2}}{k}, - \infty < k \leq 0, \\ k T \leq l \leq 0, \\ I_{5} & = \frac{k T^{2}}{2} - l T, - \infty < k \leq 0, - \infty < l \leq k T, \\ I_{6} & = \frac{k T^{2}}{2} - l T, 0 \leq k < \infty, - \infty < l \leq 0. \end{matrix}

(7.27)

Thus,

\begin{matrix} {(2 π)}^{2} J_{1} = \int_{0}^{\infty} \int_{0}^{k T} exp (- a (\frac{k T^{2}}{2} - l T + \frac{l^{2}}{k}) + i k a T^{2} ζ + i l a T η) d k d l \\ = T \int_{0}^{1} \int_{0}^{\infty} exp ((- p + i q) k) k d χ d k = - T \int_{0}^{1} \frac{\partial}{\partial p} (\int_{0}^{\infty} exp ((- p + i q) k) d k) d χ \\ = T \int_{0}^{1} \frac{d χ}{{(p - i q)}^{2}} = \frac{1}{a^{2} T^{3}} \int_{0}^{T} \frac{d χ}{{((χ - f_{+}) (χ - f_{-}))}^{2}}, \end{matrix}

(7.28)

where $(ζ, η)$ are nondimensional variables:

\begin{matrix} ζ & = \frac{\overset{ˉ}{x} - x - \overset{ˉ}{y} T + \frac{b T^{2}}{2}}{a T^{2}}, η = \frac{\overset{ˉ}{y} - y - b T}{a T}, \end{matrix}

(7.29)

\begin{matrix} l & = T χ k, p (χ) = a T^{2} (\frac{1}{2} - χ + χ^{2}) > 0, q (χ) = a T^{2} (ζ + χ η), \end{matrix}

(7.30)

and $f_{\pm}$ are roots of the quadratic equation

\begin{matrix} χ^{2} - (1 + i η) χ + (\frac{1}{2} - i ζ) = 0. \end{matrix}

(7.31)

One can check that

\begin{matrix} f_{\pm} & = \frac{(1 + i η) \pm \sqrt{{(1 + i η)}^{2} - 2 + 4 i ζ}}{2} = \frac{(1 + i η) \pm i \sqrt{D}}{2}, \\ f_{+} f_{-} & = \frac{1}{2} - i ζ, f_{+} + f_{-} = 1 + i η, f_{+} - f_{-} = i \sqrt{D}, \end{matrix}

(7.32)

with

\begin{matrix} D = 1 + η^{2} - 4 i ζ - 2 i η . \end{matrix}

(7.33)

The roots $f_{\pm}$ are never equal, since $D$ does not vanish when $ζ, η$ are real.

Thus, one has

\begin{matrix} {(2 π)}^{2} J_{1} = & \frac{1}{a^{2} T^{3}} \int_{0}^{1} \frac{d χ}{{((χ - f_{+}) (χ - f_{-}))}^{2}} \\ = \frac{1}{a^{2} T^{3} {(f_{+} - f_{-})}^{2}} \int_{0}^{1} {(\frac{1}{χ - f_{+}} - \frac{1}{χ - f_{-}})}^{2} d χ \\ = & - \frac{1}{a^{2} T^{3} {(f_{+} - f_{-})}^{2}} (((\frac{1}{1 - f_{+}} + \frac{1}{f_{+}}) + (\frac{1}{1 - f_{-}} + \frac{1}{f_{-}}))) \\ (+ \frac{2}{(f_{+} - f_{-})} ln (\frac{f_{-} (1 - f_{+})}{f_{+} (1 - f_{-})})) \\ = & \frac{1}{a^{2} T^{3} D} (\frac{4 (D + 2 i ζ + i η)}{(1 - 2 i ζ) (1 - 2 i ζ - 2 i η)} - \frac{2 i}{\sqrt{D}} ln (\frac{2 ζ + η - \sqrt{D}}{2 ζ + η + \sqrt{D}})) . \end{matrix}

(7.34)

By symmetry,

\begin{matrix} J_{4} (ζ, η) = J_{1} (- ζ, - η) = \overline{J_{1} (ζ, η)} . \end{matrix}

(7.35)

Next,

\begin{matrix} {(2 π)}^{2} J_{2} & = \int_{0}^{\infty} \int_{k T}^{\infty} exp (- a (- \frac{k T^{2}}{2} + l T) + i k a T^{2} ζ + i l a T η) d k d l \\ = \frac{1}{a T (1 - i η)} \int_{0}^{\infty} exp (- \frac{k a T^{2}}{2} + i k a T^{2} (ζ + η)) d k \\ = \frac{1}{a^{2} T^{3} ((1 - i η)) (\frac{1}{2} - i (ζ + η))} . \end{matrix}

(7.36)

Similarly, it is easy to show that

\begin{matrix} {(2 π)}^{2} J_{3} & = \int_{- \infty}^{0} \int_{0}^{\infty} exp (- a (- \frac{k T^{2}}{2} + l T) + i k a T^{2} ζ + i l a T η) d k d l \\ = \frac{1}{a^{2} T^{3} (1 - i η) (\frac{1}{2} + i ζ)}, \end{matrix}

(7.37)

while, by symmetry, one gets

\begin{matrix} J_{5} (ζ, η) & = \frac{1}{{(2 π)}^{2} a^{2} T^{3} (1 + i η) (\frac{1}{2} + i (ζ + η))} = \overline{J_{2} (ζ, η)}, \\ J_{6} (ζ, η) & = \frac{1}{{(2 π)}^{2} a^{2} T^{3} (1 + i η) (\frac{1}{2} - i ζ)} = \overline{J_{3} (ζ, η)}, \end{matrix}

(7.38)

so that

\begin{matrix} ϖ = & \frac{1}{π^{2} a^{2} T^{3}} (\frac{1}{(1 + η^{2})} (\frac{(1 - 2 η (ζ + η))}{(1 + 4 {(ζ + η)}^{2})} + \frac{(1 + 2 η ζ)}{(1 + 4 ζ^{2})})) \\ (+ R e (\frac{1}{D} (\frac{2 (D + 2 i ζ + i η)}{(D - {(2 ζ + η)}^{2})} - \frac{i}{\sqrt{D}} ln (\frac{2 ζ + η - \sqrt{D}}{2 ζ + η + \sqrt{D}})))), \end{matrix}

(7.39)

and

\begin{matrix} ϖ & (\overset{ˉ}{x}, \overset{ˉ}{y}) d \overset{ˉ}{x} d \overset{ˉ}{y} = \frac{1}{π^{2} a^{2}} (\frac{1}{(1 + η^{2})} (\frac{(1 - 2 η (ζ + η))}{(1 + 4 {(ζ + η)}^{2})} + \frac{(1 + 2 η ζ)}{(1 + 4 ζ^{2})})) \\ (+ R e (\frac{1}{D} (\frac{2 (D + 2 i ζ + i η)}{(D - {(2 ζ + η)}^{2})} - \frac{i}{\sqrt{D}} ln (\frac{2 ζ + η - \sqrt{D}}{2 ζ + η + \sqrt{D}})))) d ζ d η \\ \equiv ϖ (ζ, η) d ζ d η, \end{matrix}

(7.40)

which shows that, as expected, in the nondimensional variables there is no explicit dependence on $T .$ Footnote ⁷

A typical anomalous Kolmogorov process is depicted in Figure 9. The difference between the anomalous diffusion shown in Figure 9 and the pure diffusion shown in Figure 6 is clear.

Figure 9 Contour lines of $ϖ (0, 0, 0, T, \tilde{x}, \tilde{y})$ for an anomalous Kolmogorov process with $T = 1.5,$ $a = 2.5,$ $b = 1.5$ . Author’s graphics.

It is worth comparing Equations (7.39) and (3.28). To this end, rewrite $Φ$ given by (3.29) in the following form:

\begin{matrix} Φ = \frac{{(\overset{ˉ}{y} - y - b T)}^{2}}{2 a T} + \frac{6 {(\overset{ˉ}{x} - x - \frac{(\overset{ˉ}{y} + y) T}{2})}^{2}}{a T^{3}} = \frac{η^{2}}{2} + \frac{3 {(2 ζ + η)}^{2}}{2}, \end{matrix}

(7.41)

where $ζ, η$ are nondimensional variables of the form:

\begin{matrix} ζ = \frac{\overset{ˉ}{x} - x - \overset{ˉ}{y} T + \frac{b T^{2}}{2}}{\sqrt{a T^{3}}}, η = \frac{\overset{ˉ}{y} - y - b T}{\sqrt{a T}}, \end{matrix}

(7.42)

and $a$ is the diffusion coefficient; its dimension is $(a) = L^{2} / T^{3} .$ Thus,

\begin{matrix} ϖ = \frac{\sqrt{3}}{π a T^{2}} exp (- \frac{η^{2} + 3 {(2 ζ + η)}^{2}}{2}), \end{matrix}

(7.43)

and

\begin{matrix} ϖ (\overset{ˉ}{x}, \overset{ˉ}{y}) d \overset{ˉ}{x} d \overset{ˉ}{y} = \frac{\sqrt{3}}{π a} exp (- \frac{η^{2} + 3 {(2 ζ + η)}^{2}}{2}) d ζ d η \equiv ϖ (ζ, η) d ζ d η . \end{matrix}

(7.44)

Comparing Equations (7.39) and (7.43), one can see that the scaling of $ϖ$ and its asymptotic behavior at infinity is completely different.

7.4 Example: Feller Process

7.4.1 Feller Process

Feller Process with Constant Parameters

For benchmarking purposes, it is useful to start with deriving the well-known t.p.d.f. for the Feller process with constant coefficients; see Reference FellerFeller (1951), Reference Feller(1952):

\begin{matrix} d {\hat{y}}_{t} = (χ - κ {\hat{y}}_{t}) d t + ε \sqrt{{\hat{y}}_{t}} d {\hat{Z}}_{t}, {\hat{y}}_{t} = y . \end{matrix}

(7.45)

Initially, the process with time-independent parameters is considered; the time-dependent case is analyzed later in this section.

To start with, it is assumed that

\begin{matrix} \frac{2 χ}{ε^{2}} - 1 \equiv ϑ > 0. \end{matrix}

(7.46)

This condition guarantees that the process ${\hat{y}}_{t}$ does not hit zero, which is one of the main reasons to use the Feller process in practice; it is relaxed shortly.

The corresponding Fokker–Planck problem has the form:

\begin{matrix} ϖ_{\overset{ˉ}{t}} - \frac{1}{2} ε^{2} {(\overset{ˉ}{y} ϖ)}_{\overset{ˉ}{y} \overset{ˉ}{y}} + {((χ - κ \overset{ˉ}{y}) ϖ)}_{\overset{ˉ}{y}} = 0, \\ ϖ (t, y, t, \overset{ˉ}{y}) = δ (\overset{ˉ}{y} - y) . \end{matrix}

(7.47)

This equation can be written as a conservation law:

\begin{matrix} ϖ_{\overset{ˉ}{t}} + F_{\overset{ˉ}{y}} = 0, \\ ϖ (t, y, t, \overset{ˉ}{y}) = δ (\overset{ˉ}{y} - y), \end{matrix}

(7.48)

where the probability flux $F$ is given by

\begin{matrix} F = - \frac{1}{2} ε^{2} {(\overset{ˉ}{y} ϖ)}_{\overset{ˉ}{y}} + (χ - κ \overset{ˉ}{y}) ϖ . \end{matrix}

(7.49)

However, experience suggests that solving the backward Kolmogorov problem is more expedient. It can be formulated as follows:

\begin{matrix} ϖ_{t} + \frac{1}{2} ε^{2} y ϖ_{y y} + (χ - κ y) ϖ_{y} = 0, \\ ϖ (\overset{ˉ}{t}, y, \overset{ˉ}{t}, \overset{ˉ}{y}) = δ (y - \overset{ˉ}{y}) . \end{matrix}

(7.50)

The associated Kelvin wave function $K (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}, l)$ has the following form:

\begin{matrix} K = exp (α (t, \overset{ˉ}{t}) + i γ (t, \overset{ˉ}{t}) y - i l \overset{ˉ}{y}), \end{matrix}

(7.51)

where $α, γ$ solve the following system of backward ODEs:

\begin{matrix} α_{t} (t, \overset{ˉ}{t}) + χ i γ (t, \overset{ˉ}{t}) = 0, α (\overset{ˉ}{t}, \overset{ˉ}{t}) = 0, \\ i γ_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) - \frac{1}{2} ε^{2} γ^{2} (t, \overset{ˉ}{t}) - κ i γ (t, \overset{ˉ}{t}) = 0, γ (\overset{ˉ}{t}, \overset{ˉ}{t}) = l . \end{matrix}

(7.52)

Thus, $γ$ solves a nonlinear Riccati equation, which can be linearized via the standard substitution

\begin{matrix} γ (t, \overset{ˉ}{t}) = - \frac{2 i Ω^{'} (t, \overset{ˉ}{t})}{ε^{2} Ω (t, \overset{ˉ}{t})} . \end{matrix}

(7.53)

As a result, one gets the following equations:

\begin{matrix} Ω_{t t} (t, \overset{ˉ}{t}) - κ Ω_{t} (t, \overset{ˉ}{t}) = 0, Ω (\overset{ˉ}{t}, \overset{ˉ}{t}) = 1, Ω^{'} (\overset{ˉ}{t}, \overset{ˉ}{t}) = \frac{i ε^{2} l}{2}, \end{matrix}

(7.54)

\begin{matrix} α_{t} (t, \overset{ˉ}{t}) + \frac{2 χ}{ε^{2}} {(ln (Ω (t, \overset{ˉ}{t})))}_{t} = 0, α (\overset{ˉ}{t}, \overset{ˉ}{t}) = 0. \end{matrix}

(7.55)

Accordingly,

\begin{matrix} Ω (t, \overset{ˉ}{t}) & = 1 - \frac{ε^{2}}{2} B_{κ} (T) i l, \end{matrix}

(7.56)

\begin{matrix} Ω^{'} (t, \overset{ˉ}{t}) & = \frac{ε^{2}}{2} A_{κ} (T) i l, \end{matrix}

(7.57)

\begin{matrix} γ (t, \overset{ˉ}{t}) & = \frac{A_{κ} (T) l}{(1 - \frac{ε^{2}}{2} B_{κ} (T) i l)}, \end{matrix}

(7.58)

\begin{matrix} α (t, \overset{ˉ}{t}) & = - (ϑ + 1) ln (1 - \frac{ε^{2}}{2} B_{κ} (T) i l), \end{matrix}

(7.59)

and

\begin{matrix} K = exp (- (ϑ + 1) ln (1 - \frac{ε^{2}}{2} B_{κ} (T) i l) + (\frac{\frac{ε^{2}}{2} A_{κ} (T)}{(1 - \frac{ε^{2}}{2} B_{κ} (T) i l)} y - \overset{ˉ}{y}) i l) . \end{matrix}

(7.60)

To analyze the problem further, it is helpful to define

\begin{matrix} M = \frac{2}{ε^{2} B_{κ} (T)}, \end{matrix}

(7.61)

introduce a new variable, $l \to \hat{l}$ :

\begin{matrix} \hat{l} = \frac{l}{2 M}, l = 2 M \hat{l}, \end{matrix}

(7.62)

and rescale $K,$ $K d l \to \hat{K} d \hat{l}$ :

\begin{matrix} \hat{K} = & 2 M exp (- M (Y + \overset{ˉ}{y})) \\ \times exp (- (ϑ + 1) ln (1 - 2 i \hat{l}) + M (\frac{Y}{1 - 2 i \hat{l}} + \overset{ˉ}{y} (1 - 2 i \hat{l}))), \end{matrix}

(7.63)

where $M$ appears due to the change of variables, and

\begin{matrix} Y = e^{- κ T} y . \end{matrix}

(7.64)

Finally,

\begin{matrix} ϖ (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}) = \frac{M}{π} e^{- M (\overset{ˉ}{y} + Y)} \int_{- \infty}^{\infty} e^{- (ϑ + 1) ln (1 - 2 i \hat{l}) + M (\frac{Y}{1 - 2 i \hat{l}} + \overset{ˉ}{y} (1 - 2 i \hat{l}))} d \hat{l} . \end{matrix}

(7.65)

Equation (7.65) allows us to understand the true meaning of condition (7.46). When this condition is satisfied, the corresponding integral converges absolutely when $\hat{l} \to \pm \infty .$ A well-known formula yields

\begin{matrix} ϖ^{(ϑ)} (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}) = M e^{- M (\overset{ˉ}{y} + Y)} {(\frac{\overset{ˉ}{y}}{Y})}^{ϑ / 2} I_{ϑ} (2 M \sqrt{\overset{ˉ}{y} Y}) . \end{matrix}

(7.66)

See, for example, Reference LiptonLipton (2001) and references therein. The probability flux $F$ has the form

\begin{matrix} F^{(ϑ)} (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}) = - \frac{1}{2} ε^{2} {(\overset{ˉ}{y} ϖ^{(ϑ)} (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}))}_{\overset{ˉ}{y}} + (χ - κ \overset{ˉ}{y}) ϖ^{(ϑ)} (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}) \\ = - \frac{1}{2} ε^{2} M Y ϖ^{(ϑ + 1)} (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}) + (\frac{1}{2} ε^{2} - \frac{κ}{M}) M \overset{ˉ}{y} ϖ^{(ϑ)} (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}) . \end{matrix}

(7.67)

It is important to note that the density $ϖ (\overset{ˉ}{y})$ integrates to one:

\begin{matrix} \int_{0}^{\infty} ϖ (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}) d \overset{ˉ}{y} = \int_{0}^{\infty} e^{- u - v} {(\frac{v}{u})}^{ϑ / 2} I_{ϑ} (2 \sqrt{u v}) d v = 1, \end{matrix}

(7.68)

where $u = M Y, v = M \overset{ˉ}{y} .$ This fact is used in the following discussion.

Using the asymptotic expansion of the modified Bessel function, one can show that $ϖ^{(ϑ)}$ and $F$ vanish on the boundary, since

\begin{matrix} ϖ^{(ϑ)} (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}) & = \frac{M e^{- M Y}}{Γ (ϑ + 1)} {(M \overset{ˉ}{y})}^{ϑ} (1 + O (\overset{ˉ}{y})), \\ F^{(ϑ)} (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}) & = (\frac{ε^{2}}{2} (1 - \frac{M Y}{(ϑ + 1)}) - \frac{κ}{M}) M e^{- M Y} \frac{{(M \overset{ˉ}{y})}^{(ϑ + 1)}}{Γ (ϑ + 1)} (1 + O (\overset{ˉ}{y})) . \end{matrix}

(7.69)

Now assume that condition (7.46) is violated, so that $- 1 < ϑ < 0 .$ In this case, the integral in (7.65) is no longer absolutely convergent, so one needs to regularize it. There are two ways of regularizing the corresponding integral: (I) integration by parts, (II) change of variables. Not surprisingly, they produce different results.

Start with integration by parts and write

\begin{matrix} I n t_{ϑ} \equiv & \frac{1}{π} \int_{- \infty}^{\infty} e^{- (ϑ + 1) ln (1 - 2 i \hat{l}) + \frac{M Y}{1 - 2 i \hat{l}}} d (\frac{e^{M \overset{ˉ}{y} (1 - 2 i \hat{l})}}{- 2 i M \overset{ˉ}{y}}) \\ = & \frac{1}{π} \frac{(ϑ + 1)}{M \overset{ˉ}{y}} \int_{- \infty}^{\infty} e^{- (ϑ + 2) ln (1 - 2 i \hat{l}) + M (\frac{Y}{1 - 2 i \hat{l}} + \overset{ˉ}{y} (1 - 2 i \hat{l}))} d \hat{l} \\ + \frac{1}{π} \frac{Y}{\overset{ˉ}{y}} \int_{- \infty}^{\infty} e^{- (ϑ + 3) ln (1 - 2 i \hat{l}) + M (\frac{Y}{1 - 2 i \hat{l}} + \overset{ˉ}{y} (1 - 2 i \hat{l}))} d \hat{l}, \end{matrix}

(7.70)

where the integrals are absolutely convergent. Thus, (7.66) yields

\begin{matrix} I n t_{ϑ} = {(\frac{\overset{ˉ}{y}}{Y})}^{\frac{ϑ}{2}} (\frac{2 (ϑ + 1)}{Z} I_{ϑ + 1} (Z) + I_{ϑ + 2} (Z)) = {(\frac{\overset{ˉ}{y}}{Y})}^{\frac{ϑ}{2}} I_{ϑ} (Z), \end{matrix}

(7.71)

where $Z = 2 M \sqrt{\overset{ˉ}{y} Y},$ and a well-known recurrent relation for the modified Bessel functions is used; Reference Abramowitz and StegunAbramowitz and Stegun (1964), Eq. 9.6.26. Thus, Equations (7.66) and (7.67) hold for $- 1 < ϑ < 0$ :

\begin{matrix} ϖ^{(ϑ, I)} = ϖ^{(ϑ)}, F^{(ϑ, I)} = F^{(ϑ)} . \end{matrix}

(7.72)

It is important to note that $ϖ^{(ϑ, I)} (\overset{ˉ}{y} \to 0) \to \infty$ when $ϑ < 0$ (the corresponding singularity is integrable), while $ϖ^{(ϑ)}$ is bounded at $\overset{ˉ}{y} = 0,$ when $ϑ > 0 .$ While the t.p.d.f. itself blows up at the natural boundary $\overset{ˉ}{y} = 0,$ the probability flux $F^{(ϑ, I)}$ vanishes on the boundary, so that the total probability of staying on the positive semiaxis is conserved.

Now, use change of variables to regularize $I n t_{ϑ} .$ Specifically, introduce $\tilde{l},$ such that

\begin{matrix} 1 - 2 i \hat{l} = \frac{1}{1 - 2 i \tilde{l}}, \end{matrix}

(7.73)

and formally write $I n t_{ϑ}$ as follows:

\begin{matrix} I n t_{ϑ} & = \frac{1}{π} \int_{- \infty}^{\infty} e^{- (- ϑ + 1) ln (1 - 2 i \tilde{l}) + M Y (1 - 2 i \tilde{l}) + \frac{M \overset{ˉ}{y}}{(1 - 2 i \tilde{l})}} d \tilde{l} \\ = {(\frac{Y}{\overset{ˉ}{y}})}^{- \frac{ϑ}{2}} I_{- ϑ} (2 M \sqrt{\overset{ˉ}{y} Y}) = {(\frac{\overset{ˉ}{y}}{Y})}^{\frac{ϑ}{2}} I_{- ϑ} (2 M \sqrt{\overset{ˉ}{y} Y}) . \end{matrix}

(7.74)

Accordingly,

\begin{matrix} ϖ^{(ϑ, I I)} (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}) = M e^{- M (\overset{ˉ}{y} + Y)} {(\frac{\overset{ˉ}{y}}{Y})}^{\frac{ϑ}{2}} I_{- ϑ} (2 M \sqrt{\overset{ˉ}{y} Y}) . \end{matrix}

(7.75)

A straightforward calculation yields

\begin{matrix} F^{(ϑ, I I)} (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}) = & M e^{- M (\overset{ˉ}{y} + Y)} {(\frac{\overset{ˉ}{y}}{Y})}^{ϑ / 2} (- \frac{1}{2} ε^{2} M \sqrt{\overset{ˉ}{y} Y} I_{- ϑ + 1} (2 M \sqrt{\overset{ˉ}{y} Y})) \\ (+ (\frac{1}{2} ε^{2} ϑ + (\frac{1}{2} ε^{2} - \frac{κ}{M}) M \overset{ˉ}{y}) I_{- ϑ} (2 M \sqrt{\overset{ˉ}{y} Y})) . \end{matrix}

(7.76)

It is easy to see that both $ϖ^{(ϑ, I I)}$ and $F^{(ϑ, I I)}$ are bounded at $\overset{ˉ}{y} = 0$ :

\begin{matrix} ϖ^{(ϑ, I I)} (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}) & = \frac{M e^{- M Y}}{Γ (- ϑ + 1) {(M Y)}^{ϑ}} (1 + O (\overset{ˉ}{y})), \\ F^{(ϑ, I I)} (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}) & = \frac{ε^{2} ϑ M e^{- M Y}}{2 Γ (- ϑ + 1) {(M Y)}^{ϑ}} (1 + O (\overset{ˉ}{y})) . \end{matrix}

(7.77)

Since there is a probability flux across the natural boundary $\overset{ˉ}{y} = 0,$ the total probability on the positive semiaxis $(0, \infty)$ is less than one.

Representative t.p.d.fs for Feller processes with different values of $ϑ$ are illustrated in Figure 10.

Figure 10 T.p.d.fs for three Feller processes with different parameters and regularity conditions. (a) $χ = 0.1,$ $κ = 1.2,$ $ε = 0.2,$ $y_{0} = 0.15,$ ${\overset{ˉ}{t}}_{m a x} = 3$ ; (b), (c) $χ = 0.1,$ $κ = 1.2,$ $ε = 0.6,$ $y_{0} = 0.15,$ ${\overset{ˉ}{t}}_{m a x} = 3 .$ For the first and second processes, the probability of $\overset{ˉ}{y} \geq 0$ is equal to one. For the third process, this probability, shown as a function of time in (d), is less than one. Author’s graphics.

Feller Process with Time-Dependent Parameters

Surprisingly, studying the Feller process with time-dependent coefficients is viewed as a difficult problem, which remains an active area of research; see, for example, Reference MasoliverMasoliver (2016), Reference Giorno and NobileGiorno and Nobile (2021), and references therein. However, using Kelvin wave formalism allows one to find an expression for the t.p.d.f. in a very natural way.

For the process with time-dependent parameters, the problem of interest has the form:

\begin{matrix} ϖ_{t} + \frac{1}{2} ε^{2} (t) y ϖ_{y y} + (χ (t) - κ (t) y) ϖ_{y} = 0, \\ ϖ (\overset{ˉ}{t}, y) = δ (y - \overset{ˉ}{y}) . \end{matrix}

(7.78)

Here it is assumed that the following regularity condition is satisfied:

\begin{matrix} ϑ (t) = \frac{2 χ (t)}{ε^{2} (t)} - 1 > 0. \end{matrix}

(7.79)

This condition guarantees that the corresponding integrals converge at infinity.

As usual, $ϖ$ can be written as a superposition of Kelvin waves of the form

\begin{matrix} K = exp (α (t, \overset{ˉ}{t}) + i γ (t, \overset{ˉ}{t}) y - i l \overset{ˉ}{y}), \end{matrix}

(7.80)

where $α, γ$ solve the following system of backward ODEs:

\begin{matrix} α_{t} (t, \overset{ˉ}{t}) + χ (t, \overset{ˉ}{t}) i γ (t, \overset{ˉ}{t}) = 0, α (\overset{ˉ}{t}, \overset{ˉ}{t}) = 0, \\ i γ_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) - \frac{1}{2} ε^{2} (t) γ^{2} (t, \overset{ˉ}{t}) - κ (t) i γ (t, \overset{ˉ}{t}) = 0, γ (\overset{ˉ}{t}, \overset{ˉ}{t}) = l . \end{matrix}

(7.81)

Introducing $Ω (t, \overset{ˉ}{t}),$ such that

\begin{matrix} γ (t, \overset{ˉ}{t}) = - \frac{2 i Ω^{'} (t, \overset{ˉ}{t})}{ε^{2} (t) Ω (t, \overset{ˉ}{t})}, \end{matrix}

(7.82)

one gets the following second-order equation for $Ω (t, \overset{ˉ}{t})$ :

\begin{matrix} Ω_{t t} (t, \overset{ˉ}{t}) - (κ + 2 ln {(ε)}^{'}) Ω_{t} (t, \overset{ˉ}{t}) = 0, Ω (\overset{ˉ}{t}, \overset{ˉ}{t}) = 1, Ω^{'} (\overset{ˉ}{t}, \overset{ˉ}{t}) = \frac{ε^{2} (\overset{ˉ}{t})}{2} i l . \end{matrix}

(7.83)

Solving this equation, one gets

\begin{matrix} Ω (t, \overset{ˉ}{t}) & = 1 - \frac{ε^{2} (t)}{2} {\overset{ˉ}{B}}_{κ} (t, \overset{ˉ}{t}) i l, \\ Ω_{t} (t, \overset{ˉ}{t}) & = \frac{ε^{2} (t)}{2} (A_{κ} (t, \overset{ˉ}{t}) - \frac{2 ε^{'} (t)}{ε (t)} {\overset{ˉ}{B}}_{κ} (t, \overset{ˉ}{t})) i l . \end{matrix}

(7.84)

Accordingly,

\begin{matrix} γ (t, \overset{ˉ}{t}) = & \frac{(A_{κ} (t, \overset{ˉ}{t}) - \frac{2 ε^{'} (t)}{ε (t)} {\overset{ˉ}{B}}_{κ} (t, \overset{ˉ}{t})) l}{(1 - \frac{ε^{2} (t)}{2} {\overset{ˉ}{B}}_{κ} (t, \overset{ˉ}{t}) i l)}, \end{matrix}

(7.85)

\begin{matrix} α (t, \overset{ˉ}{t}) = & - \frac{2 χ (t)}{ε^{2} (t)} ln (1 - \frac{ε^{2} (t)}{2} {\overset{ˉ}{B}}_{κ} (t, \overset{ˉ}{t}) i l) \\ - \int_{t}^{\overset{ˉ}{t}} {(\frac{2 χ (s)}{ε^{2} (s)})}^{'} ln (1 - \frac{ε^{2} (s)}{2} {\overset{ˉ}{B}}_{κ} (s, \overset{ˉ}{t}) i l) d s . \end{matrix}

(7.86)

Thus, the Kelvin wave becomes

\begin{matrix} K = & exp (- \frac{2 χ (t)}{ε^{2} (t)} ln (1 - \frac{ε^{2} (t)}{2} {\overset{ˉ}{B}}_{κ} (t, \overset{ˉ}{t}) i l)) \\ (- \int_{t}^{\overset{ˉ}{t}} {(\frac{2 χ (s)}{ε^{2} (s)})}^{'} ln (1 - \frac{ε^{2} (s)}{2} {\overset{ˉ}{B}}_{κ} (s, \overset{ˉ}{t}) i l) d s) \\ (+ (\frac{(A_{κ} (t, \overset{ˉ}{t}) - \frac{2 ε^{'} (t)}{ε (t)} {\overset{ˉ}{B}}_{κ} (t, \overset{ˉ}{t}))}{(1 - \frac{ε^{2} (t)}{2} {\overset{ˉ}{B}}_{κ} (t, \overset{ˉ}{t}) i l)} y - \overset{ˉ}{y}) i l) . \end{matrix}

(7.87)

By analogy with (7.61), (7.62), and (7.63), define

\begin{matrix} M (t, \overset{ˉ}{t}) = \frac{2}{ε^{2} (t) {\overset{ˉ}{B}}_{κ} (t, \overset{ˉ}{t})}, \hat{l} = \frac{l}{2 M (t, \overset{ˉ}{t})}, l = 2 M (t, \overset{ˉ}{t}) \hat{l} \end{matrix}

(7.88)

and represent $\hat{K}$ as follows:

\begin{matrix} \hat{K} (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}, \hat{l}) = 2 M exp (- M (t, \overset{ˉ}{t}) (Y + \overset{ˉ}{y})) exp (- \frac{2 χ (t)}{ε^{2} (t)} ln (1 - 2 i \hat{l})) \\ (- \int_{t}^{\overset{ˉ}{t}} {(\frac{2 χ (s)}{ε^{2} (s)})}^{'} ln (1 - \frac{M (t, \overset{ˉ}{t})}{M (s, \overset{ˉ}{t})} 2 i \hat{l}) d s + M (t, \overset{ˉ}{t}) (\frac{Y}{(1 - 2 i \hat{l})} + \overset{ˉ}{y} (1 - 2 i \hat{l}))), \end{matrix}

(7.89)

where

\begin{matrix} Y = (A_{κ} (t, \overset{ˉ}{t}) - \frac{4 ε^{'} (t)}{ε^{3} (t) M (t, \overset{ˉ}{t})}) y . \end{matrix}

(7.90)

Finally,

\begin{matrix} ϖ (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}) = \frac{1}{2 π} \int_{- \infty}^{\infty} \hat{K} (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}, \hat{l}) d \hat{l} . \end{matrix}

(7.91)

Therefore, finding $ϖ (t, y, \overset{ˉ}{t}, \overset{ˉ}{y})$ is reduced to solving some very simple ODEs and calculating a one-dimensional integral, which is theoretically appealing and numerically efficient.

Feller Process with Jumps

Consider a jump-diffusion process ${\hat{y}}_{t}$ with constant coefficients governed by the following equation:

\begin{matrix} d {\hat{y}}_{t} = (χ - κ {\hat{y}}_{t}) d t + ε \sqrt{{\hat{y}}_{t}} d {\hat{Z}}_{t} + J d {\hat{Π}}_{t}, {\hat{y}}_{t} = y, \end{matrix}

(7.92)

where ${\hat{Z}}_{t}$ is a standard Wiener process, and ${\hat{Π}}_{t}$ is a Poisson process with intensity $λ .$ To preserve tractability, it is assumed that jumps are positive and exponentially distributed with parameter $ϕ$ ; for additional insights, see Reference Lipton and SheltonLipton and Shelton (2012).

The backward Kolmogorov problem can be written as

\begin{matrix} ϖ_{t} + \frac{1}{2} ε^{2} y ϖ_{y y} + (χ - k y) ϖ_{y} + λ (ϕ \int_{0}^{\infty} ϖ (t, y + J) e^{- ϕ J} d J - ϖ (t, y)) = 0, \\ ϖ (\overset{ˉ}{t}, y, \overset{ˉ}{t}, \overset{ˉ}{y}) = δ (y - \overset{ˉ}{y}) ​ . \end{matrix}

(7.93)

The corresponding Kelvin wave has the familiar form:

\begin{matrix} K (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}, l) = exp (α (t, \overset{ˉ}{t}) + i γ (t, \overset{ˉ}{t}) y - i l \overset{ˉ}{y}), \end{matrix}

(7.94)

where $α, γ$ satisfy the following system of ODEs:

\begin{matrix} α_{t} (t, \overset{ˉ}{t}) + χ i γ (t, \overset{ˉ}{t}) + \frac{λ i γ (t, \overset{ˉ}{t})}{ϕ - i γ (t, \overset{ˉ}{t})}, α (\overset{ˉ}{t}, \overset{ˉ}{t}) = 0, \\ i γ_{\overset{ˉ}{t}} (t, \overset{ˉ}{t}) - \frac{1}{2} ε^{2} γ^{2} (t, \overset{ˉ}{t}) - κ i γ (t, \overset{ˉ}{t}) = 0, γ (\overset{ˉ}{t}, \overset{ˉ}{t}) = l . \end{matrix}

(7.95)

The expression for $γ$ is given by (7.58), while $α$ can be split as follows:

\begin{matrix} α (t, \overset{ˉ}{t}) = α_{0} (t, \overset{ˉ}{t}) + λ α_{1} (t, \overset{ˉ}{t}) . \end{matrix}

(7.96)

In this setting, $α_{0}$ has the familiar form:

\begin{matrix} α_{0} (t, \overset{ˉ}{t}) = - (ϑ + 1) ln (1 - \frac{ε^{2}}{2} {\overset{ˉ}{B}}_{κ} (T) i l), \end{matrix}

(7.97)

while $α_{1}$ can be represented as follows:

\begin{matrix} α_{1} (t, \overset{ˉ}{t}) & = \int_{t}^{\overset{ˉ}{t}} \frac{A_{κ} (\overset{ˉ}{t} - s) i l}{ϕ - (\frac{ϕ ε^{2}}{2} {\overset{ˉ}{B}}_{κ} (\overset{ˉ}{t} - s) + A_{κ} (\overset{ˉ}{t} - s)) i l} d s \\ = \frac{1}{(κ - \frac{ϕ ε^{2}}{2})} ln (\frac{ϕ - (\frac{ϕ ε^{2}}{2} {\overset{ˉ}{B}}_{κ} (T) + A_{κ} (T)) i l}{ϕ - i l}) . \end{matrix}

(7.98)

Thus, jumps do profoundly affect the dynamics of the underlying stochastic process.

7.4.2 Augmented Feller Process, I

This section studies the joint dynamics of a Feller process ${\hat{y}}_{t}$ and its integral ${\hat{x}}_{t} .$ The corresponding combined process is described by the following equations:

\begin{matrix} d {\hat{x}}_{t} & = {\hat{y}}_{t} d t, {\hat{x}}_{t} = x, \\ d {\hat{y}}_{t} & = (χ - κ {\hat{y}}_{t}) d t + ε \sqrt{{\hat{y}}_{t}} d {\hat{Z}}_{t}, {\hat{y}}_{t} = y . \end{matrix}

(7.99)

Depending on the interpretation, these equations can describe the joint evolution of a particle’s position and its velocity, the integral of variance and variance, among other possibilities.

The forward Fokker–Planck has the following form:

\begin{matrix} ϖ_{\overset{ˉ}{t}} - \frac{1}{2} ε^{2} {(\overset{ˉ}{y} ϖ)}_{\overset{ˉ}{y} \overset{ˉ}{y}} + \overset{ˉ}{y} ϖ_{\overset{ˉ}{x}} + {((χ - κ \overset{ˉ}{y}) ϖ)}_{\overset{ˉ}{y}} = 0, \\ ϖ (t, x, y, t, \overset{ˉ}{x}, \overset{ˉ}{y}) = δ (\overset{ˉ}{x} - x) δ (\overset{ˉ}{y} - y), \end{matrix}

(7.100)

while the backward Kolmogorov problem can be written as follows:

\begin{matrix} ϖ_{t} + \frac{1}{2} ε^{2} y ϖ_{y y} + y ϖ_{x} + (χ - κ y) ϖ_{y} = 0 \\ ϖ (\overset{ˉ}{t}, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}) = δ (x - \overset{ˉ}{x}) δ (y - \overset{ˉ}{y}) . \end{matrix}

(7.101)

In the following discussion the backward problem is considered, which allows one to derive the desired formula more efficiently. The corresponding function $K$ has the following form:

\begin{matrix} K = exp (α (t, \overset{ˉ}{t}) + i k (x - \overset{ˉ}{x}) + i γ (t, \overset{ˉ}{t}) y - i l \overset{ˉ}{y}), \end{matrix}

(7.102)

where

\begin{matrix} α_{t} (t, \overset{ˉ}{t}) + i χ γ (t, \overset{ˉ}{t}) = 0, α (\overset{ˉ}{t}, \overset{ˉ}{t}) = 0, \\ i γ_{t} (t, \overset{ˉ}{t}) - \frac{1}{2} ε^{2} γ^{2} (t, \overset{ˉ}{t}) - i κ γ (t, \overset{ˉ}{t}) + i k = 0, γ (\overset{ˉ}{t}, \overset{ˉ}{t}) = l . \end{matrix}

(7.103)

As before, one can linearize the Riccati equation for $γ$ by using substitution given by (7.53), with $Ω (t, \overset{ˉ}{t})$ solving the second-order equation of the following form:

\begin{matrix} Ω_{t t} (t, \overset{ˉ}{t}) - κ Ω_{t} (t, \overset{ˉ}{t}) + \frac{i ε^{2} k}{2} Ω (t, \overset{ˉ}{t}) = 0, Ω (\overset{ˉ}{t}, \overset{ˉ}{t}) = 1, Ω^{'} (\overset{ˉ}{t}, \overset{ˉ}{t}) = \frac{i ε^{2} l}{2} . \end{matrix}

(7.104)

One can represent $Ω (t, \overset{ˉ}{t})$ in the following form:

\begin{matrix} Ω (t, \overset{ˉ}{t}) = ω_{+} e^{λ_{+} (\overset{ˉ}{t} - t)} + ω_{-} e^{λ_{+} (\overset{ˉ}{t} - t)}, \end{matrix}

(7.105)

where $λ_{\pm}$ are solutions of the characteristic equation:

\begin{matrix} λ^{2} + κ λ + \frac{i ε^{2} k}{2} = 0, \end{matrix}

(7.106)

and $ω_{\pm}$ satisfy the following system of linear equations:

\begin{matrix} ω_{+} + ω_{-} & = 1, \\ λ_{+} ω_{+} + λ_{-} ω_{-} & = - \frac{i ε^{2} l}{2} . \end{matrix}

(7.107)

Thus,

\begin{matrix} λ_{\pm} & = μ \pm ζ, \\ μ & = - \frac{κ}{2}, ζ = \frac{\sqrt{κ^{2} - 2 i ε^{2} k}}{2}, \end{matrix}

(7.108)

\begin{matrix} ω_{+} & = - \frac{(2 λ_{-} + i ε^{2} l)}{4 ζ}, ω_{-} = \frac{(2 λ_{+} + i ε^{2} l)}{4 ζ} . \end{matrix}

(7.109)

It is useful to note that

\begin{matrix} λ_{+} λ_{-} = μ^{2} - ζ^{2} = \frac{i ε^{2} k}{2} . \end{matrix}

(7.110)

For the sake of brevity, notation (6.122) is used:

\begin{matrix} Ω (t, \overset{ˉ}{t}) & = \frac{E_{0} (- (2 λ_{-} + i ε^{2} l) E_{+} + (2 λ_{+} + i ε^{2} l) E_{-})}{4 ζ}, \end{matrix}

(7.111)

\begin{matrix} Ω_{t} (t, \overset{ˉ}{t}) & = \frac{E_{0} (λ_{+} (2 λ_{-} + i ε^{2} l) E_{+} - λ_{-} (2 λ_{+} + i ε^{2} l) E_{-})}{4 ζ}, \end{matrix}

(7.112)

\begin{matrix} γ & = \frac{2 i (λ_{+} (2 λ_{-} + i ε^{2} l) E_{+} - λ_{-} (2 λ_{+} + i ε^{2} l) E_{-})}{ε^{2} ((2 λ_{-} + i ε^{2} l) E_{+} - (2 λ_{+} + i ε^{2} l) E_{-})}, \end{matrix}

(7.113)

\begin{matrix} α & = \frac{χ κ T}{ε^{2}} - (ϑ + 1) ln (\frac{- (2 λ_{-} + i ε^{2} l) E_{+} + (2 λ_{+} + i ε^{2} l) E_{-}}{4 ζ}) . \end{matrix}

(7.114)

Accordingly, $K$ can be written in the following form:

\begin{matrix} K = & exp (\frac{χ κ T}{ε^{2}} + i k (x - \overset{ˉ}{x})) \\ - (ϑ + 1) ln (\frac{2 (- λ_{-} E_{+} + λ_{+} E_{-}) - i ε^{2} l (E_{+} - E_{-})}{4 ζ}) \\ (+ \frac{2 (2 λ_{+} λ_{-} (E_{+} - E_{-}) + i ε^{2} l (λ_{+} E_{+} - λ_{-} E_{-}))}{ε^{2} (2 (- λ_{-} E_{+} + λ_{+} E_{-}) - i ε^{2} l (E_{+} - E_{-}))} y - i l \overset{ˉ}{y}) . \end{matrix}

(7.115)

Define a new variable $\hat{l},$ such that

\begin{matrix} \hat{l} = \frac{l}{2 M}, l = 2 M \hat{l}, \end{matrix}

(7.116)

where

\begin{matrix} M = \frac{2 (- λ_{-} E_{+} + λ_{+} E_{-})}{ε^{2} (E_{+} - E_{-})} . \end{matrix}

(7.117)

Rescaled $\hat{K}$ can be factorized as follows:

\begin{matrix} \hat{K} = {\hat{K}}_{1} {\hat{K}}_{2}, \end{matrix}

(7.118)

where

\begin{matrix} {\hat{K}}_{1} = & exp (\frac{χ κ T}{ε^{2}} + i k (x - \overset{ˉ}{x}) - (ϑ + 1) ln (\frac{- λ_{-} E_{+} + λ_{+} E_{-}}{2 ζ})) \\ (+ \frac{2 λ_{+} λ_{-} (E_{+} - E_{-})}{ε^{2} (- λ_{-} E_{+} + λ_{+} E_{-})} y), \\ {\hat{K}}_{2} = & 2 M exp (- M (Y + \overset{ˉ}{y})) \\ \times exp (- (ϑ + 1) ln (1 - 2 i \hat{l}) + M (\frac{Y}{1 - 2 i \hat{l}} + \overset{ˉ}{y} (1 - 2 i \hat{l}))), \end{matrix}

(7.119)

with

\begin{matrix} Y = \frac{4 ζ^{2}}{{(- λ_{-} E_{+} + λ_{+} E_{-})}^{2}} y . \end{matrix}

(7.120)

Integration with respect to $\hat{l}$ can be done analytically:

\begin{matrix} \frac{M}{π} \int_{- \infty}^{\infty} {\hat{K}}_{2} d \hat{l} = M e^{- M (\overset{ˉ}{y} + Y)} {(\frac{\overset{ˉ}{y}}{Y})}^{\frac{ϑ}{2}} I_{ϑ} (2 M \sqrt{\overset{ˉ}{y} Y}), \end{matrix}

(7.121)

which allows one to calculate $ϖ$ via a single inverse Fourier transform:

\begin{matrix} ϖ = & \frac{1}{2 π} \int_{- \infty}^{\infty} exp (\frac{χ κ T}{ε^{2}} + i k (x - \overset{ˉ}{x}) - (ϑ + 1) ln (\frac{- λ_{-} E_{+} + λ_{+} E_{-}}{2 ζ})) \\ (+ \frac{2 λ_{+} λ_{-} (E_{+} - E_{-})}{ε^{2} (- λ_{-} E_{+} + λ_{+} E_{-})} y) M e^{- M (\overset{ˉ}{y} + Y)} {(\frac{\overset{ˉ}{y}}{Y})}^{\frac{ϑ}{2}} I_{ϑ} (2 M \sqrt{\overset{ˉ}{y} Y}) d k . \end{matrix}

(7.122)

A typical t.p.d.f. for a degenerate augmented Feller process is illustrated in Figure 11.

Figure 11 A thousand trajectories of a representative t.p.d.f. for the degenerate augmented Feller process. Parameters are $T = 3,$ $d t = 0.01,$ $χ = 0.1,$ $κ = 1.2,$ $ε = 0.2,$ $x = 0$ ; $y_{0} = 0.15 .$ (a) $x (t),$ (b) $y (t),$ (c) $(\overset{ˉ}{x} (T), \overset{ˉ}{y} (T)),$ (d) contour lines of $ϖ (0, 0.15, 0, T, \tilde{x}, \tilde{y}) .$ Author’s graphics.

Since the integral over $\overset{ˉ}{y}$ is equal to one, one can represent the marginal distribution of $\overset{ˉ}{x}$ in the following form:

\begin{matrix} ϖ^{(x)} (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}) = \frac{1}{2 π} \int_{- \infty}^{\infty} ϝ (t, y, \overset{ˉ}{t}, k) e^{i k (x - \overset{ˉ}{x})} d k, \end{matrix}

(7.123)

where

\begin{matrix} ϝ (t, y, \overset{ˉ}{t}, k) = & exp (\frac{χ κ T}{ε^{2}} - (ϑ + 1) ln (\frac{- λ_{-} E_{+} + λ_{+} E_{-}}{2 ζ})) \\ (+ \frac{2 λ_{+} λ_{-} (E_{+} - E_{-})}{ε^{2} (- λ_{-} E_{+} + λ_{+} E_{-})} y), \end{matrix}

(7.124)

with $μ, ζ$ given by the equations in (7.106). It is easy to check that $ϖ^{(x)} (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x})$ integrates to one:

\begin{matrix} \int_{- \infty}^{\infty} ϖ^{(x)} (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}) d \overset{ˉ}{x} = \frac{1}{2 π} \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} ϝ (t, y, \overset{ˉ}{t}, k) exp (i k (x - \overset{ˉ}{x})) d k d \overset{ˉ}{x} \\ = \int_{- \infty}^{\infty} ϝ (t, y, \overset{ˉ}{t}, k) δ (k) d k = ϝ (t, y, \overset{ˉ}{t}, 0) = exp (\frac{χ κ T}{ε^{2}} - \frac{2 χ}{ε^{2}} \frac{κ T}{2}) = 1. \end{matrix}

(7.125)

The expected value of $\overset{ˉ}{x}$ has the following form:

\begin{matrix} X & = \int_{- \infty}^{\infty} ϖ^{(x)} (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}) \overset{ˉ}{x} d \overset{ˉ}{x} = x + \int_{- \infty}^{\infty} ϖ^{(x)} (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}) (\overset{ˉ}{x} - x) d \overset{ˉ}{x} \\ = x + \frac{1}{2 π} \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} ϝ (t, y, \overset{ˉ}{t}, k) exp (i k (\overset{ˉ}{x} - x)) (\overset{ˉ}{x} - x) d k d \overset{ˉ}{x} \\ = x + lim_{ϵ \to 0} (\frac{d}{d ϵ} (\frac{1}{2 π} \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} ϝ (t, y, \overset{ˉ}{t}, k) exp ((i k + ϵ) (\overset{ˉ}{x} - x)) d k d \overset{ˉ}{x})) \\ = x + lim_{ϵ \to 0} (\frac{d}{d ϵ} (\int_{- \infty}^{\infty} ϝ (t, y, \overset{ˉ}{t}, k) δ (k - i ϵ) d k)) \\ = x + {(\frac{d}{d ϵ} ϝ (t, y, \overset{ˉ}{t}, i ϵ))}_{ϵ = 0} . \end{matrix}

(7.126)

A calculation left to the reader yields

\begin{matrix} X = x + \frac{χ}{κ} T - {\overset{ˉ}{B}}_{κ} (T) (\frac{χ}{κ} - y), \end{matrix}

(7.127)

which agrees with (6.115).

It is worth noting that $ϖ^{(x)} (\overset{ˉ}{t}, \overset{ˉ}{x})$ has fat tails, since some of the exponential moments of $\overset{ˉ}{x}$ have finite-time explosions; see Reference Andersen and PiterbargAndersen and Piterbarg (2007), Reference Friz and Keller-ResselFriz and Keller-Ressel (2010), and references therein.Footnote ⁸ Specifically, one needs to analyze if $I_{p} (t, \overset{ˉ}{t})$ of the following form:

\begin{matrix} I_{p} (t, \overset{ˉ}{t}) = \int_{- \infty}^{\infty} ϖ^{(x)} (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}) e^{p \overset{ˉ}{x}} d \overset{ˉ}{x} \\ = \frac{e^{p x}}{2 π} \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} ϝ (t, y, \overset{ˉ}{t}, k) exp (i k (x - \overset{ˉ}{x}) - p (x - \overset{ˉ}{x})) d k d \overset{ˉ}{x} \\ = e^{p x} \int_{- \infty}^{\infty} ϝ (t, y, \overset{ˉ}{t}, k) δ (i k - p) d k = e^{p x} ϝ (t, y, \overset{ˉ}{t}, - i p), \end{matrix}

(7.128)

blows up for some finite $\overset{ˉ}{t} > t .$ Indeed,

\begin{matrix} ϝ (t, y, \overset{ˉ}{t}, - i p) \\ = exp (\frac{χ κ T}{ε^{2}} - (ϑ + 1) ln (\frac{- λ_{-} E_{+} + λ_{+} E_{-}}{2 ζ}) + \frac{2 λ_{+} λ_{-} (E_{+} - E_{-})}{ε^{2} (- λ_{-} E_{+} + λ_{+} E_{-})} y), \end{matrix}

(7.129)

where

\begin{matrix} λ_{\pm} & = μ \pm ζ, \\ μ & = - \frac{κ}{2}, ζ = \frac{\sqrt{κ^{2} - 2 ε^{2} p}}{2}, \\ λ_{+} λ_{-} & = \frac{ε^{2} p}{2} . \end{matrix}

(7.130)

Thus, when $ζ > 0$ is real:

\begin{matrix} ℐ_{p} = & {(\frac{2 ζ}{- μ \sinh (| ζ | T) + ζ \cosh (| ζ | T)})}^{(ϑ + 1)} \\ \exp (\frac{χ κ T}{ε^{2}} + p x + \frac{p \sinh (ζ T)}{(- μ \sinh (| ζ | T) + ζ \cosh (| ζ | T))} y) ​, \end{matrix}

(7.131)

and, when $ζ = i (ζ)$ is imaginary:

\begin{matrix} I_{p} = & {(\frac{(ζ)}{- μ sin ((ζ) T) + (ζ) cos ((ζ) T)})}^{(ϑ + 1)} \\ exp (\frac{χ κ T}{ε^{2}} + p x + \frac{p sin ((ζ) T)}{(- μ sin ((ζ) T) + (ζ) cos ((ζ) T))} y) . \end{matrix}

(7.132)

For $p \in (- \infty, \hat{p}),$ $ζ$ is real, and for $p \in (\hat{p}, \infty),$ it is imaginary. Here

\begin{matrix} \hat{p} = \frac{κ^{2}}{2 ε^{2}} > 0. \end{matrix}

(7.133)

There is no blowup when $ζ$ is real. When $ζ$ is imaginary, the blowup time $t^{*}$ is the smallest positive root of the equation

\begin{matrix} κ sin (\sqrt{2 ε^{2} p - κ^{2}} (t^{*} - t)) + \sqrt{2 ε^{2} p - κ^{2}} cos (\sqrt{2 ε^{2} p - κ^{2}} (t^{*} - t)) = 0, \end{matrix}

(7.134)

\begin{matrix} t^{*} = t + \frac{π - arctan (\frac{\sqrt{2 ε^{2} p - κ^{2}}}{κ})}{\sqrt{2 ε^{2} p - κ^{2}}} . \end{matrix}

(7.135)

It is clear that $I_{- 1}$ does not blow up. This fact in used in the next section.

The marginal distribution of $\overset{ˉ}{y},$ $ϖ^{(y)} (\overset{ˉ}{t}, \overset{ˉ}{y})$ is the standard Feller distribution given by (7.66).

7.4.3 Augmented Feller Process, II

This section studies the joint dynamics of an arithmetic Brownian ${\hat{x}}_{t}$ whose stochastic variance is driven by a Feller process ${\hat{y}}_{t},$ and considers the following system of affine SDEs:

\begin{matrix} d {\hat{x}}_{t} = \sqrt{{\hat{y}}_{t}} d {\hat{W}}_{t}, {\hat{x}}_{t} = x, \\ d {\hat{y}}_{t} = (χ - κ {\hat{y}}_{t}) d t + ε \sqrt{{\hat{y}}_{t}} d {\hat{Z}}_{t}, {\hat{y}}_{t} = y . \end{matrix}

(7.136)

Studying such a process is very helpful for finding option prices and solving other important problems in the financial engineering context.

The associated forward Fokker–Planck problem can be written as follows:

\begin{matrix} ϖ_{\overset{ˉ}{t}} - \frac{1}{2} \overset{ˉ}{y} ϖ_{\overset{ˉ}{x} \overset{ˉ}{x}} - ρ ε {(\overset{ˉ}{y} ϖ)}_{\overset{ˉ}{x} \overset{ˉ}{y}} - \frac{1}{2} ε^{2} {(\overset{ˉ}{y} ϖ)}_{\overset{ˉ}{y} \overset{ˉ}{y}} + {((χ - κ \overset{ˉ}{y}) ϖ)}_{\overset{ˉ}{y}} = 0, \\ ϖ (t, x, y, t, \overset{ˉ}{x}, \overset{ˉ}{y}) = δ (\overset{ˉ}{x} - x) δ (\overset{ˉ}{y} - y), \end{matrix}

(7.137)

while the backward Kolmogorov problem has the following form:

\begin{matrix} ϖ_{t} + \frac{1}{2} y ϖ_{x x} + ρ ε y ϖ_{x y} + \frac{1}{2} ε^{2} y ϖ_{y y} + (χ - κ y) ϖ_{y} = 0, \\ ϖ (\overset{ˉ}{t}, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}) = δ (x - \overset{ˉ}{x}) δ (y - \overset{ˉ}{y}) . \end{matrix}

(7.138)

As before, concentrate on problem (7.138).

The Kelvin function $K$ has the form (7.102). The governing ODEs for $α, γ$ are as follows:

\begin{matrix} α_{t} (t, \overset{ˉ}{t}) + i χ γ (t, \overset{ˉ}{t}) = 0, α (\overset{ˉ}{t}, \overset{ˉ}{t}) = 0, \\ i γ_{t} (t, \overset{ˉ}{t}) - \frac{1}{2} ε^{2} γ^{2} (t, \overset{ˉ}{t}) - (κ - i ρ ε k) i γ (t, \overset{ˉ}{t}) - \frac{1}{2} k^{2} = 0, γ (\overset{ˉ}{t}, \overset{ˉ}{t}) = l . \end{matrix}

(7.139)

Formulas (7.111)–(7.114) hold; however, the corresponding characteristic equation is

\begin{matrix} λ^{2} + (κ - i ρ ε k) λ - \frac{ε^{2}}{4} k^{2} = 0, \end{matrix}

(7.140)

so that

\begin{matrix} λ_{\pm} & = μ \pm ζ, \\ μ & = - \frac{1}{2} (κ - i ρ ε k), ζ = \frac{1}{2} \sqrt{ε^{2} {\overset{ˉ}{ρ}}^{2} k^{2} - 2 i ρ ε κ k + κ^{2}}, \\ λ_{+} λ_{-} & = μ^{2} - ζ^{2} = - \frac{ε^{2}}{4} k^{2}, \end{matrix}

(7.141)

where ${\overset{ˉ}{ρ}}^{2} = 1 - ρ^{2} .$ Subsequent calculations are very similar to the ones performed in the previous subsection, so they are omitted for brevity. The final expressions for $ϖ$ and $ϖ^{(x)}$ are given by Equations (7.122), (7.123), and (7.124), with $μ, ζ$ given by the equations in (7.141). These expressions are similar to the formulas originally derived by Lipton as part of his analysis of the Heston stochastic volatility model; see Reference LiptonLipton (2001).Footnote ⁹

A typical t.p.d.f. for a nondegenerate augmented Feller process is shown in Figure 12.

Figure 12 A thousand trajectories of a representative nondegenerate augmented Feller process. Parameters are $T = 3,$ $d t = 0.01,$ $χ = 0.2,$ $κ = 2.0,$ $ε = 0.2,$ $ρ = - 0.5,$ $x = 0$ , $y_{0} = 0.15 .$ (a) $x (t),$ (b) $y (t),$ (c) $(\overset{ˉ}{x} (T), \overset{ˉ}{y} (T)),$ (d) contour lines of $ϖ (0, 0.15, 0, T, \tilde{x}, \tilde{y}) .$ Author’s graphics.

As before, $ϖ^{(x)} (t, x, y, \overset{ˉ}{t}, \overset{ˉ}{x})$ has fat tails. Consider $I_{p} (t, \overset{ˉ}{t})$ given by (7.128). The corresponding $λ_{\pm}$ have the following form:

\begin{matrix} λ_{\pm} & = μ \pm ζ, \\ μ & = - \frac{1}{2} (κ - ρ ε p), ζ = \frac{1}{2} \sqrt{- ε^{2} {\overset{ˉ}{ρ}}^{2} p^{2} - 2 ρ ε κ p + κ^{2}}, \\ λ_{+} λ_{-} & = μ^{2} - ζ^{2} = \frac{ε^{2}}{4} p^{2} . \end{matrix}

(7.142)

Thus, when $ζ > 0$ is real,

\begin{matrix} I_{p} = & {(\frac{ζ}{- μ sinh (ζ T) + ζ cosh (ζ T)})}^{(ϑ + 1)} \\ exp (\frac{2 χ μ T}{ε^{2}} + p x + \frac{p (p - 1) sinh (ζ T)}{2 (- μ sinh (ζ T) + ζ cosh (ζ T))} y), \end{matrix}

(7.143)

and when $ζ = i (ζ)$ is imaginary,

\begin{matrix} I_{p} = & {(\frac{(ζ)}{- μ sin ((ζ) T) + (ζ) cos ((ζ) T)})}^{(ϑ + 1)} \\ exp (\frac{2 χ μ T}{ε^{2}} + p x + \frac{p (p - 1) sin ((ζ) T)}{2 (- μ sin ((ζ) T) + (ζ) cos ((ζ) T))} y) . \end{matrix}

(7.144)

One needs to determine when $ζ$ becomes imaginary. The corresponding quadratic equation has the form:

\begin{matrix} {\overset{ˉ}{ρ}}^{2} ε^{2} p^{2} + 2 ρ ε κ p - κ^{2} = 0, \end{matrix}

(7.145)

its roots are as follows:

\begin{matrix} p_{\pm} = \frac{- ρ ε κ \pm \sqrt{ρ^{2} ε^{2} κ^{2} + {\overset{ˉ}{ρ}}^{2} ε^{2} κ^{2}}}{{\overset{ˉ}{ρ}}^{2} ε^{2}} = \frac{(- ρ \pm 1) κ}{{\overset{ˉ}{ρ}}^{2} ε}, \end{matrix}

(7.146)

so that

\begin{matrix} p_{+} > 1, p_{-} < 1. \end{matrix}

(7.147)

For $p \in (p_{-}, p_{+}),$ $ζ$ is real, for $p \notin (p_{-}, p_{+}),$ it is imaginary. There is no blowup when $ζ$ is real. When $ζ$ is imaginary, the blowup time $t^{*}$ is the smallest positive root of the equation

\begin{matrix} - μ sin ((ζ) (t^{*} - t)) + (ζ) cos ((ζ) (t^{*} - t)) = 0, \end{matrix}

(7.148)

\begin{matrix} t^{*} = (\begin{matrix} t + \frac{arctan (\frac{(ζ)}{(μ)})}{(ζ)}, & μ > 0, \\ t + \frac{π - arctan (\frac{(ζ)}{(μ)})}{(ζ)}, & μ < 0. \end{matrix}) \end{matrix}

(7.149)

7.5 Example: Path-Dependent Process

Let ${\hat{y}}_{t}$ be a stochastic process and ${\hat{x}}_{t}$ be its moving average. Then

\begin{matrix} {\hat{x}}_{t} = κ \int_{- \infty}^{t} e^{- κ (t - s)} {\hat{y}}_{s} d s . \end{matrix}

(7.150)

A simple calculation yields

\begin{matrix} d {\hat{x}}_{t} = κ ({\hat{y}}_{t} - {\hat{x}}_{t}) d t . \end{matrix}

(7.151)

The process ${\hat{y}}_{t}$ is path-dependent, because its volatility ${\hat{σ}}_{t}$ depends on its moving average ${\hat{x}}_{t}$ :

\begin{matrix} {\hat{σ}}_{t} = \sqrt{a_{0} + a_{1} ({\hat{y}}_{t} - {\hat{x}}_{t})}, \end{matrix}

(7.152)

where $a_{0} > 0,$ $a_{1} < 0,$ in order to capture the effect of leverage. Thus, one can write the governing degenerate system of SDEs as follows:

\begin{matrix} d {\hat{x}}_{t} & = κ ({\hat{y}}_{t} - {\hat{x}}_{t}) d t, {\hat{x}}_{t} = x, \\ d {\hat{y}}_{t} & = \sqrt{a_{0} + a_{1} ({\hat{y}}_{t} - {\hat{x}}_{t})} d {\hat{W}}_{t}, {\hat{y}}_{t} = y . \end{matrix}

(7.153)

The Fokker–Planck and Kolmogorov problems are

\begin{matrix} ϖ_{\overset{ˉ}{t}} - \frac{1}{2} {((a_{0} + a_{1} (\overset{ˉ}{y} - \overset{ˉ}{x})) ϖ)}_{\overset{ˉ}{y} \overset{ˉ}{y}} + {(κ (\overset{ˉ}{y} - \overset{ˉ}{x}) ϖ)}_{\overset{ˉ}{x}} = 0, \\ ϖ (t, x, y, t, \overset{ˉ}{x}, \overset{ˉ}{y}) = δ (\overset{ˉ}{x} - x) δ (\overset{ˉ}{y} - y), \end{matrix}

(7.154)

\begin{matrix} ϖ_{t} + \frac{1}{2} (a_{0} + a_{1} (y - x)) ϖ_{y y} + κ (y - x) ϖ_{x} = 0, \\ ϖ (\overset{ˉ}{t}, x, y, \overset{ˉ}{t}, \overset{ˉ}{x}, \overset{ˉ}{y}) = δ (x - \overset{ˉ}{x}) δ (y - \overset{ˉ}{y}), \end{matrix}

(7.155)

respectively.

A representative Kelvin mode has the following form:

\begin{matrix} K = exp (α (t, \overset{ˉ}{t}) + i β (t, \overset{ˉ}{t}) x - i k \overset{ˉ}{x} + i γ (t, \overset{ˉ}{t}) y - i l \overset{ˉ}{y}) . \end{matrix}

(7.156)

The system of backward ODEs for $α, γ, β$ is as follows:

\begin{matrix} α_{t} (t, \overset{ˉ}{t}) - \frac{a_{0}}{2} γ^{2} (t, \overset{ˉ}{t}) = 0, α (\overset{ˉ}{t}, \overset{ˉ}{t}) = 0, \\ i β_{t} (t, \overset{ˉ}{t}) + \frac{a_{1}}{2} γ^{2} (t, \overset{ˉ}{t}) - i κ β (t, \overset{ˉ}{t}) = 0, β (\overset{ˉ}{t}, \overset{ˉ}{t}) = k, \\ i γ_{t} (t, \overset{ˉ}{t}) - \frac{a_{1}}{2} γ^{2} (t, \overset{ˉ}{t}) + i κ β (t, \overset{ˉ}{t}) = 0, γ (\overset{ˉ}{t}, \overset{ˉ}{t}) = l . \end{matrix}

(7.157)

The equations in (7.157) are matrix Riccati equations, as opposed to the scalar Riccati equations considered earlier. In general, such equations are very difficult to solve. However, the case under consideration is one of the relatively rare instances when a matrix Riccati equation can be solved explicitly. Start with an observation:

\begin{matrix} γ_{t} (t, \overset{ˉ}{t}) + β_{t} (t, \overset{ˉ}{t}) = 0, \end{matrix}

(7.158)

so that

\begin{matrix} β (t, \overset{ˉ}{t}) = - γ (t, \overset{ˉ}{t}) + k + l . \end{matrix}

(7.159)

Accordingly,

\begin{matrix} i γ_{t} (t, \overset{ˉ}{t}) - \frac{a_{1}}{2} γ^{2} (t, \overset{ˉ}{t}) - i κ γ (t, \overset{ˉ}{t}) + i κ (k + l) = 0, γ (\overset{ˉ}{t}, \overset{ˉ}{t}) = l . \end{matrix}

(7.160)

One can use Equations (7.111)–(7.113) with $(β, k)$ replaced by $(γ, l),$ and

\begin{matrix} λ^{2} + κ λ + \frac{i a_{1} κ (k + l)}{2} = 0, \end{matrix}

(7.161)

so that

\begin{matrix} λ_{\pm} & = μ \pm ζ, \\ μ & = - \frac{κ}{2}, ζ = \frac{\sqrt{κ^{2} - 2 i a_{1} κ (k + l)}}{2} . \end{matrix}

(7.162)

Equation (7.159) yields

\begin{matrix} β (t, \overset{ˉ}{t}) = \frac{2 i Ω^{'} (t, \overset{ˉ}{t})}{a_{1} Ω (t, \overset{ˉ}{t})} + k + l, \end{matrix}

(7.163)

and

\begin{matrix} γ^{2} (t, \overset{ˉ}{t}) & = \frac{2 i}{a_{1}} (γ_{t} (t, \overset{ˉ}{t}) + κ β (t, \overset{ˉ}{t})) \\ = \frac{2 i}{a_{1}} (γ_{t} (t, \overset{ˉ}{t}) - κ γ (t, \overset{ˉ}{t}) + κ (κ + k)) . \end{matrix}

(7.164)

Thus,

\begin{matrix} α_{t} (t, \overset{ˉ}{t}) = \frac{i a_{0}}{a_{1}} (γ_{t} (t, \overset{ˉ}{t}) - κ γ (t, \overset{ˉ}{t}) + κ (k + l)), α (\overset{ˉ}{t}, \overset{ˉ}{t}) = 0. \end{matrix}

(7.165)

Accordingly,

\begin{matrix} α (t, \overset{ˉ}{t}) = \frac{i a_{0}}{a_{1}} (γ (t, \overset{ˉ}{t}) - l) - \frac{2 a_{0} κ}{a_{1}^{2}} ln (Ω (t, \overset{ˉ}{t})) + \frac{a_{0} κ T}{a_{1}} i (k + l) . \end{matrix}

(7.166)

Finally,

\begin{matrix} K = & exp (α + i β (t, \overset{ˉ}{t}) x - i k \overset{ˉ}{x} + i γ (t, \overset{ˉ}{t}) y - i l \overset{ˉ}{y}) \\ = & exp (- \frac{2 a_{0} κ}{a_{1}^{2}} ln (Ω (t, \overset{ˉ}{t})) + i γ (t, \overset{ˉ}{t}) (y - x + \frac{a_{0}}{a_{1}})) \\ (+ i l (x - \overset{ˉ}{y} + \frac{a_{0}}{a_{1}} (κ T - 1)) + i k (x - \overset{ˉ}{x} + \frac{a_{0} κ T}{a_{1}})) . \end{matrix}

(7.167)

To make sure that ${\hat{σ}}_{t}$ given by (7.152) and the integrand (7.167) are well defined, it is assumed that

\begin{matrix} a_{0} + a_{1} (y - x) > 0, a_{0} + a_{1} (\overset{ˉ}{y} - \overset{ˉ}{x}) > 0. \end{matrix}

(7.168)

7.6 Example: OU-Like Process

This section considers several instances when an OU-inspired process becomes non-Gaussian. This can happen for a variety of reasons, such as effects of anomalous diffusion, the presence of jumps, effects of augmentation, and the likes.

7.6.1 Anomalous OU Process

This section considers a mean-reverting process driven by a non-Gaussian anomalous diffusion. For brevity, it is assumed that coefficients are time-independent. The fractional forward Fokker–Planck and backward Kolmogorov problems can be written as follows:

\begin{matrix} ϖ_{\overset{ˉ}{t}} + a {(- \frac{\partial^{2}}{\partial {\overset{ˉ}{y}}^{2}})}^{1 / 2} ϖ + {((χ - κ \overset{ˉ}{y}) ϖ)}_{\overset{ˉ}{y}} = 0, \\ ϖ (t, y, t, \overset{ˉ}{y}) = δ (\overset{ˉ}{y} - y), \end{matrix}

(7.169)

\begin{matrix} ϖ_{t} - a {(- \frac{\partial^{2}}{\partial y^{2}})}^{1 / 2} ϖ + (χ - κ y) ϖ_{y} = 0, \\ ϖ (\overset{ˉ}{t}, y, \overset{ˉ}{t}, \overset{ˉ}{y}) = δ (y - \overset{ˉ}{y}), \end{matrix}

(7.170)

respectively. Here $a > 0$ is the anomalous diffusion coefficient.

As before, one can use Kelvin waves to solve (7.170) by choosing a particular solution of the form (7.51). The corresponding $(α, γ)$ satisfy the following ODEs:

\begin{matrix} α_{t} (t, \overset{ˉ}{t}) - a (γ (t, \overset{ˉ}{t})) + i χ γ (t, \overset{ˉ}{t}) = 0, α (\overset{ˉ}{t}, \overset{ˉ}{t}) = 0, \\ γ_{t} (t, \overset{ˉ}{t}) - κ γ (t, \overset{ˉ}{t}) = 0, γ (\overset{ˉ}{t}, \overset{ˉ}{t}) = l, \end{matrix}

(7.171)

so that

\begin{matrix} α (t, \overset{ˉ}{t}) & = - {\overset{ˉ}{B}}_{κ} (T) (a (l) - i χ l), \\ γ (t, \overset{ˉ}{t}) & = e^{- κ T} l . \end{matrix}

(7.172)

Accordingly,

\begin{matrix} ϖ (t, y, \overset{ˉ}{t}, \overset{ˉ}{y}) & = \frac{1}{2 π} \int_{- \infty}^{\infty} exp (- {\overset{ˉ}{B}}_{κ} (T) a (l) + ({\overset{ˉ}{B}}_{κ} (T) χ + e^{- κ T} y - \overset{ˉ}{y}) i l) d l \\ = \frac{1}{π} \frac{{\overset{ˉ}{B}}_{κ} (T) a}{({({\overset{ˉ}{B}}_{κ} (T) a)}^{2} + {(e^{- κ T} (y - \frac{χ}{κ}) - (\overset{ˉ}{y} - \frac{χ}{κ}))}^{2})} . \end{matrix}

(7.173)

Thus, in sharp contrast to the classical OU process, which is described by a Gaussian distribution, the fractional OU process is described by a Cauchy distribution. This distribution has fat tails and no first and second moments.

7.6.2 Non-Gaussian Augmented OU Process, I

On occasion, problems seemingly not of the type given by (7.11) can be cast in the proper form via a suitable trick. Consider, for example, the following system of SDEs:

\begin{matrix} d {\hat{x}}_{t} & = {\hat{y}}_{t}^{2} d t, {\hat{x}}_{t} = x, \\ d {\hat{y}}_{t} & = (χ - κ {\hat{y}}_{t}) d t + ε d {\hat{Z}}_{t}, {\hat{y}}_{t} = y . \end{matrix}

(7.174)

Superficially, it does not belong to the class of processes studied earlier. However, by introducing new variables $z_{1} = x,$ $z_{2} = y^{2},$ $z_{3} = y,$ one can augment the equations in (7.127) as follows:

\begin{matrix} d {\hat{z}}_{1, t} & = {\hat{z}}_{2, t} d t, {\hat{z}}_{1, t} = x \equiv z_{1}, \\ d {\hat{z}}_{2, t} & = (ε^{2} - 2 κ {\hat{z}}_{2, t} + 2 χ {\hat{z}}_{3, t}) + 2 ε {\hat{z}}_{3, t} d {\hat{Z}}_{t}, {\hat{z}}_{2, t} = y^{2} \equiv z_{2}, \\ d {\hat{z}}_{3, t} & = (χ - κ {\hat{z}}_{3, t}) d t + ε d {\hat{Z}}_{t}, {\hat{z}}_{3, t} = z_{3} \equiv y . \end{matrix}

(7.175)

These equations are “almost” in the suitable form. The only snag is that one cannot claim that ${\hat{z}}_{3, t} = \sqrt{{\hat{z}}_{2, t}}$ since ${\hat{z}}_{3, t}$ is not always positive.

The corresponding Fokker–Planck and Kolmogorov problems can be written as follows:

\begin{matrix} ϖ_{\overset{ˉ}{t}} - 2 ε^{2} {({\overset{ˉ}{z}}_{2} ϖ)}_{{\overset{ˉ}{z}}_{2} {\overset{ˉ}{z}}_{2}} - 2 ε^{2} {({\overset{ˉ}{z}}_{3} ϖ)}_{{\overset{ˉ}{z}}_{2} {\overset{ˉ}{z}}_{3}} - \frac{1}{2} ε^{2} ϖ_{{\overset{ˉ}{z}}_{3} {\overset{ˉ}{z}}_{3}} \\ + {\overset{ˉ}{z}}_{2} ϖ_{{\overset{ˉ}{z}}_{1}} + {((ε^{2} - 2 κ {\overset{ˉ}{z}}_{2} + 2 χ {\overset{ˉ}{z}}_{3}) ϖ)}_{{\overset{ˉ}{z}}_{2}} + {((χ - κ {\overset{ˉ}{z}}_{3}) ϖ)}_{{\overset{ˉ}{z}}_{3}} = 0, \end{matrix}

(7.176)

\begin{matrix} ϖ (t, x, y^{2}, y, t, {\overset{ˉ}{z}}_{1}, {\overset{ˉ}{z}}_{2}, {\overset{ˉ}{z}}_{3}) = δ ({\overset{ˉ}{z}}_{1} - x) δ ({\overset{ˉ}{z}}_{2} - y^{2}) δ ({\overset{ˉ}{z}}_{3} - y), \\ ϖ_{t} + 2 ε^{2} z_{2} ϖ_{z_{2} z_{2}} + 2 ε^{2} z_{3} ϖ_{z_{2} z_{3}} + \frac{1}{2} ε^{2} ϖ_{z_{3} z_{3}} \\ + z_{2} ϖ_{z_{1}} + (ε^{2} - 2 κ z_{2} + 2 x z_{3}) ϖ_{z_{2}} + (χ - κ z_{3}) ϖ_{z_{3}} = 0, \\ ϖ (\overset{ˉ}{t}, z_{1}, z_{2}, z_{3}, \overset{ˉ}{t}, {\overset{ˉ}{z}}_{1}, {\overset{ˉ}{z}}_{3}^{2}, {\overset{ˉ}{z}}_{3}) = δ (z_{1} - {\overset{ˉ}{z}}_{1}) δ (z_{2} - {\overset{ˉ}{z}}_{3}^{2}) δ (z_{3} - {\overset{ˉ}{z}}_{3}) . \end{matrix}

(7.177)

As usual, $K$ has the form:

\begin{matrix} K (t, \overset{ˉ}{t}, z, m) \\ = exp (α (t, \overset{ˉ}{t}) + i m_{1} (z_{1} - {\overset{ˉ}{z}}_{1}) + i δ_{2} (t, \overset{ˉ}{t}) z_{2} - i m_{2} {\overset{ˉ}{z}}_{3}^{2} + i δ_{3} (t, \overset{ˉ}{t}) z_{3} - i m_{3} {\overset{ˉ}{z}}_{3}) . \end{matrix}

(7.178)

The corresponding set of ODEs for $α, δ_{2}, δ_{3}$ is as follows:

\begin{matrix} α_{t} (t, \overset{ˉ}{t}) - \frac{ε^{2}}{2} δ_{3}^{2} (t, \overset{ˉ}{t}) + i ε^{2} δ_{2} (t, \overset{ˉ}{t}) + i χ δ_{3} (t, \overset{ˉ}{t}) = 0, α (\overset{ˉ}{t}, \overset{ˉ}{t}) = 0, \\ i δ_{2}^{'} (t, \overset{ˉ}{t}) - 2 ε^{2} δ_{2}^{2} (t, \overset{ˉ}{t}) - 2 i κ δ_{2} (t, \overset{ˉ}{t}) + i m_{1} = 0, δ_{2} (\overset{ˉ}{t}, \overset{ˉ}{t}) = m_{2}, \\ i δ_{3}^{'} (t, \overset{ˉ}{t}) - 2 ε^{2} δ_{2} (t, \overset{ˉ}{t}) δ_{3} (t, \overset{ˉ}{t}) + 2 i χ δ_{2} v - i κ δ_{3} (t, \overset{ˉ}{t}) = 0, δ_{3} (\overset{ˉ}{t}, \overset{ˉ}{t}) = m_{3} . \end{matrix}

(7.179)

These are matrix Riccati equations.

Once again, the corresponding matrix Riccati equation can be solved explicitly. Since the second equation is separable and hence can be viewed as a scalar Riccati equation, one can start with a familiar ansatz and use Equations (7.111)–(7.113) with $(γ, l)$ replaced by $(δ_{2}, m_{2}),$ and the corresponding characteristic equation is as follows:

\begin{matrix} λ^{2} + 2 κ λ + 2 i ε^{2} m_{1} = 0, \end{matrix}

(7.180)

and its solutions have the familiar form:

\begin{matrix} λ_{\pm} & = μ \pm ζ, \\ μ & = - κ, ζ = \sqrt{κ^{2} - 2 i ε^{2} m_{1}} . \end{matrix}

(7.181)

To linearize the equations in (7.179) as a whole, use the following ansatz:

\begin{matrix} Ω & = E_{0} (ω_{+} E_{+} + ω_{-} E_{-}), \\ α & = - \frac{1}{2} ln (Ω) + \frac{E_{0} (a_{0} + a_{+} E_{+} + a_{-} E_{-})}{Ω} + g (\overset{ˉ}{t} - t), \\ δ_{2} & = \frac{E_{0} (b_{+} E_{+} + b_{-} E_{-})}{Ω}, δ_{3} = \frac{E_{0} (c_{0} + c_{+} E_{+} + c_{-} E_{-})}{Ω}, \end{matrix}

(7.182)

where $a_{0},$ $a_{\pm},$ $b_{0},$ $b_{\pm},$ and $g$ are constants to be determined. This ansatz is useful since terms proportional to $\sim E_{0}, E_{+}, E_{-}$ balance each other, which allows us to find the coefficients explicitly. Initial conditions complete the picture. The actual calculation is omitted for brevity. The result is as follows:

\begin{matrix} ω_{\pm} & = \mp \frac{(λ_{\mp} + 2 i ε^{2} m_{2})}{2 ζ}, b_{\pm} = \frac{i λ_{\pm} ω_{\pm}}{2 ε^{2}}, \\ c_{\pm} & = \pm \frac{i χ λ_{\pm} ω_{\pm}}{ε^{2} ζ}, c_{0} = m_{3} - c_{+} - c_{-}, g = \frac{χ^{2} λ_{+} λ_{-}}{2 ε^{2} ζ^{2}}, \\ a_{0} & = - \frac{i κ χ c_{0}}{ζ^{2}}, a_{\pm} = - ω_{\pm} a_{0} \mp (\frac{ε^{2} c_{0}^{2}}{4 ζ} + \frac{χ^{2} κ^{2} ω_{+} ω_{-}}{ε^{2} ζ^{3}}), \end{matrix}

(7.183)

where $λ_{\pm}$ are given by the equations in (7.181). These expressions can be substituted in the function $K$ to obtain the corresponding t.p.d.f.

7.6.3 Non-Gaussian Augmented OU Process, II

This section studies an affine process of the following form:

\begin{matrix} d {\hat{x}}_{t} = {\hat{y}}_{t} d {\hat{W}}_{t}, {\hat{x}}_{t} = x, \\ d {\hat{y}}_{t} = (χ - κ {\hat{y}}_{t}) d t + ε d {\hat{Z}}_{t}, {\hat{y}}_{t} = y . \end{matrix}

(7.184)

The killed process is studied in Section 8 in the context of the Stein–Stein model.

Precisely as before, one can introduce the new variables $z_{1} = x,$ $z_{2} = y^{2},$ $z_{3} = y,$ and expand the equations in (7.184) as follows:

\begin{matrix} d {\hat{z}}_{1, t} = {\hat{z}}_{3, t} d {\hat{W}}_{t}, {\hat{z}}_{1, t} = x \equiv z_{1}, \\ d {\hat{z}}_{2, t} = (ε^{2} - 2 κ {\hat{z}}_{2, t} + 2 χ {\hat{z}}_{3, t}) + 2 ε {\hat{z}}_{3, t} d {\hat{Z}}_{t}, {\hat{z}}_{2, t} = y^{2} \equiv z_{2}, \\ d {\hat{z}}_{3, t} = (χ - κ {\hat{z}}_{3, t}) d t + ε d {\hat{Z}}_{t}, {\hat{z}}_{3, t} = z_{3} \equiv y . \end{matrix}

(7.185)

It is clear that the equations in (7.185) are affine.

The corresponding Fokker–Planck and Kolmogorov problems can be written as follows:

\begin{matrix} ϖ_{\overset{ˉ}{t}} - \frac{1}{2} {\overset{ˉ}{z}}_{2} ϖ_{{\overset{ˉ}{z}}_{1} {\overset{ˉ}{z}}_{1}} - 2 ρ ε {({\overset{ˉ}{z}}_{2} ϖ)}_{{\overset{ˉ}{z}}_{1} {\overset{ˉ}{z}}_{2}} - ρ ε {({\overset{ˉ}{z}}_{3} ϖ)}_{{\overset{ˉ}{z}}_{1} {\overset{ˉ}{z}}_{3}} \\ - 2 ε^{2} {({\overset{ˉ}{z}}_{2} ϖ)}_{{\overset{ˉ}{z}}_{2} {\overset{ˉ}{z}}_{2}} - 2 ε^{2} {({\overset{ˉ}{z}}_{3} ϖ)}_{{\overset{ˉ}{z}}_{2} {\overset{ˉ}{z}}_{3}} - \frac{1}{2} ε^{2} ϖ_{{\overset{ˉ}{z}}_{3} {\overset{ˉ}{z}}_{3}} \\ + {((ε^{2} - 2 κ {\overset{ˉ}{z}}_{2} + 2 χ {\overset{ˉ}{z}}_{3}) ϖ)}_{{\overset{ˉ}{z}}_{2}} + {((χ - κ {\overset{ˉ}{z}}_{3}) ϖ)}_{{\overset{ˉ}{z}}_{3}} = 0, \end{matrix}

(7.186)

\begin{matrix} ϖ (t, x, y^{2}, y, t, {\overset{ˉ}{z}}_{1}, {\overset{ˉ}{z}}_{2}, {\overset{ˉ}{z}}_{3}) = δ ({\overset{ˉ}{z}}_{1} - x) δ ({\overset{ˉ}{z}}_{2} - y^{2}) δ ({\overset{ˉ}{z}}_{3} - y), \\ ϖ_{t} + \frac{1}{2} z_{2} ϖ_{z_{1} z_{1}} + 2 ρ ε z_{2} ϖ_{z_{1} z_{2}} + ρ ε z_{3} ϖ_{z_{1} z_{3}} \\ + 2 ε^{2} z_{2} ϖ_{z_{2} z_{2}} + 2 ε^{2} z_{3} ϖ_{z_{2} z_{3}} + \frac{1}{2} ε^{2} ϖ_{z_{3} z_{3}} \\ + (ε^{2} - 2 κ z_{2} + 2 χ z_{3}) ϖ_{z_{2}} + (χ - κ z_{3}) ϖ_{z_{3}} = 0, \\ ϖ (\overset{ˉ}{t}, z_{1}, z_{2}, z_{3}, \overset{ˉ}{t}, {\overset{ˉ}{z}}_{1}, {\overset{ˉ}{z}}_{3}^{2}, {\overset{ˉ}{z}}_{3}) = δ (z_{1} - {\overset{ˉ}{z}}_{1}) δ (z_{2} - {\overset{ˉ}{z}}_{3}^{2}) δ (z_{3} - {\overset{ˉ}{z}}_{3}) . \end{matrix}

(7.187)

One can use $K$ given by (7.178) and write the set of ODEs for $α, δ_{2}, δ_{3}$ as follows:

\begin{matrix} α_{t} (t, \overset{ˉ}{t}) - \frac{ε^{2}}{2} δ_{3}^{2} (t, \overset{ˉ}{t}) + i ε^{2} δ_{2} (t, \overset{ˉ}{t}) + i χ δ_{3} (t, \overset{ˉ}{t}) = 0, α (\overset{ˉ}{t}, \overset{ˉ}{t}) = 0, \\ i δ_{2}^{'} (t, \overset{ˉ}{t}) - 2 ε^{2} δ_{2}^{2} (t, \overset{ˉ}{t}) - 2 i (κ - i ρ ε m_{1}) δ_{2} (t, \overset{ˉ}{t}) - \frac{1}{2} m_{1}^{2} = 0, δ_{2} (\overset{ˉ}{t}, \overset{ˉ}{t}) = m_{2}, \\ i δ_{3}^{'} (t, \overset{ˉ}{t}) - 2 ε^{2} δ_{2} (t, \overset{ˉ}{t}) δ_{3} (t, \overset{ˉ}{t}) + 2 i χ δ_{2} (t, \overset{ˉ}{t}) - i (κ - i ρ ε m_{1}) δ_{3} (t, \overset{ˉ}{t}) = 0, \\ δ_{3} (\overset{ˉ}{t}, \overset{ˉ}{t}) = m_{3} . \end{matrix}

(7.188)

As before, this system can be linearized and solved analytically, which was pointed out by Reference Stein and SteinStein and Stein (1991), Reference Schöbel and ZhuSchöbel and Zhu (1999). One can repeat the result obtained in the previous section verbatim, except for (7.181). The corresponding characteristic equation has the following form:

\begin{matrix} λ^{2} + 2 (κ - i ρ ε m_{1}) λ - ε^{2} m_{1}^{2} = 0, \end{matrix}

(7.189)

and its solutions can be written as follows:

\begin{matrix} λ_{\pm} & = μ \pm ζ, \\ μ & = - (κ - i ρ ε m_{1}), ζ = \sqrt{{\overset{ˉ}{ρ}}^{2} ε^{2} m_{1}^{2} - 2 i ρ ε κ m_{1} + κ^{2}} . \end{matrix}

(7.190)

The rest of the formal analysis is the same. But the asymptotic behavior of the t.p.d.f. is, of course, different.

8 Pricing of Financial Instruments

8.1 Background

The formulas derived in Sections 6 and 7 can be used to solve numerous problems of financial engineering within a consistent framework based on Kelvin waves. Here are some representative examples.

Payoffs of European options depend solely on the terminal value of $\overset{ˉ}{S} = {\hat{S}}_{\overset{ˉ}{t}}$ of the underlying price at the option’s maturity. The most common European options are calls and puts, but, on occasion, binary options and other types are traded as well. Since the hedging and speculation needs of market participants cannot be satisfied by European options alone, the whole industry emerged to design, price, and hedge the so-called exotic options, with payoffs depending on the entire price trajectory between inception and maturity.

Prices of the fundamental financial instruments, such as forwards and European calls and puts, depend on the underlying prices only at maturity. However, the prices of many other instruments depend on the entire underlying price history between the instrument’s inception and maturity. Typical examples are barrier, American, Asian, lookback, and passport options; see, for example, Reference Lipton-LifschitzLipton-Lifschitz (1999), Reference LiptonLipton (2001), and references therein. Moreover, the prices of bonds also depend on the history of the interest rates and credit spreads throughout their life. This section shows how to price some path-dependent financial instruments using the methodology developed in the previous sections.

8.2 The Underlying Processes

The original approach to modeling financial assets was developed by Bachelier, who assumed that prices ${\hat{S}}_{t}$ of such instruments are governed by an arithmetic Brownian motion; see Reference BachelierBachelier (1900):

\begin{matrix} d {\hat{S}}_{t} = r {\hat{S}}_{t} d t + \hat{σ} d {\hat{W}}_{t}, {\hat{S}}_{t} = S . \end{matrix}

(8.1)

Here, $r$ is the risk-neutralized drift, $\hat{σ}$ is the volatility, and ${\hat{W}}_{t}$ is a Wiener process; $r,$ $\hat{σ}$ are dimensional quantities, $(r) = T^{- 1},$ $(σ) = $ T^{- 1 / 2} .$ The process for ${\hat{S}}_{t}$ given by (8.1) is affine; in fact, it is an OU process with zero mean and mean-repulsion instead of mean-reversion.

Subsequently, the academic community concluded that using a geometric Brownian motion as a driver is more appropriate; see Reference BonessBoness (1964), Reference SamuelsonSamuelson (1965), Reference Black and ScholesBlack and Scholes (1973), and Reference MertonMerton (1973). At present, the basic assumption is that the price ${\hat{S}}_{t}$ of an underlying financial instrument follows a geometric Brownian motion process with constant coefficients:

\begin{matrix} \frac{d {\hat{S}}_{t}}{{\hat{S}}_{t}} = r d t + σ d {\hat{W}}_{t}, {\hat{S}}_{t} = S . \end{matrix}

(8.2)

Here, $r$ is the risk-neutralized drift, and $σ$ is the volatility. These are dimensional quantities, $(r) = T^{- 1},$ $(σ) = T^{- 1 / 2} .$

The choice between using the Bachelier and the Black–Scholes models often depends on the nature of the underlying asset and the market’s specific characteristics. Since the Bachelier model assumes that the underlying asset prices follow a normal distribution, it can be more appropriate for assets whose price changes are additive and can theoretically go below zero, like interest rates, some commodities, or certain types of bonds. Generally, the price movements of the underlying asset are relatively small for short periods, so the Bachelier model provides a good description of these movements. The Bachelier model is often used for pricing commodities, some interest-rate derivatives, and studying the optimal execution. In markets with relatively low volatility, the Bachelier model’s assumption of additive price movements can provide a better fit for pricing and hedging derivatives than the multiplicative approach of the Black–Scholes model.

It was realized, very soon after the seminal paper by Reference Black and ScholesBlack and Scholes (1973) was published, that in practice it provides a rather poor description of reality. Hence, considerable efforts were dedicated to developing more adequate models. Such models include the jump-diffusion, local volatility, path-dependent volatility, stochastic volatility, local-stochastic volatility, rough volatility, and culminate in the universal volatility model; see Reference MertonMerton (1976), Reference Stein and SteinStein and Stein (1991), Reference Bick and ReismanBick and Reisman (1993), Reference HestonHeston (1993), Reference Derman and KaniDerman and Kani (1994), Reference DupireDupire (1994), Reference RubinsteinRubinstein (1994), Reference Hobson and RogersHobson and Rogers (1998), Reference Jex, Henderson and WangJex et al. (1999), Reference LewisLewis (2000), Reference LiptonLipton (2000, Reference Lipton2001); Reference Boyarchenko and LevendorskiiBoyarchenko and Levendorsky (2002), Reference Hagan, Kumar, Lesniewski and WoodwardHagan et al. (2002), Reference LiptonLipton (2002), Reference BergomiBergomi (2015), Reference ReghaiReghai (2015), Reference Gatheral, Jaisson and RosenbaumGatheral et al. (2018), Reference Gershon, Lipton, Rosenbaum and WienerGershon et al. (2022), and references therein.

Replacing constant volatility for a geometric Brownian motion with stochastic volatility driven by a Feller process results in the popular Heston model; see Reference HestonHeston (1993). This model has numerous applications, particularly for pricing equity and foreign exchange derivatives. The governing SDEs are as follows:

\begin{matrix} \frac{d {\hat{S}}_{t}}{{\hat{S}}_{t}} & = r d t + \sqrt{{\hat{v}}_{t}} d {\hat{W}}_{t}, {\hat{S}}_{t} = S, \\ d {\hat{v}}_{t} & = (χ - κ {\hat{v}}_{t}) d t + ε \sqrt{{\hat{v}}_{t}} d {\hat{Z}}_{t}, {\hat{v}}_{t} = ν, \end{matrix}

(8.3)

where $d {\hat{W}}_{t} d {\hat{Z}}_{t} = ρ d t .$ The logarithmic change of variables, given by (8.3), yields the equations of (7.136).

Replacing constant volatility with stochastic volatility driven by an OU process results in the (less popular) Stein–Stein model; see Reference Schöbel and ZhuSchöbel and Zhu (1999); Reference Stein and SteinStein and Stein (1991). The corresponding SDEs have the form:

\begin{matrix} \frac{d {\hat{S}}_{t}}{{\hat{S}}_{t}} & = r d t + {\hat{σ}}_{t} d {\hat{W}}_{t}, {\hat{S}}_{t} = S, \\ d {\hat{σ}}_{t} & = (χ - κ {\hat{σ}}_{t}) d t + ε d {\hat{Z}}_{t}, {\hat{v}}_{t} = ν, \end{matrix}

(8.4)

Reference Stein and SteinStein and Stein (1991) considered the special case of zero correlation, $d {\hat{W}}_{t} d {\hat{Z}}_{t} = 0,$ while Reference Schöbel and ZhuSchöbel and Zhu (1999) studied the general case of arbitrary correlation, $d {\hat{W}}_{t} d {\hat{Z}}_{t} = ρ d t .$

Now, it is shown how to use formulas derived in Sections 6 and 7 in the context of financial engineering.

8.3 European Derivatives

8.3.1 Forwards, Calls, Puts, and Covered Calls

The most basic derivatives are forwards. Recall that a forward contract obligates the buyer (seller) to buy (to sell) an underlying asset for an agreed price at a specified future date. These contracts are not standardized and are traded over-the-counter (OTC), not on exchanges. Typical underlying assets are commodities, currencies, and financial instruments. The choice of an asset depends on the needs of the contracting parties. The price agreed upon in a forward contract is called the forward price. This price is derived based on the spot price of the underlying asset, adjusted for factors like time to maturity, interest rates, and dividends. Forward contracts are primarily used for hedging price fluctuations of the underlying asset or speculation. The payoff of a forward contract with maturity $\overset{ˉ}{t}$ and strike $K$ has the following form:

\begin{matrix} U^{(F)} (\overset{ˉ}{S}, K) = \overset{ˉ}{S} - K, \end{matrix}

(8.5)

where the strike is chosen in such a way that today’s price of the forward contract is equal to zero. This price can be found without knowing the actual stochastic process $\hat{S} .$ The hedging argument shows that the only way to deliver the price of a non-dividend-paying stock at maturity $\overset{ˉ}{t}$ is to buy it outright at inception $t .$ Similarly, to deliver the strike $K$ at time $\overset{ˉ}{t},$ one has to buy a zero coupon bond at time $t .$ Let $Z_{t, \overset{ˉ}{t}}$ be the price of a bond paying unity at maturity $\overset{ˉ}{t} .$ Then

\begin{matrix} F_{t, \overset{ˉ}{t}} \equiv K = \frac{S}{Z_{t, \overset{ˉ}{t}}} . \end{matrix}

(8.6)

In contrast to forwards, a European call option grants the holder the right, but imposes no obligation, to buy an underlying asset at the option maturity for a predetermined strike price. Similarly, a European put option grants the holder the right to sell an underlying asset. Theoretically, buyers utilize calls and puts to hedge future risks; however, they often buy options for speculative purposes. American options can be exercised at any time of the buyer’s choice before the option’s maturity. Bermudan options are exercisable at fixed times between their inception and maturity. A call option is a contract between two parties – a buyer and a seller. Typically, the buyer takes the long position on the underlying (i.e., she expects that at maturity, the underlying price will exceed the strike price) and does not hedge her position. On the other hand, the seller or writer of the option (typically a bank) does hedge and, hence, maintains a market-neutral position. The seller receives cash up-front but incurs potential liabilities at option maturity if the option is exercised. In contrast, the buyer pays money up front in exchange for the potential for future gains. For a put option, the buyer takes a short position, while the seller is still market-neutral.

Payoffs of call and put options with maturity $\overset{ˉ}{t}$ and strike $K$ have the form

\begin{matrix} U^{(C)} (\overset{ˉ}{S}, K) & = max (\overset{ˉ}{S} - K, 0), \\ U^{(P)} (\overset{ˉ}{S}, K) & = max (K - \overset{ˉ}{S}, 0), \\ U^{(C, P)} (\overset{ˉ}{S}, K) & = max (ϕ (\overset{ˉ}{S} - K), 0), \end{matrix}

(8.7)

where $ϕ = 1$ for a call, and $ϕ = - 1$ for a put. Put-call parity implies that their difference is linear in $\overset{ˉ}{S}$ and represents a forward contract:

\begin{matrix} U^{(C)} (\overset{ˉ}{S}, K) - U^{(P)} (\overset{ˉ}{S}, K) = \overset{ˉ}{S} - K . \end{matrix}

(8.8)

Several popular models, including Bachelier, Black–Scholes, Heston, and Stein–Stein, are considered below. While the Bachelier model is not scale invariant, all the other models are. A general driver for a scale-invariant model can be written as follows:

\begin{matrix} \frac{d {\hat{S}}_{t}}{{\hat{S}}_{t}} = r d t + σ_{t} d {\hat{W}}_{t} + υ d {\hat{Π}}_{t}, {\hat{S}}_{t} = S, \end{matrix}

(8.9)

where, potentially, the volatility ${\hat{σ}}_{t}$ and the intensity ${\hat{λ}}_{t}$ of the Poisson process ${\hat{Π}}_{t}$ are driven by SDEs of their own. For such models, it is convenient to decompose call and put payoffs (8.43) into parts, which are easier to study via Kevin waves; see Reference LiptonLipton (2001), Reference Lipton(2002). To this end, introduce the covered call with the payoff of the form

\begin{matrix} U^{(C C)} (\overset{ˉ}{S}, K) = min (\overset{ˉ}{S}, K) . \end{matrix}

(8.10)

The call and put payoffs can be decomposed as follows:

\begin{matrix} U^{(C)} (\overset{ˉ}{S}, K) = \overset{ˉ}{S} - U^{(C C)} (\overset{ˉ}{S}, K), U^{(P)} (\overset{ˉ}{S}, K) = K - U^{(C C)} (\overset{ˉ}{S}, K) \end{matrix}

(8.11)

Thus, the call price is the difference between the forward price and the covered call price, while the put price is the difference between the bond price and the covered call price. In both cases, the covered call is the source of optionality.

8.3.2 Black–Scholes Model

For the standard log-normal process, the backward pricing problem for covered calls can be written as follows:

\begin{matrix} U_{t} + \frac{1}{2} σ^{2} S^{2} U_{S S} + r U_{S} - r U = 0, \\ U (\overset{ˉ}{t}, S) = min (S, K) . \end{matrix}

(8.12)

It is helpful to rewrite it by using forward rather than spot prices:

\begin{matrix} {\hat{U}}_{t} + \frac{1}{2} σ^{2} F^{2} {\hat{U}}_{F F} = 0, \\ \hat{U} (\overset{ˉ}{t}, F) = min (F, K), \end{matrix}

(8.13)

where

\begin{matrix} {\hat{F}}_{t, \overset{ˉ}{t}} = e^{r (\overset{ˉ}{t} - t)} {\hat{S}}_{t}, \hat{U} (t, F) = e^{r (\overset{ˉ}{t} - t)} U (t, S) . \end{matrix}

(8.14)

Change of variables,

\begin{matrix} {\hat{F}}_{t, \overset{ˉ}{t}} \to {\hat{x}}_{t, \overset{ˉ}{t}}, {\hat{F}}_{t, \overset{ˉ}{t}} = K e^{{\hat{x}}_{t, \overset{ˉ}{t}}}, \end{matrix}

(8.15)

results in the following process for ${\hat{x}}_{t}$ :

\begin{matrix} d {\hat{x}}_{t, \overset{ˉ}{t}} = - \frac{1}{2} σ^{2} d t + σ d {\hat{W}}_{t}, {\hat{x}}_{t, \overset{ˉ}{t}} = x = ln (\frac{F_{t, \overset{ˉ}{t}}}{K}) . \end{matrix}

(8.16)

The t.p.d.f. for this process is Gaussian:

\begin{matrix} ϖ (t, x, \overset{ˉ}{t}, \overset{ˉ}{x}) = \frac{1}{\sqrt{2 π σ^{2} T}} exp (- \frac{{(\overset{ˉ}{x} - x + σ^{2} / 2 T)}^{2}}{2 σ^{2} T}) . \end{matrix}

(8.17)

Since the the nondimensional payoff of the covered call has the form

\begin{matrix} {\tilde{U}}^{(C C)} (x) = min (e^{x}, 1), \end{matrix}

(8.18)

where $\tilde{U} = \hat{U} / K,$ one obtains the following expression for ${\tilde{U}}^{(C C)}$ :

\begin{matrix} {\tilde{U}}^{(C C)} (t, x) = e^{x} N (- \frac{x}{σ \sqrt{T}} - \frac{σ \sqrt{T}}{2}) + N (\frac{x}{σ \sqrt{T}} - \frac{σ \sqrt{T}}{2}), \end{matrix}

(8.19)

where $N (.)$ is the cumulative normal function.

By using (8.19), one can represent call and put prices as follows:

\begin{matrix} {\hat{U}}^{(C, P)} (t, F_{T}) & = ϕ (F_{T} N (ϕ d_{+}) - K N (ϕ d_{-})), \\ d_{\pm} & = \frac{ln (F_{T} / K)}{σ \sqrt{T}} \pm \frac{σ \sqrt{T}}{2} . \end{matrix}

(8.20)

See Reference BlackBlack (1976).

Returning to the original variables, write the classical Reference Black and ScholesBlack and Scholes (1973) closed-form formula for the time $t$ prices of calls and puts in its original form:

\begin{matrix} U^{(C, P)} (t, S) & = ϕ (S N (ϕ d_{+}) - e^{- r T} K N (ϕ d_{-})), \\ d_{\pm} & = \frac{ln (e^{r T} S / K)}{σ \sqrt{T}} \pm \frac{σ \sqrt{T}}{2} . \end{matrix}

(8.21)

Further transforming,

\begin{matrix} \tilde{U}^{(C C)} (t, x) = e^{x / 2} V^{(C C)} (t, x), \end{matrix}

(8.22)

yields the following backward problem:

\begin{matrix} V_{t}^{(C C)} + \frac{1}{2} σ^{2} V_{x x}^{(C C)} - \frac{1}{8} σ^{2} V^{(C C)} = 0, \\ V^{(C C)} (\overset{ˉ}{t}, x) = e^{- (x) / 2}, \end{matrix}

(8.23)

with symmetric “peakon” payoff, which is proportional to the Laplace distribution density. This transform removes the drift in the $x$ direction at the expense of adding killing with intensity $σ^{2} / 8 .$ Equation (8.19) implies

\begin{matrix} V (t, x) = e^{x / 2} N (- \frac{x}{σ \sqrt{T}} - \frac{σ \sqrt{T}}{2}) + e^{- x / 2} N (\frac{x}{σ \sqrt{T}} - \frac{σ \sqrt{T}}{2}) . \end{matrix}

(8.24)

The Fourier transform of the “peakon” payoff yields

\begin{matrix} \int_{- \infty}^{\infty} e^{- (x) / 2 - i k x} d x = \frac{1}{k^{2} + 1 / 4} . \end{matrix}

(8.25)

By using this formula, one can derive an alternative expression for $U^{(C, P)}$ based on Kelvin waves; see Reference LiptonLipton (2002). It is clear that Kelvin waves associated with the killed arithmetic Brownian motion described by (8.16) are the standard Fourier waves of the following form:

\begin{matrix} K (t, x, k) = e^{- (k^{2} + 1 / 4) σ^{2} T / 2 + i k x} . \end{matrix}

(8.26)

Equations (8.25) and (8.26) yield the following alternative expression for the price of covered calls given by (8.24):

\begin{matrix} V^{(C C)} (t, x) = \frac{1}{2 π} \int_{- \infty}^{\infty} \frac{e^{- (k^{2} + 1 / 4) σ^{2} T / 2 + i k x}}{k^{2} + 1 / 4} d k . \end{matrix}

(8.27)

See Reference LiptonLipton (2002). Equation (8.27) is central for the subsequent developments. For a single strike, this formula is less efficient than its classical counterpart; however, for a set of strikes, it is faster, because all the prices can be computed in one go, via the Fast Fourier Transform.

As one shall see shortly, these formulas help to handle affine pricing models very naturally.

8.3.3 Heston Model

The transformed forward pricing problem for the Heston model with the “peakon” payoff has the following form:

\begin{matrix} V_{t}^{(C C)} + \frac{1}{2} y (V_{x x}^{(C C)} + 2 ρ ε V_{x y}^{(C C)} + ε^{2} V_{y y}^{(C C)}) \\ + (χ - \hat{κ} y) V_{y}^{(C C)} - \frac{y}{8} V^{(C C)} = 0, \\ V^{(C C)} (\overset{ˉ}{t}, x, y) = e^{- (x) / 2}, \end{matrix}

(8.28)

where $\hat{κ} = κ - ρ ε / 2 .$ Thus, one is dealing with the killed stochastic process given by the equations in (7.136). Adapting the corresponding equations to accommodate the updated mean-reversion rate and the presence of the killing term, one gets the following system of ODEs for the corresponding Kelvin wave parameters:

\begin{matrix} α_{t} (t, \overset{ˉ}{t}) + i χ γ (t, \overset{ˉ}{t}) = 0, α (\overset{ˉ}{t}, \overset{ˉ}{t}) = 0, \\ i γ_{t} (t, \overset{ˉ}{t}) - \frac{1}{2} ε^{2} γ^{2} (t, \overset{ˉ}{t}) - (\hat{κ} - i ρ ε k) i γ (t, \overset{ˉ}{t}) - \frac{1}{2} (k^{2} + \frac{1}{4}) = 0, γ (\overset{ˉ}{t}, \overset{ˉ}{t}) = 0. \end{matrix}

(8.29)

Formulas (7.111)–(7.114) are still applicable. However, the corresponding characteristic equation and its solution are:

\begin{matrix} λ^{2} + (\hat{κ} - i ρ ε k) λ - \frac{ε^{2}}{4} (k^{2} + \frac{1}{4}) = 0, \\ λ_{\pm} = μ \pm ζ, \\ μ = - \frac{(\hat{κ} - i ρ ε k)}{2}, ζ = \frac{\sqrt{{\overset{ˉ}{ρ}}^{2} ε^{2} k^{2} - 2 i ρ ε k + {\hat{κ}}^{2} + ε^{2} / 4}}{2} . \end{matrix}

(8.30)

It is convenient to write $(α, γ)$ as follows:

\begin{matrix} α (T, k) & = - \frac{2 χ}{ε^{2}} ((μ + ζ) T + ln (\frac{- μ + ζ + (μ + ζ) e^{- 2 ζ T}}{2 ζ})), \end{matrix}

(8.32)

\begin{matrix} γ (T, k) & = (k^{2} + \frac{1}{4}) \frac{i (1 - e^{- 2 ζ T})}{2 (- μ + ζ + (μ + ζ) e^{- 2 ζ T})} \equiv (k^{2} + \frac{1}{4}) i ς (T, k) . \end{matrix}

(8.33)

Hence, the price of the “peakon” has the following form:

\begin{matrix} V^{(C C)} (t, x, y) = \frac{1}{2 π} \int_{- \infty}^{\infty} \frac{e^{α (T, k) - (k^{2} + 1 / 4) ς (T, k) y + i k x}}{k^{2} + 1 / 4} d k . \end{matrix}

(8.34)

Equation (8.34) is frequently called the Lewis–Lipton formula; see, for example, Reference LewisLewis (2000), Reference LiptonLipton (2000), Reference LewisLewis (2001), Reference LiptonLipton (2001), Reference Lipton(2002); Reference SchmelzleSchmelzle (2010), Reference Janek, Kluge, Weron and WystupJanek et al. (2011).

The implied volatility surface generated by a representative Heston model is shown in Figure 13. Recall that the implied volatility $Σ (T, K)$ is the volatility one must substitute into the Black–Scholes formula to reproduce the market price of a call (or put) option with maturity $T$ and strike $K .$ Thus, the deviation of the volatility surface from the flat surface $Σ (T, K) = Σ_{0}$ shows how far a given market (or model) is from the idealized Black–Scholes framework.

Figure 13 A representative implied volatility surface generated by the Heston model. Parameters are the same as in Figure 12. Author’s graphics.

8.3.4 Stein–Stein Model

The transformed forward pricing problem for the Stein–Stein model with the “peakon” payoff has the following form:

\begin{matrix} V_{t}^{(C C)} + \frac{1}{2} z_{2} V_{z_{1} z_{1}}^{(C C)} + 2 ρ ε z_{2} V_{z_{1} z_{2}}^{(C C)} + ρ ε z_{3} V_{z_{1} z_{3}}^{(C C)} \\ + 2 ε^{2} z_{2} V_{z_{2} z_{2}}^{(C C)} + 2 ε^{2} z_{3} V_{z_{2} z_{3}}^{(C C)} + \frac{1}{2} ε^{2} V_{z_{3} z_{3}}^{(C C)} \\ + (ε^{2} - 2 \hat{κ} z_{2} + 2 χ z_{3}) V_{z_{2}}^{(C C)} + (χ - \hat{κ} z_{3}) V_{z_{3}}^{(C C)} - \frac{z_{2}}{8} V^{(C C)} = 0, \\ V^{(C C)} (\overset{ˉ}{t}, z_{1}, z_{2}, z_{3}) = e^{- (z_{1}) / 2}, \end{matrix}

(8.35)

which corresponds to the killed stochastic process described by the equations in (7.184). By incorporating the killing term, one gets the following set of ODEs for the Kelvin wave parameters

\begin{matrix} α_{t} (t, \overset{ˉ}{t}) & - \frac{ε^{2}}{2} δ_{3}^{2} (t, \overset{ˉ}{t}) + i ε^{2} δ_{2} (t, \overset{ˉ}{t}) + i χ δ_{3} (t, \overset{ˉ}{t}) = 0, α (\overset{ˉ}{t}, \overset{ˉ}{t}) = 0, \\ i δ_{2}^{'} (t, \overset{ˉ}{t}) & - 2 ε^{2} δ_{2}^{2} (t, \overset{ˉ}{t}) - 2 i (\hat{κ} - i ρ ε m_{1}) δ_{2} (t, \overset{ˉ}{t}) \\ - \frac{1}{2} (m_{1}^{2} + \frac{1}{4}) = 0, δ_{2} (\overset{ˉ}{t}, \overset{ˉ}{t}) = 0, \\ i δ_{3}^{'} (t, \overset{ˉ}{t}) & - 2 ε^{2} δ_{2} (t, \overset{ˉ}{t}) δ_{3} (t, \overset{ˉ}{t}) + 2 i χ δ_{2} (t, \overset{ˉ}{t}) \\ - i (\hat{κ} - i ρ ε m_{1}) δ_{3} (t, \overset{ˉ}{t}) = 0, δ_{3} (\overset{ˉ}{t}, \overset{ˉ}{t}) = 0. \end{matrix}

(8.36)

The corresponding solution has the form given by the equations in (7.182) with

\begin{matrix} λ_{\pm} = μ \pm ζ, \\ μ = - (\hat{κ} - i ρ ε m_{1}), ζ = \sqrt{{\overset{ˉ}{ρ}}^{2} ε^{2} m_{1}^{2} - 2 i ρ ε κ m_{1} + κ^{2} + ε^{2} / 4}, \\ ω_{\pm} = \mp \frac{λ_{\mp}}{2 ζ}, b_{\pm} = \frac{i λ_{\pm} ω_{\pm}}{2 ε^{2}}, \\ c_{\pm} = \pm \frac{i χ λ_{\pm} ω_{\pm}}{ε^{2} ζ}, c_{0} = - c_{+} - c_{-}, g = \frac{χ^{2} λ_{+} λ_{-}}{2 ε^{2} ζ^{2}}, \\ a_{0} = - \frac{i κ χ c_{0}}{ζ^{2}}, a_{\pm} = - ω_{\pm} a_{0} \mp (\frac{ε^{2} c_{0}^{2}}{4 ζ} + \frac{χ^{2} κ^{2} ω_{+} ω_{-}}{ε^{2} ζ^{3}}) . \end{matrix}

(8.37)

The generic expression for the price of the “peakon” has the following form:

\begin{matrix} V^{(C C)} (t, z_{1}, z_{3}^{2}, z_{3}) = \frac{1}{2 π} \int_{- \infty}^{\infty} \frac{e^{α (T, m_{1}) + i δ_{2} (T, m_{1}) z_{3}^{2} + i δ_{3} (T, m_{1}) z_{3} + i m_{1} z_{1}}}{m_{1}^{2} + 1 / 4} d m_{1} . \end{matrix}

(8.38)

It is clear that this price is a function of $\overset{ˉ}{t},$ $z_{1},$ $z_{3} .$

8.3.5 Path-Dependent Volatility Model

Reference Hobson and RogersHobson and Rogers (1998) initially proposed path-dependent volatility models; subsequently, they were studied by many authors; see Reference DavisDavis (2004), Reference Di Francesco and PascucciDi Francesco and Pascucci (2004, Reference Di Francesco and Pascucci2005); Guyon Reference Guyon(2014), and Reference Lipton and ReghaiLipton and Reghai (2023), among others. They present a viable alternative to the more popular local volatility models developed by Reference Bick and ReismanBick and Reisman (1993), Reference Derman and KaniDerman and Kani (1994), Reference DupireDupire (1994), and Reference RubinsteinRubinstein (1994).

The main advantage of path-dependent volatility models compared to their local volatility brethren is that the former deal with volatility functions depending on a nondimensional argument, such as ${\hat{S}}_{t} / {\hat{A}}_{t},$ where ${\hat{S}}_{t}$ is the stock price, and ${\hat{A}}_{t}$ is its average, say, $σ = σ ({\hat{S}}_{t} / {\hat{A}}_{t}),$ while the latter use volatilities depending on a dimensional argument ${\hat{S}}_{t},$ $σ = σ ({\hat{S}}_{t}),$ which is conceptually unsound and results in model dynamics deviating from the one observed in the market. The problem with path-dependent models is that building an analytically tractable path-dependent model is exceedingly tricky, so gaining the necessary intuition or benchmarking numerical solutions is complicated. However, this section develops such a model using results derived in Section 7.3.

Here, an original path-dependent model with a semianalytical solution is presented for the first time. The dynamics is adapted from Section 7.3, Equation (7.153) as follows:

\begin{matrix} {\hat{A}}_{t} & = exp (κ \int_{- \infty}^{t} e^{- κ (t - t^{'})} ln {\hat{S}}_{t^{'}} d t^{'}), {\hat{A}}_{t} = A, \\ \frac{d {\hat{S}}_{t}}{{\hat{S}}_{t}} & = \sqrt{c_{0} + c_{1} ln (\frac{{\hat{S}}_{t}}{A_{t}})} d {\hat{W}}_{t}, {\hat{S}}_{t} = S . \end{matrix}

(8.39)

It is not necessary to describe in detail how ${\hat{S}}_{\tilde{t}},$ and, hence, ${\hat{A}}_{\tilde{t}},$ behave when $\tilde{t} < t,$ since it becomes unimportant provided that $κ T$ is sufficiently large. For instance, one can assume that ${\hat{S}}_{\tilde{t}} \equiv A,$ when $\tilde{t} < t,$ then $A = S .$ Additionally, it is assumed that $r = 0,$ so that spot and forward prices coincide, ${\hat{S}}_{t} = {\hat{F}}_{t, \overset{ˉ}{t}} .$

In logarithmic variables ${\hat{x}}_{t} = ln ({\hat{A}}_{t}),$ ${\hat{y}}_{t} = ln ({\hat{S}}_{t}),$ the equations in (8.39) assume the form given by the equations in (7.153). Accordingly, the pricing equation for the path-dependent model with the symmetric “peakon” payoff can be written as follows:

\begin{matrix} V_{t}^{(C C)} + \frac{1}{2} (a_{0} + a_{1} (y - x)) (V_{y y}^{(C C)} - \frac{1}{4} V^{(C C)}) + κ (y - x) V_{x}^{(C C)} = 0, \\ V^{(C C)} (\overset{ˉ}{t}, x, y) = e^{- (y) / 2} . \end{matrix}

(8.40)

The Kelvin wave parameters are governed by the equations of the following form:

\begin{matrix} α_{t} (t, \overset{ˉ}{t}) - \frac{a_{0}}{2} (γ^{2} (t, \overset{ˉ}{t}) + \frac{1}{4}) = 0, α (\overset{ˉ}{t}, \overset{ˉ}{t}) = 0, \\ i β_{t} (t, \overset{ˉ}{t}) + \frac{a_{1}}{2} (γ^{2} (t, \overset{ˉ}{t}) + \frac{1}{4}) - i κ β (t, \overset{ˉ}{t}) = 0, β (\overset{ˉ}{t}, \overset{ˉ}{t}) = 0, \\ i γ_{t} (t, \overset{ˉ}{t}) - \frac{a_{1}}{2} (γ^{2} (t, \overset{ˉ}{t}) + \frac{1}{4}) + i κ β (t, \overset{ˉ}{t}) = 0, γ (\overset{ˉ}{t}, \overset{ˉ}{t}) = l . \end{matrix}

(8.41)

8.3.6 Bachelier Model

In the Bachelier model, the corresponding discounted t.p.d.f. is given by a modified (6.96):

\begin{matrix} ϖ (t, S, \overset{ˉ}{t}, \overset{ˉ}{S}) = \frac{1}{\sqrt{2 π Σ^{2} (t, \overset{ˉ}{t})}} exp (- \frac{{(\overset{ˉ}{S} - F_{T})}^{2}}{2 Σ^{2} (t, \overset{ˉ}{t})}), \end{matrix}

(8.42)

where

\begin{matrix} Σ^{2} (t, \overset{ˉ}{t}) = \frac{{\hat{σ}}^{2} (e^{2 r T} - 1)}{2 r} . \end{matrix}

(8.43)

By virtue of (8.7), one can price European calls and puts as follows:

\begin{matrix} V (t, F_{T}) = e^{- r T} (ϕ (F_{T} - K) N (ϕ \frac{F_{T} - K}{Σ (T)}) + Σ (T) n (\frac{F_{T} - K}{Σ (T)})), \end{matrix}

(8.44)

or, in spot terms:

\begin{matrix} V (t, S) = ϕ (S - e^{- r T} K) N (ϕ \frac{S - e^{- r T} K}{\tilde{Σ} (T)}) + \tilde{Σ} (T) n (\frac{S - e^{- r T} K}{\tilde{Σ} (T)}), \end{matrix}

(8.45)

where

\begin{matrix} {\tilde{Σ}}^{2} (T) = \frac{{\hat{σ}}^{2} (1 - e^{- 2 r T})}{2 r} . \end{matrix}

(8.46)

See Reference BachelierBachelier (1900), Reference Schachermayer and TeichmannSchachermayer & Teichmann (2008), and Reference TerakadoTerakado (2019) for further details.

8.4 Asian Options with Arithmetic and Geometric Averaging

The most basic path-dependent options are fixed strike Asian calls and puts, whose payoff depends on the underlying value averaged between the inception and maturity. Such options are popular for commodity and energy trading and in many other circumstances. The average ${\hat{A}}_{t, \overset{ˉ}{t}}$ on the interval $(t, \overset{ˉ}{t})$ can be defined in several ways. The simplest and, as a result, the most popular is an arithmetic average:

\begin{matrix} {\hat{A}}_{t, \overset{ˉ}{t}} = \frac{1}{T} \int_{t}^{\overset{ˉ}{t}} {\hat{S}}_{s} d s . \end{matrix}

(8.47)

A less frequent, but technically easier to deal with, alternative is a geometric average:

\begin{matrix} {\hat{A}}_{t, \overset{ˉ}{t}} = exp (\frac{1}{T} \int_{t}^{\overset{ˉ}{t}} ln ({\hat{S}}_{s}) d s) . \end{matrix}

(8.48)

The payoff of an Asian option with maturity $\overset{ˉ}{t}$ and fixed strike $K$ is

\begin{matrix} U ({\overset{ˉ}{A}}_{t, \overset{ˉ}{t}}) = max (ϕ ({\overset{ˉ}{A}}_{t, \overset{ˉ}{t}} - K), 0), \end{matrix}

(8.49)

as before, $ϕ = 1$ for a call, and $ϕ = - 1$ for a put. For the floating strike, the payoff is

\begin{matrix} U ({\overset{ˉ}{S}}_{\overset{ˉ}{t}}, {\overset{ˉ}{A}}_{t, \overset{ˉ}{t}}) = max (ϕ ({\overset{ˉ}{S}}_{\overset{ˉ}{t}} - k {\overset{ˉ}{A}}_{t, \overset{ˉ}{t}}), 0), \end{matrix}

(8.50)

where the nondimensional parameter $k$ is called weighting; typically, $k = 1 .$

Start with the Bachelier model. Equations for pricing Asian Options with an arithmetic average are as follows:

\begin{matrix} d {\hat{A}}_{t} & = {\hat{S}}_{t} d t, {\hat{A}}_{t} = 0, \\ d {\hat{S}}_{t} & = r {\hat{S}}_{t} d t + σ d {\hat{W}}_{t}, {\hat{S}}_{t} = S . \end{matrix}

(8.51)

Thus, (6.114) and (6.115) are applicable. All one needs is the marginal distribution for ${\overset{ˉ}{A}}_{t, \overset{ˉ}{t}},$ which is Gaussian:

\begin{matrix} ϖ (\overset{ˉ}{A}) \sim N (R, Σ^{2}), \end{matrix}

(8.52)

where

\begin{matrix} R = B_{- r} (T) S, Σ^{2} = \frac{σ^{2}}{r} (B_{0} (T) - 2 B_{- r} (T) + B_{- 2 r} (T)) . \end{matrix}

(8.53)

Consider the discounted payoff of the Asian call option (say):

\begin{matrix} U (t, \overset{ˉ}{A}) = {(\frac{\overset{ˉ}{A}}{T} - K)}_{+} . \end{matrix}

(8.54)

The corresponding calculation is straightforward:

\begin{matrix} U (t, S) & = e^{- r T} \int_{T K}^{\infty} \frac{(\frac{\overset{ˉ}{A}}{T} - K) e^{- \frac{{(\overset{ˉ}{A} - R)}^{2}}{2 Σ^{2}}}}{\sqrt{2 π Σ^{2}}} d \overset{ˉ}{A} \\ = \frac{e^{- r T} Σ}{T} \int_{\frac{(T K - R)}{Σ}}^{\infty} \frac{η e^{- \frac{η^{2}}{2}}}{\sqrt{2 π}} d η - \frac{e^{- r T} (T K - R)}{T} \int_{\frac{(T K - R)}{Σ}}^{\infty} \frac{e^{- \frac{η^{2}}{2}}}{\sqrt{2 π}} d η \\ = \frac{e^{- r T} Σ}{T} n (\frac{R - T K}{Σ}) - \frac{e^{- r T} (T K - R)}{T} N (\frac{R - T K}{Σ}) . \end{matrix}

(8.55)

Analytical pricing of Asian options with arithmetic averaging for the Black–Scholes model is notoriously tricky; see Reference Geman and EydelandGeman and Eydeland (1995), Reference Rogers and ShiRogers and Shi (1995), and Reference LiptonLipton (1999, Reference Lipton2001). At the same time, pricing Asian options with geometric averaging can be done quickly; see Reference Barucci, Polidoro and VespriBarrucci et al. (2001), Reference LiptonLipton (2001), and Reference Di Francesco and PascucciDi Francesco and Pascucci (2005) , and references therein. Such options can be priced using formula (6.45) derived in Section 6. An alternative approach based on the path integral method is discussed in Reference Devreese, Lemmens and TempereDevreese et al. (2010). Define

\begin{matrix} {\hat{x}}_{t} = \int_{t}^{t} {\hat{y}}_{s} d s, {\hat{y}}_{t} = ln ({\hat{S}}_{t}) . \end{matrix}

(8.56)

Then

\begin{matrix} d {\hat{x}}_{t} & = {\hat{y}}_{t} d t, {\hat{x}}_{t} = 0, \\ d {\hat{y}}_{t} & = (r - \frac{σ^{2}}{2}) d t + σ d {\hat{W}}_{t}, {\hat{y}}_{t} = ln ({\hat{S}}_{t}) \equiv y . \end{matrix}

(8.57)

The value of the option can be written as follows:

\begin{matrix} U (t, S) = e^{- r T} \int_{x^{*}}^{ϕ \infty} ϖ (\overset{ˉ}{x}) (exp (\frac{\overset{ˉ}{x}}{T}) - exp (ln K)) d \overset{ˉ}{x}, \end{matrix}

(8.58)

where

\begin{matrix} x^{*} = T ln K . \end{matrix}

(8.59)

Since (8.57) is a special case of (6.74), one can use the equations in (6.81) to obtain the marginal distribution for $\overset{ˉ}{x},$ which is a Gaussian distribution of the form:

\begin{matrix} ϖ (\overset{ˉ}{x}) & = \frac{exp (- \frac{{(\overset{ˉ}{x} - p)}^{2}}{2 σ_{x}^{2}})}{\sqrt{2 π σ_{x}^{2}}}, \\ σ_{x}^{2} & = \frac{σ^{2} T^{3}}{3}, p = ln (S) T + \frac{1}{2} (r - \frac{σ^{2}}{2}) T^{2} . \end{matrix}

(8.60)

Thus,

\begin{matrix} U (t, S) = J_{1} (t, S) - J_{2} (t, S), \end{matrix}

(8.61)

where

\begin{matrix} J_{1} (t, S) & = e^{- r T} \int_{x^{*}}^{ϕ \infty} \frac{exp (- \frac{{(\overset{ˉ}{x} - p)}^{2}}{2 σ_{x}^{2}} + \frac{\overset{ˉ}{x}}{T})}{\sqrt{2 π σ_{x}^{2}}} d \overset{ˉ}{x} = ϕ e^{- \frac{1}{2} (r + \frac{σ^{2}}{6}) T} S N (ϕ d_{+}), \\ J_{2} (t, S) & = e^{- r T} \int_{x^{*}}^{ϕ \infty} \frac{exp (- \frac{{(\overset{ˉ}{x} - p)}^{2}}{2 σ_{x}^{2}} + ln (K))}{\sqrt{2 π σ_{x}^{2}}} d \overset{ˉ}{x} = ϕ e^{- r T} K N (ϕ d_{-}), \end{matrix}

(8.62)

where

\begin{matrix} d_{\pm} = \frac{ln (S / K) + \frac{1}{2} (r - \frac{σ^{2}}{6} \pm \frac{σ^{2}}{3}) T}{\sqrt{σ^{2} T / 3}} . \end{matrix}

(8.63)

Finally, one obtains a well-known formula for the price of a fixed strike Asian option with geometric averaging:

\begin{matrix} U (t, S) = ϕ (e^{- \frac{1}{2} (r + \frac{σ^{2}}{6}) T} S N (ϕ d_{+}) - e^{- r T} K N (ϕ d_{-})) . \end{matrix}

(8.64)

Of course, a similar formula holds when $r, σ$ are time-dependent. The derivation, although very simple, seems to be new.

8.5 Volatility and Variance Swaps and Swaptions

8.5.1 Volatility Swaps and Swaptions

Recall that the Stein–Stein stochastic volatility model assumes that the volatility is driven by an OU process; see Reference Stein and SteinStein and Stein (1991). One needs to find Green’s function associated with the following augmented SDEs:

\begin{matrix} d {\hat{x}}_{t} & = {\hat{y}}_{t} d t, {\hat{x}}_{t} = 0, \\ d {\hat{y}}_{t} & = (χ^{(V o l)} - κ^{(V o l)} {\hat{y}}_{t}) d t + ε^{(V o l)} d {\hat{W}}_{t}, {\hat{y}}_{t} = y^{(V o l)}, \end{matrix}

(8.65)

or, equivalently,

\begin{matrix} d {\hat{x}}_{t} & = {\hat{y}}_{t} d t, {\hat{x}}_{t} = 0, \\ d {\hat{y}}_{t} & = κ^{(V o l)} (θ^{(V o l)} - {\hat{y}}_{t}) d t + ε^{(V o l)} d {\hat{W}}_{t}, {\hat{y}}_{t} = y^{(V o l)}, \end{matrix}

(8.66)

which describe the evolution of the volatility ${\hat{σ}}_{t} \equiv {\hat{y}}_{t}$ and its integral ${\hat{x}}_{t}$ ; the equations of (8.65) are identical to the equations of (6.98).

It can be shown that the pair $(\overset{ˉ}{x}, \overset{ˉ}{y})$ has the bivariate Gaussian distribution with the covariance matrix $H$ given by (6.113), and mean $(p, q)$ given by (6.114):

\begin{matrix} (\begin{matrix} p \\ q \end{matrix}) = (\begin{matrix} T θ^{(V o l)} + {\overset{ˉ}{B}}_{κ^{(V o l)}} (T) (y^{(V o l)} - θ^{(V o l)}) \\ θ^{(V o l)} + A_{κ^{(V o l)}} (T) (y^{(V o l)} - θ^{(V o l)}) \end{matrix}) . \end{matrix}

(8.67)

Since the marginal distribution of ${\hat{x}}_{t}$ given by (6.115 ) is Gaussian, the fair strike of a volatility swap with maturity $t$ is simply the expected value of ${\hat{x}}_{t} / T$ :

\begin{matrix} V o l S w a p = θ^{(V o l)} + (y^{(V o l)} - θ^{(V o l)}) \frac{{\overset{ˉ}{B}}_{κ^{(V o l)}} (T)}{T} . \end{matrix}

(8.68)

Here

\begin{matrix} (V o l S w a p) = (\frac{χ^{(V o l)}}{κ^{(V o l)}}) = (θ^{(V o l)}) = (y) = \frac{1}{T^{1 / 2}} . \end{matrix}

(8.69)

Of course, one can calculate the expected value of ${\hat{x}}_{t} / T$ via more straightforward means. To this end, (8.68) can be derived directly by taking expectations of SDE (8.65). However, as we shall see in the following subsection, (6.115) for the marginal distribution $ϖ^{(x)} (t, y^{(V o l)}, \overset{ˉ}{t}, \overset{ˉ}{x})$ allows one to solve more interesting problems, such as calculating prices of bonds and bond options; see the discussion that follows.

Moreover, by using this equation, one can price volatility swaptions with payoffs of the form:

\begin{matrix} U (\overset{ˉ}{t}, \overset{ˉ}{x}) = max (ϕ (\overset{ˉ}{x} - x^{*}), 0) . \end{matrix}

(8.70)

The price $U (t, y^{(V o l)})$ becomes:

\begin{matrix} U (t, y^{(V o l)}) & = e^{- r T} ϕ \int_{x^{*}}^{ϕ \infty} (\overset{ˉ}{x} - x^{*}) ϖ^{(x)} (t, y^{(V o l)}, \overset{ˉ}{t}, \overset{ˉ}{x}) d \overset{ˉ}{x} \\ = \frac{e^{- r T} ϕ}{\sqrt{2 π h_{0} (t, \overset{ˉ}{t})}} \int_{x^{*}}^{ϕ \infty} (\overset{ˉ}{x} - x^{*}) exp (\frac{{(\overset{ˉ}{x} - p)}^{2}}{2 h_{0}}) d \overset{ˉ}{x} \\ = e^{- r T} (ϕ (p - x^{*}) N (ϕ \frac{(p - x^{*})}{\sqrt{h_{0}}}) + \sqrt{h_{0}} n (\frac{(p - x^{*})}{\sqrt{h_{0}}})) . \end{matrix}

(8.71)

It is clear that formula (8.71) is a variant of the Bachelier formula (8.44).

8.5.2 Variance Swaps and Swaptions

In contrast to volatility, which, despite common misconceptions, can be negative, variance must be nonnegative since it is a square of a real-valued quantity. Accordingly, the easiest way to model it is by using the augmented Feller process with $ϑ > 0$ ; see (7.99).

Using (7.127), one can immediately obtain the following expression for the fair value of a variance swap for the Feller process:

\begin{matrix} V a r S w a p = θ^{(V a r)} + (y^{(V a r)} - θ^{(V a r)}) \frac{{\overset{ˉ}{B}}_{κ^{(V a r)}} (T)}{T}, \end{matrix}

(8.72)

where $θ^{(V a r)} = χ^{(V a r)} / κ^{(V a r)} .$ Here

\begin{matrix} (V a r S w a p) = (θ^{(V a r)}) = (y^{(V a r)}) = \frac{1}{T} . \end{matrix}

(8.73)

While formulas (8.68) and (8.72) look the same but deal with the volatility and variance, respectively, the corresponding parameters have different meanings.

Alternatively, one can use the degenerate augmented OU process, see the equations of (7.174). Averaging away stochastic terms, one gets the following formula for the fair price of the variance swap:

\begin{matrix} V a r S w a p = {(θ^{(V o l)})}^{2} + ({(y^{(V o l)})}^{2} - {(θ^{(V o l)})}^{2}) \frac{{\overset{ˉ}{B}}_{κ^{(V o l)}} (T)}{T} . \end{matrix}

(8.74)

It is clear that Equations (8.72) and (8.74) provide different fair values for a variance swap, although these values asymptotically agree. This fact reflects the so-called model risk – by using different models, one gets different answers to the same question.

Equation (7.123) can be used to calculate the price of a variance swaption:

\begin{matrix} U (t, y^{(V a r)}) & = \frac{1}{2 π} \int_{x^{*}}^{ϕ \infty} \int_{- \infty}^{\infty} ϕ (\overset{ˉ}{x} - x^{*}) ϝ (\overset{ˉ}{t}, k) e^{i k \overset{ˉ}{x}} d k d \overset{ˉ}{x} \\ = \frac{1}{2 π} \int_{- \infty}^{\infty} ϝ (\overset{ˉ}{t}, k) (ϕ \int_{x^{*}}^{ϕ \infty} (\overset{ˉ}{x} - x^{*}) e^{i k \overset{ˉ}{x}} d \overset{ˉ}{x}) d k \\ = \frac{1}{2 π} lim_{ϵ \to 0} \int_{- \infty}^{\infty} ϝ (\overset{ˉ}{t}, k) e^{i k x^{*}} (- \frac{\partial}{\partial ϵ} \int_{x^{*}}^{ϕ \infty} e^{(i k - ϕ ϵ) \overset{ˉ}{x}} d \overset{ˉ}{x}) \\ = \frac{1}{2 π} lim_{ϵ \to 0} \int_{- \infty}^{\infty} \frac{ϝ (\overset{ˉ}{t}, k) e^{i k x^{*}}}{{(i k - ϕ ϵ)}^{2}} d k, \end{matrix}

(8.75)

where $ϝ (\overset{ˉ}{t}, k)$ is given by (7.124).

8.6 Automated Market Makers

Variance and volatility swaps had long occupied a specific niche within the financial product landscape. Recently, they experienced an unexpected surge in interest due to the influence of cryptocurrency trading. These swaps have proven effective in hedging impermanent loss, a phenomenon generated by automated market makers; see Reference Lipton and HardjonoLipton and Hardjono (2021), Reference Lipton and TreccaniLipton and Treccani (2021), Reference Lipton and SeppLipton and Sepp (2022), Reference Cartea, Drissi and MongaCartea et al. (2023), Reference Fukasawa, Maire and WunschFukasawa et. al (2023), and others. This section closely follows Reference Lipton and HardjonoLipton and Hardjono (2021).

Let us consider a smart contract (SC), called an automated market maker (AMM) designed to facilitate exchanges of two tokens, $T N_{1}$ and $T N_{2} .$ The analytical formula for the price of the second token in terms of the first defines the nature of the contract. AMMs have gained significant traction in recent years. Initially, anyone can participate as a market maker and liquidity provider by depositing $T N_{1}$ and $T N_{2}$ simultaneously and in the correct ratio into the collateral pool. Subsequently, participants can withdraw one token from the pool by delivering the other token according to the rules established by the underlying SC. While AMMs excel in facilitating stablecoin swaps, they can easily accommodate the exchange of various tokens, such as swapping a stablecoin, say USDT, for ethereum (ETH).

The actual exchange rate is determined by rules that rely on prior agreement. The available options are the constant sum, constant product, and mixture rules. Sources including Reference Angeris, Kao, Chiang, Noyes and ChitraAngeris et al. (2019), Reference EgorovEgorov (2019), Reference Zhang, Chen and ParkZhang et al. (2018), Reference Lipton and HardjonoLipton and Hardjono (2021), Reference Lipton and SeppLipton and Sepp (2022), and references therein offer detailed coverage of AMMs and comprehensive insights into their mechanisms.

Assuming that initially tokens $T N_{1},$ $T N_{2}$ are equal in value, one can define a constant sum AMM:

\begin{matrix} X + Y = Σ_{0}, X_{0} = Y_{0} = N, Σ_{0} = 2 N . \end{matrix}

(8.76)

Here $X, Y$ are the quantities of $T N_{1},$ $T N_{2}$ in the pool. Equation (8.76) yields

\begin{matrix} Y = Σ_{0} - X, (\frac{d Y}{d X}) = 1. \end{matrix}

(8.77)

As per (8.77), the pool reaches depletion at $X = Σ_{0},$ as it becomes advantageous for an arbitrageur to increase $X$ from $N$ to $2 N$ when $T N_{2}$ surpasses $T N_{1}$ in value. The marginal price of $T N_{2}$ relative to $T N_{1},$ as expressed in the second equation (8.77), remains consistent and equal to one. A constant price is optimal for a constant sum AMM, particularly when dealing with stablecoins like $T N_{1}$ and $T N_{2},$ whose prices fluctuate mildly around their equilibrium values. Depleting the pool is rational in scenarios where transaction fees are nonexistent, even with a minimal deviation from equilibrium. However, under more realistic conditions with nonzero transaction fees, arbitrage becomes profitable only if the deviation surpasses a certain threshold.

The constant product rule defines more intricate and, importantly, practical AMMs:

\begin{matrix} X Y = Π_{0}, X_{0} = Y_{0} = N, Π_{0} = N^{2} . \end{matrix}

(8.78)

It is clear that

\begin{matrix} Y = \frac{Π_{0}}{X}, (\frac{d Y}{d X}) = \frac{Π_{0}}{X^{2}} . \end{matrix}

(8.79)

Consequently, an arbitrageur is unable to deplete such a pool, allowing it to persist indefinitely. In this scenario, it becomes evident that the price of $T N_{2}$ relative to $T N_{1}$ is no longer steady; instead, it rises (or falls) as $X$ decreases (or increases).

To make liquidity provision more attractive to potential market makers, one can generalize the constant sum and constant product rules. Expressions (8.76) and (8.78) representing these rules can be formulated as follows:

\begin{matrix} (\frac{Σ}{Σ_{0}} - 1) & = 0, X_{0} = Y_{0} = N, Σ_{0} = 2 N, \\ (\frac{Π_{0}}{Π} - 1) & = 0, X_{0} = Y_{0} = N, Π_{0} = N^{2} . \end{matrix}

(8.80)

where $Σ = X + Y,$ $Π = X Y$ are the current sum and product, respectively. These rules can be combined as follows:

\begin{matrix} (\frac{Π_{0}}{Π} - 1) + α (\frac{Σ}{Σ_{0}} - 1) = 0, \\ X_{0} = Y_{0} = N, Σ_{0} = 2 N, Π_{0} = N^{2} . \end{matrix}

(8.81)

Here, $α > 0$ is an adaptive parameter, characterizing the transition from the constant product to the constant sum rule. The product $Π$ is in the denominator to avoid the possibility of exhausting the entire pool and ensuring that

\begin{matrix} Y (X) \underset{X \to 0}{\to} \infty, X (Y) \underset{Y \to 0}{\to} \infty . \end{matrix}

(8.82)

Certainly, when AMM liquidity providers are exposed to arbitragers, they face potential losses stemming from a decline in collateral value below its buy-and-hold threshold. In financial terms, an AMM liquidity provider is an option seller experiencing negative convexity, so that they must impose transaction fees to offset these losses. The losses incurred by AMMs are (somewhat misleadingly) termed “impermanent” because they tend to vanish under the assumption of mean reversion. However, the validity of the mean-reversion assumption in real-world scenarios can vary. Introducing variables $x$ and $y$ where $X = N x$ and $Y = N y,$ one can express the constant sum rule described by Equations (8.76) and (8.77) as follows:

\begin{matrix} x + y & = 2, x_{0} = y_{0} = 1, \end{matrix}

(8.83)

\begin{matrix} y (x) & = 2 - x, (\frac{d y}{d x}) = 1. \end{matrix}

(8.84)

In terms of $x$ and $y,$ the constant product rule given by Equations (8.78) and (8.79) can be written in the following form:

\begin{matrix} x y & = 1, x_{0} = y_{0} = 1, \end{matrix}

(8.85)

\begin{matrix} y (x) & = \frac{1}{x}, (\frac{d y}{d x}) = \frac{1}{x^{2}} . \end{matrix}

(8.86)

Finally, the mixed-rule equations of (8.81) written in terms of $x$ and $y$ become

\begin{matrix} (\frac{1}{x y} - 1) + α (\frac{x + y}{2} - 1) = 0, x_{0} = y_{0} = 1. \end{matrix}

(8.87)

Straightforward algebra yields

\begin{matrix} y_{α} & = \frac{1}{2 α} (- (2 (1 - α) + α x) + {({(2 (1 - α) + α x)}^{2} + \frac{8 α}{x})}^{1 / 2}), \\ \frac{d y_{α}}{d x} & = \frac{1}{2} (- 1 + \frac{2 (1 - α) + α x - (4) x^{2}}{{({(2 (1 - α) + α x)}^{2} + \frac{8 α}{x})}^{1 / 2}}), \\ \frac{d^{2} y_{α}}{d x^{2}} & = \frac{1}{2} (\frac{α + (8) x^{3}}{{({(2 (1 - α) + α x)}^{2} + \frac{8 α}{x})}^{1 / 2}} - \frac{α (2 (1 - α) + α x - (4) x^{2})}{{({(2 (1 - α) + α x)}^{2} + \frac{8 α}{x})}^{3 / 2}}) . \end{matrix}

(8.88)

Assume that the external exchange price $S$ of $T N_{2}$ expressed in terms of $T N_{1}$ moves away from its equilibrium value $S_{0} = 1 .$ Let $S > 1 .$ For the constant sum contract, an arbitrageur can choose a number $x,$ $1 < x \leq 2,$ and deliver $(x - 1)$ of $T N_{1}$ tokens to the pool in exchange for getting $(x - 1)$ of $T N_{2}$ tokens. The profit or loss ( $P & L$ ) is given by

\begin{matrix} Ω (x) = (S - 1) (x - 1) . \end{matrix}

(8.89)

Since $Ω$ is a linear function of $x,$ it is rational to exhaust the entire pool by choosing the following optimal values $(x^{*}, y^{*}, Ω^{*})$ :

\begin{matrix} x^{*} = 2, y^{*} = 0, Ω^{*} = (S - 1) . \end{matrix}

(8.90)

Similarly, when $S < 1$ :

\begin{matrix} x^{*} = 0, y^{*} = 2, Ω^{*} = - (S - 1) . \end{matrix}

(8.91)

The arbitraged portfolio’s value is $π^{*} (S),$ where

\begin{matrix} π^{*} (S) = (\begin{matrix} 2, S \geq 1, \\ 2 S, S < 1. \end{matrix}), \end{matrix}

(8.92)

while the buy-and-hold portfolio’s value is $(S + 1) .$ The difference $ω$ has the form

\begin{matrix} ω = (S + 1) - π^{*} (S) = (S - 1) . \end{matrix}

(8.93)

In the DeFi parlance, $ω$ is termed as impermanent loss. However, this description can be misleading as the loss can swiftly become permanent when $S$ moves away from its assumed “equilibrium” value of one. The percentage loss in the actual portfolio compared to the buy-and-hold portfolio is structured as follows:

\begin{matrix} λ = 1 - \frac{(S - 1)}{S + 1} . \end{matrix}

(8.94)

A similar calculation can be performed for the constant product contract. When $S$ deviates from one, an arbitrageur can choose a number $x > 1$ and deliver $(x - 1)$ tokens $T N_{1}$ to the pool, while taking $(1 - y)$ tokens $T N_{2}$ from the pool, where $y = 1 / x .$ The $P & L$ has the form:

\begin{matrix} Ω (x) = (S (1 - \frac{1}{x}) - (x - 1)) . \end{matrix}

(8.95)

The optimality condition has the form

\begin{matrix} Ω^{'} (x) = (\frac{S}{x^{2}} - 1) = 0, \end{matrix}

(8.96)

so that the corresponding optimal values $(x^{*}, y^{*}, Ω^{*})$ are

\begin{matrix} x^{*} = \sqrt{S}, y^{*} = \frac{1}{\sqrt{S}}, Ω^{*} = {(\sqrt{S} - 1)}^{2} . \end{matrix}

(8.97)

Hence, a constant product collateral pool remains inexhaustible. Throughout each phase, the ideal quantities of $T N_{1}$ and $T N_{2}$ maintained in the portfolio are both $\sqrt{S} .$ As both tokens’ values within the portfolio must equate, the suggested optimal value of $T N_{2}$ in terms of $T N_{1}$ is $S^{*} = x^{*} / y^{*} = S .$ The value of the arbitrage-driven portfolio stands at $π^{*} = 2 \sqrt{S},$ whereas the value of the buy-and-hold portfolio amounts to $(S + 1) .$ The difference is given by

\begin{matrix} ω = (S + 1) - 2 \sqrt{S} = {(\sqrt{S} - 1)}^{2} . \end{matrix}

(8.98)

The corresponding percentage loss is

\begin{matrix} λ = 1 - \frac{2 \sqrt{S}}{(S + 1)} = \frac{{(\sqrt{S} - 1)}^{2}}{(S + 1)} . \end{matrix}

(8.99)

For the mixed-rule AMM, the arbitrageur’s profit for $S > 1$ has the form

\begin{matrix} Ω (x) = (S (1 - y_{α} (x)) - (x - 1)), \end{matrix}

(8.100)

with the optimum achieved at $x_{α}^{*}, y_{α}^{*}, Ω_{α}^{*}$ of the form

\begin{matrix} y_{α}^{'} (x_{α}^{*}) = - \frac{1}{S}, y_{α}^{*} = y_{α} (x_{α}^{*}), Ω_{α}^{*} = (S (1 - y_{α}^{*}) - (x_{α}^{*} - 1)), \end{matrix}

(8.101)

with the optimal $x_{α}^{*}$ via the Newton–Raphson method starting with a suitable $x_{α}^{(0)}$ :

\begin{matrix} x_{α}^{(n + 1)} = x_{α}^{(n)} - \frac{y_{α}^{'} (x_{α}^{(n)}) + \frac{1}{S}}{y_{α}^{''} (x_{α}^{(n)})} . \end{matrix}

(8.102)

Here $y_{α}^{'},$ $y_{α}^{''}$ are given by the equations of (8.88). Due to quadratic convergence of the Newton–Raphson method, ten iterations provide machine accuracy, so that one can set $x_{α}^{*} = x_{α}^{(10)} .$ The value of the arbitraged portfolio is

\begin{matrix} π^{*} = x_{α}^{*} + S y_{α} (x_{α}^{*}) . \end{matrix}

(8.103)

Figure 14 shows the constant sum, constant product, and mixed-rule curves, along with the relative prices of $T N_{2}$ in terms of $T N_{1}$ and the associated impermanent losses. It demonstrates that deviations from the tokens’ equilibrium values result in losses for the market maker. Impermanent loss is relatively minor for the constant product rule, moderate for the mixed rule, and notably high for the constant sum rule. Even when the price $S$ sways by a factor of five from its equilibrium, the impermanent loss within the constant product rule remains manageable, especially compared to the mixed rule.

Figure 14 The constant sum, constant product, and mixed-rule curves, along with the relative prices of $T N_{2}$ in terms of $T N_{1}$ and the associated impermanent losses; $α = 10 .$ Author’s graphics.

One can use variance swaps to hedge impermanent loss. For brevity, consider the constant product rule. The corresponding impermanent loss, shown in Figure 14, is given by (8.98). It can be viewed as a payoff of a nonstandard European option. The hedging approach is straightforward – one approximates this payoff with payoffs of options, which can be priced explicitly. Specifically, one can use two such options: the log and entropy contracts. The corresponding payoffs are as follows:

\begin{matrix} U^{L C} (S) & = c^{L C} (S - 1 - ln (S)), \end{matrix}

(8.104)

\begin{matrix} U^{E C} (S) & = c^{E C} (S ln (S) - (S - 1)) . \end{matrix}

(8.105)

The prefactors $c^{L C},$ $c^{E C}$ are chosen in such a way that the value of the impermanent loss (8.98) and the hypothetical payoffs (8.104) and (8.105) agree at the point $S = 1$ up to the third derivative, so that

\begin{matrix} c^{L C} = c^{E C} = \frac{1}{2} . \end{matrix}

(8.106)

Assuming that $S$ is driven by the geometric Brownian motion with stochastic volatility, one can find the value of the log and entropy contracts at time $t$ at the point $S = 1,$ by solving the following problems:

\begin{matrix} U_{t} + \frac{1}{2} v (S^{2} U_{S S} + 2 ε ρ S U_{S v} + ε^{2} U_{v v}) + (χ - κ v) U_{v} = 0, \end{matrix}

(8.107)

supplied with terminal conditions of the form

\begin{matrix} U^{L C} (\overset{ˉ}{t}, S, v) = (S - 1 - ln (S)), \end{matrix}

(8.108)

and

\begin{matrix} U^{E C} (\overset{ˉ}{t}, S, v) = (S ln (S) - (S - 1)), \end{matrix}

(8.109)

respectively.

The corresponding solutions are well-known and easy to find. One can present $U^{L C} (t, S)$ as follows:

\begin{matrix} U^{L C} (t, S, v, \overset{ˉ}{t}) = Φ^{L C} (t, v, \overset{ˉ}{t}) + (S - 1 - ln (S)), \end{matrix}

(8.110)

where

\begin{matrix} Φ_{t}^{L C} + \frac{1}{2} v (1 + ε^{2} Φ_{v v}^{L C}) + (χ - κ v) Φ_{v}^{L C} = 0, \\ Φ^{L C} (\overset{ˉ}{t}, v, \overset{ˉ}{t}) = 0. \end{matrix}

(8.111)

Accordingly,

\begin{matrix} Φ^{L C} (t, v, \overset{ˉ}{t}) = α^{L C} (t, \overset{ˉ}{t}) + β^{L C} (t, \overset{ˉ}{t}) v, \end{matrix}

(8.112)

where

\begin{matrix} α_{t}^{L C} (t, \overset{ˉ}{t}) + χ β^{L C} (t, \overset{ˉ}{t}) & = 0, α^{L C} (\overset{ˉ}{t}, \overset{ˉ}{t}) = 0, \\ β_{t}^{L C} (t, \overset{ˉ}{t}) - κ β^{L C} (t, \overset{ˉ}{t}) + \frac{1}{2} & = 0, β^{L C} (\overset{ˉ}{t}, \overset{ˉ}{t}) = 0. \end{matrix}

(8.113)

Thus,

\begin{matrix} α^{L C} (t, \overset{ˉ}{t}) & = \frac{χ}{2 κ} (T - {\overset{ˉ}{B}}_{κ} (T)), \\ β^{L C} (t, \overset{ˉ}{t}) & = \frac{{\overset{ˉ}{B}}_{κ} (T)}{2}, \end{matrix}

(8.114)

so that

\begin{matrix} U^{L C} (t, S, v, \overset{ˉ}{t}) = \frac{1}{2} (\frac{χ T}{κ} + (v - \frac{χ}{2 κ}) {\overset{ˉ}{B}}_{κ} (T)) + (S - 1 - ln (S)) . \end{matrix}

(8.115)

It is clear that $U^{L C} (t, 1, v, \overset{ˉ}{t})$ is in agreement with (8.72).

One can calculate $U^{E C} (t, S, v)$ in a similar fashion by representing it in the form:

\begin{matrix} U^{E C} (t, S, v, \overset{ˉ}{t}) = Φ^{E C} (t, v, \overset{ˉ}{t}) S + (S ln (S) - (S - 1)), \end{matrix}

(8.116)

where, once the common factor $S$ is omitted,

\begin{matrix} Φ_{t}^{E C} + \frac{1}{2} v (1 + 2 ε ρ Φ_{v}^{E C} + ε^{2} Φ_{v v}^{E C}) + (χ - κ v) Φ_{v}^{E C} = 0, \\ Φ^{L C} (\overset{ˉ}{t}, v, \overset{ˉ}{t}) = 0. \end{matrix}

(8.117)

As before,

\begin{matrix} Φ^{E C} (t, v, \overset{ˉ}{t}) = α^{E C} (t, \overset{ˉ}{t}) + β^{E C} (t, \overset{ˉ}{t}) v, \end{matrix}

(8.118)

where

\begin{matrix} α_{t}^{E C} (t, \overset{ˉ}{t}) + χ β^{E C} (t, \overset{ˉ}{t}) = 0, α^{E C} (\overset{ˉ}{t}, \overset{ˉ}{t}) = 0, \\ β_{t}^{E C} (t, \overset{ˉ}{t}) - (κ - ε ρ) β^{E C} (t, \overset{ˉ}{t}) + \frac{1}{2} = 0, β^{E C} (\overset{ˉ}{t}, \overset{ˉ}{t}) = 0. \end{matrix}

(8.119)

Thus,

\begin{matrix} α^{E C} (t, \overset{ˉ}{t}) = \frac{χ}{2 (κ - ε ρ)} (T - {\overset{ˉ}{B}}_{κ - ε ρ} (T)), \end{matrix}

(8.120)

\begin{matrix} β^{E C} (t, \overset{ˉ}{t}) = \frac{{\overset{ˉ}{B}}_{κ - ε ρ} (T)}{2}, \\ U^{E C} (t, S, v, \overset{ˉ}{t}) = \frac{1}{2} (\frac{χ T}{κ_{1}} + (v - \frac{χ}{2 κ_{1}}) {\overset{ˉ}{B}}_{κ_{1}} (T)) S + (S ln (S) - (S - 1)) . \end{matrix}

(8.121)

where $κ_{1} = κ - ε ρ .$ Equations (8.115) and (8.121) allow us to estimate the amount a liquidity provider needs to collect to cover the expected impermanent loss.

However, it turns out (which comes as a surprise, at least to the present author) that one can solve the pricing problem (8.107) with the exact terminal condition (8.98) explicitly, since the impermanent loss does not have any optionality and is a linear combination of the so-called power contracts with payoffs of the form $S, \sqrt{S}, 1 .$ Footnote ¹⁰

Thus, by using an appropriate Kelvin wave, one can solve the problem (8.107) with the power terminal condition:

\begin{matrix} U^{(ν)} (S) = S^{ν} . \end{matrix}

(8.122)

Of course, for $ν = 0, 1,$ the solution is trivial; for other values of $ν,$ additional efforts are needed. To be concrete, it is assumed that $0 < ν < 1$ ; for other values of $ν,$ the solution can blow up in finite time. The price of the power contract with the payoff $S^{ν}$ (even when the interest rate $r \neq 0$ ) is given by a Kelvin wave:

\begin{matrix} V (t, S, \overset{ˉ}{t}) = e^{α (t, \overset{ˉ}{t}) + β (t, \overset{ˉ}{t}) v} S^{ν}, \end{matrix}

(8.123)

where $α (t), β (t)$ solve the following system of ODEs:

\begin{matrix} α_{t} (t, \overset{ˉ}{t}) + χ β (t, \overset{ˉ}{t}) = 0, α (\overset{ˉ}{t}, \overset{ˉ}{t}) = 0, \\ β_{t} (t, \overset{ˉ}{t}) + \frac{ε^{2}}{2} β^{2} (t, \overset{ˉ}{t}) - (κ - ν ρ ε) β (t, \overset{ˉ}{t}) + \frac{ν (ν - 1)}{2} = 0, β (\overset{ˉ}{t}, \overset{ˉ}{t}) = 0, \end{matrix}

(8.124)

which has an explicit solution given by Equations (7.111)–(7.114) with

\begin{matrix} λ_{\pm}^{2} + (κ - ν ρ ε) λ_{\pm} + \frac{ε^{2} ν (ν - 1)}{4} = 0, \\ λ_{\pm} = μ \pm ζ, \end{matrix}

(8.125)

\begin{matrix} μ = - \frac{(κ - ν ρ ε)}{2}, ζ = \frac{\sqrt{{(κ - ν ρ ε)}^{2} - ε^{2} ν (ν - 1)}}{2} . \end{matrix}

(8.126)

Thus, both $μ$ and $ζ$ are real. Accordingly, one can represent $α$ and $β$ as follows:

\begin{matrix} α (T) & = - \frac{2 χ}{ε^{2}} (- \frac{(κ - ν ρ ε) T}{2} + ln (\frac{ζ cosh (T) - μ sinh (T)}{ζ})), \\ β (T) & = \frac{ν (ν - 1) (sinh (T))}{2 (ζ cosh (T) - μ sinh (T))} . \end{matrix}

(8.127)

The exact impermanent loss and its approximations are shown in Figure 15. This figure shows that $max (U^{L C}, U^{E C})$ strictly dominates the exact solution $U^{E X},$ but, as time of liquidity provision grows, the corresponding upper bound becomes inaccurate.

Figure 15 The exact impermanent loss and its approximations via log and enthropy contracts. The corresponding parameters are $T = 3,$ $d t = 0.01,$ $χ = 0.2,$ $κ = 2.0,$ $ε = 0.2,$ $ρ = - 0.5,$ $v = 0.15 .$ Author’s graphics.

The calculation of the mixed-rule impermanent loss and its approximations is left to the reader as a difficult exercise.

In $P & L$ modeling for AMMs, the primary aim is to ensure that the liquidity provider makes a profit or, at least, does not incur a loss. This profit stems from transaction fees charged by the pool, which must exceed the impermanent loss caused by collateral value dropping below its buy-and-hold threshold. These fees must exceed the impermanent loss. An arbitrageur needs to add more tokens to the pool than the rule dictates to account for transaction fees. In the presence of nonzero transaction costs, the actual composition of the pool is time- and path-dependent. Given the stochastic nature of the log price, the analysis of $\overline{P & L}$ can only be conducted probabilistically through Monte Carlo simulations; see Reference Lipton and HardjonoLipton and Hardjono (2021) and Reference Lipton and SeppLipton and Sepp (2022). For the parameter selection used by these authors, automated liquidity provision is profitable on average. This profitability arises because the AMM accumulates more tokens by the process’s conclusion than initially possessed.

8.7 Bonds and Bond Options

8.7.1 Background

We now use the machinery developed in Sections 6 and 7 for pricing bonds and bond options in some popular fixed-income models, including Vasicek–Hull–White and Cox–Ingersoll–Ross.

8.7.2 Vasicek Model

One can use formulas derived in the previous subsection to price bonds and bond options in the popular Vasicek and Hull–White models; see Reference Hull and WhiteHull and White (1990); Reference VasicekVasicek (1977). Recall that Vasicek postulated the following dynamics for the short interest rate ${\hat{y}}_{t}$ :

\begin{matrix} d {\hat{y}}_{t} = (χ - κ {\hat{y}}_{t}) d t + ε d {\hat{W}}_{t}, {\hat{y}}_{t} = y, \end{matrix}

(8.128)

or, alternatively,

\begin{matrix} d {\hat{y}}_{t} = κ (θ - {\hat{y}}_{t}) d t + ε d {\hat{W}}_{t}, {\hat{y}}_{t} = y, \end{matrix}

(8.129)

where $κ θ = χ .$

At time $t,$ the price of a bond maturing at time $\overset{ˉ}{t},$ which is denoted by $Z (t, y, \overset{ˉ}{t}),$ boils down to solving the following classical backward problem:

\begin{matrix} Z_{t} (t, y, \overset{ˉ}{t}) + (χ - κ y) Z_{y} (t, y, \overset{ˉ}{t}) + \frac{1}{2} ε^{2} Z_{y y} (t, y, \overset{ˉ}{t}) - y Z (t, y, \overset{ˉ}{t}) = 0, \\ Z (\overset{ˉ}{t}, y, \overset{ˉ}{t}) = 1. \end{matrix}

(8.130)

The standard affine ansatz yields

\begin{matrix} Z (t, y, \overset{ˉ}{t}) & = exp (C - B_{κ} y) \\ C & = (θ - \frac{ε^{2}}{2 κ^{2}}) (B_{κ} - T) - \frac{ε^{2}}{4 κ} B_{κ}^{2} \\ = (B_{κ} - T) θ + \frac{h_{0}}{2}, \end{matrix}

(8.131)

where $h_{0}$ is given by (6.114).

One can use formulae derived in the previous section to come up with an alternative derivation. Introduce ${\hat{x}}_{t} = \int_{t}^{t} {\hat{y}}_{s} d s .$ The distribution of $({\hat{x}}_{t}, {\hat{y}}_{t})$ is given by (6.45) with the covariance matrix $H,$ given by (6.114) and the expected value $r$ given by (6.115). Accordingly, the price of a bond can be written as follows:

\begin{matrix} Z (t, y, \overset{ˉ}{t}) & = E (e^{- \overset{ˉ}{x}}) = \frac{1}{\sqrt{2 π h_{0}}} \int_{- \infty}^{\infty} e^{- \overset{ˉ}{x} - \frac{{(\overset{ˉ}{x} - p)}^{2}}{2 h_{0}}} d \overset{ˉ}{x} \\ = e^{- p + \frac{h_{0}}{2}} = exp (C (T) - B_{κ} (T) y), \end{matrix}

(8.132)

so that Equations (8.131) and (8.132) are in agreement.

Knowing the joint Gaussian distribution for $({\hat{x}}_{t}, {\hat{y}}_{t}),$ one can price an option on zero coupon bond maturing at time $\overset{˘}{t} > t,$ $\overset{˘}{t} - t = \overset{˘}{T} .$ The payoff of a European option with strike $K$ has the form:

\begin{matrix} U (\overset{ˉ}{t}, \overset{ˉ}{y}) = max (ϕ (exp (C (\overset{˘}{T}) - B_{κ} (\overset{˘}{T}) \overset{ˉ}{y}) - exp (ln K)), 0) . \end{matrix}

(8.133)

At maturity $\overset{ˉ}{t},$ the payoff is independent of $\overset{ˉ}{x}$ ; however, at inception it does depend on the realized value of $\overset{ˉ}{x} .$ By using Equations (6.45), (6.114), and (6.115), one can write $U (t, y)$ (recall that here $x = 0$ ) as follows:

\begin{matrix} U (t, y) = J_{1} (t, y) - J_{2} (t, y), \end{matrix}

(8.134)

where

\begin{matrix} J_{1} (t, y) & = \frac{1}{2 π det {(H)}^{1 / 2}} \int_{- \infty}^{\infty} \int_{- ϕ \infty}^{y^{*}} exp (- Λ (\overset{ˉ}{x}, \overset{ˉ}{y}) - \overset{ˉ}{x} + C (\overset{˘}{T}) - B_{κ} (\overset{˘}{T}) \overset{ˉ}{y}) d \overset{ˉ}{x} d \overset{ˉ}{y}, \end{matrix}

(8.135)

\begin{matrix} J_{2} (t, y) & = \frac{1}{2 π det {(H)}^{1 / 2}} \int_{- \infty}^{\infty} \int_{- ϕ \infty}^{y^{*}} exp (- Λ (\overset{ˉ}{x}, \overset{ˉ}{y}) - \overset{ˉ}{x} + ln K) d \overset{ˉ}{x} d \overset{ˉ}{y}, \end{matrix}

(8.136)

\begin{matrix} Λ (\overset{ˉ}{x}, \overset{ˉ}{y}) & = \frac{(h_{2} {(\overset{ˉ}{x} - p)}^{2} - 2 h_{1} (\overset{ˉ}{x} - p) (\overset{ˉ}{y} - q) + h_{0} {(\overset{ˉ}{y} - q)}^{2})}{2 det (H)}, \end{matrix}

(8.137)

with $h_{i}$ given by (6.114), $det (H) = h_{0} h_{2} - h_{1}^{2} .$ Here $y^{*}$ is defined as follows:

\begin{matrix} y^{*} = \frac{C (\overset{˘}{T}) - ln K}{B_{κ} (\overset{˘}{T})} . \end{matrix}

(8.138)

First, consider $J_{1} .$ Completing the square, one gets

\begin{matrix} - Λ (\overset{ˉ}{x}, \overset{ˉ}{y}) - \overset{ˉ}{x} + C (\overset{˘}{T}) - B_{κ} (\overset{˘}{T}) \overset{ˉ}{y} \\ = & - \frac{(h_{2} {((\overset{ˉ}{x} - p) - \frac{Ξ (y)}{\sqrt{h_{2}}})}^{2} - Ξ^{2} (\overset{ˉ}{y}) + h_{0} {(\overset{ˉ}{y} - q)}^{2})}{2 det (H)} \\ - B_{κ} (\overset{˘}{T}) (\overset{ˉ}{y} - q) - p + C (\overset{˘}{T}) - B_{κ} (\overset{˘}{T}) q, \end{matrix}

(8.139)

where

\begin{matrix} Ξ (\overset{ˉ}{y}) = \frac{(h_{1} (\overset{ˉ}{y} - q) - det (H))}{\sqrt{h_{2}}} . \end{matrix}

(8.140)

Integrating over $\overset{ˉ}{x},$ one obtains the following expression for $J_{1}$ :

\begin{matrix} J_{1} (t, y) = & \frac{e^{- p + C (\overset{˘}{T}) - B_{κ} (\overset{˘}{T}) q}}{\sqrt{2 π h_{2}}} \\ \int_{- ϕ \infty}^{y^{*}} exp (- \frac{(- Ξ^{2} (\overset{ˉ}{y}) + h_{0} {(\overset{ˉ}{y} - q)}^{2} + 2 det (H) B_{κ} (\overset{˘}{T}) (\overset{ˉ}{y} - q))}{2 det (H)}) d \overset{ˉ}{y} . \end{matrix}

(8.141)

Completing the square one more time, one gets:

\begin{matrix} - \frac{- Ξ^{2} + h_{0} {(\overset{ˉ}{y} - q)}^{2} + 2 det (H) B_{κ} (\overset{˘}{T}) (\overset{ˉ}{y} - q)}{2 det (H)} \\ = - \frac{{(\overset{ˉ}{y} - q + h_{1} + B_{κ} (\overset{˘}{T}) h_{2})}^{2}}{2 h_{2}} + \frac{h_{0}}{2} + B_{κ} (\overset{˘}{T}) h_{1} + \frac{B_{κ}^{2} (\overset{˘}{T}) h_{2}}{2}, \end{matrix}

(8.142)

so that

\begin{matrix} J_{1} (t, y) \\ = & \frac{e^{- p + C (\overset{˘}{T}) {- B}_{κ} (\overset{˘}{T}) q + \frac{h_{0}}{2} + B_{κ} (\overset{˘}{T}) h_{1} + \frac{B_{κ}^{2} (\overset{˘}{T}) h_{2}}{2}}}{\sqrt{2 π h_{2} (t, t)}} \\ \int_{- ϕ \infty}^{y^{*}} exp (- \frac{{(\overset{ˉ}{y} - q + h_{1} + B_{κ} (\overset{˘}{T}) h_{2})}^{2}}{2 h_{2}}) d \overset{ˉ}{y} \\ = & ϕ e^{- p + C (\overset{˘}{T}) {- B}_{κ} (\overset{˘}{T}) q + \frac{h_{0}}{2} + B_{κ} (\overset{˘}{T}) h_{1} + \frac{B_{κ}^{2} (\overset{˘}{T}) h_{2}}{2}} N (\frac{ϕ (y^{*} - q + h_{1} + B_{κ} (\overset{˘}{T}) h_{2})}{\sqrt{h_{2}}}) . \end{matrix}

(8.143)

It is easy to see that $Z (t, y, \overset{˘}{t})$ is given by (8.143) with $ϕ = 1$ and $y^{*} = \infty,$ so that

\begin{matrix} Z (t, y, \overset{˘}{t}) = e^{- p + C (\overset{˘}{T}) {- B}_{κ} (\overset{˘}{T}) q + \frac{h_{0}}{2} + B_{κ} (\overset{˘}{T}) h_{1} + \frac{B_{κ}^{2} (\overset{˘}{T}) h_{2}}{2}} . \end{matrix}

(8.144)

Thus,

\begin{matrix} J_{1} (t, y) = ϕ Z (t, y, \overset{˘}{t}) \\ N (\frac{ϕ (C (\overset{˘}{T}) - ln K - B_{κ} (\overset{˘}{T}) q + B_{κ} (\overset{˘}{T}) h_{1} + B_{κ}^{2} (\overset{˘}{T}) h_{2})}{\sqrt{h_{2}} B_{κ} (\overset{˘}{T})}) . \end{matrix}

(8.145)

Direct verification of (8.143) is left to the reader as a useful exercise. By using this equation, it is easy but tedious to show that

\begin{matrix} J_{1} (t, y) & = ϕ Z (t, y, \overset{˘}{t}) N (ϕ d_{+}), \\ d_{+} & = \frac{ln (\frac{Z (t, y, \overset{˘}{t})}{Z (t, y, \overset{ˉ}{t}) K})}{Σ (t, \overset{ˉ}{t}, \overset{˘}{t})} + \frac{Σ (t, \overset{ˉ}{t}, \overset{˘}{t})}{2}, \end{matrix}

(8.146)

where

\begin{matrix} Σ (t, \overset{ˉ}{t}, \overset{˘}{t}) = \sqrt{h_{2}} B_{κ} (\overset{˘}{T}) . \end{matrix}

(8.147)

Second, consider $J_{2},$ proceed in the same way as before, and represent $J_{2} (t, y)$ in the following form:

\begin{matrix} J_{2} (t, y) & = ϕ Z (t, y, \overset{ˉ}{t}) N (ϕ d_{-}), \\ d_{-} & = \frac{ln (\frac{Z (t, y, \overset{˘}{t})}{Z (t, y, \overset{ˉ}{t}) K})}{Σ (t, \overset{ˉ}{t}, \overset{˘}{t})} - \frac{Σ (t, \overset{ˉ}{t}, \overset{˘}{t})}{2} . \end{matrix}

(8.148)

Finally, one arrives at the following familiar expression for the bond option price:

\begin{matrix} U (t, y) = ϕ (Z (t, y, \overset{˘}{t}) N (ϕ d_{+}) - Z (t, y, \overset{ˉ}{t}) K N (ϕ d_{-})) . \end{matrix}

(8.149)

8.7.3 CIR Model

The CIR model postulates that the short rate follows the Feller process; see Reference Cox, Ingersoll Jr. and RossCox et al. (1985). Accordingly, the bond price can be calculated by using (7.123) with $x = 0,$ and $k = - i$ :

\begin{matrix} Z (t, y, \overset{ˉ}{t}) & = \int_{- \infty}^{\infty} ϖ^{(x)} (t, y, \overset{ˉ}{t}, \overset{ˉ}{x}) e^{- \overset{ˉ}{x}} d \overset{ˉ}{x} \\ = \frac{1}{2 π} \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} ϝ (t, y, \overset{ˉ}{t}, k) e^{(i k - 1) \overset{ˉ}{x}} d k d \overset{ˉ}{x} = ϝ (t, y, \overset{ˉ}{t}, - i), \end{matrix}

(8.150)

where

\begin{matrix} ϝ (t, y, \overset{ˉ}{t}, - i) = exp (\frac{2 χ μ T}{ε^{2}} - \frac{2 χ}{ε^{2}} ln (\frac{- λ_{-} E_{+} + λ_{+} E_{-}}{2 ζ}) + \frac{2 λ_{+} λ_{-} (E_{+} - E_{-}) y}{ε^{2} (- λ_{-} E_{+} + λ_{+} E_{-})}), \end{matrix}

(8.151)

with

\begin{matrix} λ_{\pm} & = μ \pm ζ, \\ μ & = - \frac{κ}{2}, ζ = \frac{\sqrt{κ^{2} + 2 ε^{2}}}{2} . \end{matrix}

(8.152)

Thus,

\begin{matrix} Z (t, y, \overset{ˉ}{t}) = e^{\tilde{C} - \tilde{B} y}, \end{matrix}

(8.153)

where

\begin{matrix} \tilde{C} & = \frac{χ κ T}{ε^{2}} - \frac{2 χ}{ε^{2}} ln (\frac{- λ_{-} E_{+} + λ_{+} E_{-}}{2 ζ}), \\ \tilde{B} & = \frac{(E_{+} - E_{-})}{(- λ_{-} E_{+} + λ_{+} E_{-})}, \end{matrix}

(8.154)

which coincides with the standard expressions given by Reference Cox, Ingersoll Jr. and RossCox et al. (1985).

8.8 European Options with Stochastic Interest Rates

This section shows how to price equity options with stochastic interest rates. While the formulation of this problem may appear straightforward, its solution proves to be tedious. It is assumed that interest rate is governed by the Ornstein–Uhlenbeck–Vasicek processes.

\begin{matrix} d {\hat{y}}_{t} = (χ - κ {\hat{y}}_{t}) d t + ε d {\hat{Z}}_{t}, {\hat{y}}_{t} = y, \end{matrix}

(8.155)

where ${\hat{Z}}_{t}$ is the standard Wiener processes. The risk-neutral evolution of the foreign exchange is governed by the following equation:

\begin{matrix} \frac{d {\hat{S}}_{t}}{{\hat{S}}_{t}} = {\hat{y}}_{t} d t + σ d {\hat{W}}_{t}, {\hat{S}}_{t} = S, \end{matrix}

(8.156)

or, equivalently,

\begin{matrix} d {\hat{x}}_{t} = ({\hat{y}}_{t} - \frac{1}{2} σ^{2}) d t + σ d {\hat{W}}_{t}, {\hat{x}}_{t} = x, \end{matrix}

(8.157)

where $\hat{x} = ln (\hat{S} / K) .$ In general, $d {\hat{Z}}_{t}$ and $d {\hat{W}}_{t}$ are correlated, so that $d {\hat{Z}}_{t} d {\hat{W}}_{t} = ρ d t .$

Consider the familiar backward Kolmogorov problem for European calls and puts:

\begin{matrix} U_{t} + \frac{1}{2} ε^{2} U_{r r} + ρ ε σ U_{r x} + \frac{1}{2} σ^{2} U_{x x} \\ + (χ - κ r) U_{y} + (y - \frac{1}{2} σ^{2}) U_{x} - r U = 0, \\ U (\overset{ˉ}{t}, y, x) = K {(ϕ (e^{x} - 1))}_{+} . \end{matrix}

(8.158)

As usual, start with the change of the dependent variable:

\begin{matrix} U = K B_{1} V . \end{matrix}

(8.159)

where $B = exp (α_{1} - β_{1} y)$ is the domestic bond price, given by (8.131), so that

\begin{matrix} B_{t} + \frac{1}{2} ε^{2} B_{r r} + (χ - κ r) B_{y} - r B = 0. \end{matrix}

(8.160)

Hence,

\begin{matrix} V_{t} + \frac{1}{2} ε^{2} V_{r r} + ρ ε σ V_{r x} + \frac{1}{2} σ^{2} V_{x x} \\ + (x - \frac{1}{2} ε^{2} β_{1} - κ r) V_{y} + (y - \frac{1}{2} σ^{2} - ρ ε σ β_{1}) V_{x} = 0, \\ V (\overset{ˉ}{t}, y, x) = {(ϕ (e^{x} - 1))}_{+} . \end{matrix}

(8.161)

Now, change independent variables $(t, y, x) \to (t, η_{1}, η_{2}),$ where

\begin{matrix} η_{1} = y, η_{2} = - α_{1} + β_{1} y + x . \end{matrix}

(8.162)

Thus,

\begin{matrix} \frac{\partial}{\partial t} & = \frac{\partial}{\partial t} + (- α_{1}^{'} + β_{1}^{'} η_{1}) \frac{\partial}{\partial η_{2}}, \\ \frac{\partial}{\partial y} & = \frac{\partial}{\partial η_{1}} + β_{1} \frac{\partial}{\partial η_{2}}, \frac{\partial}{\partial x} = \frac{\partial}{\partial η_{2}} . \end{matrix}

(8.163)

so that

\begin{matrix} V_{t} + (- α_{1}^{'} + β_{1}^{'} η_{1}) V_{η_{2}} \\ + \frac{1}{2} ε^{2} (V_{η_{1} η_{1}} + 2 β_{1} V_{η_{1} η_{2}} + β_{1}^{2} V_{η_{2} η_{2}}) + ρ ε σ (V_{η_{1} η_{2}} + β_{1} V_{η_{2} η_{2}}) + \frac{1}{2} σ^{2} V_{η_{2} η_{2}} \\ + (χ - \frac{1}{2} ε^{2} β_{1} - κ η_{1}) (V_{η_{1}} + β_{1} V_{η_{2}}) + (η_{1} - \frac{1}{2} σ^{2} - ρ ε σ β_{1}) V_{η_{2}} = 0, \\ V (\overset{ˉ}{t}, η_{1}, η_{2}) = {(ϕ (e^{η_{2}} - 1))}_{+} . \end{matrix}

(8.164)

Assume that $V (t, η_{1}, η_{2})$ only depends on $t, η_{2},$ $V (t, η_{1}, η_{2}) = V (t, η_{2}),$ which is consistent with the terminal condition. Thus,

\begin{matrix} V_{t} + (\frac{1}{2} ε^{2} β_{1}^{2} + ρ ε σ β_{1} + \frac{1}{2} σ^{2}) V_{η_{2} η_{2}} \\ + (- α_{1}^{'} + β_{1}^{'} η_{1} + (b_{1} - \frac{1}{2} ε^{2} β_{1} - κ η_{1}) β_{1} - \frac{1}{2} σ^{2} - ρ ε σ β_{1} + η_{1}) V_{η_{2}} = 0 \\ V (\overset{ˉ}{t}, η_{1}, η_{2}, η_{2}) = {(ϕ (e^{η_{2}} - 1))}_{+} . \end{matrix}

(8.165)

But

\begin{matrix} α_{1}^{'} - β_{1}^{'} η_{1} + \frac{1}{2} ε^{2} β_{1}^{2} - (χ - κ η_{1}) β_{1} - η_{1} = 0, \end{matrix}

(8.166)

so that

\begin{matrix} V_{t} + (\frac{1}{2} ε^{2} β_{1}^{2} + ρ ε σ β_{1} + \frac{1}{2} σ^{2}) (V_{η_{2} η_{2}} - V_{η_{2}}) = 0, \\ V (\overset{ˉ}{t}, η_{1}, η_{2}) = {(ϕ (e^{η_{2}} - 1))}_{+} . \end{matrix}

(8.167)

This is the classical Black–Scholes problem with time-dependent volatility:

\begin{matrix} V_{t} + \frac{1}{2} Σ^{2} (V_{η_{2} η_{2}} - V_{η_{2}}) = 0, \\ V (\overset{ˉ}{t}, η_{2}) = {(ϕ (e^{η_{2}} - 1))}_{+}, \end{matrix}

(8.168)

where

\begin{matrix} Σ^{2} = ε^{2} B_{κ}^{2} + 2 ρ ε σ B_{κ} + σ^{2} . \end{matrix}

(8.169)

Thus, the price is

\begin{matrix} U = B_{1} U^{(C, P)} (\frac{B_{2} S}{B_{1}}; T, K, \sqrt{\frac{\int Σ^{2} d s}{T}}), \end{matrix}

(8.170)

where $U^{(C, P)}$ are given by (8.20).

A similar technique can be used for the Heston model and the Stein–Stein model with stochastic interest rates. However, there is one significant difference between these two models - the former model works only when volatility and rate innovations are uncorrelated, while the latter model can handle arbitrary correlations.

9 Conclusions

Due to the space constraints, the discussion must be concluded here. It is left to the reader to explore further the application of mathematical tools and techniques based on Kelvin waves in financial engineering. Three particularly compelling problems are

the pricing and risk management of credit derivatives;
the exploration of mean-reverting trading strategies, such as pairs trading;
the examination of affine jump-diffusion and pseudo-differential processes.

References such as Reference Lipton and SheltonLipton and Shelton (2012), Reference Lipton and Lopez de PradoLipton and Lopez de Prado (2020), and others provide additional insights into these problems.

This Element has established a unified methodology for determining t.p.d.fs and expectations for affine processes through integral representations based on Kelvin waves. This approach has bridged various disciplines, uncovering profound connections between hydrodynamics, molecular physics, stochastic processes, and financial engineering. Both degenerate problems, which possess more independent variables than sources of uncertainty, and their nondegenerate counterparts are covered, showcasing the versatility of the method.

A surprising link is established between the Langevin equation for underdamped Brownian motion and the vorticity equation for two-dimensional flows in viscous incompressible fluids. Utilizing Kelvin wave expansions, the book solves several relevant financial problems, including the deriving convenient formulas for t.p.d.fs and expectations for processes with stochastic volatility, developing an analytically solvable model for path-dependent volatility, pricing of Asian options with geometric averaging, and pricing bonds and bond options by augmenting the short-rate process with its integral process.

The methodology introduced in this book can address a wide spectrum of complex problems, significantly enhancing the comprehension and modeling of stochastic systems across diverse fields.

Acknowledgments

I am grateful to my ADIA colleagues Majed Alromaithi, Marcos Lopez de Prado, Koushik Balasubramanian, Andrey Itkin, Oleksiy Kondratiev, Arthur Maghakian, Dmitry Muravey, Adil Reghai, other Q-team colleagues, my ADIA Lab colleague Horst Simon, and a former Bank of America colleague, Artur Sepp, for their encouragement and council. The kind invitation by Riccardo Rebonato to contribute to Cambridge Elements in Quantitative Finance is much appreciated. I am grateful to Drs. Nicola Ghazi and Piergiorgio Neri from Cleveland Clinic Abu Dhabi for saving the vision in my left eye, thus allowing me to finish this Element. Last but not least, the help of my wife, Marsha Lipton, especially her editorial suggestions and financial insights, has been critical in producing this Element.

Alexander Lipton is a Global Head of Research & Development at Abu Dhabi Investment Authority, an Advisory Board member at ADIA Lab, a Professor of Practice at Khalifa University, and a Connection Science Fellow at MIT. He is a Co-Founder of Sila, a company providing digital wallet & ACH payment services, and an advisory board member at several companies worldwide. From 2006 to 2016, Alexander was Co-Head of the Global Quantitative Group and Quantitative Solutions Executive at Bank of America. Before that, he held senior managerial positions at several leading financial institutions. Additionally, Alexander held visiting professorships at EPFL, NYU, Oxford, and Imperial College. Earlier, Alexander was a Full Professor at the University of Illinois and a Consultant at the Los Alamos National Laboratory. Risk Magazine awarded him the Inaugural Quant of the Year Award in 2000 and the Buy-side Quant of the Year Award in 2021. Alexander has authored/edited thirteen books and over a hundred scientific papers on nuclear fusion, astrophysics, applied mathematics, financial engineering, distributed ledgers, and quantum computing. He holds several US patents.

Riccardo Rebonato
EDHEC Business School
Editor Riccardo Rebonato is Professor of Finance at EDHEC Business School and holds the PIMCO Research Chair for the EDHEC Risk Institute. He has previously held academic positions at Imperial College, London, and Oxford University and has been Global Head of Fixed Income and FX Analytics at PIMCO, and Head of Research, Risk Management and Derivatives Trading at several major international banks. He has previously been on the Board of Directors for ISDA and GARP, and he is currently on the Board of the Nine Dot Prize. He is the author of several books and articles in finance and risk management, including Bond Pricing and Yield Curve Modelling (2017, Cambridge University Press).

About the series

Cambridge Elements in Quantitative Finance aims for broad coverage of all major topics within the field. Written at a level appropriate for advanced undergraduate or graduate students and practitioners, Elements combines reports on original research covering an author’s personal area of expertise, tutorials and masterclasses on emerging methodologies, and reviews of the most important literature.

Element contents

Hydrodynamics of Markets

Summary

Keywords

1 Introduction

1.1 Background

1.2 Main Results

1.3 Element Structure

2 Fluid Flows

2.1 Euler and Navier–Stokes Equations

2.2 Linear Flows

2.3 Kelvin Waves in an Incompressible Fluid

3 Kolmogorov Stochastic Process

3.1 Background

3.2 Summary of Kolmogorov’s Paper

3.3 Challenge and Response

3.4 Direct Verification

3.5 Solution via Kelvin Waves

3.6 Solution via Coordinate Transform

3.7 A Representative Example

4 Klein–Kramers Stochastic Process

4.1 Background

4.2 Langevin Equation

4.3 Klein–Kramers Equation

4.4 Chandrasekhar’s Solutions

5 Transition Probability Densities for Stochastic Processes

5.1 Motivation

5.2 Backward and Forward Equations

5.3 Augmentation Procedure

5.4 Reduction Procedure

6 Gaussian Stochastic Processes

6.1 Regular Gaussian Processes

6.1.1 Solution via Kelvin Waves

6.1.2 Solution via Coordinate Transform

6.2 Killed Gaussian Processes

6.2.1 Solution via Kelvin Waves

6.3 Example: Kolmogorov Process

6.4 Example: OU Process

6.4.1 OU Process

6.4.2 Gaussian Augmented OU Process

6.5 Example: Diffusion of Free and Harmonically Bound Particles

6.6 Example: Vorticity of Two-Dimensional Flows

7 Non-Gaussian Stochastic Processes

7.1 Regular Non-Gaussian Processes

7.2 Killed Non-Gaussian Processes

7.3 Example: Anomalous Kolmogorov Process

7.4 Example: Feller Process

7.4.1 Feller Process

Feller Process with Constant Parameters

Feller Process with Time-Dependent Parameters

Feller Process with Jumps

7.4.2 Augmented Feller Process, I

7.4.3 Augmented Feller Process, II

7.5 Example: Path-Dependent Process

7.6 Example: OU-Like Process

7.6.1 Anomalous OU Process

7.6.2 Non-Gaussian Augmented OU Process, I

7.6.3 Non-Gaussian Augmented OU Process, II

8 Pricing of Financial Instruments

8.1 Background

8.2 The Underlying Processes

8.3 European Derivatives

8.3.1 Forwards, Calls, Puts, and Covered Calls

8.3.2 Black–Scholes Model

8.3.3 Heston Model

8.3.4 Stein–Stein Model

8.3.5 Path-Dependent Volatility Model

8.3.6 Bachelier Model

8.4 Asian Options with Arithmetic and Geometric Averaging

8.5 Volatility and Variance Swaps and Swaptions

8.5.1 Volatility Swaps and Swaptions

8.5.2 Variance Swaps and Swaptions

8.6 Automated Market Makers

8.7 Bonds and Bond Options

8.7.1 Background

8.7.2 Vasicek Model

8.7.3 CIR Model

8.8 European Options with Stochastic Interest Rates

9 Conclusions

Acknowledgments