1. Introduction
For the last six and a half decades, an enormous effort has been devoted to the study of microinstabilities in magnetically confined plasmas. Mathematically, such instabilities can be described by the Boltzmann equation for the plasma particles coupled to Maxwell's equations for the electric and magnetic fields, but it is often sufficient to consider the somewhat simpler gyrokinetic system of equations (Rutherford & Frieman Reference Rutherford and Frieman1968; Taylor & Hastie Reference Taylor and Hastie1968; Catto Reference Catto1978; Antonsen & Lane Reference Antonsen and Lane1980; Catto, Tang & Baldwin Reference Catto, Tang and Baldwin1981; Frieman & Chen Reference Frieman and Chen1982; Brizard & Hahm Reference Brizard and Hahm2007; Krommes Reference Krommes2012; Catto Reference Catto2019). These equations apply if the instability wavelength perpendicular to the magnetic field is comparable to the ion or electron gyroradius, but the wavelength is much longer in the direction along the field, which is normally the case for the most important microinstabilities and turbulence afflicting magnetised plasmas in the laboratory. Gyrokinetics also finds fruitful application in other parts of plasma physics, such as astrophysics (Schekochihin et al. Reference Schekochihin, Cowley, Dorland, Hammett, Howes, Quataert and Tatsuno2009), and has been the subject of thousands of publications. Several millions of lines of computer code has been written for the purpose of numerically simulating gyrokinetic instabilities and turbulence (Kotschenreuther, Rewoldt & Tang Reference Kotschenreuther, Rewoldt and Tang1995; Garbet et al. Reference Garbet, Idomura, Villard and Watanabe2010).
As a result of this effort, a great deal of knowledge about various microinstabilities has been accumulated. Ion-temperature-gradient- (ITG-) and electron-temperature-gradient- driven modes, trapped-electron modes, kinetic ballooning modes and microtearing modes have, for instance, been found to be unstable and cause turbulence in tokamaks, stellarators and other fusion devices. However, a basic problem is that these and other instabilities tend to be sensitive to assumptions made about plasma parameters and the magnetic-field geometry. A cylindrical plasma does not have the same stability properties as a plasma slab, toroidal plasmas are different from cylindrical plasmas, and tokamaks and stellarators are also substantially different. As a result, little is known in general about gyrokinetic microinstabilities, despite the great effort devoted to their study.
In a recent publication (Helander & Plunk Reference Helander and Plunk2021), universal upper bounds on the growth rates of local gyrokinetic instabilities could nevertheless be derived in such a way that the results hold in any low-beta plasma, regardless of the magnetic geometry, number of particle species and collisions. The reason why these bounds are so general is they result from thermodynamic considerations. It is the budget of the Helmholtz free energy that constrains all instability growth rates to lie below the bounds in question. In the present paper, we provide more mathematical details of this calculation and extend it by showing how the bounds can be sharpened. In particular, we calculate the lowest possible bound on the growth rate that can be obtained from the free-energy budget of a plasma with ‘adiabatic’ electrons and a single kinetic ion species. In subsequent publications, such rates of ‘optimal growth’ will be derived in more complex cases that include both electrostatic and magnetic fluctuations. We will also show how the bounds can be lowered by simultaneously considering the budget of free energy and electrostatic energy, and compare them with gyrokinetic simulations. The present paper serves as an introduction to this series of publications.
2. Gyrokinetic system of equations
The mathematical setting of our considerations is that of local gyrokinetics. The distribution function of each species $a$ is written as (Catto Reference Catto1978)
where ${\boldsymbol r}$ denotes the particle position and ${\boldsymbol R} = {\boldsymbol r} - {\boldsymbol b} \times {\boldsymbol v} / \varOmega _a$ the gyrocentre position. Here, the magnetic field has been written as ${\boldsymbol B} = B {\boldsymbol b} = \boldsymbol {\nabla } \psi \times \boldsymbol {\nabla } \alpha$ in terms of Clebsch coordinates $(\psi,\alpha )$. If the magnetic field lines trace out toroidal surfaces, as in tokamaks and stellarators, a ballooning transform is necessary unless all field lines close on themselves. The gyrofrequency is $\varOmega _a = e_a B / m_a$, where $m_a$ denotes mass and $e_a$ charge. The equilibrium distribution function is taken to be Maxwellian, with density $n_a(\psi )$ and temperature $T_a(\psi )$ constant on magnetic surfaces, and no mean flow velocity. The particle velocity is denoted ${\boldsymbol v} = v_\| {\boldsymbol b} + {\boldsymbol v}_\perp$, the unperturbed energy by $E_a = m_a v^2 / 2 + e_a \varPhi (\psi )$, and the magnetic moment $\mu _a = m_a v_\perp ^2 / (2B)$ is a lowest-order constant of the motion. The geometry is taken to be that of a ‘flux tube’, i.e. a slender volume of plasma aligned with the magnetic field, with a rectangular cross-section in the $(\psi,\alpha )$-plane. Periodic boundary conditions on the fluctuations will be applied in this plane, so that all perturbations can be Fourier decomposed. For instance, the electrostatic potential fluctuations $\delta \phi$ are
where ${\boldsymbol k} = {\boldsymbol k}_\perp = k_\psi \boldsymbol {\nabla } \psi + k_\alpha \boldsymbol {\nabla } \alpha$ with $k_\psi$ and $k_\alpha$ independent of the arc length $l$ along the magnetic field. The Fourier coefficients must satisfy $\delta \phi ^\ast _{\boldsymbol k} = \delta \phi _{-\boldsymbol k}$ in order that the potential be real.
The ‘non-adiabatic’ part of the distribution function $g_a$ evolves according to the nonlinear gyrokinetic equation (Frieman & Chen Reference Frieman and Chen1982)
where $\omega _d = {\boldsymbol k} \boldsymbol {\cdot } {\boldsymbol v}_d$ denotes the drift frequency (with ${\boldsymbol v}_d$ being the unperturbed drift velocity),
and ${\rm J}_0$ and ${\rm J}_1$ are Bessel functions. The gyro-averaged and linearised collision operator between species $a$ and $b$ is denoted by $C_{ab}$, and the field perturbations are given by
Here and in the following, we write $\lambda _a = {n_a e_a^2}/{T_a}$ and ${\rm J}_{na} = {\rm J}_n(k_\perp v_\perp / \varOmega _a)$. Equation (2.7) expresses quasineutrality, (2.8) Ampère's law and (2.9) the condition that the sum of the thermal pressure and the magnetic pressure should be constant on the short length scale of the fluctuations. The volume element in velocity space is
where the sum is taken over both values of $\sigma = v_\| / | v_\| | = \pm 1$.
Note that we restrict our attention to the original gyrokinetic equation (2.3) of Frieman & Chen (Reference Frieman and Chen1982), which does not include equilibrium flows. We thus only consider instabilities caused by density and temperature gradients, but not those associated with velocity-space anisotropy or non-Maxwellian distribution functions, such as fast-ion-driven instabilities (Chen & Zonca Reference Chen and Zonca2016). Moreover, stabilisation or destabilisation associated with flow-velocity shear is not included in the analysis although it can be quite important in practice (see, e.g., Barnes et al. Reference Barnes, Parra, Highcock, Schekochihin, Cowley and Roach2011). It should be possible to include such effects by adding an appropriate term to (2.3), at least in the case that the equilibrium is axisymmetric (Artun & Tang Reference Artun and Tang1994; Parra, Barnes & Peeters Reference Parra, Barnes and Peeters2011). In non-symmetric equilibria, the situation is fundamentally more complicated because any equilibrium flow must be small (Helander Reference Helander2014).
As we show in the following, it is advantageous to introduce the function
where all quantities are evaluated at the gyrocentre position $\boldsymbol R$. The quasineutrality condition then becomes
where $\varGamma _0(x) = {\rm I}_0(x) {\rm e}^{-x}$, $b_a = k_\perp ^2 \rho _a^2 = k_\perp ^2 T_a / (m_a \varOmega _a^2)$ and we have used an integral given in appendix A. In the following, we sometimes write $\varGamma _{0a}$ instead of $\varGamma _0(b_a)$.
3. Helmholtz free energy
The budget of Helmholtz free energy has been considered by several authors, e.g. Krommes & Hu (Reference Krommes and Hu1993), Brizard (Reference Brizard1994), Sugama et al. (Reference Sugama, Okamoto, Horton and Wakatani1996), Garbet et al. (Reference Garbet, Dubuit, Asp, Sarazin, Bourdelle, Ghendrih and Hoang2005), Schekochihin et al. (Reference Schekochihin, Cowley, Dorland, Hammett, Howes, Quataert and Tatsuno2009), Banon Navarro et al. (Reference Banon Navarro, Morel, Albrecht-Marc, Carati, Merz, Goerler and Jenko2011), Hatch et al. (Reference Hatch, Jenko, Navarro, Bratanov, Terry and Pueschel2016) and Stoltzfus-Dueck & Scott (Reference Stoltzfus-Dueck and Scott2017), and is obtained by multiplying the gyrokinetic equation (2.3) by $T_a g_a^\ast / F_{a0}$, taking the real part, summing over all species and wavenumbers, integrating over velocity space and, finally, taking an average over the volume of the flux tube, which we denote by angular brackets,
We note that the average could also be defined keeping $L$ finite, e.g. for periodic systems, without affecting what follows. In order for the integral to converge, we require that the functions $\bar {\chi }_{\boldsymbol k}(l)$ should be bounded. On the left-hand side of (2.3), this operation,
annihilates the second term because
where we have used (2.10) and assumed that $|g_{a, {\boldsymbol k}}|^2$ remains bounded as $l \rightarrow \infty$.Footnote 1 The operation also eliminates the third term because $\omega _{da}$ is real, and the fourth term because
where ${\boldsymbol q} = {\boldsymbol k}' - {\boldsymbol k}$ and we have used $g^\ast _{a,\boldsymbol k} = g_{a,-\boldsymbol k}$. Because the right-hand side changes sign if ${\boldsymbol k}$ and $\boldsymbol q$ are interchanged, the result vanishes upon summation over $\boldsymbol k$ and ${\boldsymbol q}$. The remainder of the equation thus becomes
where
is negative or vanishes by Boltzmann's $H$-theorem. By using the field equations (2.7)–(2.9), we find
where $| \delta {\boldsymbol B}_{\boldsymbol k} |^2 = | k_\perp \delta A_{\|\boldsymbol k} |^2 + | \delta B_{\|\boldsymbol k} |^2$ and, thus, we obtain our key equation:
where we have written
It is helpful to write $H$ in terms of $\delta F_a$, defined in (2.11), instead of $g_a$:
which makes it clear that $H$ can never be negative and only vanishes if all distribution-function perturbations $\delta F_a$ vanish everywhere in phase space. The first term in $H$ is recognised from the Gibbs entropy formula: if $F = F_0 + \delta F$, then to second order in $\delta F$,
which motivates us to define
Furthermore, we write
and note that, in the short-wavelength limit, $b_a = (k_\perp \rho _a)^2 \ll 1$, $\varGamma _0(b_a) = 1 - b_a + O(b_a^2)$, so that
where the first term represents the kinetic energy of ${\boldsymbol E} \times {\boldsymbol B}$ motion and the second term magnetic energy. We thus arrive at the formula
with $U$ denoting the energy of the fluctuations and $S_a$ their entropy, suggesting that $H$ describes the Helmholtz free energy of the fluctuations and (3.8) the budget of this energy. Indeed, on the right-hand side of this equation $C$ reflects the increase in entropy due to collisions, and $D$ can be written as
Here
describes the gyrocentre velocity perturbation due to the fluctuations, and the radial particle and heat fluxes are
The term in (3.17) involving $\varGamma _a$ is thus suggestive of the thermodynamic work performed by the particle flux against the pressure gradient, and the term involving $q_a$ relates to entropy production due to a heat flux down the temperature gradient.
Thanks to the nonlinear term in the gyrokinetic equation, free energy can be transferred between different wavenumbers and be ‘cascaded’ to small scales, where it is dissipated by collisions, much like kinetic energy in Navier–Stokes turbulence. The way in which this occurs and gives rise to a turbulent spectrum of fluctuations has been studied extensively in the literature (Schekochihin et al. Reference Schekochihin, Cowley, Dorland, Hammett, Howes, Quataert and Tatsuno2009; Tatsuno et al. Reference Tatsuno, Dorland, Schekochihin, Plunk, Barnes, Cowley and Howes2009; Banon Navarro et al. Reference Banon Navarro, Morel, Albrecht-Marc, Carati, Merz, Goerler and Jenko2011; Stoltzfus-Dueck & Scott Reference Stoltzfus-Dueck and Scott2017). We shall use the free-energy budget (3.8) for a different purpose, namely, to derive rigorous upper bounds on linear and nonlinear growth rates. Outside the realm of gyrokinetics, this has earlier been accomplished for linear instabilities by Fowler and co-workers (Fowler Reference Fowler1964, Reference Fowler1968; Brizard et al. Reference Brizard, Fowler, Hua and Morrison1991).
4. Cauchy–Schwarz inequalities
For simplicity, we restrict our considerations to low-beta plasmas, where fluctuations in the magnetic-field strength can be neglected, $\delta B_\| = 0$. This approximation is common in the literature but will be removed in the next publication in this series of papers.
Our basic mathematical tools are the triangle and Cauchy–Schwarz inequalities, which limit the amplitude of field fluctuations that are possible given a certain entropy budget. For instance, it follows from the field equation (2.12) that the electrostatic potential is bounded by
Thus, if we measure the relative entropy perturbation at the scale $\boldsymbol k$ of each species $a$ by the dimensionless quantity
then it follows that the electrostatic potential is subject to the bound
Analogously, it follows from Ampère's law (2.8) that the magnetic potential is limited by
i.e.
where $\beta _a(l) = 2 \mu _0 n_a T_a / B^2$. In the last, approximate equality, we have recognised the fact that the sum is usually dominated by the contribution from the electrons thanks to their small gyroradius. Because $k_\perp \rho _e$ is small for most instabilities of interest, the inequality (4.5) is not very restrictive, but it is nevertheless valuable as it implies that gyrokinetic instabilities are electrostatic in the limit $\beta _e \rightarrow 0$. Moreover, it is used in the following to demonstrate that the growth rate remains bounded in the limit $k_\perp \rho _e \rightarrow \infty$.
We can also apply the triangle and Cauchy–Schwarz inequalities to the free-energy production rate (3.9):
where the functions
can be expressed in terms of modified Bessel functions as
using integrals given in appendix A. In the limits of very small and very large wavelength, respectively, the asymptotic forms of these functions are
5. Upper bounds on linear growth rates
In this section, we temporarily consider linear instabilities and thus focus on a single pair of wavenumbers $(k_\psi,k_\alpha )$. Thanks to Boltzmann's $H$-theorem, the quantity $C({\boldsymbol k},t)$ is always negative and the relation (3.8) thus implies an upper bound on the linear growth rate
As we have already bounded $D$ from above, we merely need to find a suitable bound on
from below to derive an upper bound on $\gamma ({\boldsymbol k})$. Some care is needed to construct reasonably tight bounds, but all results are largely independent of the geometry of the magnetic field because the second and third terms from (2.3) do not contribute to the free-energy balance equation (3.8). The bound (5.1) therefore only depends on the magnetic geometry through the two quantities $B(l)$ and $k_\perp (l) = | k_\psi \boldsymbol {\nabla } \psi + k_\alpha \boldsymbol {\nabla } \alpha |$.
5.1. Adiabatic electrons
We begin by considering the simplest case of a hydrogen plasma with a Boltzmann-distributed, or so-called ‘adiabatic’, electron response, where $g_e$ is taken to vanish. This is the traditionally simplest gyrokinetic model of ITG and trapped-ion instabilities, which account for a substantial fraction of the turbulence and transport in tokamaks and stellarators, and therefore has been the subject of hundreds, if not thousands, of publications. As $g_e$ vanishes and there are no magnetic fluctuations [in the approximation used in (4.5)], the free energy becomes
where $n = n_i = n_e$ and $\tau = T_i / T_e$. Furthermore, the quasineutrality condition (2.12) reduces to
and the bound (4.3) is thus replaced by the more stringent condition
Thanks to this inequality, the free energy satisfies
The free-energy production term can be simplified somewhat because the quasineutrality condition (5.4) in the case of adiabatic electrons implies that there is no particle flux. Indeed, the flux from (3.19),
vanishes because of (2.12), and $D$ thus becomes
As a result, in the inequality (4.6), the function $M(\eta,b)$ can be replaced by
and the bound (5.1) becomes
Here, $\tilde M (\eta _i, b_i)$ is a decreasing function of $b_i$, and the denominator is an increasing function of the same quantity. The right-hand side is thus maximised by choosing $|\delta \phi _{\boldsymbol k}(l)|^2 = \delta (l-l_0)$, where $l_0$ is the position along the field line where $b_i(l) = k_\perp ^2 \rho _i^2 \propto (k_\perp /B)^2$ is minimised. We thus obtain
where $b_\textrm {min} = b_i(l_0)$. The result is plotted in figure 1. Note that all dependence on the geometry of the magnetic field has disappeared: our limit on the growth rate is spatially local in nature and only depends on the minimum value of $k_\perp \rho _i$.
This bound, which applies to all local gyrokinetic instabilities in a plasma with adiabatic electrons, is not optimal and can be improved by a factor of approximately two, as we show in the next section. Nevertheless, it displays scalings that have been seen in many publications and numerical simulations over the years. For long wavelengths, $b_i \rightarrow 0$, it reduces to
Note that all dependence on the magnetic geometry has disappeared, and because $\omega _{\ast i} \propto k_\alpha$ the growth rate is proportional to $k_\alpha$ in this limit. For short wavelengths, $k_\perp \rho _i \gg 1$, the bound remains finite,
because
is a positive-definite quadratic form in $k_\psi$ and $k_\alpha$. Indeed, $\gamma (k_\psi,k_\alpha )$ approaches a finite constant in the limit $k_\alpha \rightarrow \infty$ and vanishes if $k_\psi \rightarrow \infty$ at fixed $k_\alpha$. Moreover, at constant ion temperature, the bound (5.11) increases with the electron temperature through the scaling with $\tau$, which is a well-known feature of numerical simulations and analytical dispersion relations in explicitly tractable limits (Biglari, Diamond & Rosenbluth Reference Biglari, Diamond and Rosenbluth1989; Romanelli Reference Romanelli1989; Plunk et al. Reference Plunk, Helander, Xanthopoulos and Connor2014; Zocco et al. Reference Zocco, Plunk, Xanthopoulos and Helander2018). This unfortunate scaling is thought to degrade energy confinement in electron-heated tokamaks and stellarators.
5.2. Electromagnetic instabilities
We now turn to the more general case of an arbitrary number of kinetic species, but still restrict our attention to instabilities with $\delta B_\| = 0$. No attempt will be made to make the bound as low as possible. Our main concern is to show that an upper bound exists and that it is itself bounded as a function of $\boldsymbol k$, so that there is a universal upper bound on the growth rate at any wavelength. This result will be of crucial importance when we consider nonlinear growth in a subsequent section. In the next publication of this series, we show how to extend the calculation to include fluctuations of the magnetic field strength and how to compute the lowest possible bounds in this context.
We begin by seeking lower bounds on $H$ under the constraints (4.3) and (4.5), which lead us to a simple quadratic minimisation problem treated in appendix B, where the minimum
subject to the constraint
is found for the case that $q_a$ and $p_a$ are positive real numbers. In terms of this notation, we first choose $x_a = \sqrt {s_a}$, $p_a = n_a |e_a| \sqrt {\varGamma _{0a}}$, $q_a = n_a T_a$ and
and then obtain
As a result of this inequality, we conclude from (5.2) that $H \ge \left \langle L |\delta \phi _{\boldsymbol k} |^2 \right \rangle$ with
Similarly, by instead choosing $c = |k_\perp \delta A_{\|\boldsymbol k} |/\mu _0$ and
we find
where $\beta _a = 2 \mu _0 n_a T_a / B^2$. Because the gyroradius of the electrons is usually much smaller than that of any ion species and $\varGamma _{a0} = \varGamma _0(b_a)$ is a decreasing function of particle mass, only the electrons need to be kept in the sum over species, and we conclude that $H$ is bounded from below by
with
We are now ready to apply our basic upper bound (5.1), where we use (4.6) and
to conclude that
where the contribution from ions to the electromagnetic term in $D$ has been neglected, being a factor of order $m_e/m_i$ smaller than the electron contribution. As $L$ is an increasing function of the quantities $b_a$, which are all proportional to $(k_\perp / B)^2$, the first term on the right is maximised if $|\delta \phi _{\boldsymbol k}(l)|^2$ is chosen to be a delta function in the point $l_0$ where the function $k_\perp (l) / B(l)$ attains its minimum. Similarly, the second term is maximised by choosing $| \delta A_{\| \boldsymbol k}(l) |^2 \propto \delta (l-l_1)$ where $l_1$ is the point where $K(l)/N(l)$ is minimised. We thus arrive at the result
Apart from the neglect of terms of order $m_e/m_i$ and fluctuations in the magnetic-field strength, $\delta B_\|$, this upper bound on the growth rate is completely general and applies to any local gyrokinetic instability. It applies to ITG and electron-temperature-gradient modes, kinetic and resistive ballooning modes, trapped-ion and trapped-electron modes and microtearing modes, as well as to the so-called universal and ubiquitous instabilities.
A particularly simple and important case is that of a hydrogen plasma without other ions and $k_\perp \rho _e \ll 1$. Noting that $\omega _{\ast i} = - \tau \omega _{\ast e}$ and using the asymptotic forms (4.11) and (4.12), we find
where the first term on the right is evaluated at $l=l_0$ and the second term (which is proportional to $\beta _e$) at $l=l_1$. Both terms give an upper bound on $\gamma$ that remains finite in the long-wavelength limit because $\omega _{\ast e}$ is proportional to $k_\alpha$ and
in the limit $b_i \ll 1$. Furthermore, as long as $k_\perp \rho _e \ll 1$, the growth rate is subject to a bound equal to
where $C_0$ and $C_1$ are numbers of order unity, $v_{Ti}$ denotes the ion thermal speed and $L_\perp$ the length scale of the equilibrium density and temperature gradients. In the opposite limit, $k_\perp \rho _e \gg 1$, the term proportional to $\beta _e$ can be neglected and we instead obtain
where $v_{Te}$ denotes the electron thermal speed and $C_2$ is a number of order unity.
6. Optimal bounds
The bounds (5.11) and (5.27) are not optimal and can be improved. In this section, we derive the best possible bound, in a sense that will be made precise, for the simplest case of a hydrogen plasma with adiabatic electrons. If $\varphi = e \delta \phi _{\boldsymbol k} / T_i$ and $g = g_{i\boldsymbol k}$, we have
where $x^2 = m_i v^2 / 2 T_i$. Here $D$ and $H$ are thus quadratic functionals of $g$, and the challenge is to maximise the ratio $D[g]/H[g]$ over all such functions.
In order to do so, we first note that $D$ and $\varphi$ only depend on two moments of $g$, namely,
where $j = 0$ or 1. We can therefore begin by minimising $H[g]$ over all functions with given values of these two moments. Using Lagrange multipliers, $c_0$ and $c_1$, we are thus led to minimise the functional
which gives
We have thus reduced our problem to that of finding the maximum value of $D/H$ expressed as a ratio of two quadratic forms in the coefficients $c_j$. Note that conventional eigenmodes, i.e. functions satisfying the linearised version of (2.3) are, in general, not of the form (6.6). This equation describes modes of optimal free-energy growth, which are distinct from eigenmodes and will be studied in greater detail in Part 2 of this series of papers.
If we write
so that
then
where
and
In order to maximise the ratio and calculate
we consider the variations
where $\textrm {c.c.}$ stands for the complex conjugate, and we note that the maximum is reached when
which gives a system of equations
which has non-zero solutions if
This is the ‘optimal’ bound on the growth rate that can be obtained within our formalism in the sense that no lower bound is possible. Indeed, growth of the free energy at this rate is realised if no collisions are present and the distribution function is chosen as dictated by (6.6) with $c_0$ and $c_1$ satisfying the eigenvalue problem (6.18). The bound (6.20) is shown in figure 1 and is lower than our previous result (5.11) by a factor of 2 and $\sqrt {5}$ in the limits of long and short wavelengths, respectively,
7. Bounds on nonlinear growth
Our most general bound (5.27) is not optimal and will be improved substantially in our next publication, but its most important implication follows already from this crude form. The right-hand side is a bounded function of the mode numbers $(k_\psi, k_\alpha )$, and the linear growth rate can therefore never exceed the maximum
As we now show, this conclusion also holds for nonlinear growth.
Consider the evolution of a set of fluctuations governed by the gyrokinetic system of equations starting from some arbitrary initial condition, specified by the distribution functions $\delta F_a$ of all species at $t=0$. According to (3.8) the instantaneous growth of the total free energy,
is bounded by
where each term is subject to the bound
The growth rate of the total free energy is therefore limited by twice the maximum linear growth
This bound holds for fluctuations of arbitrary amplitude within the gyrokinetic formalism. In particular, it must hold in any gyrokinetic simulation of turbulence.
Moreover, if collisions are absent, then instantaneous growth of the free energy is possible at any positive rate up to the ‘optimal’ one, which for the particularly simple case of adiabatic electrons was derived in the previous subsection. To see this, suppose the bounds on the right-hand side of (7.1) are chosen optimally in the sense that
where $D$ and $H$ are now considered to be quadratic functionals of the distribution functions $g = \{ g_a \}$ of all species. This means, then, that there is a choice of wavenumber and initial data such that the free energy grows at a rate arbitrarily close to $2 \gamma _\textrm {max}$. Conversely, there is a similar limit on the rate at which the free energy can decay in the absence of collisions,
as follows from the observation that $D[g,{\boldsymbol k}]$ is odd in the wavenumber $\boldsymbol k$ at fixed $g$ whereas $H$ is even. The transformation ${\boldsymbol k} \rightarrow - {\boldsymbol k}$ thus changes the sign of the ratio ${D[g,{\boldsymbol k}]}/{H[g,{\boldsymbol k}]}$ if $g$ is held constant.Footnote 2 Any upper bound on this ratio therefore automatically implies a similar lower bound when collisions are absent.
8. Conclusions
As we have shown, it is possible to derive rigorous upper bounds on the growth rate of linear instabilities and on the nonlinear growth of free energy in gyrokinetics. Unlike most other results in the field, these bounds are universal and hold in plasmas with any number of particle species regardless of collisionality and magnetic-field geometry. For simplicity, we have taken the plasma pressure (beta) to be sufficiently small that fluctuations in the magnetic-field strength can be neglected, $\delta B_\| = 0$, but this restriction will be removed in Part 2 in the present series of papers.
In the case of a plasma with a single kinetic ion species and ‘adiabatic’ electrons, the bound is given by (6.20) and is of order
for $k_\perp \rho _i < 1$ and
for shorter wavelengths. The dependence on the parameter $\tau = T_i/T_e$ reflects a well-known unfavourable dependence of the ITG growth rate on electron temperature.
The bound (5.27) we found on instabilities with kinetic electrons is less restrictive and remains finite in the limit $k_\perp \rho _i \rightarrow 0$. It is a sum of two distinct contributions: an electrostatic term and an electromagnetic term that vanishes if $\beta _e \rightarrow 0$. As we shall show in the next publication of this series, this result is not qualitatively affected by the inclusion of parallel magnetic fluctuations.
Actual microinstability growth rates must lie below these bounds. For instance, toroidal ITG modes with adiabatic electrons and $k_\perp \rho _i \ll 1$ have growth rates
in the strongly driven limit (Biglari et al. Reference Biglari, Diamond and Rosenbluth1989; Romanelli Reference Romanelli1989; Plunk et al. Reference Plunk, Helander, Xanthopoulos and Connor2014; Zocco et al. Reference Zocco, Plunk, Xanthopoulos and Helander2018), and trapped-ion modes have a similar growth rate (Biglari et al. Reference Biglari, Diamond and Rosenbluth1989). Here $R$ denotes the radius of curvature of the magnetic field, so that $\omega _{di} \sim (k_\perp \rho _i) v_{Ti} / R$. Due to the assumption $| \omega _{di} /\omega _{\ast i} |\sim L_\perp / R \ll 1$ (corresponding to strong instability drive) made in the derivation of this estimate, the growth rate is smaller than our upper bound. Similarly, in the theory of kinetic ballooning modes, the assumption $L_\perp / R \ll 1$ leads to growth rates of order (Tang, Connor & Hastie Reference Tang, Connor and Hastie1980; Aleynikova et al. Reference Aleynikova, Zocco, Xanthopoulos, Helander and Nührenberg2018)
This growth rate never exceeds our bound (5.28) and scales as our estimate (5.30). In less strongly driven cases, the growth rate is lower.
Although all our results are quite general, they do not encompass all instabilities of interest. Kink modes and tearing modes sometimes need a gyrokinetic treatment in a thin layer around a resonant magnetic surface, where magnetic reconnection may occur, but take their energy from the exterior region and depend on the overall plasma current profile (Hazeltine, Dobrott & Wang Reference Hazeltine, Dobrott and Wang1975; Drake & Lee Reference Drake and Lee1977). Such instabilities cannot adequately be described in the geometry of a magnetic flux tube (Connor et al. Reference Connor, Hastie, Pusztai, Catto and Barnes2014, Reference Connor, Ham, Hastie and Zocco2019) and are not subject to the bounds derived in the present paper. Mathematically, they are not covered by our treatment because the solution of the gyrokinetic equation involves matching to the exterior region, whose destabilising influence is usually described by a parameter $\varDelta '$, making these modes non-local in nature. However, microtearing modes which are driven by local gradients are subject to our bound (5.27) on electromagnetic instabilities.
As already remarked, instabilities driven by equilibrium flow shear would need an additional term in the gyrokinetic equation. The parallel-velocity-gradient instability, which could then be treated, is known to be capable of causing subcritical turbulence below the linear stability threshold due to transiently growing modes (Barnes et al. Reference Barnes, Parra, Highcock, Schekochihin, Cowley and Roach2011). The latter would, however, be subject to bounds similar to those we have derived, because these also apply to nonlinear growth and thus limit the possibility of subcritical turbulence excitation.
Acknowledgements
Editor A. Schekochihin thanks the referees for their advice in evaluating this article.
Funding
This work was partly supported by a grant from the Simons Foundation (560651, PH).
Declaration of interests
The authors report no conflict of interest.
Appendix A. Gaussian integrals involving Bessel functions
The following integrals are used in several places
where $\varGamma _n(b) = \textrm {I}_n(b) \textrm {e}^{-b}$ and $\textrm {I}_n$ denotes modified Bessel functions. These functions have the asymptotic forms
Appendix B. A quadratic minimisation problem
Consider the problem of minimising
where ${\boldsymbol x} = (x_1, x_2, \ldots )$ subject to the constraint
where $q_a$ and $p_a$ are positive real numbers. This problem is not difficult to solve by considering the function
where $\lambda$ is a Lagrange multiplier. The conditions
lead to
and