Application of Lagrangian techniques for calculating the on-axis rotational transform

S. Guinchard; W. Sengupta; S.R. Hudson

doi:10.1017/S0022377824001648

Application of Lagrangian techniques for calculating the on-axis rotational transform

Published online by Cambridge University Press: 12 February 2025

and

S. Guinchard*: Affiliation:
Princeton Plasma Physics Laboratory, Princeton, NJ 08540, USA
W. Sengupta: Affiliation:
Department of Astrophysical Sciences, Princeton University, Princeton, NJ 08543, USA
S.R. Hudson: Affiliation:
Princeton Plasma Physics Laboratory, Princeton, NJ 08540, USA
*: †Email address for correspondence: [email protected]

Article contents

Abstract
The need for and the origin of the rotational transform
Description of the magnetic field-line action
Verification of Mercier's formula for the Floquet exponent
Discrete formalism: piecewise action
Conclusion
Funding
Declaration of interests
Data availability
References

Rights & Permissions

Abstract

The Floquet exponents of periodic field lines are studied through the variations of the magnetic action on the magnetic axis, which is assumed to be elliptical. The near-axis formalism developed by Mercier, Solov'ev and Shafranov is combined with a Lagrangian approach. The on-axis Floquet exponent is shown to coincide with the on-axis rotational transform. A discrete solution suitable for numerical implementation is introduced, which gives the Floquet exponents as solutions to an eigenvalue problem. This discrete formalism expresses the exponents as the eigenvalues of a $6\times 6$ matrix.

Keywords

magnetic action rotational transform near-axis expansion Lagrangian integration Mercier Solov'ev-Shafranov variation

Type: Research Article
Information: Journal of Plasma Physics , Volume 91 , Issue 1 , February 2025 , E34

DOI: https://doi.org/10.1017/S0022377824001648 [Opens in a new window]

NASA ADS Abstract Service [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: Copyright © The Author(s), 2025. Published by Cambridge University Press

1 The need for and the origin of the rotational transform

Magnetostatic equilibria are characterized by the following equations:

(1.1a-c)\begin{equation} \boldsymbol{\nabla} p = {\boldsymbol{J}} \times {\boldsymbol{B}}, \quad \boldsymbol{\nabla} \times {\boldsymbol{B}} = {\boldsymbol{J}}, \quad \boldsymbol{\nabla}\boldsymbol{\cdot} {\boldsymbol{B}} = 0, \end{equation}

where ${\boldsymbol {B}}$ is the magnetic field, ${\boldsymbol {J}}$ is the current density and $p$ is the scalar pressure. With the assumption that $p$ is not constant in a small region and the surface lies in a bounded volume of space, the boundary must be topologially toroidal (Kruskal & Kulsrud Reference Kruskal and Kulsrud1958). The phase portrait of ${\boldsymbol {B}}$, where the magnetic field lines are treated like integral curves of a Hamiltonian dynamical system, is characterized by the topology of the level sets of $p$. A magnetic field line action can be defined (Cary & Littlejohn Reference Cary and Littlejohn1983), which serves as a starting point for the Lagrangian integration carried out in this paper.

For magnetic confinement of plasmas in toroidal geometries, that the magnetic field lines rotate poloidally (the short way) as they rotate toroidally (the long way) around the torus is essential for cancelling charged particle drifts, which would otherwise lead to loss (Spitzer Reference Spitzer1958). The number of poloidal rotations that a field line achieves per toroidal period is called the rotational transform, $\iota$ (Spitzer Reference Spitzer1958).

Kruskal & Kulsrud consider the case where ‘$p$ is reasonably smooth and not constant in any region’, and this entails that the magnetic field lines lie on nested flux surfaces. In the following, we do not place such a constraint on the pressure and we do not require that the magnetic field has nested flux surfaces. The existence of nested magnetic surfaces, their link with continuous symmetries and the preservation of magnetic surfaces under small symmetry-destroying perturbation are widely discussed in Kolmogorov (Reference Kolmogorov1954), Arnol'd (Reference Arnol'd1963) and Moser (Reference Moser1962) and also in texts and reviews: Moser (Reference Moser1973), Arnold (Reference Arnold1978), Lichtenberg & Lieberman (Reference Lichtenberg and Lieberman1992) and Meiss (Reference Meiss1992). Herein, we only require that the magnetic field is ‘toroidal’, which we describe as follows. We consider a magnetic configuration in the domain $\varOmega$, which is assumed to be a solid torus, with ${\boldsymbol {B}}\boldsymbol {\cdot } \boldsymbol {\nabla } \phi >0$ everywhere where $\phi$ is the toroidal angle, and with ${\boldsymbol {B}}\boldsymbol {\cdot }{\boldsymbol {n}}=0$ where ${\boldsymbol {n}}$ is normal to the boundary. For any poloidal section $\varSigma$ of $\varOmega$, the Poincaré return map is defined by the intersection of field lines and $\varSigma$ after one toroidal period. Brouwer's fixed-point theorem (Brouwer Reference Brouwer1910) ensures that there will be at least one fixed point of the Poincaré first return map.

Taking such a point as the origin of polar-like coordinates, e.g. $(r,\theta )$, we may consider the effect of iterating the mapping on nearby points, where the point $(r_n,\theta _n)$ gets mapped to $(r_{n+1},\theta _{n+1})$. If, upon iterating the return map, the points rotate around the fixed point, then the fixed point is called elliptic. (A more rigorous definition of elliptic fixed points is given in Meiss (Reference Meiss1992).) There may be more than one fixed point, and not all fixed points are elliptic. Even though the methods of Lagrangian integration as described below can be applied more generally, for clarity of exposition we restrict our attention to the case of an isolated elliptic fixed point. Stellarators are typically designed to have one easily identifiable elliptic fixed point about which most field lines rotate, and this is called the magnetic axis.

This sequence of angular displacements, $\Delta \theta = \theta _{n+1}-\theta _n$, enables one to define the rotational transform of a field line:

(1.2)\begin{equation} \iota := \lim_{N\uparrow \infty}\frac{1}{2{\rm \pi} N}\sum_n^N\Delta\theta_n. \end{equation}

Even though the angle coordinate becomes degenerate at the origin in polar-like coordinates, the common approach is to define the on-axis rotational transform by taking the limit as the starting point gets closer to the axis.

Note that the this definition of the rotational transform is purely geometric, in that the rotational transform effectively measures how many times a given magnetic field line links the magnetic axis. A discussion of the rotational transform as an asymptotic linking of neighbouring field lines can be found in Arnold (Reference Arnold2014).

Mercier (see Mercier Reference Mercier1964; Helander Reference Helander2014) expressed the on-axis rotational transform as an integral along the axis:

(1.3)\begin{equation} \iota = N + \frac{1}{2{\rm \pi} } \oint \frac{{\rm d}\ell}{\cosh{\eta}} \left( \frac{J_0}{2B_0}+\delta' - \tau\right) , \end{equation}

with $\ell$ denoting the arc length. The on-axis current density and magnetic field are denoted by $J_0$ and $B_0$, respectively, and the torsion of the axis by $\tau$. The eccentricity of flux surfaces is described by $\eta$ and $\delta$ is a parameter describing their rotation around the axis, with $\delta ' := {\rm d}\delta /{\rm d}\ell$. Here $N$ is an integer coming from the phase of the rotation term $\delta$ (see Pfefferlé et al. Reference Pfefferlé, Gunderson, Hudson and Noakes2018). Note that Spitzer (Reference Spitzer1958) had identified independently that a way to generate some rotational transform is to give torsion to an axis. This expression provides invaluable insight, showing how rotational transform can be produced by plasma currents, as is used by tokamaks, or by geometrical shaping, as is used by stellarators.

One may also encounter different definitions of the rotational transform. Assuming that the magnetic configuration possesses magnetic surfaces, ensuring that the toroidal and poloidal magnetic fluxes $\psi$ and $\chi$ respectively, can be properly defined, then as shown by Mercier et al. (Reference Mercier1974), $\iota$ can also be expressed as the ratio of differential fluxes:

(1.4)\begin{equation} \iota = \frac{{\rm d}\chi}{{\rm d}\psi}. \end{equation}

Mercier's expression (1.3) for instance was derived starting from the flux definition (1.4). The fluxes were expanded in a power series of the distance to the axis by constructing custom polar coordinates, so-called Mercier coordinates, which are described in Appendix C.

In this paper, we present a derivation of Mercier's formula using Lagrangian integration. This approach expresses the rotational transform as a Floquet exponent of the field lines. The relationship between the rotational transform and the Floquet exponent was described by Greene (Reference Greene1979), and more recently by Duignan & Meiss (Reference Duignan and Meiss2021). In § 2, the magnetic field-line action is defined. Stationary curves of the action are shown to be field lines, enabling one to identify a magnetic axis. Assumption is made in this paper that the axis is elliptical, but the formalism can be applied to the hyperbolic case. In § 3, Mercier's formula (1.3) is derived from the second variation of the action. The result is obtained through a near-axis expansion of the null eigenspace of the second variation operator. The periodicity enables the use of a Floquet description of the solutions. This same result is derived through the theory of the Hill infinite determinant, analogous to Schrödinger's equation in a periodic potential. In both cases, the rotational transform is shown to be a Floquet exponent of the null eigenspace of the second variation operator. In § 4, a discrete formalism is introduced so that the Floquet exponents can be solved for numerically.

2 Description of the magnetic field-line action

Let us start by defining the magnetic field line action, as introduced by Cary & Littlejohn (Reference Cary and Littlejohn1983). This is defined as a line integral that, for a given magnetic field, depends only the integration contour. Herein, the integration contours are assumed to be curves that close after one toroidal period. Let us consider a closed, differentiable curve $\mathcal {C} \subset \mathbb {R}^3$ with total length $L$, closing after one toroidal transit. The latter assumption that the curve closes after one toroidal period is a necessary condition for a magnetic axis. Curve ${\mathcal {C}}$ is parametrized by the $C^1$, $L$-periodic vector-valued function

(2.1)\begin{equation} \left. \begin{gathered} {\boldsymbol{x}}:[0,L]\rightarrow \mathbb{R}^3\\ \ell \mapsto {\boldsymbol{x}}(\ell)\in {\mathcal{C}}, \end{gathered} \right\} \end{equation}

$\ell$ being the arc length and we note ${\boldsymbol {x}}' := \textrm {d}{\boldsymbol {x}}/\textrm {d}\ell$. The action is defined as the circulation along $\mathcal {C}$ of the magnetic vector potential ${\boldsymbol {A}}$, with ${\boldsymbol {B}} = \boldsymbol {\nabla }\times {\boldsymbol {A}}$ in $\varOmega$:

(2.2)\begin{equation} \mathcal{S} := \oint_{\mathcal{C}}\,{\rm d}\ell\, {\boldsymbol{A}} \boldsymbol{\cdot} {\boldsymbol{x}}'. \end{equation}

Properties of the magnetic field are accessible through calculus of variations, performed on the action. The variations are performed with respect to changes in the geometry of the curve. The first variation with respect to a variation $\delta {\boldsymbol {x}}$ is

(2.3)\begin{equation} \delta \mathcal{S}[\delta {\boldsymbol{x}}] = \oint_{\mathcal{C}} \,{\rm d}\ell\, {\boldsymbol{x}}^\prime \times {{\boldsymbol{B}}} \boldsymbol{\cdot} \delta {\boldsymbol{x}}, \end{equation}

which shows that stationary curves are tangential to the magnetic field and hence field lines. We used that $\delta {\boldsymbol {A}} [\delta {\boldsymbol {x}}]=\delta {\boldsymbol {x}} \boldsymbol {\cdot }\boldsymbol {\nabla } {\boldsymbol {A}}$. From (2.3), a magnetic axis can be identified. We focus in particular on elliptical axes, but one can apply the following formalism to hyperbolic axes as well. The difference would be seen in the flux-surface functions around the axis, and in the nature of the Floquet exponents. Let us denote an elliptical axis by ${\mathcal {C}}_a$. Additional properties of the field appear at higher orders of variations of $\mathcal {S}$. For the rotational transform in particular, the second-order variation $\delta ^2 {\mathcal {S}}$ is of interest.

3 Verification of Mercier's formula for the Floquet exponent

3.1 The second variation as an operator

In order to express the on-axis rotational transform, the second-order variation of the action applied to the axis needs to be derived. Assume that an elliptical axis ${\mathcal {C}}_a$ has been found as a stationary curve of the action. The formalism is identical for hyperbolic axes, but in this paper, we focus on the elliptic case. The second variation of the action performed from the axis ${\mathcal {C}}_a$ is

(3.1)\begin{equation} \delta^2 \mathcal{S}[\delta {\boldsymbol{x}}] = \oint_{\mathcal{C}_a} \,{\rm d}\ell\,\delta({\boldsymbol{x}}^\prime \times {{\boldsymbol{B}}} \boldsymbol{\cdot} \delta{{\boldsymbol{x}}}) = \oint_{\mathcal{C}_a} \,{\rm d}\ell\,\delta {\boldsymbol{x}} \boldsymbol{\cdot} (\delta {\boldsymbol{x}}^\prime \times {\boldsymbol{B}} + {\boldsymbol{x}}' \times \delta {\boldsymbol{B}}), \end{equation}

where $\boldsymbol {f}':=\textrm {d}\boldsymbol {f}/\textrm {d}\ell$ for any $\boldsymbol {f}$. Using $\delta {\boldsymbol {B}} = \delta {\boldsymbol {x}} \boldsymbol {\cdot } \boldsymbol {\nabla } {\boldsymbol {B}}$ and the Einstein summation convention, we write the second variation as an operator:

(3.2)\begin{equation} \delta^2 \mathcal{S} = \oint_{{\mathcal{C}}_a} \,{\rm d}\ell\, \delta {\boldsymbol{x}}^i \frac{\delta^2 S}{\delta {\boldsymbol{x}}^i \delta {\boldsymbol{x}}^j} \delta {\boldsymbol{x}}^j , \end{equation}

where

(3.3)\begin{equation} \frac{\delta^2 \mathcal{S}}{\delta {\boldsymbol{x}}^i \delta {\boldsymbol{x}}^j} = \epsilon_{ijk}{\boldsymbol{B}}^k\frac{{\rm d}}{{\rm d}\ell} + \epsilon_{imk}{\boldsymbol{x}}'^m \partial_j {\boldsymbol{B}}^k, \end{equation}

and $i,j,k,m \in \{1,2,3\}$, which in matrix form reads

(3.4)\begin{equation} {\boldsymbol{\mathsf{M}}} := \frac{\delta^2 \mathcal{S}}{\delta {\boldsymbol{x}} \delta {\boldsymbol{x}}}={-}\left(\boldsymbol{\mathsf{I}}\times {\boldsymbol{B}} \right) \frac{{\rm d}}{{\rm d}\ell}+ {\boldsymbol{x}}' \times (\boldsymbol{\nabla} {\boldsymbol{B}})^\mathsf{T}. \end{equation}

The identity tensor is denoted by $\boldsymbol{\mathsf{I}}$. Note that the covariant tensor $\boldsymbol{\mathsf{M}}$ can be easily symmetrized (see Hudson & Dewar Reference Hudson and Dewar2009). However, the direction in which the derivative $\textrm {d}/\textrm {d}\ell$ is taken has to be chosen carefully. We continue with the non-symmetric form (3.4).

We show that the null eigenspace of $\boldsymbol{\mathsf{M}}$ is of particular interest. Indeed, by periodicity of the field lines considered to compute the action, the eigenspaces of the operator $\boldsymbol{\mathsf{M}}$ are related to geometric properties of the neighbouring field lines, including the rotational transform. Let ${\boldsymbol {v}}$ be a null eigenfunction of $\boldsymbol{\mathsf{M}}$ such that

(3.5)\begin{equation} \boldsymbol{\mathsf{M}}{\boldsymbol{v}} =\boldsymbol{0}. \end{equation}

For ${\boldsymbol {v}}$ to be non-trivial, (3.5) rewrites

(3.6)\begin{equation} \det\left( \boldsymbol{\mathsf{M}} \right) =0. \end{equation}

We will demonstrate that the condition (3.5) is satisfied by solutions $\boldsymbol {v}$, such that their Floquet exponent is the on-axis rotational transform, and (3.6) will serve in the discrete formalism introduced in § 4. To solve (3.5), the near-axis formalism developed by Mercier, Solov'ev and Shafranov (Solov'ev & Shafranov Reference Solov'ev and Shafranov1970; Mercier et al. Reference Mercier1974) is used. Near-axis formalism in the inverse coordinate approach, where the flux surface $\psi$ is used as a coordinate, has proved to be very successful in understanding quasi-symmetry (see Garren & Boozer Reference Garren and Boozer1991; Landreman & Sengupta Reference Landreman and Sengupta2018, Reference Landreman and Sengupta2019; Rodríguez, Sengupta & Bhattacharjee Reference Rodríguez, Sengupta and Bhattacharjee2023). However, for this work, the near-axis formalism in direct or Mercier–Solov'ev–Shafranov coordinates (Jorge, Sengupta & Landreman Reference Jorge, Sengupta and Landreman2020b,Reference Jorge, Sengupta and Landremana; Sengupta et al. Reference Sengupta, Rodriguez, Jorge, Bhattacharjee and Landreman2024) is more relevant.

3.2 Derivation of the Floquet exponent from the second variation

The operator is expanded in the Solov'ev–Shafranov coordinates, introduced in Appendix C.2. The latter set of coordinates is closely related to the Mercier coordinates, presented in Appendix C.1.

In what follows, it is assumed that the magnetic axis is a closed curve ${\mathcal {C}}_a\subset \mathbb {R}^3$ parametrized by the vector-valued function ${\boldsymbol {r}}_0$, with the arc length $\ell$ as parameter, and a total length $L$. The basis of expansion is taken to be $\{\boldsymbol {e}_1, \boldsymbol {e}_2, \boldsymbol {e}_3\}$, whose expressions in terms of $\boldsymbol {\mathcal {N}}$, $\boldsymbol {\mathcal {B}}$ and ${\boldsymbol {t}}$ are given in Appendix C.2, but the results will always be expressed in terms of the Solov'ev–Shafranov vectors $\{\boldsymbol {\mathcal {N}}, \boldsymbol {\mathcal {B}}, {\boldsymbol {t}}\}$. Additionally, the following notation is adopted: $f'=\textrm {d}f/\textrm {d}\ell$ for any $f$.

3.2.1 Magnetic field expansion near an elliptic magnetic axis

From (3.4), the magnetic field ${\boldsymbol {B}}$ needs to be described in the vicinity of the axis ${\boldsymbol {r}}_0$ for $\boldsymbol{\mathsf{M}}$ to be expanded in the near-axis formalism. The expansion of ${\boldsymbol {B}}$ in Solov'ev–Shafranov coordinates is carried in the limit $x,y\ll 1$, or in other words, is limited to linear terms only. The magnetic field can be written in contravariant form as

(3.7)\begin{equation} \sqrt{|g|}{\boldsymbol{B}}= \sqrt{|g|}B^1 \boldsymbol{e}_1+\sqrt{|g|}B^2 \boldsymbol{e}_2+\sqrt{|g|}B^3 \boldsymbol{e}_3 , \end{equation}

with

(3.8a-c)\begin{equation} \sqrt{|g|}B^1= a_1 x+a_2 y , \quad \sqrt{|g|}B^2= b_1 x+b_2 y, \quad \sqrt{|g|}B^3= B_0+c_1 x+c_2 y, \end{equation}

where $a_1,a_2,b_1,b_2, c_1,c_2$ are periodic functions of $\ell$, $B_0$ the zeroth-order magnitude of ${\boldsymbol {B}}$ and $\sqrt {|g|} = h$ as defined in Appendix C.2. Therefore, in the limit $x,y\ll 1$, $h=1$ and the magnetic field reads

(3.9)\begin{gather} {\boldsymbol{B}} = \left(B_0 +c_1 x+c_2 y\right){\boldsymbol{t}} + \left[a_1 x+(a_2 +u' B_0 )y\right]\boldsymbol{\mathcal{N}}\nonumber\\ +\left[(b_1-u' B_0) x+b_2 y\right]\boldsymbol{\mathcal{B}}+ \boldsymbol{\mathcal{O}}(x^2+y^2). \end{gather}

A direct calculation shows that

(3.10)\begin{equation} \left. \begin{aligned} \boldsymbol{\nabla} {\boldsymbol{B}} & =\kappa B_0 {\boldsymbol{t}} {\boldsymbol{n}} +a_1 \boldsymbol{\mathcal{N}}\boldsymbol{\mathcal{N}}+b_2 \boldsymbol{\mathcal{B}}\boldsymbol{\mathcal{B}}+B_0' {\boldsymbol{t}}{\boldsymbol{t}} + (a_2+u' B_0) \boldsymbol{\mathcal{B}}\boldsymbol{\mathcal{N}}\\ & \quad +(b_1-u' B_0)\boldsymbol{\mathcal{N}}\boldsymbol{\mathcal{B}} + (c_1 \boldsymbol{\mathcal{N}}+c_2 \boldsymbol{\mathcal{B}})\boldsymbol{t},\\ \boldsymbol{x'}\times (\boldsymbol{\nabla} \boldsymbol{B})^\mathsf{T} & =\kappa B_0 {\boldsymbol{b}} {\boldsymbol{t}} +\boldsymbol{\mathcal{B}} \left[a_1 \boldsymbol{\mathcal{N}}+(a_2 +u' B_0) \boldsymbol{\mathcal{B}} \right]-\boldsymbol{\mathcal{N}} \left[(b_1-u' B_0) \boldsymbol{\mathcal{N}}+b_2 \boldsymbol{\mathcal{B}} \right], \end{aligned} \right\} \end{equation}

which yields, using dyadic algebra

(3.11)\begin{equation} \left. \begin{gathered} \boldsymbol{\nabla}\boldsymbol{\cdot}{\boldsymbol{B}} = \boldsymbol{\mathsf{I}}\boldsymbol{:}\boldsymbol{\nabla} {\boldsymbol{B}} = B_0' + a_1+b_2,\\ \boldsymbol{\nabla}\times {\boldsymbol{B}} ={-}\boldsymbol{\mathsf{I}}\boldsymbol{\overset{\times}{.}}\boldsymbol{\nabla} {\boldsymbol{B}} = (b_1-a_2-2 u' B_0) {\boldsymbol{t}}+ (c_2 -\kappa B_0 \sin\delta )\boldsymbol{\mathcal{N}}\\ - (c_1-\kappa B_0 \cos \delta)\boldsymbol{\mathcal{B}}. \end{gathered} \right\} \end{equation}

Here $\kappa$ denotes the local curvature of the axis ${\boldsymbol {r}}_0$. Additionally, (C10a,b) has been used to rewrite ${\boldsymbol {b}}$ in terms of $\boldsymbol {\mathcal {N}}$ and $\boldsymbol {\mathcal {B}}$. Requiring that $\boldsymbol {\nabla } \boldsymbol {\cdot } {\boldsymbol {B}}=0$ and $\boldsymbol {\nabla } \times {\boldsymbol {B}}=J_0 {\boldsymbol {t}}$, one finds

(3.12a-d)\begin{equation} a_1+b_2={-}B_0', \quad \frac{b_1-a_2}{2B_0}=u'+\frac{J_0}{2B_0}, \quad c_1 =\kappa B_0 \cos \delta, \quad c_2 =\kappa B_0 \sin \delta. \end{equation}

So far, only two constraints have been derived for the four expansion functions $a_1,a_2,b_1,b_2$. Turning to the Mercier representation (D2a,b) and (D7a–c) of ${\boldsymbol {B}}$ brings in additional conditions. Together with the definitions of $\boldsymbol {\mathcal {N}}$ and $\boldsymbol {\mathcal {B}}$,

(3.13)\begin{align} \frac{{\boldsymbol{B}}}{B_0} & =\quad {\boldsymbol{t}}\left[1+\kappa \sqrt{x^2+y^2} \cos{(u-\delta)}\right] \nonumber\\ & \quad+ \boldsymbol{\mathcal{N}} \left[ -\frac{1}{2}\left(\frac{B_0'}{B_0}+\eta'\right)x + \left(\frac{-J_0}{2 B_0}+ \frac{\dfrac{J_0}{2B_0}- \tau +\delta'}{ \cosh{\eta}} \sinh{\eta}\right) y\right]\nonumber\\ & \quad+ \boldsymbol{\mathcal{B}} \left[ \left(\frac{J_0}{2 B_0}+ \frac{\dfrac{J_0}{2B_0}- \tau +\delta'}{ \cosh{\eta}} \sinh{\eta}\right) x-\frac{1}{2}\left(\frac{B_0'}{B_0}-\eta'\right)y \right]\nonumber\\ & \quad + \boldsymbol{\mathcal{O}}(x^2+y^2), \end{align}

where $\delta$, as explained in Appendix C, describes the rotation of elliptical flux surfaces around the expansion axis ${\boldsymbol {r}}_0$. As for $\eta$, it describes the eccentricity of the flux surfaces around ${\boldsymbol {r}}_0$. Defining

(3.14)\begin{equation} \varOmega_0(\ell)=\frac{\dfrac{J_0}{2B_0}- \tau +\delta'}{ \cosh{\eta}}, \end{equation}

the comparison between the Mercier representation (3.13) and the Solov'ev–Shafranov representation (3.9) yields for the expansion coefficients

(3.15)\begin{equation} \left. \begin{aligned} \frac{a_1}{B_0} & ={-}\frac{1}{2}\left(\frac{B_0'}{B_0}+\eta'\right), \quad \frac{a_2}{B_0}={-}\left(u'+\frac{J_0/2}{ B_0}\right)+\varOmega_0\sinh{\eta},\\ \frac{b_2}{B_0} & ={-}\frac{1}{2}\left(\frac{B_0'}{B_0}-\eta'\right), \quad \frac{b_1}{B_0}=\left(u'+\frac{J_0/2}{ B_0}\right)+\varOmega_0\sinh{\eta}, \end{aligned} \right\} \end{equation}

which satisfy the relations (3.12a–d). From the definition of $\varOmega _0$, we can further simplify $a_2$ and $b_1$ as

(3.16a,b)\begin{equation} \frac{a_2}{B_0}={-}\varOmega_0 {\rm e}^{-\eta}, \quad \frac{b_1}{B_0}=\varOmega_0 {\rm e}^{\eta}. \end{equation}

As a check of correctness for the above expressions, one can show that ${\boldsymbol {B}}\boldsymbol {\cdot } \boldsymbol {\nabla } \psi =0$, where

(3.17)\begin{align} \sqrt{g} {\boldsymbol{B}}\boldsymbol{\cdot} \boldsymbol{\nabla} & = \sqrt{g}B^1\partial_x +\sqrt{g}B^2\partial_y+\sqrt{g}B^3\partial_\ell\nonumber\\ & = B_0\partial_\ell + (a_1 x + a_2 y)\partial_x + (b_1 x + b_2 y)\partial_y. \end{align}

The flux surface function $\psi$ for the elliptic case is given in the Mercier representation by (D10) or equivalently in the Solov'ev–Shafranov representation by

(3.18)\begin{equation} \psi(x,y)=B_0({\rm e}^{\eta}x^2 + {\rm e}^{-\eta}y^2). \end{equation}

Note that in the case of a hyperbolic axis, the plus sign from the flux function (3.18) would be replaced by a minus sign.

3.2.2 The second variation tensor and its null eigenspace

Now that the magnetic field has been properly expressed and expanded in Solov'ev–Shafranov coordinates, the obtained ${\boldsymbol {B}}$ and $\boldsymbol {\nabla } {\boldsymbol {B}}$ can be substituted in (3.4) to expand the second variation tensor. From (3.10), we find that it is given by

(3.19)\begin{equation} \left. \begin{aligned} \boldsymbol{\mathsf{M}} & := \boldsymbol{\mathsf{M}}_{{1}} \frac{{\rm d}}{{\rm d}\ell}+\boldsymbol{\mathsf{M}}_{{2}},\\ \boldsymbol{\mathsf{M}}_{{1}} & = (\boldsymbol{\mathcal{N}} \boldsymbol{\mathcal{B}}-\boldsymbol{\mathcal{B}} \boldsymbol{\mathcal{N}})B_0,\\ \boldsymbol{\mathsf{M}}_{{2}} & = \kappa B_0 \left[\boldsymbol{\mathcal{B}}\cos{\delta} - \boldsymbol{\mathcal{N}}\sin{\delta} \right]{\boldsymbol{t}} +\boldsymbol{\mathcal{B}} \left[a_1 \boldsymbol{\mathcal{N}}+(a_2+u' B_0) \boldsymbol{\mathcal{B}} \right]\\ & \quad -\boldsymbol{\mathcal{N}} \left[(b_1-u' B_0) \boldsymbol{\mathcal{N}}+b_2 \boldsymbol{\mathcal{B}} \right]. \end{aligned} \right\} \end{equation}

Observing the dyadic form of $\boldsymbol{\mathsf{M}}$, it can be shown that tangential components $v^t$ of the eigenvectors ${\boldsymbol {v}}=(v^t, v^{{\mathcal {N}}}, v^{{\mathcal {B}}})^{\mathsf {T}}$ will not contribute to $\boldsymbol{\mathsf{M}}{\boldsymbol {v}} =\boldsymbol {0}$. Alternatively, the underlying argument is that the arc length parametrization of ${\mathcal {C}}$ constrains the admissible variations $\delta {\boldsymbol {x}} = {\boldsymbol {v}}$, removing the dependence in the tangential component. Therefore, $v^t$ can be absorbed by redefining $v^{{\mathcal {N}}}$ and $v^{{\mathcal {B}}}$. Thus, the null eigenvectors are chosen to be of the form ${\boldsymbol {v}}= v^{{\mathcal {N}}} \boldsymbol {\mathcal {N}}+ v^{{\mathcal {B}}} \boldsymbol {\mathcal {B}}$. We note that

(3.20)\begin{equation} \frac{{\rm d}}{{\rm d}\ell}{\boldsymbol{v}} = \left(v^{{\mathcal{N}}\prime}+u' v^{{\mathcal{B}}}\right)\boldsymbol{\mathcal{N}} + \left(v^{{\mathcal{B}}\prime}-u' v^{{\mathcal{N}}}\right)\boldsymbol{\mathcal{B}} +\kappa\left[v^{{\mathcal{N}}} \cos{(u-\delta)} + v^{{\mathcal{B}}} \sin{(u-\delta)}\right]{\boldsymbol{t}}. \end{equation}

The tangential terms $\propto {\boldsymbol {t}}$ are not relevant since they do not contribute to $\boldsymbol{\mathsf{M}} \boldsymbol {v} =\boldsymbol {0}$. The equations for $v^{{\mathcal {N}}},v^{{\mathcal {B}}}$ then read

(3.21)\begin{equation} B_0 \begin{pmatrix} 0 & +1 \\-1 & 0 \end{pmatrix} \dfrac{{\rm d}}{{\rm d} \ell} \begin{pmatrix} v^{{\mathcal{N}}}\\ v^{{\mathcal{B}}} \end{pmatrix} + \begin{pmatrix} -b_1 & -b_2\\ a_1 & a_2 \end{pmatrix} \begin{pmatrix} v^{{\mathcal{N}}}\\ v^{{\mathcal{B}}} \end{pmatrix} =0. \end{equation}

From (3.21), by periodicity of $a_i/B_0$ and $b_i/B_0$, $i=1,2$, the system can immediately be rewritten in the form of a periodic system:

(3.22a,b)\begin{equation} \frac{{\rm d}{\boldsymbol{v}}}{{\rm d}\ell}=\boldsymbol{\mathsf{A}}(\ell) {\boldsymbol{v}}, \quad {\boldsymbol{v}} = \left(\begin{array}{c}v^{{\mathcal{N}}} \\v^{{\mathcal{B}}}\end{array}\right). \end{equation}

Here, $\boldsymbol{\mathsf{A}}(\ell )$ is periodic in $\ell$. The Floquet theorem, (A6), can be applied to conclude that the solution must be of the form

(3.23)\begin{equation} {\boldsymbol{v}} = \boldsymbol{\mathsf{U}}(\ell){\rm e}^{\boldsymbol{\mathsf{C}} \ell/L}, \end{equation}

where $\boldsymbol{\mathsf{U}}$ is a symplectic periodic matrix (with period $L$) and $\boldsymbol{\mathsf{C}}$ is a constant Hamiltonian matrix (Duignan & Meiss Reference Duignan and Meiss2021). The eigenvalues of $\boldsymbol{\mathsf{C}}$ are the Floquet exponents, which must be purely imaginary near an elliptic axis and real in the hyperbolic case.

Although the system is already in a form that allows one to solve for the exponents $\nu$ by means of (3.23), identifying the matrices $\boldsymbol{\mathsf{C}}$ and $\boldsymbol{\mathsf{U}}$ can be somewhat troublesome. For that reason, (3.21) can be alternatively rewritten in the following ordinary differential equation (ODE) form:

(3.24)\begin{equation} \frac{{\rm d}v^{{\mathcal{N}}}}{a_1 v^{{\mathcal{N}}} + a_2 v^{{\mathcal{B}}}}= \frac{{\rm d}v^{{\mathcal{B}}}}{b_1 v^{{\mathcal{N}}} + b_2 v^{{\mathcal{B}}}} = \frac{{\rm d} \ell}{B_0}, \end{equation}

which shows that the eigenvector ${\boldsymbol {v}}$ satisfies the general characteristic equation along the magnetic field:

(3.25)\begin{equation} \frac{{\rm d}{\mathcal{X}}}{\sqrt{g}B^1({\mathcal{X}},{\mathcal{Y}},\ell)} = \frac{{\rm d}{\mathcal{Y}}}{\sqrt{g}B^2({\mathcal{X}},{\mathcal{Y}},\ell)}= \frac{{\rm d}\ell}{\sqrt{g}B^3({\mathcal{X}},{\mathcal{Y}},\ell)}. \end{equation}

The eigenvector ${\boldsymbol {v}}$ components $v^{{\mathcal {N}}},v^{{\mathcal {B}}}$ can be identified with ${\mathcal {X}},{\mathcal {Y}}$, the displacements of the magnetic field line from the closed field line ${\boldsymbol {r}}_0$ along the rotated normal and binormal directions. The expansion coordinates $(x,y)$ ought not to be confused with $({\mathcal {X}},{\mathcal {Y}})$, which are the solutions of the characteristic ODEs (3.25). The equations for $({\mathcal {X}},{\mathcal {Y}})$ are

(3.26)\begin{equation} \left. \begin{aligned} & {\mathcal{X}}'+\frac{1}{2}\left(\frac{B_0'}{B_0}+\eta'\right){\mathcal{X}} + \varOmega_0 {\rm e}^{-\eta}{\mathcal{Y}}=0,\\ & {\mathcal{Y}}'+\frac{1}{2}\left(\frac{B_0'}{B_0}-\eta'\right){\mathcal{Y}} - \varOmega_0 {\rm e}^{+\eta}{\mathcal{X}}=0. \end{aligned} \right\} \end{equation}

Introducing the variables $X,Y$ defined through

(3.27a,b)\begin{equation} {\mathcal{X}}= \frac{1}{\sqrt{B_0}}{\rm e}^{-\eta/2}X(\ell), \quad {\mathcal{Y}}= \frac{1}{\sqrt{B_0}}{\rm e}^{+\eta/2}Y(\ell), \end{equation}

which is possible since the periodicity is conserved, the system reduces to the simple harmonic oscillator:

(3.28a-c)\begin{equation} X' + \varOmega_0 Y =0, \quad Y'-\varOmega_0 X =0, \quad \varOmega_0 = \frac{\dfrac{J_0/2}{B_0}- \tau +\delta'}{ \cosh{\eta}}, \end{equation}

with ‘time-dependent’ frequency $\varOmega _0(\ell )$, $\ell$ being the time-like parameter. Using the complex variable $Z=X+\textrm {i}\, Y$, the system reshapes as a single complex ODE:

(3.29)\begin{equation} Z'-{\rm i}\, \varOmega_0 Z =0 \quad \Rightarrow \quad Z(\ell)= Z_0\exp{\int_0^\ell {\rm i}\,\varOmega_0(s) \,{\rm d}s}. \end{equation}

Separating the periodic and non-periodic parts of the exponential, we obtain

(3.30)\begin{equation} \left. \begin{gathered} Z(\ell) = Z_p(\ell) {\rm e}^{{\rm i}\, \nu \ell/L}, \quad Z_p(\ell)= Z_0 \exp{\int_0^\ell {\rm i}\,\tilde{\varOmega}_0(s) \,{\rm d}s},\\ \bar{\varOmega} := \frac{1}{L}\int_0^L \varOmega_0(s) \,{\rm d}s, \quad \tilde{\varOmega}_0= \varOmega_0 - \bar{\varOmega}. \end{gathered} \right\} \end{equation}

Comparing (3.30) with the Floquet form (3.23), we find that $\nu$ is given by

(3.31)\begin{equation} \nu = \int_0^L \varOmega_0(s) \,{\rm d}s \pmod{2{\rm \pi}} = \oint \frac{\dfrac{J_0(s)/2}{B_0(s)}- \tau(s) +\delta'(s)}{ \cosh{\eta(s)}} \,{\rm d}s \pmod{2{\rm \pi}} = 2{\rm \pi} \iota.\end{equation}

This matches the expression for the rotational transform (1.3) up to a factor $2{\rm \pi}$, as $\nu$ represents an angle and $\iota$ the number of turns that this angle constitutes. It follows that solving for the null eigenspace of the second variation tensor expanded in the Solov'ev–Shafranov near-axis formalism yields the correct on-axis rotational transform. The exact same approach can be followed in the hyperbolic case.

3.2.3 Derivation of the Floquet exponent from the Hill's determinant equation

Here we pursue an alternative approach to obtain the Floquet exponent, using the theory of the Hill's infinite determinant. This way of solving for the Floquet exponents of a periodic system has been known for a long time (Magnus Reference Magnus1953); however, to the authors’ knowledge, it is the first time that such an approach is used from the magnetic action. We start with the system (3.26) written as

(3.32a,b)\begin{equation} {\mathcal{X}}'=\frac{1}{B_0}(a_1 {\mathcal{X}} +a_2 {\mathcal{Y}}), \quad {\mathcal{Y}}'=\frac{1}{B_0}(b_1 {\mathcal{X}} +b_2 {\mathcal{Y}}). \end{equation}

Eliminating ${\mathcal {Y}}$ from (3.32a,b), we obtain the following second-order ODE for ${\mathcal {X}}$:

(3.33)\begin{equation} {\mathcal{X}}'' +2 C_1 {\mathcal{X}}' + C_2{\mathcal{X}} =0, \end{equation}

where

(3.34a-c)\begin{equation} 2C_1 ={-}\frac{a_2'}{a_2}+2\frac{B_0'}{B_0}, \quad C_2 = {\mathcal{D}} -\frac{a_2}{B_0}\left(\frac{a_1}{a_2}\right)', \quad {\mathcal{D}} = \frac{1}{B_0^2}(a_1 b_2 -a_2 b_1). \end{equation}

We can eliminate the first derivative term from (3.33) by the change of variables

(3.35)\begin{equation} {\mathcal{X}} = \exp{\left(-\int C_1 \,{\rm d}\ell\right)}\varPsi = \frac{\sqrt{a_2}}{B_0}X, \end{equation}

which leads to

(3.36a,b)\begin{equation} \varPsi'' + \omega^2 \varPsi=0, \quad \omega^2 \equiv C_2 -C_1'-C_1^2. \end{equation}

Equation (3.36a,b) is in the form of Hill's equation or a Schrödinger equation with a periodic potential $\omega ^2$. We note that the linear transformation (3.35) implies that both ${\mathcal {X}}$ and $\varPsi$ have the same Floquet exponent. This is because the multiplication factor $\sqrt {a_2}/B_0$ is periodic in nature and therefore does not change the Floquet exponent.

Leveraging the periodicity of $\omega ^2$, $\omega ^2(\ell +L)=\omega ^2(\ell )$, we can Fourier expand

(3.37)\begin{equation} \omega^2(\ell)=\sum_{k\in \mathbb{Z}}\varOmega_k\,{\rm e}^{{\rm i}\,\ell({2{\rm \pi}}/{L})k}, \end{equation}

where $\{\varOmega _k\}_{k\in \mathbb {Z}}$ denote the Fourier coefficients of $\omega ^2$. Let the fundamental solutions of (3.36a,b) be given by $\varPsi _{\pm }(\ell )$ such that

(3.38a-d)\begin{equation} \varPsi_+(0)=1, \quad \varPsi_{-}(0)=0, \quad \varPsi'_+(0)=0, \quad \varPsi'_{-}(0)=1, \end{equation}

where these conditions have been chosen so that the basis functions $\varPsi _{\pm }$ are linearly independent and with unit Wronskian. The Floquet solutions are given by

(3.39)\begin{equation} \left. \begin{aligned} \varPsi_+(\ell) & = {\rm e}^{+{\rm i}\, \nu \ell}\sigma_+(\ell), \quad \sigma_+(\ell+L)=\sigma_+(\ell), \quad \sigma_+(0)=1,\\ \varPsi_{-}(\ell) & = {\rm e}^{-{\rm i}\, \nu \ell}\sigma_{-}(\ell), \quad \sigma_{-}(\ell+L)=\sigma_{-}(\ell), \quad \sigma_-(0)=0, \end{aligned} \right\} \end{equation}

where $\sigma _+$ and $\sigma _{-}$ satisfy the periodicity condition and the linear independence, and $\nu$ denotes the Floquet exponents for which we seek. Nothing more needs to be known about the functions $\sigma _{\pm }$ since their coefficients do not appear in the final expression for the Floquet exponents. Let us now Fourier expand the Floquet solution as

(3.40)\begin{equation} \varPsi_+{=} {\rm e}^{{\rm i}\, \nu \ell}\sum_{n\in \mathbb{Z}}b_n\, {\rm e}^{{\rm i}\,({2{\rm \pi} n }/{L})\ell} = \sum_{n\in \mathbb{Z}} b_n \exp\left({{\rm i}\, \ell \left(\frac{2{\rm \pi} n }{L}+ \nu\right)}\right), \end{equation}

where $\{b_n\}_{n\in \mathbb {Z}}$ is the set of Fourier coefficients of $\sigma _+$. The Fourier expansion of (3.36a,b) reads

(3.41)\begin{align} & \varPsi_+^{''}(\ell)+\omega^2(\ell)\varPsi_+(\ell)={-}\sum_{n\in \mathbb{Z}}b_n \left( \nu + \frac{2{\rm \pi} n}{L}\right)^2 \exp\left({{\rm i}\, \ell \left(\nu + \frac{2n{\rm \pi}}{L}\right)}\right)\nonumber\\ & \qquad + \left(\sum_{k\in \mathbb{Z}}\varOmega_k\,{\rm e}^{{\rm i}\,\ell({2{\rm \pi}}/{L})k} \right)\left(\sum_{n\in \mathbb{Z}} b_n \exp\left({{\rm i}\, \ell \left(\frac{2{\rm \pi} n }{L}+ \nu\right)}\right)\right)\nonumber\\ & \quad =\sum_{n\in \mathbb{Z}}\left(\sum_{k}\varOmega_kb_{n-k}-\left(\frac{2{\rm \pi} n }{L}+ \nu\right)^2b_n\right) \exp\left({{\rm i}\,\ell\left(\frac{2{\rm \pi} n }{L}+ \nu\right)}\right)\nonumber\\ & \quad =0. \end{align}

In order for (3.41) to be verified, the following condition has to hold:

(3.42)\begin{equation} \sum_{k\in \mathbb{Z}}\varOmega_kb_{n-k}-\left(\frac{2{\rm \pi} n }{L}+ \nu\right)^2b_n=0,\quad n \in \mathbb{Z}, \end{equation}

which can be rewritten in the following matrix form, after dividing (3.42) by $\varOmega _0-(2{\rm \pi} n/L+\nu )^2$:

(3.43)\begin{equation} \sum_{m\in \mathbb{Z}}B_{nm}b_m=0, \quad n \in \mathbb{Z}, \end{equation}

where the matrix $B$ is defined as

(3.44)\begin{equation} B_{nn}=1, \quad B_{nm}=\frac{\varOmega_{n-m}}{\varOmega_0-\left(\dfrac{2{\rm \pi} n}{L}+\nu\right)^2}. \end{equation}

This yields the following determinant equation:

(3.45)\begin{equation} \det{B_{nm}} = 0. \end{equation}

The above determinant can be considered a function of the Floquet exponents $\nu$:

(3.46)\begin{equation} \det{B_{nm}(\nu)}:=\Delta(\nu)\equiv \det{\left(\delta_{nm} + \frac{\varOmega_{n-m}}{\varOmega_0-\left(\dfrac{2{\rm \pi} n}{L}+\nu\right)^2}\right)}, \quad n,m \in \mathbb{Z}. \end{equation}

For (3.45) to be verified, we seek for the Floquet exponents $\nu$ such that $\Delta (\nu )=0$. Following Wang, Guo & Xia (Reference Wang, Guo and Xia1989), we now show that the exponents can be deduced from the very simple expression (3.50). One notes that $\Delta (\nu )$ is a $2{\rm \pi}$-periodic function, with poles in $\nu = \pm \sqrt {\varOmega _0} - 2{\rm \pi} n/L$, $n \in \mathbb {Z}$. It can be shown that the determinant is absolutely convergent in the whole $\nu$-plane, except in these poles. Therefore, $\Delta$ is meromorphic. Moreover, as the imaginary part $\mathrm {Im}{(\nu )}\rightarrow \pm \infty$, $\Delta (\nu )\rightarrow 1$. Let us now define the complex function $f$ as

(3.47)\begin{equation} f(\nu):= \cot{\frac{L}{2}(\nu - \sqrt{\varOmega_0})} - \cot{\frac{L}{2}(\nu + \sqrt{\varOmega_0})}. \end{equation}

It is useful to introduce $f$ since it has the same poles and periodicity as $\Delta$. Moreover, it is bounded as $\mathrm {Im}{(\nu )}\rightarrow \pm \infty$. This way, there has to exist a constant $K\in \mathbb {C}$ such that

(3.48)\begin{equation} D(\nu) \equiv \Delta(\nu) + Kf(\nu) \end{equation}

has no singularity in the whole $\nu$ plane. Together with the fact that it is bounded as $|\nu |\rightarrow \infty$, according to Liouville's theorem, $D$ is a constant function. In the limit $|\nu |\rightarrow \infty$, we see that $D=1$. To determine $K$, take $\nu =0$:

(3.49)\begin{equation} \nu =0 \implies K = \frac{1-\Delta(0)}{f(0)} = \frac{1-\Delta(0)}{2\cot{\dfrac{\sqrt{\varOmega_0}L}{2}}}.\end{equation}

Using the value of $K$ from (3.49), the Floquet exponent equation reduces to

(3.50)\begin{equation} \sin^2{\nu\frac{L}{2}} = \Delta(0)\sin^2{\frac{\sqrt{\varOmega_0}L}{2}}. \end{equation}

Although (3.50) is very simple, one obstacle remains to compute the exponents: one has to evaluate the infinite determinant, $\Delta (0)$. We refer to Wang et al. (Reference Wang, Guo and Xia1989) for some approximations of $\Delta (0)$. For instance, when $\varOmega _n$ are sufficiently small, $\Delta (0)$ can be approximated by the order-3 determinant with $B_{00}$ as central element (we recall that the determinant involves summation over all $\mathbb {Z}$), providing

(3.51)\begin{equation} \Delta(0)\simeq 1+\frac{2 \varOmega_1^2}{\varOmega_0\left( 4 - \varOmega_0\right)^2}+\frac{2 \varOmega_1^2 \varOmega_2}{\varOmega_0\left( 4 - \varOmega_0\right)^2}-\frac{\varOmega_2^2}{\left( 4 - \varOmega_0\right)^2}. \end{equation}

However, this approximation breaks down when the coefficients become too large as $|n|$ increases. As of the exponents computed, they might not all be suitable for $\iota$. One has to discard the results that are not relevant. Moreover, we emphasize that the exponents may be shifted by $2k {\rm \pi}/L$, with $k$ an integer, without changing the mathematics of the system, by periodicity. However, one has to carefully choose the adapted value for $\iota$, by setting the appropriate phase shift – see (1.3).

In a nutshell, the second variation of the magnetic action, at an elliptical axis, has been expanded in the Solov'ev–Shafranov near-axis formalism. It has been shown that the null eigenspace of this operator yields the correct on-axis rotational transform, upon applying Floquet theory to solve for the latter. On the other hand, solving for the null eigenspace has led to a system that could be rewritten in the form of a Hill equation. The Floquet exponents appear as solutions of an infinite determinant equation, that can only be approximated analytically. The derivation of the on-axis rotational transform as a Floquet exponent of the null eigenvectors of $\delta ^2{\mathcal {S}}$ has also been carried on in Mercier coordinates, to support the previous conclusions – see Appendix D.

Magnetic confinement device design needs fast and accurate computation of $\iota$. Currently, the most widely used method to compute $\iota$ is field-line tracing (Todoroki Reference Todoroki2003). Field-line tracing methods compute $\iota$ as an infinite time limit of following a field line. In practice, it can be done by following a field line sufficiently long to achieve convergence. As an alternative, one can also compute Greene's residue of the axis, which is related to $\iota$ (see Greene Reference Greene1979; Hanson & Cary Reference Hanson and Cary1984; Hudson Reference Hudson2004). Even though it still involves ODE integration, we can limit ourselves to just one circuit around the torus.

In the following section, we introduce a new method to compute $\iota$ as a solution to a discrete problem involving the magnetic action. The action is discretized in a way similar to that in Mackay & Meiss (Reference Mackay and Meiss1983) and Hudson & Suzuki (Reference Hudson and Suzuki2014).

4 Discrete formalism: piecewise action

We have seen that the problem described in the previous section, involving the second-order variation of the magnetic action to determine the on-axis Floquet exponent, leads to the operator equation (3.5). In another approach, the magnetic axis can be discretized, and the rotational transform can be determined from the multipliers of the latter curve. The multipliers have been shown to be linked to the residue of the curve (see Greene Reference Greene1979; Mackay & Meiss Reference Mackay and Meiss1983). This discrete approach is explored in what follows. We consider

(4.1)\begin{align} \mathcal{S} & = \sum_{i=1}^{n-1} \int_{\mathcal{C}_i} {\boldsymbol{A}} \boldsymbol{\cdot} {\rm d}\boldsymbol{\ell}\nonumber\\ & = \sum_{i=1}^{n-1} S({\boldsymbol{x}}_{i},{\boldsymbol{x}}_{i+1}), \end{align}

meaning that the curve has been discretized with $n\in \mathbb {N}$ points. So far, the way that ${\mathcal {C}}_i$ are defined is not important. Only the endpoints matter. The particular case where ${\mathcal {C}}_i$ are segments is given in Appendix E. The following notation is adopted for the derivatives:

(4.2)\begin{equation} \left. \begin{aligned} \boldsymbol{S}_1^{[i,i+1]} & := \boldsymbol{\nabla}_{{\boldsymbol{x}}_i} S^{[i,i+1]}=\boldsymbol{\nabla}_{{\boldsymbol{x}}_i} S({\boldsymbol{x}}_i, {\boldsymbol{x}}_{i+1}),\\ \boldsymbol{S}_2^{[i,i+1]} & := \boldsymbol{\nabla}_{{\boldsymbol{x}}_{i+1}} S^{[i,i+1]}=\boldsymbol{\nabla}_{{\boldsymbol{x}}_{i+1}} S({\boldsymbol{x}}_i, {\boldsymbol{x}}_{i+1}), \end{aligned} \right\} \end{equation}

as well as for the second-order derivatives:

(4.3)\begin{equation} \boldsymbol{\mathsf{S}}_{21}^{[i,i+1]} := \boldsymbol{\nabla}_{{\boldsymbol{x}}_{i+1}}\boldsymbol{S}_1^{[i,i+1]}, \end{equation}

and similarly for $\boldsymbol{\mathsf{S}}_{12}$, $\boldsymbol{\mathsf{S}}_{11}$ and $\boldsymbol{\mathsf{S}}_{22}$. Let us also define generalized periodic orbits of type $(q)$, as orbits with

(4.4)\begin{equation} {\boldsymbol{x}}_{i+q} = {\boldsymbol{x}}_i, \end{equation}

for some $q\in \mathbb {N}$. Therefore, the magnetic axis as discretized above is a general periodic orbit of type $(n)$. The terminology ‘orbit’ for field lines is justified by the Hamiltonian behaviour of the magnetic field. Moreover, the discretized action (4.1) satisfies the periodicity condition

(4.5)\begin{equation} \sum_{i=1}^{n-1} S({\boldsymbol{x}}_{i+n},{\boldsymbol{x}}_{i+n+1}) = \sum_{i=1}^{n-1} S({\boldsymbol{x}}_{i},{\boldsymbol{x}}_{i+1}) + C, \end{equation}

where the constant $C=0$. In the one-dimensional case, as dealt with in Mackay & Meiss (Reference Mackay and Meiss1983), the existence of extrema of the action is ensured by an additional convexity condition on the Lagrangian, $-L_{12}>0$. In our three-dimensional case, this can be generalized in that the second-order derivative tensors $\boldsymbol{\mathsf{S}}_{12}^{[i,i+1]}$ ought to be negative definite for all $i$:

(4.6)\begin{equation} {\boldsymbol{x}}^{\mathsf{T}}{\boldsymbol{\mathsf{S}}_{12}}^{[i,i+1]}{\boldsymbol{x}} <0, \quad {\boldsymbol{x}} \in \mathbb{R}^{*3}, \quad 1\leq i \leq n.\end{equation}

The fact that $C=0$ together with the condition that $\boldsymbol{\mathsf{S}}_{12}^{[i,i+1]}$ guarantees that the action of periodic orbits of type $(n)$ is bounded below and the Poincaré–Birkhoff theorem (Birkhoff Reference Birkhoff1913) ensures the existence of at least two stationary trajectories among the space of all generalized periodic paths of type $(n)$, one that minimizes the action and is elliptic, and one that is a saddle called minimax and is usually hyperbolic but can be alternating hyperbolic, where hyperbolic and elliptic describe the behaviour of nearby trajectories.

For an $(n)$-periodic curve that extremizes the action, the latter has to be stationary with respect to an arbitrary variation in its geometry $\delta {\boldsymbol {x}}_i$, and the stationarity condition can be expressed in terms of the previously defined derivatives:

(4.7)\begin{equation} \left. \begin{gathered} \delta S [\delta {\boldsymbol{x}}_i] = \left[\boldsymbol{\nabla}_{{\boldsymbol{x}}_i} S^{[i-1,i]}+ \boldsymbol{\nabla}_{{\boldsymbol{x}}_i} S^{[i,i+1]} \right]\cdot \delta {\boldsymbol{x}}_i=0,\\ \Leftrightarrow\quad \boldsymbol{S}_2^{[i-1,i]} + \boldsymbol{S}_1^{[i,i+1]} = \boldsymbol{0}. \end{gathered} \right\} \end{equation}

Note that (4.7) holds for any $1\leq i \leq n$, and the stationarity condition is expressed for each point in terms of the two nearest neighbours. This way, a magnetic axis can be identified. Similarly to the continuous case dealt with in § 3, the neighbouring field lines have to satisfy

(4.8)\begin{equation} \boldsymbol{\nabla}_{{\boldsymbol{x}}_{i-1}} \boldsymbol{S}_2^{[i-1,i]} \delta {\boldsymbol{x}}_{i-1} + \boldsymbol{\nabla}_{{\boldsymbol{x}}_{i+1}}\boldsymbol{S}_1^{[i,i+1]} \delta {\boldsymbol{x}}_{i+1} + \boldsymbol{\nabla}_{{\boldsymbol{x}}_{i}}\left(\boldsymbol{S}_2^{[i-1,i]} + \boldsymbol{S}_1^{[i,i+1]}\right) \delta {\boldsymbol{x}}_{i} = \boldsymbol{0}, \end{equation}

which we rewrite in terms of the second-order derivatives of the action as

(4.9)\begin{equation} \boldsymbol{\mathsf{S}}_{12}^{[i-1,i]} \delta {\boldsymbol{x}}_{i-1} + \boldsymbol{\mathsf{S}}_{21}^{[i,i+1]} \delta {\boldsymbol{x}}_{i+1} + \left( \boldsymbol{\mathsf{S}}_{22}^{[i-1,i]} + \boldsymbol{\mathsf{S}}_{11}^{[i,i+1]}\right) \delta {\boldsymbol{x}}_{i} = \boldsymbol{0}, \quad 1\leq i\leq n. \end{equation}

From Mackay & Meiss (Reference Mackay and Meiss1983), we know that the multipliers $\lambda$ of an ($n$)-periodic orbit are defined by the existence of a tangent orbit satisfying

(4.10)\begin{equation} \delta {\boldsymbol{x}}_{i+n} = \lambda \delta {\boldsymbol{x}}_i, \end{equation}

so the following holds:

(4.11a,b)\begin{equation} \delta {\boldsymbol{x}}_{n+1} = \lambda \delta {\boldsymbol{x}}_{1}, \quad \delta {\boldsymbol{x}}_0 = \lambda^{{-}1}\delta {\boldsymbol{x}}_n. \end{equation}

They can be written in their Floquet form (Greene Reference Greene1979):

(4.12)\begin{equation} \lambda = {\rm e}^{{\rm i}\,\nu}, \end{equation}

where the Floquet exponent $\nu$ describes the average rotation angle per period of the ($n$)-orbit, therefore, the rotational transform $\iota$. Since (4.9) is valid for any $1\leq i\leq n$, together with (4.11a,b), it can be rewritten in tensor form:

(4.13)\begin{equation} \left(\begin{array}{cccc}\left( \boldsymbol{\mathsf{S}}_{22}^{[01]} + \boldsymbol{\mathsf{S}}_{11}^{[12]} \right) & \boldsymbol{\mathsf{S}}_{12}^{[12]} & & \lambda^{{-}1}\boldsymbol{\mathsf{S}}_{21}^{[01]}\\ \boldsymbol{\mathsf{S}}_{21}^{[12]} & \boldsymbol{\mathsf{S}}_{12}^{[23]} & \ddots & \\ & \ddots & \ddots & \boldsymbol{\mathsf{S}}_{12}^{[n-1,n]} \\ \lambda\boldsymbol{\mathsf{S}}_{12}^{[n,n+1]} & & \boldsymbol{\mathsf{S}}_{21}^{[n-1,n]} & \left( \boldsymbol{\mathsf{S}}_{22}^{[n-1,n]} + \boldsymbol{\mathsf{S}}_{11}^{[n,n+1]} \right)\end{array}\right) \left(\begin{array}{c} \delta {\boldsymbol{x}}_{1}\\ \vdots \\ \\ \vdots \\ \delta {\boldsymbol{x}}_{n} \end{array}\right) = \boldsymbol{0}, \end{equation}

where the block-tridiagonal form with corners arises naturally. Denoting by $\boldsymbol{\mathsf{M}}$ the matrix of second derivatives, we get an equation for the multipliers:

(4.14)\begin{equation} \boldsymbol{\mathsf{M}}(\lambda)\delta {\boldsymbol{x}} = \boldsymbol{0}. \end{equation}

The blank spaces in $\boldsymbol{\mathsf{M}}$ are blocks of $0$. Note that each block in $\boldsymbol{\mathsf{M}}$ is of size $3\times 3$ since $\boldsymbol{\mathsf{S}}_{lm}$ contain the second derivatives of a line integral embedded in three-dimensional space. Thus, for an $(n)$-periodic curve, $\boldsymbol{\mathsf{M}}\in \mathcal {M}_{3n\times 3n}$, and $\delta {\boldsymbol {x}} \in \mathbb {R}^{3n}$. For (4.14) to hold, the determinant of $\boldsymbol{\mathsf{M}}$ ought to be zero to avoid the trivial solution $\delta {\boldsymbol {x}} = \boldsymbol {0}$.

For a block-tridiagonal matrix defined as in (4.13) with $\lambda \in \mathbb {C}$, the determinant can be expressed analytically (Molinari Reference Molinari2008):

(4.15)\begin{equation} \left. \begin{aligned} \det \boldsymbol{\mathsf{M}}(\lambda) & = \frac{({-}1)^{3n}}{(-\lambda)^3}\det\left(\boldsymbol{\mathsf{T}}_S-\lambda \boldsymbol{\mathsf{I}}_{6} \right)\det\left( \prod_{i=1}^{n}\boldsymbol{\mathsf{S}}_{12}[i,i+1]\right),\\ \boldsymbol{\mathsf{T}}_S & = \prod_{i=1}^{n}\left(\begin{array}{cc} -{\boldsymbol{\mathsf{S}}^{{-}1}_{12}}^{[i,i+1]}(\boldsymbol{\mathsf{S}}_{22}^{[i-1,i]}+\boldsymbol{\mathsf{S}}_{11}^{[i,i+1]}) & -{\boldsymbol{\mathsf{S}}_{12}^{{-}1}}^{[i,i+1]}\boldsymbol{\mathsf{S}}_{12}^{[i-1,i]} \\ \boldsymbol{\mathsf{I}}_{3} & \boldsymbol{\mathsf{0}}\end{array}\right), \end{aligned} \right\} \end{equation}

which requires one to define the so-called transfer matrix $\boldsymbol{\mathsf{T}}_S$. Our generalized convexity condition (4.6) that ensures the existence of minimizing orbits guarantees that the determinant of the transfer matrix can be computed as $\boldsymbol{\mathsf{S}}_{12}^{[i,i+1]}$ is invertible $\forall i$. The multipliers $\lambda$ are then given by the solutions of

(4.16)\begin{equation} \lambda^{{-}3} \det\left(\boldsymbol{\mathsf{T}}_S-\lambda \boldsymbol{\mathsf{I}} \right) = 0, \end{equation}

so they are the non-zero eigenvalues of $\boldsymbol{\mathsf{T}}_S$. Once the multipliers have been computed, the Floquet exponent can be easily determined from (3.31).

Alternatively, note that (4.13) gives a recursive relation for $\delta {\boldsymbol {x}}_i$:

(4.17)\begin{equation} \left. \begin{gathered} \left(\boldsymbol{\mathsf{S}}_{22}^{[01]} + \boldsymbol{\mathsf{S}}_{11}^{[12]} \right) \delta {\boldsymbol{x}}_1 + \boldsymbol{\mathsf{S}}_{12}^{[12]}\delta{\boldsymbol{x}}_2 + \lambda^{{-}1}\boldsymbol{\mathsf{S}}_{21}^{[01]}\delta{\boldsymbol{x}}_n =0,\\ \lambda\boldsymbol{\mathsf{S}}_{12}^{[n,n+1]}\delta{\boldsymbol{x}}_1 + \boldsymbol{\mathsf{S}}_{21}^{[n-1,n]}\delta{\boldsymbol{x}}_{n-1} + \left(\boldsymbol{\mathsf{S}}_{22}^{[n-1,n]} + \boldsymbol{\mathsf{S}}_{11}^{[n,n+1]} \right) \delta {\boldsymbol{x}}_n = 0,\\ \boldsymbol{\mathsf{S}}_{21}^{[k-1,k]}\delta{\boldsymbol{x}}_{k-1} + \left(\boldsymbol{\mathsf{S}}_{22}^{[k-1,k]} + \boldsymbol{\mathsf{S}}_{11}^{[k,k+1]} \right) \delta {\boldsymbol{x}}_k + \boldsymbol{\mathsf{S}}_{12}^{[k, k+1]}{\boldsymbol{x}}_{k+1} =0; \quad 2\leq k\leq n-1. \end{gathered} \right\} \end{equation}

Equation (4.17) can be rewritten as

(4.18)\begin{equation} \left(\begin{array}{c} \delta {\boldsymbol{x}}_{n+1} \\ \delta {\boldsymbol{x}}_{n} \end{array}\right) = \boldsymbol{\mathsf{J}} \left(\begin{array}{c} \delta {\boldsymbol{x}}_{1} \\ \delta {\boldsymbol{x}}_{0}\end{array}\right), \end{equation}

with

(4.19)\begin{equation} \boldsymbol{\mathsf{J}} := \prod_{k=1}^{n}\left(\begin{array}{cc} -{\boldsymbol{\mathsf{S}}^{{-}1}_{12}}^{[k,k+1]}(\boldsymbol{\mathsf{S}}_{22}^{[k-1,k]}+\boldsymbol{\mathsf{S}}_{11}^{[k,k+1]}) & -{\boldsymbol{\mathsf{S}}_{12}^{{-}1}}^{[k,k+1]}\boldsymbol{\mathsf{S}}_{12}^{[k-1,k]} \\ \boldsymbol{\mathsf{I}} & \boldsymbol{\mathsf{0}}\end{array}\right). \end{equation}

The multipliers of a periodic orbit are the eigenvalues of the derivatives of the return map around the orbit, $\boldsymbol{\mathsf{J}}$ (Mackay & Meiss Reference Mackay and Meiss1983), so upon comparison with (4.15), $\boldsymbol{\mathsf{J}}=\boldsymbol{\mathsf{T}}_S$ confirming the result from (4.16).

The strength of this method resides in the fact that the problem is reduced to finding the eigenvalues of a $6\times 6$ matrix. In fact, the size of the operator matrix $\boldsymbol{\mathsf{M}}$ increases linearly with the number of discretization points, but of interest are solely the eigenvalues of the matrix $\boldsymbol{\mathsf{T}}_S$, whose size is $6\times 6$, no matter how many discretization points have been used. Although the number of matrices in the product $\boldsymbol{\mathsf{J}}$ also increases linearly with the number of discretization points, the most commonly used algorithms to determine the eigenvalues of a matrix have a cost that is generally cubic (Francis Reference Francis1961, Reference Francis1962; Kublanovskaya Reference Kublanovskaya1962). Since an $m\times m$ matrix has at most $m$ distinct eigenvalues, reducing its size is crucial for computational efficiency. The derivation of the aforementioned results in the case where the curve of interest is discretized by piecewise linears is given in Appendix E.

5 Conclusion

In this paper, after having introduced the rotational transform, the Hamiltonian behaviour of toroidal magnetic fields was used to motivate the definition of a magnetic action. The latter action served as starting point to express the on-axis rotational transform from a novel approach. The focus has been on elliptical magnetic axes, but this method applies to hyperbolic axes as well, and more generally to any periodic field line.

The action and resulting properties were studied through the lens of the calculus of variations, where variations of the curves’ geometry were performed. The first variation led to the result that extremizing curves are magnetic field lines, enabling one to identify a magnetic axis. The second variation sheds light on the geometry of the neighbouring field lines. Studying the null eigenspace of the second variation enabled the derivation of the on-axis rotational transform, with a focus on elliptical axes. The second variation, seen as an operator, was expanded in the near-axis formalism developed by Mercier, Solov'ev and Shafranov, to yield a system of periodic differential equations. The periodicity of the system allowed the use of Floquet theory to solve for the eigenspace. The key result is that the Floquet exponents of the axis were shown to match Mercier's expression (1.3) of the rotational transform. Additionally, solving for the null eigenspace was shown to lead to a Hill equation, from which the Floquet exponents were expressed as solutions of an infinite determinant equation.

Following this continuous derivation of the on-axis rotational transform through the Floquet exponents of the null eigenspace of the second variation, a discrete approach was introduced. It consists of discretizing the field line of interest and by linearity, to define the action as a sum of piecewise actions. This approach was described by Mackay & Meiss (Reference Mackay and Meiss1983) for one-dimensional Lagrangian systems. We provide a generalization of this method as our action is based on field lines which are in essence three-dimensional. Solving for the Floquet exponents was shown to be closely related to solving for the multipliers of the curve, described by Mackay & Meiss (Reference Mackay and Meiss1983) and Greene (Reference Greene1979), and the parallel between the two approaches was made as a consistency check. The on-axis Floquet multiplier is computed by finding the eigenvalues of a $6\times 6$ matrix, made of second derivatives of the action. Its efficiency in comparison with field-line tracing will be studied in future work.

Finally, this paper applies the near-axis formalism to recovering the rotational transform from a Lagrangian approach. Although the method has been introduced for elliptical axes, also called $O$-points, it can be applied to hyperbolic axes, or $X$-points, with the only difference being that the Floquet exponents are real, not purely imaginary; and, more generally, to all periodic orbits. The discrete approach stands as an alternative to compute the rotational transform instead of following field lines. We aim at using this method in stellarator optimization.

Acknowledgements

We dedicate this paper to Bob Dewar. The authors would like to thank A. Bhattacharjee and E.J. Paul for stimulating discussions and helpful suggestions.

Editor P. Helander thanks the referees for their advice in evaluating this article.

Funding

W.S. was supported by a grant from the Simons Foundation/SFARI (560651, AB), and the Department of Energy Award No. DE-SC0024548. This paper is based upon work supported by the US Department of Energy, Office of Science, Office of Fusion Energy Sciences, and has been authored by Princeton University under Contract Number DE-AC02-09CH11466 with the US Department of Energy.

Declaration of interests

The authors report no conflict of interest.

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Appendix A. Floquet theory

Floquet theory was formulated by Gaston Floquet towards the end of the 19th century, in his attempt to solve linear differential equations with periodic coefficients (Floquet Reference Floquet1883). Suppose one needs to solve the linear system

(A1a,b)\begin{equation} \dot{{\boldsymbol{x}}} = \boldsymbol{\mathsf{A}}(t){\boldsymbol{x}} , \quad {\boldsymbol{x}}(t_0) = {\boldsymbol{x}}_0, \end{equation}

where $\boldsymbol{\mathsf{A}}\in \mathcal {M}_{n\times n}$ is periodic in $t$ with period $T$. Considering $n$ linearly independent solutions $\{{\boldsymbol {x}}_{1},\dots {\boldsymbol {x}}_{n}\}$ of (A1a,b), it is useful to introduce the so-called fundamental matrix $\boldsymbol{\mathsf{X}}$ by grouping ${\boldsymbol {x}}_i$ together:

(A2)\begin{equation} \boldsymbol{\mathsf{X}}(t,t_0) := \left( {\boldsymbol{x}}_1;{\boldsymbol{x}}_2;\ldots ;{\boldsymbol{x}}_n\right), \end{equation}

where ${\boldsymbol {x}}_i$ are column vectors, and the second argument has been added to specify that the initial condition occurs at time $t_0$. Thus, (A1a,b) can be rewritten as

(A3)\begin{equation} \frac{{\rm d}}{{\rm d}t}\boldsymbol{\mathsf{X}}(t,t_0) = \boldsymbol{\mathsf{A}}(t)\boldsymbol{\mathsf{X}}(t,t_0). \end{equation}

If $\boldsymbol{\mathsf{X}}(t_0,t_0) = \boldsymbol{\mathsf{I}}$, $\boldsymbol{\mathsf{X}}$ is called the principal fundamental matrix. We now state some well-known theorems of Floquet theory, the proofs of which can be found in Meiss (Reference Meiss2007).

Theorem A.1 (Abel)

The determinant of the fundamental matrix $\boldsymbol{\mathsf{X}}$ is

(A4)\begin{equation} \det \boldsymbol{\mathsf{X}}(t,t_0) = \exp\int_{t_0}^{t}\text{tr}{\boldsymbol{\mathsf{A}}(s)}\,{\rm d}s. \end{equation}

Moreover, when $(t_0,t)=(0,T)$, it can be rewritten as the product of the so-called Floquet multipliers:

(A5)\begin{equation} \det \boldsymbol{\mathsf{X}}(T,0) = \prod_{i=1}^{n}\,{\rm e}^{\nu_{i}T}, \end{equation}

where $\nu _i$ are the Floquet exponents.

This second property simply states that the Floquet multipliers are the eigenvalues of the monodromy matrix $\boldsymbol{\mathsf{m}}:= \boldsymbol{\mathsf{X}}(T,0)$ of the linear system (A3).

Theorem A.2 (Floquet–Lyapunov)

The fundamental matrix $\boldsymbol{\mathsf{X}}$ solution of the system (A3) is of the form

(A6)\begin{equation} \boldsymbol{\mathsf{X}}(t,t_0) = \boldsymbol{\mathsf{P}}(t){\rm e}^{(t-t_0)\boldsymbol{\mathsf{B}}},\end{equation}

where the matrix $\boldsymbol{\mathsf{P}}$ is symplectic and $T$-periodic, with $\boldsymbol{\mathsf{P}}(t_0)=\boldsymbol{\mathsf{I}}$, and $\boldsymbol{\mathsf{B}}$ is a constant Hamiltonian matrix, that is,

(A7a,b)\begin{equation} \boldsymbol{\mathsf{J}}\boldsymbol{\mathsf{B}} = \boldsymbol{\mathsf{B}}^{\mathsf{T}}\boldsymbol{\mathsf{J}}^{\mathsf{T}}, \quad \boldsymbol{\mathsf{J}} = \left(\begin{array}{cc} \boldsymbol{\mathsf{0}} & - \boldsymbol{\mathsf{I}} \\ \boldsymbol{\mathsf{I}} & \boldsymbol{\mathsf{0}} \end{array}\right). \end{equation}

Appendix B. Frenet–Serret frame

The Frenet–Serret frame is a local basis that spans the three-dimensional space $\mathbb {R}^3$. The terminology ‘local’ arises from the fact that this frame is defined locally along a curve ${\mathcal {C}}\subset \mathbb {R}^3$. Let us define the basis vectors and some essential properties of the frame. We refer to Duignan & Meiss (Reference Duignan and Meiss2021) as they provide the reader with a brief yet efficient description of this frame. The original mathematical developments can be found in Frenet (Reference Frenet1852).

The frame is composed of the three well-known tangent, normal and binormal vectors, defined and related to each other as follows. Provided that the curve ${\mathcal {C}}$ is described by the vector-valued function ${\boldsymbol {r}}_0\in \mathbb {R}^3$ and parametrized by the arc length $\ell$, the tangent vector ${\boldsymbol {t}}$ is defined as

(B1)\begin{equation} {\boldsymbol{t}} := \frac{{\rm d}{\boldsymbol{r}}_0}{{\rm d}\ell} = {\boldsymbol{r}}_0'. \end{equation}

The normal vector accounts for the normalized rate of change of the tangent along the curve:

(B2)\begin{equation} {\boldsymbol{n}} := {\boldsymbol{t}}'\left| {\boldsymbol{t}}'\right|^{{-}1}. \end{equation}

Finally, the binormal vector is the cross product of the tangent and the normal vector:

(B3)\begin{equation} {\boldsymbol{b}} := {\boldsymbol{t}} \times {\boldsymbol{n}}. \end{equation}

The curvature $\kappa$ and the torsion $\tau$ of the curve ${\mathcal {C}}$ can be expressed from the derivatives of ${\boldsymbol {r}}_0$:

(B4a,b)\begin{equation} \kappa := |{\boldsymbol{r}}_0''|=|{\boldsymbol{t}}'|, \quad \tau := \frac{\left( {\boldsymbol{r}}_0'\times {\boldsymbol{r}}_0''\right)\boldsymbol{\cdot} {\boldsymbol{r}}_0'''}{\kappa^2}, \end{equation}

provided that the curvature is non-vanishing. The rate of change of the Frenet–Serret frame along the curve ${\mathcal {C}}$ is written in terms of the curvature and the torsion. The resulting expressions are the so-called Frenet–Serret formulae:

(B5)\begin{equation} \frac{{\rm d}}{{\rm d}\ell}\left(\begin{array}{c}{\boldsymbol{n}} \\ {\boldsymbol{b}} \\ {\boldsymbol{t}}\end{array}\right) = \left(\begin{array}{ccc} 0 & \tau(\ell) & - \kappa(\ell) \\ - \tau(\ell) & 0 & 0\\ \kappa(\ell) & 0 & 0\end{array}\right)\left(\begin{array}{c}{\boldsymbol{n}} \\ {\boldsymbol{b}} \\ {\boldsymbol{t}}\end{array}\right). \end{equation}

With the previous relations, the reader is equipped with what is necessary to delve into near-axis expansion coordinates, based on the Frenet–Serret frame.

Appendix C. Near-axis expansion coordinates

This appendix is dedicated to the description of two sets of coordinates that have been proven to be powerful in the expansion of operators or physical quantities in the vicinity of field lines. The terminology near-axis expansion is used here, as in this paper expansions are carried out around magnetic axes, but those coordinates remain suitable for any near-field-line expansion. Appendix C.1 is dedicated to the description of Mercier coordinates, introduced by Mercier (Reference Mercier1964), and Appendix C.2 deals with the Solov'ev–Shafranov coordinates, as described in Solov'ev & Shafranov (Reference Solov'ev and Shafranov1970). They are both closely related, and a correspondence can easily be established between the two. In addition, they both are based on the Frenet–Serret frame, as described in Appendix B. In what follows, the coordinate systems are introduced in the context of magnetic fields, and each curve that is dealt with is assumed to be a field line. In addition, we emphasize that the following two subsections do not describe new material, introduced in the above references. The reader might want to refer to Jorge, Sengupta & Landreman (Reference Jorge, Sengupta and Landreman2020c) and Sengupta et al. (Reference Sengupta, Rodriguez, Jorge, Bhattacharjee and Landreman2024) for more insight on how near-axis expansion has been used in the context of plasma physics.

C.1 Mercier coordinates

Mercier's frame $\{\boldsymbol {\rho },\boldsymbol {\omega }, {\boldsymbol {t}}\}$ is based on the Frenet–Serret frame $\{{\boldsymbol {n}},{\boldsymbol {b}}, {\boldsymbol {t}}\}$, and related to the latter by a rotation of the normal and binormal vectors by a polar angle $\theta$, which is a purely geometric quantity, as shown in figure 1. The field line (that can be considered to be a magnetic axis here) ${\mathcal {C}}\subset \mathbb {R}^3$ and described by ${\boldsymbol {r}}_0$ is assumed closed and parametrized by the arc length $\ell$ with a total length $L$. Following Mercier et al. (Reference Mercier1974), the near-axis expansion is based on the construction of a tube of radius $\rho$ around the axis, such that any neighbouring point can be described by $\rho (\ell ) \boldsymbol {\rho }(\ell )$:

(C1a,b)\begin{equation} {\boldsymbol{r}}(\ell)= {\boldsymbol{r}}_0(\ell) + \rho(\ell) \boldsymbol{\rho}(\ell), \quad {\boldsymbol{r}}_0'(\ell) ={\boldsymbol{t}}(\ell), \end{equation}

where the dependence on $\ell$ has been made explicit, but is omitted in what follows for the sake of readability. However, it is made clear whenever the dependence on quantities is not obvious. Therefore $\{\boldsymbol {\rho },\boldsymbol {\omega }, {\boldsymbol {t}}\}$ is a right-handed triad related to $\{{\boldsymbol {n}},{\boldsymbol {b}}, {\boldsymbol {t}}\}$ by

(C2a,b)\begin{equation} \boldsymbol{\rho} = {\boldsymbol{n}} \cos{\theta} + {\boldsymbol{b}} \sin{\theta} , \quad \boldsymbol{\omega} = {\boldsymbol{b}} \cos{\theta} -{\boldsymbol{n}} \sin{\theta}. \end{equation}

Figure 1. Mercier's triad $\{\boldsymbol {\rho }, \boldsymbol {\omega }, {\boldsymbol {t}} \}$ related to the usual Frenet–Serret $\{ \boldsymbol {n}, \boldsymbol {b}, {\boldsymbol {t}}\}$. The solid black line is a field line.

It can be shown by direct computation that $(\rho,\omega = \theta +\int \tau \,\textrm {d}\ell ',\ell )$, with $\tau$ denoting the torsion of the axis ${\boldsymbol {r}}_0$ (see Appendix B), forms an orthogonal coordinate system with metric

(C3a,b)\begin{equation} {\rm d}s^2 = {\rm d}\rho^2 + \rho^2 \,{\rm d}\omega^2 +h^2 \,{\rm d}\ell^2, \quad h= 1-\kappa \rho \cos{\theta}, \end{equation}

where $\kappa$ denotes the curvature of ${\mathcal {C}}$. The product $\kappa \rho$ can be seen as a measure of the ratio between the radius of curvature of ${\boldsymbol {r}}_0$ and the radius of expansion around the latter. The metric tensor reads

(C4)\begin{equation} \mathsf{g}_{ij}= \begin{pmatrix} 1 & 0 & 0\\ 0 & \rho^2 & 0\\ 0 & 0 & \quad h^2 \end{pmatrix}. \end{equation}

Therefore, from the previous characterization of the Mercier coordinates, a few differential identities can be derived. Since the basis considered for Mercier coordinates is the triad $\{ \boldsymbol {\rho }, \boldsymbol {\omega }, {\boldsymbol {t}}\}$, the identity tensor is trivially written as

(C5)\begin{equation} \boldsymbol{\mathsf{I}} = \boldsymbol{\rho} \boldsymbol{\rho}+ \boldsymbol{\omega} \boldsymbol{\omega}+ {\boldsymbol{t}} {\boldsymbol{t}}. \end{equation}

By means of the metric expression (C1a,b), the gradient can be derived:

(C6)\begin{equation} \boldsymbol{\nabla} = \boldsymbol{\rho} \partial_\rho + \frac{\boldsymbol{\omega} }{\rho}\partial_\omega +\frac{{\boldsymbol{t}}}{h}\partial_\ell,\end{equation}

where the subscripts denote with respect to which coordinate the partial derivative is taken. Moreover, some pseudo-Poisson formulae can be derived for the derivatives of $\boldsymbol {\rho }$ and $\boldsymbol {\omega }$, to read

(C7a-d)\begin{equation} \boldsymbol{\rho}_{,\omega} = \boldsymbol{\omega}, \quad \boldsymbol{\rho}_{,\ell}={-}{\boldsymbol{t}}\; \kappa \cos{\theta} , \quad \boldsymbol{\omega}_{,\omega} ={-}\boldsymbol{\rho}, \quad \boldsymbol{\omega}_{,\ell}= {\boldsymbol{t}}\; \kappa \sin{\theta}, \end{equation}

where the subscript comma represents partial differentiation. Finally, using (C6), the gradient of each basis vector can be easily expressed:

(C8a-c)\begin{equation} \boldsymbol{\nabla} {\boldsymbol{t}} = \frac{\kappa}{h}{\boldsymbol{t}} {\boldsymbol{n}}, \quad \boldsymbol{\nabla} \boldsymbol{\rho}= \frac{1}{\rho}\boldsymbol{\omega}\boldsymbol{\omega} - \frac{\kappa \cos{\theta}}{h}{\boldsymbol{t}} {\boldsymbol{t}} , \quad \boldsymbol{\nabla} \boldsymbol{\omega} = \frac{\kappa \sin{\theta}}{h}{\boldsymbol{t}} {\boldsymbol{t}} -\frac{1}{\rho}\boldsymbol{\omega}\boldsymbol{\rho}. \end{equation}

In the formalism developed by Mercier, when expanding quantities in powers of $\rho$, it can be useful to have an additional ‘phase’ term $\delta$ that may simplify expressions. The latter phase is a function of the arc length $\ell$ and enables one to define the so-called Mercier angle $u$:

(C9)\begin{equation} u:=\theta + \delta = \omega-\int \tau \,{\rm d}\ell' + \delta. \end{equation}

In fact, the $\delta$ term describes the rotation of trajectories around the axis ${\boldsymbol {r}}_0$. One notes that, by periodicity, $\delta$ has to satisfy $\delta (\ell + L) = \delta (\ell )+ 2{\rm \pi} n$, with $n\in \mathbb {Z}$. The rotation function $\delta$ is important for the Solov'ev–Shafranov near-axis expansion as the latter is based on the construction of ellipses around ${\boldsymbol {r}}_0$, whose rotation is naturally important.

C.2 Solov'ev–Shafranov coordinates

The Solov'ev–Shafranov coordinate system is closely related to the Mercier triad, but differs in that the expansion is not carried out by constructing a tube of radius $\rho$ around the axis, but an ellipse of semi-axes varying along ${\boldsymbol {r}}_0$ as depicted in figure 2. Let $\{\boldsymbol {\mathcal {N}},\boldsymbol {\mathcal {B}}, {\boldsymbol {t}}\}$ be the orthogonal triad related to the Frenet–Serret frame through a rotation by function $\delta$ introduced in the Mercier formalism such that

(C10a,b)\begin{equation} \boldsymbol{\mathcal{N}}= \boldsymbol{n}\cos\delta -\boldsymbol{b}\sin\delta = \boldsymbol{\rho}\cos u -\boldsymbol{\omega}\sin u, \quad \boldsymbol{\mathcal{B}}=\boldsymbol{n}\sin\delta +\boldsymbol{b}\cos\delta = \boldsymbol{\rho} \sin u +\boldsymbol{\omega} \cos u, \end{equation}

where $\{ \boldsymbol {\rho }, \boldsymbol {\omega }, {\boldsymbol {t}} \}$ form the Mercier basis, with Mercier angle $u$ as defined in Appendix C.1. Equation (C10a,b) shows how intrinsically related are the two frames. Again, the field line considered, $\boldsymbol {r}_0$, forms a closed curve $\mathcal {C}$ parametrized by the arc length $\ell$ and a total length $L$. It is obvious that $\boldsymbol {\mathcal {N}}$ and $\boldsymbol {\mathcal {B}}$ depend on $\ell$, enabling one to write

(C11a,b)\begin{equation} \boldsymbol{\mathcal{N}}'={-}\kappa \cos\delta{\boldsymbol{t}} - u'\boldsymbol{\mathcal{B}}, \quad \boldsymbol{\mathcal{B}}'={-}\kappa \sin\delta {\boldsymbol{t}}+ u' \boldsymbol{\mathcal{N}}, \end{equation}

where $\kappa$ denotes the curvature of ${\boldsymbol {r}}_0$, which is a local property. Following Solov'ev & Shafranov (Reference Solov'ev and Shafranov1970), the position vector $\boldsymbol {r}$ of any point in the vicinity of the axis can be expressed as

(C12)\begin{equation} \boldsymbol{r}=\boldsymbol{r}_0 + x \boldsymbol{\mathcal{N}}+y \boldsymbol{\mathcal{B}}, \end{equation}

where $x$ and $y$ are expansion parameters depending on the position of the frame $\{ \boldsymbol {\mathcal {N}}, \boldsymbol {\mathcal {B}}, {\boldsymbol {t}} \}$ through $\ell$. In Mercier coordinates, $x=\rho \cos u$ and $y=\rho \sin u$. Differentiating ${\boldsymbol {r}}$ with respect to the length parameter $\ell$:

(C13)\begin{equation} \frac{{\rm d}\boldsymbol{r}}{{\rm d}\ell}= (x'+u' y) \boldsymbol{\mathcal{N}} + (y'-u' x) \boldsymbol{\mathcal{B}} + h {\boldsymbol{t}} , \end{equation}

where the prime denotes total differentiation with respect to $\ell$, and ${\boldsymbol {r}}_0'=\boldsymbol {t}$. Additionally, $h=1-\kappa (x\cos \delta - y \sin \delta ) = 1-\kappa \rho \cos \theta$ and $u'=\delta '-\tau$. It induces the following metric:

(C14)\begin{equation} {\rm d}s^2={{\rm d} x}^2+{{\rm d} y}^2+2 u' (y \,{{\rm d} x}-x \,{{\rm d} y})\,{\rm d}\ell + \left[h^2+{u'}^2(x^2+y^2) \right]\,{\rm d}\ell^2. \end{equation}

The covariant and contravariant forms $\mathsf{g}_{ij}$ and $\mathsf{g}^{ij}$ of the metric tensor for $\textrm {d}s^2$ read

(C15)\begin{equation} \mathsf{g}_{ij}= \begin{pmatrix} 1 & 0 & u' y\\ 0 & 1 & -u' x\\ u'y & -u'x & h^2+{u'}^2(x^2+y^2) \end{pmatrix} \end{equation}

and

(C16)\begin{equation} \mathsf{g}^{ij}=\frac{1}{h^2} \begin{pmatrix} h^2+{u'}^2 y^2 & -{u'}^2 x y & -u' y\\ -{u'}^2 x y & 1 & u' x\\ -u'y & u'x & 1 \end{pmatrix} \end{equation}

with $\sqrt {|g|}=h$. A direct computation shows that the following basis vectors lead to the same metric tensors:

(C17)\begin{equation} \left. \begin{array}{lll} \boldsymbol{e}^1 = \boldsymbol{\nabla} x=\boldsymbol{\mathcal{N}}-\dfrac{u' y}{h}\boldsymbol{t}, & \boldsymbol{e}^2= \boldsymbol{\nabla} y=\boldsymbol{\mathcal{B}}+\dfrac{u' x}{h}\boldsymbol{t}, & \boldsymbol{e}^3= \boldsymbol{\nabla} \ell = \dfrac{1}{h}\boldsymbol{t},\\ \boldsymbol{e}_1 = h \left(\boldsymbol{\nabla} y \times \boldsymbol{\nabla} \ell\right) = \boldsymbol{\mathcal{N}}, & \boldsymbol{e}_2= h\left( \boldsymbol{\nabla} \ell\times \boldsymbol{\nabla} x\right)= \boldsymbol{\mathcal{B}}, & \boldsymbol{e}_3= h\left(\boldsymbol{\nabla} x \times \boldsymbol{\nabla} y\right)\\ & & \ \ = h \boldsymbol{t}+u'({-}x\boldsymbol{\mathcal{B}} + y \boldsymbol{\mathcal{N}}),\end{array} \right\} \end{equation}

which will be useful and, in fact, more convenient in the near-axis expansion of the quantities of interest in this paper. The gradient in the basis $\{ \boldsymbol {e}_j\}_j$ is expressed as

(C18)\begin{equation} \boldsymbol{\nabla} \equiv \boldsymbol{e}_j\; g^{ij}\frac{\partial}{\partial \alpha^{i}}\end{equation}

with $\alpha ^1=x$, $\alpha ^2=y$, $\alpha ^3=\ell$ and the Einstein summation convention.

Figure 2. Solov'ev–Shafranov triad ${\{ \boldsymbol {\mathcal {N}}, \boldsymbol {\mathcal {B}},{\boldsymbol {t}}\}}$ related to the usual Frenet–Serret $\{ {\boldsymbol {n}}, {\boldsymbol {b}}, {\boldsymbol {t}}\}$. The solid black line is a field line.

Appendix D. Derivation of the rotational transform from the action in Mercier coordinates

D.1 Near-axis expansion of the second variation tensor

As we did in the Solov'ev–Shafranov geometry in the body of this paper, in order to expand the operator

(D1)\begin{equation} \boldsymbol{\mathsf{M}} := \frac{\delta^2 \mathcal{S}}{\delta {\boldsymbol{x}} \delta {\boldsymbol{x}}}={-}\left(\boldsymbol{\mathsf{I}}\times {\boldsymbol{B}} \right) \frac{{\rm d}}{{\rm d}\ell}+ {\boldsymbol{x}}' \times (\boldsymbol{\nabla} {\boldsymbol{B}})^\mathsf{T} \end{equation}

in Mercier coordinates, ${\boldsymbol {B}}$ is expanded in the parameter $\rho$ such that $\kappa \rho \ll 1$. To evaluate $\boldsymbol{\mathsf{M}}$ to lowest order in $\rho$, the magnetic field needs to be expanded up to first order. Therefore, we set

(D2a,b)\begin{equation} {\boldsymbol{B}} = B_0(\ell) {\boldsymbol{t}} + \rho {\boldsymbol{B}}_1 , \quad {\boldsymbol{B}}_1= \left( B_{1}^{\rho} \boldsymbol{\rho} + B_{1}^{\omega}\boldsymbol{\omega} + B_{1}^{t}{\boldsymbol{t}} \right). \end{equation}

The next step in the expansion of $\boldsymbol{\mathsf{M}}$ is to compute $\boldsymbol {\nabla } {\boldsymbol {B}}$. It is straightforward from the differential identity (C6):

(D3)\begin{equation} \boldsymbol{\nabla} {\boldsymbol{B}} = \frac{1}{h} \left({\boldsymbol{t}} {\boldsymbol{t}} \; B_0'(\ell) + {\boldsymbol{t}} {\boldsymbol{n}}\; \kappa B_0 \right) + \boldsymbol{\rho} \; {\boldsymbol{B}}_1 + \rho \boldsymbol{\nabla} {\boldsymbol{B}}, \end{equation}

such that, to leading order

(D4)\begin{equation} \left. \begin{aligned} {\boldsymbol{x}}'\times \boldsymbol{\nabla} {\boldsymbol{B}}^\mathsf{T} & =\kappa B_0 {\boldsymbol{b}} {\boldsymbol{t}} + \left(B_{1}^{\rho} \boldsymbol{\omega} -B_{1}^{\omega} \boldsymbol{\rho} \right)\boldsymbol{\rho} + \boldsymbol{\omega}\boldsymbol{\omega}\;(\partial_\omega B_{1}^{\rho}-B_{1}^{\omega}) -\boldsymbol{\omega}\boldsymbol{\rho}\, (\partial_\omega B_{1}^{\omega}+B_{1}^{\rho}),\\ -(\boldsymbol{\mathsf{I}}\times {\boldsymbol{B}}) & =(\boldsymbol{\rho}\boldsymbol{\omega} -\boldsymbol{\omega}\boldsymbol{\rho})B_0. \end{aligned} \right\} \end{equation}

Finally, $\boldsymbol{\mathsf{M}}$ as given by (3.4) simplifies to

(D5)\begin{equation} \left. \begin{aligned} \boldsymbol{\mathsf{M}} & := \boldsymbol{\mathsf{M}}_1 \frac{{\rm d}}{{\rm d}\ell}+\boldsymbol{\mathsf{M}}_2, \\ \boldsymbol{\mathsf{M}}_1 & = (\boldsymbol{\rho}\boldsymbol{\omega} -\boldsymbol{\omega}\boldsymbol{\rho})B_0 ,\\ \boldsymbol{\mathsf{M}}_2 & = \kappa B_0 {\boldsymbol{b}} {\boldsymbol{t}} + \left(B_{1}^{\rho} \boldsymbol{\omega} -B_{1}^{\omega} \boldsymbol{\rho} \right)\boldsymbol{\rho} + \boldsymbol{\omega}\boldsymbol{\omega}\,(\partial_\omega B_{1}^{\rho}-B_{1}^{\omega}) -\boldsymbol{\omega}\boldsymbol{\rho}\, (\partial_\omega B_{1}^{\omega}+B_{1}^{\rho}). \end{aligned} \right\} \end{equation}

So far, the power expansion of the gradient of the magnetic field relied on a generic expression of ${\boldsymbol {B}}$ in terms of $B_{1}^{\rho }$, $B_{1}^{\omega }$ and $B_{1}^{t}$. The latter functions can be determined enforcing additional constraints such as $\boldsymbol {\nabla }\boldsymbol {\cdot } {\boldsymbol {B}}=0$ and $\boldsymbol {\nabla } \times {\boldsymbol {B}} = \boldsymbol {J}_0=J_0(\ell ){\boldsymbol {t}}$. The divergence and curl of ${\boldsymbol {B}}$ can be evaluated directly from the $\boldsymbol {\nabla } {\boldsymbol {B}}$ tensor, using dyadic algebra. From

(D6)\begin{equation} \left. \begin{aligned} & \boldsymbol{\nabla} \boldsymbol{\cdot} {\boldsymbol{B}} = \boldsymbol{\mathsf{I}}\boldsymbol{:}\boldsymbol{\nabla} {\boldsymbol{B}} = B_0' + 2 B_{1}^{\rho} + \partial_\omega B_{1}^{\omega},\\ & \boldsymbol{\nabla} \times {\boldsymbol{B}} ={-}\boldsymbol{\mathsf{I}}\boldsymbol{\overset{\times}{.}}\boldsymbol{\nabla} {\boldsymbol{B}} = {\boldsymbol{t}}\; (2 B_{1}^{\omega}-\partial_\omega B_{1}^{\rho}) + \left(\boldsymbol{\omega} -\boldsymbol{\rho}\partial_\omega\right)(B_0 \kappa \cos\theta -B_{1}^{t}), \end{aligned} \right\} \end{equation}

where $\partial _{\omega }:=\partial /\partial \omega$, it can be easily shown that

(D7a-c)\begin{equation} B_{1}^{t}= \kappa B_0 \cos\theta=B_{1t}, \quad B_{1}^{\rho}={-}\frac{1}{2}(B_0' + \partial_\omega b_{1}) = B_{1\rho}, \quad B_{1}^{\omega}= \frac{1}{2}J_0+b_1=B_{1\omega}, \end{equation}

where $b_1$ satisfies the harmonic equation

(D8)\begin{equation} \left(\partial_\omega^2 +4\right)b_1=0. \end{equation}

Following Mercier et al. (Reference Mercier1974), the solution of (D8) can be expressed as

(D9)\begin{equation} \frac{b_1}{B_0}=b_{c2}\cos{(2u)} + b_{s2}\sin{(2u)} = \tanh{\eta(\ell)}\left(\delta'-\tau +\frac{J_0}{2B_0}\right)\cos{(2u)}+\frac{\eta'}{2}\sin{(2u)},\end{equation}

where $u$ is the Mercier angle as introduced in Appendix C.1. The functions $\eta (\ell )$ and $\delta (\ell )$ represent, respectively, the eccentricity and the rotation of the elliptic flux surfaces winding around the axis, given by the flux function

(D10)\begin{equation} \psi=\rho^2 B_0\left[ \cosh({\eta})+\sinh({\eta})\sin{(2u)} \right]. \end{equation}

D.2 Floquet exponents and rotational transform

Now that the magnetohydrodynamic constraints have been enforced, let us obtain the null eigenvector ${\boldsymbol {v}}= v^t {\boldsymbol {t}} + v^{\rho } \boldsymbol {\rho }+ v^{\omega } \boldsymbol {\omega }$ of $\boldsymbol{\mathsf{M}}$ such that $\boldsymbol{\mathsf{M}}{\boldsymbol {v}} = \boldsymbol {0}$, in order to determine the Floquet exponents. We take advantage of the orthogonality of Mercier coordinates to work with the covariant representation of ${\boldsymbol {v}}$, so the notation is less heavy. The metric tensor for this set of coordinates is given by (C4). We state that the components $v_t,v_\rho, v_\omega$ are functions of $\omega$ and $\ell$. The $\boldsymbol{\mathsf{M}}_1$ terms can be shown to be

(D11)\begin{equation} \boldsymbol{\mathsf{M}}_1 \frac{{\rm d}}{{\rm d}\ell} {\boldsymbol{v}} = B_0 \left[ \boldsymbol{\rho}\,(v_\omega'-\kappa \sin\theta\, v_t)-\boldsymbol{\omega} (v_\rho' +\kappa \cos\theta\, v_t) \right]. \end{equation}

Similarly,

(D12)\begin{align} \boldsymbol{\mathsf{M}}_2 {\boldsymbol{v}} & = \boldsymbol{\rho}\;\left[B_0 v_\omega' +v_\omega (B_0'+B_{1\rho})-B_{1\omega}v_\rho\right]\nonumber\\ & \quad +\boldsymbol{\omega}\left[{-}B_0 v_\rho' +v_\omega (B_{1\omega}-J_0)+B_{1\rho}v_\rho\right]. \end{align}

Thus, we observe that $\boldsymbol{\mathsf{M}}{\boldsymbol {v}} =\boldsymbol {0}$ only has $\boldsymbol {\rho },\boldsymbol {\omega }$ components with $\ell$ derivatives of $v_\rho,v_\omega$. The tangential component $v_t$ is thus an arbitrary constant, which can be absorbed by redefining $v_\rho,v_\omega$. We therefore set $v_t=0$. The condition $\boldsymbol{\mathsf{M}}{\boldsymbol {v}} =\boldsymbol {0}$ leads to the system

(D13)\begin{equation} B_0 \begin{pmatrix} 0 & +1 \\-1 & 0 \end{pmatrix} \dfrac{{\rm d}}{{\rm d} \ell} \begin{pmatrix} v_\rho\\ v_\omega \end{pmatrix} + \begin{pmatrix} B_{1\omega} & & [B_0'+B_{1\rho}]\\ B_{1\rho} & & [B_{1\omega}-J_0] \end{pmatrix} \begin{pmatrix} v_\rho\\ v_\omega \end{pmatrix} =0. \end{equation}

Substituting the constraints (D7a–c) into (D13), we obtain the equivalent system of coupled linear partial differential equations:

(D14)\begin{equation} \left. \begin{gathered} v_\omega' +\left( \frac{B_0'}{2B_0}-\frac{\partial_\omega b_1}{2B_0} \right) v_\omega -\left(\frac{J_0/2}{B_0}+\frac{b_1}{B_0}\right) v_\rho = 0,\\ v_\rho' +\left( \frac{B_0'}{2B_0}+\frac{\partial_\omega b_1}{2B_0} \right) v_\omega +\left(\frac{J_0/2}{B_0}-\frac{b_1}{B_0}\right) v_\omega = 0. \end{gathered} \right\} \end{equation}

Therefore, the the system (D14) can be represented in the form

(D15)\begin{equation} \frac{{\rm d}\tilde{\boldsymbol{v}}}{{\rm d}\ell}=\boldsymbol{\mathsf{A}}(\ell) \tilde{\boldsymbol{v}}, \quad \tilde{\boldsymbol{v}}= (v_\rho, v_\omega)^{\mathsf{T}}. \end{equation}

Here, $\boldsymbol{\mathsf{A}}$ is a matrix with components periodic in $\ell$ with period $L$. Note that this representation is possible since all the quantities between brackets are depending on $\ell$ along the field line from which the expansion is carried out, $\omega$ included (Solov'ev & Shafranov Reference Solov'ev and Shafranov1970; Duignan & Meiss Reference Duignan and Meiss2021). We can directly use the Floquet theorem on (D15) to conclude that the solution must be of the form

(D16)\begin{equation} \tilde{{\boldsymbol{v}}}=\boldsymbol{\mathsf{U}}(\ell){\rm e}^{\boldsymbol{\mathsf{C}} \ell/L}, \end{equation}

where $\boldsymbol{\mathsf{U}}$ is a symplectic periodic matrix and $\boldsymbol{\mathsf{C}}$ is a constant Hamiltonian matrix. The eigenvalues of $\boldsymbol{\mathsf{C}}$ are the Floquet exponents, which must be of the form $\pm \textrm {i}\, \nu$ ($\nu \in \mathbb {R}$) near an elliptic axis.

An equivalent approach to solve the system from (D14) and hence identify the exponents $\nu$ is to use the fact that $b_1$ only has second harmonics in $u$. This allows us to seek a solution where $\tilde {{\boldsymbol {v}}}$ only has first harmonics in $u$. From now on, we follow this approach. We show that there exist solutions to $\boldsymbol{\mathsf{M}}{\boldsymbol {v}} =\boldsymbol {0}$ with

(D17)\begin{equation} \begin{pmatrix} v_\rho(\ell) \\ v_\omega(\ell) \end{pmatrix} = \begin{pmatrix} v_{\rho c}(\ell) \\ v_{\omega c}(\ell) \end{pmatrix} \cos u + \begin{pmatrix} v_{\rho s}(\ell) \\ v_{\omega s}(\ell) \end{pmatrix}\sin u. \end{equation}

Substituting the above into (D14), together with the expressions for $B_{1\rho }, B_{1\omega }$ from (D7a–c), yields first- and third-order harmonics in $u$. Equating the terms with first harmonics in $u$, the $v_\omega '$ equation yields

(D18)\begin{equation} \left. \begin{gathered} {}v_{\omega c}' -(\tau -\delta')v_{\omega s} -\left(\frac{J_0/2}{B_0}\right) v_{\rho c} + \frac{B_0'}{2 B_0} v_{\omega c} + \frac{1}{2}(\mathfrak{a}_{c2c} + \mathfrak{a}_{s2s} ) =0,\\ {}v_{\omega s}' +(\tau -\delta')v_{\omega c} -\left(\frac{J_0/2}{B_0}\right) v_{\rho s} + \frac{B_0'}{2 B_0} v_{\omega s} + \frac{1}{2}(\mathfrak{a}_{s2c} - \mathfrak{a}_{c2s} ) =0. \end{gathered} \right\} \end{equation}

Equating the third harmonic terms leads to the following constraints:

(D19a,b)\begin{equation} \mathfrak{a}_{c2c}= \mathfrak{a}_{s2s} , \quad \mathfrak{a}_{s2c}={-} \mathfrak{a}_{c2s}, \end{equation}

where the $\mathfrak {a}$ terms have the following definitions:

(D20)\begin{equation} \left. \begin{aligned} \mathfrak{a}_{c2c} & ={-}\frac{b_{c2}}{B_0}v_{\rho c}-\frac{b_{s2}}{B_0}v_{\omega c}, \quad \mathfrak{a}_{s2s}={-}\frac{b_{s2}}{B_0}v_{\rho s}+\frac{b_{c2}}{B_0}v_{\omega s},\\ \mathfrak{a}_{s2c} & ={-}\frac{b_{s2}}{B_0}v_{\rho c}+\frac{b_{c2}}{B_0}v_{\omega c}, \quad \mathfrak{a}_{c2s}={-}\frac{b_{c2}}{B_0}v_{\rho s}-\frac{b_{s2}}{B_0}v_{\omega s}. \end{aligned} \right\} \end{equation}

The constraint (D19a,b) and the definitions (D20) imply, together with $b_{s2}^2+b_{c2}^2\neq 0$, that

(D21a,b)\begin{equation} v_{\rho s}= v_{\omega c}, \quad v_{\rho c}={-} v_{\omega s}, \end{equation}

which allows us to rewrite (D18) solely in terms of $v_{\omega c}, v_{\omega s}$. Simplification leads to

(D22)\begin{equation} \left. \begin{gathered} v_{\omega c}' + \left(\frac{B_0'}{2 B_0} -\frac{b_{s2}}{B_0} \right) v_{\omega c} +\left( \frac{J_0/2}{B_0}- \tau +\delta' +\frac{b_{c2}}{B_0}\right) v_{\omega s} =0,\\ v_{\omega s}' + \left(\frac{B_0'}{2 B_0} +\frac{b_{s2}}{B_0} \right) v_{\omega s} -\left( \frac{J_0/2}{B_0}- \tau +\delta' -\frac{b_{c2}}{B_0}\right) v_{\omega c} =0. \end{gathered} \right\} \end{equation}

Finally, substituting (D9), we get

(D23)\begin{equation} \left. \begin{gathered} v_{\omega c}' + \left(\frac{B_0'}{2 B_0} -\frac{\eta'}{2} \right) v_{\omega c} +\frac{{\rm e}^{+\eta}}{\cosh{(\eta)}}\left( \frac{J_0/2}{B_0}- \tau +\delta' \right) v_{\omega s} =0,\\ v_{\omega s}' + \left(\frac{B_0'}{2 B_0} +\frac{\eta'}{2} \right) v_{\omega s} -\frac{{\rm e}^{-\eta}}{\cosh{(\eta)}}\left( \frac{J_0/2}{B_0}- \tau +\delta'\right) v_{\omega c} =0. \end{gathered} \right\} \end{equation}

It is possible to further simplify the system by introducing new variables $X,Y$:

(D24a,b)\begin{equation} v_{\omega c}= \frac{1}{\sqrt{B_0'}}{\rm e}^{+\eta/2}X(\ell), \quad v_{\omega s}= \frac{1}{\sqrt{B_0'}} {\rm e}^{-\eta/2}Y(\ell) \end{equation}

such that (D23) reduces to

(D25a-c)\begin{equation} X' + \varOmega_0(\ell) Y =0, \quad Y'-\varOmega_0(\ell) X =0, \quad \varOmega_0(\ell) = \frac{\dfrac{J_0/2}{B_0}- \tau +\delta'}{ \cosh{\eta}}. \end{equation}

One notes that (D24a,b) has the exact same form as (3.27a,b), which describe a harmonic oscillator system with a ‘time’-dependent frequency $\varOmega _0$. Once again, using the complex variable $Z=X+\textrm {i} Y$, the system can be rewritten as a single complex ODE:

(D26)\begin{equation} Z'-{\rm i}\, \varOmega_0 Z =0 \quad \Rightarrow \quad Z(\ell)= Z_0 \exp{\int_0^\ell {\rm i}\,\varOmega_0(s)\, {\rm d}s}. \end{equation}

Separating the periodic and non-periodic parts of the exponential we get

(D27)\begin{equation} \left. \begin{gathered} Z(\ell) = Z_p(\ell) {\rm e}^{{\rm i}\, \nu \ell/L}, \quad Z_p(\ell)= Z_0 \exp{\int_0^\ell {\rm i}\,\tilde{\varOmega}_0(s) \,{\rm d}s},\\ \bar{\varOmega} \equiv \frac{1}{L}\int_0^L \varOmega_0(s) \,{\rm d}s, \quad \tilde{\varOmega_0}= \varOmega_0(\ell) - \bar{\varOmega}. \end{gathered} \right\} \end{equation}

Comparing (D27) with (D16), we find that $\nu$, given by

(D28)\begin{equation} \nu = \int_0^L \varOmega_0(s)\,{\rm d}s \pmod{2{\rm \pi}} = \oint \frac{\dfrac{J_0(s)/2}{B_0(s)}- \tau(s) +\delta'(s)}{\cosh{\eta(s)}}\,{\rm d}s \pmod{2{\rm \pi}}, \end{equation}

is the Floquet exponent for the system. It confirms the result obtained in the Solov'ev–Shafranov coordinates, and it is a second confirmation that the second variation of the magnetic field action, when expanded by means of a near-axis formalism and combined with Floquet theory, yields the correct on-axis rotational transform. This was expected since the result is independent of coordinates.

Appendix E. Discrete formalism: the piecewise linear discretization

A discrete method to compute the on-axis Floquet exponent has been introduced in § 4. Here, we give the computations in the particular case of the curve being broken down to a concatenation of segments. Recall that the discrete action was introduced as

(E1)\begin{equation} \mathcal{S} = \sum_{i=1}^{n-1} \int_{\mathcal{C}_i} {\boldsymbol{A}} \boldsymbol{\cdot} {\rm d}\boldsymbol{\ell}. \end{equation}

Thus, taking $\mathcal {C}_i$ to be segments, the action sums up to a sum of integrals along piecewise linears $\mathcal {C}_i:=\{ \mathbb {R}^3\ni {\boldsymbol {x}} = \zeta ({\boldsymbol {x}}_{i+1}-{\boldsymbol {x}}_{i}) + {\boldsymbol {x}}_i \mid \zeta \in [0,1] \}$:

(E2)\begin{equation} \left. \begin{aligned} \mathcal{S} & = \sum_{i=1}^{n-1} S({\boldsymbol{x}}_{i},{\boldsymbol{x}}_{i+1}),\\ S({\boldsymbol{x}}_{i},{\boldsymbol{x}}_{i+1}) & = \int_0^{1}\, {\rm d}\zeta{\boldsymbol{A}}\left(\zeta({\boldsymbol{x}}_{i+1}-{\boldsymbol{x}}_i)+{\boldsymbol{x}}_i\right)\boldsymbol{\cdot} ({\boldsymbol{x}}_{i+1}-{\boldsymbol{x}}_i)\\ & =\int_0^{1}\,{\rm d}\zeta \boldsymbol{A}\left({\boldsymbol{v}}({\boldsymbol{x}}(\zeta))\right) \boldsymbol{\cdot} \boldsymbol{u}, \end{aligned} \right\} \end{equation}

with ${\boldsymbol {x}}(0) = {\boldsymbol {x}}_i$ and ${\boldsymbol {x}}(1)={\boldsymbol {x}}_{i+1}$, and the vector functions $\boldsymbol {u}$ and ${\boldsymbol {v}}$ defined as follows:

(E3)\begin{equation} \left. \begin{aligned} \boldsymbol{u}({\boldsymbol{x}}_i,{\boldsymbol{x}}_{i+1}) & = {\boldsymbol{x}}_{i+1}-{\boldsymbol{x}}_i,\\ {\boldsymbol{v}}(\zeta,{\boldsymbol{x}}_i, {\boldsymbol{x}}_{i+1}) & = \zeta({\boldsymbol{x}}_{i+1}-{\boldsymbol{x}}_i)+{\boldsymbol{x}}_i, \end{aligned} \right\} \end{equation}

such that $\boldsymbol {\nabla }_{{\boldsymbol {x}}_i} \boldsymbol {u}({\boldsymbol {x}}_i,{\boldsymbol {x}}_{i+1}) = -\boldsymbol{\mathsf{I}}$, $\boldsymbol {\nabla }_{{\boldsymbol {x}}_{i+1}} \boldsymbol {u}({\boldsymbol {x}}_i,{\boldsymbol {x}}_{i+1}) = \boldsymbol{\mathsf{I}}$, $\boldsymbol {\nabla }_{{\boldsymbol {x}}_i}{\boldsymbol {v}}(\zeta,{\boldsymbol {x}}_i, {\boldsymbol {x}}_{i+1}) = (1-\zeta ) \boldsymbol{\mathsf{I}}$, $\boldsymbol {\nabla }_{{\boldsymbol {x}}_{i+1}}{\boldsymbol {v}}(\zeta, {\boldsymbol {x}}_i, {\boldsymbol {x}}_{i+1}) = \zeta \boldsymbol{\mathsf{I}}$. Since

(E4)\begin{align} \boldsymbol{\nabla}_{{\boldsymbol{x}}_i} \left[ \boldsymbol{A}({\boldsymbol{v}}({\boldsymbol{x}}_i)) \boldsymbol{\cdot}\boldsymbol{u}({\boldsymbol{x}}_i)\right] & = \boldsymbol{\mathsf{J}}^{\mathsf{T}}_{\boldsymbol{A} \circ {\boldsymbol{v}}}({\boldsymbol{x}}_i) \boldsymbol{u}({\boldsymbol{x}}_i) + \boldsymbol{\mathsf{J}}^{\mathsf{T}}_{\boldsymbol{u}}({\boldsymbol{x}}_i) \boldsymbol{A}({\boldsymbol{v}}({\boldsymbol{x}}_i))\nonumber\\ & = \left[ \boldsymbol{\mathsf{J}}^{\mathsf{T}}_{{\boldsymbol{v}}}({\boldsymbol{x}}_i) \boldsymbol{\mathsf{J}}^{\mathsf{T}}_{\boldsymbol{A}}({\boldsymbol{v}}({\boldsymbol{x}}_i))\right] \boldsymbol{u}({\boldsymbol{x}}_i)+ \boldsymbol{\mathsf{J}}^{\mathsf{T}}_{\boldsymbol{u}}({\boldsymbol{x}}_i) \boldsymbol{A}({\boldsymbol{v}}({\boldsymbol{x}}_i)), \end{align}

where $\boldsymbol{\mathsf{J}}_{\boldsymbol {a}}^{\mathsf {T}}$ stands for the transpose of the Jacobian matrix of the vector field $\boldsymbol {a}$, one gets the following first-order derivatives of the action:

(E5)\begin{equation} \left. \begin{aligned} \boldsymbol{S}_1^{[i,i+1]} & = \int_0^{1}(1-\zeta)\boldsymbol{\mathsf{J}}_{{\boldsymbol{A}}}^{\mathsf{T}}\left(\zeta({\boldsymbol{x}}_{i+1}-{\boldsymbol{x}}_i)+ {\boldsymbol{x}}_i\right)({\boldsymbol{x}}_{i+1}-{\boldsymbol{x}}_i) - {\boldsymbol{A}}\left(\zeta({\boldsymbol{x}}_{i+1}-{\boldsymbol{x}}_i)+ {\boldsymbol{x}}_i\right) \,{\rm d}\zeta,\\ \boldsymbol{S}_2^{[i-1,i]} & = \int_0^{1}\zeta \boldsymbol{\mathsf{J}}_{{\boldsymbol{A}}}^{\mathsf{T}}\left(\zeta({\boldsymbol{x}}_{i}-{\boldsymbol{x}}_{i-1})+ {\boldsymbol{x}}_{i-1}\right)({\boldsymbol{x}}_{i}-{\boldsymbol{x}}_{i-1}) + {\boldsymbol{A}}\left(\zeta({\boldsymbol{x}}_{i}-{\boldsymbol{x}}_{i-1})+ {\boldsymbol{x}}_{i-1}\right) \,{\rm d}\zeta. \end{aligned} \right\} \end{equation}

A direct computation of the second derivatives reads

(E6)\begin{equation} \left. \begin{aligned} \boldsymbol{\mathsf{S}}_{12}^{[i-1,i]} & =\int_0^{1}\, {\rm d}\zeta \zeta\left[ (1-\zeta) \boldsymbol{\nabla}_{{\boldsymbol{v}}} \boldsymbol{\mathsf{J}}^{\mathsf{T}}_{{\boldsymbol{A}}}({\boldsymbol{v}}) ({\boldsymbol{x}}_i - {\boldsymbol{x}}_{i-1}) - \boldsymbol{\mathsf{J}}^{\mathsf{T}}_{{\boldsymbol{A}}}({\boldsymbol{v}})\right]+ (1-\zeta)\boldsymbol{\mathsf{J}}_{{\boldsymbol{A}}}({\boldsymbol{v}}),\\ \boldsymbol{\mathsf{S}}_{21}^{[i,i+1]} & =\int_0^{1}\, {\rm d}\zeta (1-\zeta)\left[ \zeta \boldsymbol{\nabla}_{{\boldsymbol{v}}} \boldsymbol{\mathsf{J}}^T_{{\boldsymbol{A}}}({\boldsymbol{v}}) ({\boldsymbol{x}}_{i+1} - {\boldsymbol{x}}_{i}) + \boldsymbol{\mathsf{J}}^T_{{\boldsymbol{A}}}({\boldsymbol{v}})\right]+ \zeta \boldsymbol{\mathsf{J}}_{{\boldsymbol{A}}}({\boldsymbol{v}}),\\ \boldsymbol{\mathsf{S}}_{22}^{[i-1,i]} & =\int_0^{1}\, {\rm d}\zeta \zeta\left[ \zeta \boldsymbol{\nabla}_{{\boldsymbol{v}}} \boldsymbol{\mathsf{J}}^{\mathsf{T}}_{{\boldsymbol{A}}}({\boldsymbol{v}}) ({\boldsymbol{x}}_i - {\boldsymbol{x}}_{i-1}) + \left(\boldsymbol{\mathsf{J}}_{{\boldsymbol{A}}}({\boldsymbol{v}}) +\boldsymbol{\mathsf{J}}^{\mathsf{T}}_{{\boldsymbol{A}}}({\boldsymbol{v}})\right)\right],\\ \boldsymbol{\mathsf{S}}_{11}^{[i,i+1]} & =\int_0^{1}\, {\rm d}\zeta (1-\zeta)\left[ (1-\zeta) \boldsymbol{\nabla}_{{\boldsymbol{v}}} \boldsymbol{\mathsf{J}}^{\mathsf{T}}_{{\boldsymbol{A}}}({\boldsymbol{v}}) ({\boldsymbol{x}}_{i+1} - {\boldsymbol{x}}_{i}) - \left(\boldsymbol{\mathsf{J}}_{{\boldsymbol{A}}}({\boldsymbol{v}})+\boldsymbol{\mathsf{J}}_{{\boldsymbol{A}}}^{\mathsf{T}}({\boldsymbol{v}})\right)\right]. \end{aligned} \right\} \end{equation}

The second-order derivatives above can be easily computed numerically. We recall that the multipliers $\lambda$ of the curve are given by solutions of

(E7)\begin{equation} \left. \begin{gathered} \det \boldsymbol{\mathsf{M}}(\lambda) = ({-}1)^n \lambda ^{{-}3}\det\left(\boldsymbol{\mathsf{T}}_S-\lambda \boldsymbol{\mathsf{I}}_{6} \right)\det\left( \prod_{i=1}^{n}\boldsymbol{\mathsf{S}}_{12}[i,i+1]\right),\\ \boldsymbol{\mathsf{T}}_S = \prod_{i=1}^{n}\left(\begin{array}{cc} -{\boldsymbol{\mathsf{S}}^{{-}1}_{12}}^{[i,i+1]}(\boldsymbol{\mathsf{S}}_{22}^{[i-1,i]}+\boldsymbol{\mathsf{S}}_{11}^{[i,i+1]}) & -{\boldsymbol{\mathsf{S}}_{12}^{{-}1}}^{[i,i+1]}\boldsymbol{\mathsf{S}}_{12}^{[i-1,i]} \\ \boldsymbol{\mathsf{I}}_{3} & 0\end{array}\right), \end{gathered} \right\} \end{equation}

and that they are linked to the Floquet exponent by $\lambda = \textrm {e}^{\textrm {i}\,\nu }$. From the set of expressions (E6), the determinant of $\boldsymbol{\mathsf{M}}$ can be computed, leading to the multipliers. The above can be implemented numerically as a novel tool to compute the on-axis Floquet exponent.

References

Arnol'd, V.I. 1963 Proof of a theorem of A. N. Kolmogorov on the invariance of quasi-periodic motions under small perturbations of the Hamiltonian. Russ. Math. Surv. 18 (5), 9.CrossRef Google Scholar

Arnold, V.I. 1978 Mathematical methods of Classical Mechanics. Springer.CrossRef Google Scholar

Arnold, V.I. 2014 The Asymptotic Hopf Invariant and Its Applications, pp. 357–375. Springer.Google Scholar

Birkhoff, G.D. 1913 Proof of Poincaré's geometric theorem. Trans. Am. Math. Soc. 14 (1), 14–22.Google Scholar

Brouwer, L.E.J. 1910 Über eineindeutige, stetige transformationen von flächen in sich. Math. Ann. 69 (2), 176–180.CrossRef Google Scholar

Cary, J.R. & Littlejohn, R.G. 1983 Noncanonical Hamiltonian mechanics and its application of magnetic field line flow. Ann. Phys. 151(1), 1–34.CrossRef Google Scholar

Duignan, N. & Meiss, J.D. 2021 Normal forms and near-axis expansions for Beltrami magnetic fields. Phys. Plasmas 28 (12), 122501.CrossRef Google Scholar

Floquet, G. 1883 Sur les équations différentielles linéaires à coefficients périodiques. Ann. Sci. École Norm. Sup. 12, 47–88.CrossRef Google Scholar

Francis, J.G.F. 1961 The QR transformation a unitary analogue to the LR transformation. Part 1. Comput. J. 4 (3), 265–271.CrossRef Google Scholar

Francis, J.G.F. 1962 The QR transformation. Part 2. Comput. J. 4 (4), 332–345.CrossRef Google Scholar

Frenet, F. 1852 Sur les courbes à double courbure. J. Math. Pures Appl. 17, 437–447.Google Scholar

Garren, D.A. & Boozer, A.H. 1991 Existence of quasihelically symmetric stellarators. Phys. Fluids B 3 (10), 2822–2834.CrossRef Google Scholar

Greene, J.M. 1979 A method for determining a stochastic transition. J. Math. Phys. 20 (6), 1183–1201.CrossRef Google Scholar

Hanson, J.D. & Cary, J.R. 1984 Elimination of stochasticity in stellarators. Phys. Fluids 27 (4), 767.CrossRef Google Scholar

Helander, P. 2014 Theory of plasma confinement in non-axisymmetric magnetic fields. Rep. Prog. Phys. 77 (8), 087001.CrossRef Google Scholar PubMed

Hudson, S.R. 2004 Destruction of invariant surfaces and magnetic coordinates for perturbed magnetic fields. Phys. Plasmas 11 (2), 677–685.CrossRef Google Scholar

Hudson, S.R. & Dewar, R.L. 2009 Are ghost surfaces quadratic-flux-minimizing? Phys. Lett. A 373 (48), 4409–4415.CrossRef Google Scholar

Hudson, S.R. & Suzuki, Y. 2014 Chaotic coordinates for the Large Helical Device. Phys. Plasmas 21 (10), 102505.CrossRef Google Scholar

Jorge, R., Sengupta, W. & Landreman, M. 2020 a Construction of quasisymmetric stellarators using a direct coordinate approach. Nucl. Fusion 60 (7), 076021.CrossRef Google Scholar

Jorge, R., Sengupta, W. & Landreman, M. 2020 b Near-axis expansion of stellarator equilibrium at arbitrary order in the distance to the axis. J. Plasma Phys. 86 (1), 905860106.CrossRef Google Scholar

Jorge, R., Sengupta, W. & Landreman, M. 2020 c Near-axis expansion of stellarator equilibrium at arbitrary order in the distance to the axis. J. Plasma Phys. 86 (1), 905860106.CrossRef Google Scholar

Kolmogorov, A.N. 1954 On conservation of conditionally periodic motions for a small change in Hamilton's function. Dokl. Akad. Nauk SSSR 98, 527–530.Google Scholar

Kruskal, M.D. & Kulsrud, R.M. 1958 Equilibrium of a magnetically confined plasma in a toroid. Phys. Fluids 1 (4), 265–274.CrossRef Google Scholar

Kublanovskaya, V.N. 1962 On some algorithms for the solution of the complete eigenvalue problem. USSR Comput. Maths Math. Phys. 1 (3), 637–657.CrossRef Google Scholar

Landreman, M. & Sengupta, W. 2018 Direct construction of optimized stellarator shapes. Part 1. Theory in cylindrical coordinates. J. Plasma Phys. 84 (6), 905840616.CrossRef Google Scholar

Landreman, M. & Sengupta, W. 2019 Constructing stellarators with quasisymmetry to high order. J. Plasma Phys. 85 (6), 815850601.CrossRef Google Scholar

Lichtenberg, A.J. & Lieberman, M.A. 1992 Regular and Chaotic Dynamics, 2nd edn. Springer.CrossRef Google Scholar

Mackay, R.S. & Meiss, J.D. 1983 Linear stability of periodic orbits in lagrangian systems. Phys. Lett. A 98 (3), 92–94.CrossRef Google Scholar

Magnus, W. 1953 Infinite Determinants in the Theory of Mathieu's and Hill's Equations, vol. 1. Mathematics Research Group, Washington Square College of Arts and Science.Google Scholar

Meiss, J.D. 1992 Symplectic maps, variational principles, and transport. Rev. Mod. Phys. 64, 795–848.CrossRef Google Scholar

Meiss, J.D. 2007 Differential Dynamical Systems. SIAM.CrossRef Google Scholar

Mercier, C. 1964 Equilibrium and stability of a toroidal magnetohydrodynamic system in the neighbourhood of a magnetic axis. Nucl. Fusion 4 (3), 213.CrossRef Google Scholar

Mercier, C., Euratom & Commission of the European Communities 1974 Lectures in Plasma Physics: The Magnetohydrodynamic Approach to Plasma Confinement in Closed Magnetic Configurations. EUR 5127 EN/1987. Commission of the European Communities.Google Scholar

Molinari, L.G. 2008 Determinants of block tridiagonal matrices. Linear Algebra Appl. 429 (8–9), 2221–2226.CrossRef Google Scholar

Moser, J. 1962 On invariant curves of area-preserving mappings of an annulus. Nachr. Akad. Wiss. Göttingen, Math. Phys. Kl. II 1,1 Kl (1), 1.Google Scholar

Moser, J. 1973 Stable and Random Motions. Princeton Univ. Press.Google Scholar

Pfefferlé, D., Gunderson, L., Hudson, S.R. & Noakes, L. 2018 Nonplanar elasticae as optimal curves for the magnetic axis of stellarators. Phys. Plasmas 25 (9), 092508.CrossRef Google Scholar

Rodríguez, E., Sengupta, W. & Bhattacharjee, A. 2023 Constructing the space of quasisymmetric stellarators through near-axis expansion. Plasma Phys. and Control. Fusion 65 (9), 095004.CrossRef Google Scholar

Sengupta, W., Rodriguez, E., Jorge, R., Bhattacharjee, M. & Landreman, A. 2024 Stellarator equilibrium axis-expansion to all orders in distance from the axis for arbitrary plasma beta.. J. Plasma Phys. 90 (4). doi:10.1017/S002237782400076XCrossRef Google Scholar

Solov'ev, L.S. & Shafranov, V.D. 1970 Reviews of Plasma Physics, vol. 5. ed. M. A. Leontovich. Consultants Bureau.Google Scholar

Spitzer, L. Jr. 1958 The stellarator concept. Phys. Fluids 1 (4), 253–264.CrossRef Google Scholar

Todoroki, J. 2003 Calculating rotational transform following field lines. J. Plasma Fusion Res. 79 (4), 321–322.CrossRef Google Scholar

Wang, Z.X., Guo, D.R. & Xia, X.J. 1989 Special Functions. World Scientific Pub Co Inc.CrossRef Google Scholar

Article contents

Application of Lagrangian techniques for calculating the on-axis rotational transform

Abstract

Keywords

1 The need for and the origin of the rotational transform

2 Description of the magnetic field-line action

3 Verification of Mercier's formula for the Floquet exponent

3.1 The second variation as an operator

3.2 Derivation of the Floquet exponent from the second variation

3.2.1 Magnetic field expansion near an elliptic magnetic axis

3.2.2 The second variation tensor and its null eigenspace

3.2.3 Derivation of the Floquet exponent from the Hill's determinant equation

4 Discrete formalism: piecewise action

5 Conclusion

Acknowledgements

Funding

Declaration of interests

Data availability

Appendix A. Floquet theory

Appendix B. Frenet–Serret frame

Appendix C. Near-axis expansion coordinates

C.1 Mercier coordinates

C.2 Solov'ev–Shafranov coordinates

Appendix D. Derivation of the rotational transform from the action in Mercier coordinates

D.1 Near-axis expansion of the second variation tensor

D.2 Floquet exponents and rotational transform

Appendix E. Discrete formalism: the piecewise linear discretization

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests