Hostname: page-component-cd9895bd7-jn8rn Total loading time: 0 Render date: 2024-12-28T16:35:11.220Z Has data issue: false hasContentIssue false

Low-degree Hurwitz stacks in the Grothendieck ring

Published online by Cambridge University Press:  11 September 2024

Aaron Landesman
Affiliation:
Department of Mathematics, Harvard University, 1 Oxford Street, Cambridge, MA 02138, USA [email protected]
Ravi Vakil
Affiliation:
Department of Mathematics, Stanford University, 450 Jane Stanford Way, Stanford, CA 94305, USA [email protected]
Melanie Matchett Wood
Affiliation:
Department of Mathematics, Harvard University, 1 Oxford Street, Cambridge, MA 02138, USA [email protected]
Rights & Permissions [Opens in a new window]

Abstract

For $2 \leq d \leq 5$, we show that the class of the Hurwitz space of smooth degree $d$, genus $g$ covers of $\mathbb {P}^1$ stabilizes in the Grothendieck ring of stacks as $g \to \infty$, and we give a formula for the limit. We also verify this stabilization when one imposes ramification conditions on the covers, and obtain a particularly simple answer for this limit when one restricts to simply branched covers.

Type
Research Article
Copyright
© 2024 The Author(s). The publishing rights in this article are licensed to Foundation Compositio Mathematica under an exclusive licence

1. Introduction

The main results of this paper are Grothendieck ring analogs of classical theorems on the density of discriminants of number fields of degree at most $5$ (see [Reference Davenport and HeilbronnDH69, Reference BhargavaBha05, Reference BhargavaBha10]). Let $\mathrm {Hur}_{d,g,k}$ be the moduli stack of degree $d$ covers of $\mathbb {P}^1$ with Galois group $S_d$ by smooth geometrically connected genus $g$ curves over a field $k$, see Definition 5.3. Let $\mathrm {Hur}_{d,g,k}^{s}$ be the open substack of $\mathrm {Hur}_{d,g,k}$ corresponding to simply branched covers, i.e. the open subset where the map to $\mathbb {P}^1$ has geometric fibers with at least $d-1$ points. The main results of this paper are that for each $d \leq 5$, the classes of these moduli spaces converge in the Grothendieck ring as $g \rightarrow \infty$, to particularly nice limits. More precisely, we work in a suitably defined Grothendieck ring of stacks $\widehat {\widetilde {K_0}}(\mathrm {Stacks}_k)$, see Definition 2.6, where as usual $\mathbb {L} := \{\mathbb {A}^1\}$ is the class of the affine line.

Theorem 1.1 (Theorem A)

Suppose $2 \leq d \leq 5$ and $k$ is a field of characteristic not dividing $d!$. In $\widehat {\widetilde {K_0}}(\mathrm {Stacks}_k)$,

\[ \lim_{g \rightarrow \infty} \frac {\{\mathrm{Hur}_{d,g,k}^{s}\}} {\mathbb{L}^{ \dim \mathrm{Hur}_{d,g,k}}} = 1 - \mathbb{L}^{-2}. \]

Theorem 1.2 (Theorem B)

Suppose $2 \leq d \leq 5$ and $k$ is a field of characteristic not dividing $d!$. In $\widehat {\widetilde {K_0}}(\mathrm {Stacks}_k)$,

\[ \lim_{g \rightarrow \infty} \frac { \{\mathrm{Hur}_{d,g,k}\}} {\mathbb{L}^{ \dim \mathrm{Hur}_{d,g,k}}} = \begin{cases} 1-\mathbb{L}^{-2} & \text{if } d = 2, \\ (1+\mathbb{L}^{-1}) (1 - \mathbb{L}^{-3}) & \text{if } d = 3 ,\\ \displaystyle\frac 1 {(1-\mathbb{L}^{-1})} \prod_{x \in \mathbb{P}^1_k} (1+\mathbb{L}^{-2}- \mathbb{L}^{-3} - \mathbb{L}^{-4}) & \text{if } d= 4, \\ \displaystyle\frac 1 {(1-\mathbb{L}^{-1})} \prod_{x \in \mathbb{P}^1_k} (1+\mathbb{L}^{-2}- \mathbb{L}^{-4} - \mathbb{L}^{-5}) & \text{if } d = 5. \end{cases} \]

The products on the right in the cases $d=4$ and $d=5$ are motivic Euler products in the sense of Bilu [Reference BiluBil17, Reference Bilu and HoweBH21]. Alternatively, these can be viewed as powers in the sense of power structures, as introduced by Gusein-Zade, Luengo, and Melle-Hernàndez [Reference Gusein-Zade, Luengo and Melle-HernàndezGLM04], see § 2.10.

Theorem 1.1 is a special case of Corollary 10.6 whereas Theorem 1.2 is a special case of Corollary 10.7. Both are consequences of Theorem 10.5, describing the limits of branched covers with specified ramification, along with rates of convergence. These results lead to conjectures in higher degree, see § 1.5.

Remark 1.3 The results Theorems 1.1 and 1.2 of this paper are stated above with the restriction that the Galois group of the cover is $S_d$. These results continue to hold when one removes this restriction, except that when $d = 4$, covers with Galois group $D_4$ must be removed. One can deduce these claims from Lemma 9.6.

1.4 Motivation

Motivations for Theorems 1.1 and 1.2 come from number theory, topology, and algebraic geometry.

1.4.1 Arithmetic motivation

One can also view results relating to counting number fields of bounded discriminant as ‘point counting analogs’ of the stabilization of Hurwitz spaces. To spell this out, our main results on stabilization of the classes of Hurwitz spaces suggest the number of $\mathbb {F}_q$ points of these Hurwitz spaces also stabilize in $g$. (This is not actually implied by our results, because we work in the dimension filtration of the Grothendieck ring, and so it is possible that high codimension substacks of these Hurwitz spaces contain many $\mathbb {F}_q$ points which could potentially alter the $g \to \infty$ limiting behavior of the $\mathbb {F}_q$ point counts.) In the degree $3$ case, stabilization of the number of $\mathbb {F}_q$ points was shown by Datskovsky and Wright in [Reference Datskovsky and WrightDW88]. Their results actually count $S_3$ covers of any global field using Shintani zeta functions. However, a more geometric proof counting $S_3$ covers of any curve over a finite field has also been given by J. Gunther (‘Counting cubic curve covers over finite fields’, private communication). These results have also been generalized to work in degrees $4$ and $5$ by Bhargava, Shankar, and Wang in [Reference Bhargava, Shankar and WangBSW15]. Analogs over $\mathbb {Q}$ were known much earlier than these results over global function fields. That is, instead of counting $\mathbb {F}_q$ points of Hurwitz spaces, corresponding to $S_d$ covers of $\mathbb {P}^1_{\mathbb {F}_q}$, the arithmetic analog is to count $S_d$ extensions of $\mathbb {Q}$. When $d = 3$, these counts were carried out by Davenport and Heilbronn [Reference Davenport and HeilbronnDH69, Reference Davenport and HeilbronnDH71]. When $d =4$ and $d = 5$, the number field counting was done by Bhargava in [Reference BhargavaBha05, Reference BhargavaBha10, Reference BhargavaBha14]. Our theorems can thus be viewed as Grothendieck ring analogs of these number field counting results. Indeed, the ‘Euler products’ occurring in Theorem 1.2 with $\mathbb {L}$ replaced by $p$ are exactly those that occur in the densities of discriminants of $S_d$-number fields of degree $d\leq 5$ (see [Reference Davenport and HeilbronnDH69, Reference BhargavaBha05, Reference BhargavaBha10]), which demonstrates, in particular, the great success of the notion of motivic Euler products. Similarly to our methods, the methods behind the number field counting results only apply when $d\leq 5$ because they rely on specific parametrizations [Reference Delone and FaddeevDF64, Reference BhargavaBha04, Reference BhargavaBha08] of low-degree covers of $\operatorname {Spec} \mathbb {Z}$.

1.4.2 Topological motivation

We now describe topological results demonstrating stabilization of Hurwitz spaces. One striking result is due to Ellenberg, Venkatesh, and Westerland [Reference Ellenberg, Venkatesh and WesterlandEVW16], which has deep applications to number theory. Their result [Reference Ellenberg, Venkatesh and WesterlandEVW16, Theorem p. 732] implies that the dimension of the $i$th homology $h_i(\mathrm {Hur}_{3,g,\mathbb {C}}^{s}, \mathbb {Q})$ stabilizes as $g \to \infty$. Unfortunately, although their methods apply in the case of degree $3$ covers, they already fail to apply when $d= 4$, see the remarks in [Reference Ellenberg, Venkatesh and WesterlandEVW16, p. 732].

If, instead of working with covers of $\mathbb {P}^1$, one works with the full moduli stack of curves with marked points, $\mathcal {M}_{g,n}$, then these stacks satisfy certain homological stabilities, due to Harer, Madsen-Weiss, and others. See, for example, [Reference Madsen and WeissMW07] and the survey article [Reference HatcherHat11].

1.4.3 Algebrogeometric motivation

Finally, from an algebraic geometric viewpoint, there are some further related unirationality results on objects of low degree and genus. For degrees $d \leq 5$ a simple parametrization of degree $d$ covers was originally given in [Reference MirandaMir85, Theorem 1.1], [Reference Casnati and EkedahlCE96, Theorem 4.4], and [Reference CasnatiCas96, Theorem 3.8], see also Theorems 3.13, 3.14, and 3.16 (as well as [Reference PoonenPoo08, Proposition 5.1] and [Reference WoodWoo11, Theorem 1.1]).

There have also been results proving stabilization of algebraic data relating to $\mathrm {Hur}_{d,g,k}$. The rational Picard groups stabilize when $d \leq 5$, due to Deopurkar and Patel [Reference Deopurkar and PatelDP15, Theorem A]. Also, the rational Chow groups were fully determined for $d=3$, and the rational Chow groups were shown to stabilize for $d=4$ and $d=5$ (removing $D_4$ covers when $d=4$), in [Reference Canning and LarsonCL22, Theorem 1.1].

There have also been some related stabilization results working in the Grothendieck ring. For example, the class of smooth hypersurfaces of degree $d$ in $\mathbb {P}^n$ stabilizes as $d \to \infty$ in the Grothendieck ring. This, and various related results are shown by the second and third authors in [Reference Vakil and WoodVW15]. Building on this, Bilu and Howe prove more general stabilization results for sections of vector bundles in the Grothendieck ring [Reference Bilu and HoweBH21, Theorem A]. The use of these results will be crucial in the present paper.

1.5 Conjectures and questions motivated by Theorems A and B

The most natural question following Theorem 1.1 is whether the pattern continues for higher $d$. The continuation of analogies of this pattern have been conjectured in several different domains.

1.6 Arithmetic conjectures

In the context of counting degree $d$ number fields whose Galois closure has Galois group $S_d$, Bhargava [Reference BhargavaBha07, Conjecture 1.2] has conjectured that an analog of Theorem 1.2 holds for all $d$ (which, as mentioned above, is known for $d\leq 5$). Bhargava has given a specific conjectural expression for the Euler factors. It is natural to ask whether Theorem 1.2 holds for $d\geq 6$ using the analogous Euler factors. That is, one may ask whether Theorem 10.5 holds for $d\geq 6$ when all types of ramification are allowed. Further, the heuristics of [Reference BhargavaBha07] also predict the analog of Theorem 1.1 in the number field counting setting for all $d$ (which again is a theorem for $d\leq 5$; see [Reference BhargavaBha14, Theorem 1.1]). Bhargava's heuristics more generally apply to give a conjecture for counting $S_d$ degree $d$ fields with various ramification restrictions, and the analogy in the Grothendieck ring setting would be to conjecture that Theorem 10.5 holds for $d\geq 6$.

The heuristics above are based on a mass formula proven by Bhargava [Reference BhargavaBha07, Theorem 1.1]. We prove an analogous mass formula in the Grothendieck ring in Theorem 8.3, which we now state a consequence of. To make a precise statement, let $\mathscr X_d$ denote the stack over $k$ whose $T$ points are finite locally free degree $d$ Gorenstein covers $Z$ of $T \times _{\operatorname {\operatorname {Spec}} k} \operatorname {\operatorname {Spec}} k[\varepsilon ]/(\varepsilon ^2)$ so that for each geometric point $\operatorname {\operatorname {Spec}} \kappa \to T$, $Z \times _{T \times \operatorname {\operatorname {Spec}} k[\varepsilon ]/(\varepsilon ^2)} (\operatorname {\operatorname {Spec}} \kappa \times \operatorname {\operatorname {Spec}} k[\varepsilon ]/(\varepsilon ^2))$ has one-dimensional Zariski tangent space at each point. We write $R \vdash d$ to mean that $R$ is a partition of $d$. Given $R \vdash d$ comprised of $t_i$ copies of $r_i$ for $i=1,\ldots n$, we define $r(R) := \sum _{i=1}^n (r_i- 1)t_i$ to be its ramification order. We can then deduce the following corollary of Theorem 8.3, also see Remark 8.8, by summing over partitions of $d$ in the same way that [Reference BhargavaBha07, Proposition 2.3] was deduced from [Reference BhargavaBha07, Proposition 2.2].

Corollary 1.7 For $d \geq 1$ and $k$ a field of characteristic not dividing $d!$, in $\widehat {\widetilde {K_0}}(\mathrm {Stacks}_k)$,

\[ \{\mathscr X_d\} =\sum_{R \vdash d} \mathbb{L}^{-r(R)} =\sum_{j=0}^{d-1} q(d,d-j) \mathbb{L}^{-j}, \]

where $q(d,d-j)$ is the number of partitions of $d$ into at exactly $d-j$ parts.

The above heuristics can be expanded to make predictions when other finite groups replace $S_d$. These expanded heuristics are often called the Malle–Bhargava principle (see [Reference WoodWoo16]), though in complete generality the predictions are not always correct. This principle, as long as one is imposing only geometric local conditions (i.e. only local conditions on ramification) naturally extends to the Grothendieck ring setting. Then, one can ask in what generality the predictions of the principle hold. Moreover, in the field counting setting, one naturally counts extensions of global fields other than $\mathbb {Q}$ or $\mathbb {F}_q(t)$, and the analog here would be replacing $\mathbb {P}^1$ with another fixed curve, which is another interesting direction to try to understand.

In addition to the above conjectures on $S_d$ extensions, there are also many open questions about Grothendieck ring versions of other extension counting problems. One particularly accessible problem may be that of counting $D_4$ extensions. In [Reference Cohen, Diaz y Diaz and OlivierCDO02, Corollary 1.4], the number of $D_4$ extensions of $\mathbb {Q}$ was computed when counted by discriminant, though the answer does not appear to have a simple closed form, and was expressed in terms of a sum over quadratic extensions of $\mathbb {Q}$. However, in [Reference Ali Altug, Shankar, Varma and WilsonASVW21, Theorem 1] these extensions were counted by conductor, and there was a closed-form answer, expressed in terms of an Euler product.

Question 1.8 What is the asymptotic class of the locus of $D_4$ covers of $\mathbb {P}^1$ in the Grothendieck ring $\widehat {\widetilde {K_0}}(\mathrm {Stacks}_k)$ when counted by discriminant or conductor?

Similarly, it would be interesting to compute the class of abelian covers of $\mathbb {P}^1$.

Question 1.9 Fix an abelian group $G$. What is the asymptotic class of the locus of $G$ covers of $\mathbb {P}^1$ in the Grothendieck ring $\widehat {\widetilde {K_0}}(\mathrm {Stacks}_k)$ when counted by discriminant or conductor?

One way to approach this question could be to use that the moduli spaces of abelian covers of $\mathbb {P}^1$ can be described in terms of certain configuration spaces of (colored) points on $\mathbb {P}^1$. The classes of such configuration spaces can be extracted from [Reference Vakil and WoodVW15, § 5].

1.10 Error terms and second-order terms

It would be interesting to understand the error terms in Theorem 10.5. More precisely, in Theorem 10.5, we show the equalities of Theorems 1.1 and 1.2 hold not just in the limit, but even hold for any fixed $g$ up to codimension

\[ r_{d,g := \min}\biggl (\frac{g + c_d}{\kappa_d}, \frac{g + d-1}{d} - 4^{d-3}\biggr),\]

for $c_3 = 0$, $c_4=-2$, $c_5 = -23$, $\kappa _3 = 4$, $\kappa _4 = 12$, and $\kappa _5 = 40$. We say two classes of dimension $d$ are equal modulo codimension $r$ in $\widehat {\widetilde {K_0}}(\mathrm {Stacks}_k)$ if their difference lies in filtered part of $\widehat {\widetilde {K_0}}(\mathrm {Stacks}_k)$ of dimension at most $d - r$. Concretely, in degree $3$, a special case of Theorem 10.5 may be stated as follows.

Corollary 1.11 Suppose $k$ is a field of characteristic not dividing $6$. Then

\[ \frac{\{\mathrm{Hur}_{3,g,k}\}} {\mathbb{L}^{ \dim \mathrm{Hur}_{3,g,k}}} \equiv(1+\mathbb{L}^{-1})(1 - \mathbb{L}^{-3}) \]

modulo codimension $ {g}/{4}$ in $\widehat {\widetilde {K_0}}(\mathrm {Stacks}_k)$.

Focusing on the degree $3$ case, Roberts’ conjecture [Reference RobertsRob01] states that the number of degree $3$ field extensions of $\mathbb {Q}$ of discriminant at most $X$ is $\alpha X + \beta X^{5/6} + o(X^{5/6})$, for appropriate constants $\alpha, \beta$. This was proved in [Reference Bhargava, Shankar and TsimermanBST13] and [Reference Taniguchi and ThorneTT13] independently. Moreover, the error term was further improved to $\alpha X + \beta X^{5/6} + O(X^{2/3+\varepsilon })$ in [Reference Bhargava, Taniguchi and ThorneBTT21].

In the context of function fields, one might similarly expect $\alpha _q q^{\dim \mathrm {Hur}_{3,g,k}} + \beta _q q^{5/6 \dim \mathrm {Hur}_{3,g,k}} + o(q^{5/6 \dim \mathrm {Hur}_{3,g,k}})$ to count the number of degree 3 extensions of $\mathbb {F}_q(t)$ of genus $g$, for some constants $\alpha _q,\beta _q$. Progress towards this was made in [Reference ZhaoZha13]. In the context of the Grothendieck ring, as mentioned above, we were able to compute the class of the Hurwitz stack up to codimension

\[ r_{ 3,g := \min}\biggl(\frac{g}{4}, \frac{g + 2}{3}-1\biggr) = \min \biggl(\frac{g}{4}, \frac{g-1}{3}\biggr).\]

Hence, once $g \geq 4$, $r_{3,g} = {g}/{4}$. Since $\dim \mathrm {Hur}_{3,g,k} = 2g + 4$, we find $\frac {5}{6} \dim \mathrm {Hur}_{3,g,k} = \dim \mathrm {Hur}_{3,g,k} - ({g + 2})/{3}$, and so a weakened form of Roberts’ conjecture is the following.

Conjecture 1.12 Suppose $k$ is a field of characteristic not dividing $6$. Then

\[ \frac{\{\mathrm{Hur}_{3,g,k}\}} {\mathbb{L}^{ \dim \mathrm{Hur}_{3,g,k}}} \equiv (1+\mathbb{L}^{-1})(1 - \mathbb{L}^{-3}) \]

modulo codimension $ ({g-1})/{3}$ in $\widehat {\widetilde {K_0}}(\mathrm {Stacks}_k)$.

Remark 1.13 Note that $ ({g-1})/{3}$ is, in fact, the second term in the minimum defining $r_{3,g}$. There is only one step in our proof where the error term we introduce has codimension less than $ ({g-1})/{3}$, namely when we apply the sieve of [Reference Bilu and HoweBH21] in Proposition 9.10 and Lemma 9.11. Thus, if the sieving machinery could be improved, it may lead to a proof of Conjecture 1.12.

Remark 1.14 In the degree $3$ case, it would be quite interesting to actually find the second-order term, instead of just predicting the codimension of the error. The paper [Reference Bhargava, Taniguchi and ThorneBTT21] improves the error term in Davenport–Heilbronn to $O(X^{2/3+\epsilon })$, where $X$ is the discriminant of the relevant cubic extension. Since $X^{2/3} = X \cdot X^{-1/3}$, when one translates this to a codimension bound in the Hurwitz stack, it suggests one might hope to determine an asymptotic expression for $\{\mathrm {Hur}_{3,g,k}\}$ up to codimension $\frac {1}{3} \dim \mathrm {Hur}_{3,g,k}$. Such a computation would be extremely interesting to us, and we expect it would require tools far beyond those of the current paper.

In addition, it would be interesting, though likely more difficult, to determine the codimension of the error and the second-order terms in degrees $4$ and $5$.

1.15 Topological conjectures

If $\operatorname {Conf}_n$ denotes the configuration space of points on $\mathbb {P}^1$, i.e. the open subscheme of $\operatorname {Sym}^n_{\mathbb {P}^1}$ parameterizing reduced degree $n$ subschemes of $\mathbb {P}^1$, then, for $n\geq 3$, we have $ {\{\operatorname {Conf}_n\}}/{\mathbb {L}^{\dim \operatorname {Conf}_n}} = 1 - \mathbb {L}^{-2}$ in the Grothendieck ring of varieties. This follows from [Reference Vakil and WoodVW15, Lemma 5.9(a)] as we explain further toward the end of § 11.3. There is a map $\mathrm {Hur}_{d,g,k}^{s} \to \operatorname {Conf}_{2g-2 + 2d}$ sending a curve to its branch locus, see [Reference Fantechi and PandharipandeFP02]. Using this, Theorem 1.1 and the explicit formula for $\{\operatorname {Conf}_{2g-2 + 2d}\}$ implies that the source and target of this map have classes in $\widehat {\widetilde {K_0}}(\mathrm {Stacks}_k)$, defined in Definition 2.6, which differ only by a class of high codimension.

Corollary 1.16 For $2 \leq d \leq 5$ and $k$ a field of characteristic not dividing $d!$,

\[ \lim_{g \to \infty} \frac{\{\mathrm{Hur}_{d,g,k}^{s} \}}{\mathbb{L}^{\dim \mathrm{Hur}_{d,g,k}^{s}}} =\frac{\{\operatorname{Conf}_{2g-2 + 2d}\}}{\mathbb{L}^{\dim \mathrm{Hur}_{d,g,k}^{s}}} \]

in $\widehat {\widetilde {K_0}}(\mathrm {Stacks}_k)$.

It was also conjectured in [Reference Ellenberg, Venkatesh and WesterlandEVW16, Conjecture 1.5] that this map $\mathrm {Hur}_{d,g,{\mathbb {C}}}^{s} \to \operatorname {Conf}_{2g-2 + 2d}$ induces an isomorphism on $i$th homology for fixed $d$ and sufficiently large $g$. (Technically a slight variant was conjectured in [Reference Ellenberg, Venkatesh and WesterlandEVW16, Conjecture 1.5], with $\mathbb {A}^1$ base in place of $\mathbb {P}^1$.) This is, in fact, open for $d\geq 3$, though recent work of Zheng [Reference ZhengZhe24, Theorem 1.2] proves a closely related result in the $d=3$ case, by finding the stable cohomology of $\mathrm {Hur}_{3,g,{\mathbb {C}}}$. Theorem 1.1 could be seen as an additional motivation for this conjecture, especially for $d\leq 5$.

1.17 Spelling out some questions

Despite the numerous parametrizations mentioned above, the question of whether there exist simple parametrizations of covers of degree $6$, or even whether the Hurwitz stack of genus-$g$ degree $6$ covers (for large $g$) is unirational, remains wide open.

Returning to the simply branched case for simplicity, we have now seen several ways in which we could ask whether the spaces $\mathrm {Hur}_{d,g,k}^{s}$ and $\operatorname {Conf}_{2g-2 + 2d}$ are similar as $g\rightarrow \infty$. The following questions have been raised.

  1. (1) Do they have the same points counts (asymptotically) over $\mathbb {F}_q$?

  2. (2) Do they have the same cohomology, in some stable limit?

  3. (3) Do they have the same normalized limit in the Grothendieck ring?

We also include the following.

  1. (4) Are they piecewise isomorphic up to pieces of codimension going to $\infty$?

Even though it is not technically about these spaces, in this sequence of questions one should also include the following.

  1. (1′) Are the asymptotic counts of $S_d$ number fields as predicted by Bhargava in [Reference BhargavaBha07]?

For $d\geq 6$, it seems progressively harder to believe the questions (1) and ($1'$), (2), (3), and (4) could have positive answers, though for $d\leq 5$ the same parametrizations lead to positive answers to questions (1), ($1'$), (3), and (4) (and nearly to question (2) for $d=3$).

1.18 Idea of the proof

The idea of the proof of Theorems 1.1 and 1.2 is simplest to understand in the degree $3$ case, so we describe this first. Miranda [Reference MirandaMir85] gave a parametrization of degree $3$ covers of a base scheme, and we explain here how we can apply it for degree $3$ covers of $\mathbb {P}^1$. Any degree $3$ cover of $\mathbb {P}^1$ has a canonical embedding into a $\mathbb {P}^1$-bundle $\mathbb {P} \mathscr E$ over $\mathbb {P}^1$. We can write $\mathscr E \simeq \mathscr O(a) \oplus \mathscr O(b)$ where $a + b = g + 2$ and $a \leq b$. We can therefore stratify the Hurwitz space by the isomorphism type of the bundle $\mathscr E$. The degree $3$ curves lie in a particular linear series on $\mathbb {P} \mathscr E$. The idea is now to compute the locus of smooth curves in this linear system with particular ramification conditions, and then sum over all splitting types of bundles $\mathscr E$. The condition for a degree $3$ cover of $\mathbb {P}^1$ to be smooth in a fiber over $p$ can be checked over the preimage of the second-order neighborhood of $p$ in $\mathbb {P} \mathscr E$. We directly compute the classes of such curves in such an infinitesimal neighborhood. Using the notion of motivic Euler products, we can ‘multiply’ these local classes to obtain the global class of smooth curves in $\mathbb {P} \mathscr E$ in the relevant linear system, at least up to high codimension. We then sum these resulting classes over allowed splitting types of $\mathscr E$. It turns out that we must have $\mathscr E \simeq \mathscr O(a) \oplus \mathscr O(b)$ with $a \leq b, 2a \geq b$, and a general member of the relevant linear system on any such bundle gives a smooth trigonal curve. Miraculously, in the simply branched case, this motivic Euler product exactly cancels out with the sum over splitting types of $\mathbb {P} \mathscr E$, weighted by their automorphisms. This follows from a motivic Tamagawa number formula for $\mathrm {SL}_2$.

To generalize this idea to the cases of degrees $4$ and $5$ requires substantial additional work. First, it is no longer the case that curves of degrees $4$ and $5$ are elements of linear systems on a surface. Rather, there are parametrizations due to Casnati and Ekedahl [Reference Casnati and EkedahlCE96, Reference CasnatiCas96] describing covers of degree $d$ in terms of pairs of vector bundles $\mathscr E$ and $\mathscr F$, where $\mathscr E$ has rank $d - 1$ and $\mathscr F \subset \operatorname {Sym}^2 \mathscr E$ corresponds to a certain family of quadrics determined by the curve. In the $d = 4$ case, $\mathscr F$ has rank $2$, corresponding to $4$ points in $\mathbb {P}^2$ being a complete intersection of two quadratics, whereas in the case $d = 5$, $\mathscr F$ has rank $5$, corresponding to $5$ points in $\mathbb {P}^3$ being the vanishing locus of the five $4 \times 4$ Pfaffians of a certain $5 \times 5$ matrix of linear forms. As in the degree $3$ case, we can then stratify the Hurwitz stack in terms of the splitting types of $\mathscr E$ and $\mathscr F$, and compute the classes yielding curves of degree $d$ as sections of a certain vector bundle $\mathscr H(\mathscr E, \mathscr F)$ on $\mathbb {P}^1$, depending on $\mathscr E$ and $\mathscr F$. It is significantly more difficult to calculate the relevant local classes giving the smoothness conditions in fibers in degrees $4$ and $5$ than it is in degree $3$. Nevertheless, we are able to do so by reformulating the question in terms of computing classes of certain classifying stacks for positive dimensional disconnected algebraic groups, and applying a number of results of Ekedahl. The result is Theorem 8.3, which can be viewed as a motivic analog of Bhargava's mass formula [Reference BhargavaBha07] counting extensions of local fields in arbitrary degree. The specific splitting types of $\mathscr E$ and $\mathscr F$ which appear are not nearly so simple as in the degree $3$ case, but it turns out that the expressions work out modulo high codimension. For this it is important not to count $D_4$ covers, i.e. degree $4$ covers which factor through a hyperelliptic curve. As in the degree $3$ case, it turns out that, at least in the simply branched case, the sum over splitting types of $\mathscr E$ and $\mathscr F$ cancel out with the local conditions we impose, again by the Tamagawa number formula.

1.19 Outline of the paper

The structure of the remainder of the paper is as follows. In § 2, we give background on the Grothendieck ring of stacks, set up the precise variant we will work in, and recall the notion of motivic Euler products. Then, in § 3 we prove generalizations of parametrizations due to Miranda, Casnati–Ekedahl, and Casnati regarding Gorenstein covers of degree $d\leq 5$. In degrees $3$ and $4$, generalizations to arbitrary covers of an arbitrary base have been previous shown by Poonen [Reference PoonenPoo08, Proposition 5.1] and the third author [Reference WoodWoo11, Theorem 1.1], but in degree $5$ we require new arguments, and here we present a (mostly) uniform argument for degrees $3$, $4$, and $5$. In § 4 we upgrade the above-mentioned parametrizations for $d \leq 5$ to describe simple presentations of the stack of degree $d$ Gorenstein covers as a global quotient stack. Having settled the above preliminaries, we define the Hurwitz stacks we will work with in § 5 and prove they are algebraic. We then describe natural stratifications of these Hurwitz stacks that arise from the structure of the parametrizations in § 6. Using these parametrizations, we give descriptions of these strata as quotient stacks in § 7. We next begin our proof of the main theorem by computing the local conditions in the Grothendieck ring corresponding to a cover being smooth with specified ramification conditions in § 8. In § 9 we establish bounds on the codimension of the contributions to the Hurwitz stack from various strata, which will enable us to prove our main result in § 10. The proof for the case of degree $2$ is slightly different from that in degrees $3 \leq d \leq 5$, and we complete this in § 11.

1.20 Notation

Let $X_Z$ denote the fibered product $X \times _Y Z$ of schemes, when $Y$ is clear from context. Similarly define $X_R := X \times _Y \operatorname {\operatorname {Spec}} R$. For $X$ an integral variety, we use $K(X)$ to denote its function field.

Recall that for $G$ a group, the wreath product $G \wr S_n$ is the semidirect product $G^n \rtimes S_n$ where $S_n$ acts on $G^n$ by the permutation action on the $n$ copies of $G$. More generally, for $\mathscr E$ a category, let $\mathscr E \wr BS_j$ denote the corresponding wreath product of categories (see [Reference EkedahlEke09b, p. 5]) so that, in particular, $BG \wr BS_j = B(G \wr S_j)$.

For $\mathscr X$ a stack, and $G$ a group scheme acting on $\mathscr X$, we use $[\mathscr X/G]$ to denote the quotient stack. To avoid confusion with this notation, for $\mathscr X$ a stack, we use $\{\mathscr X\}$ to denote its class in the Grothendieck ring of stacks, see Definition 2.6.

We call an algebraic group $G$ over a field $k$ special if every $G$-torsor over a $k$-scheme $X$ is trivial Zariski locally on $X$.

When working in $\widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$, defined in Definition 2.6, we say two classes $A, B \in \widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$ of dimension $d$ are equal modulo codimension $n$ to mean $A - B$ lies in the dimension $d - n$ filtered part of $\widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$.

Let $D := \operatorname {\operatorname {Spec}} k[\varepsilon ]/(\varepsilon ^2)$ be the dual numbers. For $X$ a projective scheme over $Y$, let $\operatorname {Hilb}_{X/Y}^d$ denote the Hilbert scheme parameterizing degree $d$ dimension $0$ subschemes of $X$ over $Y$.

For $X \rightarrow Y$ a finite locally free map, and $Z$ an $X$-scheme, let $\operatorname {Res}_{X/Y}(Z)\rightarrow Y$ denote the Weil restriction. Recall (e.g. [Reference Bosch, Lütkebohmert and RaynaudBLR90, § 7.6]) that the Weil restriction is the functor defined on $T$ points by $\operatorname {Res}_{X/Y}(Z)(T) = Z(T\times _Y X)$. For $Z$ quasi-projective over $X$, $\operatorname {Res}_{X/Y}(Z)$ is representable [Reference Bosch, Lütkebohmert and RaynaudBLR90, § 7.6, Theorem 4].

2. Background: the Grothendieck ring of stacks and motivic Euler products

In this section, we begin by defining useful variants of the Grothendieck ring. Ultimately, we will compute the classes of Hurwitz stacks in a ring we call $\widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$, obtained from the usual Grothendieck ring of varieties by quotienting by universally bijective (i.e. bijective on topological spaces after any base change or, equivalently, radicial surjective) morphisms, inverting $\mathbb {L} = \{\mathbb {A}^1\}$, and then completing with respect to the dimension filtration. Following this, we recall basic definitions associated to motivic Euler products, following [Reference BiluBil17] and [Reference Bilu and HoweBH21]. We also prove these Euler products satisfy a multiplicativity property (Lemma 2.14).

2.1 Variations of the Grothendieck ring

Recall that we are working over a fixed field $k$. We begin by introducing the Grothendieck ring of algebraic spaces.

Definition 2.2 Let $K_0(\mathrm {Spaces}_k)$ denote the Grothendieck ring of algebraic spaces over $k$. This is the ring generated by classes $\{X\}$ of algebraic spaces $X$ of finite type over $k$ with relations given by $\{X\} = \{Y\}$ if there is an isomorphism $X \simeq Y$ over $k$ and $\{X\} = \{Z\} + \{X-Z\}$ for any closed sub-algebraic space $Z \subset X$. Let $X^{\rm red}$ denote the reduction of X. Applying this in the case $Z = X^{\rm red}$, we have $\{X\} = \{X^{\rm red}\}$. Multiplication is given by $\{X\} \cdot \{Y\} = \{X \times _k Y\}$.

More generally, if $S$ is a finite-type algebraic space over $k$ we can define $K_0(\operatorname {Spaces}_k/S)$ as the free abelian group generated by classes of morphisms $X \to S$ with relations $\{X/S\} = \{Z/S\} + \{X - Z/S\}$ for any closed sub-algebraic space $Z \subset X$, where the implicit maps $f|_Z: Z \to S, f|_{X-Z} : X - Z \to S$ are obtained by restricting the map $f: X \to S$. Multiplication is given by $\{X/S\} \cdot \{Y/S\} = \{X \times _S Y\}$.

We use a $k$-variety to mean a reduced, separated, finite-type $k$-scheme. We let $\operatorname {Var}_k$ denote the category of $k$-varieties. One can similarly define $K_0(\operatorname {Var}_k)$, see [Reference Bilu and HoweBH21, § 2]. Similarly, for $S$ a $k$-variety, one can analogously define $K_0(\operatorname {Var}_k/S)$, see [Reference Bilu and HoweBH21, § 2].

Proposition 2.3 Let $S$ be $k$-variety. The natural map $\phi : K_0(\operatorname {Var}_k/S) \to K_0(\operatorname {Spaces}_k/S)$, sending $\{X/S\}$ viewed as a $k$-variety to the same $\{X/S\}$ viewed as a finite type $k$-space, is an isomorphism.

Proof. We first show that for any finite-type $k$-space $X$, there is a finite collection $X_1, \ldots, X_n$ of locally closed $k$-subspaces isomorphic to schemes, forming a stratification of $X$. Here, a stratification means that a $\overline {k}$ point of $X$ factors through some $X_i$. The key input we will need is that finite-type spaces are quasi-separated, and so they contain a dense open isomorphic to a scheme [Reference OlssonOls16, Theorem 6.4.1]. This, together with the facts that $\{X/S\} = \{X^{\rm red}/S\}$ and that any finite-type $k$-scheme has a stratification by separated finite-type $k$-schemes shows that any finite type $k$-space $X$ has a stratification by locally closed subschemes.

Next, let us show $\phi$ is surjective. For this, if $\{X/S\}$ is any finite-type algebraic space, we can use the above stratification to write $\{X/S\} = \sum _{i=1}^n \{X_i/S\}$, for $X_i$ varieties over $S$, implying $\phi$ is surjective.

We conclude the proof by showing that $\phi$ is injective. In order to show this, it is enough to show that any single relation in $K_0(\operatorname {Spaces}_k/S)$ is expressible in terms of relations from $K_0(\operatorname {Var}_k/S)$. More precisely, if $\{X/S\} \in K_0(\operatorname {Spaces}_k/S)$ and $Z$ is a closed subspace $Z \subset X$, it suffices to show that the relation $\{X/S\} = \{Z/S\} + \{X - Z/S\}$ can be expressed as the image under $\phi$ of a sum of relations from $K_0(\operatorname {Var}_k)$. To see this is the case, write $\{X/S\} = \sum _{i=1}^n \{X_i/S\}$ and where $X_1, \ldots, X_n$ are $k$-varieties. Then, let $Z_i$ be the reduction of $X_i \times _X Z$. Note that $Z_i$ is a scheme from the definition of algebraic space because $X_i$ and $Z$ are both schemes. In addition, $Z_i$ is separated since it is a closed subscheme of the separated scheme $X_i$. Hence, $Z_i$ is a variety. We can also write $\{Z/S\} + \{X - Z/S\} = \sum _{i=1}^n \{Z_i/S\} +\sum _{i=1}^n \{X_i-Z_i/S\}$. Therefore, it suffices to verify that

\[ \sum_{i=1}^n \{X_i/S\} = \sum_{i=1}^n \{Z_i/S\} +\sum_{i=1}^n \{X_i-Z_i/S\} \]

is the image under $\phi$ of a sum of relations in the Grothendieck ring of varieties. Indeed, it is the sum over $i$ of the relations $\{X_i/S\} = \{Z_i/S\} + \{X_i-Z_i/S\}$.

We next introduce the Grothendieck ring of algebraic stacks.

Definition 2.4 The Grothendieck ring of algebraic stacks (over $k$) is the ring $K_0(\mathrm {Stacks}_k)$ generated by classes of algebraic stacks $\{\mathscr X\}$ of finite type over $k$ with affine diagonal, with the three relations:

  1. (1) $\{\mathscr X\} = \{\mathscr Y\}$ if there is an isomorphism $\mathscr X \simeq \mathscr Y$ over $k$;

  2. (2) $\{\mathscr X\} = \{\mathscr Z\} + \{\mathscr U\}$ for any closed substack $\mathscr Z \subset \mathscr X$ with open complement $\mathscr U \subset \mathscr X$;

  3. (3) $\{\operatorname {\operatorname {Spec}}_\mathscr X (\operatorname {Sym}^\bullet _\mathscr X \mathscr E)\} = \{\mathscr X \times _k \mathbb {A}^n\}$ for $\mathscr E$ any locally free sheaf on $\mathscr X$ of rank $n$.

Multiplication in this ring is given by $\{\mathscr X\} \cdot \{\mathscr Y\} = \{\mathscr X \times _k \mathscr Y\}$.

Note that condition $(3)$ above follows from the first two in the case of schemes, because vector bundles on schemes are Zariski locally trivial. However, vector bundles over stacks may fail to be Zariski locally trivial, as is the case for nontrivial vector bundles on $BG$.

Remark 2.5 Let $\mathbb {L} := \{\mathbb {A}^1_k\}$ denote the class of the affine line. The natural map $K_0(\mathrm {Spaces}_k) \to K_0(\mathrm {Stacks}_k)$ induces an isomorphism

\[ K_0(\mathrm{Spaces}_k)[\mathbb{L}^{-1}, (\mathbb{L}^n-1)^{-1}_{n \geq 1}] \overset \sim \longrightarrow K_0(\mathrm{Stacks}_k) \]

[Reference EkedahlEke09a, Theorem 1.2]. Here $K_0(\mathrm {Spaces}_k)[\mathbb {L}^{-1}, (\mathbb {L}^n-1)^{-1}]$ denotes the ring obtained from $K_0(\mathrm {Spaces}_k)$ by inverting $\mathbb {L}$, as well as $\mathbb {L}^n - 1$ for all positive integers $n$. This isomorphism is motivated by Definition 2.4(3) and the fact that inverting the classes of $\mathbb {L}$ and $\mathbb {L}^n -1$ for all $n$ is equivalent to inverting the classes of $\operatorname {GL}_n$ for all $n$.

In order to apply the results of [Reference Bilu and HoweBH21] to sieve out smooth covers from all covers, we will need to work in a slight modification of the Grothendieck ring of stacks where we invert universally bijective (i.e. radicial surjective) morphisms and then complete along the dimension filtration.

Definition 2.6 Let $k$ be a field and let $K_0(\mathrm {Spaces}_k)$ denote the Grothendieck ring of algebraic spaces over $k$ from Definition 2.2. From $K_0(\mathrm {Spaces}_k)$, we will construct another ring, $\widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$, in three steps.

  1. (1) For any universally bijective map $f: X \to Y$ of finite-type algebraic spaces over $k$, we impose the additional relation that $\{X\} = \{Y\}$. Call the result (only for the next paragraph) $K_0(\mathrm {Spaces}_k)_{\operatorname {RS}}$.

  2. (2) Define $\widetilde {K}_0(\mathrm {Spaces}_k) := K_0(\mathrm {Spaces}_k)_{\operatorname {RS}}[\mathbb {L}^{-1}]$. Like $K_0(\mathrm {Stacks}_k)$, the ring $\widetilde {K}_0(\mathrm {Spaces}_k)$ has a filtration given by dimension with the $i$th filtered part $F^i \widetilde {K}_0(\mathrm {Spaces}_k) \subset \widetilde {K}_0(\mathrm {Spaces}_k)$ denoting the subset of $\widetilde {K}_0(\mathrm {Spaces}_k)$ spanned by classes of dimension at most $-i$.

  3. (3) Finally, let

    \[\widehat{\widetilde{K_0}}(\mathrm{Spaces}_k) := \varprojlim_{i \geq 0} \widetilde{K}_0(\mathrm{Spaces}_k)/ F^i \widetilde{K}_0(\mathrm{Spaces}_k)\]
    be the completion along the dimension filtration.

Similarly, for $K_0(\mathrm {Stacks}_k)$ the Grothendieck ring of algebraic stacks over $k$ of Definition 2.4, we analogously define $\widehat {\widetilde {K_0}}(\mathrm {Stacks}_k)$ in the same three steps, replacing the word ‘spaces’ above by ‘stacks’.

  1. (1) We first impose the relation $\{X\} = \{Y\}$ for every universally bijective map of algebraic stacks $f: X \to Y$ of finite type with affine diagonal, and denote the result $K_0(\mathrm {Stacks}_k)_{\operatorname {RS}}$.

  2. (2) Define $\widetilde {K}_0(\mathrm {Stacks}_k) := K_0(\mathrm {Stacks}_k)_{\operatorname {RS}}[\mathbb {L}^{-1}]$. Like $K_0(\mathrm {Stacks}_k)$, the ring $\widetilde {K}_0(\mathrm {Stacks}_k)$ has a filtration given by dimension with the $i$th filtered part $F^i \widetilde {K}_0(\mathrm {Stacks}_k) \subset \widetilde {K}_0(\mathrm {Stacks}_k)$ denoting the subset of $\widetilde {K}_0(\mathrm {Stacks}_k)$ spanned by classes of dimension at most $-i$.

  3. (3) Finally, let

    \[ \widehat{\widetilde{K_0}}(\mathrm{Stacks}_k) := \varprojlim_{i \geq 0} \widetilde{K}_0(\mathrm{Stacks}_k)/ F^i \widetilde{K}_0(\mathrm{Stacks}_k) \]
    be the completion along the dimension filtration.

Remark 2.7 In characteristic $0$, identifying classes along universally bijective morphisms does not alter the Grothendieck ring. See [Reference Bilu and HoweBH21, Remarks 2.0.2 and 7.3.2] for some justification of why we are inverting universally bijective morphisms.

But we do not know if inverting universally bijective morphisms alters the Grothendieck ring of spaces or stacks in positive characteristic.

Since Hurwitz stacks are not, in general, algebraic spaces, but the results of [Reference Bilu and HoweBH21] apply to the completed Grothendieck ring of algebraic spaces $\widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$, it will be useful to know that one can also obtain $\widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$ from $K_0(\mathrm {Stacks}_k)$ by inverting universally bijective maps and completing along the dimension filtration, as we next verify.

Lemma 2.8 The natural map $\widehat {\widetilde {K_0}}(\mathrm {Spaces}_k) \to \widehat {\widetilde {K_0}}(\mathrm {Stacks}_k)$ is an isomorphism.

Proof. First note that although we constructed $\widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$ from $K_0(\mathrm {Spaces}_k)$ by first quotienting by universally bijective morphisms and then inverting $\mathbb {L}$, we could have equally well first inverted $\mathbb {L}$ and then inverted universally bijective morphisms. Since localization commutes with taking quotients, we obtain the same result by doing these steps in either order.

Since we can localize and take quotients in any order, using Remark 2.5, we can equivalently obtain $\widehat {\widetilde {K_0}}(\mathrm {Stacks}_k)$ by identifying universally bijective morphisms of spaces and then inverting $\mathbb {L}, \mathbb {L}^n - 1$ and completing along the dimension filtration. To show $\widehat {\widetilde {K_0}}(\mathrm {Spaces}_k) \to \widehat {\widetilde {K_0}}(\mathrm {Stacks}_k)$ is an isomorphism, we wish to show that beginning with $\widetilde {K}_0(\mathrm {Spaces}_k)$ and completing along the dimension filtration is equivalent to first inverting $\mathbb {L}^n -1$ for all $n \geq 1$ and then completing along the dimension filtration. Indeed, one may define a map $\widehat {\widetilde {K_0}}(\mathrm {Stacks}_k) \to \widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$ induced by the map $\widetilde {K}_0(\mathrm {Spaces}_k)[(\mathbb {L}^n-1)^{-1}_{n \geq 0}] \to \widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$ extended by sending the class of $(\mathbb {L}^n-1)^{-1} \mapsto \sum _{i \geq 1} \mathbb {L}^{-in}$. Upon completing along the dimension filtration this defines the desired isomorphism $\widehat {\widetilde {K_0}}(\mathrm {Stacks}_k) \to \widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$ inverse to the natural map $\widehat {\widetilde {K_0}}(\mathrm {Spaces}_k) \to \widehat {\widetilde {K_0}}(\mathrm {Stacks}_k)$ given above.

Remark 2.9 Due to the equivalence of Lemma 2.8, in what follows, we will work in $\widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$. In particular, it makes sense to speak of classes of stacks with affine diagonal in $\widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$ by Lemma 2.8.

2.10 Motivic Euler products

We recall the notion of motivic Euler products in the Grothendieck ring, which is crucial in our proof. See [Reference BiluBil17] for an introduction to motivic Euler products, and [Reference Bilu and HoweBH21, § 6] for more details.

We begin by introducing notation to give the definition of motivic Euler products in the setting we will need. For a finite multiset $\mu$, with underlying set $I$, we write $\mu =(m_i)_{i \in I}$, where $m_i$ is the number of copies of $i$ in $\mu$. Let $X$ be a reduced, quasi-projective scheme over a field $k$. For any finite multiset $\mu = (m_i)_{i \in I}$, there is a finite surjective map $p: \prod _{i \in I} X^{m_i} \to \prod _{i \in I} \operatorname {Sym}^{m_i} X$. Let $U$ denote the open subscheme of $\prod _{i \in I} X^{m_i}$ where no two coordinates agree and let $C^\mu (X)$ denote the open subscheme $p(U) \subset \prod _{i \in I} \operatorname {Sym}^{m_i} X$. Informally speaking, $C^\mu (X)$ parameterizes configurations of $\mu$-labeled points on $X$.

More generally, for $\mathcal {X} = (X_i)_{i \in I}$ a collection of reduced, quasi-projective schemes $X_i$ with morphisms to $X$, and $\mu = (m_i)_{i \in I}$ a multiset, define $C^\mu _{X}(\mathcal {X})$ as the preimage of $C^{\mu }(X) \subset \prod _{i \in I} \operatorname {Sym}^{m_i}X$ under the projection $\prod _{i \in I} \operatorname {Sym}^{m_i} X_i \to \prod _{i \in I} \operatorname {Sym}^{m_i}(X)$. As in [Reference Bilu and HoweBH21, Definition 6.1.8], one can extend this definition to make sense of $C_{X}^\mu (\mathcal {A})$ as an element of $K_0(\mathrm {Spaces}_k)$ where $\mathcal {A} = (a_i)_{i \in I}$ with $a_i$ in $K_0(\operatorname {Spaces}_k/X)$.

Let $\mathbb {N}$ denote the positive integers. Let $\mathcal {P}$ be the set of non-empty finite multisets of positive integers, and for such a multiset $\mu =(m_i)_{i\in \mathbb {N}}$, let $|\mu |:=\sum _i i \cdot m_i$. Following [Reference Bilu, Das and HoweBDH22, § 2.2.2], for $\mathcal {A} = (a_i)_{i \in \mathbb {N}}$ a collection of classes in $K_0(\operatorname {Spaces}_k/X)$, define the motivic Euler product

(2.1)\begin{equation} \prod_{x \in X} \bigg(1 +\sum_{i =1}^{\infty} a_{i,x} t^i\bigg) := 1+ \sum_{\mu \in \mathcal{P}} C^\mu_{X}((a_i)_{i\in\mathbb{N}}) t^{|\mu|} \in K_0(\mathrm{Spaces}_k) [\kern-1pt[ t ]\kern-1pt]. \end{equation}

Here, $a_{i,x}$ is formal notation to indicate the $a_i$ on which the definition depends. When we write a class $b_i\in K_0(\operatorname {Spaces}_k)$ in place of $a_{i,x}$, it indicates that $a_i=[Y_i \times X]-[Z_i\times X]$, where $Y_i,Z_i$ are algebraic spaces of finite type over $k$ such that $b_i=[Y_i]-[Z_i]$ and $Y_i \times X,Z_i\times X$ have the natural projection map to $X$.

Let $r\in \mathbb {N}$, and let $I$ be the set of $r$-tuples of non-negative integers, not all $0$. Note that $I$ is a semigroup under coordinate-wise addition. Let $\mathcal {P}_r$ denote the set of non-empty finite multisets of elements of $I$, and for $\mu \in \mathcal {P}_r$, let $|\mu |\in I$ denote the sum of the elements of $\mu$. More generally, for indeterminates $t_1,\ldots, t_r$ one can define, for $\mathcal {A} = (a_{\underline {i}})_{\underline {i} \in I}$, a collection of classes in $K_0(\operatorname {Spaces}_k/X)$,

(2.2)\begin{equation} \prod_{x \in X} \bigg(1 + \sum_{\underline{i} \in I} a_{\underline{i},x} \underline{t}^{\underline{i}}\bigg) := 1+ \sum_{\mu \in \mathcal{P}_r} C^\mu_{X}((a_{\underline{i}})_{\underline{i} \in I}) \underline{t}^{|\mu|}\in K_0(\mathrm{Spaces}_k) [\kern-1pt[ t_1,\ldots,t_r ]\kern-1pt], \end{equation}

where for $\underline {i}=(i_1,\ldots,i_r)\in I$, we write $\underline {t}^{\underline {i}}$ for $t_1^{i_1}\cdots t_r^{i_r}$.

Warning 2.11 The left-hand side of (2.1) is merely (evocative) notation, and has no intrinsic meaning beyond the right-hand side.

In the special cases that we will use them in, motivic Euler products are the same as the power structures of Gusein-Zade, Luengo, and Melle-Hernàndez [Reference Gusein-Zade, Luengo and Melle-HernàndezGLM04]. We now specialize to the one variable case.

In good circumstances, there is an evaluation map at $t = 1$ sending a motivic Euler product, viewed as an element of $K_0(\mathrm {Spaces}_k) [\kern-1pt[ t ]\kern-1pt]$ to an element of $\widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$, as in [Reference Bilu and HoweBH21, Definition 6.4.1 and Notation 6.4.2]. This makes sense whenever the motivic Euler product ‘converges at $t =1$’, meaning there are only finitely many terms $\mu$ so that $C^\mu _{X/S}(a)$ is outside any given piece of the dimension filtration.

Notation 2.12 For a motivic Euler product $\prod _{x \in X} (1 + a_x t)$ which converges at $t = 1$, we use

\[ \prod_{x \in X} (1 + a_x t)|_{t = 1} \]

to denote the evaluation at $t=1$ in $\widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$.

We will often also write $\prod _{x \in X} (1 + a_{x})$ to also denote the evaluation of the motivic Euler product $\prod _{x \in X} (1 + a_x t)$ at $t = 1$ in $\widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$, in order to shorten notation, but see Warning 2.13.

Warning 2.13 Due to the extreme care with which one must handle motivic Euler products, we acknowledge that Notation 2.12 is not very good notation. It is likely best to think of motivic Euler products as power series in $t$ which are being evaluated at values of $t$, rather then actual elements in $\widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$, as the manipulations one wants to make have only primarily been established in terms of the power series, and not in terms of their evaluations in $\widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$. We choose to use this convention so as to shorten unwieldy formulas.

In particular, one must be careful that the two expressions $\prod _{x \in X/S} (1 + \sum _{i \in I} a_{i,x} p_i( (s_j)_{j \in J} ))$ and $\prod _{x \in X/S} (1 + \sum _{i \in I} a_{i,x} t_i)|_{t_i = p_i(\underline {s})}$ do not necessarily agree. However, when these sets indexing the variables $t_i$ and $s_j$ are finite, and all $p_i((s_j)_{j \in J})$ are monomials, these two expressions do agree by [Reference Bilu and HoweBH21, Lemma 6.5.1].

An important lemma will be that these Euler products in $\widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$ are multiplicative. We now verify this, the key input being multiplicativity of motivic Euler products in $K_0(\mathrm {Spaces}_k) [\kern-1pt[ t_1,t_2 ]\kern-1pt]$.

Lemma 2.14 Suppose $a$ and $b$ are two classes in $K_0(\operatorname {Spaces}_k)$ such that the Euler products $\prod _{x \in X} ( 1 + a_x t)$ and $\prod _{x \in X} ( 1 + b_x t)$ converge at $t = 1$ in $K_0(\mathrm {Spaces}_k)$. Then,

(2.3)\begin{equation} \prod_{x \in X} (1 + a_x) \cdot \prod_{x\in X} ( 1 + b_x)=\prod_{x \in X} (( 1 + a_x) (1 + b_x)) \end{equation}

in $\widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$.

Proof. We would like to say this follows from multiplicativity of Euler products [Reference BiluBil17, Proposition 3.9.2.4], but the issue is that when we apply [Reference BiluBil17, Proposition 3.9.2.4] the left-hand side of (2.3) is equal to

(2.4)\begin{align} \bigg(\prod_{x \in X} (1 + a_x t) \cdot \prod_{x \in X} (1+ b_x t)\bigg)\bigg\rvert_{t = 1} &= \bigg(\prod_{x \in X} (1 + a_x t) \cdot (1 + b_x t)\bigg)\bigg\rvert_{t = 1} \nonumber\\ &= \bigg(\prod_{x \in X} (1 + a_x t + b_x t + a_x b_x t^2)\bigg)\bigg\rvert_{t = 1} . \end{align}

On the other hand, the right-hand side of (2.3) is, by definition,

(2.5)\begin{equation} \bigg(\prod_{x \in X} (1 + a_x t + b_x t + a_x b_x t)\bigg)\bigg\rvert_{t = 1}. \end{equation}

The lemma follows because

\[ \prod_{x \in X} (1 + a_x t + b_x t + a_x b_x s)|_{s=t=1} =\prod_{x \in X} (1 + a_x t + b_x t + a_x b_x t)|_{t=1} \]

and also

\[ \prod_{x \in X} (1 + a_x t + b_x t+ a_x b_x s)|_{s=t=1} =\prod_{x \in X} (1 + a_x t + b_x t+ a_x b_x t^2)|_{t=1} \]

by [Reference Bilu and HoweBH21, Lemma 6.5.1].

3. Parametrizations of low-degree covers

The key to computing the class of Hurwitz stacks of low-degree covers of $\mathbb {P}^1$ is the parametrization of covers of degree $d \leq 5$ of a general base scheme. In the case $d=3$, the first such parametrization was given by Miranda [Reference MirandaMir85, Theorem 1.1], for arbitrary degree 3 covers of an irreducible scheme over an algebraically closed field of characteristic not equal to 2 or 3. Pardini [Reference PardiniPar89] later generalized Miranda's result to characteristic 3, and Casnati and Ekedahl [Reference Casnati and EkedahlCE96, Theorem 3.4] generalized the result to Gorenstein degree 3 covers of an integral noetherian scheme. Poonen [Reference PoonenPoo08, Proposition 5.1] gave a complete parametrization of degree $3$ covers of an arbitrary base scheme (see also [Reference WoodWoo11, Theorem 2.1]). When $d=4$, Casnati and Ekedahl [Reference Casnati and EkedahlCE96, Theorem 4.4] gave a parametrization of Gorenstein degree $4$ covers of an integral noetherian scheme. The third author [Reference WoodWoo11, Theorem 1.1] generalized this to a parametrization of arbitrary degree $4$ covers along with the data of a cubic resolvent cover (which is unique in the Gorenstein case) over an arbitrary base scheme. When $d=5$, Casnati [Reference CasnatiCas96, Theorem 3.8] gave a parametrization of degree $5$ covers, satisfying a certainly ‘regularity’ condition (see Remark 3.7), of an integral noetherian scheme. (We also note that Wright and Yukie [Reference Wright and YukieWY92] gave these parametrizations for a covers of a field, and Delone and Faddeev [Reference Delone and FaddeevDF64] and Bhargava [Reference BhargavaBha04, Reference BhargavaBha08] gave these parametrizations for covers of $\operatorname {Spec} \mathbb {Z}$. Bhargava's parametrizations require additional resolvent data for non-Gorenstein covers. Bhargava, Shankar, and Wang [Reference Bhargava, Shankar and WangBSW15, § 3] have refined Wright and Yukie's work for covers of global fields.)

In this section, we will prove similar parametrizations, but suited for our particular application. For our purposes, we would like to parametrize only Gorenstein covers, but over an arbitrary base. For $d=3,4$, such a result could be deduced directly from [Reference PoonenPoo08, Proposition 5.1] and [Reference WoodWoo11, Theorem 1.1] by specializing to Gorenstein covers. However, for the case $d=5$ some new arguments are required both to obtain all Gorenstein covers and to generalize to an arbitrary base. For uniformity of exposition, we show how all of the parametrizations of Gorenstein covers can be obtained from the approach of Casnati and Ekedahl.

Casnati and Ekedahl [Reference Casnati and EkedahlCE96] prove a structure theorem [Reference Casnati and EkedahlCE96, Theorem 2.1] (a reformulation of [Reference Casnati and EkedahlCE96, Theorem 1.3]), which describes a minimal resolution of covers of arbitrary degree of an integral scheme. We will need to extend this structure theorem from integral schemes to arbitrary (including non-reduced) bases. Essentially the same proof given in [Reference Casnati and EkedahlCE96, Theorem 2.1] applies, suitably replacing Grauert's theorem with cohomology and base change. We thank Gianfranco Casnati for helpful conversations confirming this. We will then apply this structure theorem to obtain our desired parametrizations of covers in degrees $3$, $4$, and $5$, analogously to how it was done by Casnati and Ekedahl in [Reference Casnati and EkedahlCE96, Theorems 3.4, 4.4] and [Reference CasnatiCas96, Theorem 3.8].

We also upgrade Casnati's result in degree $5$ in an additional way to deal with all Gorenstein covers, see Remark 3.7.

3.1 The main structure theorem from Casnati and Ekedahl

We next recall the main structure theorem and give its proof in the more general setting. In essence, it says that degree $d$ Gorenstein covers are classified by linear-algebraic data. It is convenient to describe this as saying that a number of moduli stacks are isomorphic.

We first recall some terminology. We will consider degree $d$ covers which are finite locally free. A finite locally free degree $d$ cover is Gorenstein if the scheme-theoretic fiber $X_y$ over $\kappa (y)$ is Gorenstein for every $y \in Y$. For $k$ a field, a subscheme $X \subset \mathbb {P}^n_k$ is arithmetically Gorenstein if the affine cone over $X$, viewed as a subscheme of $\mathbb {A}^{n+1}_k$, is Gorenstein. For $\mathscr E$ a rank $d-1$ locally free sheaf of $\mathscr O_Y$-modules on $Y$, let $\pi : \mathbb {P} \mathscr E \rightarrow Y$ denote the corresponding projective bundle $\mathbb {P} \mathscr E := \operatorname {\operatorname {Proj}} \operatorname {Sym}^\bullet \mathscr E$. We use the term projective bundle to describe the projectivization of a vector bundle. For $\mathscr G$ a sheaf of $\mathscr O_Z$-modules on a scheme or stack $Z$, we use $\mathscr G^\vee := \mathrm {Hom}_{\mathscr O_Z}(\mathscr G, \mathscr O_Z)$ to denote its dual. Finally, for $\kappa$ a field, a subscheme of $\mathbb {P}^n_\kappa$ is nondegenerate if it is not contained in any hyperplane $H \subset \mathbb {P}^n_\kappa$.

Theorem 3.2 (Generalization of [Reference Casnati and EkedahlCE96, Theorem 2.1], see also [Reference Casnati and NotariCN07, Theorem 2.2])

Let $X$ and $Y$ be schemes and let $\rho : X \rightarrow Y$ be a finite locally free Gorenstein map of degree $d$, for $d \geq 3$. Fix a vector bundle $\mathscr E'$ of rank $d - 1$ on $Y$ with corresponding projective bundle $\pi : \mathbb {P} := \mathbb {P} \mathscr E' \to Y$, and fix an embedding $i: X \rightarrow \mathbb {P}$ such that $\rho = \pi \circ i$. We further require that $\rho ^{-1}(y) \subset \pi ^{-1}(y) \simeq \mathbb {P}^{d-2}_{\kappa (y)}$ is a nondegenerate subscheme for each point $y \in Y$. A bundle $\mathscr E'$ and map $i$ satisfying the above properties exists. Any two such triples $(\mathbb {P}, \pi, i)$ and $(\mathbb {P}_2, \pi _2, i_2)$ are uniquely isomorphic, meaning there is a unique isomorphism $\psi : \mathbb {P} \simeq \mathbb {P}_2$ such that $\pi _2 \circ \psi = \pi$ and $\psi \circ i = i_2$. Moreover, for any such triple $(\mathbb {P}, \pi, i)$ with $\rho = \pi \circ i$, the following properties hold.

  1. (i) Let $\rho ^\# : \mathscr O_Y \rightarrow \rho _* \mathscr O_X$ denote the map defining $\rho : X \rightarrow Y$ and let $\mathscr E := (\operatorname {coker} \rho ^\#)^\vee$. Then, $\mathbb {P} \simeq \mathbb {P} \mathscr E$.

  2. (ii) The composition $\phi : \rho ^* \mathscr E \rightarrow \rho ^* \rho _* \omega _{X/Y} \rightarrow \omega _{X/Y}$ is surjective, and so induces a map $j: X \to \mathbb {P} \mathscr E$, and $(\mathbb {P} \mathscr E, \sigma : \mathbb {P} \mathscr E \to Y, j)$ is a triple satisfying the properties above. The ramification divisor $R \subset X$ of $\rho$ satisfies $\mathscr O_X(R) \simeq \omega _{X/Y} \simeq j^* \mathscr O_{\mathbb {P} \mathscr E}(1)$.

  3. (iii) There is a sequence $\mathscr N_0, \mathscr N_1, \ldots, \mathscr N_{d-2}$ of finite locally free $\mathscr O_{\mathbb {P} \mathscr E'}$ sheaves on $\mathbb {P} \mathscr E'$ with $\mathscr N_0 :=\mathscr O_{\mathbb {P} \mathscr E'}$ and an exact sequence

    (3.1)
    such that the restriction of (3.1) to the fiber $(\mathbb {P} \mathscr E')_y := \pi ^{-1}(y)$ over $y$ is a minimal free resolution of the structure sheaf of $X_y := \rho ^{-1}(y)$ for every point $y \in Y$. Given $\rho, \mathscr E', i$ as above, the exact sequence (3.1) is unique up to unique isomorphism, such that the isomorphism restricts to the identity map on the final nonzero term $\mathscr O_X$, among all sequences with the above-listed properties. The locally free sheaves $\mathscr F_i := \pi _* \mathscr N_i$ on $Y$ satisfy $\pi ^* \mathscr F_i \simeq \mathscr N_i$. Further $\mathscr N_{d-2}$ is invertible, and, for $i = 1, \ldots, d-3$, one has
    (3.2)\begin{equation} \beta_i := \operatorname{rk} \mathscr N_i = \operatorname{rk} \mathscr F_i = \frac{i(d-2-i)}{d-1}\binom{d}{i+1}. \end{equation}
    Moreover, $X_y \subset \mathbb {P}_y$ is a nondegenerate arithmetically Gorenstein subscheme, $\pi ^* \pi _* \mathscr N_i \simeq \mathscr N_i$ for $0 \leq i \leq d-2$, and $\mathscr {H}\kern -.5pt om_{\mathscr O_{\mathbb {P} \mathscr E'}} (\mathscr N_\bullet, \mathscr N_{d-2}(-d)) \simeq \mathscr N_\bullet$. In addition, the formation of $\pi _* \mathscr N_\bullet$ commutes with base change on $Y$.
  4. (iv) For $\mathscr N_{d-2}$ as in (3.1), we have $\mathscr E'\simeq \mathscr E$ if and only if $\mathscr N_{d-2} \simeq \pi ^* \det \mathscr E'$.

  5. (v) The pushforward of the map $\alpha _1: \mathscr N_1(-2) \to \mathscr O_{\mathbb {P}}$ along $\pi$ induces an injection $\mathscr F_1 \to \operatorname {Sym}^2 \mathscr E$ and for $d-3 \geq i \geq 2$, the pushforward $\alpha _i: \mathscr N_i(-i-1) \to \mathscr N_{i-1}(-i)$ along $\pi$ induces an injection $\mathscr F_i \to \mathscr F_{i-1} \otimes \mathscr E$.

  6. (vi) For any point $y \in Y$, no subscheme $X_y' \subset X_y$ of degree $d-1$ is sent under $\rho$ to a hyperplane of $\pi ^{-1}(y)$.

Remark 3.3 The statement of Theorem 3.2 differs in several ways from the original statement [Reference Casnati and EkedahlCE96, Theorem 2.1].

  1. (1) As pointed out in [Reference Casnati and NotariCN07, Theorem 2.2], it is necessary to add a nondegenerate hypothesis to the statement (which was an oversight in the original result).

  2. (2) We do not require our base $Y$ to be noetherian.

  3. (3) We do not require our base $Y$ to be integral.

  4. (4) We show that given any two triples $(\mathbb {P}, \pi, i)$ there is a unique isomorphism between them, as in the sense of the statement of Theorem 3.2. In [Reference Casnati and EkedahlCE96, Theorem 2.1], it is only shown that the bundle $\mathbb {P}$ is unique.

  5. (5) In property (ii), we additionally show that $(\mathbb {P} \mathscr E, \sigma : \mathbb {P} \mathscr E \to Y, j)$ is one of the unique above-mentioned triples.

  6. (6) In property (iii), we show the formation of $\pi _* \mathscr N_\bullet$ commutes with base change.

  7. (7) In property (iii), we include the requirement that the isomorphism is unique among isomorphisms restricting to the identity on $\mathscr O_X$. This assumption was also needed in [Reference Casnati and EkedahlCE96], but not explicitly stated there.

  8. (8) We have also added property (v).

  9. (9) We have added property (vi).

Proof. As a first step, we reduce the proof to the case $X$ and $Y$ are noetherian.

3.1 Removing noetherian hypotheses

In view of the asserted uniqueness, by Zariski descent, we may reduce to the case that $Y$ is affine. Because $\rho : X \to Y$ is locally finitely presented as it is finite locally free, we will next show we can spread out all of the above data to a finite-type scheme $Y_0$. More precisely, as a first step, by [Reference GrothendieckGro66, Proposition 8.9.1], we can find some finite-type schemes $Y_0$ and $X_0$, a map $\rho _0: X_0 \to Y_0$ and a map $Y \to Y_0$ so that $\rho$ is the base change of $\rho _0$ along $Y \to Y_0$. By the various spreading out results in [Reference GrothendieckGro66, § 8] after possibly replacing $Y_0$ with another finite-type scheme, we may additionally assume $\rho _0$ is Gorenstein, $\mathscr E'$ is the pullback of a vector bundle $\mathscr E'_0$ on $Y_0$, and the triples $(\mathbb {P}, \pi, i)$ and $(\mathbb {P}_2, \pi _2, i_2)$ are base changes of corresponding triples on $Y_0$. Nearly all parts of the theorem, except the unique isomorphism of two triples $(\mathbb {P}, \pi, i)$ and $(\mathbb {P}_2, \pi _2, i_2)$ and the unique isomorphism in property (iii), follow from the corresponding statement over $Y_0$. However, if these isomorphisms are not unique, there will be some noetherian scheme to which two different such isomorphisms descend and, hence, this claim can be verified after replacing $Y_0$ with another noetherian scheme. In particular, it suffices to prove the theorem in the case $Y$ and $X$ are noetherian, and even finite type over $\operatorname {\operatorname {Spec}} \mathbb {Z}$. This removes the noetherian hypothesis, addressing Remark 3.3(2).

For the remainder of the proof, we assume $X$ and $Y$ are noetherian. The proof given in [Reference Casnati and EkedahlCE96, Theorem 2.1] is broken up into steps A, B, C, and D. Step A has a minor inaccuracy which we next address. The only generalization needed occurs in step B, while steps C and D go through without change.

Addressing step A

We next explain the proof of step A, though we make the additional assumption that the field $k$ is infinite. Before explaining this proof, we remark on an error in the proof of step A from [Reference Casnati and EkedahlCE96, Step A, p. 443] when $k$ is finite.

Remark 3.4 Let $A$ be a finite $k$-algebra $A$ with maximal ideals $\mathfrak m_1, \ldots, \mathfrak m_p$. Let $\eta : A \to k$, be a generalized trace map, i.e. a surjection of $k$-vector spaces such that the only ideal contained in the kernel is the $0$ ideal. It is then claimed that there exists $a \in \ker \eta - \bigcup _{i=1}^p \mathfrak m_i$.

This is not always true over finite fields, such as when $k= \operatorname {\operatorname {Spec}} \mathbb {F}_2$ and $A = \mathbb {F}_2^5$ and $\eta : A \to k$ is the map given by summing the five coordinates. Indeed, the oversight in [Reference Casnati and EkedahlCE96, Step A, p. 443] is that while over infinite fields, $\ker \eta \subset \bigcup _{i=1}^p \mathfrak m_i$ implies $\ker \eta \subset \mathfrak m_i$ for a single $i$, this does not always hold over finite fields. It is straightforward to check that this claim holds over an infinite field. Since we cannot have $\ker \eta \subset \mathfrak m_i$ by the definition of a generalized trace map, over infinite fields we conclude that $\ker \eta \subsetneq \bigcup _{i=1}^p \mathfrak m_i$.

Having explained the error when $k$ is finite, we now conclude our commentary on the proof of step A. As mentioned above, the proof still works correctly in the case $k$ is infinite. We also note that in the statement of [Reference SchreyerSch86, Lemma, p. 119] which is cited in [Reference Casnati and EkedahlCE96, Step A, p. 443], the subscheme $D$ there should have degree $d$ and lie in $\mathbb {P}^{d-2}$, as opposed to degree $d-2$ in $\mathbb {P}^{d-1}$. Note that in order to apply [Reference SchreyerSch86, Lemma, p. 119], it is necessary to use the hypothesis that $X \subset \mathbb {P}\mathscr E$ is nondegenerate, a hypothesis which was omitted in [Reference Casnati and EkedahlCE96, Theorem 2.1], addressing Remark 3.3(1). At this point, property (vi) follows from [Reference SchreyerSch86, Lemma, p. 119], addressing Remark 3.3(8).

Addressing step B

Having established the result when $Y = \operatorname {\operatorname {Spec}} \overline {k}$, it remains to carry out the proof for general bases following [Reference Casnati and EkedahlCE96, Steps B, C, and D, p. 445–447]. In what follows, we next recapitulate the argument for step B [Reference Casnati and EkedahlCE96, p. 445], modifying the application of Grauert's theorem to one of cohomology and base change, which allows us to remove the integrality hypothesis on $Y$, as in Remark 3.3(3).

Recall the statement of step B.

  1. Step B: Suppose there is a factorization $\rho = \pi \circ i$, for $\pi : \mathbb {P} \rightarrow Y$ a projective $\mathbb {P}^{d-2}$ bundle and $i: X \rightarrow \mathbb {P}$ an embedding with $X_y$ a nondegenerate arithmetically Gorenstein subscheme of $\mathbb {P}_y$ for each $y \in Y$. Then, (3.1) exists, is unique up to unique isomorphisms, restricts to a minimal free resolution of $\mathscr O_{X_y}$ over each point $y \in Y$, and $\pi ^* \pi _* \mathscr N_\bullet \simeq \mathscr N_\bullet$.

Note that when it is written the resolution is unique up to unique isomorphism in step B, the statement implicitly means such an isomorphism is unique up to those restricting to the identity on $\mathscr O_X$, as if such a specification were not given, we could compose with multiplication by a unit. This is the reason for the modification from Remark 3.3(7).

We next observe that it suffices to prove a version of step B where we replace $Y$ with a geometric point over $y$. To be more precise, in order to verify step B, it suffices to verify step B$'$, given as follows.

  1. Step B′: Suppose there is a factorization $\rho = \pi \circ i$, for $\pi : \mathbb {P} \rightarrow Y$ a projective $\mathbb {P}^{d-2}$ bundle and $i: X \rightarrow \mathbb {P}$ an embedding with $X_y$ a nondegenerate arithmetically Gorenstein subscheme of $\mathbb {P}_y$ for each geometric point $y$ of $Y$. Then, (3.1) exists, is unique up to unique isomorphisms, restricts to a minimal free resolution of $\mathscr O_{X_y}$ over each geometric point $y$ of $Y$ and $\pi ^* \pi _* \mathscr N_\bullet \simeq \mathscr N_\bullet$.

We now explain why step B$'$ implies step B. Indeed, the conditions of $X_y$ being a nondegenerate arithmetically Gorenstein subscheme and for a resolution of $\mathscr O_{X_y}$ being a minimal free resolution may be verified after replacing $y$ with a geometric point $\overline {y}$ mapping to $y$. Therefore, step B$'$ implies step B.

We next verify step B$'$. In what follows, we therefore use $y$ to denote a geometric point of $Y$, as opposed to a point of the underlying topological space whose with scheme structure given as the spectrum of the residue field at that point.

For the remainder of the verification of step B$'$, we only handle the case $d \geq 4$. The case $d = 3$ is quite analogous to the case $d \geq 4$, though significantly easier as the resolution has length $2$.

Define maps $j_y, i_y$ as in the following diagram.

(3.3)

Letting $\mathscr I$ denote the ideal sheaf of $X$ in $\mathbb {P}$, we claim that $j_y^*\mathscr I$ is the ideal sheaf of $X_y$ in $\mathbb {P}_y$. To see this, we only need to verify that $j_y^* \mathscr I \rightarrow j_y^* \mathscr O_{\mathbb {P}} \rightarrow j_y^* \mathscr O_X$ is exact. Since $\mathscr O_X$ is flat over $Y$, we will verify more generally that for $\mathscr H, \mathscr G, \mathscr F$ three sheaves on $X$ with $\mathscr F$ flat over $Y$, and an exact sequence $0 \rightarrow \mathscr H \rightarrow \mathscr G \rightarrow \mathscr F \rightarrow 0$, the pullback sequence $0 \rightarrow j_y^* \mathscr H \rightarrow j_y^* \mathscr G \rightarrow j_y^* \mathscr F \rightarrow 0$ is exact. Indeed, this holds because $R^1 j_y^* \mathscr F = \mathscr {T}\kern -.5pt or_1^{\mathscr O_\mathbb {P}}(\mathscr F, \mathscr O_{\mathbb {P}_y}) = \mathscr {T}\kern -.5pt or_1^{\mathscr O_Y}(\mathscr F, \kappa (y)) = 0$. Here we are using that $\mathscr F$ is flat over $Y$ for the final vanishing and $\mathscr F \otimes _{\mathscr O_\mathbb {P}} \mathscr O_{\mathbb {P}_y} \simeq \mathscr F \otimes _{\mathscr O_Y} \kappa (y)$ for the equality of $\mathscr {T}\kern -.5pt or$ sheaves.

Next, [Reference Casnati and EkedahlCE96, Step A, p. 443] provides a resolution of $\mathscr I_{X_y/ \mathbb {P}_y} = j_y^*\mathscr I$ of the following form.

(3.4)

Note here that we have only verified step A in the case $y$ is the spectrum of an algebraically closed field, but at this point we are assuming that $y$ is a geometric point, as we are verifying step B$'$.

We claim $j_y^* \mathscr I$ is $3$-regular, in the sense of Castelnuovo–Mumford regularity, i.e. $H^i( \mathbb {P}_y, j_y^* \mathscr I(3-i)) = 0$ for $i \geq 1$. To verify this, it follows from the definition of regularity that for an exact sequence $0 \rightarrow \mathscr F' \rightarrow \mathscr F \rightarrow \mathscr F'' \rightarrow 0$ of sheaves with $\mathscr F'$ $m+1$-regular and $\mathscr F$ $m$-regular, $\mathscr F''$ is also $m$ regular. Using this and the fact that $\mathscr O_{\mathbb {P}_y}(-k)$ is $k$-regular (and, hence, it is also $k+1$ regular by [Reference Fantechi, Göttsche, Illusie, Kleiman, Nitsure and VistoliFGI+05, Lemma 5.1(b)]), it follows by induction that $\operatorname {im} \alpha _{d-i,y}$ is $d-i+2$ regular. Therefore, $j^*_y \mathscr I= \operatorname {im} \alpha _{1,y}$ is $3$-regular. By [Reference Fantechi, Göttsche, Illusie, Kleiman, Nitsure and VistoliFGI+05, Lemma 5.1(b)], we obtain $H^1(\mathbb {P}_y, j_y^* \mathscr I(n)) = 0$ for $n \geq 2$. Hence, by cohomology and base change, $R^1 \pi _* \mathscr I(n) = 0$ for $n \geq 2$. Note that, often, cohomology and base change is only stated in the case $y$ is a point (as opposed to a geometric point) but the case that $y$ is a point follows from the case that $y$ is a geometric point since the vanishing of cohomology groups can be verified after base change to an algebraic closure, using flat base change.

For our next step, we verify that $\pi _* \mathscr I(n)$ commutes with base change on $Y$ for $n \geq 2$. For $\mathscr F$ a sheaf, let us denote by $\phi ^i_y(\mathscr F) : R^i \pi _* \mathscr F \otimes \kappa (y) \rightarrow H^i(X_y, \mathscr F|_{X_y})$ the natural base change map. Then we have seen above that, for $n \geq 2$, $\phi ^1_y(\mathscr I(n))$ is an isomorphism at all $y$. Further, $R^1 \pi _* \mathscr I(n)$ is locally free (and, in fact, equal to $0$) which implies by cohomology and base change that $\phi ^0_y(\mathscr I(n))$ is an isomorphism for all $n \geq 2$. In other words, the formation of $\pi _* \mathscr I(n)$ then commutes with base change on $Y$. Further, again by cohomology and base change, $\pi _* \mathscr I(n)$ is a locally free sheaf when $n \geq 2$ (since the condition from the theorem on cohomology and base change that $\phi ^{-1}_y$ be an isomorphism is vacuously satisfied).

Set $\mathscr F_1 := \pi _* \mathscr I(2)$ and $\mathscr N_1 := \pi ^* \mathscr F_1$. Let $\alpha _1 : \mathscr N_1(-2) \rightarrow \mathscr I$ denote the evaluation map coming from the adjunction $\pi ^* \pi _* \mathscr I(2) \otimes \mathscr O_\mathbb {P}(-2) \rightarrow \mathscr I(2) \otimes \mathscr O_{\mathbb {P}}(-2) \rightarrow \mathscr I$. As we have shown above, the formation of $\mathscr F_1$, and hence $\mathscr N_1$, commutes with base change. Further, naturality of the map $\alpha _1$, coming from the adjunction, also implies $j_y^*(\alpha _1) = \alpha _{1,y}$. Therefore, $\alpha _1$ is surjective, as its cokernel has empty support.

We next construct sheaves $\mathscr F_i$ and $\mathscr N_i$ inductively, with $\mathscr N_i = \pi ^* \mathscr F_i$, for $2 \leq i \leq d-3$. Let $\mathscr A_1 := \mathscr I$. For $i \geq 2$, assume inductively we have constructed the map $\alpha _{i-1}$ and define $\mathscr A_i := \ker \alpha _{i-1}$. Analogously to the above verification that $j_y^* \mathscr I$ is $3$-regular, it follows that $j_y^* \mathscr A_i$ is $i+2$ regular. Therefore, by [Reference Fantechi, Göttsche, Illusie, Kleiman, Nitsure and VistoliFGI+05, Lemma 5.1(b)], $H^1(\mathbb {P}_y, j_y^* \mathscr A_i(k)) = 0$ for $k \geq (i + 2) - 1 = i+1$. Analogously to the above case when $i = 1$, it follows from cohomology and base change that $R^1 \pi _* \mathscr A_i(k) = 0$ for $k \geq i+1$, $\pi _* \mathscr A_i(k)$ is locally free for $k \geq i+1$, and the formation of $\pi _* \mathscr A_i(k)$ commutes with base change for $k \geq i+1$. Then, set $\mathscr F_i := \pi _* \mathscr A_i(i+1)$ and $\mathscr N_i := \pi ^* \mathscr F_i$.

We next construct the map $\alpha _i: \mathscr N_i \rightarrow \mathscr N_{i-1}$. Begin with the inclusion $\mathscr A_i(i+1) \rightarrow \mathscr N_{i-1}(1)$ (obtained by twisting the inclusion $\mathscr A_i \rightarrow \mathscr N_{i-1}(-i)$, coming from the definition of $\mathscr A_i$, by $i+1$). Apply $\pi ^* \pi _*$ to obtain a map $\pi ^* \pi _* \mathscr A_i(i+1) \rightarrow \pi _* \pi ^* \mathscr N_{i-1}(1)$. Twist by $-i-1$ which yields the composite map

(3.5)\begin{align} \mathscr N_i(-i-1) &= (\pi^* \pi_* \mathscr A_i(i+1))(-i-1) \nonumber\\ &\rightarrow (\pi^* \pi_* \mathscr N_{i-1}(1))(-i-1) \nonumber\\ &\simeq (\mathscr N_{i-1} \otimes \pi^* \pi_* \mathscr O(1))(-i-1) \nonumber\\ &\rightarrow \mathscr N_{i-1} (-i), \end{align}

which we call $\alpha _i$. Since $\mathscr N_i$ commutes with base change, and this map is obtained from adjunction, the formation of $\alpha _i$ also commutes with base change. Also, since pushforward is left exact, we obtain condition (v) in the theorem from the above construction of $\mathscr F_i$, provided we show the above construction is the unique such one as in the statement (which will be done later in the proof). This addresses Remark 3.3(9).

Finally, we similarly construct $\mathscr F_{d-2}$, $\mathscr N_{d-2}$, and $\alpha _{d-2}$, assuming we have constructed $\alpha _{d-3}$. Let $\mathscr A_{d-2} := \ker \alpha _{d-3}$. By cohomology and base change, we find $j_y^* \mathscr A_{d-2}$ is, in fact, $d$-regular (as opposed to only $d-1$ regular, as was the case for $\mathscr A_i$ with $i < d-2$). Therefore, by cohomology and base change, we find $R^1 \pi _* \mathscr A_{d-2}(-d) = 0$ and also that $\pi _* \mathscr A_{d-2}(-d)$ is locally free and commutes with base change. We set $\mathscr F_{d-2} := \pi _* \mathscr A_{d-2}(-d)$ and $\mathscr N_{d-2} := \pi ^* \mathscr F_{d-2}$. Analogously to (3.5), there is a canonical map $\alpha _{d-2} :\mathscr N_{d-2}(-d) \rightarrow \mathscr N_{d-3}(-d+2)$ coming from adjunction which commutes with base change. Altogether, we have constructed a complex as in (3.1) which commutes with base change on $Y$ and restricts to the minimal free resolution (3.4) on each fiber $y \in Y$. It follows from Nakayama's lemma that the complex (3.1) is exact, because it is exact when restricted to each fiber over $y \in Y$.

Further, because $\mathscr N_i = \pi ^* \mathscr F_i$, it follows from the projection formula that $\pi _* \mathscr N_i \simeq \pi _*(\mathscr O_\mathbb {P} \otimes \pi ^* \mathscr F_i) \simeq \pi _* \mathscr O_\mathbb {P} \otimes \mathscr F_i \simeq \mathscr F_i$, and so $\pi ^* \pi _* \mathscr N_i \simeq \mathscr N_i$.

We next verify uniqueness of our constructed resolution $\mathscr N_\bullet$, up to unique isomorphism, in the sense claimed in property (iii). Suppose $\mathscr M_\bullet$ is another such resolution which restricts to a minimal free resolution over each geometric fiber over $y \in Y$. Over any local scheme ${\operatorname {\operatorname {Spec}} \mathscr O_{y, Y} \subset Y}$, there is an isomorphism $\phi _U: \mathscr N_\bullet |_{\operatorname {\operatorname {Spec}} \mathscr O_{y,Y}} \simeq \mathscr M_\bullet |_{\operatorname {\operatorname {Spec}} \mathscr O_{y,Y}}$ by a sheafified version of [Reference EisenbudEis95, Theorem 20.2]. Such an isomorphism spreads out to an isomorphism over some affine open $U \subset Y$. Further, this isomorphism is unique up to homotopy by a sheafified version of [Reference EisenbudEis95, Lemma 20.3]. We claim there are no nonzero homotopies $s: \mathscr N_\bullet |_U \rightarrow \mathscr M_\bullet |_U$. Indeed, such an homotopy would yield a map $s_i: \mathscr N_i|_U \rightarrow \mathscr M_{i+1}|_U$. We wish to show this map is $0$. To check it is $0$, it suffices to show it is $0$ over each $y \in Y$. Over a point $y \in Y$, this corresponds to a map $\mathscr O_{\mathbb {P}_y}(a)^{\oplus b} \rightarrow \mathscr O_{\mathbb {P}_y}(c)^{\oplus d}$ with $c < a$. It follows that there are no nonzero such maps, so the isomorphism $\phi _U$ is unique. Hence, by this uniqueness, we obtain via Zariski descent an isomorphism $\phi : \mathscr N_\bullet \simeq \mathscr M_\bullet$. This isomorphism is unique because it is unique when restricted to each member of an open cover.

Addressing steps C and D

We have completed the verification of [Reference Casnati and EkedahlCE96, Theorem 2.1, Step B] and now note that steps C and D given in the proof of [Reference Casnati and EkedahlCE96, Theorem 2.1] go through without change. Recall that step D states that the factorization $\rho = \pi \circ i$ exists. However, the proof shows more: it shows that the triple $(\mathbb {P} \mathscr E, \sigma, j)$ gives such a triple, where $\sigma : \mathbb {P} \mathscr E \to Y$ is the structure map. This concludes the verification of part (ii), as mentioned in Remark 3.3(5).

Addressing uniqueness of the triples

At this point, we have proved everything except the uniqueness of the triple $(\mathbb {P}, \pi, i)$. We conclude the proof by verifying this statement, which will complete the verification of the modification noted in Remark 3.3(4). We have shown so far in part (i) that if $(\mathbb {P}_1, \pi _1, i_1)$ and $(\mathbb {P}_2, \pi _2, i_2)$ are two triples as in the statement of Theorem 3.2, then there is an isomorphism $\mu : \mathbb {P}_1 \simeq \mathbb {P}_2$. Since $\mu$ is an isomorphism of projective bundles over $Y$, we have $\pi _1 \circ \mu \simeq \pi _2$. Using this and property (ii), we can reduce to the case that $\mathbb {P}_1 \simeq \mathbb {P}_2 \simeq \mathbb {P} \mathscr E$: it suffices to find an automorphism $\psi : \mathbb {P} \to \mathbb {P}$ over $Y$ so that $\psi \circ i = i_2$ and, moreover, show this automorphism $\psi$ is the unique one with this property.

To verify existence and uniqueness of $\psi$, we first reduce to the case $Y$ is the spectrum of a local ring. We know that both $i_1^* \mathscr O_{\mathbb {P} (\mathscr E)}(1) \simeq \omega _{X/Y}$ and $i_2^* \mathscr O_{\mathbb {P} (\mathscr E)}(1) \simeq \omega _{X/Y}$, by Theorem 3.2(ii). Hence, we obtain that the automorphism $\psi$ is induced by some automorphism $\phi$ of $\pi _*\omega _{X/Y}$, determined up to unit. The maps $i_1$ and $i_2$ induce two surjections $q_1, q_2: \pi _* \omega _{X/Y} \to \mathscr O_Y$ with the maps $i_1$ and $i_2$ coming via the linear subsystems $\ker (q_1)$ and $\ker (q_2)$. To show we have an induced map between $\ker (q_1)$ and $\ker (q_2)$, which are both abstractly isomorphic to $\mathscr E$, it is enough to show that, up to unit, $q_1 = q_2 \circ \phi$. We may verify this locally and, hence, assume $Y$ is the spectrum of a local ring.

We conclude by verifying existence and uniqueness of $\psi$ in the case $Y$ is the spectrum of a local ring. Using Theorem 3.2(vi), in both of the maps $i_1$ and $i_2$, there is no subscheme of degree $d - 1$ on the closed fiber contained in a hyperplane, and hence the same holds over the whole local scheme $Y$. We may rephrase this as the condition that the two relative hyperplane sections of $\mathbb {P} \mathscr E$ associated to $q_1$ and $q_2$ do not meet $i_1(X)$ and $i_2(X)$. Equivalently, the two hyperplane sections associated to $q_1$ and $q_2$ are nowhere vanishing on $X$ and, therefore, related by a unit. By modifying $\phi$ by this unit, we may assume $q_1 = q_2 \circ \phi$. This verifies that $\phi$ is unique up to unit and, hence, that $\psi$ is unique. Under the above identifications, the image of $\mathscr E \to \pi _* \omega _{X/Y}$ is identified with the kernel of the natural map $\pi _* \omega _{X/Y} \to \mathscr O_X$ dual to $\rho ^\#$. Since this map is also fixed by the resulting automorphism $\phi$, the automorphism $\phi$ of $\pi _* \omega _{X/Y}$ restricts to an automorphism of $\mathscr E$ which induces the desired automorphism $\psi : \mathbb {P} \mathscr E \to \mathbb {P} \mathscr E$.

The following useful corollary tells us that any two ‘canonical embeddings’ of a Gorenstein cover are related by an automorphism of $\mathbb {P} \mathscr E$ coming from $\mathscr E$. A special case of this was stated in [Reference Casnati and NotariCN07, Corollary 2.3], though the proof there seems quite terse, as it omits the verification of uniqueness of the triple $(\mathbb {P} \mathscr E', \pi, i)$ which we carry out in Theorem 3.2.

Corollary 3.5 With notation as in Theorem 3.2, suppose we are given $\rho : X \to Y$ and two embeddings $i_1: X \to \mathbb {P} \mathscr E$ and $i_2: X \to \mathbb {P} \mathscr E$ so that $\rho = \pi \circ i_1 = \pi \circ i_2$ and $\rho ^{-1}(y)$ is arithmetically Gorenstein and nondegenerate under both embeddings $i_1$ and $i_2$. Then, the unique isomorphism $\psi : \mathbb {P} \mathscr E \to \mathbb {P} \mathscr E$ taking $i_1(X)$ to $i_2(X)$ is induced by an automorphism of $\mathscr E$.

Proof. This is a direct consequence of the uniqueness property for triples $(\mathbb {P}, \pi, i)$ as stated in Theorem 3.2, applied to two triples $(\mathbb {P} \mathscr E, \pi, i_1)$ and $(\mathbb {P} \mathscr E, \pi, i_2)$.

3.6 Low-degree parametrizations

We now apply Theorem 3.2, as in the work of Casnati and Ekedahl, to obtain parametrizations of Gorenstein covers of degrees $3$, $4$, and $5$.

Remark 3.7 Our parametrization in degree $5$, Theorem 3.16, is stronger than previous work in several ways. The similar result in degree $5$ proven in [Reference CasnatiCas96, Theorem 3.8] has certain additional restrictions on the covers and sections that Casnati refers to as being ‘regular’. This regularity condition amounts to the assumption that the map $\wedge ^2 \mathscr F^\vee \otimes \det \mathscr E \to \mathscr E$ associated to a section $\eta \in \mathscr H(\mathscr E, \mathscr F)$ is surjective. In addition, [Reference CasnatiCas96, Theorem 3.8] does not claim there is a bijection between covers and sections up to automorphisms of $\mathscr E$ and $\mathscr F$, but only gives constructions of maps in both directions. Further, [Reference CasnatiCas96, Theorem 3.8] is stated for degree $5$ finite flat surjective maps $X \rightarrow Y$ with $Y$ integral and noetherian, whereas ours hold for arbitrary schemes $Y$.

To introduce notation simultaneously in the cases of degrees $3$, $4$, and $5$, we use the following notation.

Notation 3.8 Let $d \in \{ 3, 4, 5\}$. Let $Y$ be a scheme. Fix a locally free sheaf $\mathscr E$ on $Y$ of rank $d-1$. If $d = 4$, let $\mathscr F$ be a locally free sheaf on $Y$ of rank $2$ and if $d = 5$, let $\mathscr F$ be a locally free sheaf on $Y$ of rank $5$. We use the tuple $(\mathscr E, \mathscr F_\bullet )$ to denote the pair $(\mathscr E,\mathscr F)$ when $d = 4$ or $d = 5$ and to denote $\mathscr E$ when $d = 3$. Define the associated sheaf

(3.6)\begin{equation} \mathscr H(\mathscr E, \mathscr F_\bullet) := \begin{cases} \operatorname{Sym}^3 \mathscr E \otimes \det \mathscr E^\vee & \text{if } d=3 ,\\ \mathscr F^\vee \otimes \operatorname{Sym}^2 \mathscr E & \text{if } d=4 ,\\ \wedge^2 \mathscr F \otimes \mathscr E \otimes \det \mathscr E^\vee & \text{if } d=5. \\ \end{cases} \end{equation}

We will often use $\mathscr H$ to denote $\mathscr H(\mathscr E, \mathscr F_\bullet )$ when the data $(\mathscr E,\mathscr F_\bullet )$ is clear from context. We will see that sections of the above sheaf $\mathscr H$ define subschemes of $\mathbb {P} \mathscr E$. When these subschemes have dimension $0$ in fibers, we will see they induce degree $d$ locally free covers. The parametrizations for degrees $3$, $4$, and $5$ essentially say that the resulting covers are in bijection with such sections, up to automorphisms of $(\mathscr E, \mathscr F_\bullet )$.

3.9 The resolutions in low degree

In order to state the parametrizations in degrees $3$, $4$, and $5$, we now want a way of associating a subscheme of $\mathbb {P} \mathscr E$ to a section. We will give a description of this association separately in the cases that $d = 3$, $4$, and $5$.

Renaming the sheaf $\mathscr E'$ appearing in (3.1) as $\mathscr E$ and renaming $\mathscr F_1$ as $\mathscr F$, in the cases $d =3$, $4$, and $5$, (3.1) becomes respectively

(3.7)
(3.8)
(3.9)

with the rank of the locally free sheaves $\mathscr E$ and $\mathscr F$ in the degree $3$, degree $4$, and degree $5$ cases given in Notation 

3.8

.

3.10 The maps $\Phi _d$ in low degree

In the above three cases, corresponding to degrees $3$, $4$, and $5$, respectively, we have isomorphisms

(3.10)\begin{gather} \Phi_3: H^0(Y, \operatorname{Sym}^3 \mathscr E \otimes \det \mathscr E^{\vee}) \overset \sim \longrightarrow H^0(\mathbb{P}\mathscr E, \pi^* \det \mathscr E^{\vee}(3)), \end{gather}
(3.11)\begin{gather} \Phi_4: H^0(Y, \operatorname{Sym}^2 \mathscr E \otimes \mathscr F^\vee) \overset\sim \longrightarrow H^0(\mathbb{P}\mathscr E, \pi^* \mathscr F^\vee(2)), \end{gather}
(3.12)\begin{gather} \Phi_5: H^0(Y, \wedge^2 \mathscr F \otimes \mathscr E \otimes \det \mathscr E^{\vee}) \overset \sim \longrightarrow H^0(\mathbb{P}\mathscr E, \wedge^2 \pi^* \mathscr F \otimes \pi^* \det \mathscr E^{\vee}(1)). \end{gather}

3.11 The maps $\Psi _d$ in low degree

For $\rho : X \to Y$ a finite locally free surjective Gorenstein map of degree $d$, we will use $\mathscr E^X$ to denote the Tschirnhausen bundle $\operatorname {coker}(\mathscr O_Y \to \rho _* \mathscr O_X\!)^\vee$ and $\mathscr F^X$ to denote the bundle $\mathscr F_1$ in the case we take $\mathscr E'$ in Theorem 3.2(iii) to be the Tschirnhausen bundle $\mathscr E^X$.

Next, for $3 \leq d \leq 5$, given a section $\eta \in H^0(Y, \mathscr H(\mathscr E,\mathscr F_\bullet ))$, we define an associated scheme $\Psi _d(\eta )$ over $Y$.

When $d = 3$, we begin with a section $\eta \in H^0(Y, \operatorname {Sym}^3 \mathscr E \otimes \det \mathscr E^{\vee })$, which, via $\Phi _3$, can be viewed as an element of $H^0(\mathbb {P}\mathscr E, \pi ^* \det \mathscr E^{\vee }(3))$. Such a section corresponds to a map $\mathscr O_{\mathbb {P}\mathscr E} \rightarrow \pi ^* \det \mathscr E^{\vee }(3)$ or, equivalently, a map $\pi ^* \det \mathscr E(-3) \rightarrow \mathscr O_{\mathbb {P}\mathscr E}$. We let $\Psi _3(\eta )$ denote the support of the cokernel of this map. That is, we define $\Psi _3(\eta ) \subset \mathbb {P}\mathscr E$ so that on $\mathbb {P} \mathscr E$ we have the following exact sequence.

(3.13)

When $d =4$, given $\eta \in H^0(Y, \mathscr F^\vee \otimes \operatorname {Sym}^2 \mathscr E)$, define $\Psi _4(\eta )$ to be the subscheme of $\mathbb {P}\mathscr E$, considered as the support of the cokernel of the map $\pi ^* \mathscr F(-2) \rightarrow \mathscr O_{\mathbb {P} (\mathscr E)}$ corresponding to $\Psi _4(\eta )$.

Finally, when $d = 5$, given $\eta \in H^0(Y, \wedge ^2\mathscr F \otimes \pi ^*\det \mathscr E^{\vee } \otimes \mathscr E)$, from $\Phi _5(\eta )$ we obtain a corresponding alternating map $\pi ^* \mathscr F^\vee \otimes \pi ^* \det \mathscr E(-3) \to \pi ^* \mathscr F(-2)$. The five $4 \times 4$ Pfaffians of this map determine a map of sheaves $\pi ^* \mathscr F(-2) \to \mathscr O_{\mathbb {P} (\mathscr E)}$, as may be computed locally. Define $\Psi _5(\eta )$ as the support of the cokernel of the map $\pi ^* \mathscr F(-2) \to \mathscr O_{\mathbb {P} (\mathscr E)}$ in $\mathbb {P}\mathscr E$.

Definition 3.12 Let $d \in \{3,4,5\}$, $Y$ be a scheme, and $(\mathscr E, \mathscr F_\bullet ), \mathscr H(\mathscr E, \mathscr F_\bullet )$ be sheaves on $Y$ as in Notation 3.8. We say $\eta \in H^0(Y, \mathscr H(\mathscr E, \mathscr F_\bullet ))$ has the right Hilbert polynomial at a point $y \in Y$ if the fiber of $\Psi _d(\eta )$ over $y$ has dimension $0$ and degree $d$. We say $\eta$ has the right Hilbert polynomial if it has the right Hilbert polynomial at every $y \in Y$.

Finally, we are ready to state the low-degree parametrizations. The parametrization in degree $3$ is as follows.

Theorem 3.13 (Generalization of [Reference Casnati and EkedahlCE96, Theorem 3.4], specialization of [Reference PoonenPoo08, Proposition 5.1])

Fix a scheme $Y$ and a rank-$2$ locally free sheaf $\mathscr E$ on $Y$. The map $\eta \mapsto \Psi _3(\eta )$ induces a bijection between:

  1. (1) sections $\eta \in H^0(Y, \operatorname {Sym}^3 \mathscr E \otimes \det \mathscr E^{\vee })$ having the right Hilbert polynomial at every $y \in Y$, up to automorphisms of $\mathscr E$; and

  2. (2) finite locally free Gorenstein covers $\rho : X \rightarrow Y$ of degree $3$ such that $\mathscr E^\vee \simeq \operatorname {coker} \rho ^\#$.

The following proof extends that given in [Reference Casnati and EkedahlCE96, Theorem 3.4]. We note that there the base is assumed to be reduced and noetherian, and the bijection is not stated explicitly. We outline the proof for the reader's convenience.

Proof. We start by constructing the map from (2) to (1). Given such a $\rho : X \rightarrow Y$, we obtain from Theorem 3.2, a resolution of $\mathscr O_{\mathbb {P}\mathscr E}$ as in (3.7), unique up to unique isomorphism. The map $\sigma$ in (3.7) can be viewed as a section $\sigma \in H^0(\mathbb {P} \mathscr E, \pi ^* \det \mathscr E^\vee (3))$. For $\Phi _3$ as defined in (3.10), we obtain a section $\eta := \Phi _3^{-1}(\sigma ) \in H^0(Y, \operatorname {Sym}^3 \mathscr E \otimes \det \mathscr E^{\vee })$. Note that the resulting $\eta$ has the right Hilbert polynomial at every $y \in Y$ because $X \rightarrow Y$ is finite by assumption.

We next show the map $\eta \mapsto \Psi _3(\eta )$ indeed defines a map from (1) to (2). Given $\eta$ of the right Hilbert polynomial at every $y \in Y$, we obtain a right exact sequence (3.13). The assumption that $\eta$ has the right Hilbert polynomial yields that the first map in this sequence is injective and, hence, $X \to \mathbb {P} \mathscr E$ has a resolution of the form (3.7). This resolution shows $X$ is locally finitely presented over $Y$. Further, $X$ is finite as it is locally of finite presentation, proper, and quasi-finite [Reference GrothendieckGro66, 8.11.1]. Flatness of $X \to Y$ may be verified locally, in which case it holds as $X$ is cut out of $\mathbb {P}^1_Y$ by a single equation of degree $3$ not vanishing on any fibers. Therefore, $X$ is a finite locally free degree $3$ cover of $Y$. Finally, exactness of (3.7) implies $\mathscr E^\vee \simeq \operatorname {coker} \rho ^\#$ from Theorem 3.2(iii) and (iv).

It remains to see that these two maps we have defined establish a bijection. For this, we show the compositions of these maps in both orders are equivalent to the identity map. If we begin with a cover $\rho : X \to Y$, (3.7) defines a resolution of $X \to \mathbb {P} \mathscr E$ giving $X$ as the vanishing locus $\Psi _3(\eta ) \subset \mathbb {P} \mathscr E$. To show the other composition is equivalent to the identity, begin with some $\eta \in H^0(Y, \operatorname {Sym}^3 \mathscr E \otimes \det \mathscr E^{\vee })$, and let $X$ denote the associated cover $\Psi _3(\eta )$. The Tschirnhausen bundle $\mathscr E^X$ as in § 3.11 associated to $X$ from Theorem 3.2 is then isomorphic to $\mathscr E$ using Theorem 3.2(iv), as we may view $\eta$ as a map $\pi ^* \det \mathscr E(-3) \to \mathscr O_{\mathbb {P} \mathscr E}$. Upon choosing such an isomorphism $\mathscr E \simeq \mathscr E^X$, we obtain a section $\eta ^X \in H^0(Y, \operatorname {Sym}^3 \mathscr E^X \otimes \det (\mathscr E^X)^{\vee }) \simeq H^0(Y, \operatorname {Sym}^3 \mathscr E \otimes \det \mathscr E^{\vee })$. Using Theorem 3.2(iv), there is an automorphism of $\mathbb {P} \mathscr E$ taking $\Psi _3(\eta )$ to $\Psi _3(\eta ^X)$. From Theorem 3.2(iv) and the fact that the leftmost term of the resolution (3.7) is $\pi ^* \det \mathscr E(-3)$, we find $\mathscr E$ is isomorphic to $\ker (\rho _* \omega _{X/Y} \to \mathscr O_Y\!)$. By Corollary 3.5, this automorphism of $\mathbb {P} \mathscr E$ is induced by an automorphism of $\mathscr E$. Hence, after composing with the automorphism of $\mathscr E$, we can assume $\eta$ and $\eta ^X$ define isomorphic subschemes of $\mathbb {P} \mathscr E$, and so are related via multiplication by a global section $s^{-1} \in \mathscr O_Y(Y)$. By composing with an automorphism of $\mathscr E$ multiplying by $s^{-1}$, $\eta$ and $\eta ^X$ are identified.

We next verify the parametrization in degree $4$.

Theorem 3.14 (Generalization of [Reference Casnati and EkedahlCE96, Theorem 4.4], specialization of [Reference WoodWoo11, Theorem 1.1])

Fix a scheme $Y$, a rank-$3$ locally free sheaf $\mathscr E$ on $Y$, and a rank-$2$ locally free sheaf $\mathscr F$ on $Y$ such that there exists an unspecified isomorphism $\det \mathscr E \simeq \det \mathscr F$. The map $\eta \mapsto \Psi _4(\eta )$ induces a bijection between:

  1. (1) sections $\eta \in H^0(Y, \mathscr F^\vee \otimes \operatorname {Sym}^2 \mathscr E)$ having the right Hilbert polynomial at every $y \in Y$, up to automorphisms of $\mathscr E$ and $\mathscr F$; and

  2. (2) finite locally free Gorenstein maps $\rho :X \rightarrow Y$ of degree $4$ with associated sheaves $\mathscr E^X, \mathscr F^X$ as in § 3.11 which are isomorphic to $\mathscr E$ and $\mathscr F$.

Proof. First we construct the map from (2) to (1). Beginning with a cover $X \to Y$, we obtain a resolution (3.8) and, upon choosing isomorphisms $\mathscr E^X \simeq \mathscr E$ and $\mathscr F^X \simeq \mathscr F$, we obtain a section $\eta \in H^0(Y, \mathscr F^\vee \otimes \operatorname {Sym}^2 \mathscr E)$ having the right Hilbert polynomial at every $y \in Y$.

To construct the map from (1) to (2), we must show $\Psi _4(\eta )$ satisfies the properties listed in (2). We first verify $\Psi _4(\eta )$ is a finitely presented Gorenstein cover of $Y$. On fibers, $\Psi _4(\eta )$ is described as a dimension-$0$ intersection of two quadrics. Since $\eta$ has the right Hilbert polynomial at $y \in Y$, it has degree $4$ over $y$. (We parenthetically note that by Bezout's theorem, having the right Hilbert polynomial is equivalent to having dimension $0$, which then matches with Casnati and Ekedahl's notion of having ‘the right codimension’ from [Reference Casnati and EkedahlCE96, Definition 4.2].) Gorensteinness follows because $\Psi _4(\eta )$ is a local complete intersection.

We next deduce flatness of $\Psi _4(\eta )$ over $Y$. We first explain how to reduce to the case that $Y$ is smooth. Let $Z$ denote the moduli space parameterizing pairs of quadrics in $\mathbb {P}^2$ which comes with a universal $\pi : U \to Z$ whose fiber over a pair $[(Q_1, Q_2)]$ is $Q_1 \cap Q_2$. There is an open locus $Z^\circ \subset Z$ where the intersection of these quadrics is zero-dimensional, and hence has constant degree $4$ by Bezout's theorem. Let $U^\circ := \pi ^{-1}(Z^\circ )$. Since $Z$ is a product of projective spaces, $Z^\circ$ is an open in a product of projective spaces, hence, in particular, smooth. Working fppf locally on $Y$, we can express $X \to Y$ as an intersection of relative quadrics in $\mathbb {P}^2$, in which case $X \to Y$ is pulled back from $U^\circ \to Z^\circ$ via a map $Y \to Z^\circ$. Hence, it suffices to show that $U^\circ \to Z^\circ$ itself is flat. In this case, since $Z^\circ$ is reduced, flatness follows from constancy of the degree.

To conclude the construction of the map from (1) to (2), we will show it is possible to choose identifications $\mathscr E^X \simeq \mathscr E, \mathscr F^X \simeq \mathscr F$ so that we obtain an associated section $\eta ^X \in H^0(Y,\mathscr F^\vee \otimes \operatorname {Sym}^2 \mathscr E) \simeq H^0(Y,(\mathscr F^X)^\vee \otimes \operatorname {Sym}^2 \mathscr E^X)$.

First we show $\mathscr E^X \simeq \mathscr E$. Indeed, there is the following Koszul complex.

(3.14)

It also follows from [Reference EisenbudEis95, Theorem 20.15] (using the comments on [Reference EisenbudEis95, p. 503] and the fact that Gorenstein schemes are Cohen–Macaulay) that (3.14) yields a minimal free resolution of $X_y$ in $\mathbb {P} \mathscr E_y$ for every $y \in Y$. Because $\det \mathscr F \simeq \det \mathscr E$ by assumption, Theorem 3.2(iv) implies $\mathscr E \simeq \mathscr E^X$.

Using the isomorphism $\mathscr E \simeq \mathscr E^X$, we also verify $\mathscr F \simeq \mathscr F^X$. Let $i: X \to \mathbb {P} \mathscr E$ and $i^X: X \to \mathbb {P} \mathscr E^X$ denote the two embeddings. By pushing forward the twist of (3.14) by $\mathscr O_{\mathbb {P} \mathscr E}(2)$ along $\pi$, we find $\mathscr F^X \simeq \ker (\operatorname {Sym}^2 \mathscr E^X \to \pi _* (i^X_* \mathscr O_X \otimes \mathscr O_{\mathbb {P} \mathscr E^X}(2)))$. Similarly, the analogous resolution from Theorem 3.2 for $X$ in terms of $\mathscr E^X$ and $\mathscr F^X$ yields $\mathscr F \simeq \ker (\operatorname {Sym}^2 \mathscr E \to \pi _* (i_* \mathscr O_X \otimes \mathscr O_{\mathbb {P} \mathscr E}(2)))$. Hence, the isomorphism $\mathscr E \simeq \mathscr E^X$ induces the desired isomorphism $\mathscr F \simeq \mathscr F^X$.

The isomorphism $\mathscr E \simeq \mathscr E^X$ is compatible with the above restriction map, and so induces an isomorphism $\mathscr F \simeq \mathscr F^X$. This concludes the verification that the map we have produced indeed goes from (1) to (2).

It remains to prove the compositions of the above maps in both directions are equivalent to the identity. As in the degree $3$ case, if we start with a cover, and produce the associated section $\eta ^X$, $\Psi _4(\eta ^X)$ is isomorphic to $X$ via the construction. For showing the reverse composition is equivalent to the identity, start with some section $\eta$. Let $X$ denote the resulting cover $\Psi _4(\eta )$.

Given the above identifications $\mathscr E^X \simeq \mathscr E, \mathscr F^X \simeq \mathscr F$, we wish to show $\eta ^X$ is related to $\eta$ by automorphisms of $\mathscr E$ and $\mathscr F$. Note also here that any automorphism of $\mathscr E$ and $\mathscr F$ sends $\eta$ to another section defining an isomorphic cover. Using Theorem 3.2, there is an automorphism of $\mathbb {P} \mathscr E$ taking the subscheme $\Psi _4(\eta ^X)$ to $\Psi _4(\eta )$. From Theorem 3.2(iv) and the fact that the leftmost term of the resolution (3.8) is $\pi ^* \det \mathscr E(-4)$, we find $\mathscr E$ is isomorphic to $\ker (\rho _* \omega _{X/Y} \to \mathscr O_Y)$. By Corollary 3.5, the above automorphism of $\mathbb {P} \mathscr E$ is induced by an automorphism of $\mathscr E$. By composing with the inverse of this automorphism, we may assume the resulting map is the identity on $\mathbb {P} \mathscr E$, and so the automorphism of $\mathbb {P} \mathscr E$ is then induced by some automorphism of $\mathscr E$ via multiplication by a section $s\in \mathscr O_Y(Y)$. After composing with multiplication by $s^{-1}$, we may reduce to the case $s$ is the identity. Since $\mathscr F$ is a subsheaf of $\operatorname {Sym}^2 \mathscr E$ by Theorem 3.2(v), the image of the induced map $\mathscr F \to \operatorname {Sym}^2 \mathscr E$ is uniquely determined by $X$, but the precise map is only determined up to automorphism of $\mathscr F$. Upon composing with such an automorphism, we may identify not just the images of $\mathscr F$ in $\operatorname {Sym}^2 \mathscr E$, but further we may identify the maps. Under these identifications, $\eta$ agrees with $\eta ^X$, when viewed as maps $\mathscr F \to \operatorname {Sym}^2 \mathscr E$.

We next state and prove the analogous parametrization in degree $5$. As preparation, we will need the following application of the structure theorem for codimension-$3$ Gorenstein algebras due to Buchsbaum and Eisenbud.

Lemma 3.15 Let $Y$ be a scheme, and let $\mathscr E$ and $\mathscr F$ be locally free sheaves on $Y$ of ranks $3$ and $5$. A finite locally free Gorenstein map $\rho : X \to Y$ of degree $5$, described as $\Psi _5(\eta )$ for $\eta \in H^0(Y, \wedge ^2 \mathscr F \otimes \mathscr E \otimes \det \mathscr E^{\vee })$, has a resolution of the form

(3.15)

which restricts to a minimal free resolution over each $y\in Y$, where $\beta _2$ is alternating and $\beta _3$ is identified with the dual of $\beta _1$ tensored with $\pi ^* \det \mathscr E^\vee \otimes \pi ^*\det \mathscr F(-5)$.

Proof. In (3.15), the map $\beta _2$ is obtained from $\eta$, interpreted as a section of $H^0(Y, \wedge ^2 \mathscr F \otimes \mathscr E \otimes \det \mathscr E^{\vee }) \simeq H^0(\mathbb {P} \mathscr E, \pi ^*(\wedge ^2 \mathscr F \det \mathscr E^{\vee })(1))$. The map $\beta _3$ is obtained by taking five $4 \times 4$ Pfaffians of $\beta _2$. To make sense of this, one may first construct $\beta _3$ locally upon choosing trivializations of $\mathscr F$ and $\mathscr E$. One then obtains a global map $\mathscr F(-2) \to \mathscr O_{\mathbb {P} \mathscr E}$ because the formation of the Pfaffians are compatible with restriction to an open subscheme of $Y$. Finally, $\beta _1$ is obtained as the dual to $\beta _3$, tensored with $\pi ^* \det \mathscr E^\vee \otimes \pi ^*\det \mathscr F(-5)$.

Since we have constructed the maps in (3.15) globally over $\mathbb {P} \mathscr E$, it is enough to verify they furnish a minimal free resolution on geometric fibers. To this end, we may work locally on $Y$ and choose a trivialization $u: \det \mathscr E \simeq \mathscr O_Y$. Upon choosing this trivialization and composing with the isomorphism $u$ for the two left nonzero sheaves in (3.15), we obtain a sequence

(3.16)

where $\beta _2'$ is still alternating, i.e. it corresponds to an element of $H^0(Y, \wedge ^2 \mathscr F \otimes \mathscr E)$, and $\beta _3'$ remains identified with the dual of $\beta _1'$, now tensored with $\pi ^*\det \mathscr F(-5)$. Since the sequence (3.16) commutes with base change on $Y$, we may further restrict to a geometric point $y \in Y$ and, hence, assume $Y$ is the spectrum of an algebraically closed field.

We wish to show (3.16) is a minimal locally free resolution. To do so, we wish to apply [Reference Buchsbaum and EisenbudBE77], and so we translate the above to the setting of commutative algebra. By Theorem 3.2(iii), $X \to \mathbb {P} \mathscr E$ is an arithmetically Gorenstein subscheme. Writing $\mathbb {P} \mathscr E = \operatorname {\operatorname {Proj}} \kappa (y)[x_0, x_1, x_2, x_3]$, the cone over $X$ defines a Gorenstein subscheme of $\operatorname {\operatorname {Spec}} \kappa (y)[x_0, x_1, x_2, x_3]_{(x_0, x_1, x_2, x_3)}$, the localization of $\mathbb {A}^4_{\kappa (y)}$ at the origin. Taking $R := \kappa (y)[x_0, x_1, x_2, x_3]_{(x_0, x_1, x_2, x_3)}$, we can identify $\pi ^* \mathscr F$ with a rank $5$ free $R$-module $F$. Let $J$ denote the ideal of the cone over $X$ in $R$. The resolution (3.16) can then be reexpressed in the form

(3.17)

with $\beta _2'' \in \wedge ^2 F$ alternating and $\beta _3''$ the dual of $\beta _1''$. By the definition of $\Psi _5(\eta )$ this sequence is exact at $R$, so $J$ is the image of $\beta _1''$. Since $X$ has codimension $3$ in $\mathbb {P} \mathscr E$ by assumption, $J$ is of grade $3$. Hence, (3.17) satisfies the hypotheses of [Reference Buchsbaum and EisenbudBE77, Theorem 2.1(1)]. It is stated that any such resolution satisfying these hypotheses is a minimal free resolution of $R/JR$ in the bottom paragraph of [Reference Buchsbaum and EisenbudBE77, p. 463] and the proof is given in [Reference Buchsbaum and EisenbudBE77, p. 464].

Theorem 3.16 (Generalization of [Reference CasnatiCas96, Theorem 3.8])

Fix a scheme $Y$, a rank-$4$ locally free sheaf $\mathscr E$ on $Y$, and a rank-$5$ locally free sheaf $\mathscr F$ on $Y$ such that there exists an unspecified isomorphism $\det \mathscr F \simeq (\det \mathscr E)^{\otimes 2}$. The map $\eta \mapsto \Psi _5(\eta )$ induces a bijection between:

  1. (1) sections $\eta \in H^0(Y, \wedge ^2 \mathscr F \otimes \mathscr E \otimes \det \mathscr E^{\vee })$ having the right Hilbert polynomial at every $y \in Y$, up to automorphisms of $\mathscr E$ and $\mathscr F$; and

  2. (2) finite locally free Gorenstein maps $\rho :X \rightarrow Y$ of degree $5$ with associated sheaves $\mathscr E^X, \mathscr F^X$ as in § 3.11 which are isomorphic to $\mathscr E$ and $\mathscr F$.

Proof. To start, we construct the map from (2) to (1). Beginning with a cover $X \to Y$, we obtain a resolution (3.9). Upon choosing isomorphisms $\mathscr E^X \simeq \mathscr E$ and $\mathscr F^X \simeq \mathscr F$ we obtain a section $\eta \in H^0(Y, \mathscr F^{\otimes 2} \otimes \mathscr E \otimes \det \mathscr E^\vee )$ having the right Hilbert polynomial at every $y \in Y$. We wish to check next that this section actually lies in $H^0(Y, \wedge ^2 \mathscr F \otimes \mathscr E \otimes \det \mathscr E^\vee )$. Viewing this as a map $\pi ^*\mathscr F^\vee \otimes \pi ^* \det \mathscr E \to \pi ^*\mathscr F(1)$ via (3.9), it is enough to verify the map is alternating locally on the base. Therefore, for this verification, we may assume $Y$ is the spectrum of a local ring and $\mathscr E$ is trivial. After this reduction, $X \subset \mathbb {P} \mathscr E$ is codimension $3$ and arithmetically Gorenstein, and so the Buchsbaum–Eisenbud parametrization for codimension-$3$ Gorenstein schemes [Reference Buchsbaum and EisenbudBE77, Theorem 2.1(2)] applies. This produces a resolution of $X \subset \mathbb {P} \mathscr E$ as in (3.15) which by Theorem 3.2 must agree with (3.9). Since the map corresponding to $\pi ^*\mathscr F^\vee \otimes \pi ^* \det \mathscr E \to \pi ^* \mathscr F(1)$ is alternating in the resolution of [Reference Buchsbaum and EisenbudBE77, Theorem 2.1(2)] it follows that $\pi ^* \mathscr F^\vee \otimes \pi ^*\det \mathscr E \to \pi ^* \mathscr F(1)$ is also alternating.

We next construct the map from (1) to (2). This map will send $\eta$ to $\Psi _5(\eta )$. To show this is indeed a well-defined map, we wish to verify $\Psi _5(\eta )$ is a finitely presented Gorenstein cover of $Y$. The finite presentation condition follows from the resolution given in (3.9). We may check the remaining conditions locally on $Y$, and hence assume $Y$ is the spectrum of a local ring. Observe that $X \to \mathbb {P} \mathscr E$ is arithmetically Gorenstein and of codimension $3$, using the assumption that $\eta$ has the right Hilbert polynomial at each $y \in Y$. Using [Reference Buchsbaum and EisenbudBE77, Theorem 2.1(1)], we find that $X$ is Gorenstein and is cut out scheme theoretically by the five $4 \times 4$ Pfaffians associated to $\eta$, thought of as a map $\pi ^*\mathscr F^\vee \otimes \pi ^*\det \mathscr E \to \pi ^* \mathscr F(1)$. On fibers, $\Psi _5(\eta )$ is described as the vanishing of the five $4 \times 4$ Pfaffians of an alternating linear map. The resolution [Reference Buchsbaum and EisenbudBE77, Theorem 2.1(1)], can be identified with one of the form (3.9), from which one may calculate that the Hilbert polynomial of every fiber is $5$. Therefore, the resulting scheme $\Phi _5(\eta )$ is finite and each fiber has degree $5$.

We next deduce flatness of $\Psi _5(\eta )$ over $Y$. The idea is to reduce to the universal case, where we can verify flatness using constancy of Hilbert polynomial. Let $Z \simeq \mathbb {A}^{40}$ denote the affine space parameterizing alternating $5 \times 5$ matrices of linear forms in $\mathbb {P}^3$. Let $\pi : U \to Z$ denote the universal family of intersections of the five $4 \times 4$ Pfaffians of the corresponding matrix, so that the fiber of $\pi$ over a point $[M] \in Z$ is the intersection of the five $4 \times 4$ Pfaffians of the alternating matrix of linear forms $M$. There is an open subset $Z^\circ \subset Z$ parameterizing the locus where the fiber of $\pi$ is zero-dimensional and degree at most $5$. One may verify that every fiber of $\pi$ has degree at least $5$, and so this open $Z^\circ$ parameterizes subschemes of degree exactly $5$. Let $U^\circ := \pi ^{-1}(Z^\circ )$. Since $Z$ is smooth $Z^\circ$ is as well. Working fppf locally on $Y$, we can assume $X \to Y$ is a pullback of $U^\circ \to Z^\circ$ along a map $Y \to Z$. Hence, it suffices to show that $U^\circ \to Z^\circ$ itself is flat. In this case, since $Z$ is reduced, flatness follows from constancy of the degree.

In order to show the above constructed map indeed takes (1) to (2), we must demonstrate identifications $\mathscr E^X \simeq \mathscr E$ and $\mathscr F^X \simeq \mathscr F$. To obtain the first identification, we use Lemma 3.15. Since $\det \mathscr E^{\otimes 2} \simeq \det \mathscr F$, the leftmost nonzero term of the resolution in Lemma 3.15 becomes $\det \pi ^* \mathscr E^\vee \otimes \pi ^*\det \mathscr F(-5) \simeq \pi ^* \det \mathscr E(-5)$. Hence, Theorem 3.2(iv) implies $\mathscr E \simeq \mathscr E^X$. Let $i : X \to \mathbb {P} \mathscr E, i^X : X \to \mathbb {P} \mathscr E^X$ denote the inclusions. By twisting (3.15) by $\mathscr O_{\mathbb {P} \mathscr E}(2)$ and pushing forward, we find $\mathscr F \simeq \ker (\operatorname {Sym}^2 \mathscr E \to \pi _* (i_* \mathscr O_X \otimes \mathscr O_{\mathbb {P} \mathscr E}(2)))$. The analogous resolution from Theorem 3.2 for $X$ in terms of $\mathscr E^X$ and $\mathscr F^X$ yields $\mathscr F^X \simeq \ker (\operatorname {Sym}^2 \mathscr E^X \to \pi _*(i^X_* \mathscr O_X \otimes \mathscr O_{\mathbb {P} \mathscr E^X}(2)))$. Hence, the isomorphism $\mathscr E \simeq \mathscr E^X$ induces the desired identification $\mathscr F \simeq \mathscr F^X$. This completes the construction of the map from (1) to (2).

It remains to prove the compositions of the above maps between (1) and (2) are equivalent to the identity. As in the degree $3$ case, if we start with a cover, produce the associated section $\eta ^X$, $\Psi _5(\eta ^X)$ is isomorphic to $X$ via the construction.

For the reverse composition, start with some section $\eta$ and let $X$ denote the resulting cover $\Psi _5(\eta )$. Now, choose identifications $\mathscr E^X \simeq \mathscr E, \mathscr F^X \simeq \mathscr F$ as above so that we obtain an associated section $\eta ^X \in H^0(Y,\wedge ^2 \mathscr F^\vee \otimes \det \mathscr E \to \mathscr E) \simeq H^0(Y, \wedge ^2 (\mathscr F^X)^\vee \otimes \det \mathscr E^X \to \mathscr E^X)$. We wish to show $\eta ^X$ is related to $\eta$ by automorphisms of $\mathscr E$ and $\mathscr F$. Note also here that any automorphism of $\mathscr E$ and $\mathscr F$ sends $\eta$ to another section defining an isomorphic cover. Using Theorem 3.2, there is an automorphism of $\mathbb {P} \mathscr E$ taking $\Psi _5(\eta ^X)$ to $\Psi _5(\eta )$. From Theorem 3.2(iv) and the fact that the leftmost term of the resolution (3.8) is $\pi ^* \det \mathscr E(-5)$, we find $\mathscr E$ is isomorphic to $\ker (\rho _* \omega _{X/Y} \to \mathscr O_Y)$. By Corollary 3.5, this automorphism of $\mathbb {P} \mathscr E$ is induced by an automorphism of $\mathscr E$. By composing with the inverse of this automorphism, we may assume $\eta$ and $\eta ^X$ define the same subscheme of $\mathbb {P} \mathscr E$. Hence we may assume the automorphism of $\mathbb {P} \mathscr E$ is then induced by multiplication by a section $s \in \mathscr O_Y(Y)$. After composing with multiplication by $s^{-1}$, we may therefore reduce to the case that $s$ is the identity section. By Theorem 3.2(iii), we obtain a unique isomorphism between the two resolutions of $X$ in $\mathbb {P} \mathscr E$ (3.9) determined by $\eta$ and $\eta ^X$. This isomorphism can be specified as a tuple of $5$ maps between the nonzero terms of (3.9).

We next show we can apply an automorphism of $\mathscr F$ so as to assume the map $\pi ^* \mathscr F(-2) \to \pi ^* \mathscr F(-2)$ is the identity. Since $\mathscr F$ is a subsheaf of $\operatorname {Sym}^2 \mathscr E$ by Theorem 3.2(v), the image of the induced map $\mathscr F \to \operatorname {Sym}^2 \mathscr E$ coming from the Pfaffians associated to $\eta$ is uniquely determined by $X$, but the precise map is only determined up to automorphism of $\mathscr F$. Upon composing with such an automorphism, we may identify not just the images of $\mathscr F$ in $\operatorname {Sym}^2 \mathscr E$, but further we may identify the maps. Under these identifications, $\eta$ agrees with $\eta ^X$, when viewed as maps $\mathscr F \to \operatorname {Sym}^2 \mathscr E$.

So far, we have constructed a map of the two resolutions (3.9) associated to $\eta$ and $\eta _X$. Upon choosing identifications $\mathscr E \simeq \mathscr E^X$ and $\mathscr F \simeq \mathscr F^X$ as above, we have enforced that the map of resolutions is given by the identity on the terms $\mathscr O_X \to \mathscr O_X, \mathscr O_{\mathbb {P}} \to \mathscr O_{\mathbb {P}}$, and $\pi ^* \mathscr F(-2) \to \pi ^* \mathscr F(-2)$. When we write the second nonzero term of (3.9) as $\pi ^* \mathscr F^\vee \otimes \pi ^* \det \mathscr E(-3)$, we have identified it via Grothendieck duality as pairing with the third nonzero term $\pi ^* \mathscr F(-3)$ into $\pi ^* \mathscr E(-5)$, and therefore the induced automorphism of $\pi ^* \mathscr F^\vee \otimes \pi ^* \det \mathscr E(-3)$ must respect this duality. In particular, since we have reduced to the case where the automorphism of $\pi ^* \mathscr F(-2)$ is the identity, we also obtain the induced automorphism of $\pi ^* \mathscr F^\vee \otimes \pi ^* \det \mathscr E(-3)$ is the identity. Using Theorem 3.2(v) to guarantee that the maps $\eta$ and $\eta ^X$ from $\mathscr F^\vee \otimes \det \mathscr E \to \mathscr F \otimes \mathscr E$ are injective, we obtain the desired identification of $\eta$ with $\eta ^X$.

Finally, we recall a rather elementary criterion for when $\Psi _d(\eta )$ is geometrically connected.

Theorem 3.17 (Part of [Reference Casnati and EkedahlCE96, Theorem 3.6], [Reference Casnati and EkedahlCE96, Theorem 4.5], and [Reference CasnatiCas96, Theorem 4.4])

Keeping notation as in Notation 3.8, assume that $Y$ is a geometrically connected and geometrically reduced projective scheme over a field $k$. If $h^0(Y, \mathscr E^\vee ) = 0$, then $\Psi _d(\eta )$ is geometrically connected.

Proof. The proof is essentially given in [Reference Casnati and EkedahlCE96, Theorem 3.6], and we repeat it for the reader's convenience. Let $X := \Psi _d(\eta )$. If $h^0(Y, \mathscr E^\vee ) = 0$ the exact sequence

(3.18)

induces an isomorphism $H^0(Y, \mathscr O_Y\!) \simeq H^0(Y, \rho _* \mathscr O_X\!) = H^0(X, \mathscr O_X\!)$. Since $Y$ is geometrically connected and geometrically reduced, we have $h^0(Y, \mathscr O_Y\!) = 1$. From this we find $H^0(X, \mathscr O_X\!) = 1$ as well, and therefore $X$ is necessarily geometrically connected.

4. Describing stacks of low-degree covers as quotients

In this section, we give a description of the stack of degree $d$ Gorenstein covers as a global quotient stack for $3 \leq d \leq 5$. We now introduce the groups we will be quotienting by. Since the Hurwitz stack is closely related to the Weil restriction of the stack of degree $d$ covers along $\mathbb {P}^1 \to \operatorname {\operatorname {Spec}} k$, we will simultaneously define these automorphism groups along Weil restrictions.

Remark 4.1 We are about to define an automorphism sheaf $\operatorname {Aut}_{\mathscr E, \mathscr F_\bullet }^{Y/B}$ for $Y \to B$ a morphism of schemes and $\mathscr E, \mathscr F_\bullet$ locally free sheaves on $Y$. Before giving the formal definition, we give an intuitive description.

Consider first the case that $d = 4$, or $d = 5$, $Y = B = \operatorname {\operatorname {Spec}} k$, and additionally assume there is an isomorphism $\det \mathscr E^{\otimes d-3} \simeq \det \mathscr F_1$. Then, the points of $\operatorname {Aut}_{\mathscr E,\mathscr F_\bullet }^{\operatorname {\operatorname {Spec}} k/\operatorname {\operatorname {Spec}} k}$ corresponds to automorphisms of $\mathscr E$ and $\mathscr F_1$ which preserve the above isomorphism. However, in what follows we do not require such an isomorphism $\det \mathscr E^{\otimes d-3} \simeq \det \mathscr F_1$ exists, and so the definition we give is somewhat more general. Namely, we instead work with automorphisms $(M, N) \in \operatorname {Aut}_\mathscr E \times \operatorname {Aut}_{\mathscr F_1}$ so that $\det M^{d-3} = \det N$.

Another important case is that where $Y = D, B = \operatorname {\operatorname {Spec}} k$, and again $d= 4$ or $5$. If, in addition, there is an isomorphism $\det (\mathscr E^{\otimes d-3})_D \simeq (\det \mathscr F_1)_D$, $\operatorname {Aut}_{\mathscr E,\mathscr F_\bullet }^{D/\operatorname {\operatorname {Spec}} k}$ can be thought of as parameterizing automorphisms of $\mathscr E$ and $\mathscr F_1$ over $D$ which preserve the isomorphism $\det (\mathscr E^{\otimes d-3})_D \simeq (\det \mathscr F_1)_D$. Again, we have the caveat that this is only correct when such an isomorphism exists.

Definition 4.2 Given a scheme $Y$ over a base $B$ and an integer $d$, let resolution data for $Y$ and $d$ denote a tuple of locally free sheaves $(\mathscr E, \mathscr F_\bullet )$ on $Y$, where $\mathscr E$ is a locally free sheaf of rank $d-1$ and $\mathscr F_\bullet$ denotes the sequence $\mathscr F_1, \ldots, \mathscr F_{\lfloor ({d-2})/{2} \rfloor }$ where $\operatorname {rk} \mathscr F_i = \beta _i$ as in (3.2). Let $3 \leq d \leq 5$, fix a scheme $Y$ over a field, and fix resolution data $(\mathscr E, \mathscr F_\bullet )$ for a degree $d$ cover of $Y$. For $\mathscr G$ a locally free sheaf on $Y$, let $\Delta _\mathscr G^{Y/B} := \mathbb {G}_m \to \operatorname {Res}_{Y/B}(\operatorname {Aut}_{\mathscr G/Y}\!)$ denote the map adjoint to the central inclusion $(\mathbb {G}_m \times _B Y) \to \operatorname {Aut}_{\mathscr G/Y}$ on $Y$. We denote by $\big (\Delta _\mathscr G^{Y/B} \big )^i$ the composition $\mathbb {G}_m \xrightarrow {x \mapsto x^i} \mathbb {G}_m \xrightarrow {\Delta _\mathscr G^{Y/B}} \operatorname {Res}_{Y/B}(\operatorname {Aut}_{\mathscr G/Y}\!)$ and define

\begin{align*} ((\Delta_\mathscr E^{Y/B})^i,(\Delta_{\mathscr F_1}^{Y/B})^{j}) : \mathbb{G}_m & \rightarrow \operatorname{Res}_{Y/B}(\operatorname{Aut}_{\mathscr E/Y}\!) \times\operatorname{Res}_{Y/B}(\operatorname{Aut}_{\mathscr F_1/Y}\!)\\ x &\mapsto (\Delta_{\mathscr E}^{Y/B}(x^i), \Delta_{\mathscr E}^{Y/B}(x^j)). \end{align*}

Finally, we use

\[ \operatorname{coker}((\Delta_\mathscr E^{Y/B})^i,(\Delta_{\mathscr F_1}^{Y/B})^{j}) := \frac{(\operatorname{Res}_{Y/B}(\operatorname{Aut}_{\mathscr E/Y}\!) \times\operatorname{Res}_{Y/B}(\operatorname{Aut}_{\mathscr F_1/Y}\!))}{((\Delta_\mathscr E^{Y/B})^i,(\Delta_{\mathscr F_1}^{Y/B})^{j})(\mathbb{G}_m)}. \]

Then, define the automorphism sheaf of this resolution data to be the $B$-scheme

(4.1)\begin{equation} \operatorname{Aut}_{\mathscr E, \mathscr F_\bullet}^{Y/B} := \begin{cases} \operatorname{Res}_{Y/B}(\operatorname{Aut}_{\mathscr E/Y}\!) & \text{if } d = 3 ,\\ \operatorname{coker}(\Delta_\mathscr E^{Y/B}, (\Delta_{\mathscr F_1}^{Y/B})^{2}) & \text{if } d= 4, \\ \operatorname{coker}((\Delta_\mathscr E^{Y/B})^2, (\Delta_{\mathscr F_1}^{Y/B})^3) & \text{if } d= 5. \end{cases} \end{equation}

In the case $Y = B$, we notate $\operatorname {Aut}_{\mathscr E, \mathscr F_\bullet }^{B/B}$ simply by $\operatorname {Aut}_{\mathscr E, \mathscr F_\bullet }$. When $d =4$ or $5$, we will often denote $\mathscr F_1$ by $\mathscr F$.

Throughout much of the remainder of the paper, we will typically work over the base $B = \operatorname {\operatorname {Spec}} k$ for $k$ a field. There are notable exceptions, such as Proposition 4.8, where we take $B = \operatorname {\operatorname {Spec}} \mathbb {Z}$.

Remark 4.3 Concretely, $\mathscr E$ and $\mathscr F_\bullet$ in Definition 4.2 are (sequences of) sheaves of the following ranks. For $d = 3$, $\mathscr E$ is locally free of rank $2$ and $\mathscr F_\bullet$ is trivial (i.e. the sequence of sheaves has length $0$). When $d = 4$, $\mathscr E$ is locally free of rank $3$ and $\mathscr F_\bullet = \mathscr F$ is locally free of rank $2$. When $d = 5$, $\mathscr E$ is locally free of rank $4$ and $\mathscr F_\bullet = \mathscr F$ is locally free of rank $5$.

In order be able to calculate the class of quotients by the groups of Definition 4.2 in the Grothendieck ring, it will be useful to know these groups are often special. The following description of these quotients will allow us later, in Lemma 7.12, to easily deduce these groups are special.

Lemma 4.4 Maintaining the notation of Definition 4.2, we have an isomorphism of functors

(4.2)\begin{align} \operatorname{Aut}_{\mathscr E, \mathscr F_\bullet}^{Y/B} \simeq \begin{cases} \ker(\det,\det^{-1}): \operatorname{Res}_{Y/B}(\operatorname{Aut}_{\mathscr E/Y}\!) \times \operatorname{Res}_{Y/B}(\operatorname{Aut}_{\mathscr F/Y}\!) \rightarrow \operatorname{Res}_{Y/B} (\mathbb{G}_m) & \text{if } d= 4 ,\\ \ker(\det^2,\det^{-1}): \operatorname{Res}_{Y/B}(\operatorname{Aut}_{\mathscr E/Y}\!) \times \operatorname{Res}_{Y/B}(\operatorname{Aut}_{\mathscr F/Y}\!) \rightarrow \operatorname{Res}_{Y/B}(\mathbb{G}_m) & \text{if } d= 5. \end{cases} \end{align}

Here, by determinant we mean the map adjoint to the corresponding determinant map on $Y$.

Proof. We produce the claimed isomorphisms by constructing a section to the quotient map $q : \operatorname {Res}_{Y/B}(\operatorname {Aut}_{\mathscr E/Y}\!) \times \operatorname {Res}_{Y/B}(\operatorname {Aut}_{\mathscr F/Y}\!) \to \operatorname {Aut}_{\mathscr E, \mathscr F_\bullet }^{Y/B}$ defining $\operatorname {Aut}_{\mathscr E, \mathscr F_\bullet }^{Y/B}$.

To start, we cover the case $d = 4$. Given $(M, N) \in \operatorname {Res}_{Y/B}(\operatorname {Aut}_{\mathscr E/Y}\!) \times \operatorname {Res}_{Y/B}(\operatorname {Aut}_{\mathscr F/Y}\!)$, for $\lambda \in \mathbb {G}_m$, we can identify $q(M,N) = q(\lambda M, \lambda ^2 N)$. For any such $(M,N)$ the key observation is that there is a unique $\lambda \in \mathbb {G}_m$ such that $\det (\lambda M) = \det (\lambda ^2 N)$. Indeed, $\det (\lambda M) = \lambda ^3 \det M$ while $\det ( \lambda ^2 N) = \lambda ^4 \det N$ and so the unique such $\lambda$ is $\lambda = \det M/\det N$. This gives the desired splitting realizing $\operatorname {Aut}_{\mathscr E, \mathscr F_\bullet }^{Y/B}$ as a subgroup of $\operatorname {Res}_{Y/B}(\operatorname {Aut}_{\mathscr E/Y}\!) \times \operatorname {Res}_{Y/B}(\operatorname {Aut}_{\mathscr F/Y}\!)$ because the composition $\ker (\det, \det ^{-1}) \to \operatorname {Res}_{Y/B}(\operatorname {Aut}_{\mathscr E/Y}\!) \times \operatorname {Res}_{Y/B}(\operatorname {Aut}_{\mathscr F/Y}\!) \to \operatorname {Aut}_{\mathscr E, \mathscr F_\bullet }$ is an isomorphism.

The $d = 5$ case is quite similar to the $d = 4$ case. Namely, in this case, for $(M, N) \in \operatorname {Res}_{Y/B}(\operatorname {Aut}_{\mathscr E/Y}\!) \times \operatorname {Res}_{Y/B}(\operatorname {Aut}_{\mathscr F/Y}\!)$, there is again a unique $\lambda \in \mathbb {G}_m$ so that $(\det (\lambda ^2 M))^2 = \det (\lambda ^3 N)$. Indeed, $(\det (\lambda ^2 M))^2 = \lambda ^{16} \det M^2$ and $\det (\lambda ^3 N) = \lambda ^{15} \det N$, so the unique desired $\lambda$ is $\det N/(\det M)^2$. As in the $d = 4$ case, this provides a section to the given quotient map realizing $\operatorname {Aut}_{\mathscr E, \mathscr F_\bullet }^{Y/B}$ as the subgroup of $\operatorname {Res}_{Y/B}(\operatorname {Aut}_{\mathscr E/Y}\!) \times \operatorname {Res}_{Y/B}(\operatorname {Aut}_{\mathscr F/Y}\!)$ given as those $(M,N)$ with $(\det M)^2 = \det N$.

We next describe a presentation of the stack parameterizing degree $d$ Gorenstein covers for $3 \leq d \leq 5$. To make our next definition, we will need to know the Gorenstein locus of a finite locally free map is open.

Lemma 4.5 Let $f: X \to Y$ be a finite locally free morphism of schemes. The locus of points of $Y$ on which the fiber of $f$ is Gorenstein is an open subscheme of $Y$.

Proof. First, by [Sta, Tag 00RH], the condition that the fiber be Cohen–Macaulay is an open condition. After restricting to such an open subscheme, by [Reference ConradCon00, Theorem 3.5.1], a dualizing sheaf exists, and the Gorenstein locus is then the locus where this dualizing sheaf is locally free, which again defines an open subscheme.

We are now ready to define the relevant Gorenstein loci. With notation as in Definition 4.2, we work over $B = \operatorname {\operatorname {Spec}} \mathbb {Z}$.

Definition 4.6 For each $3 \leq d \leq 5$, fix free sheaves on $Y = B = \operatorname {\operatorname {Spec}} \mathbb {Z}$, $\mathscr E$ and $\mathscr F_\bullet$ as in Definition 4.2 and Remark 4.3. Let $\mathrm {U}_{d} \subset \operatorname {\operatorname {Spec}} (\operatorname {Sym}^\bullet H^0(\operatorname {\operatorname {Spec}} \mathbb {Z}, \mathscr H(\mathscr E, \mathscr F_\bullet ))^\vee )$ denote the open subscheme (using Lemma 4.5) functorially parameterizing those sections $\eta$ so that $\Psi _d(\eta )$ defines a degree $d$ locally free Gorenstein cover, for $\Psi _d$ the maps (depending on $3 \leq d \leq 5$) defined in § 3.6.

In what follows, we use $\mathrm {Covers}_{d}$ to denote the fibered category whose $S$ points are finite locally free covers $X \to S$ of degree $d$ with Gorenstein fibers.

Definition 4.7 For $3 \leq d \leq 5$, the map $\Psi _d$ over $B = \operatorname {\operatorname {Spec}} \mathbb {Z}$ induces a map $\mu _d: \mathrm {U}_{d} \to \mathrm {Covers}_{d}$, with $\mathrm {Covers}_{d}$ as defined above. There is a natural action of $\operatorname {Aut}_{\mathscr E, \mathscr F_\bullet }$ on $\mathrm {U}_{d}$, induced by the action of $\operatorname {Aut}_{\mathscr E} \times \operatorname {Aut}_{\mathscr F_\bullet }$ on $\mathrm {U}_{d}$. The map $\mu _d$ is invariant under this action, since the resulting abstract degree $d$ cover is unchanged by such re-coordinatizations. We now define the induced map from the quotient stack $\phi _d : [\mathrm {U}_{d}/\operatorname {Aut}_{\mathscr E, \mathscr F_\bullet }] \to \mathrm {Covers}_{d}$.

Proposition 4.8 For $3 \leq d \leq 5$, the map $\phi _d$ defined in Definition 4.7 over $B = \operatorname {\operatorname {Spec}} \mathbb {Z}$ is an isomorphism.

When $d=3,4$, Proposition 4.8 is the specialization of the isomorphisms of moduli stacks given in [Reference PoonenPoo08, Proposition 5.1] and [Reference WoodWoo11, Theorem 1.1] to Gorenstein covers.

Proof. We will construct an inverse map using Theorem 3.2. Using Theorem 3.2, there is an $\operatorname {Aut}_{\mathscr E} \times \operatorname {Aut}_{\mathscr F_\bullet }$ torsor $\mathrm {T}_{d}$ over $\mathrm {Covers}_{d}$ whose $S$-points parameterize covers $X \to S$ together with specified trivializations $\mathscr E^X \simeq \mathscr E, \mathscr F_\bullet ^X \simeq \mathscr F_\bullet$ of the sheaves $\mathscr E^X$ and $\mathscr F^X_\bullet$ associated to $X$ coming from Theorem 3.2. Note here that $\mathrm {T}_{d}$ maps surjectively to $\mathrm {Covers}_{d}$ because for any $S$ point, there is an open cover of $S$ on which these vector bundles become isomorphic to trivial bundles. The parametrizations Theorems 3.13, 3.14, and 3.16 then give a section $\eta \in \mathscr H(\mathscr E, \mathscr F_\bullet )$. This induces a map $\mathrm {T}_{d} \to \mathrm {U}_{d}$.

We wish to show this induced map $\mathrm {T}_{d} \to \mathrm {U}_{d}$ is an isomorphism in degree $3$ and a $\mathbb {G}_m$ torsor in degrees $4$ and $5$, where $\mathbb {G}_m$ is the copy of $\mathbb {G}_m \subset \operatorname {Aut}_{\mathscr E} \times \operatorname {Aut}_{\mathscr F}$ as in Definition 4.2 whose quotient yields $\operatorname {Aut}_{\mathscr E, \mathscr F_\bullet }$. Once we verify this, the parametrizations Theorems 3.13, 3.14, and 3.16 imply that the composition $\mathrm {T}_{d} \to \mathrm {U}_{d} \to [\mathrm {U}_{d}/\operatorname {Aut}_{\mathscr E, \mathscr F_\bullet }] \xrightarrow {\phi _d} \mathrm {Covers}_{d}$ is the structure map for the torsor $\mathrm {T}_{d} \to \mathrm {Covers}_{d}$. From this, it follows that the resulting isomorphism $[\mathrm {T}_{d}/ \operatorname {Aut}_\mathscr E \times \operatorname {Aut}_{\mathscr F_\bullet }] \to \mathrm {Covers}_{d}$ factors through an isomorphism $[\mathrm {T}_{d}/ \operatorname {Aut}_\mathscr E \times \operatorname {Aut}_{\mathscr F_\bullet }] \to [\mathrm {U}_{d}/\operatorname {Aut}_{\mathscr E, \mathscr F_\bullet }]$ and, hence, $\phi _d: [\mathrm {U}_{d}/\operatorname {Aut}_{\mathscr E, \mathscr F_\bullet }] \to \mathrm {Covers}_{d}$ is an isomorphism.

First, we verify the map $\mathrm {T}_{d} \to \mathrm {U}_{d}$ is invariant under the above mentioned $\mathbb {G}_m$ action in the cases that $d =4$ and $5$. In the degree $4$ case, scaling $\mathscr E$ by $\lambda$ and $\mathscr F$ by $\lambda ^2$ scales $\mathscr F^\vee \otimes \operatorname {Sym}^2 \mathscr E$ by $\lambda ^{-2} \cdot \lambda ^2 = 1$. In the degree $5$ case, scaling $\mathscr E$ by $\lambda ^2$ and $\mathscr F$ by $\lambda ^3$ scales $\wedge ^2 \mathscr F \otimes \mathscr E \otimes \det \mathscr E^\vee$ by $\lambda ^6 \cdot \lambda ^2 \cdot \lambda ^{2 \cdot {(-4)}} = 1$.

Therefore, to conclude the verification, it is enough to show the only elements of $\operatorname {Aut}_{\mathscr E} \times \operatorname {Aut}_{\mathscr F_\bullet }$ fixing a given section are trivial when $d = 3$ and lie in $\mathbb {G}_m$ when $d = 4$ or $5$. To start, the map $X \to \mathbb {P} \mathscr E$ realizes $X$ as a nondegenerate subscheme of $\mathbb {P} \mathscr E$ and, therefore, only the trivial element of $\operatorname {PGL}_{\mathscr E}$ fixes $X$ as a subscheme of $\mathbb {P} \mathscr E$. In the degree $3$ case, scaling by $\lambda$ in the central $\mathbb {G}_m \subset \operatorname {Aut}_{\mathscr E}$ scales the resulting section by $\lambda$, and so only the identity element of $\operatorname {Aut}_{\mathscr E}$ preserves the section. This establishes the claim when $d = 3$.

We now consider the cases $d= 4$ and $d = 5$. We are seeking automorphisms of $\mathscr E$ and $\mathscr F$ preserving a given section $\eta \in \mathrm {U}_{d}$. We have seen above that any such automorphism must act on $\mathscr E$ by some element $\lambda$ in the central $\mathbb {G}_m \subset \operatorname {Aut}_{\mathscr E}$. Since we are quotienting by a copy of $\mathbb {G}_m \subset \operatorname {Aut}_{\mathscr E} \times \operatorname {Aut}_{\mathscr F}$ which maps surjectively to the central $\mathbb {G}_m$ in $\operatorname {Aut}_{\mathscr E}$, we may modify our given automorphism so as to assume it is trivial in $\operatorname {Aut}_{\mathscr E}$. Note that when $d = 5$, we may have to pass to an fppf cover so as to extract a square root of $\lambda$. We may now assume the automorphism is trivial on $\mathscr E$ and wish to show it is also trivial on $\mathscr F$. However, the given section $\eta$ induces an injective map $\mathscr F \to \operatorname {Sym}^2 \mathscr E$, realizing $\mathscr F$ as a subsheaf of $\operatorname {Sym}^2 \mathscr E$ by Theorem 3.2(v). Since we are assuming the automorphism acts as the identity on $\mathscr E$ and it preserves this inclusion, it must also act as the identity on $\mathscr F$.

5. Defining our Hurwitz stacks

In this section, we construct and define the Hurwitz spaces we will be working with. We will ultimately be interested in the Hurwitz space whose geometric points parameterize degree $d$ $S_d$ covers of $\mathbb {P}^1$ which are smooth and connected. When one restricts to simply branched covers, such a Hurwitz scheme was constructed by Fulton [Reference FultonFul69]. Another good reference is [Reference DeopurkarDeo14, Theorem A], though this reference assumes characteristic $0$. Another excellent reference is [Reference Bertin and RomagnyBR11, Theorem 6.6.6], which constructs the Hurwitz stacks in the case that the cover is not Galois, but has a fixed Galois closure $G$, which is invertible on the base. Although we are ultimately primarily interested in counting $S_d$ covers, we will do so by realizing them as a certain proportion of the space of all degree $d$ covers, so this reference again does not quite suffice for our purposes. We were unable to find a reference that allows arbitrary branching and non-Galois covers in arbitrary characteristic, and so we give the construction here. To begin, we define a certain Hurwitz stack parameterizing covers of $\mathbb {P}^1$ which are not necessarily $S_d$ covers.

Definition 5.1 For $S$ a base scheme, and $d \geq 0$ an integer, let $\overline {\mathrm {Hur}}_{d,S}$ denote the category fibered in groups over $S$-schemes whose $T$ points over a given map of schemes $T \to S$ consists of $(T, X, h: X \to T, f: X \to \mathbb {P}^1_T)$

(5.1)

where $X$ is a scheme, $f$ is a finite locally free map of degree $d$ and $h$ is a smooth proper relative curve. A map $(T, X, h, f) \to (T, X', h', f')$ consists of a $T$-isomorphism $\alpha : X \to X'$ such that the following commutes.

(5.2)

For $g \geq 0$ an integer, let $\overline {\mathrm {Hur}}_{d,g, S}$ denote the substack parameterizing those $T$-points of $\overline {\mathrm {Hur}}_{d,S}$ such that $X \to T$ has arithmetic genus $g$.

Lemma 5.2 For $S$ a scheme, $\overline {\mathrm {Hur}}_{d,S}$ and $\overline {\mathrm {Hur}}_{d,g, S}$ are algebraic stacks.

Proof. First, we show $\overline {\mathrm {Hur}}_{d,S}$ is an algebraic stack. It is enough to establish this in the universal case $S = \operatorname {\operatorname {Spec}} \mathbb {Z}$. Observe that $\overline {\mathrm {Hur}}_{d,\mathbb {Z}}$ is a stack because descent for finite degree $d$ locally free morphisms is effective. To see it is algebraic, we construct it as a hom stack. Let $\mathfrak A_d$ denote the stack parameterizing finite locally free degree $d$ covers, as constructed in [Reference PoonenPoo08, Definition 3.2].

Next, we claim the mapping stack $\mathrm {Hom}(\mathbb {P}^1, \mathfrak A_d)$ is algebraic. This would follow from [Reference AokiAok06a, Theorem 1.1], except the theorem there is not stated correctly, as mentioned in the erratum [Reference AokiAok06b]. This erratum asserts that we only need verify the additional condition that for any complete local noetherian ring $A$ with maximal ideal ${\mathfrak m}$ and $A_n := A/{\mathfrak m}^n$, a collection of compatible maps $\mathrm {Hom}(\mathbb {P}^1_{A_n}, (\mathfrak A_d)_{A_n})$ for each $n$ lifts to a map $\mathrm {Hom}(\mathbb {P}^1_{A}, (\mathfrak A_d)_{A})$. In our setting, this condition is indeed satisfied because specifying such maps over $A_n$ corresponds to specifying degree $d$ locally free covers $X_n \to \mathbb {P}^1_{A_n}$ over $A_n$ for each $n$. Then, by Grothendieck's algebraization theorem [Reference Fantechi, Göttsche, Illusie, Kleiman, Nitsure and VistoliFGI+05, Theorem 8.4.10] such a family algebraizes to a family $X \to \mathbb {P}^1_A$ over $\operatorname {\operatorname {Spec}} A$, using the pullback of $\mathscr O_{\mathbb {P}^1}(1)$ to $X$ as the relevant ample line bundle on $X$.

The stack $\overline {\mathrm {Hur}}_{d,S}$ is then the open substack of the mapping stack $\mathrm {Hom}(\mathbb {P}^1, \mathfrak A_d)$ corresponding to those finite locally free covers $X \to \mathbb {P}^1$ which are smooth over the base.

Finally, $\overline {\mathrm {Hur}}_{d,g, S}$ is an open and closed substack of $\overline {\mathrm {Hur}}_{d,S}$ because the genus is locally constant in flat families.

Having constructed the Hurwitz stack parameterizing all degree $d$ covers of $\mathbb {P}^1$, we next construct an open substack parameterizing $S_d$ covers, over geometric fibers. For the following definition, recall that $B_n$, the $n$th Bell number, is the number of ways to partition a set of $n$ elements into subsets. Thus, for example, $B_1 = 1$, $B_2 = 2$, $B_3 = 5$ and $B_4 = 15$.

Definition 5.3 Let $S$ be a scheme with $d!$ invertible on $S$. Let $\mathrm {Hur}_{d,g,S}$ denote the substack of $\overline {\mathrm {Hur}}_{d,g, S}$ parameterizing those $(T, X, h: X \to T, f: X \to \mathbb {P}^1_T)$ such that $X^d := \underbrace {X \times _{\mathbb {P}^1_T} X \times _{\mathbb {P}^1_T} \cdots \times _{\mathbb {P}^1_T} X}_{\text {d times}}$ has $B_d$ irreducible components in each geometric fiber over $T$, where $B_d$ is the $d$th Bell number.

The above definition is a bit opaque, but the point is that it parameterizes degree $d$ covers $X \to \mathbb {P}^1$ so that the Galois closure of $K(X) \leftarrow K(\mathbb {P}^1)$ is an $S_d$ Galois extension, as we now verify.

Lemma 5.4 The fiber product $X^d$ as in Definition 5.3 always has at least $B_d$ irreducible components in each geometric fiber over $T$.

Further, it has exactly $B_d$ components if and only if $X \to \mathbb {P}^1_T$ is a degree $d$ cover whose Galois closure has Galois group $S_d$ on geometric fibers over $T$.

Proof. We may reduce to the case $T$ is a geometric point. First, we check $X^d$ has at least $B_d$ irreducible components. To see this, for any partition $U = \{ S_1, \ldots, S_{\# U}\}$ of $\{1, \ldots, d\}$ into $\# U$ many subsets, let $X^U \subset X^d$ denote the subscheme of $X^d$ given as the image $X^{\# U} \to X^d$ sending the $i$th copy of $X$ via the identity to those copies of $X$ indexed by elements of $S_i$. For each partition $V$ of $\{1, \ldots,d\}$ such that $U$ refines $V$, the closure of $X^U - \bigcup _{V, U \text { refines } V} X^V$ defines a nonempty union of irreducible components of $X^d$. We have therefore produced $B_d$ irreducible components of $X_d$, showing there are always at least $B_d$ irreducible components.

Conversely, $X^d$ has exactly $B_d$ geometric components if and only if each of the $B_d$ subschemes described in the previous paragraph are irreducible. Let us focus on the subscheme $Y$ corresponding to the partition $U =\{ \{1\}, \{2\}, \ldots, \{d\}\}$ into singletons, which has degree $d!$ over $\mathbb {P}^1$ and is the closure of the complement of the ‘fat diagonal’ in $X^d$. Observe that $X \to \mathbb {P}^1$ is generically étale because $X$ is smooth and we are assuming the characteristic of $T$ does not divide $d!$. Therefore, $Y \to \mathbb {P}^1$ is also generically étale, and contains a component whose function field is the Galois closure of the extension of function fields $K(X) \leftarrow K(\mathbb {P}^1)$. Therefore, $Y$ is irreducible if and only if $K(Y)$ is the Galois closure of $K(X) \leftarrow K(\mathbb {P}^1)$. As $Y \to \mathbb {P}^1$ has degree $d!$, this, in turn, is equivalent to $X \to \mathbb {P}^1$ having Galois closure with Galois group $S_d$. In particular, for any cover $X \to \mathbb {P}^1$ whose Galois closure is smaller than $S_d$ $X^d$ has strictly more than $B_d$ irreducible components.

Finally, we check that for any $S_d$ cover, each of the $B_d$ components described above are irreducible. As we have shown, even the component $Y$ of degree $d!$ over $\mathbb {P}^1$ is irreducible. Because all the other components correspond to intermediate extensions between $Y$ and $\mathbb {P}^1$, they are also irreducible.

We next carry out the surprisingly tricky verification that $\mathrm {Hur}_{d,g,S}$ is an open substack of $\overline {\mathrm {Hur}}_{d,g, S}$.

Proposition 5.5 For any integers $d, g \geq 0$, and $d!$ invertible on $S$, $\mathrm {Hur}_{d,g,S}$ is an open substack of $\overline {\mathrm {Hur}}_{d,g, S}$, hence an algebraic stack. Further, if we have a family of curves $X \to \mathbb {P}^1_T \to T$ corresponding to a $T$-point of $\mathrm {Hur}_{d,g,S}$, all fibers of $X$ over $T$ are geometrically irreducible.

Proof. It is enough to demonstrate $\mathrm {Hur}_{d,g,S}$ is an open substack of $\overline {\mathrm {Hur}}_{d,g, S}$. Let $X \to \mathbb {P}^1_T \to T$ be a family of smooth curves, corresponding to a point of $\overline {\mathrm {Hur}}_{d,g, S}$. Let $X^d$ denote the $d$-fold fiber product of $X$ over $\mathbb {P}^1_T$. By Lemma 5.4, any such point corresponds to an $S_d$ cover of $\mathbb {P}^1$ on geometric fibers, and therefore these geometric fibers are irreducible, verifying the final statement.

It remains to show that the locus where $X^d$ has $B_d$ irreducible fibers in geometric fibers is open on $T$. First, we will see in Lemma 5.7 that the geometric fibers of $X^d$ over $T$ have no embedded points.

Because the fibers have no embedded points, we may apply [Reference GrothendieckGro66, 12.2.1(xi)], which says that the total multiplicity (in the sense defined in [Reference GrothendieckGro65, p. 77], following [Reference GrothendieckGro65, 4.7.4], where total multiplicity is defined for integral schemes) is upper semicontinuous. From this, we conclude that the locus of geometric points in $T$ where the total multiplicity of $X^d$ is at most $B_d$ is open. By Lemma 5.4, the total multiplicity of any geometric fiber is always at least $B_d$ and, hence, the locus where the total multiplicity is exactly $B_d$ is also open. To conclude, it remains to verify the total multiplicity of any geometric fiber is equal to the number of its irreducible components. Note that the radicial multiplicity of any fiber is $1$ because $X^d$ is generically reduced, since it has a generically separable map to $\mathbb {P}^1$ by assumption that $d! \nmid \operatorname {\operatorname {char}}(k)$. It follows that the total multiplicity is equal to the separable multiplicity. By definition, the separable multiplicity of a finite-type scheme over a field is equal to $1$ if and only if the scheme is geometrically irreducible, as desired.

Remark 5.6 Later, in Lemma 9.6, we will appeal to [Reference WewersWew98] to construct substacks of $\overline {\mathrm {Hur}}_{d,g, S}$ parameterizing covers with specified Galois group $G \subset S_d$. One can also see using the method of proof of Proposition 5.5 that these form locally closed substacks, with partial ordering given by the partial ordering along inclusion of subgroups in $S_d$.

Lemma 5.7 Let $X \to \mathbb {P}^1$ be a degree $d$ map of smooth proper curves over an algebraically closed field $k$. If the characteristic of $k$ does not divide $d!$, then $X^d := \underbrace {X \times _{\mathbb {P}^1} X \times _{\mathbb {P}^1} \cdots \times _{\mathbb {P}^1} X}_{\text {d times}}$ is Cohen–Macaulay and, hence, has no embedded points.

Proof. It is enough to show $X^d$ is Cohen–Macaulay, as one-dimensional Cohen–Macaulay schemes have no embedded points. To verify $X^d$ is Cohen–Macaulay, we may do so étale locally on $\mathbb {P}^1$ and, hence, we may freely base change to the strict henselization of $\mathbb {P}^1$ at any given closed point. Using the assumption on the characteristic of $k$ and the classification of prime to $\operatorname {\operatorname {char}}(k)$ covers of the strict henselization of $k[t]$, we may assume our cover is given by extracting roots of the uniformizer. Equivalently, it is enough to verify Cohen–Macaulayness in the case $X^d$ is locally described as a localization of $k[x_1] \otimes _{k[t]} k[x_2] \otimes _{k[t]} \cdots \otimes _{k[t]} k[x_m]$ where the maps $k[t] \to k[x_i]$ are given by $t \mapsto x_i^{s_i}$, for $s_i \leq d$. We can equivalently write this tensor product as $k[x_1] \otimes _{k[t]} k[x_2] \otimes _{k[t]} \cdots \otimes _{k[t]} k[x_m] \simeq k[x_1,x_2,$ $\ldots, x_m]/(x_1^{s_1} - x_2^{s_2}, \ldots, x_1^{s_1} - x_m^{s_m}) =: R$. We wish to verify $R$ is Cohen–Macaulay. Observe that $R$ is a one-dimensional scheme, being a finite cover of $k[t]$. Since it is defined by $m-1$ equations in $\mathbb {A}^m$, it is a complete intersection and, therefore, Cohen–Macaulay.

The following remark will not be used in the remainder of the paper, but may be nice for the reader to keep in mind.

Remark 5.8 For $d > 2$ and $g \geq 1$, $\mathrm {Hur}_{d,g,S}$ is a scheme when $d!$ is invertible on $S$. We have seen above it is an algebraic stack. In order to see it is a scheme, one may first verify it is an algebraic space by checking any degree $d$ cover of $\mathbb {P}^1$ with Galois group $S_d$ for $d > 2$ has no nontrivial automorphisms [Sta, Tag 04SZ]. Indeed, if such a cover did have automorphisms, it would factor through an intermediate cover obtained by quotienting by some such nontrivial automorphism, forcing the Galois group to be smaller than $S_d$.

Having established $\mathrm {Hur}_{d,g,S}$ is an algebraic space, we next wish to explain why it is a scheme. Observe this Hurwitz space has a map to the symmetric power $\operatorname {Sym}^{2g-2+2d}_{\mathbb {P}^1}$ of $2g-2 + 2d$ points on $\mathbb {P}^1$ given by ‘taking the branch locus’. This uses that $d!$ is invertible on $S$ and Riemann–Hurwitz. One may verify this map is separated (for example, using the valuative criterion) and quasi-finite (since the inertia data around the branch points determines the cover), hence quasi-affine [Sta, Tag 082J]. Therefore, it is quasi-affine over a scheme, and therefore a scheme.

6. Defining the Casnati–Ekedahl stratification of Hurwitz stacks

For this section, we now fix a positive integer $d$ and a base field $k$ with $d!$ invertible on $k$. We parenthetically note that much of the following can be generalized to work over arbitrary base schemes. For $T$ a $k$-scheme, given a Gorenstein finite locally free degree $d$ cover $X \rightarrow \mathbb {P}^1_T$, from Theorem 3.2, we obtain a canonical sequence of vector bundles $(\mathscr E^X, \mathscr F_1^X, \mathscr F_2^X, \ldots, \mathscr F_{d-2}^X)$ on $\mathbb {P}^1_T$. We next aim to define certain locally closed substacks of $\overline {\mathrm {Hur}}_{d,g, S}$ corresponding to those covers $X \to \mathbb {P}^1_T$ whose associated vector bundles are isomorphic to some specified sequence $(\mathscr E, \mathscr F_1, \mathscr F_2, \ldots, \mathscr F_{d-2})$. To define this substack, we first define the corresponding stack of these vector bundles.

Recall that the stack of locally free rank-$n$ sheaves on $\mathbb {P}^1_k$ is an algebraic stack, as is well known; see, for example, [Reference BehrendBeh91, Proposition 4.4.6].

Definition 6.1 Let $\mathrm {Vect}^{n}_{\mathbb {P}^1_k}$ denote the moduli stack of locally free rank-$n$ sheaves on $\mathbb {P}^1_k$. For $\vec {a} = (a_1, a_2, \ldots, a_n)$, with $a_i \in \mathbb {Z}$, let $\mathscr O_{\mathbb {P}^1_k}(\vec {a}) := \bigoplus _{i=1}^n \mathscr O_{\mathbb {P}^1_k}(a_i)$ and let $\mathrm {Vect}^{\vec {a}}_{\mathbb {P}^1_k}$ denote the residual gerbe at the point corresponding to the vector bundle $\mathscr O_{\mathbb {P}^1_k}(\vec {a})$.

Remark 6.2 Note that this residual gerbe is indeed a locally closed substack by [Reference RydhRyd11, Theorem B.2]. Alternatively, the residual gerbe is given concretely as the quotient stack $B(\operatorname {Res}_{\mathbb {P}^1_k/k}(\operatorname {Aut}_{\mathscr O_{\mathbb {P}^1_k}(\vec {a})}))$.

In order to relate the genus of a cover of $\mathbb {P}^1$ to the associated vector bundle $\mathscr E$ we need the following standard lemma.

Lemma 6.3 Suppose $\rho : X \rightarrow \mathbb {P}^1_k$ is a degree $d$ Gorenstein finite locally free cover and let $\mathscr E := \ker (\rho _* \omega _X \to \mathscr O_{\mathbb {P}^1_k})$. If $h^0(X, \mathscr O_X\!) = 1$, such as in the case that $X$ is smooth and geometrically connected, then $\deg (\det \mathscr E) = g + d - 1$.

Proof. First, we claim $\rho _* \mathscr O_X \simeq \mathscr O_{\mathbb {P}^1_k} \oplus \mathscr E^\vee$. Indeed, by duality, we have a short exact sequence $\mathscr O_{\mathbb {P}^1_k} \rightarrow \rho _* \mathscr O_X \rightarrow \mathscr E^\vee$. Because all vector bundles on $\mathbb {P}^1$ split, and $h^0(\mathbb {P}^1_k, \rho _* \mathscr O_X\!) = h^0(X, \mathscr O_X\!) = 1$, we find that $\mathscr E^\vee \simeq \bigoplus _{i=1}^{d-1} \mathscr O_{\mathbb {P}^1_k}(-a_i)$ for $a_i > 0$. Because there are no extensions of $\mathscr O_{\mathbb {P}^1_k}(-a_i)$ by $\mathscr O_{\mathbb {P}^1_k}$, the above exact sequence splits, yielding $\rho _* \mathscr O_X \simeq \mathscr O_{\mathbb {P}^1_k} \oplus \mathscr E^\vee \simeq \mathscr O_{\mathbb {P}^1_k} \oplus \bigoplus _{i=1}^{d-1} \mathscr O_{\mathbb {P}^1_k}(-a_i)$. Then, for $n$ sufficiently large and $\mathscr L$ a degree $n$ line bundle on $\mathbb {P}^1_k$, Riemann Roch on the curve $X$ implies $h^0(\mathbb {P}^1_k, \mathscr O_{\mathbb {P}^1_k}(n) \oplus \bigoplus _{i=1}^{d-1} \mathscr O_{\mathbb {P}^1_k}(-a_i + n)) = h^0(\mathbb {P}^1_k,$ $\rho _* \mathscr O_X \otimes \mathscr L) = h^0(X, \rho ^* \mathscr L) = dn -g + 1$. For $n$ larger than the maximum of the $a_i$, the left-hand side is equal to $dn + d -\sum _{i=1}^{d-1} a_i$, and so we obtain $-\sum _{i=1}^{d-1} a_i = -g -d + 1$. Therefore, $\deg (\det \mathscr E) = -\deg (\det \mathscr E^\vee ) = \sum _{i=1}^{d-1}a_i = g + d - 1$.

With the relation between $g$ and $\mathscr E$ of Lemma 6.3 established, we are ready to define the Casnati–Ekedahl strata. For the next definition, we will fix vectors $\vec {a}^\mathscr E, \vec {a}^{\mathscr F_1}, \ldots, \vec {a}^{\mathscr F_{d-2}}$ and vector bundles $\mathscr E, \mathscr F_1, \ldots, \mathscr F_{\lfloor ({d-2})/{2} \rfloor }$ on $\mathbb {P}^1$ given by $\mathscr E \simeq \mathscr O_{\mathbb {P}^1_k}(\vec {a}^\mathscr E)$ and $\mathscr F_i \simeq \mathscr O_{\mathbb {P}^1_k}(\vec {a}^{\mathscr F_i})$. Note that although $d-2$ vector bundles appear in Theorem 3.2, the isomorphism classes of vector bundles $\mathscr F_i$ for $i > \lfloor ({d-2})/{2} \rfloor$ are, in fact, determined by those with $i \leq \lfloor ({d-2})/{2} \rfloor$ because duality enforces the relation $\mathscr F_{d-2} \simeq \det \mathscr E$ and for $1 \leq i \leq d-3$, $\mathscr F_{d-2-i} \simeq \det \mathscr E \otimes \mathscr F_i^\vee$.

Definition 6.4 Let $k$ be a field with $d!$ invertible on $k$, and fix a tuple of vectors $(\vec a^\mathscr E, \vec a^{\mathscr F_1} , \ldots, \vec a^{\mathscr F_{\lfloor ({d-2})/{2} \rfloor }})$. Let $g := 1 - d + \sum _{i=1}^{d-1} a_i^{\mathscr E}$ and define the Casnati–Ekedahl stratum $\mathscr M(\vec a^\mathscr E, \vec a^{\mathscr F_1} , \ldots, \vec a^{\mathscr F_{\lfloor ({d-2})/{2} \rfloor }} )$ as the locally closed substack of $\overline {\mathrm {Hur}}_{d,g, k}$ given as the fiber product

\[ \overline{\mathrm{Hur}}_{d,g, k} \times_{\mathrm{Vect}^{d-1}_{\mathbb{P}^1_k} \times \prod_{i=1}^{\lfloor ({d-2})/{2} \rfloor} \mathrm{Vect}^{\beta_i}_{\mathbb{P}^1_k}} \mathrm{Vect}^{\vec a^\mathscr E }_{\mathbb{P}^1_k} \times \prod_{i=1}^{\lfloor ({d-2})/{2} \rfloor} \mathrm{Vect}^{\vec a^{\mathscr F_i}}_{\mathbb{P}^1_k}. \]

Here, $\beta _i$ are as in Theorem 3.2, and the map $\overline {\mathrm {Hur}}_{d,g, k} \to \mathrm {Vect}^{d-1}_{\mathbb {P}^1_k} \times \prod _{i=1}^{\lfloor ({d-2})/{2} \rfloor } \mathrm {Vect}^{\beta _i}_{\mathbb {P}^1_k}$ is induced by Theorem 3.2. In other words, $\mathscr M(\vec a^\mathscr E, \vec a^{\mathscr F_1}, \ldots, \vec a^{\mathscr F_{\lfloor ({d-2})/{2} \rfloor }} )$ is the locally closed substack of the Hurwitz stack such that the associated morphism $T \rightarrow \mathrm {Vect}^{d-1}_{\mathbb {P}^1_k} \times \prod _{i=1}^{\lfloor ({d-2})/{2} \rfloor } \mathrm {Vect}^{\beta _i}_{\mathbb {P}^1_k}$ factors through a map $T \rightarrow \mathrm {Vect}^{\vec a^\mathscr E }_{\mathbb {P}^1_k} \times \prod _{i=1}^{\lfloor ({d-2})/{2} \rfloor } \mathrm {Vect}^{\vec a^{\mathscr F_i}}_{\mathbb {P}^1_k}$.

Remark 6.5 There is a natural generalization of the construction of Casnati–Ekedahl strata of covers of $\mathbb {P}^1$ to a version for covers of genus-$g$ curves $C$ in place of the genus-$0$ curve $\mathbb {P}^1$. Namely, given a finite locally free cover $C' \to C$ over a base $T$, using Theorem 3.2, one can associate a sequence of vector bundles on the relative curve $C \rightarrow T$. A given Casnati–Ekedahl stratum would naturally be defined as the locus where these bundles have specific Harder–Narasimhan filtration, generalizing the notion of splitting type.

Remark 6.6 Since the substacks $\mathrm {Vect}^{\vec a}_{\mathbb {P}^1_k}$ form a stratification of $\mathrm {Vect}^{n}_{\mathbb {P}^1_k}$, it follows that the Casnati–Ekedahl strata, varying over all tuples $(\vec a^\mathscr E, \vec a^{\mathscr F_1}, \ldots, \vec a^{\mathscr F_{\lfloor ({d-2})/{2} \rfloor }} )$ form a stratification of $\overline {\mathrm {Hur}}_{d,g, k}$. This will enable us to write the class of $\overline {\mathrm {Hur}}_{d,g, k}$ in the Grothendieck ring as the sum of the classes of $\mathscr M(\vec a^\mathscr E, \vec a^{\mathscr F_1}, \ldots, \vec a^{\mathscr F_{\lfloor ({d-2})/{2} \rfloor }} )$ for $(\vec a^\mathscr E, \vec a^{\mathscr F_1}, \ldots, \vec a^{\mathscr F_{\lfloor ({d-2})/{2} \rfloor }} )$ varying over all integer tuples of vectors.

To conclude this section, we introduce some notation for objects we will associate with a Casnati–Ekedahl stratum of the Hurwitz stack.

Notation 6.7 For $3 \leq d \leq 5$, $\mathscr M(\vec a^\mathscr E, \vec a^{\mathscr F_1} , \ldots, \vec a^{\mathscr F_{\lfloor ({d-2})/{2} \rfloor }}) \subset \overline {\mathrm {Hur}}_{d,g, k}$ a Casnati–Ekedahl stratum, for $\mathscr E :=\bigoplus _j \mathscr O(\vec a^\mathscr E_i), \mathscr F_\bullet := \bigoplus _j \mathscr O(\vec a^{\mathscr F_\bullet }_j)$, define $\operatorname {Aut}_{\mathscr M} := \operatorname {Aut}^{\mathbb {P}^1_k/k}_{\mathscr E, \mathscr F_\bullet }$, as defined in Definition 4.2, depending on the value of $d$. In addition, for $f: T \rightarrow \mathbb {P}^1_k$ denote $\operatorname {Aut}_{f^* \mathscr M} := \operatorname {Aut}^{T/k}_{f^* \mathscr E, f^* \mathscr F_\bullet }$. When the map $f$ is understood, we also use $\operatorname {Aut}_{\mathscr M|_T}$ as notation for $\operatorname {Aut}_{f^* \mathscr M}$.

Remark 6.8 The construction $\operatorname {Aut}_{\mathscr M|_T}$ at the end of Notation 6.7 will primarily be used when $T = D$, the dual numbers, mapping to a point of $\mathbb {P}^1_k$. Note that, in this case $\mathscr E|_D$ and $\mathscr F_\bullet |_T$ are free vector bundles because all locally free bundles over $D$ are free.

We conclude this section with a general discussion about the moduli stack of vector bundles on $\mathbb {P}^1_k$. This will be useful in later sections, specifically in Lemma 9.11.

6.9 Discussion of the moduli stack of vector bundles on $\mathbb {P}^1_k$

Recall that, for $k$ a field, every vector bundle $\mathscr V$ on $\mathbb {P}^1_k$ of rank $r$ and degree $\delta$ can be written as $\mathscr V \simeq \bigoplus _{i=1}^r \mathscr O_{\mathbb {P}^1_k}(a_i)$ where $\sum _{i=1}^r a_i = \delta$. The moduli stack of vector bundles of rank $r$ and degree $\delta$ on $\mathbb {P}^1_k$ is smooth and connected. The generic point of this moduli stack is given by a balanced bundle. Formally, a vector bundle $\mathscr V$ on $\mathbb {P}^1$ is balanced if it can be written as $\mathscr V \simeq \bigoplus _{i=1}^r \mathscr O(a_i)$ with $|a_i - a_j| \leq 1$ for all $1 \leq i \leq j \leq r$.

We can now describe when one degree $\delta$, rank-$r$ vector bundle $\mathscr V$, viewed as a point on the moduli stack, lies in the closure of a point corresponding to another degree $\delta$, rank-$r$ vector bundle $\mathscr W$. See [Reference Eisenbud and HarrisEH16, Theorem 14.7(a)] for a proof of the following description. Suppose $\mathscr V \simeq \bigoplus _{i=1}^r \mathscr O_{\mathbb {P}^1_k}(a_i) = \mathscr O_{\mathbb {P}^1_k}(\vec {a})$, and there are some $a_i, a_j$ with $a_i \leq a_j-2$. Let $\sigma _{i,j}(\vec {a}):= (a_1, \ldots, a_i +1, \ldots, a_j - 1, \ldots, a_r)$ so that $\sigma _{i,j}(\vec {a})$ agrees with $\vec {a}$ except in positions $i$ and $j$. Then $\mathscr O_{\mathbb {P}^1_k}(\sigma _{i,j}(\vec {a}))$ lies in the closure of $\mathscr O_{\mathbb {P}^1_k}(\vec {a})$. Informally, one bundle lies in the closure of another if one can find a sequence of moves as above relating one to the other. More precisely, $\mathscr O_{\mathbb {P}^1_k}(\vec {b})$ lies in the closure of $\mathscr O_{\mathbb {P}^1_k}(\vec {a})$ if we can write $\vec {b} = \sigma _{i_1, j_1} \circ \sigma _{i_2, j_2} \cdots \circ \sigma _{i_m, j_m}(\vec {a})$ for some non-negative integer $m$ and integers $i_1, \ldots, i_m, j_1, \ldots, j_m$. In particular, if one starts with any vector bundle of rank $r$ and degree $\delta$, one can sequentially move the entries of $\vec {a}$ closer together, which shows that a balanced bundle correspond to the generic point of the moduli stack.

7. Presentations of the Casnati–Ekedahl strata

We next aim to use the parametrizations from § 3 in order to describe each of the $\mathscr M(\mathscr E, \mathscr F_\bullet )$ for $3 \leq d \leq 5$ as the quotient of an open in affine space by an appropriate group action. Because we will also want to parameterize simply branched covers, it will be useful to restrict the possible ramification types of these covers. We now introduce the notion of ramification profile, which describes the possible ramification types of a finite cover of $\mathbb {P}^1$ by a smooth curve.

Definition 7.1 (Ramification profile)

Fix a positive integer $d$ and let $R = (r_1^{t_1}, r_2^{t_2}, \ldots, r_n^{t_n})$ denote a partition of $d$, i.e. a collection of integers with $t_1, \ldots, t_n \geq 1$ so that $\sum _{i=1}^n t_i r_i = d$. Here, we think of $r_i$ as the part sizes appearing in the partition and $t_i$ as the corresponding multiplicity. A ramification profile of degree $d$ is a partition of $d$. For $X \rightarrow S$ a scheme, we say $X$ has ramification profile $R$ if for every geometric point $\operatorname {\operatorname {Spec}} k \in S$, the base change $X_k := X \times _S \operatorname {\operatorname {Spec}} k$ is isomorphic to $\coprod _{i=1}^n (\coprod _{j=1}^{t_i} \operatorname {\operatorname {Spec}} k[x]/(x^{r_i}))$. We let $r(R) := \sum _{i=1}^n (r_i-1)t_i$ denote the associated ramification order.

One way to think about ramification profiles as defined above is to think of each fiber $X_k$ of $X \to S$ having a partition into curvilinear schemes (i.e. schemes with cotangent spaces of dimension at most $1$ at every point) of degrees determined by the partition $R$.

We next introduce the notion of an allowable collection of ramification profiles. The point of allowable collections is that covers of $\mathbb {P}^1$ whose ramification profiles lie in an allowable collection define an open substack of the Hurwitz stack with closed complement of high codimension. We use the notation $\lambda \vdash n$ to indicate that $\lambda$ is a partition of $n$.

Definition 7.2 Fix an integer $d$. Let $\mathcal {R}$ denote a collection of ramification profiles of degree $d$. We say $\mathcal {R}$ is an allowable collection of ramification profiles of degree $d$ if:

  1. (1) $\mathcal {R}$ includes $(1^d)$ and $(2, 1^{d-2})$;

  2. (2) whenever $\lambda \vdash d$ lies in $\mathcal {R}$, and $\lambda ' \vdash d$ is a partition refining $\lambda$, then $\lambda '$ also lies in $\mathcal {R}$.

In the remainder of this section, we first define certain open substacks of Hurwitz stacks with restricted ramification, lying in an allowable collection $\mathcal {R}$. Following this, we define a certain space of sections of a vector bundle on $\mathbb {P}^1$ parameterizing smooth degree $d$ covers (for $3 \leq d \leq 5$) with specified ramification profiles in an allowable collection.

Definition 7.3 Suppose $k$ is a field with $d!$ invertible on $k$. For $\mathcal {R}$ an allowable collection of ramification profiles of degree $d$, let $\overline {\mathrm {Hur}}_{d,g, k}^{\mathcal {R}} \subset \overline {\mathrm {Hur}}_{d,g, k}$ denote the open substack of $\overline {\mathrm {Hur}}_{d,g, k}$ (we prove it is open in Lemma 7.5) whose $T$ points parameterize smooth curves $X \to \mathbb {P}^1_T$ over $T$ so that for each geometric point $\operatorname {\operatorname {Spec}} \kappa \to \mathbb {P}^1_T$, $X_{\kappa }$ has ramification profile in $\mathcal {R}$. Let $\mathrm {Hur}_{d,g,k}^{{\mathcal {R}}} \subset \mathrm {Hur}_{d,g,k}$ denote the restriction of $\overline {\mathrm {Hur}}_{d,g, k}^{\mathcal {R}}$ along $\mathrm {Hur}_{d,g,k} \subset \overline {\mathrm {Hur}}_{d,g, k}$.

Similarly, for $\mathscr M \subset \overline {\mathrm {Hur}}_{d,g, k}$ a Casnati–Ekedahl stratum, let $\mathscr M^{\mathcal {R}} \subset \mathscr M$ denote the open substack $\mathscr M \times _{\overline {\mathrm {Hur}}_{d,g, k}} \overline {\mathrm {Hur}}_{d,g, k}^{\mathcal {R}} \subset \mathscr M$. We use $\mathscr M^{\mathcal {R}, S_d}$ to denote the pullback of $\mathscr M^{\mathcal {R}}$ along $\mathrm {Hur}_{d,g,k}^{{\mathcal {R}}} \subset \overline {\mathrm {Hur}}_{d,g, k}^{\mathcal {R}}$.

Remark 7.4 In the case $d = 2$, the only allowable $\mathcal {R}$ is $\mathcal {R} = \{(1^2), (2)\}$ and in this case $\mathrm {Hur}_{2,g,k}^{{\mathcal {R}}} = \mathrm {Hur}_{2,g,k}$.

We now verify that the locus of $\overline {\mathrm {Hur}}_{d,g, k}^{\mathcal {R}} \subset \overline {\mathrm {Hur}}_{d,g, k}$ is an open substack.

Lemma 7.5 With notation as in Definition 7.3, $\overline {\mathrm {Hur}}_{d,g, k}^{\mathcal {R}} \subset \overline {\mathrm {Hur}}_{d,g, k}$ is an open substack.

Proof. As a first step, we will show this is a constructible subset. To do so, we can define certain substacks of $\overline {\mathrm {Hur}}_{d,g, k}$ parameterizing covers $f: C \to \mathbb {P}^1_k$ so that the multiset of ramification profiles over the geometric branch points of $f$ is equal to some fixed multiset $S$, which we call $(\overline {\mathrm {Hur}}_{d,g, k})^S$. One can show $(\overline {\mathrm {Hur}}_{d,g, k})^S$ is an algebraic stack. (See [Reference Bertin and RomagnyBR11, Theorem 6.6.6] for a very closely related construction.) Therefore, the image of any of these stacks in $\overline {\mathrm {Hur}}_{d,g, k}$ is constructible. Since the underlying set of $\overline {\mathrm {Hur}}_{d,g, k}^{\mathcal {R}}$ is a finite union $\coprod _S (\overline {\mathrm {Hur}}_{d,g, k})^S$ for all possible multisets $S$ producing genus-$g$ covers which only include ramification lying in $\mathcal {R}$, we obtain that $\overline {\mathrm {Hur}}_{d,g, k}^{\mathcal {R}}$ is a constructible subset of $\overline {\mathrm {Hur}}_{d,g, k}$.

To conclude, we wish to show this constructible subset is in fact an open subset. To do so, we only need show it is closed under generization. However, if a point of $(\overline {\mathrm {Hur}}_{d,g, k})^S$ has a generization which is a point of $(\overline {\mathrm {Hur}}_{d,g, k})^{S'}$ then all ramification profiles appearing in $S'$ must be refinements of those appearing in $S$. Therefore, condition $(2)$ from the definition of allowable collection of ramification profiles, Definition 7.2, shows that $\overline {\mathrm {Hur}}_{d,g, k}^{\mathcal {R}}$ is indeed closed under generization, and so defines an open substack of $\overline {\mathrm {Hur}}_{d,g, k}$.

We next give analogs of the restricted ramification loci above for spaces of sections.

Definition 7.6 For $3 \leq d \leq 5$, suppose $d!$ is invertible on $k$. Fix a choice of Casnati–Ekedahl stratum $\mathscr M := \mathscr M(\vec a^\mathscr E, \vec a^{\mathscr F_1}, \ldots, \vec a^{\mathscr F_{\lfloor ({d-2})/{2} \rfloor }})$, with associated locally free sheaves on $\mathbb {P}^1_k$ given by $\mathscr E_\mathscr M := \bigoplus \mathscr O(\vec a^\mathscr E_i)$, and, if $4 \leq d \leq 5$, $\mathscr F_\mathscr M := \bigoplus \mathscr O(\vec a^{\mathscr F_1}_j)$. Let $g := \deg \det \mathscr E_\mathscr M - d + 1$. Let $\mathscr H_\mathscr M$ denote the associated locally free sheaf on $\mathbb {P}^1$ defined in (3.6). Let $\mathcal {R}$ denote an allowable collection of ramification profiles. Then, define $\mathrm {U}_{ \mathscr M}^{\mathcal {R}}$ to be the open subscheme (we prove openness in Lemma 7.7) of $\operatorname {\operatorname {Spec}} \operatorname {Sym}^\bullet H^0(\mathbb {P}^1_k, \mathscr H_\mathscr M)^\vee$ parameterizing $T$-points $\eta$ so that $\Psi _d(\eta )$ defines a smooth proper curve over $T$ with geometrically connected fibers such that over each geometric point $\operatorname {\operatorname {Spec}} \kappa \to \mathbb {P}^1_T$, the pullback of $\Phi _d(\eta ) \subset \mathbb {P} \mathscr E \rightarrow \mathbb {P}^1_T$ along $\operatorname {\operatorname {Spec}} \kappa \rightarrow \mathbb {P}^1_T$ has ramification profile lying in $\mathcal {R}$.

In addition, define $\mathrm {U}_{\mathscr M}^{\mathcal {R}, S_d} \subset \mathrm {U}_{\mathscr M}^{\mathcal {R}}$ as the open subscheme parameterizing those sections $\eta$ for which $\Psi _d(\eta )$ is a smooth curve $X$ with geometrically connected fibers, such that over each fiber, the cover $X \to \mathbb {P}^1$ of degree $d$ has Galois closure which is an $S_d$ cover. In other words, $\mathrm {U}_{\mathscr M}^{\mathcal {R}, S_d}$ is the subset of $\mathrm {U}_{\mathscr M}^{\mathcal {R}}$ for which the map $\Psi _d$ defines a point of $\mathrm {Hur}_{d,g,k}$.

In the above definition, we claimed $\mathrm {U}_{ \mathscr M}^{\mathcal {R}} \subset \operatorname {\operatorname {Spec}} \operatorname {Sym}^\bullet H^0(\mathbb {P}^1_k, \mathscr H_\mathscr M)^\vee$ is an open subscheme. We now justify this.

Lemma 7.7 The subset $\mathrm {U}_{ \mathscr M}^{\mathcal {R}} \subset \operatorname {\operatorname {Spec}} \operatorname {Sym}^\bullet H^0(\mathbb {P}^1_k, \mathscr H_\mathscr M)^\vee$ naturally has the structure of an open subscheme.

Proof. Let $W_d \subset \operatorname {\operatorname {Spec}} \operatorname {Sym}^\bullet H^0(\mathbb {P}^1_k, \mathscr H_\mathscr M)^\vee$ denote the open subscheme parameterizing those sections $\eta$ for which $\Psi _d(\eta )$ has degree $d$ on all fibers. There is a map $W_d \to \overline {\mathrm {Hur}}_{d,g, k}$, induced by $\Psi _d$ sending $\eta \mapsto \Psi _d(\eta )$. Under this map, $\mathrm {U}_{\mathscr M}^{\mathcal {R}}$ is the preimage of $\overline {\mathrm {Hur}}_{d,g, k}^{\mathcal {R}}$, which is open by Lemma 7.5. Hence, $\mathrm {U}_{ \mathscr M}^{\mathcal {R}} \subset \operatorname {\operatorname {Spec}} \operatorname {Sym}^\bullet H^0(\mathbb {P}^1_k, \mathscr H_\mathscr M)^\vee$ is open.

Example 7.8 If we take $\mathcal {R}$ in Definition 7.6 to range over all possible ramification profiles (i.e. all partitions of $d$) then $\mathrm {U}_{ \mathscr M}^{\mathcal {R}}$ corresponds to all sections $\eta$ as in Definition 7.6 with $\Phi _d(\eta )$ a smooth geometrically connected degree $d$ cover of $\mathbb {P}^1_k$.

On the other hand, if we take $\mathcal {R}$ to be the union of two ramification profiles, the first given by $1^d$ and the second given by $(2, 1^{d-2})$, we obtain all sections $\eta$ with $\Phi _d(\eta )$ a smooth geometrically connected curve which is simply branched over $\mathbb {P}^1$.

7.9 Writing the class as a sum over Casnati–Ekedahl strata

Our goal for the remainder of the section is to express the class of the Hurwitz stack as a sum over the Casnati–Ekedahl strata, which will be somewhat more manageable due to their descriptions as quotients of opens in affine spaces by relatively simple algebraic groups.

Proposition 7.10 For $3 \leq d \leq 5$, and $\mathcal {R}$ an allowable collection of ramification profiles of degree $d$, we have an equality in $K_0(\mathrm {Stacks}_k)$

\[ \{\mathrm{Hur}_{d,g,k}^{{\mathcal{R}}}\} = \sum_{\text{Casnati--Ekedahl strata }\mathscr M} \frac{\{\mathrm{U}_{ \mathscr M}^{\mathcal{R}, S_d}\}}{\{\operatorname{Aut}_{\mathscr M}\}}. \]

Proof assuming Proposition 7.11 and Lemma 7.12 We claim

\begin{align*} \{\mathrm{Hur}_{d,g,k}^{{\mathcal{R}}}\} &= \sum_{\text{Casnati--Ekedahl strata }\mathscr M} \{\mathscr M^{\mathcal{R}, S_d}\}\\ &= \sum_{\text{Casnati--Ekedahl strata }\mathscr M} \bigg\{\bigg[\frac{\mathrm{U}_{ \mathscr M}^{\mathcal{R},S_d}}{\operatorname{Aut}_{\mathscr M}}\bigg]\bigg\} \\ &= \sum_{\text{Casnati--Ekedahl strata }\mathscr M} \frac{\{\mathrm{U}_{ \mathscr M}^{\mathcal{R}, S_d}\}}{\{\operatorname{Aut}_{\mathscr M}\}}. \end{align*}

The first equality holds because the Casnati–Ekedahl strata form a stratification of $\overline {\mathrm {Hur}}_{d,g, k}$ by locally closed substacks. The second holds by Proposition 7.11. The final equality holds by Lemma 7.12, using both that $\operatorname {Aut}_{\mathscr M}$ is special so $\{\operatorname {Aut}_{\mathscr M}\} \{[ {\mathrm {U}_{ \mathscr M}^{\mathcal {R}, S_d}}/{\operatorname {Aut}_{\mathscr M}}]\} = \{\mathrm {U}_{ \mathscr M}^{\mathcal {R}, S_d}\}$ by [Reference EkedahlEke09a, Proposition 1.4(i)], and that $\{\operatorname {Aut}_{\mathscr M}\}$ is invertible.

To conclude our proof of Proposition 7.10, we need to verify Proposition 7.11 and Lemma 7.12. We omit the proof of Proposition 7.11 since it is analogous to Proposition 4.8, where we additionally fix isomorphisms to fixed bundles $\mathscr E, \mathscr F_\bullet$ on $\mathbb {P}^1_k$ (as opposed to trivial bundles on $\operatorname {\operatorname {Spec}} \mathbb {Z}$) and add in conditions associated to the ramification profiles in $\mathcal {R}$ and lying in $\mathrm {Hur}_{d,g,k}$ appropriately.

Proposition 7.11 For $3 \leq d \leq 5$, fix a choice of Casnati–Ekedahl stratum $\mathscr M := \mathscr M(\vec a^\mathscr E, \vec a^{\mathscr F_1} , \ldots, \vec a^{\mathscr F_{\lfloor ({d-2})/{2} \rfloor }})$ with associated sheaves $\mathscr E_\mathscr M$ and, if $4 \leq d \leq 5$, $\mathscr F_\mathscr M$ as in Definition 7.6. There are isomorphisms $[\mathrm {U}_{ \mathscr M}^{\mathcal {R}} / \operatorname {Aut}_{\mathscr M}] \simeq \mathscr M^{\mathcal {R}}$ and $[\mathrm {U}_{ \mathscr M}^{\mathcal {R}, S_d} / \operatorname {Aut}_{\mathscr M}] \simeq \mathscr M^{\mathcal {R}, S_d}$.

We now verify the relevant automorphism groups are special. Because later we will have to deal with an analogous construction over the dual numbers $D$, we include that setting in the following lemma as well.

Lemma 7.12 For $\mathscr V$ any vector bundle on $Y$, for $Y = \mathbb {P}^1_k$ or $Y = D$, $\operatorname {Res}_{Y/k}(\operatorname {Aut}_{\mathscr V})$ and $\operatorname {Res}_{Y/k}(\ker (\det : \operatorname {Aut}_{\mathscr E} \to \mathbb {G}_m))$ are special and their classes are invertible in $K_0(\mathrm {Stacks}_k)$.

When $Y = \mathbb {P}^1$ or $Y = D$, the three group schemes appearing in (4.1) in the cases $d = 3$, $4$, and $5$ are special. Further, the classes of these groups are invertible in $K_0(\mathrm {Stacks}_k)$.

Proof. We only explicate the proof in the case $Y= \mathbb {P}^1_k$, since the proof when $Y = D$ is analogous but simpler (noting that all vector bundles are trivial over $D$).

First we show that for any vector bundle $\mathscr G$ on $\mathbb {P}^1_k$, $\operatorname {Res}_{\mathbb {P}^1_k/k}(\operatorname {Aut}_{\mathscr G})$ is special. The reason for this is as follows. Write $\mathscr G = \bigoplus _{i=1}^m \mathscr O_{\mathbb {P}^1_k}(a_i)^{n_i}$ with $a_1 \leq \cdots \leq a_m$. We can express $\operatorname {Res}_{\mathbb {P}^1_k/k}(\operatorname {Aut}_{\mathscr G}) \simeq \prod _i \operatorname {GL}_{n_i} \ltimes \prod _{i< j} V_{ij}$ where $V_{ij}$ is the vector group $V_{ij} = \operatorname {Res}_{\mathbb {P}^1_k/k}(\mathrm {Hom}(\mathscr O_{\mathbb {P}^1_k}(a_i)^{n_i}, \mathscr O_{\mathbb {P}^1_k}(a_j)^{n_j})) \simeq \mathbb {G}_a^{(a_j - a_i+1)n_i n_j}$. It will also be useful to note that $\ker (\det ) : \operatorname {Res}_{\mathbb {P}^1_k/k}(\operatorname {Aut}_{\mathscr G} )\to \operatorname {Res}_{\mathbb {P}^1_k/k}(\mathbb {G}_m)$ is special, since it can be expressed as an extension of a power of $\mathbb {G}_m$ by $\prod _i \mathrm {SL}_{n_i} \ltimes \prod _{i< j} V_{ij}$, both of which are special. These statements imply the first part of the lemma.

We now check the groups $\operatorname {Aut}^{\mathbb {P}^1_k/k}_{\mathscr E, \mathscr F_\bullet }$ are special when $d = 3$, $4$, and $5$. The above observations immediately implies the claim when $d = 3$. To deal with the cases $d =4$ and $d =5$, we use Lemma 4.4. In both cases, the composition coming from Lemma 4.4 $\operatorname {Aut}_{\mathscr E, \mathscr F}^{\mathbb {P}^1_k/k} \to \operatorname {Res}_{\mathbb {P}^1_k/k}(\operatorname {Aut}_{\mathscr E/\mathbb {P}^1_k}) \times \operatorname {Res}_{\mathbb {P}^1_k/k}(\operatorname {Aut}_{\mathscr F/\mathbb {P}^1_k}) \to \operatorname {Res}_{\mathbb {P}^1_k/k}(\operatorname {Aut}_{\mathscr E/\mathbb {P}^1_k})$ is surjective. From the description in Lemma 4.4, the kernel of this composition is identified with $\ker (\det ) : \operatorname {Res}_{\mathbb {P}^1_k/k}(\operatorname {Aut}_{\mathscr F/\mathbb {P}^1_k}) \to \mathbb {G}_m$. As mentioned above, this is special, and so $\operatorname {Aut}^{\mathbb {P}^1_k/k}_{\mathscr E, \mathscr F_\bullet }$ is an extension of special group schemes, hence special.

By the above explicit description of $\operatorname {Aut}^{\mathbb {P}^1_k/k}_{\mathscr E, \mathscr F_\bullet }$ in terms of classes of special linear groups, general linear groups, and vector groups, we conclude that $\operatorname {Aut}^{\mathbb {P}^1_k/k}_{\mathscr E, \mathscr F_\bullet }$ has class which is a product of powers of $\mathbb {L}$, and expressions of the form $\mathbb {L}^s - 1$ for varying $s$. Therefore, $\operatorname {Aut}^{\mathbb {P}^1_k/k}_{\mathscr E, \mathscr F_\bullet }$ is invertible in $K_0(\mathrm {Stacks}_k)$.

8. Computing the local classes

The goal of this section is to compute the classes of sections over the dual numbers in Theorem 8.9. These classes can be thought of as describing the ‘probability’ that a curve is smooth at a point and has a certain ramification profile. We will then use these classes to sieve for smoothness and ramification conditions by employing the work of Bilu and Howe [Reference Bilu and HoweBH21] in Proposition 9.10. The condition of smoothness can be rephrased as a local condition over an infinitesimal neighborhood of the point in $\mathbb {P}^1$. We will first prove Theorem 8.3 which computes this ‘probability’ for abstract covers, and from this deduce Theorem 8.9, which computes this ‘probability’ for sections of $\mathscr H(\mathscr E, \mathscr F_\bullet )$. Theorem 8.3 can be thought of as a motivic analog of Bhargava's mass formulas for counting local fields [Reference BhargavaBha07], though we note that the interesting part of [Reference BhargavaBha07] is when there is wild ramification, and our hypothesis eliminates that possibility. On the other hand, it is still interesting to upgrade even the (much easier) tame mass formula to a motivic statement.

The idea for computing these local classes seems one of the main new insights of this paper. In the arithmetic analogs of this work, one is able to directly count the number of sections over $\mathbb {Z}/p^2\mathbb {Z}$, see [Reference Bhargava, Shankar and TsimermanBST13, Lemma 18] for the degree 3 case, [Reference BhargavaBha04, Lemma 23] for the degree $4$ case, and [Reference BhargavaBha08, Lemma 20] for the degree $5$ case. In the Grothendieck ring, when working over infinite fields, there are infinitely many sections, and so to determine the relevant class, direct counting is no longer possible. We relate computing the classes of these sections to computing the classes of the classifying stacks of abstract automorphism groups of the corresponding schemes. These classes can, in turn, be computed using stacky symmetric powers $\operatorname {Symm}^n$ (see Definition 8.15) and the class of $BS_n$. An observation which is the key to the proof of Theorem 8.9 is that for $G$ a group scheme, we have an isomorphism of stacks $\operatorname {Symm}^n (BG) \simeq B(G \wr S_n)$.

Throughout this section, we fix $d \in \mathbb {Z}_{\geq 1}$ an integer and let $k$ be a field with $\operatorname {\operatorname {char}}(k) \nmid d!$. For later explicit calculations, it will be convenient to work with the following explicit scheme $X_{(R)}$ over $D$ which has ramification profile $R =(r_1^{t_1}, \ldots, r_n^{t_n})$, with $\sum _{i=1}^n r_i t_i = d$, over the closed point of $D$. Define

(8.1)\begin{equation} X_{(R)} := \coprod_{i=1}^n \bigg(\coprod_{j=1}^{t_i} \operatorname{\operatorname{Spec}} k[x, \varepsilon]/(x^{r_i}-\varepsilon, \varepsilon^2)\bigg). \end{equation}

Thus, $X_{(R)}$ is a disjoint union of curvilinear schemes flat over the dual numbers, which have degrees over the dual numbers corresponding to elements of the partition. In particular, the total degree of $X_{(R)}$ over the dual numbers is $d$. We use the parentheses around $R$ in $X_{(R)}$ to distinguish it from the base change of $X$ to $R$.

Recall we defined $\mathrm {Covers}_{d}$ prior to Definition 4.7 as the algebraic stack parameterizing degree $d$ finite locally free covers over a base field $k$.

Definition 8.1 We let $\mathscr X_{R,d} \subset \operatorname {Res}_{D/k}(\mathrm {Covers}_{d} \times _{\operatorname {\operatorname {Spec}} k} D)$ denote the residual gerbe at the $k$-point of $\operatorname {Res}_{D/k}(\mathrm {Covers}_{d} \times _{\operatorname {\operatorname {Spec}} k} D)$ corresponding to the $D$-point of $\mathrm {Covers}_{d}$ given by $X_{(R)}$.

Remark 8.2 Since we have an induced monomorphism $B(\operatorname {Res}_{D/k}(\operatorname {Aut}_{X_{(R)}/D})) \to \mathrm {Covers}_{d}$ and an epimorphism $\operatorname {\operatorname {Spec}} k \to B(\operatorname {Res}_{D/k}(\operatorname {Aut}_{X_{(R)}/D}))$, it follows that $\mathscr X_{R,d}$ is equivalent to $B(\operatorname {Res}_{D/k}(\operatorname {Aut}_{X_{(R)}/D}))$ from the universal property for residual gerbes.

Our main result of this section is to compute the class of $\mathscr X_{R,d}$ in $K_0(\mathrm {Stacks}_k)$, and we complete the proof at the end of the section in § 8.17.

Theorem 8.3 Let $R$ be a ramification profile which is a partition of $d$. Let $r(R)$ be the ramification order associated to the ramification profile $R$, as defined in Definition 7.1. Then, for $k$ a field with $\operatorname {\operatorname {char}}(k) \nmid d!$, we have

\[ \{\mathscr X_{R,d}\} = \mathbb{L}^{-r(R)} \]

in $K_0(\mathrm {Stacks}_k)$.

The plan for the rest of the section is to first use Theorem 8.3 to deduce the local condition for a section of $\mathscr H(\mathscr E,\mathscr F_\bullet )$ to be smooth in Theorem 8.9. Following this, we devote the remainder of the section to proving Theorem 8.3. The main idea is to directly compute the automorphism group of $X_{(R)}$ in terms of its combinatorial data starting in § 8.10 and culminating in Corollary 8.12. Using this, we will then be able to compute the class of the classifying stack of the resulting affine (but typically quite disconnected) group scheme in § 8.13. For this, we appeal to a result of Ekedahl on stacky symmetric powers and another result of Ekedahl showing $\{BS_d\} =1$. We complete the proof of Theorem 8.3 in § 8.17.

Remark 8.4 With some additional work, one can also prove a variant of Theorem 8.3 which computes the class of the locally closed subscheme $\mathscr Z_{R,d}$ of the Hilbert scheme $\operatorname {Res}_{D/k} (\operatorname {Hilb}_{\mathbb {P}^{d-2}_D/D}^d)$ parameterizing curvilinear nondegenerate subschemes with ramification profile $R$ so that on any geometric fiber, no degree-($d - 1$) subscheme is contained in a hyperplane. One can show, $\{\mathscr Z_{R,d}\} = \{\operatorname {PGL}_{d-1}\} \mathbb {L}^{\dim \operatorname {PGL}_{d-1}-r(R)}$. Note there is some subtlety in verifying this because this Hilbert scheme is naturally a $\operatorname {Res}_{D/k}(\operatorname {PGL}_{d-1})$ torsor over $\mathrm {Covers}_{d}$, and $\operatorname {PGL}_{d-1}$ is not a special group. Nevertheless, one may prove this by ‘linearizing the action’ so as to construct this as a quotient of a $\operatorname {Res}_{D/k}(\operatorname {GL}_{d-1})$ torsor by $\operatorname {Res}_{D/k}(\mathbb {G}_m)$, both of which are special.

8.5 Using Theorem 8.3 to compute smooth sections

Before proving Theorem 8.3, we will see how it can be used to determine local conditions for a section in a given Casnati–Ekedahl stratum to be smooth. In order to apply Theorem 8.3 to our problem of computing the classes of Hurwitz stacks we want to relate it to sections of the sheaf $\mathscr H(\mathscr E,\mathscr F_\bullet )$ on $D$ (for $\mathscr E$ and $\mathscr F_\bullet$ trivial sheaves on $D$ of appropriate ranks as in Notation 3.8, depending on $d$ with $3 \leq d \leq 5$). For this we need a generalization of Proposition 4.8 where we take a Weil restriction from the dual numbers. More precisely, for $3 \leq d \leq 5$, the map $\mu _d: \mathrm {U}_{d} \to \mathrm {Covers}_{d}$ defined in Definition 4.7 induces a map $\operatorname {Res}_{D/k}(\mu _d) : \operatorname {Res}_{D/k}((\mathrm {U}_{d})_D) \to \operatorname {Res}_{D/k}((\mathrm {Covers}_{d})_D)$. Since $\mu _d$ is invariant for the action of $\operatorname {Aut}_{\mathscr E, \mathscr F_\bullet }$ as in Definition 4.7, we obtain a map $\phi _d^{D/k}: [\operatorname {Res}_{D/k}((\mathrm {U}_{d})_D) / \operatorname {Res}_{D/k}(\operatorname {Aut}_{\mathscr E|_D, \mathscr F_\bullet |_D})] \to \operatorname {Res}_{D/k}((\mathrm {Covers}_{d})_D)$ induced by sending a section to its vanishing locus.

Lemma 8.6 For $3 \leq d \leq 5$, the map $\phi _d^{D/k}: [\operatorname {Res}_{D/k}((\mathrm {U}_{d})_D) / \operatorname {Res}_{D/k}(\operatorname {Aut}_{\mathscr E|_D, \mathscr F_\bullet |_D})] \to \operatorname {Res}_{D/k} ((\mathrm {Covers}_{d})_D)$ is an isomorphism.

This is proven via a nearly identical argument to Proposition 4.8 and we omit the proof. The one minor difference one must note is that, in order to show $\phi _d^{D/k}$ is surjective, for any $T \to \operatorname {\operatorname {Spec}} k$ and any vector bundle on $T \times _k D$, one may replace $T$ by an open cover which trivializes the bundle.

Definition 8.7 Let $\mathscr Y_{R,d} \subset \operatorname {Res}_{D/k}((\mathrm {U}_{d})_D)$ denote the preimage under the composition

\[ \operatorname{Res}_{D/k}((\mathrm{U}_{d})_D) \to [\operatorname{Res}_{D/k}((\mathrm{U}_{d})_D) / \operatorname{Res}_{D/k}(\operatorname{Aut}_{\mathscr E|_D, \mathscr F_\bullet|_D})]\xrightarrow{\phi_d^{D/k}} \operatorname{Res}_{D/k}((\mathrm{Covers}_{d})_D) \]

of $\mathscr X_{R,d} \subset \operatorname {Res}_{D/k}((\mathrm {Covers}_{d})_D)$.

Remark 8.8 We will implicitly use the following geometric description of the residual gerbe $\mathscr X_{R,d}$ and its preimage $\mathscr Y_{R,d}$ in $\operatorname {Res}_{D/k}((\mathrm {U}_{d})_D)$. As a fibered category, $\mathscr X_{R,d}$ has $T$ points given by finite locally free degree $d$ Gorenstein covers $Z \to T \times _k D$ satisfying the following properties:

  1. (1) $Z$ has ramification profile $R$ over each geometric point $\operatorname {\operatorname {Spec}} \kappa \rightarrow T_D$;

  2. (2) $Z$ is curvilinear in the sense that for each geometric point $\operatorname {\operatorname {Spec}} \kappa \rightarrow T$, the resulting scheme $Z \times _T \operatorname {\operatorname {Spec}} \kappa$ has $1$-dimensional Zariski tangent space at each point.

Similarly, when $3 \leq d \leq 5$, we can describe $\mathscr Y_{R,d}$ as those sections $\eta \in \operatorname {Res}_{D/k}((\mathrm {U}_{d})_D)(T)$ for which the associated degree $d$ cover of $T \times _k D$, $\Psi _d(\eta )$ (as defined in § 3.11) has the above properties. We note that $\mathscr Y_{R,d}$ is a locally closed subscheme of $\operatorname {Res}_{D/k}((\mathrm {U}_{d})_D)$ since the same holds for the residual gerbe $\mathscr X_{R,d}$ in $\operatorname {Res}_{D/k}(\mathrm {Covers}_{d})$ (see [Reference RydhRyd11, Theorem B.2]). One may also deduce this is locally closed directly from the above functorial description.

By combining Theorem 8.3 with Lemma 8.6, we can easily deduce the following.

Theorem 8.9 Let $R$ be a ramification profile which is a partition of $d$ and let $\mathscr Y_{R,d}$ be the scheme defined in Definition 8.7 (with associated free sheaves $\mathscr E, \mathscr F_\bullet$ on $D$). Let $r(R)$ denote the ramification order associated to the ramification profile $R$, as defined in Definition 7.1. Then,

\[ \{\mathscr Y_{R,d}\} = \{\operatorname{Aut}^{D/k}_{\mathscr E,\mathscr F_\bullet}\} \mathbb{L}^{-r(R)}. \]

Proof of Theorem 8.9 assuming Theorem 8.3 Using Lemma 8.6 and Remark 8.2,

\[ \{[\mathscr Y_{R,d}/\operatorname{Aut}^{D/k}_{\mathscr E, \mathscr F_\bullet}]\} = \{B(\operatorname{Res}_{D/k}(\operatorname{Aut}_{X_{(R)}/D}))\} = \mathscr X_{R,d}. \]

Since $\operatorname {Aut}^{D/k}_{\mathscr E, \mathscr F_\bullet }$ is special and has invertible class in $K_0(\mathrm {Stacks}_k)$ by Lemma 7.12, [Reference EkedahlEke09a, Propositions 1.4(i), 1.1(ix)] implies that

\[ \{[\mathscr Y_{R,d}/\operatorname{Aut}^{D/k}_{\mathscr E, \mathscr F_\bullet}]\} = \frac{\{\mathscr Y_{R,d}\}}{\{\operatorname{Aut}^{D/k}_{\mathscr E, \mathscr F_\bullet}\}}. \]

Then, by Theorem 8.3,

(8.2)\begin{equation} \{\mathscr Y_{R,d}\} =\{\operatorname{Aut}^{D/k}_{\mathscr E, \mathscr F_\bullet}\} \cdot \{\mathscr X_{R,d} \} = \{\operatorname{Aut}^{D/k}_{\mathscr E, \mathscr F_\bullet}\} \cdot \mathbb{L}^{-r(R)}. \end{equation}

8.10 Computing the algebraic group $\operatorname {Res}_{D/k}(\operatorname {Aut}_{X_{(R)}/D})$

Let $X_{(R)}$ denote the scheme as defined in (8.1). Our next goal is to compute the group scheme $\operatorname {Res}_{D/k}(\operatorname {Aut}_{X_{(R)}/D})$, which we will carry out in Corollary 8.12. In order to do so, we first deal with the case that $X_{(R)}$ is connected.

Lemma 8.11 Let $d \in \mathbb {Z}_{\geq 1}$, and let $k$ be a field with $\operatorname {\operatorname {char}}(k) \nmid d!$. Let $W := \operatorname {\operatorname {Spec}} k[y,\varepsilon ]/ (\varepsilon ^2, \varepsilon -y^d)$. For $\operatorname {Aut}_{W/D}$ the automorphism scheme of $W$ over $D$, we have $\operatorname {Res}_{D/k}(\operatorname {Aut}_{W/D}) \simeq \mathbb {G}_a^{d-1} \rtimes \mu _d$, explicitly given by $\alpha \in \mu _d$ sending $y \mapsto \alpha y$ and $(a_1, \ldots, a_{d-1}) \in \mathbb {G}_a^{d-1}$ sending $y \mapsto y + \sum _{i=1}^{d-1} a_iy^{d+i}$.

Proof. For $T$ a $k$ algebra, a functorial $T$ point of $\operatorname {Res}_{D/k}(\operatorname {Aut}_{W/D})$ corresponds (upon taking global sections) to an isomorphism of $T$ algebras

\[ \phi: T[y,\varepsilon]/(\varepsilon^2, \varepsilon-y^d) \simeq T[y, \varepsilon]/(\varepsilon^2, \varepsilon-y^d) \]

over $T[\varepsilon ]/(\varepsilon ^2)$. Such an automorphism is uniquely determined by where it sends $y$. To conclude the proof, it suffices to verify that any such $\phi$ is of the form $y \mapsto \alpha y + \sum _{i=1}^{d-1} a_i y^{d+i}$ for $\alpha \in \mu _d(T)$ and $a_i \in \mathbb {G}_a(T)$, and conversely that any map of this form determines an automorphism.

Let $\phi _{\alpha, a_1, \ldots, a_{d-1}}$ denote the map of $T$ algebras sending $y \mapsto \alpha y + \sum _{i=1}^{d-1} a_i y^{d+i}$ as above. Under the isomorphism $T[y,\varepsilon ]/(\varepsilon ^2, \varepsilon -y^d) \simeq T[y]/y^{2d}$, any automorphism $\phi$ must induce an isomorphism on cotangent spaces, and hence send $y$ to some polynomial $p_\phi (y) = b_1 y + b_2 y^2 + \cdots + b_{2d-1}y^{2d-1}$, with $b_1 \neq 0$ and $b_i \in T$. The condition that $\phi$ determines a map of $T[\varepsilon ]/(\varepsilon ^2)$ algebras precisely corresponds to $y^d = p_\phi (y)^d$. Comparing the coefficients of $y^d$ in this equation implies $b_1 \in \mu _d(T)$. Since $\operatorname {\operatorname {char}}(k) \nmid d!$, comparing the coefficients of $y^{d+1}, \ldots, y^{2d-1}$ in the equation $y^d = p_\phi (y)^d$ implies $b_2 = b_3 = \cdots = b_d = 0$. However, the coefficients $b_{d+1}, \ldots, b_{2d-1}$ can be arbitrary and $y^d = p_\phi (y)^d$ will be satisfied. So, any automorphism $\phi$ must be of the form $\phi _{\alpha, a_1, \ldots, a_{d-1}}$ (where we take $a_i = b_{d+i}$ in the above notation).

To see any map $\phi _{\alpha, a_1, \ldots, a_{d-1}}$ determines an automorphism of $T$ algebras, note first that it is well defined, because $(\alpha y + \sum _{i=1}^{d-1} a_i y^{d+i})^d = y^d$, using that $y^{2d} = 0$. It is an automorphism as its inverse is explicitly given by $\phi _{(\alpha ^{-1}, -\alpha ^{-2} a_1, -\alpha ^{-3} a_2, \ldots, - \alpha ^{-d} a_{d-1})}$

Corollary 8.12 Choose a partition $(r_1^{t_1}, \ldots, r_n^{t_n})$ of $d$, i.e. $d = \sum _{i=1}^n t_i \cdot r_i$. For $i = 1, \ldots, n$, let $W_i := \operatorname {\operatorname {Spec}} \prod _{j=1}^{t_i} k[y,\varepsilon ]/(\varepsilon ^2, \varepsilon -y^{r_i})$. Let $W := \coprod _{i=1}^n W_i$, so that $W \simeq X_{(R)}$ when $R$ is the ramification profile associated to the above partition. We have an isomorphism $\operatorname {Res}_{D/k}(\operatorname {Aut}_{W/D}) \simeq \prod _{i=1}^n (\mathbb {G}_a^{r_i - 1} \rtimes \mu _{r_i}) \wr S_{t_i}$, where each $\mathbb {G}_a^{r_i - 1} \rtimes \mu _{r_i}$ is explicitly realized acting on each component of $W_i$ as in Lemma 8.11, and the action of the wreath product with $S_{t_i}$ is obtained by permuting the $t_i$ components of $W_i$.

Proof. To compute the automorphism group of $W$, first observe that any automorphism must permute all connected components of a fixed degree, and therefore $\operatorname {Aut}_{W/D} = \prod _{i=1}^n \operatorname {Aut}_{W_i/D}$ and, consequently,

\[ \operatorname{Res}_{D/k} (\operatorname{Aut}_{W/D}) = \operatorname{Res}_{D/k} \bigg(\prod_{i=1}^n \operatorname{Aut}_{W_i/D}\bigg) = \prod_{i=1}^n \operatorname{Res}_{D/k} (\operatorname{Aut}_{W_i/D}). \]

It therefore suffices to show $\operatorname {Res}_{D/k}(\operatorname {Aut}_{W_i/D}) \simeq (\mathbb {G}_a^{r_i - 1} \rtimes \mu _{r_i}) \wr S_{t_i}$. As all connected components of $W_i$ are isomorphic, any automorphism is realized as the composition of an automorphism preserving each connected component, followed by some permutation of the connected components. Since there are $t_i$ connected components, the group of permutations of the components is the symmetric group $S_{t_i}$, while for $Z_i$ a connected component of $W_i$, we established $\operatorname {Res}_{D/k}\operatorname {Aut}_{Z_i/D} \simeq \mathbb {G}_a^{r_i - 1} \rtimes \mu _{r_i}$ in Lemma 8.11. It follows that

\[ \operatorname{Res}_{D/k}(\operatorname{Aut}_{W_i/D}) = (\operatorname{Res}_{D/k}(\operatorname{Aut}_{Z_i/D})) \wr S_{t_i} = (\mathbb{G}_a^{r_i - 1}\rtimes \mu_{r_i}) \wr S_{t_i}. \]

8.13 Computing $\{B \operatorname {Res}_{D/k}(\operatorname {Aut}_{X_{(R)}/D})\}$

Our next goal is to prove Theorem 8.3 by computing the class of $\{B\operatorname {Res}_{D/k}(\operatorname {Aut}_{X_{(R)}/D})\}$ in $K_0(\mathrm {Stacks}_k)$, which we carry out at the end of this section in § 8.17. Of course, we will use our computation of $\operatorname {Res}_{D/k}(\operatorname {Aut}_{X_{(R)}/D})$ from Corollary 8.12. In order to set up our computation we need the following lemma.

Lemma 8.14 For $x$ and $y$ positive integers, $\{B(\mathbb {G}_a^x \rtimes \mu _y)\}= \{B(\mathbb {G}_a^x)\} = \mathbb {L}^{-x}$, where $\mu _y$ acts on $\mathbb {G}_a^x$ by the scaling action $(\alpha, (a_1, \ldots, a_x)) \mapsto (\alpha a_1, \ldots, \alpha a_x)$.

Proof. Indeed, we have an inclusion

\[ (\mathbb{G}_a^x \rtimes \mu_{y}) \hookrightarrow (\mathbb{G}_a^{x} \rtimes \mathbb{G}_m), \]

where the semidirect product $\mathbb {G}_a^{x} \rtimes \mathbb {G}_m$ is defined similarly to that in Lemma 8.11 so that $\mathbb {G}_m$ acts on $\mathbb {G}_a^{x}$ by

\begin{align*} \mathbb{G}_m \times \mathbb{G}_a^x & \rightarrow \mathbb{G}_a^x\\ (\alpha, (a_1, \ldots, a_x)) & \mapsto (\alpha a_1, \ldots, \alpha a_x). \end{align*}

The natural inclusion $\mu _{y} \rightarrow \mathbb {G}_m$ then respects the constructed group structures. For simplicity of notation, temporarily define $K := \mathbb {G}_a^{x} \rtimes \mu _{y}$ and $L :=\mathbb {G}_a^{x} \rtimes \mathbb {G}_m$.

Since $L$ is special, and special groups are closed under extensions, it follows from [Reference EkedahlEke09a, Proposition 1.1(ix)] that $\{BK\} = \{L/K\}\{BL\}$. However, since $\mathbb {G}_a^{x}$ is a normal subgroup of both $L$ and $K$, the quotient $L/K$ is identified with

\[ L/K \simeq \frac{ L/ \mathbb{G}_a^{x}}{K/\mathbb{G}_a^{x}} \simeq \mathbb{G}_m/\mu_{y} \simeq \mathbb{G}_m. \]

Since $L$ is special, using [Reference EkedahlEke09a, Proposition 1.4(i)] and [Reference EkedahlEke09a, Proposition 1.1(v)], we obtain that $\{BL\} = {1}/{\{L\}} = \mathbb {L}^{-x} {1}/({\mathbb {L}-1})$. Therefore,

\[ \{BK\} = \{L/K\}\{BL\} = (\mathbb{L}-1)\mathbb{L}^{-x} \frac{1}{\mathbb{L}-1} = \mathbb{L}^{-x} = \{B(\mathbb{G}_a^x)\}, \]

using again that $L = \mathbb {G}_a^x \rtimes \mathbb {G}_m$ is special.

Using Lemma 8.14, we next compute the class of $B \operatorname {Res}_{D/k}(\operatorname {Aut}_{X_{(R)}/D})$ in the case that the partition $R$ has a single part. To continue our computation, we need the notion of stacky symmetric powers.

Definition 8.15 For $\mathscr X$ a stack, define the stacky symmetric power $\operatorname {Symm}^n \mathscr X := [\mathscr X^n/S_n]$ (where $[\mathscr X^n/S_n]$ denotes the stack quotient for $S_n$ acting on $\mathscr X^n$ by permuting the factors).

The key input to our next computation will be that taking the stacky symmetric powers is a well-defined operation on the Grothendieck ring of stacks by [Reference EkedahlEke09b, Proposition 2.5].

Lemma 8.16 For integers $s$ and $t$,

\[ B((\mathbb{G}_a^{s - 1} \rtimes \mu_{s}) \wr S_{t}) = \mathbb{L}^{-(s-1) \cdot t} \in K_0(\mathrm{Stacks}_k). \]

Proof. By definition,

\[ \{B((\mathbb{G}_a^{s - 1} \rtimes \mu_{s}) \wr S_{t})\} = \{B(\mathbb{G}_a^{s - 1} \rtimes \mu_{s}) \wr B(S_{t}) \} = \{\operatorname{Symm}^{t}(B(\mathbb{G}_a^{s - 1} \rtimes \mu_{s}))\}. \]

Having computed $\{B(\mathbb {G}_a^{s - 1} \rtimes \mu _{s})\} = \mathbb {L}^{-s+1}$ in Lemma 8.14, we therefore wish to next compute $\{\operatorname {Symm}^{t}(B(\mathbb {G}_a^{s - 1} \rtimes \mu _{s}))\}$. Since $\{\operatorname {Symm}^n \mathscr X\}$ only depends on $\{\mathscr X\}$ by [Reference EkedahlEke09b, Proposition 2.5], and we have shown $\{B(\mathbb {G}_a^{s - 1} \rtimes \mu _{s})\}= \{B(\mathbb {G}_a^{s - 1})\}$ in Lemma 8.14 it follows that

\[ \{\operatorname{Symm}^{t}(B(\mathbb{G}_a^{s - 1} \rtimes \mu_{s}))\} = \{\operatorname{Symm}^{t}(B\mathbb{G}_a^{s -1})\}. \]

Next, by [Reference EkedahlEke09b, Lemma 2.4], we have

\begin{align*} \{\operatorname{Symm}^{t}(\mathbb{A}^{s-1} \times B \mathbb{G}_a^{s-1})\} &= \{\operatorname{Symm}^{t}(B \mathbb{G}_a^{s-1}) \times \mathbb{A}^{(s-1)t}\} \\ &= \{\operatorname{Symm}^{t}(B \mathbb{G}_a^{s-1})\} \cdot \mathbb{L}^{(s-1)t}. \end{align*}

However, since $\{\mathbb {A}^{s-1} \times B \mathbb {G}_a^{s-1} \} = 1$, and $\{\operatorname {Symm}^n \mathscr X\}$ only depends on $\{\mathscr X\}$ by [Reference EkedahlEke09b, Proposition 2.5], we obtain

\begin{align*} \{\operatorname{Symm}^{t}(B \mathbb{G}_a^{s-1})\} &= \{\operatorname{Symm}^{t}(\mathbb{A}^{s-1} \times B \mathbb{G}_a^{s-1})\}\mathbb{L}^{-(s-1)t}\\ &= \{\operatorname{Symm}^{t}(1)\} \mathbb{L}^{-(s-1)t} \\ &= \{BS_{t}\}\mathbb{L}^{-(s-1)t} \\ &= \mathbb{L}^{-(s-1)t}. \end{align*}

For the last step, we used Theorem A.1, which says that $\{BS_t\} = 1$.

8.17 Completing the calculation of the local class

We now complete the proof of Theorem 8.3.

Proof of Theorem 8.3 By Remark 8.2 $\{\mathscr X_{R,d}\} = \{B(\operatorname {Res}_{D/k}(\operatorname {Aut}_{X_{(R)}/D}))\}$, and so we will now compute the latter. By Corollary 8.12 we equate

\[ \operatorname{Res}_{D/k}(\operatorname{Aut}_{X_{(R)}/D}) = \prod_{i=1}^n (\mathbb{G}_a^{r_i - 1} \rtimes\mu_{r_i}) \wr S_{t_i}. \]

Factoring this as a product, it suffices to compute the class of $B(\mathbb {G}_a^{r_i - 1} \rtimes \mu _{r_i}) \wr S_{t_i}$. Using that $\sum _{i=1}^n (r_i-1)t_i =r(R)$, the result follows from Lemma 8.16.

9. Codimension bounds for the main result

In this section, we establish various bounds on the codimension or certain bad loci we will want to weed out when computing the class of Hurwitz stacks in the Grothendieck ring.

9.1 Weeding out the strata of unexpected codimension

In order to compute the classes of Hurwitz stacks, we will stratify the Hurwitz stacks by Casnati–Ekedahl strata. The following lemma computes the codimension of these loci in the Hurwitz stack. For the following statement, recall our notation for $\mathscr H(\mathscr E, \mathscr F_\bullet )$ from (3.6).

Lemma 9.2 Fix some $d$, resolution data $(\mathscr E, \mathscr F_\bullet )$, and define $g$ by $\deg \det \mathscr E = g + d - 1$. Letting $\mathscr H := \mathscr H(\mathscr E, \mathscr F_\bullet )$, the codimension of $\mathscr M := \mathscr M(\mathscr E, \mathscr F_\bullet )$ in $\overline {\mathrm {Hur}}_{d,g, k}$, assuming it is nonempty, is

\[ \begin{cases} h^1(\mathbb{P}^1_k, \operatorname{End} \mathscr E) & \text{if } d = 3 ,\\ h^1(\mathbb{P}^1_k, \operatorname{End} \mathscr E)+ h^1(\mathbb{P}^1_k, \operatorname{End} \mathscr F) - h^1(\mathbb{P}^1_k, \mathscr H) & \text{if } d = 4 \text{ or } 5. \end{cases} \]

Proof. The case $d = 3$ is proven in [Reference PatelPat13, Proposition 1.4]. Therefore, for the remainder of the proof, we assume $d = 4$ or $d = 5$, in which case we simply write $(\mathscr E, \mathscr F)$ in place of $(\mathscr E, \mathscr F_\bullet )$. Let $\mathscr M^{\circ } := \mathscr M(\mathscr E^\circ, \mathscr F^\circ )$ denote the dense open stratum, corresponding to vector bundles $\mathscr E^\circ$ and $\mathscr F^\circ$ which are balanced (see § 6.9), subject to the conditions that $\det \mathscr F^\circ \simeq \det \mathscr E^\circ$ when $d = 4$ and $\det \mathscr F^\circ \simeq \det (\mathscr E^\circ )^{\otimes 2}$ when $d = 5$, coming from coming from Theorems 3.14 and 3.16. Similarly, let $\mathscr H^\circ := \mathscr H(\mathscr E^\circ, \mathscr F^\circ )$.

We are looking to compute the codimension of $\mathscr M$ in $\overline {\mathrm {Hur}}_{d,g, k}$, or equivalently the difference of dimensions $\dim \mathscr M^\circ - \dim \mathscr M$. Using the description of $\mathscr M$ from Proposition 7.11 as a quotient of an open in the affine space associated to $H^0(\mathbb {P}^1_k, \mathscr H)$ by $\operatorname {Aut}_{\mathscr M}$, it follows that the codimension of $\mathscr M$ in $\overline {\mathrm {Hur}}_{d,g, k}$ is $\dim \mathscr M^{\circ } - \dim \mathscr M$. Since $\dim \mathscr M = h^0(\mathbb {P}^1_k, \mathscr H) - \dim \operatorname {Aut}_{\mathscr M}$, we are looking to compute

(9.1)\begin{align} & (h^0(\mathbb{P}^1_k,\mathscr H^\circ) - \dim \operatorname{Aut}_{\mathscr M^\circ}) -(h^0(\mathbb{P}^1_k, \mathscr H) - \dim\operatorname{Aut}_{\mathscr M}) \nonumber\\ &\quad = (h^0(\mathbb{P}^1_k,\mathscr H^\circ) - h^0(\mathbb{P}^1_k, \mathscr H)) +(\dim \operatorname{Aut}_{\mathscr M} - \dim \operatorname{Aut}_{\mathscr M^\circ}). \end{align}

We will first identify $\dim \operatorname {Aut}_{\mathscr M} - \dim \operatorname {Aut}_{\mathscr M^\circ }$ with $h^1(\mathbb {P}^1_k, \operatorname {End} \mathscr E)+ h^1(\mathbb {P}^1_k, \operatorname {End} \mathscr F)$. Second, we will show $(h^0(\mathbb {P}^1_k, \mathscr H^\circ ) - h^0(\mathbb {P}^1_k, \mathscr H))$ agrees with $h^1(\mathbb {P}^1_k, \mathscr H)$. Combining these with (9.1) will complete the proof.

To identify $\dim \operatorname {Aut}_{\mathscr M} - \dim \operatorname {Aut}_{\mathscr M^\circ }$, we may identify $\dim \operatorname {Aut}_{\mathscr M}$ with the dimension of the tangent space to $\operatorname {Aut}_{\mathscr M}$ at the identity, which is given by $H^0(\mathbb {P}^1_k, \operatorname {End} \mathscr E) \times H^0(\mathbb {P}^1_k, \operatorname {End} \mathscr F)$. It is then enough to show that

\[ h^0(\mathbb{P}^1_k, \operatorname{End} \mathscr E^\circ) - h^0(\mathbb{P}^1_k, \operatorname{End} \mathscr E) = h^1(\mathbb{P}^1_k, \operatorname{End} \mathscr E) \]

and

\[ h^0(\mathbb{P}^1_k, \operatorname{End} \mathscr F^\circ) - h^0(\mathbb{P}^1_k, \operatorname{End} \mathscr F) = h^1(\mathbb{P}^1_k, \operatorname{End} \mathscr F). \]

We focus on the case of $\mathscr E$, as the case of $\mathscr F$ is completely analogous. By Riemann Roch, since the degrees and ranks of $\mathscr E$ and $\mathscr E^\circ$ are the same, we find

\[ h^0(\mathbb{P}^1_k, \operatorname{End} \mathscr E^\circ) - h^0(\mathbb{P}^1_k, \operatorname{End} \mathscr E) = h^1(\mathbb{P}^1_k, \operatorname{End} \mathscr E) - h^1(\mathbb{P}^1_k, \operatorname{End} \mathscr E^\circ). \]

Because $\mathscr E^\circ$ is balanced, we find $h^1(\mathbb {P}^1_k, \operatorname {End} \mathscr E^\circ ) = 0$.

To complete the proof, it only remains to show $(h^0(\mathbb {P}^1_k, \mathscr H^\circ ) - h^0(\mathbb {P}^1_k, \mathscr H))$ agrees with $h^1(\mathbb {P}^1_k, \mathscr H)$. Similarly to our computation above for $h^0(\mathbb {P}^1_k, \operatorname {End} \mathscr E^\circ ) - h^0(\mathbb {P}^1_k, \operatorname {End} \mathscr E)$, we find

\[ h^0(\mathbb{P}^1_k, \mathscr H^\circ) - h^0(\mathbb{P}^1_k,\mathscr H) = h^1(\mathbb{P}^1_k, \mathscr H) - h^1(\mathbb{P}^1_k,\mathscr H^\circ) \]

by Riemann Roch. To complete the proof, we only need verify $h^1(\mathbb {P}^1_k, \mathscr H^\circ ) = 0$. Indeed, by writing out $\mathscr E^\circ$ and $\mathscr F^\circ$ as sums of line bundles on $\mathbb {P}^1_k$, and using the relation between $\det \mathscr E$ and $\det \mathscr F$, the balancedness of $\mathscr E^\circ$ and $\mathscr F^\circ$ implies $h^1(\mathbb {P}^1_k, \mathscr H^\circ ) = 0$.

With the above lemma in hand, we may note that the codimension of the vector bundles $(\mathscr E, \mathscr F_\bullet )$ in the stack of vector bundles is $H^1(\mathbb {P}^1, \operatorname {End}(\mathscr E)) + H^1(\mathbb {P}^1, \operatorname {End}(\mathscr F))$ (the latter interpreted as $0$ when $d =3$).

Remark 9.3 We will think of a Casnati–Ekedahl stratum as having the ‘expected codimension’ when its codimension in the Hurwitz stack agrees with the corresponding codimension of $(\mathscr E, \mathscr F_\bullet )$ in the stack of tuples of vector bundles. Using Lemma 9.2 and its proof, a stratum is of the expected codimension precisely when $H^1(\mathbb {P}^1, \mathscr H(\mathscr E, \mathscr F_\bullet )) = 0$.

The next lemma bounds the codimension of strata not having the expected codimension.

Lemma 9.4 Suppose $3 \leq d \leq 5$ and $\mathscr M := \mathscr M(\mathscr E, \mathscr F_\bullet )$ is a Casnati–Ekedahl stratum containing a curve $C \to \mathbb {P}^1$ which does not factor through some intermediate cover $C' \to \mathbb {P}^1$ of positive degree. If $H^1(\mathbb {P}^1, \mathscr H(\mathscr E, \mathscr F_\bullet )) \neq 0$ or $H^0(\mathbb {P}^1, \mathscr E^\vee ) \neq 0$, $\operatorname {codim}_{\overline {\mathrm {Hur}}_{d,g, k}} \mathscr M \geq ({g + d -1})/{d} - 4^{d-3}$.

Proof. If $H^1(\mathbb {P}^1, \mathscr H(\mathscr E, \mathscr F_\bullet )) \neq 0$, then we have $\operatorname {codim}_{\overline {\mathrm {Hur}}_{d,g, k}} \mathscr M \geq ({g + d -1})/{d} - 4^{d-3}$ by [Reference Canning and LarsonCL24, Lemma 5.8] when $d = 4$, [Reference Canning and LarsonCL24, Lemma 5.12] when $d = 5$, and [Reference MirandaMir85, (6.2)] for the cases that $d=3$ (see also [Reference Bolognesi and VistoliBV12, Proposition 2.2]). It therefore remains to show that if $H^1(\mathbb {P}^1, \mathscr H(\mathscr E, \mathscr F_\bullet )) = 0$ but $H^0(\mathbb {P}^1, \mathscr E^\vee ) \neq 0$, we will also have $\operatorname {codim}_{\overline {\mathrm {Hur}}_{d,g, k}} \mathscr M \geq ({g + d -1})/{d} - 4^{d-3}$. In the case $H^1(\mathbb {P}^1, \mathscr H(\mathscr E, \mathscr F_\bullet )) = 0$, the codimension of $\mathscr M$ in $\overline {\mathrm {Hur}}_{d,g, k}$ is simply $h^1(\mathbb {P}^1, \operatorname {End}(\mathscr E)) + h^1(\mathbb {P}^1, \operatorname {End}(\mathscr F_\bullet ))$, by Lemma 9.2. We will only have $H^0(\mathbb {P}^1, \mathscr E^\vee ) \neq 0$ when some summand of $\mathscr E$ is non-positive. Recall from Lemma 6.3 $\deg \mathscr E = g + d- 1$. Therefore, if $\mathscr E = \bigoplus _{i=1}^{d-1} \mathscr O(e_i)$ with $e_1 \leq 0$, then $\sum _{i=2}^{d-1} (e_i - e_1) \geq g + d -1$ and, hence,

\begin{align*} h^1(\mathbb{P}^1, \operatorname{End}(\mathscr E)) &\geq h^1 \bigg (\mathbb{P}^1, \oplus \bigoplus _{i=2}^{d-1} \mathscr O_{\mathbb{P}^1}(e_1 - e_i)\bigg) \\ &\geq g + d -1 - (d-2) \\ &= g + 1 \\ &> \frac{g + d -1}{d} - 4^{d-3}. \end{align*}

9.5 Weeding out covers with smaller Galois groups

In the next few results, culminating in Lemma 9.8, we establish bounds on the codimension of degree $d$ covers of $\mathbb {P}^1$ whose Galois closure has Galois group $G$ strictly contained in $S_d$.

For a group $G$ and a base $S$, with $\# G$ invertible on $S$, we use $\mathrm {Hur}_{G,S}$ to denote the stack whose $T$-points are given by $(T, X, h: X \to T, f: X \to \mathbb {P}^1_T)$ where $X$ is a scheme, $h$ is a smooth proper relative curve, $f$ is a finite locally free map of degree $\# G$ so that $G$ acts on $X$ over $\mathbb {P}^1_T$, together with an isomorphism $G \simeq \operatorname {Aut} f$. Note that $\mathrm {Hur}_{G,S}$ is an algebraic stack with an étale map to the configuration space of points in $\mathbb {P}^1$ given by taking the branch divisor, as follows from [Reference WewersWew98, Theorem 4], (the key point of the construction being the algebraicity criterion in [Reference WewersWew98, Theorem 1.3.3]). Upon specifying an embedding $G \subset S_d$ for some $d$, there is a natural map $\mathrm {Hur}_{G,S} \to \overline {\mathrm {Hur}}_{d,S}$ sending a given cover $(T, X, h: X \to T, f: X \to \mathbb {P}^1_T)$ to an associated cover $\coprod _{h \in G \backslash S_d} (h X)/S_{d-1} \to \mathbb {P}^1_T$ where we take the disjoint union over cosets of $G \backslash S_d$ and then quotienting the resulting $S_d$ cover by $S_{d-1}$. The image of this map is a substack of $\overline {\mathrm {Hur}}_{d,S}$ whose geometric points parameterize degree $d$ covers whose Galois group is $G$ with the specified embedding $G \subset S_d$. We note that we could have alternatively constructed $\mathrm {Hur}_{G,S}$ directly, as mentioned in Remark 5.6, without appealing to [Reference WewersWew98].

Lemma 9.6 Suppose $G \subset S_d$ is a subgroup not containing a transposition. Then the closure of the image $(\mathrm {Hur}_{G,S} \to \overline {\mathrm {Hur}}_{d,S}) \cap \overline {\mathrm {Hur}}_{d,g, S}$ has dimension at most $g - 1 + d$.

Proof. By [Reference WewersWew98, Theorem 4], if the image $(\mathrm {Hur}_{G,S} \to \overline {\mathrm {Hur}}_{d,S}) \cap \overline {\mathrm {Hur}}_{d,g, S}$ parameterizes curves with $n$ branch points, it has dimension $n$. We therefore use $n$ for the number of branch points. It is possible this image has multiple components, but because the Galois closure of a genus $g$ degree $d$ cover of $\mathbb {P}^1_k$ is a curve of bounded genus, there can only be finitely many components. We now fix one of these components and wish to show $n \leq g - 1 + d$.

Let $X \to \mathbb {P}^1$ be a degree $d$ genus $g$ cover corresponding to a point on this component with $n$ branch points. If $G \subset S_d$ has no transpositions, the inertia at any point of $\mathbb {P}^1$, which is tame by assumption, does not act as a transposition. Therefore, the cover is not simply branched over that point, i.e. the ramification partition is not $(1^d)$ or $(2,1^{d-2})$. Hence, the fiber over that point has total ramification degree at least $2$. It follows from Riemann–Hurwitz that $2g - 2 \geq -2d + 2n$ so $n \leq g - 1 + d$.

Corollary 9.7 Suppose $2 \leq d \leq 5$ and $G \subset S_d$ acts transitively on $\{1, \ldots, d\}$ with $G$ not isomorphic to $D_4$. Then, the image $(\mathrm {Hur}_{G,S} \to \overline {\mathrm {Hur}}_{d,S}) \cap \overline {\mathrm {Hur}}_{d,g, S}$ has dimension at most $g - 1 + d$.

Proof. If $2 \leq d \leq 5$, we claim the only proper conjugacy class of subgroups $G \subset S_d$ acting transitively on $\{1, \ldots, d\}$ and containing a transposition is $D_4 \subset S_4$, the dihedral group of order $8$. Indeed, this claim follows by a straightforward check of all subgroups of $S_d$. The corollary then follows from Lemma 9.6

The next lemma shows that in any Casnati–Ekedahl stratum having the expected codimension (see Remark 9.3) the locus of non-$S_d$ covers has high codimension in the Hurwitz stack.

Lemma 9.8 Suppose $3 \leq d \leq 5$ and $\mathscr M(\mathscr E,\mathscr F_\bullet )$ is a Casnati–Ekedahl stratum with $H^1(\mathbb {P}^1, \mathscr H(\mathscr E, \mathscr F_\bullet )) = 0$. Suppose further that $[\mathrm {U}_{\mathscr M(\mathscr E, \mathscr F_\bullet )}^{}/\operatorname {Aut}_{\mathscr M(\mathscr E, \mathscr F_\bullet )}]$ contains some geometrically connected cover $X \to \mathbb {P}^1_k$ whose Galois closure is not $S_d$. Then the codimension of this locus of covers in $\overline {\mathrm {Hur}}_{d,g, k}$ is at least $ ({g+3})/{2}$.

Note that the space of $D_4$ covers is typically of codimension $2$ in $\overline {\mathrm {Hur}}_{d,g, k}$, but these covers will typically lie in a Casnati–Ekedahl stratum with $H^1(\mathbb {P}^1,\mathscr H(\mathscr E, \mathscr F_\bullet )) \neq 0$.

Proof. The most difficult case is when $d = 4$ and the Galois closure is $D_4$, the dihedral group of order $8$, this was verified in [Reference Canning and LarsonCL24, Lemma 5.5]. Note here we are using that whenever the Galois group of a degree $4$ cover is $D_4$, $C\to \mathbb {P}^1$ necessarily factors through an intermediate degree $2$ cover.

It remains to verify that if we have a smooth geometrically connected curve $C \to \mathbb {P}^1$ whose Galois closure is not $D_4$, the codimension of such curves is at least $ ({g + 3})/{2}$. The geometric connectedness condition guarantees that the action of $G$ on $\{1, \ldots, d\}$ is transitive. Note that the dimension of such a stratum is at most $g - 1 + d$ by Corollary 9.7, and hence also codimension $g - 1 + d$ in the ($2g+2d-2$)-dimensional stack $\mathrm {Hur}_{d,g,k}$. The lemma follows because $g - 1 + d > ({g + 3})/{2}$.

9.9 Weeding out the singular sections

Our next goal is to show that for any given Casnati–Ekedahl stratum, the sections defining smooth curves can be expressed in terms of a fairly simple motivic Euler product, away from high codimension. This is, in some sense, the key input to our approach, and draws heavily on the work of [Reference Bilu and HoweBH21] while also making use of our computations of classes associated to sections with given ramification profiles over the dual numbers from § 8. It will turn out that this codimension is the dominant term, in the sense that for large $g$, the codimension bound we obtain on these singular sections agrees with the codimension bound we find in our main result Theorem 10.5. At this point, it may be useful to recall notation for motivic Euler products from § 2.10.

Proposition 9.10 Let $3 \leq d \leq 5$, and let $\mathcal {R}$ be an allowable collection of ramification profiles of degree $d$. Suppose $s \geq 0$ and $\mathscr M := \mathscr M(\mathscr E, \mathscr F_\bullet )$ is a Casnati–Ekedahl stratum for which $\mathscr H(\mathscr E, \mathscr F_\bullet )(-s)$ is globally generated and each entry of $\vec a^\mathscr E$ is positive. Then,

(9.2)\begin{equation} \{\mathrm{U}_{ \mathscr M}^{\mathcal{R}}\} \equiv \mathbb{L}^{\dim \mathrm{U}_{ \mathscr M}^{\mathcal{R}}} \prod_{x \in \mathbb{P}^1_k} \bigg(1- \bigg( 1 - \frac{(\sum_{R \in \mathcal{R}} \mathbb{L}^{-r(R)}) \{\operatorname{Aut}_{\mathscr M|_D}\}}{\mathbb{L}^{h^0(D, \mathscr H(\mathscr E|_D, \mathscr F_\bullet|_D))}}\bigg) t \bigg)\bigg\rvert_{t =1} \end{equation}

modulo codimension $\lfloor ({s + 1})/{2} \rfloor$ in $\widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$.

In the above product, the restriction to $D$ is understood to take place at any subscheme $D \subset \mathbb {P}^1_k$, noting that $h^0(D, \mathscr H(\mathscr E|_D, \mathscr F_\bullet |_D))$ and $\{\operatorname {Aut}_{\mathscr M|_D}\}$ are independent of the choice of such $D$.

Proof. We first explain the final statement that $h^0(D, \mathscr H(\mathscr E|_D, \mathscr F_\bullet |_D))$ and $\{\operatorname {Aut}_{\mathscr M|_D}\}$ are independent of the choice of $D$. Indeed, $h^0(D, \mathscr H(\mathscr E|_D, \mathscr F_\bullet |_D))$ is independent of choice of $D \subset \mathbb {P}^1_k$ because it only depends on the rank of $\mathscr H(\mathscr E, \mathscr F_\bullet )$. Similarly, $\{\operatorname {Aut}_{\mathscr M|_D}\}$ only depends on $d$ and the ranks of the sheaves $\mathscr E, \mathscr F_\bullet$, and not the specific choice of $D \subset \mathbb {P}^1_k$. We therefore focus on proving (9.2).

We will deduce (9.2) by applying [Reference Bilu and HoweBH21, Theorem 9.3.1] with the local condition determined by the ramification profile $R$, as determined in Theorem 8.9, as we next explain. In particular, in applying [Reference Bilu and HoweBH21, Theorem 9.3.1], we will take $r = 1, m = \lfloor ({s + 1})/{2} \rfloor, M = 0$ in their notation.

In some more detail, we take $(f: X \to S, \mathcal {F}, \mathcal {L}, r, M)$ in [Reference Bilu and HoweBH21, Theorem 9.3.1] to be $(\mathbb {P}^1_k \to \operatorname {\operatorname {Spec}} k, \mathscr H(\mathscr E, \mathscr F_\bullet ),\mathscr O_{\mathbb {P}^1}(1),1,0)$ and the constructible Taylor conditions $T$ of [Reference Bilu and HoweBH21, Theorem 9.3.1] which we will define in the next paragraph.

For $\mathscr F$ a locally free sheaf on some scheme $X$, we use $\mathcal {P}^1(\mathscr F)$ to denote the first-order sheaf of principal parts. This is a locally free sheaf on $X$ whose fiber over $x \in X$ can be identified with $H^0(X, \mathscr F \otimes \mathscr O_{X,x}/\mathfrak m_{X,x}^2)$. Thus, in the case $X = \mathbb {P}^1_k$ is a curve and $D$ is the copy of the dual numbers whose closed point maps to $x$, the fiber of $\mathcal {P}^1(\mathscr F)$ at $x$ is $H^0(\mathbb {P}^1_k, \mathscr F|_D)$. For a definition and standard background on bundles of principal parts, see [Reference Eisenbud and HarrisEH16, § 7.2]. Let $T$ denote the constructible subset of $\operatorname {\operatorname {Spec}} (\operatorname {Sym}^\bullet \mathcal {P}^1(\mathscr H(\mathscr E, \mathscr F_\bullet )))^\vee$ defined as follows: upon identifying the fiber of $\operatorname {\operatorname {Spec}} (\operatorname {Sym}^\bullet \mathcal {P}^1(\mathscr H(\mathscr E, \mathscr F_\bullet )))^\vee$ at $x$ with $H^0(\mathbb {P}^1_k, \mathscr H(\mathscr E, \mathscr F_\bullet )|_D)$, we take the subset given by those sections $\eta \in H^0(\mathbb {P}^1_k, \mathscr H(\mathscr E, \mathscr F_\bullet )|_D)$ so that $\Psi _d(\eta )$ defines a curvilinear scheme over $D$ whose ramification profile lies in $\mathcal {R}$. Let $T^c$ denote the complement of $T$ in $\operatorname {\operatorname {Spec}} (\operatorname {Sym}^\bullet \mathcal {P}^1(\mathscr H(\mathscr E, \mathscr F_\bullet )))^\vee$.

In order to apply [Reference Bilu and HoweBH21, Theorem 9.3.1], we need to verify the above conditions are indeed admissible in the sense of [Reference Bilu and HoweBH21, Definition 9.2.6]. Indeed, to see this, we need to check the Taylor conditions imposed by being smooth with ramification profile lying in $\mathcal {R}$ are the complement of a codimension $2 = 1 + \dim \mathbb {P}^1_k$ subset of the fiber of the first sheaf of principal parts associated to $\mathscr H(\mathscr E, \mathscr F_\bullet )$ over a field valued point of $\mathbb {P}^1_k$. First, one can verify directly (for example, by using an incidence correspondence) that those sections $\eta$ for which $\Psi _d(\eta )$ are not curvilinear form a locus of codimension at least $2$ in $\operatorname {\operatorname {Spec}} (\operatorname {Sym}^\bullet H^0(D, \mathscr H(\mathscr E|_D, \mathscr F_\bullet |_D))^\vee )$. (Note that non-curvilinear sections also include sections with $\Psi _d(\eta )$ of positive dimension.) It remains to show those curvilinear sections having ramification profile not lying in $\mathcal {R}$ have codimension at least $2$. This follows from knowledge of their class in the Grothendieck ring Theorem 8.9, which shows, in particular, the codimension of those sections having ramification profile $R$ is $r(R)$. Since the only ramification profiles with $r(R) \leq 1$ are $(1^d)$ and $(2, 1^{d-2})$, the claim follows from the first constraint in the definition of allowable, Definition 7.2.

We next use [Reference Bilu and HoweBH21, Ex. 5.4.6] to determine the value of $m$ appearing in [Reference Bilu and HoweBH21, Theorem 9.3.1]. In place of the value $D$ used in [Reference Bilu and HoweBH21, Ex. 5.4.6], we use $-s$, since we are reserving $D$ for the dual numbers. Otherwise following the notation of [Reference Bilu and HoweBH21, Ex. 5.4.6], since $\mathscr O(1) = \mathcal {L}$ is very ample and $\mathcal {L}^{\otimes 0} \simeq \mathscr O_{\mathbb {P}^1}$ globally generated, we may take $A = 1$ and $B = 0$. It follows that (in their notation except that we use $\delta$ in place of $d$) there is a surjection $\mathscr O(s)^{N} \to \mathscr F$, and so $H^0(X, \mathscr F \otimes \mathscr O(\delta ))$ (so, again, we are taking $\mathscr F$ to be $\mathscr H(\mathscr E, \mathscr F_\bullet )$) is $1$-infinitesimally $m$-generating whenever $\delta \geq -s + 0 + 1(1 + (m-1)\cdot (1+1)) = -s + 1 + 2(m-1)$. Therefore, taking $\delta = 0$, we find $H^0(X, \mathscr F)$ is $1$-infinitesimally $m$-generating whenever $s \geq 1 + 2(m-1) = 2m - 1$. Therefore, we may take $m = \lfloor ({s + 1})/{2} \rfloor$.

Using that $\operatorname {rk} \mathcal {P}^1( \mathscr H(\mathscr E,\mathscr F_\bullet ))= h^0(D, \mathscr H(\mathscr E|_D,\mathscr F_\bullet |_D))$, we obtain from [Reference Bilu and HoweBH21, Theorem 9.3.1] the congruence

(9.3)\begin{align} \{\mathrm{U}_{ \mathscr M}^{\mathcal{R}}\} &\equiv \mathbb{L}^{\dim \mathrm{U}_{ \mathscr M}^{\mathcal{R}}} \prod_{x \in \mathbb{P}^1_k} \bigg(1- \bigg(\frac{ \{T^c\}_x}{\mathbb{L}^{\operatorname{rk} \mathcal{P}^1 (\mathscr H(\mathscr E,\mathscr F_\bullet))}}\bigg) t \bigg)\bigg\rvert_{t =1} \end{align}
(9.4)\begin{align} &\equiv \mathbb{L}^{\dim \mathrm{U}_{ \mathscr M}^{\mathcal{R}}} \prod_{x \in \mathbb{P}^1_k} \bigg(1- \bigg(1 - \frac{ \{T\}_x}{\mathbb{L}^{h^0(D, \mathscr H(\mathscr E|_D,\mathscr F_\bullet|_D))}}\bigg) t \bigg)\bigg\rvert_{t =1}. \end{align}

In order to obtain (9.2), we need to identify (9.4) and the right-hand side of (9.2). By working Zariski locally on $\mathbb {P}^1$ so as to trivialize the bundle $\mathscr H(\mathscr E, \mathscr F_\bullet )$, it is enough to identify the fiber of $T$ over $x \in \mathbb {P}^1$ with the class $\sum _{R \in \mathcal {R}} \mathbb {L}^{-r(R)} \{\operatorname {Aut}_{\mathscr M|_D}\}$. Indeed, this identification holds because we showed that the class of the subschemes of $\operatorname {\operatorname {Spec}} (\operatorname {Sym}^\bullet H^0(D, \mathscr H(\mathscr E|_D, \mathscr F_\bullet |_D))^\vee )$ having ramification profile $R$ is $\mathbb {L}^{-r(R)} \{\operatorname {Aut}_{\mathscr M|_D}\}$ when we computed the class of $\mathscr Y_{R,d}$ in Theorem 8.9.

In order to get a good bound on the codimension up to which Proposition 9.10 holds, we need to show that the value of $s$ defined there is high whenever the codimension of the stratum is low. We now establish this.

Lemma 9.11 For any Casnati–Ekedahl stratum $\mathscr M(\mathscr E, \mathscr F_\bullet )$ so that the minimum degree of a line bundle summand of $\mathscr H(\mathscr E, \mathscr F_\bullet )$ is $s$, and $H^1(\mathbb {P}^1, \mathscr H(\mathscr E, \mathscr F_\bullet )) = 0$, we have $\operatorname {codim}_{\overline {\mathrm {Hur}}_{d,g, k}} \mathscr M + ({s + 1})/{2} \geq ({g+c_d})/{\kappa _d}$, where $c_3 = 0$, $c_4= -2$, $c_5 = -23$, $\kappa _3 = 4$, $\kappa _4 = 12$, and $\kappa _5 = 40$.

Proof. We can verify this in the case that $d = 3$ directly. Write $\mathscr E = \mathscr O(s) \oplus \mathscr O(g + 2 - s)$ with $s \leq g + 2 -s$, so that $\operatorname {codim}_{\overline {\mathrm {Hur}}_{3,g, k}}\mathscr M = h^1(\mathbb {P}^1, \operatorname {End}(\mathscr E)) \geq (g + 2) - 2s -1$. Then, $\operatorname {codim}_{\overline {\mathrm {Hur}}_{3,g, k}}\mathscr M + ({s+1})/{2} \geq (g+2) - 2s - 1 + ({s+1})/{2} = ({2g+3-3s})/{2}$. This is minimized when $s$ is maximized. Since we must have $s \leq ({g + 2})/{2}$, when $s = ({g + 2})/{2}$, we find $ ({2g+3-3s})/{2} = {g}/{4}$.

We now concentrate on the cases $d =4$ and $d = 5$. First, in the case that $\mathscr E$ and $\mathscr F$ are balanced, so that $\operatorname {codim}_{\overline {\mathrm {Hur}}_{d,g, k}} \mathscr M = 0$, we claim that $ ({s + 1})/{2} \geq ({g+c_d})/{\kappa _d}$.

When $d = 4$, and $\mathscr E$ and $\mathscr F$ are balanced, the minimum line bundle summand of $\mathscr E$ has degree at least $ ({g + 1})/{3}$ while the maximum line bundle summand of $\mathscr F$ has degree at most $ ({g+4})/{2}$ using Lemma 6.3 and the isomorphism $\det \mathscr E \simeq \det \mathscr F$ from Theorem 3.14. Hence, the minimum line bundle summand of $\mathscr H$ has degree $s \geq 2 ({g + 1})/{3} - ({g + 4})/{2} = ({g-8})/{6}$. Therefore, $ ({s+1})/{2} \geq ({g-2})/{12}$.

When $d = 5$, and $\mathscr E$ and $\mathscr F$ are balanced, the minimum line bundle summand of $\mathscr E$ has degree at least $ ({g + 1})/{4}$ by Lemma 6.3 and the minimum line bundle summand of $\mathscr F$ has degree at least $ ({2(g+4)-4})/{5}$ as $\det \mathscr E^{\otimes 2} = \det \mathscr F$ by Theorem 3.16. Therefore, the minimum degree of a line bundle summand of $\mathscr H$ is $s \geq 2 (({2(g+4)-4})/{5}) - (g+4) + ({g+1})/{4} = ({g-43})/{20}$ and $ ({s + 1})/{2} \geq ({g -23})/{40}$.

In the case $d = 4$ or $5$, it remains to see that $\operatorname {codim}_{\overline {\mathrm {Hur}}_{d,g, k}} \mathscr M + ({s + 1})/{2} \geq ({g+c_d})/{\kappa _d}$ remains true for non-general strata, supposing still that $H^1(\mathbb {P}^1, \mathscr H(\mathscr E, \mathscr F)) = 0$. In this case, by Lemma 9.2, the codimension of $\mathscr M(\mathscr E, \mathscr F)$ is $h^1(\mathbb {P}^1, \operatorname {End}(\mathscr E)) + h^1(\mathbb {P}^1, \operatorname {End}(\mathscr F))$. This codimension is also the codimension of the point $[(\mathscr E,\mathscr F)]$ in the moduli stack of vector bundles $\mathrm {Vect}^{\operatorname {rk} \mathscr E}_{\mathbb {P}^1_k} \times \mathrm {Vect}^{\operatorname {rk} \mathscr F}_{\mathbb {P}^1_k}$, see [Reference LarsonLar21, (3.1)]. Hence, we wish to verify

\[ \operatorname{codim}_{\mathrm{Vect}^{\operatorname{rk} \mathscr E}_{\mathbb{P}^1_k} \times \mathrm{Vect}^{\operatorname{rk} \mathscr F}_{\mathbb{P}^1_k}} [(\mathscr E, \mathscr F)] + \frac{s + 1}{2} \geq \frac{g+c_d}{\kappa_d}, \]

granting that we have established this in the case that $\mathscr E, \mathscr F$ are both balanced, and so correspond to the generic point of $\mathrm {Vect}^{\operatorname {rk} \mathscr E}_{\mathbb {P}^1_k} \times \mathrm {Vect}^{\operatorname {rk} \mathscr F}_{\mathbb {P}^1_k}$, as explained in § 6.9. Following the discussion from § 6.9 where we describe when one vector bundle on $\mathbb {P}^1_k$, viewed as a point in the moduli stack of vector bundles lies in the closure of another, any $(\mathscr E, \mathscr F)$ may be connected to a balanced pair by a sequence of $(\mathscr E_i, \mathscr F_i)$, each contained in the closure of the next. Further, we can assume that for any two adjacent indices $i$ and $i+1$, one of the following two cases occurs:

  1. (1) $\mathscr E_i \simeq \mathscr E_{i+1}$ and $\mathscr F_i$ differs from $\mathscr F_{i+1}$ only in two line bundles summands by a single degree;

  2. (2) $\mathscr F_i \simeq \mathscr F_{i+1}$ and $\mathscr E_i$ differs from $\mathscr E_{i+1}$ only in two line bundle summands by a single degree.

In order to show the claimed inequality holds for arbitrary strata, it suffices to show it remains true under such specializations. Because each such stratum has codimension at least $1$ in the next, it suffices to show the value of $s$ under such specializations decreases by at most $2$. When $d = 4$ this is the case because $\mathscr H(\mathscr E, \mathscr F) = \operatorname {Sym}^2 \mathscr E \otimes \mathscr F^\vee$ and increasing a summand of $\mathscr F$ by $1$ only decreases all summands of $\mathscr H(\mathscr E, \mathscr F)$ by at most $1$, while decreasing a summand of $\mathscr E$ decreases all summands of $\mathscr H(\mathscr E, \mathscr F)$ by at most $2$. Similarly, when $d = 5$, so $\mathscr H(\mathscr E, \mathscr F) = \wedge ^2 \mathscr F \otimes \mathscr E \otimes \det \mathscr E^\vee$, and decreasing a summand of $\mathscr E$ by $1$ while maintaining $\det \mathscr E$ decreases all summands of $\mathscr H(\mathscr E, \mathscr F)$ by at most $1$, while decreasing a summand of $\mathscr F$ by $1$ decreases all summands of $\mathscr H(\mathscr E, \mathscr F)$ by at most $2$.

9.12 Putting the codimension bounds together

We now merge the bounds on codimension of various bad loci established earlier in this section to obtain the following result.

Proposition 9.13 For $3 \leq d \leq 5$, $k$ a field of characteristic not dividing $d!$, $\mathcal {R}$ an allowable collection of ramification profiles of degree $d$, let $c_d, \kappa _d$ be as in Lemma 9.11. Define $n_{d,g} := \chi (\mathscr H(\mathscr E, \mathscr F_\bullet ))$. Then, $\{\mathrm {Hur}_{d,g,k}^{{\mathcal {R}}}\}$ is equal to

(9.5)\begin{equation} \sum_{\text{Casnati--Ekedahl strata }\mathscr M} \frac{1}{\{\operatorname{Aut}_{\mathscr M}\}} \mathbb{L}^{n_{d,g}} \prod_{x \in \mathbb{P}^1_k} \frac{(\sum_{R \in \mathcal{R}} \mathbb{L}^{-r(R)})\{\operatorname{Aut}_{\mathscr M|_D}\}}{\mathbb{L}^{h^0(D, \mathscr H(\mathscr E|_D, \mathscr F_\bullet|_D))}} \end{equation}

modulo codimension $r_{d,g} := \min (({g + c_d})/{\kappa _d}, ({g + d-1})/{d} - 4^{d-3})$ in $\widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$.

Proof. First, by Proposition 7.10, it suffices to show

(9.6)\begin{equation} \sum_{\text{Casnati--Ekedahl strata }\mathscr M}\frac{\{\mathrm{U}_{ \mathscr M}^{\mathcal{R}, S_d}\}}{\{\operatorname{Aut}_{\mathscr M}\}} \end{equation}

agrees with (9.5).

We next check (9.6) agrees with

(9.7)\begin{equation} \sum_{\text{nonempty Casnati--Ekedahl strata }\mathscr M}\frac{\{\mathrm{U}_{ \mathscr M}^{\mathcal{R}}\}}{\{\operatorname{Aut}_{\mathscr M}\}} \end{equation}

modulo codimension $\min (({g + c_d})/{\kappa _d}, ({g + d-1})/{d} - 4^{d-3})$. Since we are working modulo codimension $ ({g + d-1})/{d} - 4^{d-3}$, we can assume $\mathscr M(\mathscr E, \mathscr F_\bullet )$ has $H^1(\mathbb {P}^1, \mathscr H(\mathscr E, \mathscr F_\bullet )) =0$ and $H^0(\mathbb {P}^1, \mathscr E^\vee ) = 0$, by Lemma 9.4. Note that the condition $H^0(\mathbb {P}^1, \mathscr E^\vee ) = 0$ ensures all curves defined by sections of $\mathrm {U}_{ \mathscr M}^{\mathcal {R}}$ are geometrically connected, by Theorem 3.17. Since we have now restricted ourselves to work with strata for which $H^1(\mathbb {P}^1, \mathscr H(\mathscr E, \mathscr F_\bullet )) = 0$, it follows from Lemma 9.11, with notation for $s$ as in Lemma 9.11, that $\operatorname {codim} \mathscr M(\mathscr E, \mathscr F_\bullet ) + ({s + 1})/{2} \geq ({g + c_d})/{\kappa _d}$. We also obtain from Lemma 9.8 that the smooth geometrically connected curves in $\mathrm {U}_{\mathscr M}^{\mathcal {R}}$ which do not lie in $\mathrm {Hur}_{d,g,k}$ (because they do not have Galois closure $S_d$) have codimension at least $ ({g + 3})/{2}$ in $\mathrm {Hur}_{d,g,k}$. Hence, as we are working modulo codimension $ ({g+3})/{2}$, we can freely ignore these, and so (9.6) agrees with (9.7).

We next claim (9.7) agrees with

(9.8)\begin{equation} \sum_{\text{nonempty Casnati--Ekedahl strata }\mathscr M} \frac{1}{\{\operatorname{Aut}_{\mathscr M}\}} \mathbb{L}^{\dim \mathrm{U}_{ \mathscr M}^{\mathcal{R}}} \prod_{x \in \mathbb{P}^1_k} \frac{(\sum_{R \in \mathcal{R}} \mathbb{L}^{-r(R)}) \{\operatorname{Aut}_{\mathscr M|_D}\}}{\mathbb{L}^{h^0(D, \mathscr H(\mathscr E|_D, \mathscr F_\bullet|_D))}}. \end{equation}

Indeed, this follows from Proposition 9.10 using the bounds on $s$ from Lemma 9.11. Next, we claim that for any $\mathscr M(\mathscr E, \mathscr F_\bullet )$ as above, $\dim \mathrm {U}_{ \mathscr M}^{\mathcal {R}}$ is independent of $\mathscr M$ whenever $\operatorname {codim}_{\overline {\mathrm {Hur}}_{d,g, k}} \mathscr M \leq \min (({g + c_d})/{\kappa _d}, ({g + d-1})/{d} - 4^{d-3})$. Indeed, in this case, because $H^1(\mathbb {P}^1, \mathscr H(\mathscr E, \mathscr F_\bullet )) =0$, we find $\dim \mathrm {U}_{ \mathscr M}^{\mathcal {R}} = h^0(\mathbb {P}^1, \mathscr H(\mathscr E, \mathscr F_\bullet )) = \chi (\mathscr H(\mathscr E, \mathscr F_\bullet ))$ and indeed this Euler characteristic only depends on the degrees and ranks of $\mathscr E$ and $\mathscr F$. For notational convenience, we let $n_{d,g}$ denote this dimension $\chi (\mathscr H(\mathscr E, \mathscr F_\bullet ))$. Then, up to codimension $\min (({g + c_d})/{\kappa _d}, ({g + d-1})/{d} - 4^{d-3})$, in $\widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$, we can rewrite (9.8) as (9.5).

To conclude the proof, we wish to remove the word ‘nonempty’ in (9.8). That is, there may be certain strata which contain no $S_d$ covers, and we wish to show they do not contribute to (9.5) in low codimension. The summand in (9.5) associated to such an empty stratum $\mathscr M(\mathscr E, \mathscr F_\bullet )$ has codimension equal to the codimension of $(\mathscr E, \mathscr F_\bullet )$, considered as a point in the moduli stack of tuples of vector bundles on $\mathbb {P}^1$. Using Corollary 9.7, this is only potentially an issue in the case $d = 4$, where we must deal with strata $\mathscr M(\mathscr E, \mathscr F_\bullet )$ so that the generic members of $H^0(\mathbb {P}^1, \mathscr H(\mathscr E, \mathscr F_\bullet ))$ define $D_4$ covers. In [Reference Canning and LarsonCL24, Lemma 5.5], it is shown that such strata are either codimension at least $({g+3})/{2}$ or else have $H^1(\mathbb {P}^1, \mathscr H(\mathscr E, \mathscr F_\bullet )) \neq 0$. In the latter case, by [Reference Canning and LarsonCL24, Lemma 5.4], such strata have codimension at least $({g + 3})/{4}-4$ in the stack of vector bundles on $\mathbb {P}^1$. In either case, we may ignore these contributions up to our codimension bounds, and so (9.8) agrees with (9.5).

10. Proving the main result

In this section, we prove our main result Theorem 10.5 by massaging the formula for $\{\mathrm {Hur}_{d,g,k}\}$ given in Proposition 9.13. We then deduce some corollaries.

In order to prove our main result we will need one of the simplest cases of the ‘motivic Tamagawa number conjecture’ [Reference Behrend and DhillonBD07, Conjecture 3.4]. To start this Tamagawa number formula, we employ the following notation.

Notation 10.1 For $\mathscr G$ a vector bundle on a scheme $X$, let $\operatorname {Aut}^{\mathrm {SL},X}_{\mathscr G}$ denote the $\mathrm {SL}$ bundle over $X$ associated to $\mathscr G$ (i.e. the kernel of the determinant map of group schemes $\operatorname {Aut}_{\mathscr G} \to \mathbb {G}_m$). We use $\operatorname {Aut}^{\mathrm {SL}}_{\mathscr G}$ as notation for the Weil restriction $\operatorname {Res}_{X/\operatorname {\operatorname {Spec}} k}(\operatorname {Aut}^{\mathrm {SL},X}_{\mathscr G})$. For $(\mathscr E, \mathscr F_\bullet )$ resolution data, we use $\operatorname {Aut}^{\mathrm {SL}}(\mathscr F_\bullet ) := \prod _{i=1}^{\lfloor ({d-2})/{2} \rfloor } \operatorname {Aut}^{\mathrm {SL}}(\mathscr F_i)$.

Lemma 10.2 For any positive integer $n$,

\[ \sum_{\substack{\text{rank-$n$ vector bundles $\mathscr V$ on $\mathbb{P}^1$} \\ \det \mathscr V = \mathscr O_{\mathbb{P}^1}}}\frac{1}{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr V}\}} \prod_{x \in \mathbb{P}^1_k}\frac{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr V|_D}\}}{\mathbb{L}^{\dim\operatorname{Aut}^{\mathrm{SL}}_{\mathscr V|_D}}} = \mathbb{L}^{-\dim \mathrm{SL}_n} \in \widehat{\widetilde{K_0}}(\mathrm{Spaces}_k). \]

Proof. We will deduce this from the motivic Tamagawa number conjecture for $\mathrm {SL}_n$ over $\mathbb {P}^1$ proven in [Reference Behrend and DhillonBD07, § 7]. Let $\mathfrak {Bun}_{G,\mathbb {P}^1}$ denote the moduli stack of $G$-bundles on $\mathbb {P}^1$. It is shown in [Reference Behrend and DhillonBD07, § 7], and also via a different argument in [Reference Behrend and DhillonBD07, § 6], that in $\widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$ (and even without inverting universally bijective morphisms) we have $\{\mathfrak {Bun}_{\mathrm {SL}_n,\mathbb {P}^1}\} = \mathbb {L}^{-\dim \mathrm {SL}_n} \prod _{i=2}^n Z(\mathbb {P}^1, \mathbb {L}^{-i})$, where $Z(\mathbb {P}^1, t) := \sum _{i=0}^\infty \{\operatorname {Sym}^i_{\mathbb {P}^1}\} t^i = ({1}/({1-t})) ({1}/({1-\mathbb {L} t}))$ is the motivic Zeta function of $\mathbb {P}^1$.

Note that $Z(\mathbb {P}^1, \mathbb {L}^{-i}) = ({1}/({1-\mathbb {L}^{-i+1}})) ({1}/({1-\mathbb {L}^{-i}}))$ is invertible in $\widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$, with inverse equal to $(1-\mathbb {L}^{-i+1})(1-\mathbb {L}^{-i})$. To complete the proof, it is therefore enough to demonstrate the two equalities

(10.1)\begin{align} \{\mathfrak{Bun}_{\mathrm{SL}_n,\mathbb{P}^1}\} &= \sum_{\substack{\text{rank-$n$ vector bundles $\mathscr V$ on $\mathbb{P}^1$} \\ \det \mathscr V = \mathscr O_{\mathbb{P}^1} }} \frac{1}{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr V}\}} \end{align}
(10.2)\begin{align} \bigg(\prod_{i=2}^n Z(\mathbb{P}^1, \mathbb{L}^{-i})\bigg)^{-1} &= \prod_{x \in \mathbb{P}^1_k}\frac{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr V|_D}\}}{\mathbb{L}^{\dim \operatorname{Aut}^{\mathrm{SL}}_{\mathscr V|_D}}}, \end{align}

where we note that the right-hand side of (10.2) turns out to be independent of $\mathscr V$.

We first verify (10.1). Taking cohomology on $\mathbb {P}^1$ associated to exact sequence $\mathrm {SL}_n \to \operatorname {GL}_n \to \mathbb {G}_m$ defining $\mathrm {SL}_n$ shows that $\mathrm {SL}_n$ torsors over $\mathbb {P}^1$ are in bijection with $\operatorname {GL}_n$ torsors of trivial determinant. We can then stratify $\mathfrak {Bun}_{\mathrm {SL}_n,\mathbb {P}^1} = \coprod B \operatorname {Aut}^{\mathrm {SL}}_{\mathscr V}$ as a disjoint union of locally closed substacks corresponding to residual gerbes, as is explained for general $G$ in place of $\mathrm {SL}_n$ (see [Reference Behrend and DhillonBD07, p. 636]). (Much of this argument can be verified more simply and directly in the case $G = \mathrm {SL}_n$.) Noting that $\operatorname {Aut}^{\mathrm {SL}}_\mathscr V$ is special with invertible class in the Grothendieck ring by Lemma 7.12, we find $\{B(\operatorname {Aut}^{\mathrm {SL}}_{\mathscr V})\} = {1}/{\{\operatorname {Aut}^{\mathrm {SL}}_{\mathscr V}\}}$ and (10.1) follows.

It remains only to prove (10.2). First, note that $\mathscr V$ is trivial Zariski locally and, hence, trivial over $D$, so $\operatorname {Aut}^{\mathrm {SL}}_{\mathscr V|_D}$ is simply $\operatorname {Res}_{D/\operatorname {\operatorname {Spec}} k}(\mathrm {SL}_n)$ which is an extension of $\mathrm {SL}_n$ by $\mathbb {G}_a^{\dim \mathrm {SL}_n}$. Therefore, for any vector bundle $\mathscr V$, we may re-express

\[ \frac{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr V|_D}\}}{\mathbb{L}^{\dim\operatorname{Aut}^{\mathrm{SL}}_{\mathscr V|_D}}} = \frac{\{\mathrm{SL}_n\}}{\mathbb{L}^{\dim \mathrm{SL}_n}} = \{\mathrm{SL}_n\} \mathbb{L}^{-\dim \mathrm{SL}_n} = \bigg(\prod_{i=2}^n (\mathbb{L}^i - 1 ) \bigg) \mathbb{L}^{-\dim \mathrm{SL}_n} = \prod_{i=2}^n (1-\mathbb{L}^{-i}). \]

Using multiplicativity of Euler products (Lemma 2.14),

\[ \prod_{x \in \mathbb{P}^1_k}\frac{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr V|_D}\}}{\mathbb{L}^{\dim \operatorname{Aut}^{\mathrm{SL}}_{\mathscr V|_D}}} = \prod_{x \in \mathbb{P}^1_k} \prod_{i=2}^n (1-\mathbb{L}^{-i}) = \prod_{i=2}^n \prod_{x \in \mathbb{P}^1_k} (1-\mathbb{L}^{-i}). \]

Hence, to prove (10.2), we only need check $Z(\mathbb {P}^1, \mathbb {L}^{-i})^{-1} = \prod _{x \in \mathbb {P}^1_k} (1-\mathbb {L}^{-i})$ for $2 \leq i \leq n$. The right-hand side is, by definition, $\prod _{x \in \mathbb {P}^1_k} (1-\mathbb {L}^{-i}t)|_{t = 1}$. By [Reference BiluBil17, § 3.8, Property 4], we have $\prod _{x \in \mathbb {P}^1_k} (1-\mathbb {L}^{-i}t)|_{t = 1} = \prod _{x \in \mathbb {P}^1_k} (1-t)|_{t = \mathbb {L}^{-i}}$. (As a word of warning, it is important that the substitution we made here was via replacing $t$ by its product with a power of $\mathbb {L}$, see [Reference Bilu and HoweBH21, Remarks 6.5.2 and 6.5.3].) Finally, by [Reference Bilu and HoweBH21, Ex. 6.1.12] and multiplicativity of Euler products [Reference BiluBil17, Proposition 3.9.2.4], $\prod _{x \in \mathbb {P}^1_k} (1-t)|_{t = \mathbb {L}^{-i}} = Z(\mathbb {P}^1, \mathbb {L}^{-i})^{-1}$.

In fact, we will need a slight generalization of the above formula from Lemma 10.2, where we replace bundles of degree $0$ with bundles of arbitrary fixed degree.

Lemma 10.3 For any positive integer $n$, and any fixed integer $\delta$,

\[ \sum_{\substack{\text{rank-$n$ vector bundles $\mathscr V$ on $\mathbb{P}^1$} \\ \deg \mathscr V = \delta}}\frac{1}{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr V}\}} \prod_{x \in\mathbb{P}^1_k} \frac{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr V|_D}\}}{\mathbb{L}^{\dim\operatorname{Aut}^{\mathrm{SL}}_{\mathscr V|_D}}} = \mathbb{L}^{-\dim \mathrm{SL}_n} \in\widehat{\widetilde{K_0}}(\mathrm{Spaces}_k). \]

Proof. The case that $\delta = 0$ was precisely covered in Lemma 10.2. Therefore, it remains to show that the left-hand side of the statement of the lemma is independent of $\delta$. The left hand side is unchanged upon replacing $\delta$ by $\delta \pm n$ because tensoring with the line bundle $\mathscr O_{\mathbb {P}^1_k}(1)$ defines a bijection from degree $\delta$ vector bundles of rank $n$ to degree $\delta + n$ vector bundles of rank $n$, which preserves automorphism groups. Therefore, it suffices to show that the left-hand side is independent of which congruence class $\delta$ lies in $\bmod n$.

Next, note that one can express

\[ \mathbb{L}^{\dim \operatorname{PGL}_n} \cdot \sum_{\substack{\text{rank-$n$ vector bundles $\mathscr V$ on $\mathbb{P}^1$} \\ \deg \mathscr V = \delta}}\frac{1}{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr V}\}} \prod_{x \in \mathbb{P}^1_k}\frac{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr V|_D}\}}{\mathbb{L}^{\dim\operatorname{Aut}^{\mathrm{SL}}_{\mathscr V|_D}}}, \]

as $f_\delta (\mathbb {L})$, for $f_\delta$ a rational function. Hence, for $\delta, \delta '$ two distinct residue classes $\bmod n$, it is enough to show $f_\delta (q) = f_{\delta '}(q)$ for infinitely many integers $q$, as then the two rational functions must agree. The reason for choosing the above expression is that $\sum _{\delta =1}^n f_\delta (q)$ can also be identified with the Tamagawa number of $\operatorname {PGL}_n$ over the function field $\mathbb {P}^1_{\mathbb {F}_q}$. Here we are using that if one starts with a vector bundle $\mathscr V$ on $\mathbb {P}^1_{F_q}$, $\# \operatorname {Aut} \mathbb {P}\mathscr V = ({1}/({q-1})) \operatorname {Aut} \mathscr V$, and the same expression calculates the number of automorphisms of $\mathscr V$ with trivial determinant.

We now use the above description in terms of Tamagawa numbers to show $f_\delta (q) = f_{\delta '}(q)$ for $1 \leq \delta \leq \delta ' \leq n$. For $x \in \mathbb {P}^1_{\mathbb {F}_q}$ a closed point, let $\widehat {\mathscr O}_{x,\mathbb {P}^1_{\mathbb {F}_q}}$ denote the complete local ring at $x$. We use $K(\mathbb {P}^1_{\mathbb {F}_q})$ to denote the function field of $\mathbb {P}^1_{\mathbb {F}_q}$, and $\mathbb {A} := \prod _{v \in \mathbb {P}^1_{\mathbb {F}_q} \text { closed points}} (K(\mathbb {P}^1_{\mathbb {F}_q})_v, \widehat {\mathscr O}_v)$ to denote the ring of adeles for this function field. Note that the Tamagawa number can be expressed as the Tamagawa measure of $\operatorname {PGL}_n(K(\mathbb {P}^1_{\mathbb {F}_q})) \backslash \operatorname {PGL}_n(\mathbb {A})$.

There is a projection map

(10.3)\begin{equation} \alpha: \operatorname{PGL}_n(K(\mathbb{P}^1_{\mathbb{F}_q})) \backslash \operatorname{PGL}_n(\mathbb{A}) \to \operatorname{PGL}_n(K(\mathbb{P}^1_{\mathbb{F}_q})) \backslash \operatorname{PGL}_n(\mathbb{A}) / \prod_{\operatorname{places } x \in \mathbb{P}^1_{\mathbb{F}_q}} \operatorname{PGL}_n(\widehat{\mathscr O}_{x,\mathbb{P}^1_{\mathbb{F}_q}}). \end{equation}

We claim one can identify the target with the set of isomorphism classes of $\operatorname {PGL}_n$ bundles on $\mathbb {P}^1_{\mathbb {F}_q}$, and moreover, if $X$ is a $\operatorname {PGL}_n$ bundle, the Tamagawa measure satisfies

(10.4)\begin{equation} \mu_{\operatorname{Tam}}(\alpha^{-1}([X])) =\frac{\mu_{\operatorname{Tam}}\big(\!\prod_{\operatorname{places } x \in \mathbb{P}^1_{\mathbb{F}_q}} \operatorname{PGL}_n(\widehat{\mathscr O}_{x,\mathbb{P}^1_{\mathbb{F}_q}})\big)}{ \# \operatorname{Aut} X}. \end{equation}

Our claim essentially follows from [Reference Gaitsgory and LurieGL19, Proposition 1.3.2.11], except the statement there assumes the group $G$ is simply connected, which is not the case for $\operatorname {PGL}_n$. However, the only place in the proof (see the proof of [Reference Gaitsgory and LurieGL19, Proposition 1.3.2.10]) that the simply connected hypothesis was used was to show there is some dense open of $\mathbb {P}^1_{\mathbb {F}_q}$ on which any $\operatorname {PGL}_n$ bundle is trivial. We can instead verify this directly as follows. Note first that the Brauer group of $\mathbb {P}^1_{\mathbb {F}_q}$ is trivial [Reference GrothendieckGro68, Remarques 2.5(b)]. Hence, any $\operatorname {PGL}_n$ bundle on $\mathbb {P}^1_{\mathbb {F}_q}$ is the projectivization of a $\operatorname {GL}_n$ bundle. Since any $\operatorname {GL}_n$ bundle is Zariski locally trivial, the same holds for any $\operatorname {PGL}_n$ bundle on $\mathbb {P}^1_{\mathbb {F}_q}$.

There is natural map $\pi : \operatorname {PGL}_n(K(\mathbb {P}^1_{\mathbb {F}_q})) \backslash \operatorname {PGL}_n(\mathbb {A}) \to \mathbb {Z}/n\mathbb {Z}$ which factors through the double quotient map (10.3) parameterizing projective bundles on $\mathbb {P}^1$, and sends a projective bundle to its degree $\bmod n$. Note that the degree of the projectivization of a vector bundle is not well defined as an integer, but it is well defined $\bmod n$. In this setup, we obtain $\mu _{\operatorname {Tam}}(\pi ^{-1}(\delta )) = f_\delta (q)$, where $\mu _{\operatorname {Tam}}$ denotes the right-invariant Tamagawa measure, by summing (10.4) over all bundles of degree $\delta \bmod n$.

Since the Tamagawa measure is translation invariant, we can right-translate by any element of $\operatorname {PGL}_n(\mathbb {A})$ in $\pi ^{-1}(\delta ' - \delta )$ and this sends $\pi ^{-1}(\delta )$ to $\pi ^{-1}(\delta ')$. Hence, the Tamagawa measures of $\pi ^{-1}(\delta )$ and $\pi ^{-1}(\delta ')$ agree, so $f_\delta (q) =f_{\delta '}(q)$, as desired.

For our main theorem, we will also need the following elementary dimension comparison.

Lemma 10.4 For $3 \leq d \leq 5$ and $n_{d,g}$ as in Proposition 9.13, $n_{d,g} - \dim \mathrm {SL}_{\operatorname {rk} \mathscr E} - \sum _{i=1}^{ {\lfloor d - 2 \rfloor }/{2}} \dim \mathrm {SL}_{\operatorname {rk} \mathscr F_i} = \dim \mathrm {Hur}_{d,g,k} + 1$.

Proof. Indeed, this can be checked separately in the cases $d = 3$, $4$, and $5$.

We now check the most difficult case that $d = 5$, leaving the other cases to the reader. In the case $d = 5$, one computes

\begin{align*} n_{d,g} &= \chi(\wedge^2 \mathscr F \otimes \mathscr E \otimes \det \mathscr E^\vee) \\ &= \operatorname{rk} (\wedge^2\mathscr F \otimes \mathscr E \otimes \det \mathscr E^\vee) + \deg \wedge^2 \mathscr F \otimes \mathscr E \otimes \det \mathscr E^\vee \\ &= \binom{5}{2} \cdot 4 + 16 \deg \mathscr F - 30 \deg \mathscr E = 40 + 32 \deg \mathscr E - 30 \deg \mathscr E \\ &= 40 + 2 \deg \mathscr E \\ &= 40 + 2g + 2d - 2. \end{align*}

Furthermore, still in the $d = 5$ case, $\dim \mathrm {SL}_{\operatorname {rk} \mathscr E} = 15$ and $\dim \mathrm {SL}_{\operatorname {rk} \mathscr F} = 24$. Therefore,

\begin{align*} n_{d,g} - \dim \mathrm{SL}_{\operatorname{rk} \mathscr E} - \dim \mathrm{SL}_{\operatorname{rk} \mathscr F} &= 40 + 2g + 2d - 2 - 15 - 24 \\ &= (2g+2d-2)+1 \\ &= \dim \mathrm{Hur}_{d,g,k} + 1 \end{align*}

as claimed.

We are finally prepared to prove our main theorem. For the statement of our main theorem, recall we defined $r_{d,g} = \min (({g + c_d})/{\kappa _d}, ({g + d-1})/{d} - 4^{d-3})$ in Proposition 9.13, with $c_3 = 0, c_4 = -2$, and $c_5 = -23$. Note that for $g \gg 0$, $r_{d,g}$ is more than $ {g}/{\kappa _d}-1$.

Theorem 10.5 Let $2 \leq d \leq 5$, $k$ a field of characteristic not dividing $d!$, $\mathcal {R}$ an allowable collection of ramification profiles of degree $d$. Then,

(10.5)\begin{equation} \{\mathrm{Hur}_{d,g,k}^{{\mathcal{R}}}\} \equiv \frac{\mathbb{L}^{\dim{\mathrm{Hur}_{d,g,k}}}}{1-\mathbb{L}^{-1}} \bigg(\prod_{x \in \mathbb{P}^1_k} \bigg(\sum_{R \in \mathcal{R}} \mathbb{L}^{-r(R)}\bigg)(1-\mathbb{L}^{-1})\bigg) \end{equation}

modulo codimension $r_{d,g}$ in $\widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$. In the case $d = 2$, the left-hand side and right-hand side of (10.5) are actually equal in $K_0(\mathrm {Stacks}_k)$ (and not just equivalent in $\widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$ modulo terms of a certain dimension).

Proof. The proof of the $d = 2$ case is of a different nature and we defer it to the end of § 11. We now concentrate on the case $3 \leq d \leq 5$. By Proposition 9.13, our goal reduces to showing (9.5) agrees with the right hand side of (10.5).

Recall our notation for $n_{d,g}$ from Proposition 9.13. First, we claim we can rewrite (9.5) as

(10.6)\begin{equation} \frac{1}{\mathbb{L} - 1} \sum_{\mathscr E,\mathscr F_\bullet} \frac{1}{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr E}\}} \frac{1}{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr F_\bullet}\}} \mathbb{L}^{n_{d,g}} \prod_{x \in\mathbb{P}^1_k} \frac{(\sum_{R \in \mathcal{R}} \mathbb{L}^{-r(R)}) (\mathbb{L}-1)\mathbb{L}\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr E|_D}\} \{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr F_\bullet|_D}\}}{\mathbb{L}^{h^0 (D,\mathscr H(\mathscr E|_D, \mathscr F_\bullet|_D))}}, \end{equation}

with the summation over $\mathscr E, \mathscr F_\bullet$ interpreted as follows: $\mathscr E$ ranges over all $\mathbb {P}^1$ bundles of rank $d - 1$ and degree $g + d -1$; when $d = 3$, $\mathscr F_\bullet$ is interpreted as being empty (so all classes associated to it are $1$); when $d =4$, $\mathscr F_\bullet = \mathscr F$ has rank $2$ and degree $g + d -1$; when $d = 5$, $\mathscr F_\bullet = \mathscr F$ has rank $5$ and degree $2(g + d - 1)$. To see this we proceed as follows. For $\mathscr M = \mathscr M(\mathscr E, \mathscr F_\bullet )$, using the formula for $\operatorname {Aut}^{\mathbb {P}^1/k}_{\mathscr E, \mathscr F_\bullet }$ from Lemma 4.4, we can rewrite

(10.7)\begin{equation} \frac{1}{\{\operatorname{Aut}_{\mathscr M}\}} = \{\operatorname{Res}_{\mathbb{P}^1_k/k}(\mathbb{G}_m)\} \frac{1}{\{\operatorname{Aut}^{\mathbb{P}^1/k}_{\mathscr E}\}} \frac{1}{\{\operatorname{Aut}^{\mathbb{P}^1/k}_{\mathscr F_\bullet}\}}= \frac{1}{\mathbb{L} -1} \frac{1}{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr E}\}} \frac{1}{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr F_\bullet}\}}, \end{equation}

where we interpret $\{\operatorname {Aut}^{\mathrm {SL}}_{\mathscr F_\bullet }\} = 1$ when $d = 3$. Similarly,

(10.8)\begin{equation} \{\operatorname{Aut}_{\mathscr M|_D}\} = \{\operatorname{Res}_{D/k}(\mathbb{G}_m)\}\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr E|_D}\} \{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr F_\bullet|_D}\} = (\mathbb{L} -1)\mathbb{L}\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr E|_D}\} \{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr F_\bullet|_D}\}. \end{equation}

Hence, using (10.7) and (10.8), we can rewrite (9.5) as (10.6).

We next make a sequence of simplifications of (10.6). Then, summing over the same pairs $(\mathscr E,\mathscr F_\bullet )$ as in (10.6), we can rewrite it as

(10.9)\begin{align} & \frac{1}{\mathbb{L} - 1} \bigg(\sum_{\mathscr E} \frac{1}{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr E}\}}\bigg) \bigg(\sum_{\mathscr F_\bullet} \frac{1}{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr F_\bullet}\}}\bigg) \mathbb{L}^{n_{d,g}} \nonumber\\ &\quad \cdot \prod_{x \in \mathbb{P}^1_k} \frac{(\sum_{R \in \mathcal{R}} \mathbb{L}^{-r(R)}) (\mathbb{L}-1)\mathbb{L}\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr E|_D}\} \{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr F_\bullet|_D}\}}{\mathbb{L}^{h^0(D, \mathscr H(\mathscr E|_D, \mathscr F_\bullet|_D))}}, \end{align}

where the parenthesized sum of $\mathscr F_\bullet$ is interpreted as $1$ in the case $d = 3$, in this line and in the remainder of the proof.

Next, observe that $2 + \dim \operatorname {Aut}^{\mathrm {SL}}_{\mathscr E|_D} + \dim \operatorname {Aut}^{\mathrm {SL}}_{\mathscr F_\bullet |_D} = h^0(D, \mathscr H(\mathscr E|_D, \mathscr F_\bullet |_D))$. Indeed, this can be checked separately in the cases $d = 3$, $4$, and $5$. When $d = 3$, both sides equal $8$, when $d = 4$, both sides equal $24$, and when $d = 5$, both sides equal $80$. Therefore, we can rewrite (10.6) as

(10.10)\begin{align} & \frac{1}{\mathbb{L} - 1} \bigg(\sum_{\mathscr E} \frac{1}{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr E}\}}\bigg) \bigg(\sum_{\mathscr F_\bullet} \frac{1}{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr F_\bullet}\}}\bigg)\mathbb{L}^{n_{d,g}} \nonumber\\ &\quad \cdot\prod_{x \in \mathbb{P}^1_k} \bigg(\sum_{R \in \mathcal{R}} \mathbb{L}^{-r(R)}\bigg) \frac{(\mathbb{L} -1)\mathbb{L}}{\mathbb{L}^2}\frac{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr E|_D}\}}{\mathbb{L}^{\dim \operatorname{Aut}^{\mathrm{SL}}_{\mathscr E|_D}}} \frac{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr F_\bullet|_D}\}}{\mathbb{L}^{\dim \operatorname{Aut}^{\mathrm{SL}}_{\mathscr F_\bullet|_D}}}. \end{align}

Using multiplicativity of Euler products, as proven in Lemma 2.14, this becomes

(10.11)\begin{align} & \frac{1}{\mathbb{L} - 1} \bigg(\sum_{\mathscr E}\frac{1}{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr E}\}}\bigg) \bigg(\sum_{\mathscr F_\bullet}\frac{1}{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr F_\bullet}\}}\bigg) \mathbb{L}^{n_{d,g}} \bigg(\prod_{x \in \mathbb{P}^1_k} \bigg(\sum_{R \in \mathcal{R}}\mathbb{L}^{-r(R)}\bigg) (1-\mathbb{L}^{-1}) \bigg) \nonumber\\ &\quad \cdot \bigg(\prod_{x \in \mathbb{P}^1_k}\frac{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr E|_D}\}}{\mathbb{L}^{\dim \operatorname{Aut}^{\mathrm{SL}}_{\mathscr E|_D}}}\bigg) \cdot \bigg(\prod_{x \in\mathbb{P}^1_k} \frac{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr F_\bullet|_D}\}}{\mathbb{L}^{\dim \operatorname{Aut}^{\mathrm{SL}}_{\mathscr F_\bullet|_D}}}\bigg). \end{align}

Then, by the Tamagawa number formula for $\mathrm {SL}_n$, and its slight generalization from Lemma 10.3,

(10.12)\begin{align} \sum_{\mathscr E} \frac{1}{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr E}\}}\prod_{x \in \mathbb{P}^1_k}\frac{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr E|_D}\}}{\mathbb{L}^{\dim \operatorname{Aut}^{\mathrm{SL}}_{\mathscr E|_D}}} &= \mathbb{L}^{-\dim \mathrm{SL}_{\operatorname{rk} \mathscr E}}, \end{align}
(10.13)\begin{align} \sum_{\mathscr F_\bullet}\frac{1}{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr F_\bullet}\}}\prod_{x \in \mathbb{P}^1_k}\frac{\{\operatorname{Aut}^{\mathrm{SL}}_{\mathscr F_\bullet|_D}\}}{\mathbb{L}^{\dim \operatorname{Aut}^{\mathrm{SL}}_{\mathscr F_\bullet|_D}}} &= \mathbb{L}^{-\dim \mathrm{SL}_{\operatorname{rk} \mathscr F_\bullet}}, \end{align}

where $-\dim \mathrm {SL}_{\operatorname {rk} \mathscr F_\bullet }$ is interpreted as $0$ in the case $d = 3$. Again, in (10.12) and (10.13), the bundles have degrees as described after (10.6). Therefore, (10.11) simplifies to

(10.14)\begin{equation} \frac{1}{\mathbb{L} - 1} \mathbb{L}^{n_{d,g} - \dim \mathrm{SL}_{\operatorname{rk} \mathscr E} - \dim \mathrm{SL}_{\mathscr F_\bullet}} \prod_{x \in \mathbb{P}^1_k} \bigg(\sum_{R \in \mathcal{R}} \mathbb{L}^{-r(R)}\bigg) (1-\mathbb{L}^{-1}). \end{equation}

Hence, using Lemma 10.4, (10.15) simplifies to

(10.15)\begin{equation} \frac{1}{\mathbb{L} - 1} \mathbb{L}^{\dim \mathrm{Hur}_{d,g,k} + 1} \prod_{x \in \mathbb{P}^1_k} \bigg(\sum_{R \in \mathcal{R}} \mathbb{L}^{-r(R)}\bigg) (1-\mathbb{L}^{-1}), \end{equation}

which equals the right-hand side of (10.5).

Specializing Theorem 10.5 to the simply branched case gives the following corollary.

Corollary 10.6 For $2 \leq d \leq 5$, and $k$ a field of characteristic not dividing $d!$, in the case $\mathcal {R} = \{(1^d), (2, 1^{d-2})\}$ corresponding to simply branched curves, we have

\[ \{\mathrm{Hur}_{d,g,k}^{{\mathcal{R}}}\} \equiv \mathbb{L}^{\dim{\mathrm{Hur}_{d,g,k}}}(1-\mathbb{L}^{-2}) \]

in $\widehat {\widetilde {K_0}}(\mathrm {Stacks}_k)$ modulo codimension $r_{d,g}$ if $d \neq 2$, and in $K_0(\mathrm {Stacks}_k)$ when $d =2$.

Note that in the case $d = 2$, this corollary is equivalent to the statement of Theorem 10.5 and is really proven in § 11.

Proof. Simply plug in $\mathcal {R} = \{(1^d), (2, 1^{d-2})\}$ into Theorem 10.5. Then, $\sum _{R \in \mathcal {R}} \mathbb {L}^{-r(R)} = 1 + \mathbb {L}^{-1}$ and so

\begin{align*} \prod_{x \in \mathbb{P}^1_k} (1-\mathbb{L}^{-1})\bigg(\sum_{R \in \mathcal{R}}\mathbb{L}^{-r(R)}\bigg) &= \prod_{x \in \mathbb{P}^1_k} (1-\mathbb{L}^{-2}) & \\ &= \prod_{x \in \mathbb{P}^1_k} (1-\mathbb{L}^{-2}t)|_{t = 1} \\ &= \prod_{x \in \mathbb{P}^1_k} (1-t)|_{t = \mathbb{L}^{-2}}\quad \text{by Bil17,}\, \S \, \text{3.8, Property 4]}\\ &= \frac{1}{Z_{\mathbb{P}^1_k}(\mathbb{L}^{-2})} \quad \text{by [BH21, Ex. 6.1.12]}\\ &= (1- \mathbb{L}^{-1})(1-\mathbb{L}^{-2}). \end{align*}

Therefore, modulo codimension $r_{d,g}$ in $\widehat {\widetilde {K_0}}(\mathrm {Spaces}_k)$ when $d \neq 2$ (and in $K_0(\mathrm {Stacks}_k)$ when $d = 2$)

\begin{align*} \{\mathrm{Hur}_{d,g,k}^{{\mathcal{R}}}\} &\equiv \frac{\mathbb{L}^{\dim{\mathrm{Hur}_{d,g,k}}}}{(1-\mathbb{L}^{-1})} \prod_{x \in \mathbb{P}^1_k} (1-\mathbb{L}^{-1})\bigg(\sum_{R \in \mathcal{R}} \mathbb{L}^{-r(R)}\bigg) \\ &= \frac{\mathbb{L}^{\dim{\mathrm{Hur}_{d,g,k}}}}{1-\mathbb{L}^{-1}} (1- \mathbb{L}^{-1})(1-\mathbb{L}^{-2})\\ &= \mathbb{L}^{\dim{\mathrm{Hur}_{d,g,k}}} (1-\mathbb{L}^{-2}). \end{align*}

When we allow the ramification profile to be arbitrary in Theorem 10.5 we obtain the following corollary counting all degree $d$ $S_d$ Galois covers of $\mathbb {P}^1$. In the cases $d = 4$ and $d = 5$, there does not seem to be any obvious simplification of the motivic Euler product.

Corollary 10.7 For $k$ a field of characteristic not dividing $d!$,

\[ \mathrm{Hur}_{d,g,k}\equiv \begin{cases} \mathbb{L}^{\dim{\mathrm{Hur}_{2,g,k}}}(1-\mathbb{L}^{-2}) & \text{if } d = 2, \\ \mathbb{L}^{\dim{\mathrm{Hur}_{3,g,k}}} (1+\mathbb{L}^{-1}) (1 - \mathbb{L}^{-3}) & \text{if } d = 3, \\ \displaystyle\frac{\mathbb{L}^{\dim{\mathrm{Hur}_{4,g,k} }}}{(1-\mathbb{L}^{-1})} \prod_{x \in \mathbb{P}^1_k} (1+\mathbb{L}^{-2} - \mathbb{L}^{-3} - \mathbb{L}^{-4}) & \text{if } d= 4, \\ \displaystyle\frac{\mathbb{L}^{\dim{\mathrm{Hur}_{5,g,k}}}}{(1-\mathbb{L}^{-1})} \prod_{x \in \mathbb{P}^1_k} (1+\mathbb{L}^{-2}- \mathbb{L}^{-4} - \mathbb{L}^{-5}) & \text{if } d = 5, \end{cases} \]

in $\widehat {\widetilde {K_0}}(\mathrm {Stacks}_k)$ modulo codimension $r_{d,g}$ if $d \neq 2$, and in $K_0(\mathrm {Stacks}_k)$ when $d =2$.

Proof. The case $d = 2$ is already covered in Corollary 10.6, since $\mathrm {Hur}_{2,g,k}^{\{(1^2), (2)\}} = \mathrm {Hur}_{2,g,k}$. Taking

\[ \mathcal{R} = \{(1^4), (2,1^2), (3, 1), (2^2), (4)\} \]

the $d = 4$ case follows from plugging $\mathcal {R}$ into Theorem 10.5 and using $\mathrm {Hur}_{4,g,k}^{\mathcal {R}} = \mathrm {Hur}_{4,g,k}$. Taking $\mathcal {R} = \{(1^5), (2, 1^3), (2^2, 1), (3, 1^2), (3, 2), (4, 1), (5)\}$ the $d = 5$ case follows from plugging $\mathcal {R}$ into Theorem 10.5 and using $\mathrm {Hur}_{5,g,k}^{\mathcal {R}} = \mathrm {Hur}_{5,g,k}$. Finally, let us check the $d = 3$ case. Here, for $\mathcal {R} = \{(1^3), (2,1), (3)\}$, we have $\mathrm {Hur}_{3,g,k}^{\mathcal {R}} = \mathrm {Hur}_{3,g,k}$. Thus, by Theorem 10.5, using [Reference BiluBil17, § 3.8, Property 4] and by [Reference Bilu and HoweBH21, Ex. 6.1.12] as in the proof of Corollary 10.6,

\begin{align*} \{\mathrm{Hur}_{d,g,k}^{\mathcal{R}}\} &\equiv \frac{\mathbb{L}^{\dim{\mathrm{Hur}_{d,g,k}}}}{1-\mathbb{L}^{-1}} \prod_{x \in \mathbb{P}^1_k} (1-\mathbb{L}^{-1})\bigg(\sum_{R \in \mathcal{R}}\mathbb{L}^{-r(R)}\bigg) \\ &= \frac{\mathbb{L}^{\dim{\mathrm{Hur}_{d,g,k}}}}{1-\mathbb{L}^{-1}} \prod_{x \in \mathbb{P}^1_k} (1-\mathbb{L}^{-1})( 1 + \mathbb{L}^{-1} + \mathbb{L}^{-2}) \\ &= \frac{\mathbb{L}^{\dim{\mathrm{Hur}_{d,g,k}}}}{1-\mathbb{L}^{-1}} \prod_{x \in \mathbb{P}^1_k} (1-\mathbb{L}^{-3})\\ &= \frac{\mathbb{L}^{\dim{\mathrm{Hur}_{d,g,k}}}}{1-\mathbb{L}^{-1}} \frac{1}{Z_{\mathbb{P}^1_k}(\mathbb{L}^{-3})}\\ &= \frac{\mathbb{L}^{\dim{\mathrm{Hur}_{d,g,k}}}}{1-\mathbb{L}^{-1}} (1-\mathbb{L}^{-2})(1-\mathbb{L}^{-3})\\ &= \mathbb{L}^{\dim{\mathrm{Hur}_{d,g,k}}} (1+\mathbb{L}^{-1}) (1-\mathbb{L}^{-3}), \end{align*}

where we work in $\widehat {\widetilde {K_0}}(\mathrm {Stacks}_k)$ modulo codimension $r_{d,g}$.

11. Degree 2

Following the notation introduced in [Reference Arsie and VistoliAV04], let $\mathbb {A}_{sm}(1,n) \subset \operatorname {\operatorname {Spec}} (\operatorname {Sym}^\bullet H^0(\mathbb {P}^1, \mathscr O(n)^\vee ))$ denote the open subscheme parameterizing those degree $n$ forms on $\mathbb {P}^1$ whose associated closed subscheme is reduced.

Lemma 11.1 For $k$ a field with $\operatorname {\operatorname {char}} k \neq 2$, there is an isomorphism of stacks $\mathrm {Hur}_{2,g,k} \simeq [\mathbb {A}_{sm}(1,2g+2)/\mathbb {G}_m]$, for an appropriate action of $\mathbb {G}_m$ on $\mathbb {A}_{sm}(1,2g+2)$.

Remark 11.2 This can be deduced from the proofs of [Reference Arsie and VistoliAV04, Theorem 4.1, Corollary 4.7], though there the authors work with a further quotient by the $\operatorname {PGL}_2$ action on the base $\mathbb {P}^1$. The $\mathbb {G}_m$ action on $\mathbb {A}_{sm}(1,n)$ in Lemma 11.1 is explicitly given by $\alpha \cdot f(x) = \alpha ^{-2} f(x)$, though we will not need this in what follows.

Proof. First, we verify that $\mathrm {Hur}_{2,g,k}$ is equivalent to the fibered category whose $S$-points parameterize pairs $(\mathscr L, i: \mathscr L^{\otimes 2} \rightarrow \mathscr O_{\mathbb {P}^1_S})$, for $\mathscr L$ a degree-($-g - 1$) invertible sheaf on $\mathbb {P}^1_S$, and $i$ an injective homomorphism of sheaves. Indeed, to connect this to our given definition of $\mathrm {Hur}_{2,g,k}$, we follow [Reference Arsie and VistoliAV04, Remark 3.3 and Proposition 3.1]: given a cover $\rho : H \rightarrow \mathbb {P}^1_S$, we have a natural action of $\mu _2$ on $H$ over $\mathbb {P}^1$. This comes from the isomorphism $\mu _2 \simeq \mathbb {Z}/2\mathbb {Z}$ as we are assuming $\operatorname {\operatorname {char}}(k) \neq 2$. From this action, we obtain an isomorphism $\rho _* \mathscr O_H \simeq \mathscr O_{\mathbb {P}^1_S} \oplus \mathscr L$, for $\mathscr L$ the subsheaf on which $\mu _2$ acts by $(t,s) \mapsto t \cdot s$, i.e. $\mathscr L$ is the weight-$1$ eigenspace of $\mu _2$, and $\mathscr O_{\mathbb {P}^1_S}$ is the weight-$0$ eigenspace. The description of $\mathscr L$ as the $1$ eigenspace for the $\mu _2$ action yields a map $i: \mathscr L \otimes \mathscr L \rightarrow \mathscr O$. In the other direction, given $(\mathscr L, i: \mathscr L^{\otimes 2} \rightarrow \mathscr O_{\mathbb {P}^1_S})$, we can recover $H = \operatorname {\operatorname {Spec}}_{\mathscr O_{\mathbb {P}^1_S}}(\mathscr O_{\mathbb {P}^1_S} \oplus \mathscr L)$. The given maps respect automorphisms over $\mathbb {P}^1$, as the only nontrivial automorphism in both cases is given by the hyperelliptic involution. Hence, they define an equivalence of algebraic stacks.

Next, consider the cover $\widetilde {\mathrm {Hur}_{2,g,k}}$ of $\mathrm {Hur}_{2,g,k}$ given as the stackification of the fibered category whose $S$ points parameterize triples $(\mathscr L, \phi : \mathscr L \simeq \mathscr O(-g-1), i: \mathscr L^{\otimes 2} \rightarrow \mathscr O)$, with $i$ injective. Note that $\widetilde {\mathrm {Hur}_{2,g,k}} \to \mathrm {Hur}_{2,g,k}$ is indeed surjective because $\mathscr L \simeq \mathscr E^\vee$ is a degree-($-g-1$) line bundle on $\mathbb {P}^1$ by Lemma 6.3. Observe that $\widetilde {\mathrm {Hur}_{2,g,k}}$ has a natural action of $\mathbb {G}_m$ acting by automorphisms of $\mathscr L$, so that $\mathrm {Hur}_{d,g,k} = [\widetilde {\mathrm {Hur}_{d,g,k}}/\mathbb {G}_m]$. Said another way, quotienting by $\mathbb {G}_m$ forgets the data of the isomorphism $\phi$.

It remains to identify $\widetilde {\mathrm {Hur}_{2,g,k}}$ with $\mathbb {A}_{sm}(1,2g+2)$. Indeed, this was done in the course of the proof of [Reference Arsie and VistoliAV04, Theorem 4.1]. Briefly, given an $S$-point $(\mathscr L, \phi, i)$, associate the map $i \circ (\phi ^{-1})^{\otimes 2} : \mathscr O_{\mathbb {P}^1_S}(-2g-2) \rightarrow \mathscr O_{\mathbb {P}^1_S}$ corresponding to a section of $H^0(\mathbb {P}^1_S, \mathscr O(2g+2))$. Conversely, given a section $f \in H^0(\mathbb {P}^1_S, \mathscr O(2g+2))$, associate the triple $(\mathscr O(-g-1), \operatorname {id}: \mathscr O(-g-1) \rightarrow \mathscr O(-g-1), f: \mathscr O(-g-1)^{\otimes 2} \rightarrow \mathscr O)$.

We are now ready to prove Theorem 10.5 in the case $d = 2$.

11.3 Proof of $d=2$ case of Theorem 10.5

Note that the only allowable collection of ramification profiles is $\mathcal {R} = \{ (2), (1,1)\}$. Since $[\mathrm {Hur}_{2,g,k} \simeq \mathbb {A}_{sm}(1,2g+2)/\mathbb {G}_m]$ by Lemma 11.1, and $\mathbb {G}_m$ is special, we have $\{\mathrm {Hur}_{2,g,k}\}\{\mathbb {G}_m\} = \mathbb {A}_{sm}(1,2g+2)$. Since

\begin{align*} \{\mathbb{G}_m\} \frac{\mathbb{L}^{\dim{\mathrm{Hur}_{2,g,k}}}}{1-\mathbb{L}^{-1}} \bigg(\prod_{x \in \mathbb{P}^1_k} \bigg( \sum_{R \in \mathcal{R}} \mathbb{L}^{-r(R)} \bigg) (1-\mathbb{L}^{-1}) \bigg) &= \frac{\mathbb{L}-1}{1-\mathbb{L}^{-1}} \cdot \mathbb{L}^{2g+2} \prod_{x \in \mathbb{P}^1_k} (1-\mathbb{L}^{-2}) \\ &= \mathbb{L}^{2g+3}\frac{1}{Z_{\mathbb{P}^1_k}(\mathbb{L}^{-2})} \\ &= \mathbb{L}^{2g+3} (1-\mathbb{L}^{-1})(1-\mathbb{L}^{-2}), \end{align*}

(by [Reference Bilu and HoweBH21, Ex. 6.1.12] and [Reference BiluBil17, § 3.8, Property 4], as in the proof of Corollary 10.6) it suffices to verify

\[ \mathbb{A}_{sm}(1,2g+2)= \mathbb{L}^{2g+3} (1-\mathbb{L}^{-1})(1-\mathbb{L}^{-2}). \]

Indeed, this follows from [Reference Vakil and WoodVW15, Lemma 5.9(a)]. In a bit more detail, taking $a =2$ in [Reference Vakil and WoodVW15, Lemma 5.9(a)], the expression $K_{<2}(t)$ there is the generating function for which the coefficient of $t^n$ is the class of $w_{1^n}$ in the notation of [Reference Vakil and WoodVW15, (5.1)]. Here, $w_{1^a}$ is the class of the space of degree $n$ reduced divisors on $\mathbb {P}^1$. Therefore, $w_{1^n} = \{[\mathbb {A}_{sm}(1, n)/\mathbb {G}_m]\}$, and so we only need check $\{w_{1^n}\} = \mathbb {L}^n - \mathbb {L}^{n-2}$. But, indeed, this is the coefficient of $t^n$ in the expansion of

\[ \frac{Z_{\mathbb{P}^1}(t)}{Z_{\mathbb{P}^1}(t^2)} = \frac{(1-t^2 \mathbb{L})(1-t^2)}{(1-t\mathbb{L})(1-t)} = (1-t^2 \mathbb{L})(1+t)\bigg(\sum_{i=0}^\infty (t\mathbb{L})^i\bigg).\]

Remark 11.4 The construction above used to compute the class of $\mathrm {Hur}_{2,g,k}$ is admittedly fairly ad hoc in the context of this paper. A similar construction, more in line with the themes of this paper could be obtained by realizing a given hyperelliptic curve $\rho : H \rightarrow \mathbb {P}^1$ as a subscheme of $\mathbb {P} ((\rho _* \mathscr O_H)^\vee )$. One can verify that $\mathbb {P} (\rho _* \mathscr O_H^\vee )$ is fppf locally isomorphic to $\mathbb {P}(\mathscr O_{\mathbb {P}^1} \oplus \mathscr O_{\mathbb {P}^1}(g+1))$, and use this to deduce that $\mathrm {Hur}_{2,g,k}$ is the quotient of the smooth members of a certain linear series on $\mathbb {P}\big ( \mathscr O_{\mathbb {P}^1} \oplus \mathscr O_{\mathbb {P}^1}(g+1) \big )$ by the automorphisms of $\mathbb {P}(\mathscr O_{\mathbb {P}^1_k} \oplus \mathscr O_{\mathbb {P}^1_k}(g+1))$ preserving the projection to $\mathbb {P}^1_k$, and then use this description to compute $\{\mathrm {Hur}_{2,g,k}\}$, obtaining a formula similar to that of Theorem 10.5. However, such a proof would only calculate the class in $\widehat {\widetilde {K_0}}(\mathrm {Stacks}_k)$ modulo a certain codimension, as opposed to the proof we give here, which actually calculates the class in $K_0(\mathrm {Stacks}_k)$.

Acknowledgements

We thank Hannah Larson as well as multiple tremendously meticulous and helpful referees for carefully reading the paper and offering especially detailed comments. We also thank Manjul Bhargava, Margaret Bilu, Samir Canning, Gianfranco Casnati, Jordan Ellenberg, Sean Howe, Nikolas Kuhn, Anand Patel, Bjorn Poonen, Will Sawin, Federico Scavia, Craig Westerland, Takehiko Yasuda, and Wei Zhang for helpful discussions related to this paper.

Conflicts of interest

None.

Financial support

AL was supported by the National Science Foundation under Award No. DMS 2102955. RV was partly supported by NSF grant DMS-1601211. MMW was partly supported by a Packard Fellowship for Science and Engineering, a NSF Waterman Award DMS-2140043, and NSF CAREER grant DMS-2052036.

Journal information

Compositio Mathematica is owned by the Foundation Compositio Mathematica and published by the London Mathematical Society in partnership with Cambridge University Press. All surplus income from the publication of Compositio Mathematica is returned to mathematics and higher education through the charitable activities of the Foundation, the London Mathematical Society and Cambridge University Press.

Appendix A. A proof of a theorem of Ekedahl

Aaron Landesman and Federico Scavia

The main result of this appendix is a proof of the following Theorem of T. Ekedahl. We retain the notation for the Grothendieck ring of stacks described in § 1.20.

Theorem A.1 (Ekedahl [Reference EkedahlEke09b, Theorem 4.3])

Let $k$ be a field. Then, for all integers $n \geq 1$, $\{BS_n\}=1$ in $K_0(\mathrm {Stacks}_k)$.

Unfortunately, Ekedahl passed away prior to publishing [Reference EkedahlEke09b], and so the article was never refereed. There are a number of typos and errors appearing in the proof of [Reference EkedahlEke09b, Theorem 4.3]. The objective of this appendix is to point out the fixes necessary.

Let $k$ be a field, let $G$ be a finite group, and let $V$ be a $G$-representation of dimension $d\geq 0$ over $k$. If $H$ is a subgroup of $G$, we denote by $V^H$ the subscheme of $V$ fixed by $H$, and by $V_H$ the locally closed subscheme parameterizing the locus whose stabilizer is exactly $H$. If there is a point of $V$ whose stabilizer is exactly $H$, we call $H$ a stabilizer subgroup of $G$. The normalizer $N_G(H)$ of $H$ acts on $V^H$ and $V_H$, and $V_H$ is an open subscheme of $V^H$. By definition, a stabilizer flag of length $n$ is a sequence

\[ f = (\{e\}=:H_0 \subset H_1 \subset \cdots \subset H_n) \]

of subgroups of $G$ such that, for all $0\leq i\leq n-1$, $H_{i+1}$ is a stabilizer subgroup of the $G$-action on $V$. We say that $f$ is strict if $H_i\subsetneq H_{i+1}$ for all $i$. We set $n_f:=n$, $H_f:=H_n$, $d_f:=\dim V^{H_f}$ and $N_G(f):=\cap _{0\leq i\leq n} N_G(H_i)$.

Remark A.2 Our definition of stabilizer flag differs from that used by Ekedahl [Reference EkedahlEke09b, p. 10], as he required that $H_{i+1}$ be a stabilizer subgroup of the action of $\cap _{j\leq i}N_G(H_i)$ on $V^{H_i}$. In particular, in our definition it is not necessarily true that $H_f\subset N_G(f)$.

The conjugation action of $G$ on itself induces a $G$-action on the collection of all stabilizer flags. We say that two stabilizer flags are conjugate to each other if they belong to the same orbit under this action.

Proposition A.3 Let $K\subset G$ be the kernel of the $G$-action on $V$. We have

(A.1)\begin{equation} \{BG\}\mathbb{L}^d=\{[V_K/G]\}-\sum_f (-1)^{n_f}\{BN_G(f)\}\mathbb{L}^{d_f}, \end{equation}

where $f$ runs over a set of representatives of conjugacy classes of strict stabilizer flags of length $n_f\geq 1$.

Proposition A.3 corrects [Reference EkedahlEke09b, Theorem 3.4]. The formula there looks the same as ours (up to signs), but it is wrong as it is claimed with a different definition of stabilizer flag. The error there stems from the falsity of [Reference EkedahlEke09b, Lemma 3.3(iv)], as illustrated by Example A.4 below.

We note that the proof of Proposition A.3 follows similar lines to that of [Reference EkedahlEke09b, Theorem 3.4]. In particular, it uses results from [Reference EkedahlEke09b, Lemma 3.3(i), (ii), and (iii)] even though [Reference EkedahlEke09b, Lemma 3.3(iv)] is incorrect. It may be helpful for the reader to consult these statements.

Example A.4 The result [Reference EkedahlEke09b, Lemma 3.3(iv)] claims that $V_H = (V^H)_H$, where $V^H$ is considered as an $N_G(H)$ representation. However, when $G = S_3$ and $H$ is the subgroup generated by $(12)$, and $G$ acts as the three-dimensional permutation representation, then $V_H = \{(a,a,b) : a \neq b\}$, while $N_G(H) = H$ and $V^H = \{(a,a,b)\}$. Thus, here, when $V^H$ is considered as an $N_G(H) = H$ representation, we have that $H$ acts trivially and $(V^H)_H = V^H \neq V_H$.

Since [Reference EkedahlEke09b, Lemma 3.3(iv)] is implicitly used in the proof of [Reference EkedahlEke09b, Theorem 3.4], [Reference EkedahlEke09b, Theorem 3.4] is also incorrect. To produce a counterexample to the statement of [Reference EkedahlEke09b, Theorem 3.4] (even after correcting the $+$ sign appearing in the statement there to the $-$ sign of (A.1)), we can again take $G = S_3$. Then, the only strict stabilizer flags in the sense of [Reference EkedahlEke09b, p. 10] (which are defined in a slightly different way than in this appendix) up to conjugacy are $\{e\},\{e\} \subset S_2, \{e\} \subset S_3$. In this case, with our knowledge that $\{B S_3\} = 1$, the formula of [Reference EkedahlEke09b, Theorem 3.4] claims $\mathbb {L}^3 = (\mathbb {L}^3 - \mathbb {L}^2) + (\mathbb {L}^2) + (\mathbb {L})$. Of course, what is missing from this formula is that we should subtract off a term $\mathbb {L}$ coming from the sequence of subgroups $\{e\} \subset S_2 \subset S_3$, which is a stabilizer flag in the sense of this appendix, but not in the sense of [Reference EkedahlEke09b, p. 10].

Proof of Proposition A.3 Let $f$ be a strict stabilizer flag. We have

\[ [V^{H_f}/N_G(f)] - [V_{H_f}/N_G(f)] =\bigg[\coprod_{\substack{H \subset G \\ g\in N_G(f)/(N_G(f)\cap N_G(H))}} V_{gHg^{-1}}/N_G(f)\bigg] \]

where, on the right-hand side, $H$ runs among a set of representatives of $N_G(f)$-conjugacy classes of subgroups of $G$ acting on $V$ and properly containing $H_f$.

For any fixed $H \subset G$, we have

\[ \bigg[\coprod_{g\in N_G(f)/(N_G(f)\cap N_G(H))}V_{gHg^{-1}}/N_G(f)\bigg]=[V_{H}/N_G(f)\cap N_G(H)]. \]

For any such $H$, construct a strict stabilizer flag $f'$ by appending $H$ at the end of $f$. Then

\[ N_G(f)\cap N_G(H)=N_G(f'). \]

We conclude

(A.2)\begin{equation} \{[V_{H_f}/N_G(f)]\}=\{BN_G(f)\}\mathbb{L}^{d_f}-\sum_{f'}\{[V_{H_{f'}}/N_G(f')]\}, \end{equation}

where $f'$ runs over a set of representatives of conjugacy classes of strict stabilizer flags of length $n_f+1$ and starting with $f$.

We now wish to prove by induction on $m\geq 1$ that

(A.3)\begin{equation} \{BG\}\mathbb{L}^d=\{[V_K/G]\}-\sum_{0< n_f< m}(-1)^{n_f}\{BN_G(f)\}\mathbb{L}^{d_f}-(-1)^m\sum_{n_f=m}\{[V_{H_f}/N_G(f)]\}, \end{equation}

where $f$ runs among a set of representatives of conjugacy classes of strict stabilizer flags. When $m=1$, (A.3) coincides with (A.2) for $f=(K)$. Assume now that (A.3) holds for some $m>1$. One obtains the formula for $m+1$, by starting from the formula for $m$ and applying (A.2) to every flag $f$ of length $m$.

Since $G$ is finite, there are only finitely many strict stabilizer flags. The conclusion follows by choosing $m$ to be larger than the length of every strict stabilizer flag.

Having replaced [Reference EkedahlEke09b, Theorem 3.4] by Proposition A.3, the proof of Theorem A.1 can be completed as in [Reference EkedahlEke09b]. From now on, let $G=S_n$ be the group of permutations of $\Sigma :=\{1,2,\ldots,n\}$, and let $V$ be the $n$-dimensional permutation representation of $S_n$.

A flag is a pair $(S,R)$, where $S$ is a finite set, and $R$ is a sequence $R_1\subset R_2\subset \cdots \subset R_n$ of equivalence relations $R_i\subset S\times S$ on $S$. An isomorphism of flags $(S',R')\to (S,R)$ is a bijection $S'\xrightarrow {\sim } S$ sending $R'_i$ to $R_i$ for all $i$. We denote by $N_R(S)$ the automorphism group of $(S,R)$.

Lemma A.5 Assume that $G=S_n$ and that $V$ is the standard $n$-dimensional representation of $S_n$. Let $f$ be a strict stabilizer flag, and denote by $H_i$ the stabilizer subgroups appearing in $f$. For every $i$, let $R_i$ be the equivalence relation determined by the orbit partition of the $H_i$-action on $\Sigma$, and let $R$ be the flag on $\Sigma$ given by the $R_i$.

  1. (a) We have $N_{S_n}(f)=N_R(\Sigma )$.

  2. (b) If $N_{S_n}(f)=S_n$, then either $f=(\{e\})$ or $f=(\{e\}\subset S_n)$.

  3. (c) Assume that $\{BS_m\}=1$ for all $m< n$ and that $N_{S_n}(f)\neq S_n$. Then $\{BN_{S_n}(f)\}=1$.

Proof. (a) This follows from the fact that, for every $i$, a bijection $\sigma$ of $\Sigma$ respects $R_i$ if and only if it normalizes $H_i$.

(b) If $N_{S_n}(f)=S_n$, then for every $i$, $R_i$ is respected by every bijection of $\Sigma$. It follows that either $R_i$ is the diagonal in $\Sigma \times \Sigma$ or $R_i=\Sigma \times \Sigma$. Now part (b) follows from part (a).

(c) We may assume that $\{BN_{S_n}(f')\}=1$ for all flags such that $n_{f'}< n_f$. By [Reference EkedahlEke09b, Proposition 4.2], $N_{S_n}(f)$ is a direct product of wreath products $N'\wr S_r:=(N')^r\rtimes S_r$, where $N'$ is the normalizer of a flag of smaller length, and $S_r$ acts by permutation of the $r$ factors $N'$.

In what follows, we use the symbol $\operatorname {Symm}$ for the stacky symmetric power as introduced in [Reference EkedahlEke09b, p. 5]. We also use the symbol $\wr$ for wreath product. This was introduced and notated $\int$ in [Reference EkedahlEke09b, p. 5], but we use $\wr$ instead of $\int$ in order to keep our notation consistent with the rest of the paper.

Because, for $G$ and $H$ finite groups, $B(G \times H) \simeq BG \times BH$, it suffices to show $\{ B(N' \wr S_r)\} = 1$. We have $B(N' \wr S_r)\simeq BN'\wr BS_r \simeq \operatorname {Symm}^r(BN')$, as explained in [Reference EkedahlEke09b, p. 5], By inductive assumption, $\{B(N'\wr S_r)\}=\sigma _s^t(\{BN'\})=\sigma _s^t(1)=1$. For the symbol $\sigma _s^t$, see [Reference EkedahlEke09b, Proposition 2.5].

Proof of Theorem A.1 Let $V$ be the $n$-dimensional permutation representation of $S_n$ over $k$, and let $U:=V_{\{e\}}\subset V$ be the free locus of the $S_n$-action. By Proposition A.3,

\[ \{BS_n\}\mathbb{L}^n=\{U/S_n\}-\sum_f(-1)^{n_f}\{BN_{S_n}(f)\}\mathbb{L}^{d_f}, \]

where $f$ runs among conjugacy classes of strict stabilizer flags. By Lemma A.5(b), we may rewrite this as

\[ \{BS_n\}(\mathbb{L}^n-\mathbb{L})=\{U/S_n\}-\sum_f(-1)^{n_f}\{BN_{S_n}(f)\}\mathbb{L}^{d_f}, \]

where now $f$ runs among conjugacy classes of strict stabilizer flags such that $N_{S_n}(f)\neq S_n$. By Lemma A.5(c), we have $\{BN_{S_n}(f)\}=1$ for all such $f$.

We claim that $\{U/S_n\}$ is a polynomial in $\mathbb {L}$ with integer coefficients. The stacks $V_H/N_{S_n}(H)$ are isomorphic to parts of a locally closed stratification of $V/S_n$. This is well known from general principles when $\operatorname {char}k=0$ or when $\operatorname {char}k>0$ does not divide $n$, but Ekedahl gave a proof in arbitrary characteristic in [Reference EkedahlEke09b, Proposition 1.1(ii)].

To show $\{U/S_n\}$ is a polynomial in $\mathbb {L}$, let $f$ be a strict stabilizer flag. Then, as in the proof of Proposition A.3, we have

\[ \{V_{H_f}/N_{S_n}(f)\}=\{V^{H_f}/N_{S_n}(f)\}-\sum_{f'}\{V_{H_{f'}}/N_{S_n}(f')\}, \]

where $f'$ runs among conjugacy classes of strict stabilizer flags starting with $f$ and of length $n_f+1$.

Applying the previous formula iteratively, we obtain

\[ \{V/S_n\}=\{U/S_n\}-\sum_f (-1)^{n_f}\{V^{H_f}/N_{S_n}(f)\}, \]

where $f$ runs among conjugacy classes of strict stabilizer flags of positive length. For every flag $f$, we claim that the quotient $W_f:=N_{S_n}(f)/(H_f\cap N_{S_n}(f))$ is a product of symmetric groups, and $V^{H_f}$ is a permutation representation of $W_f$. To see this, note that $N_{S_n}(f)$ can be identified with $N_{R_f}(\Sigma )$ via Lemma A.5 for a sequence of equivalence relations $R_f$ given as $R_1 \subset R_2 \subset \cdots \subset R_{n_f}$. Under this identification, $H_f$ is identified with the subgroup of permutations acting trivially on the equivalence classes defined by $R_{n_f}$. Therefore, the action of $W_f$ on $V^{H_f}$ is generated by permutations switching two equivalence classes of $R_{n_f}$ for which there exists an isomorphism of those two classes respecting $R$. Therefore, $W_f$ is a product of symmetric groups acting by a permutation representation on $V^{H_f}$. Hence, by the fundamental theorem for symmetric polynomials, $V^{H_f}/N_{S_n}(f)=V^{H_f}/W_f$ is an affine space over $k$. Since $V/S_n$ is also isomorphic to affine space, we deduce that $\{U/S_n\}$ is a polynomial in $\mathbb {L}$, as claimed. We conclude that $\{BS_n\}$ can be written as a rational function in $\mathbb {L}$ with integer coefficients, and with denominator $\mathbb {L}^n-\mathbb {L}$. By [Reference EkedahlEke09b, Lemma 3.5], this implies that $\{BS_n\}=1$.

Footnotes

With an appendix by Aaron Landesman and Federico Scavia

References

Ali Altug, S., Shankar, A., Varma, I. and Wilson, K. H., The number of $D_4$-fields ordered by conductor, J. Eur. Math. Soc. (JEMS) 23 (2021), 27332785.CrossRefGoogle Scholar
Aoki, M., Hom stacks, Manuscripta Math. 119 (2006), 3756.CrossRefGoogle Scholar
Aoki, M., Erratum: ‘Hom stacks’ [Manuscripta Math. 119 (2006), no. 1, 37–56; MR2194377], Manuscripta Math. 121 (2006), 135.CrossRefGoogle Scholar
Arsie, A. and Vistoli, A., Stacks of cyclic covers of projective spaces, Compos. Math. 140 (2004), 647666.CrossRefGoogle Scholar
Behrend, K. and Dhillon, A., On the motivic class of the stack of bundles, Adv. Math. 212 (2007), 617644.CrossRefGoogle Scholar
Behrend, K. A., The Lefschetz trace formula for the moduli stack of principal bundles, PhD thesis, University of California, Berkeley (ProQuest LLC, Ann Arbor, MI, 1991).Google Scholar
Bertin, J. and Romagny, M., Champs de Hurwitz, Mém. Soc. Math. Fr. (N.S.) 125–126 (2011), 219.Google Scholar
Bhargava, M., Higher composition laws III: The parametrization of quartic rings, Ann. Math. 159 (2004), 13291360.CrossRefGoogle Scholar
Bhargava, M., The density of discriminants of quartic rings and fields, Ann. Math. 162 (2005), 10311063.CrossRefGoogle Scholar
Bhargava, M., Mass formulae for extensions of local fields, and conjectures on the density of number field discriminants, Int. Math. Res. Not. IMRN 2007 (2007), rnm052.CrossRefGoogle Scholar
Bhargava, M., Higher composition laws. IV. The parametrization of quintic rings, Ann. of Math. (2) 167 (2008), 5394.CrossRefGoogle Scholar
Bhargava, M., The density of discriminants of quintic rings and fields, Ann. Math. 172 (2010), 15591591.CrossRefGoogle Scholar
Bhargava, M., The geometric sieve and the density of squarefree values of invariant polynomials, Preprint (2014), arXiv:1402.0031v1.Google Scholar
Bhargava, M., Shankar, A. and Tsimerman, J., On the Davenport–Heilbronn theorems and second order terms, Invent. Math. 193 (2013), 439499.CrossRefGoogle Scholar
Bhargava, M., Shankar, A. and Wang, X., Geometry-of-numbers methods over global fields I: Prehomogeneous vector spaces, Preprint (2015), arXiv:1512.03035v1.Google Scholar
Bhargava, M., Taniguchi, T. and Thorne, F., Improved error estimates for the Davenport–Heilbronn theorems, Preprint (2021), arXiv:2107.12819.Google Scholar
Bilu, M., Produits eulériens motiviques, PhD thesis, Université Paris-Saclay (ComUE) (2017).Google Scholar
Bilu, M., Das, R. and Howe, S., Zeta statistics and Hadamard functions, Adv. Math. 407 (2022), 108556.CrossRefGoogle Scholar
Bilu, M. and Howe, S., Motivic Euler products in motivic statistics, Algebra Number Theory 15 (2021), 21952259.CrossRefGoogle Scholar
Bolognesi, M. and Vistoli, A., Stacks of trigonal curves, Trans. Amer. Math. Soc. 364 (2012), 33653393.CrossRefGoogle Scholar
Bosch, S., Lütkebohmert, W. and Raynaud, M., Néron models, Ergebnisse der Mathematik und ihrer Grenzgebiete (3) [Results in Mathematics and Related Areas (3)], vol. 21 (Springer, Berlin, 1990).CrossRefGoogle Scholar
Buchsbaum, D. A. and Eisenbud, D., Algebra structures for finite free resolutions, and some structure theorems for ideals of codimension $3$, Amer. J. Math. 99 (1977), 447485.CrossRefGoogle Scholar
Canning, S. and Larson, H., Chow rings of low-degree Hurwitz spaces, J. Reine Angew. Math. 789 (2022), 103152.CrossRefGoogle Scholar
Canning, S. and Larson, H., Tautological classes on low-degree Hurwitz spaces, Int. Math. Res. Not. IMRN 2024 (2024), rnac360.CrossRefGoogle Scholar
Casnati, G., Covers of algebraic varieties II. Covers of degree 5 and construction of surfaces, J. Algebraic Geom. 5 (1996), 461478.Google Scholar
Casnati, G. and Ekedahl, T., Covers of algebraic varieties I. A general structure theorem, covers of degree 3, 4 and Enriques surfaces, J. Algebraic Geom. 5 (1996), 439460.Google Scholar
Casnati, G. and Notari, R., On some Gorenstein loci in $Hilb_6(\Bbb P^4_k)$, J. Algebra 308 (2007), 493523.CrossRefGoogle Scholar
Cohen, H., Diaz y Diaz, F. and Olivier, M., Enumerating quartic dihedral extensions of $\mathbb {Q}$, Compos. Math. 133 (2002), 6593.CrossRefGoogle Scholar
Conrad, B., Grothendieck duality and base change, Lecture Notes in Mathematics, vol. 1750 (Springer, Berlin, 2000).CrossRefGoogle Scholar
Datskovsky, B. and Wright, D. J., Density of discriminants of cubic extensions, J. Reine Angew. Math. 386 (1988), 116138.Google Scholar
Davenport, H. and Heilbronn, H., On the density of discriminants of cubic fields, Bull. Lond. Math. Soc. 1 (1969), 345348.CrossRefGoogle Scholar
Davenport, H. and Heilbronn, H., On the density of discriminants of cubic fields. II, Proc. Roy. Soc. Lond. Ser. A. Math. Phys. Eng. Sci. 322 (1971), 405420.Google Scholar
Delone, B. N. and Faddeev, D. K., The theory of irrationalities of the third degree, Translations of Mathematical Monographs, vol. 10 (American Mathematical Society, Providence, RI, 1964).Google Scholar
Deopurkar, A., Compactifications of Hurwitz spaces, Int. Math. Res. Not. IMRN 2014 (2014), 38633911.CrossRefGoogle Scholar
Deopurkar, A. and Patel, A., The Picard rank conjecture for the Hurwitz spaces of degree up to five, Algebra Number Theory 9 (2015), 459492.CrossRefGoogle Scholar
Eisenbud, D., Commutative algebra: With a view toward algebraic geometry, Graduate Texts in Mathematics, vol. 150 (Springer, New York, 1995).CrossRefGoogle Scholar
Eisenbud, D. and Harris, J., 3264 and all that—a second course in algebraic geometry (Cambridge University Press, Cambridge, 2016).CrossRefGoogle Scholar
Ekedahl, T., The Grothendieck group of algebraic stacks, Preprint (2009), arXiv:0903.3143v2.Google Scholar
Ekedahl, T., A geometric invariant of a finite group, Preprint (2009), arXiv:0903.3148v1.Google Scholar
Ellenberg, J. S., Venkatesh, A. and Westerland, C., Homological stability for Hurwitz spaces and the Cohen-Lenstra conjecture over function fields, Ann. of Math. (2) 183 (2016), 729786.CrossRefGoogle Scholar
Fantechi, B., Göttsche, L., Illusie, L., Kleiman, S. L., Nitsure, N. and Vistoli, A., Fundamental algebraic geometry: Grothendieck's FGA explained, Mathematical Surveys and Monographs, vol. 123 (American Mathematical Society, Providence, RI, 2005),Google Scholar
Fantechi, B. and Pandharipande, R., Stable maps and branch divisors, Compos. Math. 130 (2002), 345364.CrossRefGoogle Scholar
Fulton, W., Hurwitz schemes and irreducibility of moduli of algebraic curves, Ann. Math. 90 (1969), 542575.CrossRefGoogle Scholar
Gaitsgory, D. and Lurie, J., Weil's conjecture for function fields. Vol. 1, Annals of Mathematics Studies, vol. 199 (Princeton University Press, Princeton, NJ, 2019).Google Scholar
Grothendieck, A., Éléments de géométrie algébrique. IV. Étude locale des schémas et des morphismes de schémas. II, Publ. Math. Inst. Hautes Études Sci. 24 (231), 1965.Google Scholar
Grothendieck, A., Éléments de géométrie algébrique. IV. Étude locale des schémas et des morphismes de schémas. III, Publ. Math. Inst. Hautes Études Sci. 28 (255), 1966.Google Scholar
Grothendieck, A., Le groupe de Brauer. III. Exemples et compléments, in Dix exposés sur la cohomologie des schémas, Advanced Studies in Pure Mathematics, vol. 3 (North-Holland, Amsterdam, 1968), 88188.Google Scholar
Gusein-Zade, S. M., Luengo, I. and Melle-Hernàndez, A., A power structure over the Grothendieck ring of varieties, Math. Res. Lett. 11 (2004), 4957.CrossRefGoogle Scholar
Hatcher, A., A short exposition of the Madsen-Weiss theorem, Preprint (2011), arXiv:1103.5223.Google Scholar
Larson, H. K., The intersection theory of the moduli stack of vector bundles on $\mathbb {P}^1$, Canad. Math. Bull. (2021), 121.Google Scholar
Madsen, I. and Weiss, M., The stable moduli space of Riemann surfaces: Mumford's conjecture, Ann. of Math. (2) 165 (2007), 843941.CrossRefGoogle Scholar
Miranda, R., Triple covers in algebraic geometry, Amer. J. Math. 107 (1985), 11231158.CrossRefGoogle Scholar
Olsson, M., Algebraic spaces and stacks, AMS Colloquium Publications, vol. 62 (American Mathematical Society, Providence, RI, 2016).Google Scholar
Pardini, R., Triple covers in positive characteristic, Ark. Mat. 27 (1989), 319341.CrossRefGoogle Scholar
Patel, A. P., The geometry of Hurwitz space, PhD thesis, Harvard University (2013).Google Scholar
Roberts, D., Density of cubic field discriminants, Math. Comput. 70 (2001), 16991705.CrossRefGoogle Scholar
Poonen, B., The moduli space of commutative algebras of finite rank, J. Eur. Math. Soc. (JEMS) (2008), 817836.CrossRefGoogle Scholar
Rydh, D., Étale dévissage, descent and pushouts of stacks, J. Algebra 331 (2011), 194223.CrossRefGoogle Scholar
Schreyer, F.-O., Syzygies of canonical curves and special linear series, Math. Ann. 275 (1986), 105137.CrossRefGoogle Scholar
The Stacks Project Authors, Stacks Project, http://stacks.math.columbia.edu.Google Scholar
Taniguchi, T. and Thorne, F., Secondary terms in counting functions for cubic fields, Duke Math. J. 162 (2013), 24512508.CrossRefGoogle Scholar
Vakil, R. and Wood, M. M., Discriminants in the Grothendieck ring, Duke Math. J. 164 (2015), 11391185.CrossRefGoogle Scholar
Wewers, S., Construction of Hurwitz spaces, Dissertation, Universität Essen (IEM, 1998).Google Scholar
Wood, M. M., Parametrizing quartic algebras over an arbitrary base, Algebra Number Theory 5 (2011), 10691094.CrossRefGoogle Scholar
Wood, M. M., Asymptotics for number fields and class groups, in Directions in number theory, Association for Women in Mathematics Series, vol. 3 (Springer, Cham, 2016), 291339.CrossRefGoogle Scholar
Wright, D. J. and Yukie, A., Prehomogeneous vector spaces and field extensions, Invent. Math. 110 (1992), 283314.CrossRefGoogle Scholar
Zhao, Y., On sieve methods for varieties over finite fields, PhD thesis, The University of Wisconsin–Madison (ProQuest LLC, Ann Arbor, MI, 2013).Google Scholar
Zheng, A., Stable cohomology of the moduli space of trigonal curves, Int. Math. Res. Not. IMRN 2024 (2024), rnad011.CrossRefGoogle Scholar