Hostname: page-component-586b7cd67f-dsjbd Total loading time: 0 Render date: 2024-11-23T19:11:20.398Z Has data issue: false hasContentIssue false

The tamely ramified geometric quantitative minimal ramification problem

Published online by Cambridge University Press:  09 November 2023

Mark Shusterman*
Affiliation:
Department of Mathematics, Harvard University, 1 Oxford Street, Cambridge, MA 02138, USA [email protected]
Rights & Permissions [Opens in a new window]

Abstract

We prove a large finite field version of the Boston–Markin conjecture on counting Galois extensions of the rational function field with a given Galois group and the smallest possible number of ramified primes. Our proof involves a study of structure groups of (direct products of) racks.

Type
Research Article
Copyright
© 2023 The Author(s). The publishing rights in this article are licensed to Foundation Compositio Mathematica under an exclusive licence

1. Introduction

Let $G$ be a nontrivial finite group. We denote by $d(G)$ the least cardinality of a generating set of $G$, and by $d_\lhd (G)$ the least cardinality of a subset of $G$ generating $G$ normally. That is, $d_\lhd (G)$ is the smallest positive integer $d$ for which there exist $g_1, \ldots, g_d \in G$ such that $G$ has no proper normal subgroup containing $g_1, \ldots, g_d$. Equivalently, it is the least positive integer $d$ for which there exist $d$ conjugacy classes in $G$ that generate $G$. Let $G' = [G,G]$ be the commutator subgroup of $G$, and let $G^{\text {ab}} = G/[G,G]$ be the abelianization of $G$. It is a standard group-theoretic fact, see for instance [Reference Neukirch, Schmidt and WingbergNSW13, Theorem 10.2.6], that

\begin{align*} d_\lhd(G) = \max\{d(G^{\text{ab}}), 1\} = \begin{cases} d(G^{\text{ab}}) & G^{\text{ab}} \neq \{1\}, \\ 1 & G^{\text{ab}} = \{1\}. \end{cases} \end{align*}

Definition 1.1 A $G$-extension of a field $L$ is a pair $(K, \varphi )$ where $K$ is a Galois extension of $L$ and $\varphi \colon \operatorname {Gal}(K/L) \to G$ is an isomorphism. Abusing notation, we will at times denote a $G$-extension simply by $K$, tacitly identifying $\operatorname {Gal}(K/L)$ with $G$ via $\varphi$.

Since $\mathbb {Q}$ has no unramified extensions, for every tamely ramified $G$-extension $K/\mathbb {Q}$, the inertia subgroups of $G$ generate it and are cyclic. As the inertia subgroups of primes in $K$ lying over a given prime of $\mathbb {Q}$ are conjugate, it follows that the number of primes of $\mathbb {Q}$ ramified in $K$ is at least $d_\lhd (G)$.

Going beyond the inverse Galois problem, Boston and Markin [Reference Boston and MarkinBM09] conjectured that the lower bound $d_\lhd (G)$ on the number of ramified primes in a tamely ramified $G$-extension of $\mathbb {Q}$ is optimal.

Conjecture 1.2 There exists a (totally real) tamely ramified $G$-extension $K/\mathbb {Q}$ such that the number of (finite) primes of $\mathbb {Q}$ ramified in $K$ is $d_\lhd (G)$.

The assumption that $K$ is totally real can also be stated as $K/\mathbb {Q}$ being split completely at infinity, or as complex conjugation corresponding to the trivial element of $G$. Boston and Markin [Reference Boston and MarkinBM09] did not restrict to this case, allowing arbitrary ramification at infinity, but suggested that this case is perhaps more interesting (or more challenging). Boston and Markin [Reference Boston and MarkinBM09] also verified the conjecture for $G$ abelian.

Let $p$ be a prime number, and let $q$ be a power of $p$. To study a problem analogous to Conjecture 1.2 over the rational function field $\mathbb {F}_q(T)$ in place of $\mathbb {Q}$, we introduce two restrictions that can perhaps make the situation over $\mathbb {F}_q(T)$ more similar to that over $\mathbb {Q}$. First, we consider only regular extensions $K/\mathbb {F}_q(T)$, namely those for which every element of $K$ that is algebraic over $\mathbb {F}_q$ lies in $\mathbb {F}_q$. In particular, this rules out constant extensions of $\mathbb {F}_q(T)$: a family of everywhere unramified extensions that do not have an analog over $\mathbb {Q}$. Second, we assume that $p$ does not divide $|G|$, so that every $G$-extension of $\mathbb {F}_q(T)$ is tamely ramified. The following is a special case of conjectures made in [Reference De WittDeW14, Reference Bary-Soroker, Entin and FehmBEF23].

Conjecture 1.3 Let $q$ be a prime power coprime to $|G|$. Then there exists a regular $G$-extension $K/\mathbb {F}_q(T)$ (split completely at infinity), such that the number of primes of $\mathbb {F}_q(T)$ ramified in $K$ is $d_\lhd (G)$.

Giving such an extension of $\mathbb {F}_q(T)$ is equivalent to producing a morphism of smooth projective geometrically connected curves $g \colon Y \to \mathbb {P}^1$ over $\mathbb {F}_q$ having the following three properties.

  • There exist monic irreducible polynomials $P_j \in \mathbb {F}_q[T]$ with $1 \leq j \leq d_{\lhd }(G)$ such that $g$ is étale away from the points of $\mathbb {P}^1$ corresponding to the (zeros of the) polynomials $P_j$, and $g$ is ramified at these points. In other words, the set of roots of the polynomials $P_j$ is the branch locus of $g$.

  • We have $\operatorname {Aut}(g) \cong G$ and this group acts transitively on the geometric fibers of $g$.

  • The fiber of $g$ over $\infty$ contains an $\mathbb {F}_q$-point.

We refer to [Reference De WittDeW14, Reference Bary-Soroker, Entin and FehmBEF23, Reference Bary-Soroker and SchlankBS20] (and references therein) for some of the progress made on Conjectures 1.2 and 1.3. For example, some results have been obtained in case $G$ is the symmetric group, or a dihedral group, assuming Schinzel's Hypothesis H on prime values of integral polynomials.

Boston and Markin [Reference Boston and MarkinBM09] went on to propose, among other things, a quantitative version of their conjecture that we restate here. For that, we fix conjugacy classes $C_1, \ldots, C_{d_{\lhd }(G)}$ of $G$ that generate $G$, and put

\[ C = \bigcup_{j=1}^{d_\lhd(G)} C_j. \]

Furthermore, we assume that for each $1 \leq j \leq d_\lhd (G)$ and $g \in C_j$, every generator of the cyclic subgroup $\langle g \rangle$ lies in $C_j$.

For a number field $K$ put

\[ \operatorname{ram}(K) = \{ p : p\ \text{is a prime number ramified in}\ K\},\quad D_K = \prod_{p \in \operatorname{ram}(K)} p. \]

Recall that $\operatorname {ram}(K)$ is the set of prime numbers dividing the discriminant of $K$, so $D_K$ is the radical of this discriminant.

For a positive integer $d$ and a positive real number $X$, we denote by $\Pi _d(X)$ the probability that a uniformly random positive squarefree integer less than $X$ has exactly $d$ (distinct) prime factors.

Conjecture 1.4 Let $X$ be a positive real number, and let $\mathcal {E}^C(G;X)$ be the family of (isomorphism classes of) totally real tamely ramified $G$-extensions $K$ of $\mathbb {Q}$ for which $D_K < X$ and some (equivalently, every) generator of each nontrivial inertia subgroup of $\operatorname {Gal}(K/\mathbb {Q})$ lies in $C$. Then there exists a positive real number $\delta ^{G,C}$ such that as $X \to \infty$ we have

\[ \sum_{K \in \mathcal{E}^C(G;X)} \mathbf{1}_{|\operatorname{ram}(K)| = d_{\lhd}(G)} \sim \delta^{G,C} \cdot \Pi_{d_{\lhd}(G)}(X) \cdot |\mathcal{E}^C(G;X)|. \]

We note that the family $\mathcal {E}(G;X)$ of totally real tamely ramified $G$-extensions $K$ of $\mathbb {Q}$ with $D_K < X$ is finite by a classical result of Hermite.

Conjecture 1.4 suggests that the chances of $D_K$ having $d_\lhd (G)$ prime factors asymptotically equal the chances of a squarefree number having $d_\lhd (G)$ prime factors. It is also possible to restate this heuristic in the following equivalent way. For $K \in \mathcal {E}^C(G;X)$ and $1 \leq j \leq d_\lhd (G)$, let $D_K(j)$ be the product of all the primes $p \in \operatorname {ram}(K)$ for which the generators of the inertia subgroups of $\operatorname {Gal}(K/\mathbb {Q})$ for primes of $K$ lying over $p$ belong to $C_j$. Then $D_K(j) \neq 1$ for every $1 \leq j \leq d_\lhd (G)$, and

\[ D_K = \prod_{j=1}^{d_\lhd(G)} D_K(j). \]

Conjecture 1.4 suggests that for every $1 \leq j \leq d_\lhd (G)$, the chances that $D_K(j)$ is a prime asymptotically equal the chances that a squarefree number is a prime, and that these events are asymptotically independent over $j$.

Remark 1.5 To elaborate on the last point, it is possible to give a localized version of Conjecture 1.4 where we only sum over those $K$ in $\mathcal {E}^C(G;X)$ with $D_K(j)$ having order of magnitude $X_j$ for every $1 \leq j \leq d_\lhd (G)$ where $X_j \to \infty$ are real numbers whose product is $X$. In such a version we would replace $\Pi _{d_\lhd (G)}(X)$ with the product over $1 \leq j \leq d_\lhd (G)$ of the odds that a squarefree number with order of magnitude $X_j$ is a prime number, which is $ {\zeta (2)}/({\log X_j})$ where $\zeta$ is the Riemann zeta function. We insist that each $X_j \to \infty$ (and not just their product) because if some $X_j$ does not grow, then (at least) one of the others is substantially more likely to be a prime as it is coprime to $X_j$ by definition.

The reason we expect the appearance of an arithmetic correction factor $\delta ^{G,C}$ is that, for a given prime number $r$, the probability that $r$ divides $D_K$ for a uniformly random $K \in \mathcal {E}^C(G;X)$ may not quite converge to $1/r$ (but perhaps to a different value) as $X \to \infty$. Heuristics of analytic number theory would then suggest that $\delta ^{G,C}$ is a product over all the primes $r$ of certain expressions involving these limiting probabilities.

The odds that a uniformly random positive squarefree integer less than $X$ with exactly $d_\lhd (G)$ prime factors (which models $D_K$ for a random $K \in \mathcal {E}^C(G;X)$ with $|\!\operatorname {ram}(K)| = d_\lhd (G)$) is coprime to $|G|$ approach $1$ as $X \to \infty$. This justifies (to some extent) our avoidance of wild ramification throughout, since even if we were to include wildly ramified extension in our count, there would be no apparent reason to change the conjectured asymptotic.

Remark 1.6 It is also possible to state a version of Conjecture 1.4 for $\mathcal {E}(G;X)$ in place of $\mathcal {E}^C(G;X)$. Such a version (and its function field analog) may require more elaborate correction factors in place of $\delta ^{G,C}$. We do not pursue this direction in the current work.

Additional motivation for Conjecture 1.4 comes from [Reference Boston and EllenbergBE11] suggesting (roughly speaking) that the Galois group over $\mathbb {Q}$ of the maximal extension of $\mathbb {Q}$ unramified away from a random set of primes of a given finite cardinality follows a certain distribution. Conjecture 1.4 can then be viewed as predicting the asymptotics of certain moments of such a distribution.

Conjecture 1.4 remains open for every nonabelian $G$, see [Reference Boston and MarkinBM09, § 4] for some progress. Speaking to its difficulty we note that even obtaining an asymptotic for $|\mathcal {E}(G;X)|$ is an open problem for most $G$. We refer to [Reference Koymans and PaganoKP23, Introduction] for a discussion of results on counting number fields with various Galois groups. Most often one bounds by $X$ the discriminant of $K$ rather than its radical $D_K$ as we do here, see however [Reference WoodWoo10] for some advantages of working with $D_K$. The conjectures in [Reference MalleMal04] and the heuristic arguments of [Reference Ellenberg and VenkateshEV05] suggest that $|\mathcal {E}(G;X)| \sim \epsilon _G X \log ^{\beta _G} X$, where $\epsilon _G$ is a positive real number and $\beta _G$ is a positive integer.

It may also be interesting to consider summing over $K \in \mathcal {E}^C(G;X)$ some other functions of the number of ramified primes in place of the indicator function of this number being equal to $d_{\lhd }(G)$, as in Conjecture 1.4. For example, one may consider the Möbius function of $D_K$ namely

\[ \mu(D_K) = (-1)^{|\operatorname{ram}(K)|}. \]

Conjecture 1.7 As $X \to \infty$, we have

\[ \sum_{K \in \mathcal{E}^C(G;X)} (-1)^{|\operatorname{ram}(K)|} = o(|\mathcal{E}^C(G;X)|). \]

The conjecture predicts that the number of ramified primes, for number fields $K \in \mathcal {E}^C(G;X)$, is even approximately as often as it is odd.

The reason for us to emphasize specifically the Möbius function is its close relation with the aforementioned indicator function from Conjecture 1.4. Bary-Soroker and Schlank [Reference Bary-Soroker and SchlankBS20] showed that, for every integer $m \geq 2$, there exists an $S_m$-extension $K_m/\mathbb {Q}$ such that the number of primes of $\mathbb {Q}$ ramified in $K_m$ is at most $4$, falling just a little short of the conjecture from [Reference Boston and MarkinBM09] predicting the existence of an $S_m$-extension ramified at only $d_{\lhd }(S_m) = 1$ prime. The problem of finding such an extension, and the use of sieve theory in [Reference Bary-Soroker and SchlankBS20], is subject to the parity barrier making it challenging to produce many $S_m$-extensions ramified at an odd number of primes, let alone a single prime. A sieve is also employed in [Reference Taniguchi and ThorneTT20] to obtain a lower bound on the number of $S_3$-extensions of $\mathbb {Q}$ with no more than $3$ ramified primes, and $S_4$-extensions with no more than $8$ ramified primes. Here, parity is one of the barriers to obtaining a lower bound of the right order of magnitude. Another application of sieve theory in this context is [Reference Bhargava and GhateBG09, Theorem 7.11] obtaining an upper bound in Conjecture 1.4 for $G = S_4$. Parity is one of the barriers to improving this upper bound. Because of this, and because of the ability to express the indicator function of the prime numbers using the Möbius function, we view Conjecture 1.7 as a step toward Conjecture 1.4.

We would like to state analogs over $\mathbb {F}_q(T)$ of Conjectures 1.4 and 1.7. For that, we recall that the norm of a nonzero polynomial $D \in \mathbb {F}_q[T]$ is given by $|D| = |\mathbb {F}_q[T]/(D)| = q^{\deg D}.$ For a finite extension $K/\mathbb {F}_q(T)$ we put

\[ \operatorname{ram}(K) = \{ P \in \mathbb{F}_q[T] : P\ \text{is a monic irreducible polynomial ramified in}\ K\},\quad\! D_K = \prod_{P \in \operatorname{ram}(K)} P. \]

Our analogs over $\mathbb {F}_q(T)$ will be modeled on localized versions of Conjectures 1.4 and 1.7, as discussed in Remark 1.5. To state these analogs, we need further notation.

Definition 1.8 Let $p$ be a prime number not dividing $|G|$, let $q$ be a power of $p$, and let $n_1, \ldots, n_{d_\lhd (G)}$ be positive integers. Let $\mathcal {E}_q^C(G;n_1, \ldots, n_{d_\lhd (G)})$ be the family of regular $G$-extensions $K$ of $\mathbb {F}_q(T)$ split completely at $\infty$ and satisfying the following two conditions.

  • Every generator of each nontrivial inertia subgroup of $\operatorname {Gal}(K/\mathbb {F}_q(T))$ lies in $C$.

  • For $1 \leq j \leq d_\lhd (G)$ let $D_K(j)$ be the product of all the $P \in \operatorname {ram}(K)$ for which the generators of the inertia subgroups of $\operatorname {Gal}(K/\mathbb {F}_q(T))$ for primes of $K$ lying over $P$ belong to $C_j$. Then

    \[ \deg D_K(j) = n_j. \]

For every $K \in \mathcal {E}_q^C(G;n_1, \ldots, n_{d_\lhd (G)})$ we have

\[ D_K = \prod_{j=1}^{d_\lhd(G)} D_K(j). \]

As in the number field case, the family $\mathcal {E}_q^C(G;n_1, \ldots, n_{d_\lhd (G)})$ is finite. Ellenberg et al. [Reference Ellenberg, Tran and WesterlandETW17] provided upper bounds of the (conjecturally) right order of magnitude on $|\mathcal {E}_q^C(G;n_1, \ldots, n_{d_\lhd (G)})|$ for all $q$ larger than a certain quantity depending on $C$. In case $C \cap H$ is either empty or a single conjugacy class of $H$ for every subgroup $H$ of $G$, near-optimal upper and lower bounds on $|\mathcal {E}_q^C(G;n)|$ are obtained in [Reference Ellenberg, Venkatesh and WesterlandEVW16].

We also recall that the zeta function of $\mathbb {F}_q[T]$ is given for $s \in \mathbb {C}$ by

\[ \zeta_q(s) = \sum_{\substack{f \in \mathbb{F}_q[T] \\ f \text{ is monic}}} |f|^{-s} = \frac{1}{1 - q^{1-s}},\quad \text{Re}(s) > 1. \]

Conjecture 1.9 Fix a prime power $q$ coprime to $|G|$. Then there exists a positive real number $\delta ^{G,C}_q$ such that as $n_1, \ldots, n_{d_\lhd (G)} \to \infty$ we have

\[ \sum_{K \in \mathcal{E}_q^C(G;n_1, \ldots, n_{d_\lhd(G)})} \mathbf{1}_{|\operatorname{ram}(K)| = d_{\lhd}(G)} \sim \delta^{G,C}_q \cdot \prod_{j=1}^{d_\lhd(G)} \frac{\zeta_q(2)}{n_j} \cdot |\mathcal{E}_q^C(G;n_1, \ldots, n_{d_\lhd(G)})|. \]

Moreover, as soon as at least one of the $n_j$ tends to $\infty$ we have

\[ \sum_{K \in \mathcal{E}_q^C(G;n_1, \ldots, n_{d_\lhd(G)})} (-1)^{|\operatorname{ram}(K)|} = o(|\mathcal{E}_q^C(G;n_1, \ldots, n_{d_\lhd(G)})|). \]

The factor $ {\zeta _q(2)}/{n_j} = {q}/{(q-1)n_j}$ is (a good approximation for) the probability that a uniformly random monic squarefree polynomial of degree $n_j \geq 2$ over $\mathbb {F}_q$ is irreducible.

Using the (very) special cases of Schinzel's Hypothesis H and the Chowla conjecture established in [Reference Sawin and ShustermanSS22], it is perhaps possible to make partial progress on Conjecture 1.9 for certain groups. It is not, however, clear to us how to make additional (or significant) progress on Conjecture 1.9, going beyond what is known about Conjecture 1.4.

In this paper, we prove a large finite field version of Conjecture 1.9. In the large finite field regime, instead of fixing $q$ and taking the $n_j$ to $\infty$, we fix the $n_j$ and take $q$ to $\infty$. One indication that this regime is more tractable is given by [Reference Liu, Wood and Zureick-BrownLWZ19, Proof of Theorem 1.4] that obtains (among other things) the leading term of the asymptotic for $|\mathcal {E}_q^C (G;n_1, \ldots, n_{d_\lhd (G)})|$. Another indication of the tractability of the large finite field regime is given by the resolution (in this setting) of very difficult problems in analytic number theory, such as Schinzel's Hypothesis H; see [Reference EntinEnt21] and references therein.

We have $\lim _{q \to \infty } \zeta _q(2) = 1$, and we believe that the arithmetic correction constants $\delta _q^{G,C}$ also converge to $1$ as $q \to \infty$. This belief is based in part on the convergence to $1$ of various singular series constants over $\mathbb {F}_q[T]$ as $q \to \infty$, see also [Reference EntinEnt21]. Our main result provides further evidence for this belief.

Theorem 1.10 Fix $n_1, \ldots, n_{d_\lhd (G)}$ such that $n_j > |C_j|$ for some $1 \leq j \leq d_\lhd (G)$. Then as $q \to \infty$ along prime powers coprime to $|G|$ we have

\[ \sum_{K \in \mathcal{E}_q^C (G;n_1, \ldots, n_{d_\lhd(G)})} (-1)^{|\operatorname{ram}(K)|} = o(|\mathcal{E}_q^C (G;n_1, \ldots, n_{d_\lhd(G)})|). \]

Fix $n_1, \ldots, n_{d_\lhd (G)}$ large enough. Then as $q \to \infty$ along prime powers coprime to $|G|$ we have

\[ \sum_{K \in \mathcal{E}_q^C (G;n_1, \ldots, n_{d_\lhd(G)})} \mathbf{1}_{|\operatorname{ram}(K)| = d_{\lhd}(G)} \sim \frac{|\mathcal{E}_q^C (G;n_1, \ldots, n_{d_\lhd(G)})|}{n_1 \cdots n_{d_\lhd(G)}}. \]

By ‘large enough’ we mean larger than a certain function of $|G|$ (or, in fact, a function of $|C|$). This dependence on $|G|$ obtained from our argument (or rather from arguments of Conway, Parker, Fried, and Völklein) is ineffective, but it is likely possible to give a less elementary form of the argument that is effective. We do not pursue this direction in the current work.

The error term with which we obtain the asymptotics in Theorem 1.10 is

\[ O\bigg(\frac{|\mathcal{E}_q^C(G;n_1, \ldots, n_{d_\lhd(G)})|}{\sqrt q}\bigg), \]

where the implied constant depends $n_1, \ldots, n_{d_\lhd (G)}$. With the geometric setup in the proof of Theorem 1.10 we are well-positioned to explicate (and improve) this dependence by a study of Betti numbers, but we do not do it in this paper. We also do not attempt to improve the dependence of the error term on $q$ by studying cohomology beyond the top degree.

Our proof of Theorem 1.10 allows us to obtain the frequency with which $D_K$ attains any given factorization type (such as the product of $d_\lhd (G)$ irreducible polynomials). Put differently, we can obtain the asymptotic for the sum over $K \in \mathcal {E}_q^C (G;n_1, \ldots, n_{d_\lhd (G)})$ of any factorization function of $D_K$ (such as $\mu (D_K)$). For a more detailed discussion of factorization types and factorization functions see [Reference GorodetskyGor20]. We can then see that the factors $D_K(j)$ of $D_K$ indeed behave as random independent monic squarefree polynomials of degree $n_j$, as far as their factorizations into monic irreducible polynomials are concerned.

2. Sketch of a proof of Theorem 1.10

We describe a simplified variant of the argument we use to prove Theorem 1.10. In this sketch, for simplicity we will mostly restrict to the special case $d_\lhd (G) = 1$. Towards the very end of our sketch, we will comment on what the actual argument is, and mention some of the additional difficulties involved.

For a squarefree monic polynomial $f$ of degree $n \geq 1$ over $\mathbb {F}_q$ we denote by $\sigma _f$ the conjugacy class in $S_n$ of the permutation raising the roots of $f$ to $q$th power. The lengths of the cycles of $\sigma _f$ are the degrees of the irreducible factors of $f$. Let us restate this in a more geometric language.

Let $\text {Conf}^{n}$ be the configuration space of $n$ unordered distinct points on the affine line over $\overline {\mathbb {F}_q}$. Viewing these $n$ points as the roots of a monic squarefree polynomial of degree $n$ over $\overline {\mathbb {F}_q}$, one endows $\text {Conf}^{n}$ with the structure of a variety over $\mathbb {F}_q$, namely

\[ \text{Conf}^{n} = \{(a_0, \ldots, a_{n-1}) : \operatorname{Discriminant}_T(a_0 + a_1 T + \cdots + a_{n-1}T^{n-1} + T^n) \neq 0 \}. \]

There exists a continuous homomorphism $\lambda$ from the (profinite) étale fundamental group $\pi _1^{{\unicode{x00E9}}{\text t}}(\text {Conf}^{n})$ to $S_n$ such that for every $f \in \text {Conf}^{n}(\mathbb {F}_q)$ (viewed as a monic squarefree polynomial of degree $n$ over $\mathbb {F}_q$) the conjugacy class in $S_n$ of the value of $\lambda$ at the element $\operatorname {Frob}_{f} \in \pi _1^{{\unicode{x00E9}}{\text t}}(\text {Conf}^{n})$ is $\sigma _f$.

In fact, this definition of $\text {Conf}^n$ works over an arbitrary field, and slightly abusing notation we denote by $\text {Conf}^n(\mathbb {C})$ the configurations space of $n$ complex numbers (equivalently, monic polynomials of degree $n$ over $\mathbb {C}$ with no repeated roots). This space is a manifold, and its fundamental group is the braid group on $n$ strands, namely

\begin{align*} \pi_1(\text{Conf}^n(\mathbb{C})) = B_n &= \langle \sigma_1, \ldots, \sigma_{n-1}: \sigma_i \sigma_j = \sigma_j \sigma_i \text{ for } i > j+1, \\ &\quad\quad \sigma_i \sigma_{i+1} \sigma_i = \sigma_{i+1} \sigma_i \sigma_{i+1} \text{ for } i < n-1 \rangle. \end{align*}

The counterpart of the homomorphism $\lambda \colon \pi _1^{{\unicode{x00E9}}{\text t}}(\text {Conf}^{n}) \to S_n$ in this setting is the homomorphism from $B_n$ to $S_n$ that maps the generator $\sigma _i$ to the transposition $(i\ i+1) \in S_n$ for every $1 \leq i \leq n-1$.

For every prime power $q$ coprime to $|G|$ there exists an $\mathbb {F}_q$-variety $\mathsf {Hur}_{G,C}^n$ parametrizing $G$-covers of $\mathbb {P}^1$ branched at $n$ points with monodromy of type $C$ and a choice of a point over $\infty$. To simplify matters in this sketch, we assume that $\mathsf {Hur}_{G,C}^n$ is geometrically connected (this assumption is not satisfied for many choices of $G$, $C$, and $n$, and when it fails to hold, the connected components of $\mathsf {Hur}_{G,C}^n$ are not necessarily defined over $\mathbb {F}_q$). We make the identification

\[ \mathsf{Hur}_{G,C}^n(\mathbb{F}_q) = \mathcal{E}_q^C(G;n). \]

There is a finite étale map $\mathsf {Hur}_{G,C}^n \to \text {Conf}^{n}$ that on the level of $\mathbb {F}_q$-points sends every $K \in \mathcal {E}_q^C(G;n)$ to $D_K$. Therefore, the distribution of the factorization type of $D_K$ as $K$ ranges over $\mathcal {E}_q^C(G;n)$, or rather the distribution of $\sigma _{D_K}$ among the conjugacy classes of $S_n$, is the distribution of $\lambda (\operatorname {Frob}_{D_K}\!)$ among the conjugacy classes of $S_n$ as $K$ ranges over $\mathsf {Hur}_{G,C}^n(\mathbb {F}_q)$.

In view of our simplifying assumption, the variety $\mathsf {Hur}_{G,C}^n$ is a connected finite étale cover of $\text {Conf}^{n}$, so we can view $\pi _1^{{\unicode{x00E9}}{\text t}}(\mathsf {Hur}_{G,C}^n)$ as an open subgroup of $\pi _1^{{\unicode{x00E9}}{\text t}}(\text {Conf}^{n})$. By a version of Chebotarev's density theorem, the aforementioned distributions are governed by the image $H$ of $\pi _1^{{\unicode{x00E9}}{\text t}}(\mathsf {Hur}_{G,C}^n)$ in $S_n$ under the homomorphism $\lambda \colon \pi _1^{{\unicode{x00E9}}{\text t}}(\text {Conf}^{n}) \to S_n$. That is, for every conjugacy class $\Delta$ of $S_n$, as $q \to \infty$ along prime powers coprime to $|G|$ we have

\[ |\{K \in \mathcal{E}_q^C(G;n) : \sigma_{D_K} = \Delta \}| = |\{K \in \mathsf{Hur}_{G,C}^n(\mathbb{F}_q) : \lambda(\operatorname{Frob}_{D_K}) \in \Delta \}| \sim \frac{|\Delta \cap H|}{|H|} \cdot |\mathcal{E}_q^C(G;n)|. \]

For our purposes, it would be sufficient to show that $H = S_n$ for $n$ large enough. The subgroup $H$ does not change if we work with étale fundamental groups over $\overline {\mathbb {F}_q}$ rather than over $\mathbb {F}_q$ as we did so far.

The group $B_n$ acts on $C^n$ (the set of $n$-tuples of elements from $C$) from the right by

(2.1)\begin{align} (c_1, \ldots, c_{i-1}, c_i, c_{i+1}, c_{i+2} \ldots, c_n)^{\sigma_i} = (c_1, \ldots, c_{i-1}, c_{i+1}, c_i^{c_{i+1}}, c_{i+2} \ldots, c_n),\quad 1 \leq i \leq n-1, \end{align}

where $c_1, \ldots, c_n \in C$ and for group elements $g,h$ we use the notation $g^h$ for $h^{-1} g h$. For (an appropriately chosen) $s \in C^n$ whose entries generate $G$, the stabilizer of $s$ in $B_n$ is the counterpart of the subgroup $\pi _1^{{\unicode{x00E9}}{\text t}}(\mathsf {Hur}_{G,C}^n)$ of $\pi _1^{{\unicode{x00E9}}{\text t}}(\text {Conf}^{n})$. In particular, one can show that $H$ is the image in $S_n$ of the stabilizer of $s$ in $B_n$. At this point, we have reduced the arithmetic problem we wanted to solve to a group-theoretic question. This kind of reduction is by now standard, being employed frequently in the proofs of results in the large finite field limit; see, for instance, [Reference EntinEnt21] and [Reference Liu, Wood and Zureick-BrownLWZ19].

The technical heart of this paper lies in showing that for $n$ large enough we indeed have $H = S_n$. Recalling that the pure braid group $PB_n$ is the kernel of the homomorphism from $B_n$ to $S_n$, we see that the equality $H = S_n$ is equivalent to the transitivity of the action of $PB_n$ on the orbit of $s$ under the action of $B_n$. As a result, our work gives an additional justification for [Reference Boston and EllenbergBE11, Heuristic 4.7] suggesting the transitivity of a very similar action of a (profinite) group closely related to $PB_n$.

To state our technical result in greater generality, we need to recall a few additional notions.

Definition 2.1 A rack is a set $X$ with a binary operation $x^y$ for $x,y \in X$ such that for every $y \in X$ the function $x \mapsto x^y$ is a bijection from $X$ to $X$, and for every $x,y,z \in X$ we have $(z^x)^y = (z^{y})^{x^y}$. If, in addition, $x^x = x$ for all $x \in X$, then we say that $X$ is a quandle.

As an example of a quandle we can take $C$ with the binary operation of conjugation. For a rack $X$ we have a right action of $B_n$ on $X^n$ using the formula in (2.1).

Definition 2.2 Let $X$ be a finite rack. Define the (directed unlabeled) Schreier graph of $X$ to be the graph whose set of vertices is $X$ and whose set of edges is $\{(x, x^y) : (x,y) \in X \times X\}$. We say that $X$ is connected if its Schreier graph is connected, and we call the connected components of the Schreier graph of $X$ simply ‘the connected components of $X$’.

The Schreier graph may contain loops. Two vertices in the Schreier graph are weakly connected if and only if they are strongly connected. The quandle $C$ is an example of a connected rack, and the Schreier graph of $C$ is the Schreier graph of the right action of $G$ on $C$ by conjugation, where $C$ also plays the role of a generating set of $G$.

Definition 2.3 Let $X$ be a finite rack. We say that a subset $X_0$ of $X$ is a subrack if for all $x,y \in X_0$ we have $x^y \in X_0$. We say that elements $x_1, \ldots, x_n \in X$ generate $X$ if there is no proper subrack of $X$ containing $x_1, \ldots, x_n$.

Our technical result is ‘an $H = S_n$ theorem’, but in the generality of racks.

Theorem 2.4 Let $X$ be a connected finite rack. Let $n$ be a sufficiently large positive integer, and let $(x_1, \ldots, x_n) \in X^n$ be an $n$-tuple of elements from $X$ such that $x_1, \ldots, x_n$ generate $X$. Denote by $\operatorname {Stab}(x_1, \ldots, x_n)$ the stabilizer in $B_n$ of $(x_1, \ldots, x_n)$. Then the image $H$ of $\operatorname {Stab}(x_1, \ldots, x_n)$ under the homomorphism from $B_n$ to $S_n$ is either $A_n$ or $S_n$. If, moreover, $X$ is a quandle, then $H = S_n$.

Example 4.38 shows that it is necessary to distinguish between racks and quandles for that matter. For the possibility of extending Theorem 2.4 to biracks and to more general algebraic structures, see Remark 4.33.

We shall now briefly describe a proof strategy for Theorem 2.4. At first we prove in Proposition 3.3 a criterion for a subgroup of $S_n$ to be either $A_n$ or $S_n$: the subgroup has to be homogeneous (in the sense of Definition 3.2) of every possible degree. The proof of this criterion rests on a theorem of Jordan about permutation groups, and somewhat unexpectedly, on (a slightly strengthened form of) Bertrand's postulate: the existence of a prime number between $n/2$ and $n$.

Let $\mathcal {T}_2$ be the trivial rack on two elements, and consider the rack $Z = X \times \mathcal {T}_2$. As we show in the proof of Theorem 4.32, in order to show that the subgroup $H$ from Theorem 2.4 has the required homogeneity property, it suffices to show that certain elements in $Z^n$ lie in the same orbit under the action of $B_n$ in case their projections to $X^n$, and their projections to $\mathcal {T}_2^n$, lie in the same orbit. In Theorem 4.30 we show that this is indeed the case, even for a more general rack $Y$ in place of $\mathcal {T}_2$, assuming that $Y$ satisfies certain natural (yet somewhat technical) conditions.

The problem of understanding the orbits of the action of $B_n$ on $Z^n$ in case $Z$ is a union of conjugacy classes in a group has been addressed by several authors including Conway, Parker, Fried, and Völklein, see [Reference WoodWoo21, Theorem 3.1]. In § 4.1 we recast their arguments in the generality of racks, and this leads us to a proof of Theorem 4.30. What remains is to check that the conditions of Theorem 4.30 are satisfied in case $Y = \mathcal {T}_2$. This is done in Proposition 4.26 that deals with commutators in the structure group of a rack, see Definition 4.14.

The proof of Theorem 2.4 just sketched, in particular Theorem 4.30, is potentially applicable to statistics in the large finite field limit of arithmetic functions arising from other racks, not necessarily directly related to factorization types or to counting covers of $\mathbb {P}^1$ with certain properties. More precisely, Theorem 4.30 could be helpful for studying correlation sums of arithmetic functions associated to $X$ and to $Y$. We do not elaborate here on the possibility of associating an arithmetic function to a rack, or to more general algebraic structures.

A downside of this proof of Theorem 2.4, is that even if the arguments of Conway–Parker–Fried–Völklein will be made effective, the resulting dependence of $n$ on $|X|$ will be suboptimal. We have therefore included in this paper also a more direct proof of the quandle case of Theorem 2.4 which is likely to give a better dependence of $n$ on $|X|$. In this proof, a different criterion for a permutation group to be $S_n$ is used, based on invariable generation; see Proposition 3.5.

We also include two effective forms of Theorem 2.4 in two special cases. In the first special case $X$ is a (certain conjugacy class in a) finite simple group $G$. In this case we are only able to show that $H$ contains an $n$-cycle, and even that under an additional simplification: we consider the action of $B_n$ on $G \backslash X^n$ rather than on $X^n$; see Corollary 4.46. This additional simplification means that as an application, we can count $G$-extensions of $\mathbb {F}_q(T)$ unramified at $\infty$ rather than split completely at $\infty$. Since we only know that $H$ contains an $n$-cycle (and not that $H = S_n$), we do not know exactly how often $D_K$ is irreducible, but we should be able to show that this happens with positive probability as $q \to \infty$ along prime powers coprime to $|G|$. For the second special case, see Proposition 4.40. These results are in the spirit of [Reference ChenChe20] counting connected components of generalized Hurwitz spaces where we do not fix $G$ and let $n$ grow, but rather vary $G$ in a certain family of finite groups while $n$ remains reasonably small.

Our proof of Theorem 1.10 is given in § 5 and it does not quite proceed by invoking Chebotarev's theorem as in this sketch, rather it adapts a proof of Chebotarev's theorem to the special case at hand by expressing the indicator functions of conjugacy classes as linear combinations of complex characters. Put differently, we interpret certain factorization functions (such as $\mu (D_K)$) as trace functions of sheaves on $\mathsf {Hur}_{G,C}^n$, and apply the Grothendieck–Lefschetz fixed-point formula in conjunction with Deligne's Riemann hypothesis to estimate the sums of these trace functions. The advantage of repeating a proof of Chebotarev's theorem (over a use of the theorem as a black box) is that further progress on some of the function field problems mentioned in the introduction is thus reduced to questions on the cohomology of local systems on $\text {Conf}^n$.

In order to remove the assumption $d_\lhd (G) = 1$ and deal with an arbitrary nontrivial finite group $G$, we prove in Theorem 4.32 and Corollary 4.37 a generalization of Theorem 2.4 to finite racks with $k \geq 1$ connected components. For this generalization, instead of showing that a certain subgroup of a symmetric group is large, we need to consider subgroups of a direct product of symmetric (and alternating) groups. Building on [Reference Bary-Soroker, Gorodetsky, Karidi and SawinBGKS20] and on Goursat's lemma, we provide in Lemma 3.6 a criterion for a subgroup of a direct product of groups to be the whole group, valid under certain assumptions on the groups in the product.

3. Groups and their actions

3.1 Braid group and its action on presentations

Let $F_n$ be the free group on the letters $x_1, \ldots, x_n$. We can view $B_n$ as the subgroup of $\mathrm {Aut}(F_n)$ generated by the automorphisms

\[ \sigma_i(x_j) = \begin{cases} x_j & j \notin \{i,i+1\} \\ x_{i+1} & j = i \\ x_i^{x_{i+1}} & j = i+1 \end{cases}\quad 1 \leq i \leq n-1. \]

Let $C \subseteq G$ be a generating set of $G$ which is a disjoint union of some conjugacy classes $C_1, \ldots, C_k$ of $G$. Let $\operatorname {Mor}^C(F_n,G)$ be the collection of all homomorphisms $\theta \colon F_n \to G$ with $\theta (x_1), \ldots, \theta (x_n) \in C$, and put $\operatorname {Mor}(F_n,G) = \operatorname {Mor}^G(F_n,G)$. The group $\mathrm {Aut}(F_n)$ acts on $\operatorname {Mor}(F_n,G)$ from the right by precomposition, namely

\[ \theta^\phi = \theta \circ \phi,\quad \theta \in \operatorname{Mor}(F_n,G),\enspace \phi \in \mathrm{Aut}(F_n). \]

Restricting this action to $B_n$, we also get a right action of $B_n$ on the subset $\operatorname {Mor}^C(F_n,G)$ of $\operatorname {Mor}(F_n,G)$.

Let $n_1, \ldots, n_k$ be nonnegative integers such that $n_1 + \cdots + n_k = n$. We further get an action of $B_n$ on the subsets

\begin{align*} & \operatorname{Sur}^C(F_n,G) = \{\theta \in \operatorname{Mor}^C(F_n,G) : \theta\ \text{is surjective}\},\\ & \operatorname{Sur}^C_1(F_n,G) = \{ \theta \in \operatorname{Sur}^C(F_n,G) : \theta(x_1) \cdots \theta(x_n) = 1 \},\\ & \operatorname{Sur}^C(F_n,G;n_1, \ldots, n_k) = \{ \theta \in \operatorname{Sur}^C(F_n,G) : |\{1 \leq i \leq n : \theta(x_i) \in C_j\}| = n_j \text{ for all } 1 \leq j \leq k \},\\ & \operatorname{Sur}^C_1(F_n,G;n_1, \ldots, n_k) = \{ \theta \in \operatorname{Sur}^C_1(F_n,G) : |\{1 \leq i \leq n : \theta(x_i) \in C_j\}| = n_j \text{ for all } 1 \leq j \leq k \}. \end{align*}

We will simply write $\operatorname {Sur}(F_n,G)$ for $\operatorname {Sur}^G(F_n,G)$ and $\operatorname {Sur}_1(F_n,G)$ for $\operatorname {Sur}^G_1(F_n,G)$.

We identify $\operatorname {Mor}(F_n,G)$ with $G^n$ by sending $\theta \in \operatorname {Mor}(F_n,G)$ to $(\theta (x_1), \ldots, \theta (x_n)) \in G^n$, so that $\operatorname {Mor}^C(F_n,G)$ is identified with $C^n$. Every $\phi \in \mathrm {Aut}(F_n)$ gives us the $n$ words $w_i(x_1, \ldots, x_n) = \phi (x_i) \in F_n$ in the letters $x_1, \ldots, x_n$. Given an $n$-tuple $(g_1, \ldots, g_n) \in G^n$, the action of $\phi$ on it is

\[ (g_1, \ldots, g_n)^\phi = (w_1(g_1, \ldots, g_n), \ldots, w_n(g_1, \ldots, g_n)). \]

In case $\phi = \sigma _i$ for some $1 \leq i \leq n-1$ is one of our generators of $B_n$, we recover (2.1).

With this identification we also have $\operatorname {Sur}^C(F_n, G) = \{(g_1, \ldots, g_n) \in C^n : \langle g_1, \ldots, g_n \rangle = G \}$ and

\[ \operatorname{Sur}^C_1(F_n, G) = \{(g_1, \ldots, g_n) \in C^n : \langle g_1, \ldots, g_n \rangle = G,\ g_1 \cdots g_n = 1 \}. \]

Moreover, we identify $\operatorname {Sur}^C_1(F_n, G; n_1, \ldots, n_k)$ with

\begin{align*} &\big \{(g_1, \ldots, g_n) \in C^n : \langle g_1, \ldots, g_n \rangle = G, g_1 \cdots g_n = 1,\\ &\quad |\{1 \leq i \leq n : g_i \in C_j\}| = n_j \text{ for all } 1 \leq j \leq k \big \}. \end{align*}

For disjoint subsets $D_j \subseteq \{1, \ldots, n\}$ with $|D_j| = n_j$ for $1 \leq j \leq k$, we make the identification

\[ \{\sigma \in S_n : \sigma(D_j) = D_j \text{ for every } 1 \leq j \leq k \} = S_{n_1} \times \cdots \times S_{n_k}. \]

Often we take $D_j = \{n_1+ \cdots + n_{j-1}+1, \ldots, n_1 + \cdots + n_j\}$. We denote by $B_{n_1, \ldots, n_k}$ the inverse image of $S_{n_1} \times \cdots \times S_{n_k}$ under the homomorphism from $B_n$ to $S_n$. Sometimes $B_{n_1, \ldots, n_k}$ is called a colored braid group (with $k$ colors).

For every $1 \leq m < n$, we identify the subgroup of $B_n$ generated by $\sigma _1, \ldots, \sigma _{m-1}, \sigma _{m+1}, \ldots, \sigma _{n-1}$ with $B_m \times B_{n-m}$.

3.2 Permutation groups

Let $\Gamma$ be a group acting on a set $S$ from the right. We say that $B \subseteq S$ is a block of $\Gamma$ if, for every $g \in \Gamma$, either $B^g = B$ or $B^g \cap B = \emptyset$. We call $H = \{g \in \Gamma : B^g = B\}$ the stabilizer of the block, and note that it acts on $B$.

Proposition 3.1 The natural map $B/H \to S/\Gamma$ is injective, so in case $B$ meets every orbit of $\Gamma$ this map is bijective.

Proof. Let $b_1,b_2 \in B$ be representatives for orbits under the action of $H$ whose images in $S/\Gamma$ coincide. This means that there exists $g \in \Gamma$ with $b_1^g = b_2$. In particular, $B^g \cap B \neq \emptyset$ so $g \in H$ since $B$ is a block of $\Gamma$. It follows that $b_1$ and $b_2$ are in the same orbit under the action of $H$ so the required injectivity is established.

Definition 3.2 Let $k \leq n$ be nonnegative integers. We say that a subgroup $H$ of the symmetric group $S_n$ is $k$-homogeneous if the action of $H$ on the set of all $k$-element subsets of $\{1, \ldots, n\}$ is transitive.

The following is a consequence of [Reference Beaumont and PetersonBP55, Theorem 10]. For the reader's convenience we include a proof here.

Proposition 3.3 Let $n \geq 14$ be an integer, and let $H$ be an $\lfloor n/2 \rfloor$-homogenous subgroup of $S_n$. Then either $H = A_n$ or $H = S_n$.

Proof. We can find a prime number $p$ satisfying $(n+1)/2 < p \leq n-3$. The stabilizer of $\{1, \ldots, \lfloor n/2 \rfloor \}$ in the action of $S_n$ on the set of all $\lfloor n/2 \rfloor$-element subsets of $\{1, \ldots, n\}$ is the subgroup $S_{\lfloor n/2 \rfloor } \times S_{\lceil n/2 \rceil }$. The $\lfloor n/2 \rfloor$-homogeneity of $H$ is tantamount to the equality $H \cdot (S_{\lfloor n/2 \rfloor } \times S_{\lceil n/2 \rceil }) = S_n$ of subsets of $S_n$. Therefore,

\[ n! = |S_n| = |H \cdot (S_{\lfloor n/2 \rfloor} \times S_{\lceil n/2 \rceil})| = \frac{|H| \cdot |S_{\lfloor n/2 \rfloor} \times S_{\lceil n/2 \rceil}|}{|H \cap (S_{\lfloor n/2 \rfloor} \times S_{\lceil n/2 \rceil})|} = \frac{|H| \cdot \lfloor n/2 \rfloor! \cdot \lceil n/2 \rceil!}{|H \cap (S_{\lfloor n/2 \rfloor} \times S_{\lceil n/2 \rceil})|}, \]

so $n!$ divides $|H| \cdot \lfloor n/2 \rfloor ! \cdot \lceil n/2 \rceil !$, hence $p$ divides this number as well.

Our choice of $p$ guarantees that $p$ divides $|H|$. By Cauchy's theorem, $H$ contains an element of order $p$. An element of order $p$ in $S_n$ is a product of $p$-cycles, but $p> n/2$ so in our case this element is necessarily a $p$-cycle. It is readily checked that an $\lfloor n/2 \rfloor$-homogeneous subgroup of $S_n$ is transitive (equivalently, $1$-homogeneous) and, moreover, primitive (or even $2$-homogeneous). By a theorem of Jordan, the only primitive subgroups of $S_n$ that contain a $p$-cycle for a prime number $p \leq n-3$ are $A_n$ and $S_n$. We conclude that either $H = A_n$ or $H = S_n$ as required.

Definition 3.4 Let $I$ be an indexing set, and let $\{H_i\}_{i \in I}$ be subgroups of a group $H$. We say that these subgroups invariably generate $H$ if for every choice of elements $\{\sigma _i\}_{i \in I}$ from $H$, the conjugate subgroups $\{H_i^{\sigma _i}\}_{i \in I}$ generate the group $H$.

For any fixed $i \in I$ in the above definition we can assume, without loss of generality, that $\sigma _i = 1$.

Proposition 3.5 Let $n, k$ be positive integers with $n > 2k$, and let $\sigma \in S_n$ be a permutation all of whose cycles are of lengths exceeding $k$. Then the subgroups $S_{n-k}$ and $\langle \sigma \rangle$ invariably generate $S_n$.

Proof. We view $S_{n-k}$ as the group of permutations of $\{1, \ldots, n-k\}$ in $S_{n}$ fixing each element of $\{n-k+1, \ldots, n\}$. Denote by $H$ the subgroup of $S_n$ generated by $S_{n-k}$ and a conjugate $\rho$ of $\sigma$. We need to show that $H = S_n$. First we claim that $H$ acts transitively on $\{1, \ldots, n\}$. We take $j \in \{1, \ldots, n\}$ and our task is to show that it lies in the orbit of $1$ under the action of $H$. If $j \in \{1, \ldots, n-k\}$, this is clear since $H$ contains $S_{n-k}$, so we assume that $j \in \{n-k+1, \ldots, n\}$. Since the cycle of $\rho$ in which $j$ lies is of length more than $k$, it contains an element from $\{1, \ldots, n-k\}$. As all the powers of $\rho$ lie in $H$, we conclude that $j$ is indeed in the orbit of $1$ under the action of $H$, as required for transitivity.

Next, we claim that $H$ is primitive. Toward a contradiction, suppose that $\{1, \ldots, n\}$ can be partitioned into disjoint blocks for $H$ with at least two distinct blocks, each block of size at least $2$. As $n-k > n/2$ by assumption, and the size of every block is a proper divisor of $n$, we conclude that $\{1, \ldots, n-k\}$ is not contained in a single block, and some block $B$ meets $\{1, \ldots, n-k\}$ at two distinct elements at least, say $i$ and $j$. Since $\{1, \ldots, n-k\} \nsubseteq B$, we can find $\tau \in S_{n-k}$ with $\tau (i) = i \in B$ and $\tau (j) \notin B$. We see that $\tau \in H$, that $\tau B \cap B \neq \emptyset$, and that $\tau (B) \neq B$. This contradicts our assumption that $B$ is a block for $H$, and concludes the proof of primitivity.

As $n - k > k \geq 1$, there exists a transposition in $S_{n-k}$, so $H$ contains a transposition. By a theorem of Jordan, the only primitive subgroup of $S_n$ that contains a transposition is $S_n$ itself so $H = S_n$ as required.

3.3 Direct products of groups

Lemma 3.6 Let $G_1, \ldots, G_n$ be groups such that for every $1 \leq i < j \leq n$ either $G_i \cong G_j$ or $G_i$ and $G_j$ do not have nonabelian simple isomorphic quotients. Put $G = G_1 \times \cdots \times G_n$ and let $H$ be a subgroup of $G$ satisfying the following three conditions.

  • The restriction of the natural homomorphism $G \to G^{\text {ab}}$ to $H$ is surjective.

  • The subgroup $H$ projects onto $G_i$ for every $1 \leq i \leq n$.

  • The subgroup $H$ projects onto $G_i \times G_j$ for every $1 \leq i < j \leq n$ with $G_i \cong G_j$.

Then $H = G$.

Proof. Let $\mathcal {G}_1, \ldots, \mathcal {G}_r$ be all the distinct isomorphism types appearing among the groups $G_1, \ldots, G_n$. Then $G \cong \mathcal {G}_1^{m_1} \times \cdots \times \mathcal {G}_r^{m_r}$ where $m_i = |\{1 \leq j \leq n : G_j \cong \mathcal {G}_i\}|$ for $1 \leq i \leq r$. We claim that $H$ projects onto $\mathcal {G}_i^{m_i}$ for every $1 \leq i \leq r$. Indeed, in case $m_i = 1$ this follows from the second condition that $H$ satisfies by assumption. In case $m_i \geq 2$ this follows from [Reference Bary-Soroker, Gorodetsky, Karidi and SawinBGKS20, Lemma 6.6] applied to the projection of $H$ to $\mathcal {G}_i^{m_i}$, using the first and the third condition that $H$ satisfies. The claim is thus established.

We prove by induction on $1 \leq i \leq r$ that $H$ projects onto $\mathcal {G}_1^{m_1} \times \cdots \times \mathcal {G}_i^{m_i}$. The base case $i=1$ follows from the claim established above. For $i>1$ we apply Goursat's lemma to the projection $H_i$ of $H$ in $(\mathcal {G}_1^{m_1} \times \cdots \times \mathcal {G}_{i-1}^{m_{i-1}}) \times \mathcal {G}_i^{m_i}$. Since $H_i$ projects onto both $\mathcal {G}_1^{m_1} \times \cdots \times \mathcal {G}_{i-1}^{m_{i-1}}$ by induction, and onto $\mathcal {G}_i^{m_i}$ by the aforementioned claim, there exists a group $K$ and surjective homomorphisms $\varphi \colon \mathcal {G}_1^{m_1} \times \cdots \times \mathcal {G}_{i-1}^{m_{i-1}} \to K$, $\psi \colon \mathcal {G}_i^{m_i} \to K$ such that $H_i = (\mathcal {G}_1^{m_1} \times \cdots \times \mathcal {G}_{i-1}^{m_{i-1}}) \times _K \mathcal {G}_i^{m_i}$. We claim that $K$ is trivial.

Toward a contradiction, suppose that $K$ admits a simple quotient $S$. Then we have surjections $\overline \varphi \colon \mathcal {G}_1^{m_1} \times \cdots \times \mathcal {G}_{i-1}^{m_{i-1}} \to S$ and $\overline \psi \colon \mathcal {G}_i^{m_i} \to S$. Since $S$ is simple, and $\overline \varphi, \overline \psi$ map normal subgroups to normal subgroups, we see that $S$ is a quotient of $\mathcal {G}_t$ for some $1 \leq t \leq i-1$ and a quotient of $\mathcal {G}_i$ because the direct factors are normal subgroups that generate the direct product. As $\mathcal {G}_t \ncong \mathcal {G}_i$, our initial assumption implies that $S$ is abelian. We note that $H_i$ is contained in the proper subgroup $\overline H_i = (\mathcal {G}_1^{m_1} \times \cdots \times \mathcal {G}_{i-1}^{m_{i-1}}) \times _S \mathcal {G}_i^{m_i}$ of $\mathcal {G}_1^{m_1} \times \cdots \times \mathcal {G}_i^{m_i}$. Since $S$ is abelian, this subgroup $\overline H_i$ contains the commutator subgroup of $\mathcal {G}_1^{m_1} \times \cdots \times \mathcal {G}_i^{m_i}$. It follows that the restriction of the natural homomorphism $\mathcal {G}_1^{m_1} \times \cdots \times \mathcal {G}_i^{m_i} \to (\mathcal {G}_1^{m_1} \times \cdots \times \mathcal {G}_i^{m_i})^{\text {ab}}$ to $\overline H_i$, and therefore also to $H_i$, is not surjective. This contradicts the first condition that $H$ satisfies, and thus proves the claim that $K = \{1\}$.

We conclude that $H_i = (\mathcal {G}_1^{m_1} \times \cdots \times \mathcal {G}_{i-1}^{m_{i-1}}) \times \mathcal {G}_i^{m_i}$ so our induction is complete. Plugging $i=r$, we get that $H = G$, as required.

Corollary 3.7 For every $1 \leq i \leq r$ let $n_i$ be an integer for which

\[ n_i \geq 14,\quad {n_i \choose \lfloor n_i/2 \rfloor} > 2^r. \]

Let $H$ be a subgroup of $S_{n_1} \times \cdots \times S_{n_r}$ such that for every choice of pairs $(X_i,Y_i)$ of subsets

\[ X_i, Y_i \subseteq \{n_1+ \cdots + n_{i-1}+1, \ldots, n_1 + \cdots + n_i\},\quad |X_i| = |Y_i| = \lfloor n_i/2 \rfloor, \]

there exists an $h \in H$ for which $h(X_i) = Y_i$ for every $1 \leq i \leq r$. Then $H$ contains $A_{n_1} \times \cdots \times A_{n_r}$.

Remark 3.8 The assumption that $n_1, \ldots, n_r$ are large enough is possibly unnecessary, but is satisfied in our applications, so we include it because it facilitates obtaining Corollary 3.7 as a consequence of Proposition 3.3 and Lemma 3.6.

Proof. Put $K = H \cap (A_{n_1} \times \cdots \times A_{n_r})$. We need to show that $K = A_{n_1} \times \cdots \times A_{n_r}$ and we will do this by invoking Lemma 3.6 whose assumptions we shall verify now. The first assumption is met because the groups $A_n$ for $n \geq 5$ are nonabelian pairwise nonisomorphic simple groups (and the groups $A_n$ for $n < 5$ do not have nonabelian simple quotients). We check next that $K$ satisfies the three conditions in Lemma 3.6.

The first condition is satisfied because the abelianization of $A_{n_1} \times \cdots \times A_{n_r}$ is trivial as $n_1, \ldots, n_r \geq 5$ by assumption. To check the second condition we start by fixing $1 \leq i \leq n$ and noting that in view of our assumptions on $n_i$ and on $H$, we get from Proposition 3.3 that the projection $H_i$ of $H$ to $S_{n_i}$ contains $A_{n_i}$. Since

\[ H/K = H/ (H \cap (A_{n_1} \times \cdots \times A_{n_r})) \cong H \cdot (A_{n_1} \times \cdots \times A_{n_r})/ A_{n_1} \times \cdots \times A_{n_r} \leq (\mathbb{Z}/2 \mathbb{Z})^r, \]

denoting by $K_i$ the projection of $K$ to $S_{n_i}$, we see that $H_{i}/K_i$ is an elementary abelian $2$-group. We conclude that $K_i$ contains $A_{n_i}$ so the second condition is indeed satisfied.

To check the third condition, we take $1 \leq i < j \leq r$ with $n_i = n_j$, and denote by $K_{i,j}$ the projection of $K$ to $A_{n_i} \times A_{n_j}$. As $n_i = n_j$ the function

\[ f \colon \{n_1+ \cdots + n_{i-1}+1, \ldots, n_1 + \cdots + n_i\} \to \{n_1+ \cdots + n_{j-1}+1, \ldots, n_1 + \cdots + n_j\}, \]

given by

\[ f(x) = x + n_{i+1} + \cdots + n_j = x + n_i + \cdots + n_{j-1} \]

is a bijection. This bijection will be silently used in what follows to identify the group $S_{n_i}$ with the group $S_{n_j}$ (and the group $A_{n_i}$ with the group $A_{n_j}$).

Suppose toward a contradiction that $K_{i,j}$ is a proper subgroup of $A_{n_i} \times A_{n_j}$. In view of the second condition verified above, the subgroup $K_{i,j}$ projects onto both $A_{n_i}$ and $A_{n_j}$ so since these two groups are simple, Goursat's lemma tells us that $K_{i,j} = \{(\sigma, \psi (\sigma )) : \sigma \in A_{n_i}\}$ for some automorphism $\psi \colon A_{n_i} \to A_{n_j}$. Our assumption that $n_i \geq 7$ implies that $\operatorname {Aut}(A_{n_i}) = S_{n_i}$, namely there exists $\tau \in S_{n_i}$ for which $K_{i,j} = \{(\sigma, \tau \sigma \tau ^{-1}) : \sigma \in A_{n_i}\}$.

We pick an $\lfloor n_i/2 \rfloor$-element subset $X$ of $\{n_1+ \cdots + n_{i-1}+1, \ldots, n_1 + \cdots + n_i\}$, for instance

\[ X = \{n_1+ \cdots + n_{i-1}+1, \ldots, n_1 + \cdots + n_{i-1} + \lfloor n_i/2 \rfloor\}, \]

and consider the orbit of $(X, \tau (f(X)))$ under the action of $K_{i,j}$. On the one hand, this orbit is $\{(\sigma (X), \tau (\sigma (f(X)))) : \sigma \in A_{n_i}\}$ so its length is at most

\[ |\{\sigma(X) : \sigma \in A_{n_i}\}| \leq {n_i \choose \lfloor n_i/2 \rfloor}. \]

On the other hand, in view of our initial assumption on $H$, the orbit of $(X, \tau (f(X)))$ under the action of the projection $H_{i,j}$ of $H$ to $S_{n_i} \times S_{n_j}$ has length ${n_i \choose \lfloor n_i/2 \rfloor }^2$. Since $K_{i,j}$ is a normal subgroup of $H_{i,j}$ with $[H_{i,j} : K_{i,j}] \leq [H : K] \leq 2^r$, the length of the orbit of $(X, \tau (f(X)))$ under the action of $K_{i,j}$ is at least $2^{-r} {n_i \choose \lfloor n_i/2 \rfloor }^2$. We conclude that

\[ 2^{-r}{n_i \choose \lfloor n_i/2 \rfloor}^2 \leq {n_i \choose \lfloor n_i/2 \rfloor}, \]

so ${n_i \choose \lfloor n_i/2 \rfloor } \leq 2^r$ in contrast to our initial assumption. We have thus shown that $K_{i,j} = A_{n_i} \times A_{n_j}$. We can therefore invoke Lemma 3.6 and conclude that $K = A_{n_1} \times \cdots \times A_{n_r}$ as required.

Corollary 3.9 Let $n_1, \ldots, n_r, k$ be integers with $\min \{n_1, \ldots, n_r\} > 2k > 0$, and let $g \in S_{n_1} \times \cdots \times S_{n_r}$ such that the lengths of the cycles of the projection of $g$ to $S_{n_i}$ all exceed $k$ for every $1 \leq i \leq r$. Then the subgroups $S_{n_1-k} \times \cdots \times S_{n_r-k}$ and $\langle g \rangle$ invariably generate $S_{n_1} \times \cdots \times S_{n_r}$.

Proof. Denote by $H$ the subgroup of $S_{n_1} \times \cdots \times S_{n_r}$ generated by $S_{n_1-k} \times \cdots \times S_{n_r-k}$ and a conjugate $\rho$ of $g$. In order to show that $H = S_{n_1} \times \cdots \times S_{n_r}$, we will invoke Lemma 3.6 whose assumptions we verify next. The first assumption is satisfied because for every positive integer $n$, the group $S_n$ does not have a nonabelian simple quotient. We shall now check that $H$ satisfies the three conditions in Lemma 3.6.

For the first condition, we need to check the surjectivity of the restriction of the sign homomorphism $S_{n_1} \times \cdots \times S_{n_r} \to (\mathbb {Z}/ 2\mathbb {Z})^r$ to $H$. This follows at once from the surjectivity of the restriction of this homomorphism to $S_{n_1-k} \times \cdots \times S_{n_r-k}$, a consequence of the fact that $n_i - k > k \geq 1$ for every $1 \leq i \leq r$. The second condition is an immediate consequence of Proposition 3.5.

To check the third condition, we take $1 \leq i < j \leq r$ (with $n_i = n_j$), and denote by $H_{i,j}$ the projection of $H$ to $S_{n_i} \times S_{n_j}$. Suppose toward a contradiction that $H_{i,j}$ is a proper subgroup of $S_{n_i} \times S_{n_j}$. In view of the second condition verified above, the subgroup $H_{i,j}$ projects onto both $S_{n_i}$ and $S_{n_j}$ so Goursat's lemma provides us with a nontrivial group $K$ and surjective homomorphisms $\varphi \colon S_{n_i} \to K$, $\psi \colon S_{n_j} \to K$ such that $H_{i,j} = S_{n_i} \times _{K} S_{n_j}$. Since $K$ is (isomorphic to) a nontrivial quotient of a symmetric group, its abelianization is necessarily nontrivial, so $S_{n_i} \times _{K^\text {ab}} S_{n_j}$ is a proper subgroup of $S_{n_i} \times S_{n_j}$ that contains both $H_{i,j}$ and the commutator subgroup of $S_{n_i} \times S_{n_j}$. It follows that the restriction to $H_{i,j}$ of the natural homomorphism $S_{n_i} \times S_{n_j} \to (S_{n_i} \times S_{n_j})^{\text {ab}}$ is not surjective. This contradicts the first condition verified above, namely the surjectivity of the map $H \to (\mathbb {Z} / 2 \mathbb {Z})^r$. We have thus shown that $H_{i,j} = S_{n_i} \times S_{n_j}$ so the third condition is verified.

We can therefore invoke Lemma 3.6 and conclude that $H = S_{n_1} \times \cdots \times S_{n_r}$ as required.

4. Braided sets

Definition 4.1 A nonempty set $X$ equipped with a bijection $R \colon X \times X \to X \times X$ is said to be a braided set if

\[ (R \times \mathrm{id}_X) \circ (\mathrm{id}_X \times R) \circ (R \times \mathrm{id}_X) = (\mathrm{id}_X \times R) \circ (R \times \mathrm{id}_X) \circ (\mathrm{id}_X \times R) \]

as maps from $X \times X \times X$ to $X \times X \times X$.

This relation is sometimes called the set-theoretic Yang–Baxter equation, and the braided set $X$ is sometimes said to be a solution of this equation.

Definition 4.2 We denote by $\pi _1 \colon X \times X \to X$ and $\pi _2 \colon X \times X \to X$ the projections. In case the function $\pi _2 \circ R$ is a bijection on each fiber of $\pi _2$, and the function $\pi _1 \circ R$ is a bijection on each fiber of $\pi _1$, we say that $X$ is nondegenerate.

A nondegenerate braided set is sometimes also called a birack.

Definition 4.3 Given braided sets $(X, R)$ and $(X', R')$, we say that a function $f \colon X \to X'$ is a morphism (of braided sets) if $(f \times f)(R(x,y)) = R'(f(x), f(y))$.

We obtain the category of braided sets. Products exist in this category.

Definition 4.4 We say that a braided set $X$ is trivial if $R(x,y) = (y,x)$ for all $(x,y) \in X \times X$. A braided set $X$ will be called squarefree if $R(x,x) = (x,x)$ for all $x \in X$.

For a positive integer $k$, we denote the trivial braided set of cardinality $k$ by $\mathcal {T}_k$. In what follows, the trivial braided set $\mathcal {T}_2 = \{0,1\}$ will play an important role.

Definition 4.5 A braided set $(X,R)$ is said to be self-distributive if for all $x,y \in X$ we have $\pi _1(R(x,y)) = y$. In this case, we use the notation $x^y = \pi _2(R(x,y))$.

For a self-distributive braided set $X$, and every $y \in X$, the function $x \mapsto x^y$ is a bijection from $X$ to $X$, so a self-distributive braided set is necessarily nondegenerate. The category of self-distributive braided sets is therefore equivalent to the category of racks, and the subcategory of squarefree self-distributive braided sets is equivalent to the subcategory of quandles.

Definition 4.6 We say that a subset $X_0 \subseteq X$ of a braided set $X$ is a braided subset if $R$ restricts to a bijection from $X_0 \times X_0$ to $X_0 \times X_0$ or, equivalently, if $X_0$ is a braided set and the inclusion $X_0 \to X$ is a morphism of braided sets. We write $X_0 \leq X$ to indicate that $X_0$ is a braided subset of $X$. Given an indexing set $I$, and elements $x_i \in X$ for every $i \in I$, we denote by

\[ \langle x_i \rangle_{i \in I} = \bigcap_{\substack{X_0 \leq X \\ x_i \in X_0 \text{ for every } i \in I }} X_0 \]

the braided subset of $X$ generated by all the $x_i$ for $i \in I$.

In case $X$ is a finite rack, this definition agrees with Definition 2.3.

Example 4.7 Let $G$ be a group, and let $C$ be a union of conjugacy classes of $G$. We endow $C$ with the structure of a braided set by

\[ R(x,y) = (y,x^y) = (y, y^{-1} x y),\quad x,y \in C. \]

This braided set is squarefree, self-distributive, and is trivial if and only if the subgroup of $G$ generated by $C$ is abelian.

Example 4.8 We denote by $\mathcal {N} = \{\eta, \xi \}$ the unique nontrivial self-distributive braided set on two elements. We have $\eta ^\eta = \eta ^\xi = \xi$ and $\xi ^\xi = \xi ^\eta = \eta$. This braided set is not squarefree.

The category of braided sets admits a (unique) final object: the trivial braided set $\mathcal {T}_1$.

Definition 4.9 For a braided set $(X, R)$ we denote by $\tau \colon X \to X_{\text {triv}}$ the morphism of braided sets characterized by the following universal property. The braided set $X_{\text {triv}}$ is trivial, and for every trivial braided set $Y$, and every morphism $\gamma \colon X \to Y$, there exists a unique morphism $\eta \colon X_{\text {triv}} \to Y$ such that $\gamma = \eta \circ \tau$. We say that $X_{\text {triv}}$, or rather $\tau$, is the trivialization of $X$.

Proposition 4.10 A trivialization of a braided set $X$ exists and is unique up to an isomorphism.

Proof. Let $\sim$ be the smallest equivalence relation on $X$ such that for all $x,y,z,w \in X$ with

\[ R(x,y) = (z,w) \]

we have $x \sim w$ and $y \sim z$. We let $X_{\text {triv}}$ be the set of equivalence classes in $X$ for $\sim$, and denote by $\tau \colon X \to X_{\text {triv}}$ the map taking an element to its equivalence class. One readily checks that $\tau$ is a morphism.

To check the universal property let $Y$ be a trivial braided set, and let $\gamma \colon X \to Y$ be a morphism. Since the braided sets $X_{\text {triv}}, Y$ are trivial, and $\tau$ is surjective, we just need to find a function $\eta \colon X_{\text {triv}} \to Y$ with $\gamma = \eta \circ \tau$. For that, it suffices to check that for every $x,y \in X$ with $\tau (x) = \tau (y)$ we have $\gamma (x) = \gamma (y)$. This follows at once from the definition of $\sim$ and the fact that $\gamma$ is a morphism.

Uniqueness up to an isomorphism follows (as usual) from the universal property.

We call the equivalence classes appearing in the proof the connected components of $X$. These connected components are the fibers of the map $\tau \colon X \to X_{\text {triv}}$. In case $X$ is a finite rack, this definition agrees with Definition 2.2.

Example 4.11 For a disjoint union $C$ of conjugacy classes $C_1, \ldots, C_k$ of a group $G$, such that $C$ generates $G$, the trivialization of $C$ is the map $C \to C_{\text {triv}} = \{C_1, \ldots, C_k\}$ sending each element to the conjugacy class in which it lies. The connected components of $C$ as a braided set are $C_1, \ldots, C_k$.

Definition 4.12 For a braided set $(X,R)$, a positive integer $n$, and an integer $1 \leq i < n$ we let $\sigma _i \in B_n$ act from the right on the $n$-fold Cartesian product $X^n$ of $X$ by $\mathrm {id}_{X^{i-1}} \times R \times \mathrm {id}_{X^{n-i-1}}$ namely

\[ (x_1, \ldots, x_{i-1}, x_i, x_{i+1}, x_{i+2} \ldots, x_n)^{\sigma_i} = (x_1, \ldots, x_{i-1}, R(x_i, x_{i+1}), x_{i+2} \ldots, x_n), \]

so we get a right action of $B_n$ on $X^n$.

The association of $X^n$ to $X$ is a functor from the category of braided sets to the category of right actions of $B_n$. In case $X$ is trivial, this action factors through the permutation action of $S_n$ on $X^n$. In the special case where $X$ is a group $G$, we recover the action of $B_n$ on $\operatorname {Mor}(F_n, G)$, and in case $X = C$ is a generating set for $G$ that is stable under conjugation by the elements of $G$, we recover the action of $B_n$ on $\operatorname {Mor}^C(F_n, G)$.

Definition 4.13 We denote the coinvariants of the action of $B_n$ on $X^n$, namely the collection of orbits, by $X^n/{B_n}$ and consider the disjoint union

\[ S_X = \bigcup_{n=1}^\infty X^n/B_n. \]

Since the action of $B_m \times B_{n-m}$ on $X^m \times X^{n-m}$ is compatible with the natural inclusion $B_m \times B_{n-m} \hookrightarrow B_n$ and the identification $X^m \times X^{n-m} = X^n$, the set $S_X$ endowed with the binary operation of concatenation of representatives of orbits is a semigroup. We call $S_X$ the semigroup of coinvariants (or the structure semigroup) of the braided set $X$.

4.1 The structure (semi)group

In case our braided set $X$ is self-distributive, the structure semigroup is given by the presentation

\[ S_X = \langle X : xy = yx^{y}\ \text{for all}\ (x,y) \in X^2 \rangle. \]

Definition 4.14 To a self-distributive braided set $X$ we also functorially associate the group given by the same presentation

\[ \Gamma_X = \langle X: y^{-1}xy = x^y\quad \text{for all}\ (x,y) \in X^2 \rangle. \]

This group is sometimes called the structure group of $X$.

The morphism of braided sets $\gamma _X \colon X \to \Gamma _X$ enjoys the following universal property. For every group $G$ and a morphism of braided sets $f \colon X \to G$ there exists a unique homomorphism of groups $\varphi \colon \Gamma _X \to G$ such that $f = \varphi \circ \gamma _X$. At times we will abuse notation viewing elements of $X$ as sitting inside $\Gamma _X$ via $\gamma _X$ (or even inside $S_X$).

We note that the natural homomorphism $\Gamma _X^{\text {ab}} \to \Gamma _{X_{\text {triv}}}$ is an isomorphism. For a trivial braided set, the structure group is a free abelian group on the braided set.

Definition 4.15 We view the group of permutations of $X$ as acting on $X$ from the right. We then have a homomorphism from $\Gamma _X$ to the group of permutations of $X$ sending $y \in X$ to the permutation $x \mapsto x^y$. The image of this homomorphism will be denoted by $\operatorname {Inn}(X)$.

One sometimes calls $\operatorname {Inn}(X)$ the group of inner automorphisms of $X$. We get a right action of $\Gamma _X$ (or, equivalently, of $\operatorname {Inn}(X)$) on $X$ whose orbits are the connected components of $X$. The kernel of the homomorphism $\Gamma _X \to \operatorname {Inn}(X)$ is contained in the center of $\Gamma _X$. In particular, in case $X$ is finite, the center of $\Gamma _X$ is of finite index in $\Gamma _X$. In this case for $x \in X$ we denote by $m_x$ the order of the image of $x$ in $\operatorname {Inn}(X)$. Then $x^{m_x}$ lies in the center of $S_X$ (and of $\Gamma _X\!$) and we put

\[ z = \prod_{x \in X} x^{m_x} \in Z(S_X\!). \]

Proposition 4.16 Let $S$ be a semigroup and let $z \in Z(S)$. Then there exists a monoid $S[z^{-1}]$ and a morphism of semigroups $\varphi \colon S \to S[z^{-1}]$ with $\varphi (z)$ invertible in $S[z^{-1}]$ such that for every monoid $M$ and a semigroup homomorphism $\psi \colon S \to M$ with $\psi (z)$ invertible, there exists a unique homomorphism of monoids $\theta \colon S[z^{-1}] \to M$ satisfying $\psi = \theta \circ \varphi$.

Proof. Consider first the semigroup $\mathbb {N} \times S$ whose elements we write as $z^{-m}w$ for a nonnegative integer $m$, and $w \in S$. The product is given by

\[ z^{-m}w \cdot z^{-n} v = z^{-(m+n)}wv,\quad m,n \in \mathbb{N},\ v,w \in S_X. \]

There is a natural homomorphism of semigroups $S \to \mathbb {N} \times S$ sending $w \in S_X$ to $z^{-0}w$. We define an equivalence relation $\sim$ on $\mathbb {N} \times S$ by $z^{-m}w \sim z^{-n}v$ if there exists a positive integer $r$ such that

\[ z^{n+r}w = z^{m+r}v \]

in $S$. Since multiplication in $\mathbb {N} \times S$ descends to multiplication on the set of equivalence classes for $\sim$ we can let $S[z^{-1}]$ be the quotient semigroup, and take $\varphi$ to be the composition $S \to \mathbb {N} \times S \to S[z^{-1}]$. We note that $z^{-1}z$ is the identity element of $S[z^{-1}]$ so $S[z^{-1}]$ is indeed a monoid. The inverse of $\varphi (z)$ in $S[z^{-1}]$ is given by $z^{-2} z$ so $\varphi (z)$ is indeed invertible. A routine check establishes the required universal property of $S[z^{-1}]$.

Proposition 4.17 The natural homomorphism of monoids $S_X[z^{-1}] \to \Gamma _X$ is an isomorphism.

Proof. We first show that the monoid $S_X[z^{-1}]$ is a group. Since $S_X[z^{-1}]$ is generated by $X$ and $z^{-2}z$ it suffices to show that (the image in $S_X[z^{-1}]$ of) every $x \in X$ is invertible in $S_X[z^{-1}]$. Fix an $x \in X$. As $z^{-2}z$ and $y^{m_y}$ lie in the center of $S_X[z^{-1}]$ for every $y \in X$, we have

\[ x \cdot x^{m_x-1} \prod_{y \in X \setminus \{x\}} y^{m_y} \cdot z^{-2}z = 1 = x^{m_x-1} \prod_{y \in X \setminus \{x\}} y^{m_y} \cdot z^{-2}z \cdot x, \]

so $x$ is indeed invertible in $S_X[z^{-1}]$. We have thus shown that $S_X[z^{-1}]$ is a group.

Since $S_X[z^{-1}]$ is a group, the natural map $X \to S_X[z^{-1}]$ is a morphism of braided sets. The universal property of $\Gamma _X$ therefore provides us with a group homomorphism $\Gamma _X \to S_X[z^{-1}]$ which is seen to be the inverse of $S_X[z^{-1}] \to \Gamma _X$ by checking that this is the case on the image of $X$.

Definition 4.18 Let $X$ be a finite self-distributive braided set, and let $C_1, \ldots, C_k$ be the connected components of $X$. For nonnegative integers $n_1, \ldots, n_k$, and $n = n_1 + \cdots + n_k$, we put

\[ X(n_1, \ldots, n_k) = \{(x_1, \ldots, x_n) \in X^n : |\{1 \leq i \leq n : x_i \in C_j \}| = n_j \text{ for every } 1 \leq j \leq k \}, \]

and consider those tuples that generate $X$ as a braided set, namely we define

\[ X^*(n_1, \ldots, n_k) = \{(x_1, \ldots, x_n) \in X(n_1, \ldots, n_k) : \langle x_1, \ldots, x_n \rangle = X \}. \]

We also define $(C_1^{n_1} \times \cdots \times C_k^{n_k})^* = (C_1^{n_1} \times \cdots \times C_k^{n_k}) \cap X^*(n_1, \ldots, n_k)$.

In case $n_1, \ldots, n_k$ are positive, the elements of a tuple in $X(n_1, \ldots, n_k)$ generate $X$ as a braided set if and only if they generate the group $\Gamma _X$ or, equivalently, the group $\operatorname {Inn}(X)$. We note that $X^*(n_1, \ldots, n_k)$ is stable under the action of $B_n$, and that for a nonnegative integer $N$ the set

\[ S_X^*(N) = \bigcup_{n_1, \ldots, n_k \geq N} X^*(n_1, \ldots, n_k)/B_n \]

is an ideal (thus, also a subsemigroup) of $S_X$.

Example 4.19 In case $X$ is the disjoint union $C$ of conjugacy classes $C_1, \ldots, C_k$ of a finite group $G$ such that $C$ generates $G$, and $n_1, \ldots, n_k$ are positive, we have $X^*(n_1, \ldots, n_k) = \operatorname {Sur}^C(F_n,G;n_1, \ldots, n_k)$.

Proposition 4.20 The subset $C_1^{n_1} \times \cdots \times C_k^{n_k}$ of $X(n_1, \ldots, n_k)$ is a block for $B_n$ and its stabilizer is $B_{n_1, \ldots, n_k}$.

Proof. This follows from the fact that $C_1^{n_1} \times \cdots \times C_k^{n_k}$ is a fiber of the map

\[ X(n_1, \ldots, n_k) \to X_{\text{triv}}(n_1, \ldots, n_k), \]

from the fact that this map is a morphism of sets with a right action of $B_n$, and from the fact that the action of $B_n$ on $X_{\text {triv}}(n_1, \ldots, n_k)$ factors through the permutation action of $S_n$ on $X_{\text {triv}}(n_1, \ldots, n_k)$.

The set $(C_1^{n_1} \times \cdots \times C_k^{n_k})_1^* = \{ (x_1, \ldots, x_n) \in (C_1^{n_1} \times \cdots \times C_k^{n_k})^* : x_1 \cdots x_n =1\}$ is therefore stable under the action of $B_{n_1, \ldots, n_k}$.

Corollary 4.21 The natural map $C_1^{n_1} \times \cdots \times C_k^{n_k} / B_{n_1, \ldots, n_k} \to X(n_1, \ldots, n_k)/B_n$ is a bijection.

Proof. This follows from Propositions 3.1 and 4.20 because the orbit under the action of $B_n$ of every element from $X(n_1, \ldots, n_k)$ contains an element from $C_1^{n_1} \times \cdots \times C_k^{n_k}$.

In what follows, every time we say that a positive integer is large enough, we mean large enough compared with $|X|$ (and not with any additional quantity).

Proposition 4.22 For positive integers $n_1, \ldots, n_k$ large enough, for nonnegative integers $m_1, \ldots, m_k$, the integer $m = m_1 + \cdots + m_k$, and $w \in X(m_1, \ldots, m_k)/B_m$, the function

\[ M_w \colon X^*(n_1, \ldots,n_k)/B_n \to X^*(n_1+m_1, \ldots, n_k + m_k)/B_{n+m},\quad M_w(v) = wv \]

of multiplication by $w$ from the left is surjective.

Proof. Since composition of surjective functions is surjective, it suffices to treat the case $m=1$, namely we assume that $w \in X$, so $w \in C_j$ for some $1 \leq j \leq k$. Take a representative

\[ s \in X^*(n_1, \ldots, n_{j-1}, n_j+1, n_{j+1}, \ldots, n_k) \]

for an orbit under the action of $B_{n+1}$. As $n_j$ is large enough, it follows from the pigeonhole principle that there exists an $x \in C_j$ that appears in $s$ more than $m_x$ times. We can therefore assume, without changing the orbit of $s$ under the action of $B_{n+1}$, that the first $m_x + 1$ entries of $s$ are all equal to $x$. In particular, we can write $s = x^{m_x} s'$ (the equality taking place in $S_X$) for some

\[ s' \in X^*(n_1, \ldots, n_{j-1}, n_j+1-m_x, n_{j+1}, \ldots, n_k)/B_{n+1-m_x}. \]

Since $x$ and $w$ lie in the same connected component of $X$, there exists $g \in \operatorname {Inn}(X)$ such that $x^g = w$. The entries of (a representative of) $s'$ generate $X$, so these entries generate $\operatorname {Inn}(X)$ as a group. In particular, we can write $g$ as a product some of these entries and their inverses (allowing repetition). Therefore, in order to show that $s = w^{m_x}s'$ in $S_X$ and deduce the required surjectivity, it suffices to observe that for every $y \in X$ that appears in (a representative of) $s'$, and every $x_1, \ldots, x_\ell \in X$ for which $x_1 \cdots x_\ell \in Z(S_X)$, we have the equality $x_1 \cdots x_\ell s' = x_1^y \cdots x_\ell ^y s'$ in $S_X$.

Corollary 4.23 For every $N$ large enough, and every $w \in S_X$, multiplication by $w$ from the left is an injective function from $S_X^*(N)$ to itself.

Proof. Since composition of injective functions is injective, it suffices to consider the case $w \in C_j$ for some $1 \leq j \leq k$. In this case, it is enough to show that for integers $n_1, \ldots, n_k \geq N$, the function

\[ M_w \colon X^*(n_1, \ldots, n_{j-1},n_j,n_{j+1}, \ldots, n_k)/B_n \to X^*(n_1, \ldots, n_{j-1},n_j+1,n_{j+1}, \ldots, n_k)/B_{n+1} \]

of multiplication by $w$ from the left is injective. It follows from Proposition 4.22 that for large enough $N$, the function $(n_1, \ldots, n_k) \mapsto |X^*(n_1, \ldots, n_k)/B_n|$ is nonincreasing in each coordinate, hence eventually constant because its values are positive integers. Therefore, for large enough $N$, the function $M_w$ maps one finite set onto another finite set of the same cardinality. The required injectivity follows.

Theorem 4.24 For every $N$ large enough, the natural homomorphism of semigroups $S_X^*(N) \to \Gamma _X$ is injective.

Proof. In view of Proposition 4.17, it suffices to show that the natural homomorphism of semigroups $S_X^*(N) \to S_X[z^{-1}]$ is injective. For that, let $u,v \in S_X^*(N)$ for which $z^r u = z^r v$ for some positive integer $r$. Invoking Corollary 4.23 with $w = z^r$, we get that $u = v$ so injectivity follows.

4.2 Products of braided sets

Definition 4.25 Let $X,Y$ be self-distributive braided sets. Then the group homomorphism

\[ \Gamma_{X \times Y} \to \Gamma_X \times \Gamma_Y \]

(arising from the functoriality of the structure group) induces a homomorphism

\[ \Gamma'_{X \times Y} \to \Gamma_X' \times \Gamma_Y' \]

on the commutator subgroups. In case the latter homomorphism is injective, we say that $(X,Y)$ is a productive pair.

Proposition 4.26 Let $X$ be a self-distributive braided set. Then the pair $(X,\mathcal {T}_2)$ is productive.

Proof. Since $\Gamma _{\mathcal {T}_2}$ is abelian, we need to show that the natural surjection $\pi \colon \Gamma '_{X \times \mathcal {T}_2} \to \Gamma _X'$ is an isomorphism. We will do so by constructing an inverse. The inclusion of the braided subset $X \times \{0\}$ into $X \times \mathcal {T}_2$ induces a group homomorphism $\psi _0 \colon \Gamma _{X} \stackrel {\sim }{\to } \Gamma _{X \times \{0\}} \to \Gamma _{X \times \mathcal {T}_2}$. We then get a homomorphism $\psi _0' \colon \Gamma _X' \to \Gamma _{X \times \mathcal {T}_2}'$. Arguing in the same way with $X \times \{1\}$ in place of $X \times \{0\}$, we get a homomorphism $\psi _1' \colon \Gamma _X' \to \Gamma _{X \times \mathcal {T}_2}'$. We claim that these homomorphisms coincide, namely $\psi _0' = \psi _1'$.

The commutator subgroup is generated by conjugates of commutators of elements in a generating set, so it suffices to check that $\psi _0'([x,y]^g) = \psi _1'([x,y]^g)$ for every $x,y \in X$ and $g \in \Gamma _X$. Equivalently, it is enough to show that $\psi _0([x,y])^{\psi _0(g)} = \psi _1([x,y])^{\psi _1(g)}$. Since conjugation by $\psi _0(g)$ coincides with conjugation by $\psi _1(g)$, our task is to show that $[\psi _0(x), \psi _0(y)] = [\psi _1(x), \psi _1(y)]$. Indeed, we have

(4.1)\begin{equation} (x,0)(y,0)(x,0)^{-1}(y,0)^{-1} = (x,1)(y,0)(x,1)^{-1}(y,0)^{-1} = (x,1)(y,1)(x,1)^{-1}(y,1)^{-1}. \end{equation}

We have thus proven our claim that $\psi _0' = \psi _1'$. We henceforth denote this map by $\psi '$.

Now we claim that $\psi '$ is an inverse of the natural surjection $\pi$. Since the inclusion of the braided set $X \times \{0\}$ into $X \times \mathcal {T}_2$ is a section of the projection $X \times \mathcal {T}_2 \to X$, it follows that $\pi \circ \psi '$ is the identity on $\Gamma _X'$. It remains to check that the homomorphism $\psi ' \circ \pi$ is the identity on $\Gamma _{X \times \mathcal {T}_2}'$. It suffices to show that $\psi ' \circ \pi$ acts as the identity on some generating set of $\Gamma _{X \times \mathcal {T}_2}'$. Since $X \times \mathcal {T}_2$ is a generating set for $\Gamma _{X \times \mathcal {T}_2}$ which is stable under conjugation, the subgroup $\Gamma _{X \times \mathcal {T}_2}'$ is generated by commutators of elements in $X \times \mathcal {T}_2$. Therefore, it is enough to check that $\psi ' \circ \pi$ acts as the identity on such commutators. This follows at once from (4.1). We have thus shown that $\psi '$ and $\pi$ are inverse to each other.

Corollary 4.27 For a productive pair $(X,Y)$ of self-distributive braided sets, the natural group homomorphism $\Gamma _{X \times Y} \to \Gamma _X \times \Gamma _Y \times \Gamma _{X \times Y}^{\text {ab}}$ is injective.

Proof. Let $a$ be an element in the kernel of our homomorphism. Since $a$ maps to the identity in $\Gamma _{X \times Y}^{\text {ab}}$ we get that $a \in \Gamma _{X \times Y}'$. It follows from the definition of a productive pair that $a = 1$. We have thus shown that the kernel of our homomorphism is trivial, so our homomorphism is indeed injective.

Definition 4.28 We say that a pair of braided sets $(X,Y)$ is synchronized if the natural surjection

\[ (X \times Y)_{\text{triv}} \to X_{\text{triv}} \times Y_{\text{triv}} \]

is a bijection.

For every braided set $X$ and every squarefree braided set $Y$, the pair $(X,Y)$ is synchronized. In particular, for every braided set $X$, the pair $(X, \mathcal {T}_2)$ is synchronized.

Corollary 4.29 For a productive synchronized pair $(X,Y)$ of self-distributive braided sets, the natural group homomorphism $\Gamma _{X \times Y} \to \Gamma _X \times \Gamma _Y \times \Gamma _{X_{\text {triv}} \times Y_{\text {triv}}}$ is injective.

Proof. In view of Corollary 4.27, it suffices to check that the natural homomorphism from $\Gamma _{X \times Y}^{\text {ab}}$, which we identify with $\Gamma _{(X\times Y)_{\text {triv}}}$, to $\Gamma _{X_{\text {triv}} \times Y_{\text {triv}}}$ is injective. This follows at once from our assumption that the pair $(X,Y)$ is synchronized.

Theorem 4.30 Let $(X,Y)$ be a productive synchronized pair of self-distributive braided sets. Let $N$ be sufficiently large, and let $v,w \in S^*_{X \times Y}(N)$ satisfying the following three conditions.

  • The projection of $v$ to $S_X$ coincides with the projection of $w$ to $S_X$.

  • The projection of $v$ to $S_Y$ coincides with the projection of $w$ to $S_Y$.

  • For every $t \in X_{\text {triv}} \times Y_{\text {triv}}$ the number of entries of $v$ that project to $t$ coincides with the number of entries of $w$ that project to $t$.

Then $v=w$.

Proof. By Theorem 4.24, it suffices to show that $v = w$ in $\Gamma _{X \times Y}$. In view of Corollary 4.29, it is enough to show that the images of $v$ and $w$ in each of the groups $\Gamma _X$, $\Gamma _Y$, $\Gamma _{X_{\text {triv}} \times Y_{\text {triv}}}$ agree. This is guaranteed by the three conditions that $v$ and $w$ satisfy.

4.3 Image of Stabilizer in $S_n$

As in Definition 4.18, let $X$ be a finite self-distributive braided set, let $C_1, \ldots, C_k$ be the connected components of $X$, let $n$ be a positive integer, and let $s = (x_1, \ldots, x_n) \in X^n$. For $1 \leq j \leq k$ put $D_j = \{1 \leq i \leq n : x_i \in C_j\}$ and $n_j = |D_j|$. We will make the identification

\[ \{\sigma \in S_n : \sigma(D_j) = D_j \text{ for every } 1 \leq j \leq k \} = S_{n_1} \times \cdots \times S_{n_k}. \]

Recall that $B_{n_1, \ldots, n_k}$ is the inverse image of $S_{n_1} \times \cdots \times S_{n_k}$ under the homomorphism from $B_n$ to $S_n$.

Corollary 4.31 The image in $S_n$ of the stabilizer in $B_n$ of $s$ is contained in $S_{n_1} \times \cdots \times S_{n_k}$. In other words, the stabilizer of $s$ in $B_n$ is the stabilizer of $s$ in $B_{n_1, \ldots, n_k}$.

Proof. This is an immediate consequence of Proposition 4.20.

Theorem 4.32 Suppose that $n_1, \ldots, n_k$ are sufficiently large, and that $\langle x_1, \ldots, x_n \rangle = X$. Then the image in $S_{n_1} \times \cdots \times S_{n_k}$ of the stabilizer in $B_n$ (equivalently, in $B_{n_1, \ldots, n_k}$) of $s$ contains $A_{n_1} \times \cdots \times A_{n_k}$.

Proof. We invoke Corollary 3.7 so given $X_j,Y_j \subseteq D_j$ with $|X_j| = |Y_j| = \lfloor n_j/2 \rfloor$, we need to show that there exists $g \in B_n$ such that $s^g = s$ and $X_j^g = Y_j$ for all $1 \leq j \leq k$. For $1 \leq i \leq n$ put

\[ u_i = \begin{cases} 1 & i \in X_j \text{ for some } 1 \leq j \leq k ,\\ 0 & \text{otherwise,} \end{cases}\quad v_i = \begin{cases} 1 & i \in Y_j \text{ for some } 1 \leq j \leq k ,\\ 0 & \text{otherwise}. \end{cases} \]

Viewing $((x_1, u_1), \ldots, (x_n, u_n))$ and $((x_1, v_1), \ldots, (x_n, v_n))$ as elements of $(X \times \mathcal {T}_2)^n$, our task is to show that $((x_1, u_1), \ldots, (x_n, u_n))^g = ((x_1, v_1), \ldots, (x_n, v_n))$ for some $g \in B_n$.

Since the entries of $s$ generate $X$, they also generate $\operatorname {Inn}(X)$. As $\mathcal {T}_2$ is a trivial braided set, it follows that the entries of $((x_1, u_1), \ldots, (x_n, u_n))$ generate $\operatorname {Inn}(X \times \mathcal {T}_2)$, so these entries also generate $X \times \mathcal {T}_2$ because they map surjectively onto $(X \times \mathcal {T}_2)_{\text {triv}} = X_{\text {triv}} \times \mathcal {T}_2$. Similarly, the entries of $((x_1, v_1), \ldots, (x_n, v_n))$ generate $X \times \mathcal {T}_2$. Therefore, we need to show that $((x_1, u_1), \ldots, (x_n, u_n)) = ((x_1, v_1), \ldots, (x_n, v_n))$ as elements in $S^*_{X \times \mathcal {T}_2}(\lfloor \min \{n_1, \ldots, n_k\}/2 \rfloor )$.

By Proposition 4.26, the pair $(X, \mathcal {T}_2)$ is productive, so we can resort to Theorem 4.30 once we check the three conditions therein. The first condition is satisfied because both $((x_1, u_1), \ldots, (x_n, u_n))$ and $((x_1, v_1), \ldots, (x_n, v_n))$ project to $s$ in $S_X$. The second condition is satisfied because, by assumption,

\[ |X_1| + \cdots + |X_k| = |Y_1| + \cdots + |Y_k|, \]

so there exists $g \in B_n$ such that $(u_1, \ldots, u_n)^g = (v_1, \ldots, v_n)$ as elements in $\mathcal {T}_2^n$. To see that the third condition is satisfied, we fix $1 \leq j \leq k$ and $\lambda \in \mathcal {T}_2$. We have

\[ |\{1 \leq i \leq n : x_i \in C_j,\ u_i = \lambda\}| = \begin{cases} |X_j| & \lambda = 1,\\ n_j - |X_j| & \lambda = 0, \end{cases} \]

and, similarly,

\[ |\{1 \leq i \leq n : x_i \in C_j,\ v_i = \lambda\}| = \begin{cases} |Y_j| & \lambda = 1,\\ n_j - |Y_j| & \lambda = 0. \end{cases} \]

Since $|X_j| = |Y_j|$ by assumption, the third condition in Theorem 4.30 is indeed satisfied.

Remark 4.33 It is likely possible to extend Theorem 4.32 to nondegenerate (but not necessarily self-distributive) finite braided sets. To do this, one associates to a nondegenerate finite braided set $Y$ its derived self-distributive braided set $X$, as in [Reference SolovievSol00]. The key point is that this association gives an isomorphism $Y^n \to X^n$ of $B_n$-sets. We do not know whether it is possible to extend Theorem 4.32 to (some family of) degenerate braided sets. Perhaps a first step would be to obtain a version of the results in § 4.1 for more general braided sets.

Lemma 4.34 Suppose that $X$ is squarefree, and let $1 \leq \alpha < \beta \leq n$ with $x_\alpha = x_\beta$. Then the image in $S_n$ of the stabilizer of $s$ in $B_n$ contains the transposition $(\alpha\ \beta )$.

Proof. Since $X$ is squarefree and self-distributive, we see that the element

\[ R_{\alpha, \beta} = \sigma_{\beta-1} \sigma_{\beta-2} \cdots \sigma_{\alpha+1} \sigma_{\alpha} \sigma_{\alpha+1}^{-1} \cdots \sigma_{\beta-2}^{-1} \sigma_{\beta-1}^{-1} \in B_n \]

lies in the stabilizer of $(x_1, \ldots, x_n)$, and that its image in $S_{n}$ is the transposition $(\alpha \ \beta )$.

Proposition 4.35 Suppose that $X$ is squarefree, set $N = \max _{1 \leq j \leq k} |C_j|$, and assume that $n_j > N$ for every $1 \leq j \leq k$. Then the image in $S_{n_1} \times \cdots \times S_{n_k}$ of the stabilizer in $B_{n_1, \ldots, n_k}$ of $s$ surjects onto $(\mathbb {Z} / 2 \mathbb {Z})^k$ under the sign homomorphisms.

Proof. Fix $1 \leq j \leq k$. Our task is to find an element in the stabilizer of $(x_1, \ldots, x_n)$ whose image in $S_{n_j}$ is an odd permutation, for instance a transposition, and whose image in $S_{n_r}$ for every $1 \leq r \leq k$ with $r \neq j$ is trivial. It follows from our definition of $N$ and the assumption on $n_j$ that there exist two indices $\alpha < \beta$ in $D_j$ for which $x_\alpha = x_\beta$. The required element is supplied to us by Lemma 4.34.

The argument in the proof above also gives the following.

Corollary 4.36 Suppose that $X$ is squarefree, and that $n_j > |C_j|$ for some $1 \leq j \leq k$. Then the image in $S_n$ of the stabilizer in $B_n$ (equivalently, in $B_{n_1, \ldots, n_k}$) of $s$ is not contained in $A_n$.

Corollary 4.37 Suppose that $n_1, \ldots, n_k$ are sufficiently large, that $\langle x_1, \ldots, x_n \rangle = X$, and that $X$ is squarefree. Then the image in $S_{n}$ of the stabilizer in $B_n$ (equivalently, in $B_{n_1, \ldots, n_k}$) of $s$ is $S_{n_1} \times \cdots \times S_{n_k}$.

Proof. Denote this image by $H$. By Theorem 4.32, $H$ contains $A_{n_1} \times \cdots \times A_{n_k}$. It follows from Proposition 4.35 that the restriction of the quotient map $S_{n_1} \times \cdots \times S_{n_k} \to S_{n_1} \times \cdots \times S_{n_k}/A_{n_1} \times \cdots \times A_{n_k}$ to $H$ is surjective. We conclude that $H = S_{n_1} \times \cdots \times S_{n_k}$ as required.

From Theorem 4.32 and Corollary 4.37 we get Theorem 2.4.

Example 4.38 By considering the case of the nontrivial self-distributive braided set $X = \mathcal {N} = \{\eta, \xi \}$ from Example 4.8, we show that the squarefreeness assumption in Corollary 4.37 is necessary in order to obtain a result stronger than in Theorem 4.32.

First note that in this case $X$ is connected. Now take any $s \in X^n$, and observe that its entries necessarily generate $X$. It is readily checked that for each of the generators $\sigma _i$ for $1 \leq i \leq n-1$, the parity of the number of appearances of $\eta$ in $s$ differs from the parity of the number of appearances of $\eta$ in $s^{\sigma _i}$. As a result, for every $g \in B_n$, the number of times $\eta$ appears in $s$ is congruent mod $2$ to the number of times $\eta$ appears in $s^g$ if and only if the image of $g$ in $S_n$ lies in $A_n$. We conclude that the image in $S_n$ of the stabilizer of $s$ in $B_n$ is contained in $A_n$.

We sketch an additional proof of Corollary 4.37.

Proposition 4.39 There exists a nonnegative integer $r$ for which the following holds. If $n_1, \ldots, n_k$ are large enough, and $\langle x_1, \ldots, x_n \rangle = X$, then the $B_{n_1, \ldots, n_k}$-orbit of $s$ contains both:

  • an $n$-tuple whose entries indexed by the first $n_j-r$ indices in $D_j$ coincide for every $1 \leq j \leq k$; and

  • an $n$-tuple $(y_1, \ldots, y_n)$ with $|\{1 \leq i \leq n : y_i = x\}| > r$ for every $x \in X$.

Proof. This follows from Proposition 4.22 and Corollary 4.21.

Proof of Corollary 4.37 It follows from the first item in Proposition 4.39 in conjunction with Lemma 4.34 that there exists a nonnegative integer $r$ such that the image in $S_{n_1} \times \cdots \times S_{n_k}$ of the stabilizer in $B_n$ of $s$ contains the subgroup $S_{n_1 - r} \times \cdots \times S_{n_k - r}$. From the second item in Proposition 4.39 and Lemma 4.34 we conclude that this image also contains a conjugate of an element whose projections to $S_{n_1}, \ldots, S_{n_r}$ have all of their cycles of lengths exceeding $r$. We conclude by invoking Corollary 3.9.

4.3.1 Results for small $n$

Proposition 4.40 Let $m \geq 2$ be an integer, and let $X$ be the conjugacy class of all transpositions in the group $S_m$. Then for a positive integer $n$ and every $s \in X^n$ whose entries generate $X$, the restriction to the stabilizer of $s$ in $B_n$ of the homomorphism to $S_n$ is a surjection.

Remark 4.41 If there exists an $n$-tuple $s \in X^n$ whose entries generate $X$, then $n \geq m-1$.

Proof. Let $s = ((i_1 \ j_1), \ldots, (i_n \ j_n))$ be an $n$-tuple of transpositions in $S_m$ that generates $X$. Consider the simple graph $\Lambda$ whose set of vertices is $\{1, \ldots, n\}$ with $1 \leq \alpha < \beta \leq n$ adjacent in case

\[ \{i_\alpha, j_\alpha \} \cap \{i_\beta, j_\beta \} \neq \emptyset. \]

We claim that $\Lambda$ is connected.

To prove the claim, suppose toward a contradiction that there exist nonempty disjoint subsets $I,J$ of $\{1, \ldots, n\}$ with $I \cup J = \{1, \ldots, n\}$ such that there is no edge between any index in $I$ and any index in $J$. From our definition of $\Lambda$ it follows that the nonempty subsets

\[ \mathcal{I} = \bigcup_{\alpha \in I} \{i_\alpha, j_\alpha\},\quad \mathcal{J} = \bigcup_{\beta \in J} \{i_\beta, j_\beta \} \]

of $\{1, \ldots, m\}$ are disjoint. We conclude that

\[ X = \langle (i_1 \ j_1), \ldots, (i_n \ j_n) \rangle \leq \{(i \ j) : i\neq j,\ i,j \in \mathcal{I} \text{ or } i,j \in \mathcal{J} \} \lneq X. \]

This contradiction concludes the proof of our claim that $\Lambda$ is connected.

Denote by $H$ the image in $S_n$ of the stabilizer in $B_n$ of $s$. We note that for every $1\leq \alpha < \beta \leq n$ that are adjacent in $\Lambda$, the stabilizer of $s$ in $B_n$ contains the element

\[ R_{\alpha, \beta}^3 = \sigma_{\beta-1} \sigma_{\beta-2} \cdots \sigma_{\alpha+1} \sigma_{\alpha}^3 \sigma_{\alpha+1}^{-1} \cdots \sigma_{\beta-2}^{-1} \sigma_{\beta-1}^{-1}. \]

This element maps to the transposition $(\alpha \ \beta )$ in $S_n$, so $(\alpha \ \beta ) \in H$. We consider the subgroup

\[ H_0 = \langle \{ (\alpha \ \beta) : 1\leq \alpha < \beta \leq n \text{ are adjacent in } \Lambda \}\rangle \]

of $H$ generated by all such transpositions. Since $\Lambda$ is connected, it follows that $H_0$ is a transitive subgroup of $S_n$. The only transitive subgroup of $S_n$ that is generated by transpositions is $S_n$ itself, so $H_0 = S_n$ and, thus, $H = S_n$ as required.

Let $C \subseteq G$ be a generating set of a group $G$ which is a disjoint union of conjugacy classes of $G$. We have a left action of $G$ on the sets $\operatorname {Mor}(F_n,G)$, $\operatorname {Mor}^C(F_n,G)$, $\operatorname {Sur}^C(F_n,G)$, $\operatorname {Sur}^C_1(F_n,G)$ by postcomposition with conjugation. This action commutes with the right action of $B_n$, so we get a right action of $B_n$ on the sets of orbits $G \backslash \operatorname {Mor}(F_n,G)$, $G \backslash \operatorname {Mor}^C(F_n,G)$, $G \backslash \operatorname {Sur}^C(F_n,G)$, $G \backslash \operatorname {Sur}^C_1(F_n,G)$. With our identification of $\operatorname {Mor}(F_n, G)$ with $G^n$, this left action of $G$ is given by

\[ {}^g(g_1, \ldots, g_n) = (gg_1g^{-1}, \ldots, g g_n g^{-1}), \quad (g_1, \ldots, g_n) \in G^n,\, g \in G. \]

Proposition 4.42 For a positive integer $n$, and every $(g_1, g_2, \ldots, g_{n-1}, g_n) \in G \backslash \operatorname {Mor}(F_n, G)$ we have

\[ (g_1, g_2, \ldots, g_{n-1}, g_n)^{\sigma_{n-1} \sigma_{n-2} \cdots \sigma_2 \sigma_1} = (g_n, g_1, g_2, \ldots, g_{n-1}) \]

as classes in $G \backslash \operatorname {Mor}(F_n,G)$.

Proof. Viewing our representatives in $G \backslash \operatorname {Mor}(F_n,G)$ as elements in $\operatorname {Mor}(F_n,G)$, we see that the action of $\sigma _{n-1} \sigma _{n-2} \cdots \sigma _2 \sigma _1 \in B_n$ is given by $(g_1, g_2, \ldots, g_{n-1}, g_n)^{\sigma _{n-1} \sigma _{n-2} \cdots \sigma _2 \sigma _1} = (g_n, g_1^{g_n}, g_2^{g_n}, \ldots, g_{n-1}^{g_n})$. The required equality of classes in $G \backslash \operatorname {Mor}(F_n,G)$ is then seen by conjugating the right-hand side by $g_n$.

Definition 4.43 Let $G$ be a group and let $C \subset G$ be a conjugacy class. We say that $C$ is abundant if for some (equivalently, every) $x \in C$, there exists $y \in G$ such that the set $\{x^{y^r} : r \in \mathbb {Z}\}$ (of conjugates of $x$ by elements of the cyclic subgroup of $G$ generated by $y$) generates $G$.

Corollary 4.44 Let $G$ be a finite simple group. Then there exists an abundant conjugacy class $C \subset G$.

Proof. This is a special case of [Reference Burness, Guralnick and HarperBGH21, Corollary 4 (ii)].

Proposition 4.45 Let $G$ be a finite group, and let $C \subset G$ be an abundant conjugacy class. Then for every positive integer $n$ that is divisible by $|G|^2$, there exists $s \in G \backslash \operatorname {Sur}^C_1(F_n,G)$ such that the image of the stabilizer of $s$ under the homomorphism from $B_n$ to $S_n$ contains an $n$-cycle.

Proof. Since $G$ is a finite group, and $C$ is an abundant conjugacy class, there exist $x \in C$ and $y \in G$ such that $\langle x^{y^r} : 0 \leq r \leq |G| - 1 \rangle = G$. We set

\[ g = x \cdot x^y \cdot x^{y^2} \cdots x^{y^{|G|-1}} \in G,\quad s_0 = (x, x^y, x^{y^2}, x^{y^{|G|-1}}) \in G^{|G|} \]

and denote by $s$ the $ ({n}/{|G|})$-fold concatenation of $s_0$ with itself. Since $n$ is divisible by $|G|^2$, we see that $ {n}/{|G|}$ is a multiple of $|G|$, so multiplying the entries of $s$ (in order) gives $1 \in G$ because $g^{|G|} = 1$. We conclude that $s$ represents an element of $\operatorname {Sur}^C_1(F_n,G)$.

It follows from Proposition 4.42 that the class of $s$ in $G \backslash \operatorname {Sur}^C_1(F_n,G)$ is mapped under $\sigma _{n-1} \cdots \sigma _1$ to a class represented by a one-step cyclic right shift of $s$. Since conjugating this representative by $y^{-1} \in G$ gives $s$, we conclude that $\sigma _{n-1} \cdots \sigma _1$ lies in the stabilizer of the class of $s$ in $G \backslash \operatorname {Sur}^C_1(F_n,G)$. As $\sigma _{n-1} \cdots \sigma _1$ maps to an $n$-cycle in $S_n$, the image in $S_n$ of the stabilizer of $s$ contains an $n$-cycle.

Corollary 4.46 Let $G$ be a finite simple group. Then for every positive integer $n$ that is divisible by $|G|^2$ there exists $s \in G \backslash \operatorname {Sur}_1(F_n,G)$ such that the image of the stabilizer of $s$ under the homomorphism from $B_n$ to $S_n$ contains an $n$-cycle.

Proof. This is an immediate consequence of Corollary 4.44 and Proposition 4.45.

5. Proof of Theorem 1.10

For brevity of notation, we set $k = d_\lhd (G)$.

5.1 The Möbius function

Our task here is to prove the first part of Theorem 1.10, namely that

\[ \sum_{K \in \mathcal{E}_q^C(G;n_1, \ldots, n_{k})} (-1)^{|\operatorname{ram}(K)|} = o(|\mathcal{E}_q^C(G;n_1, \ldots, n_{k})|),\quad q \to \infty,\quad \gcd(q, |G|) = 1, \]

assuming $n_j > |C_j|$ for some $1 \leq j \leq k$.

We recall from [Reference Liu, Wood and Zureick-BrownLWZ19, § 11.4, Theorem 11.1, Lemma 11.2, Proposition 11.4] that there exists a smooth separated scheme $\mathsf {Hur}_{G,C}^{n_1, \ldots, n_k}$ of finite type over $\mathbb {Z}[|G|^{-1}]$ with

\[ \mathsf{Hur}_{G,C}^{n_1, \ldots, n_k}(\mathbb{F}_q) = \mathcal{E}_q^C(G;n_1, \ldots, n_k). \]

The scheme $\mathsf {Hur}_{G,C}^{n_1, \ldots, n_k}$ is (pure) of relative dimension $n$ over $\mathbb {Z}[|G|^{-1}]$.

Let $\text {Conf}^{n_1, \ldots, n_k}$ be the $k$-colored configuration space of $n_j$ unordered points of color $j$, for every $1 \leq j \leq k$, on the affine line such that all points are distinct (whether they have the same color or not). We can view a point of this space over a field as a $k$-tuple of pairwise coprime monic squarefree polynomials of degrees $n_1, \ldots, n_k$ over that field. The space $\text {Conf}^{n_1, \ldots, n_k}$ is a smooth separated scheme of finite type over $\mathbb {Z}$, and thus remains so after restriction to $\mathbb {Z}[|G|^{-1}]$. [Reference Liu, Wood and Zureick-BrownLWZ19, Proposition 11.4] provides us with a finite étale map

\[ \pi \colon \mathsf{Hur}_{G,C}^{n_1, \ldots, n_k} \to \text{Conf}^{n_1, \ldots, n_k}, \]

such that $\pi (K) = (D_K(1), \ldots, D_K(k))$ on the level of $\mathbb {F}_q$-points.

Let $\text {PConf}^n$ be the locus in $\mathbb {A}^n_{\mathbb {Z}}$ where all coordinates are pairwise distinct. We can view a point of this space over a field as an (ordered) $n$-tuple of distinct scalars from that field. We consider the finite étale map $\rho \colon \text {PConf}^n \to \text {Conf}^{n_1, \ldots, n_k}$ which given an $n$-tuple $(\lambda _1, \ldots, \lambda _n)$ of distinct scalars from a field, assigns the color $j$ to the scalars $\lambda _{n_1 + \cdots + n_{j-1} + 1}, \ldots, \lambda _{n_1 + \cdots + n_j}$ for every $1 \leq j \leq k$. We can also write

\[ \rho(\lambda_1, \ldots, \lambda_n) = \bigg(\prod_{r=1}^{n_j} (T-\lambda_{n_1 + \cdots + n_{j-1} + r})\bigg)_{j = 1, \ldots, k}, \]

where the right-hand side is a $k$-tuple of pairwise coprime monic squarefree polynomials. The map $\rho$ is a Galois cover with Galois group $S_{n_1} \times \cdots \times S_{n_k}$ that acts by permuting the roots of each polynomial.

Take an auxiliary prime number $\ell > n$ with $\ell$ not dividing $q$. We fix an isomorphism of fields $\iota \colon \overline {\mathbb {Q}_\ell } \to \mathbb {C}$, and will at times (silently) identify these fields via $\iota$.

Using $\rho$ we can view every finite-dimensional representation $W$ of $S_{n_1} \times \cdots \times S_{n_k}$ over $\overline {\mathbb {Q}_\ell }$ as a lisse (étale) $\overline {\mathbb {Q}_\ell }$-sheaf on $\text {Conf}^{n_1, \ldots, n_k}$ punctually pure of weight $0$. We write $\chi _W$ for the character of $W$, and for $(f_1, \ldots, f_k) \in \text {Conf}^{n_1, \ldots, n_k}(\mathbb {F}_q)$ we denote by $(\sigma _{f_1}, \ldots, \sigma _{f_k})$ the conjugacy class in $S_{n_1} \times \cdots \times S_{n_k}$ corresponding to the permutation induced by $\operatorname {Frob}_q$ on (the roots of) $f_j$ for every $1 \leq j \leq k$. The cycle structure of $\sigma _{f_j}$ is the multiset of degrees of the monic irreducible factors of $f_j$. We therefore have

\[ \operatorname{tr}(\operatorname{Frob}_q, W_{(f_1, \ldots, f_k)}) = \chi_W(\sigma_{f_1}, \ldots, \sigma_{f_k}), \]

an expression for the trace of Frobenius on the stalk of (the sheaf corresponding to) $W$ at a geometric point of $\text {Conf}^{n_1, \ldots, n_k}$ lying over $(f_1, \ldots, f_k)$.

Let $\pi ^* W$ be the lisse sheaf (punctually pure of weight $0$) on $\mathsf {Hur}_{G,C}^{n_1, \ldots, n_k}$ obtained by pulling back $W$. For every $K \in \mathcal {E}_q^C(G;n_1, \ldots, n_k)$ we have

\[ \operatorname{tr}(\operatorname{Frob}_q, (\pi^* W)_{K}) = \operatorname{tr}(\operatorname{Frob}_q, W_{\pi(K)}) = \operatorname{tr}(\operatorname{Frob}_q, W_{(D_K(1), \ldots, D_K(k))}) = \chi_W(\sigma_{D_K(1)}, \ldots, \sigma_{D_K(k)}). \]

In the special case $W = \text {sgn}_1 \boxtimes \cdots \boxtimes \text {sgn}_k$, the sign representation of $S_{n_1} \times \cdots \times S_{n_k}$, for $K$ in $\mathcal {E}_q^C(G;n_1, \ldots, n_k)$ we have

\[ \chi_{\text{sgn}_1 \boxtimes \cdots \boxtimes \text{sgn}_k}(\sigma_{D_K(1)}, \ldots, \sigma_{D_K(k)}) = (-1)^n \cdot (-1)^{|\operatorname{ram}(K)|}, \]

so

\[ \sum_{K \in \mathcal{E}_q^C(G;n_1, \ldots, n_k)} (-1)^{|\operatorname{ram}(K)|} = (-1)^n \sum_{K \in \mathsf{Hur}_{G,C}^{n_1, \ldots, n_k}(\mathbb{F}_q)} \operatorname{tr}(\operatorname{Frob}_q, (\pi^* \text{sgn}_1 \boxtimes \cdots \boxtimes \text{sgn}_k)_{K}\!). \]

We can assume that $\mathsf {Hur}_{G,C}^{n_1, \ldots, n_k}$ has an $\mathbb {F}_q$-point since otherwise the sum above is over the empty set, so the statement to be proven holds trivially. It follows that $\mathsf {Hur}_{G,C}^{n_1, \ldots, n_k}$ has an $\mathbb {F}_q$-rational component, so from the Lang–Weil bound applied to that component we get that

\[ \liminf_{\substack{q \to \infty \\ \gcd(q, |G|) = 1}} \frac{|\mathcal{E}_q^C(G;n_1, \ldots, n_k)|}{q^n} > 0. \]

Our task is therefore to show that

\[ \lim_{\substack{q \to \infty \\ \gcd(q, |G|) = 1}} \frac{1}{q^n} \sum_{K \in \mathsf{Hur}_{G,C}^{n_1, \ldots, n_k}(\mathbb{F}_q) } \operatorname{tr}(\operatorname{Frob}_q, (\pi^* W)_{K}\!) = 0,\quad W = \text{sgn}_1 \boxtimes \cdots \boxtimes \text{sgn}_k. \]

From now until almost the end of the proof, we will task ourselves with computing (under suitable assumptions on $n_1, \ldots, n_k$) the limit above for an arbitrary finite-dimensional representation $W$ of $S_{n_1} \times \cdots \times S_{n_k}$ over $\overline {\mathbb {Q}_\ell }$. It is only at the very end that we will specialize again to $W = \text {sgn}_1 \boxtimes \cdots \boxtimes \text {sgn}_k$ and deduce that the limit is indeed $0$ in case $n_j > |C_j|$ for some $1 \leq j \leq k$.

By the Grothendieck–Lefschetz trace formula, we have

\begin{align*} & \lim_{\substack{q \to \infty \\ \gcd(q, |G|) = 1}} \frac{1}{q^n} \sum_{K \in \mathsf{Hur}_{G,C}^{n_1, \ldots, n_k}(\mathbb{F}_q) } \operatorname{tr}(\operatorname{Frob}_q, (\pi^* W)_{K}) \\ &\quad =\lim_{\substack{q \to \infty \\ \gcd(q, |G|) = 1}} \frac{1}{q^n} \sum_{i=0}^{2n} (-1)^i \operatorname{tr}(\operatorname{Frob}_q, H_c^i( \mathsf{Hur}_{G,C}^{n_1, \ldots, n_k} \times_{\mathbb{Z}[|G|^{-1}]} \overline{\mathbb{F}_q} , \pi^*W)), \end{align*}

where we use the notation $\pi ^* W$ also for the pullback of this sheaf to $\overline {\mathbb {F}_q}$. We claim first that there is no contribution to the limit from $0 \leq i \leq 2n-1$. Indeed since $\pi ^* W$ is punctually pure of weight $0$, Deligne's Riemann hypothesis gives an upper bound of $q^{i/2}$ on the absolute value of each eigenvalue of $\operatorname {Frob}_q$ on $H_c^i(\mathsf {Hur}_{G,C}^{n_1, \ldots, n_k} \times _{\mathbb {Z}[|G|^{-1}]} \overline {\mathbb {F}_q} , \pi ^*W)$, and the dimension over $\overline {\mathbb {Q}_\ell }$ of these cohomology groups is bounded independently of $q$, so dividing the trace of $\operatorname {Frob}_q$ by $q^n$ and taking $q \to \infty$ gives $0$ in the limit. We therefore have

\begin{align*} & \lim_{\substack{q \to \infty \\ \gcd(q, |G|) = 1}} \frac{1}{q^n} \sum_{i=0}^{2n} (-1)^i \operatorname{tr}(\operatorname{Frob}_q, H_c^i( \mathsf{Hur}_{G,C}^{n_1, \ldots, n_k} \times_{\mathbb{Z}[|G|^{-1}]} \overline{\mathbb{F}_q} , \pi^*W)) \\ &\quad =\lim_{\substack{q \to \infty \\ \gcd(q, |G|) = 1}} \frac{1}{q^n} \operatorname{tr}(\operatorname{Frob}_q, H_c^{2n}( \mathsf{Hur}_{G,C}^{n_1, \ldots, n_k} \times_{\mathbb{Z}[|G|^{-1}]} \overline{\mathbb{F}_q} , \pi^*W)). \end{align*}

Since representations of finite groups in characteristic $0$ are semisimple, we can find a subrepresentation $U$ of $W$ for which

\[ W = U \oplus W^{S_{n_1} \times \cdots \times S_{n_k}}. \]

We then have $U^{S_{n_1} \times \cdots \times S_{n_k}} = 0$, and

\begin{align*} & \lim_{\substack{q \to \infty \\ \gcd(q, |G|) = 1}} \frac{1}{q^n} \operatorname{tr}(\operatorname{Frob}_q, H_c^{2n}( \mathsf{Hur}_{G,C}^{n_1, \ldots, n_k} \times_{\mathbb{Z}[|G|^{-1}]} \overline{\mathbb{F}_q} , \pi^*W)) \\ &\quad =\lim_{\substack{q \to \infty \\ \gcd(q, |G|) = 1}} \frac{1}{q^n} \operatorname{tr}(\operatorname{Frob}_q, H_c^{2n}( \mathsf{Hur}_{G,C}^{n_1, \ldots, n_k} \times_{\mathbb{Z}[|G|^{-1}]} \overline{\mathbb{F}_q} , \pi^*U)) \\ &\qquad +\dim_{\overline{\mathbb{Q}_\ell}} W^{S_{n_1} \times \cdots \times S_{n_k}} \lim_{\substack{q \to \infty \\ \gcd(q, |G|) = 1}} \frac{1}{q^n} \operatorname{tr}(\operatorname{Frob}_q, H_c^{2n} (\mathsf{Hur}_{G,C}^{n_1, \ldots, n_k} \times_{\mathbb{Z}[|G|^{-1}]} \overline{\mathbb{F}_q}, \overline{\mathbb{Q}_\ell})). \end{align*}

Since $\mathsf {Hur}_{G,C}^{n_1, \ldots, n_k} \times _{\mathbb {Z}[|G|^{-1}]} \overline {\mathbb {F}_q}$ is of dimension $n$, the action of $\operatorname {Frob}_q$ on its topmost compactly supported étale cohomology (with constant coefficients, namely $\overline {\mathbb {Q}_\ell }$) is via multiplication by $q^n$, so the above equals

\begin{align*} & \lim_{\substack{q \to \infty \\ \gcd(q, |G|) = 1}} \frac{1}{q^n} \operatorname{tr}(\operatorname{Frob}_q, H_c^{2n}( \mathsf{Hur}_{G,C}^{n_1, \ldots, n_k} \times_{\mathbb{Z}[|G|^{-1}]} \overline{\mathbb{F}_q} , \pi^*U)) \\ &\quad +\dim_{\overline{\mathbb{Q}_\ell}} W^{S_{n_1} \times \cdots \times S_{n_k}} \cdot \dim_{\overline{\mathbb{Q}_\ell}} H_c^{2n}( \mathsf{Hur}_{G,C}^{n_1, \ldots, n_k} \times_{\mathbb{Z}[|G|^{-1}]} \overline{\mathbb{F}_q} , \overline{\mathbb{Q}_\ell}). \end{align*}

Since $\pi$ is finite, it follows from the Leray spectral sequence with compact supports that

\[ H_c^{2n}( \mathsf{Hur}_{G,C}^{n_1, \ldots, n_k} \times_{\mathbb{Z}[|G|^{-1}]} \overline{\mathbb{F}_q} , \pi^*U) \cong H_c^{2n}( \text{Conf}^{n_1, \ldots, n_k} \times_{\mathbb{Z}} \overline{\mathbb{F}_q} , \pi_* \pi^*U), \]

where $\pi _*$ is the pushforward of lisse sheaves by $\pi$. One readily checks that $\pi _* \pi ^* U \cong \pi _* \overline {\mathbb {Q}_\ell } \otimes U$ so

\[ H_c^{2n}( \text{Conf}^{n_1, \ldots, n_k} \times_{\mathbb{Z}} \overline{\mathbb{F}_q} , \pi_* \pi^*U) \cong H_c^{2n}( \text{Conf}^{n_1, \ldots, n_k} \times_{\mathbb{Z}} \overline{\mathbb{F}_q} , \pi_* \overline{\mathbb{Q}_\ell} \otimes U). \]

Since the sheaves $\pi _* \overline {\mathbb {Q}_\ell }$ and $U$ are self-dual, from Poincaré duality we get that

\[ H_c^{2n}( \text{Conf}^{n_1, \ldots, n_k} \times_{\mathbb{Z}} \overline{\mathbb{F}_q} , \pi_* \overline{\mathbb{Q}_\ell} \otimes U) \cong H^{0}( \text{Conf}^{n_1, \ldots, n_k} \times_{\mathbb{Z}} \overline{\mathbb{F}_q} , \pi_* \overline{\mathbb{Q}_\ell} \otimes U). \]

It follows from the proof of [Reference Liu, Wood and Zureick-BrownLWZ19, Lemma 10.3] that with our choice of $\ell$ we have

\[ H^{0}( \text{Conf}^{n_1, \ldots, n_k} \times_{\mathbb{Z}} \overline{\mathbb{F}_q} , \pi_* \overline{\mathbb{Q}_\ell} \otimes U) = H^{0}( \text{Conf}^{n_1, \ldots, n_k}(\mathbb{C}), \pi_* \overline{\mathbb{Q}_\ell} \otimes U), \]

where on the right-hand side we take singular cohomology, viewing $\pi _* \overline {\mathbb {Q}_\ell } \otimes U$ as a local system on $\text {Conf}^{n_1, \ldots, n_k}(\mathbb {C})$, or rather as a representation of the fundamental group $B_{n_1, \ldots, n_k}$ of $\text {Conf}^{n_1, \ldots, n_k}(\mathbb {C})$. Therefore, we have

\[ H^{0}( \text{Conf}^{n_1, \ldots, n_k}(\mathbb{C}) , \pi_* \overline{\mathbb{Q}_\ell} \otimes U) \cong (\pi_* \overline{\mathbb{Q}_\ell} \otimes U)^{B_{n_1, \ldots, n_k}}. \]

Denote by $\overline {\mathbb {Q}_\ell }(C_1^{n_1} \times \cdots \times C_k^{n_k})_1^*$ the permutation representation over $\overline {\mathbb {Q}_\ell }$ associated to the action of $B_{n_1, \ldots, n_k}$ on $(C_1^{n_1} \times \cdots \times C_k^{n_k})_1^*$. It follows from the proof of [Reference Liu, Wood and Zureick-BrownLWZ19, Theorem 12.4] that

\[ \pi_*\overline{\mathbb{Q}_\ell} \cong \overline{\mathbb{Q}_\ell} (C_1^{n_1} \times \cdots \times C_k^{n_k})_1^*, \]

so

\[ (\pi_* \overline{\mathbb{Q}_\ell} \otimes U)^{B_{n_1, \ldots, n_k}} \cong (\overline{\mathbb{Q}_\ell}(C_1^{n_1} \times \cdots \times C_k^{n_k})_1^* \otimes U)^{B_{n_1, \ldots, n_k}}. \]

Since permutation representations are self-dual, we have

\[ (\overline{\mathbb{Q}_\ell}(C_1^{n_1} \times \cdots \times C_k^{n_k})_1^* \otimes U)^{B_{n_1, \ldots, n_k}} \cong \operatorname{Hom}_{B_{n_1, \ldots, n_k}}(\overline{\mathbb{Q}_\ell} (C_1^{n_1} \times \cdots \times C_k^{n_k})_1^*, U). \]

Let $S$ be a set of representatives for the orbits of the action of $B_{n_1, \ldots, n_k}$ on $(C_1^{n_1} \times \cdots \times C_k^{n_k})_1^*$. That is, for every $t \in (C_1^{n_1} \times \cdots \times C_k^{n_k})_1^*$ there exists a unique $s \in S$ that lies in the orbit of $t$ under the action of $B_{n_1, \ldots, n_k}$. For $s \in S$, denoting by $B_{n_1, \ldots, n_k,s}$ the stabilizer of $s$ in $B_{n_1, \ldots, n_k}$, we see that

\[ \overline{\mathbb{Q}_\ell}(C_1^{n_1} \times \cdots \times C_k^{n_k})_1^* \cong \bigoplus_{s \in S} \operatorname{Ind}_{B_{n_1, \ldots, n_k, s}}^{B_{n_1, \ldots, n_k}} \overline{\mathbb{Q}_\ell} \]

and, as a result,

\[ \operatorname{Hom}_{B_{n_1, \ldots, n_k}}(\overline{\mathbb{Q}_\ell}(C_1^{n_1} \times \cdots \times C_k^{n_k})_1^*, U) \cong \bigoplus_{s \in S} \operatorname{Hom}_{B_{n_1, \ldots, n_k}} (\operatorname{Ind}_{B_{n_1, \ldots, n_k, s}}^{B_{n_1, \ldots, n_k}}\overline{\mathbb{Q}_\ell}, U). \]

It follows from Frobenius reciprocity that

\[ \bigoplus_{s \in S} \operatorname{Hom}_{B_{n_1, \ldots, n_k}} (\operatorname{Ind}_{B_{n_1, \ldots, n_k, s}}^{B_{n_1, \ldots, n_k}}\overline{\mathbb{Q}_\ell}, U) \cong \bigoplus_{s \in S} \operatorname{Hom}_{B_{n_1, \ldots, n_k, s}}(\overline{\mathbb{Q}_\ell},U). \]

The homomorphism from $B_{n_1, \ldots, n_k}$ to $S_{n_1} \times \cdots \times S_{n_k}$ arising from the cover $\text {PConf}^n(\mathbb {C}) \to \text {Conf}^{n_1, \ldots, n_k}(\mathbb {C})$ is the one we have considered in previous sections. In case $n_1, \ldots, n_k$ are large enough, Corollary 4.37 tells us that for every $s \in S$ the restriction to $B_{n_1, \ldots, n_k, s}$ of the homomorphism from $B_{n_1, \ldots, n_k}$ to $S_{n_1} \times \cdots \times S_{n_k}$ is surjective, so

\[ \bigoplus_{s \in S} \operatorname{Hom}_{B_{n_1, \ldots, n_k, s}}(\overline{\mathbb{Q}_\ell},U) \cong \bigoplus_{s \in S} \operatorname{Hom}_{S_{n_1} \times \cdots \times S_{n_k}} (\overline{\mathbb{Q}_\ell},U) \cong \bigoplus_{s \in S} U^{S_{n_1} \times \cdots \times S_{n_k}} = 0. \]

Therefore, in case $n_1, \ldots, n_k$ are large enough, the limit we wanted to compute is

\[ \dim_{\overline{\mathbb{Q}_\ell}} W^{S_{n_1} \times \cdots \times S_{n_k}} \cdot \dim_{\overline{\mathbb{Q}_\ell}} H^{0}( \mathsf{Hur}_{G,C}^{n_1, \ldots, n_k}(\mathbb{C}), \overline{\mathbb{Q}_\ell}). \]

As we stated earlier, at last we specialize to the case $W = \text {sgn}_1 \boxtimes \cdots \boxtimes \text {sgn}_k$. In this case we have

\[ U = W = \text{sgn}_1 \boxtimes \cdots \boxtimes \text{sgn}_k,\quad W^{S_{n_1} \times \cdots \times S_{n_k}} = 0. \]

By assumption $n_j > |C_j|$ for some $1 \leq j \leq k$ so it follows from Corollary 4.36 that

\[ \bigoplus_{s \in S} \operatorname{Hom}_{B_{n_1, \ldots, n_k, s}} (\overline{\mathbb{Q}_\ell},U) \cong \bigoplus_{s \in S} U^{B_{n_1, \ldots, n_k, s}} = 0. \]

Hence, in this case the limit we wanted to compute is indeed $0$.

5.2 The von Mangoldt function

Here we prove the second part of Theorem 1.10, namely that

\[ \sum_{K \in \mathcal{E}_q^C(G;n_1, \ldots, n_{k})} \mathbf{1}_{|\operatorname{ram}(K)| = k} \sim \frac{|\mathcal{E}_q^C(G;n_1, \ldots, n_{k})|}{n_1 \cdots n_{k}},\quad q \to \infty, \quad \gcd(q, |G|) = 1, \]

assuming that $n_1, \ldots, n_k$ are sufficiently large. We will freely use arguments and conclusions from the previous subsection.

We denote by $\Lambda _j$ the von Mangoldt function, which for our purposes is defined on monic squarefree polynomials of degree $n_j$ over $\mathbb {F}_q$, taking the value $n_j$ on irreducible polynomials and the value $0$ on reducible polynomials. For $K \in \mathcal {E}_q^C(G;n_1, \ldots, n_{k})$ we therefore have

\[ \mathbf{1}_{|\operatorname{ram}(K)| = k} = \frac{1}{n_1 \cdots n_k} \cdot \prod_{j=1}^k \Lambda_j(D_K(j)). \]

We denote by $\operatorname {std}_j$ the standard representation of $S_{n_j}$ (of dimension $n_j-1$) over $\overline {\mathbb {Q}_\ell }$. By [Reference SawinSaw21, Lemma 3.6], for every monic squarefree polynomial $f$ of degree $n_j$ over $\mathbb {F}_q$, we have

\[ \Lambda_j(f) = \sum_{i=0}^{n_j-1} (-1)^i \chi_{\wedge^i(\operatorname{std}_j)}(\sigma_f), \]

where $\sigma _f$ is the conjugacy class in $S_{n_j}$ of the permutation induced by the map $z \mapsto z^q$ on the (necessarily distinct) roots of $f$. We conclude that for every $K \in \mathcal {E}_q^C(G;n_1, \ldots, n_{k})$ we have

\[ \mathbf{1}_{|\operatorname{ram}(K)| = k} = \frac{1}{n_1 \cdots n_k} \cdot \prod_{j=1}^k \sum_{i=0}^{n_j-1} (-1)^i \chi_{\wedge^i(\operatorname{std}_j)}(\sigma_{D_K(j)}). \]

The contribution to the right-hand side above from taking the $i=0$ term for every $1 \leq j \leq k$ is $ {1}/({n_1 \cdots n_k})$, so summing this over all $K \in \mathcal {E}_q^C(G;n_1, \ldots, n_{k})$ gives the required main term $ {| \mathcal {E}_q^C(G;n_1, \ldots, n_{k})|}/({n_1 \cdots n_k})$. Our task is therefore to show that the contribution of any other term is $o(|\mathcal {E}_q^C(G;n_1, \ldots, n_{k})|)$. That is, taking $0 \leq i_1 < n_1, \ldots, 0 \leq i_k < n_k$ not all zero, it suffices to show that

\[ \sum_{K \in \mathcal{E}_q^C(G;n_1, \ldots, n_{k})} \prod_{j=1}^k \chi_{\wedge^{i_j}(\operatorname{std}_j)}(\sigma_{D_K(j)}) = o(|\mathcal{E}_q^C(G;n_1, \ldots, n_{k})|),\quad q \to \infty,\quad \gcd(q, |G|) = 1. \]

We consider the representation

\[ W = \wedge^{i_1}(\operatorname{std}_1) \boxtimes \cdots \boxtimes \wedge^{i_k}(\operatorname{std}_k) \]

of $S_{n_1} \times \cdots \times S_{n_k}$ over $\overline {\mathbb {Q}_\ell }$. Since $\wedge ^{i_j}(\operatorname {std}_j)$ is an irreducible finite-dimensional representation of $S_{n_j}$ over $\overline {\mathbb {Q}_\ell }$ for every $1 \leq j \leq k$, it follows that $W$ is an irreducible finite-dimensional representation of $S_{n_1} \times \cdots \times S_{n_k}$. Our assumption that $i_j > 0$ for some $1 \leq j \leq k$ implies that

\[ \dim_{\overline{\mathbb{Q}_\ell}} W = \prod_{j=1}^{k} \dim_{\overline{\mathbb{Q}_\ell}} \wedge^{i_j}(\operatorname{std}_j) > 1 \]

so $W^{S_{n_1} \times \cdots \times S_{n_k}} = 0$ in view of irreducibility. In the notation of the previous subsection, we therefore have $U = W$.

The sum in which we need to obtain cancelation can be rewritten as

\[ \sum_{K \in \mathcal{E}_q^C(G;n_1, \ldots, n_{k})} \prod_{j=1}^k \chi_{\wedge^{i_j}(\operatorname{std}_j)}(\sigma_{D_K(j)}) = \sum_{K \in \mathcal{E}_q^C(G;n_1, \ldots, n_{k})} \chi_W(\sigma_{D_K(1)}, \ldots, \sigma_{D_K(k)}). \]

As in the previous subsection, we have

\[ \sum_{K \in \mathcal{E}_q^C(G;n_1, \ldots, n_{k})} \chi_W(\sigma_{D_K(1)}, \ldots, \sigma_{D_K(k)}) = \sum_{K \in \mathcal{E}_q^C(G;n_1, \ldots, n_{k})} \operatorname{tr}(\operatorname{Frob}_q, (\pi^* W)_{K}\!). \]

By assumption $n_1, \ldots, n_k$ are large enough, so from the previous subsection we see that the sum above is indeed $o(| \mathcal {E}_q^C(G;n_1, \ldots, n_{k})|)$.

Acknowledgements

I am deeply indebted to Be'eri Greenfeld for his support, interest, and input. In particular, I am thankful to him for his guidance on racks, for helpful discussion on the second proof of Corollary 4.37, for informing me of [Reference Burness, Guralnick and HarperBGH21], and for proving extensions of Proposition 4.40 to other cycle structures. I would like to thank the referees for finding typos and errors, for pointing me to [Reference Beaumont and PetersonBP55], and for making helpful suggestions.

Conflicts of Interest

None.

References

Bary-Soroker, L., Entin, A. and Fehm, A., The minimal ramification problem for rational function fields over finite fields, Int. Math. Res. Not. (IMRN) (2023), rnac370, https://doi.org/10.1093/imrn/rnac370.CrossRefGoogle Scholar
Bary-Soroker, L., Gorodetsky, O., Karidi, T. and Sawin, W., Chebotarev density theorem in short intervals for extensions of $\mathbb {F}_q(T)$, Trans. Amer. Math. Soc. 373 (2020), 597628.CrossRefGoogle Scholar
Bary-Soroker, L. and Schlank, T., Sieves and the minimal ramification problem, J. Inst. Math. Jussieu 19 (2020), 919945.10.1017/S1474748018000257CrossRefGoogle Scholar
Beaumont, R. A. and Peterson, R. P., Set-transitive permutation groups, Canad. J. Math. 7 (1955), 3542.CrossRefGoogle Scholar
Bhargava, M. and Ghate, E., On the average number of octahedral newforms of prime level, Math. Ann. 344 (2009), 749768.10.1007/s00208-008-0322-4CrossRefGoogle Scholar
Boston, N. and Ellenberg, J., Random pro-$p$ groups, braid groups, and random tame Galois groups, Groups Geom. Dyn. 5 (2011), 265280.10.4171/ggd/127CrossRefGoogle Scholar
Boston, N. and Markin, N., The fewest primes ramified in a $G$-extension of $\mathbb {Q}$, Ann. Sci. Math. Qué. 33 (2009), 145154.Google Scholar
Burness, T., Guralnick, R. and Harper, S., The spread of a finite group, Ann. of Math. (2) 193 (2021), 619687.CrossRefGoogle Scholar
Chen, W., Nonabelian level structures, Nielsen equivalence, and Markoff triples, Ann. of Math. (2), to appear. Preprint (2020), arXiv:2011.12940.Google Scholar
De Witt, M., Minimal ramification and the inverse Galois problem over the rational function field $\mathbb {F}_p(t)$, J. Number Theory 143 (2014), 6281.CrossRefGoogle Scholar
Ellenberg, J. and Venkatesh, A., Counting extensions of function fields with bounded discriminant and specified Galois group, in Geometric methods in algebra and number theory, Progress in Mathematics, vol. 235 (Birkhäuser, Boston, MA, 2005), 151168.CrossRefGoogle Scholar
Ellenberg, J. S., Tran, T. and Westerland, C., Fox-Neuwirth-Fuks cells, quantum shuffle algebras, and Malle's conjecture for function fields, Preprint (2017).Google Scholar
Ellenberg, J. S., Venkatesh, A. and Westerland, C., Homological stability for Hurwitz spaces and the Cohen-Lenstra conjecture over function fields, Ann. of Math. (2) 183 (2016), 729786.10.4007/annals.2016.183.3.1CrossRefGoogle Scholar
Entin, A., Monodromy of hyperplane sections of curves and decomposition statistics over finite fields, Int. Math. Res. Not. IMRN 2021 (2021), 1040910441.CrossRefGoogle Scholar
Gorodetsky, O., Mean values of arithmetic functions in short intervals and in arithmetic progressions in the large-degree limit, Mathematika 66 (2020), 373394.CrossRefGoogle Scholar
Koymans, P. and Pagano, C., On Malle's conjecture for nilpotent groups, I, Trans. Amer. Math. Soc. Ser. B 10 (2023), 310–354.CrossRefGoogle Scholar
Liu, Y., Wood, M. M. and Zureick-Brown, D., A predicted distribution for Galois groups of maximal unramified extensions, Preprint (2019), arXiv:1907.05002.Google Scholar
Malle, G., On the distribution of Galois groups, II, Exp. Math. 13 (2004), 129135.10.1080/10586458.2004.10504527CrossRefGoogle Scholar
Neukirch, J., Schmidt, A. and Wingberg, K., Cohomology of number fields, vol. 323 (Springer, 2013).Google Scholar
Sawin, W., Square-root cancellation for sums of factorization functions over short intervals in function fields, Duke Math. J. 170 (2021), 9971026.CrossRefGoogle Scholar
Sawin, W. and Shusterman, M., Möbius cancellation on polynomial sequences and the quadratic Bateman–Horn conjecture over function fields, Invent. Math. 229 (2022), 751927.CrossRefGoogle Scholar
Soloviev, A., Non-unitary set-theoretical solutions to the quantum Yang-Baxter equation, Math. Res. Lett. 7 (2000), 577596.CrossRefGoogle Scholar
Taniguchi, T. and Thorne, F., Levels of distribution for sieve problems in prehomogeneous vector spaces, Math. Ann. 376 (2020), 15371559.10.1007/s00208-019-01933-1CrossRefGoogle Scholar
Wood, M. M., On the probabilities of local behaviors in abelian field extensions, Compos. Math. 146 (2010), 102128.CrossRefGoogle Scholar
Wood, M. M., An algebraic lifting invariant of Ellenberg, Venkatesh, and Westerland, Res. Math. Sci. 8 (2021), 21.CrossRefGoogle Scholar