1. Introduction
The conjugacy growth function, $c_{G,S}(n)$ , of a group G with respect to a finite generating set S counts the number of conjugacy classes that intersect the n-ball in the Cayley graph, just as the standard growth function counts the number of elements contained in the n-ball. The study of conjugacy growth was originally motivated by counting geodesics on Riemannian manifolds, for example in work of Margulis [Reference Margulis26], and since then has received significant attention in its own right, for example [Reference Breuillard, Cornulier, Lubotzky and Meiri6, Reference Breuillard and de Cornulier7, Reference Hull and Osin24].
It is conjectured by Guba and Sapir [Reference Guba and Sapir22] that for ‘ordinary’ groups, exponential standard growth implies exponential conjugacy growth. This has been verified in the case of hyperbolic, soluble and linear groups amongst others. On the other hand, Osin [Reference Osin28] has constructed groups of exponential standard growth where every non-identity element is conjugate, and so the conjecture emphatically fails in these cases. Furthermore, Osin and Hull [Reference Hull and Osin24] have shown that conjugacy growth fails to be a commensurability invariant in general. It is worth noting however that these counter-examples are not finitely presented.
In contrast, this paper investigates the polynomial end of the growth spectrum, namely virtually nilpotent groups (as suggested by a question of Guba and Sapir [Reference Guba and Sapir22]). We generalise work of Babenko [Reference Babenko2] to derive estimates for the conjugacy growth of a class of virtually nilpotent groups whose standard growth was studied by Stoll [Reference Stoll33] and which contains the so-called higher Heisenberg groups. Some of this work appears in the author’s PhD thesis [Reference Evetts15].
Section 2 introduces the necessary notions and proves some basic results. Section 3 summarises Stoll’s classification of class 2 nilpotent groups with infinite cyclic derived subgroup, in which it is shown that such groups can be constructed from finitely many copies of the first Heisenberg group (the free nilpotent group of class 2 on two generators). The number of copies is called the Heisenberg rank of the group. We also derive an explicit description of the automorphisms of such groups. In Section 4, we investigate the conjugacy growth of this class of groups. We prove the following theorem, which follows Theorem 4.1 of [Reference Babenko2], but for a more general class of groups, and a more restricted range of metrics. Our proof uses more elementary arguments.
Theorem 4.5. Let G be a finitely generated class 2 nilpotent group with infinite cyclic derived subgroup, with Heisenberg rank r. Then there exists $s\in{\mathbb N}$ such that
In Section 5, we prove the following necessary and sufficient condition for conjugacy growth to be preserved in finite extensions, which depends on understanding the twisted conjugacy growth, $c^{\phi}_H$ , of Definition 5.5.
Proposition 5.6. Let H be any finitely generated group. Then $c^{\phi}_H(n)\preccurlyeq c_H(n)$ for every finite-order automorphism $\phi\in{\textrm{Aut}}(H)$ if and only if every finite extension G of H satisfies $c_G(n)\sim c_H(n)$ .
This allows us to use the description of automorphisms to demonstrate that passing to a finite index supergroup does not alter the conjugacy growth function (up to the usual equivalence).
Theorem 5.3. Suppose that H is a class 2 nilpotent group with infinite cyclic derived subgroup. If a group G contains H as a finite index subgroup, then G and H have equivalent conjugacy growth functions.
This result naturally suggests the question of quasi-isometric invariance.
Question 5.8. In which classes of finitely generated groups is conjugacy growth a quasi-isometry invariant?
As noted above, there are certainly groups which do not have this property. On the other hand, it is not hard to see that conjugacy growth is a quasi-isometry invariant of virtually abelian groups (see Proposition 2.12). We conjecture that this behaviour extends to virtually nilpotent groups in general. More specifically, we make the following conjecture, in the spirit of Theorem 5.3.
Conjecture 5.10. The conjugacy growth function of a finitely generated nilpotent group depends only on the structure of its lower central series.
A closely related measure of conjugacy growth is the formal power series whose coefficients are the values of the conjugacy growth function. Ciobanu, Hermiller, Holt and Rees [Reference Ciobanu, Hermiller, Holt and Rees9] and Antolín and Ciobanu [Reference Antolín and Ciobanu1] confirmed a conjecture of Rivin [Reference Rivin31], that the conjugacy growth series of a hyperbolic group is a rational function if and only if the group is virtually cyclic (or finite). Continuing in a similar vein, the author proved the rationality of the series for all virtually abelian groups [Reference Evetts14], and Gekhtman and Yang proved transcendence for relatively hyperbolic groups and certain acylindrically hyperbolic groups [Reference Gekhtman and Yang19]. All of the above results apply to any choice of finite generating set and lead to the following conjecture.
Conjecture 1.1 (Conjecture 7.2 of [Reference Ciobanu, Evetts and Ho8]). The conjugacy growth series of any finitely presented group that is not virtually abelian is transcendental.
This is further supported by generating set specific calculations for, amongst others, the soluble Baumslag–Solitar groups [Reference Ciobanu, Evetts and Ho8], certain wreath products [Reference Mercier27] and graph products [Reference Ciobanu, Hermiller and Mercier10]. The asymptotic estimates derived in this paper have implications for some conjugacy growth series (see Corollary 5.4), providing further evidence for this conjecture.
2. Preliminaries
We begin with the basic definitions and results. In this paper, ${\mathbb N}$ will denote the non-negative integers and ${\mathbb N}_+$ the positive integers. We occasionally use the notation $f(n)={\mathcal O}\left(g(n)\right)$ to indicate that there exists a constant $C\geq1$ with $f(n)\leq Cg(n)$ for large enough n, and the notation $f(n)=o\left(g(n)\right)$ to indicate that $\lim_{n\to\infty}\frac{f(n)}{g(n)}=0$ .
2.1. Growth and conjugacy growth
Definition 2.1. Let G be a finitely generated group, and S a choice of finite generating set. We write $\{S\cup S^{-1}\}^*$ to denote the language of all words over the alphabet consisting of the elements of S and their inverses (i.e. the free monoid on $S\cup S^{-1}$ ).
(1) The word length of $g\in G$ with respect to S is
\begin{equation*}|g|_S=\min\{|w| \mid w\in \{S\cup S^{-1}\}^*,\;w=_G g\}.\end{equation*}(2) The (cumulative) standard growth function of G with respect to S is
\begin{equation*}\beta_{G,S}(n)=\#\{g\in G \mid |g|_S\leq n\}.\end{equation*}
Definition 2.2. Let G be a finitely generated group and S be a choice of finite generating set. Denote by $\mathcal{C}_G$ the set of conjugacy classes of G.
(1) For $\kappa\in\mathcal{C}_G$ , define the length of $\kappa$ with respect to S as:
\begin{equation*}|\kappa|_S=\min\{|g|_S\mid g\in\kappa\}.\end{equation*}(2) The (cumulative) conjugacy growth function of G with respect to S is
\begin{equation*}c_{G,S}(n)=\#\{\kappa\in\mathcal{C}_G \mid |\kappa|_S\leq n\}\end{equation*}
Geometrically, the standard growth function counts the elements in the ball of radius n in the Cayley graph of G with respect to S, and the conjugacy growth function counts the number of distinct conjugacy classes intersecting the same ball. One could also study the ‘strict’ standard and conjugacy growth functions, by considering the sphere instead of the ball, but for most purposes their qualitative behaviour is the same.
We will use the usual notion of equivalence of growth functions.
Definition 2.3. Let $f,g\colon{\mathbb N}\rightarrow{\mathbb N}$ be two functions. We write $f\preccurlyeq g$ if there exists $\lambda>1$ such that
for all $n\in{\mathbb N}$ . If $f\preccurlyeq g$ and $g\preccurlyeq f$ , then we write $f\sim g$ and say that the functions are equivalent. Note that this defines an equivalence relation. We write $f\prec g$ if $f\preccurlyeq g$ but it is not the case that $f\sim g$ .
It is a classical result that the equivalence class of the standard growth function of a finitely generated group does not depend on the choice of generating set. It is less well known that the same is true of the conjugacy growth function.
Proposition 2.4. Let S and T be two finite generating sets for a group G. Then
(1) $\beta_{G,S}\sim\beta_{G,T}$ , and
(2) $c_{G,S}\sim c_{G,T}$ .
Proof. The proof of the first part is standard, see for example [Reference Mann25]. The second statement is proved similarly.
Another key fact that we will use is that standard growth is a quasi-isometry invariant.
Proposition 2.5. Let G and H be quasi-isometric groups. Then $\beta_G\sim \beta_H$ .
This is not the case for conjugacy growth.
Theorem 7.2 of [Reference Hull and Osin24]. There exists a finitely generated group G and a finite index subgroup $H\leq G$ such that H has two conjugacy classes, while G is of exponential conjugacy growth.
We can study the growth of a subset of G with respect to the same metric. This is referred to as relative growth. Formally, we have the following definition.
Definition 2.6. The relative growth function of a subset U of a group G with respect to a finite generating set S is the following function:
In the following lemma, we see that the relative growth of a finite index subgroup is equivalent to that of its cosets.
Lemma 2.7. Let H be a finite index subgroup of G and consider the coset tH for some $t\in G$ . Then the relative growth of tH is equivalent to the growth of G.
Proof. Since tH is simply a translation of H, H and tH are quasi-isometric. Since G and H are also quasi-isometric, Proposition 2.5 implies that $\beta_G\sim\beta_H\sim\beta_{tH}$ .
We will need the following result of Breuillard and Cornulier.
Lemma 2.8 (Lemma 3.1 (2) of [Reference Breuillard and de Cornulier7]). Let H be a finite index subgroup of G. Then $c_H\preccurlyeq c_G$ .
The next Lemma shows that conjugacy growth behaves well with respect to direct products. Here, and in the rest of the paper, we use the standard notation [g] to denote the conjugacy class of a group element g.
Lemma 2.9. Let G and H be groups generated by finite sets S and T, respectively. Then $c_{G\times H,S\cup T}\sim c_{G,S}\cdot c_{H,T}$ .
Proof. First, note that for $(g,h)\in G\times H$ we have
Now if $|[(g,h)]|_{S\cup T}\leq n$ then there exists $(x,y)\in G\times H$ with $(x,y)(g,h)(x,y)^{-1}=u_1u_2\cdots u_l$ for some $u_i\in S\cup T$ , $l\leq n$ . Rearranging, we find elements $s_1,\ldots,s_{l_1}\in S$ and $t_1,\ldots,t_{l_2}\in T$ , with $l_1+l_2=l$ , so that
Therefore, $|xgx^{-1}|_S\leq l_1\leq n$ and $|yhy^{-1}|_T\leq l_2\leq n$ , and so $|[g]|_S\leq n$ and $|[h]|_T\leq n$ . Thus, $c_{G\times H}(n)\leq c_G(n)\cdot c_H(n)$ .
Conversely, suppose $|[g]|_S\leq n$ and $|[h]|_S\leq n$ . Then, there are elements $\gamma\in G$ and $\delta\in H$ such that $|\gamma g\gamma^{-1}|_S\leq n$ and $|\delta h\delta^{-1}|_T\leq n$ , so $|(\gamma g\gamma^{-1},\delta h\delta^{-1})|_{S\cup T}\leq 2n$ . But $(\gamma g\gamma^{-1},\delta h\delta^{-1})=(\gamma,\delta)(g,h)(\gamma,\delta)^{-1}$ and so $|[(g,h)]|_{S\cup T}\leq 2n$ . Thus, $c_G(n)\cdot c_H(n)\leq c_{G\times H}(2n)$ , giving the required result.
2.2. Nilpotent groups
We recall the definition of a nilpotent group in order to fix some notation. For elements g, h of some group G, denote their commutator $[g,h]=ghg^{-1}h^{-1}$ . For a pair of subgroups U, V of G, let $[U,V]=\left\langle [u,v]\mid u\in U, v\in V\right\rangle$ . For any group G, let $G^{(0)}=G$ and inductively define the i-fold commutator subgroup $G^{(i)}=[G^{(i-1)},G]$ . Recall that a group G is nilpotent of class c if and only if $G^{(c)}=\{1\}$ and $G^{(c-1)}\neq\{1\}$ . In particular, the nilpotent groups of class 1 are precisely the abelian groups. We write $\textrm{Ab}(G)=G/G^{(1)}$ for the abelianisation of G.
Definition 2.10. Let G be a finitely generated group. Then, each quotient $G^{(i)}/G^{(i+1)}$ is a finitely generated abelian group. Denote the torsion-free rank of $G^{i}/G^{i+1}$ by $r_i\in{\mathbb N}$ , so that $G^{(i)}/G^{(i+1)}\cong {\mathbb Z}^{r_i}\times T$ for some finite abelian group T.
Theorem 2.11 (Bass-Guivarc’h [Reference Bass3]). Let G be a finitely generated nilpotent group. Then the standard growth function $\beta_G(n)$ is equivalent to the polynomial $n^d$ where
Gromov [Reference Gromov21] famously proved the converse that any group of polynomial (standard) growth is virtually nilpotent.
The asymptotic behaviour of conjugacy growth in virtually abelian groups is well understood, as we see in the following Proposition.
Proposition 2.12. The (cumulative) standard and conjugacy growth functions of a virtually abelian group G are equivalent.
Proof. Let H be an abelian subgroup of G of finite index. Since H is nilpotent, Theorem 2.11 gives $d\in{\mathbb N}$ with $\beta_H(n)\sim n^d$ , and hence $\beta_G(n)\sim n^d$ by Proposition 2.5. Thus, $c_G(n)\preccurlyeq n^d$ (since conjugacy growth is clearly bounded above by standard growth). On the other hand, Lemma 2.8 gives $c_G(n)\succcurlyeq c_H(n)\sim \beta_H(n)\sim n^d$ (since a conjugacy class in an abelian group is simply an element). Therefore, $c_G(n)\sim n^d\sim\beta_G(n)$ .
As an immediate corollary, we see that conjugacy growth is a quasi-isometry invariant within the class of virtually abelian groups.
2.3. Generating functions
We will also be interested in the formal power series associated with the standard and conjugacy growth functions. We write ${\mathbb Q}[[z]]$ for the ring of formal power series over a variable z with coefficients in ${\mathbb Q}$ , and ${\mathbb Q}[z]$ for the ring of polynomials over z with rational coefficients.
Definition 2.13. Let G be a group with a finite generating set S. Then the standard growth series is
Similarly, the conjugacy growth series is
Here, z is a complex variable.
When referring to growth functions and series, we will often suppress the subscripts when the groups and/or generating sets are clear from context.
Definition 2.14. A series $\Gamma(z)\in{\mathbb Q}[[z]]$ is said to be
(1) rational if it is an element of the field of fractions of ${\mathbb Q}[z]$ , denoted ${\mathbb Q}(z)$ – in other words, there are polynomials $p,q\in{\mathbb Q}[z]$ such that $\Gamma=\frac{p}{q}$ ;
(2) algebraic if it is algebraic over ${\mathbb Q}[z]$ – in other words, it is the root of some polynomial expression with coefficient from the ring of polynomials ${\mathbb Q}[z]$ ;
(3) transcendental if it is not algebraic;
(4) holonomic if it is the solution to a linear finite-order differential equation with coefficients from the ring of polynomials ${\mathbb Q}[z]$ ; therefore, the non-holonomic series form a proper subset of the transcendental series.
We will refer to this classification as the algebraic complexity of $\Gamma(z)$ .
Generating functions are a well-studied topic. Duchin has written a very readable introduction [Reference Duchin12] to generating functions and growth in groups. For a more rigorous treatment, we use [Reference Graham, Knuth and Patashnik20] or [Reference Stanley32]. There are some slightly subtle connections between the algebraic complexity of a power series and the asymptotics of the coefficients. For example, we will use the following result of Stoll.
Proposition 2.15 (Proposition 3.3 of [Reference Stoll33]). Let $\Gamma(z)=\sum_{n\geq0}\gamma(n)z^n\in{\mathbb Q}[[z]]$ , and suppose that $\lim_{n\to\infty}\frac{\gamma(n)}{n^d}=a$ . Then if a is an irrational (resp. transcendental) number then $\Gamma(z)$ is irrational (resp. transcendental) as a series.
The next result relates to generating functions whose coefficients are in the polynomial range.
Theorem 2.16. Let $\gamma\colon{\mathbb N}\to{\mathbb N}$ be strictly between polynomials, that is $n^d\prec\gamma(n)\prec n^{d+1}$ for some $d\in{\mathbb N}$ . Then the series $\sum_{n\geq0}\gamma(n)z^n$ is not holonomic.
To prove this, we will need another definition.
Definition 2.17. A function $f\;:\;{\mathbb N}\to{{\mathbb C}}$ is called eventually quasi-polynomial if there exist some positive integer period N, threshold $T\geq0$ , and polynomials $f_0,f_1,\ldots,f_{N-1}$ so that for all $n\geq T$ , $f(n)=f_i(n)$ whenever $n\equiv i\!\!\mod N$ .
The following is an immediate consequence of Proposition 4.4.1 of [Reference Stanley32].
Proposition 2.18. Let $\gamma\colon{\mathbb N}\to{\mathbb N}$ be in the polynomial range, that is, $\gamma(n)\leq Cn^d$ for some $C>1$ , $d\in{\mathbb N}$ . Then, $\sum_{n\geq0} \gamma(n)z^n$ is rational if and only if $\gamma(n)$ is eventually quasi-polynomial.
Corollary 2.19. Suppose $\gamma\;:\;{\mathbb N}\to{\mathbb N}$ is non-decreasing and in the polynomial range as above. If $\sum_{n\geq0} \gamma(n)z^n$ is rational then $\gamma\sim n^d$ for some $d\in{\mathbb N}$ .
Proof. Proposition 2.18 implies that $\gamma$ is eventually quasi-polynomial, say with polynomials $\gamma_0,\ldots,\gamma_N$ as in the definition. The degree of each $\gamma_i$ is at least the degree of $\gamma_{i-1}$ (since large enough n would otherwise violate the non-decreasing assumption). But since these polynomials cycle, the degree of $\gamma_0$ must be at least the degree of $\gamma_N$ . So they all have the same degree, say $d\in{\mathbb N}$ . Thus, $\gamma$ cycles between finitely many polynomials, all equivalent to $n^d$ , and so $\gamma(n)\sim n^d$ .
We also need the following two results (see [Reference Flajolet, Gerhold and Salvy18]).
Lemma 2.20 (Pólya-Carlson). If $\Gamma(z)$ is a power series with integer coefficients that converges on the open unit disc, then $\Gamma(z)$ is either rational or admits the unit circle as a natural boundary.
Lemma 2.21. Holonomic functions necessarily have only finitely many singularities.
Proof of Theorem 2.16. By the Cauchy–Hadamard theorem, the series $\sum\gamma(n)z^n$ converges inside the unit disc, and so Lemma 2.20 applies. Thus, if the series were holonomic, and hence had only finitely many singularities, it would be rational. But by Corollary 2.19, this contradicts the hypothesis. Thus, the series cannot be holonomic.
We finish our discussion of generating functions by noting the following results from the literature.
Theorem 2.22 ([Reference Benson4]). Let G be a finitely generated virtually abelian group. Then the standard growth series of G is rational, with respect to any choice of finite generating set.
Theorem 2.23 ([Reference Evetts14]). Let G be a finitely generated virtually abelian group. Then the conjugacy growth series of G is rational, with respect to any choice of finite generating set.
2.4. GCD sums
To count conjugacy classes, we will need various facts about greatest common divisors of tuples of integers, starting with the following lemma of Fernández and Fernández.
Lemma 2.24 (Section 3 of [Reference Fernández and Fernández17]). For $n\geq1$ , let $X^{(n)}_1, X^{(n)}_2,\ldots$ be a sequence of independent random variables, uniformly distributed in $\{1,2,\ldots,n\}$ . Then the expected value of the greatest common divisor of the first s of these random variables behaves as follows:
where $C\geq0$ is a constant.
For our purposes, we will phrase this in terms of the sum of the greatest common divisors of tuples of integers whose absolute values are at most n.
Definition 2.25. We define two different n-balls in the free abelian group ${\mathbb Z}^s$ .
(1) Let $B^{(s)}_\square(n)=\{(x_1,\ldots,x_s)\in{\mathbb Z}^s\mid |x_i|\leq n\text{ for each }1\leq i\leq s\}$ . That is, the n-ball in ${\mathbb Z}^s$ with respect to the ‘cubical’ generating set:
\begin{equation*}\{(\varepsilon_1,\ldots,\varepsilon_s)\mid \varepsilon_i\in\{0,1,-1\}\}.\end{equation*}(2) Let $B^{(s)}_{\ell_1}(n)=\{(x_1,\ldots,x_s)\in{\mathbb Z}^s\mid \sum |x_i|\leq n\}$ . That is, the n-ball in ${\mathbb Z}^s$ with respect to the generating set consisting of standard basis vectors.
We will omit the superscript s when it is clear which dimension we are working with.
Then, Lemma 2.24 can be reinterpreted as follows.
Corollary 2.26. Let $\textbf{x}=(x_1,\ldots,x_s)\in{\mathbb Z}^s$ . Then,
where $R_2\in{\mathbb Q}$ , and
where $R_s\in{\mathbb Q}$ depends on the dimension s.
Proof. The sum of the values of a function over some fixed finite domain is equal to the expected value of the function over the domain, multiplied by the cardinality of the domain. The standard growth function of ${\mathbb Z}^s$ is equivalent to $Dn^s$ (by Theorem 2.11), where $D\in{\mathbb R}$ depends on the choice of generating set, but is always rational since otherwise Proposition 2.15 would imply that the standard growth series was irrational, contradicting Theorem 2.22.
We will also need the following generalisation of Corollary 2.26, showing that offsetting $\textbf{x}$ by a constant does not affect the asymptotics of the GCD sum.
Corollary 2.27. Fix an element $\textbf{a}=(a_1,\ldots,a_s)\in{\mathbb Z}^s$ . Then,
and
where $R_2, R_s\in{\mathbb Q}$ are the same as in Corollary 2.26.
Proof. Let $a_{\max}=\max\left(|a_1|,\ldots,|a_s|\right)$ . Then, we have
for all n and thus
Applying Corollary 2.26 gives the result, since adding a constant to n does not alter the asymptotic expressions on the right-hand sides.
3. A family of class 2 nilpotent groups
In this section, we discuss the nilpotent groups of class 2 whose derived subgroup is infinite cyclic. This includes the following family.
Definition 3.1. The (higher) Heisenberg groups are class 2 nilpotent groups, with a parameter $r\in{\mathbb N}_+$ , given by the following presentation:
The commutator subgroup $H_r^{(1)}$ is infinite cyclic, generated by the commutator $c=[a_i,b_i]$ . These groups play an important role in the story of standard growth, as they provide essentially the only known examples of growth series behaviour which depends on the choice of generating set, as we will see in Stoll’s result, Theorem 3.9. Furthermore, Duchin and Shapiro have shown [Reference Duchin and Shapiro13] that the first Heisenberg group (also known as the integer or discrete Heisenberg group) $H_1$ has rational standard growth series with respect to any choice of finite generating set.
3.1. Stoll’s classification
In [Reference Stoll33], Stoll classifies all finitely generated class 2 nilpotent groups with infinite cyclic derived subgroup in terms of the first Heisenberg group $H_1$ . We summarise this classification below.
Let $G_1$ and $G_2$ be groups with central subgroups $Z_1$ and $Z_2$ , respectively. Suppose that there exists an abelian group Z and homomorphisms $\varphi_1\colon Z_1\to Z$ and $\varphi_2\colon Z_2\to Z$ , and consider the product homomorphism $\varphi\colon Z_1\times Z_2\rightarrow Z$ defined by:
Furthermore, suppose that $\varphi$ is surjective.
Definition 3.2. The group
is called the centrally amalgamated direct product of $G_1$ and $G_2$ with respect to $\varphi$ . We will write central product for brevity.
Example 3.3. Let $G_1\cong G_2\cong{\mathbb Z}^2$ be given by the presentations:
and let $Z=\langle z\mid z^2\rangle$ . Define homomorphisms from the second direct factor of each $G_i$ to Z as $\varphi_i\colon \langle y_i\rangle\to Z$ given by $\varphi\colon y_i\mapsto z$ . Then the centrally amalgamated direct product, G, of $G_1$ and $G_2$ with respect to $\varphi$ is given by the presentation:
From now on, we will deal exclusively with the case where each $Z_i$ is infinite cyclic, generated by an element $z_i$ , and similarly Z is infinite cyclic, generated by an element z. Hence, each $\varphi_i$ is determined by an integer $d_i$ as follows $\varphi_i\colon z_i\mapsto z^{d_i}$ . Given pairs $(G_1,z_1)$ and $(G_2,z_2)$ such that each $\langle z_i\rangle$ is an infinite cyclic central subgroup of $G_i$ , we will write $(G_1,z_1)\otimes_d(G_2,z_2)$ for the central product of $G_1$ and $G_2$ , amalgamated over the subgroups $\langle z_1\rangle$ and $\langle z_2\rangle$ with $\varphi_1(z_1)=z$ and $\varphi_1(z_2)=z^d$ . If $d=1$ , we simply write $(G_1,z_1)\otimes(G_2,z_2)$ .
Lemma 3.4 (Lemma 7.1 of [Reference Stoll33]). Let G be a finitely generated 2-step nilpotent group with $G^{(1)}\cong{\mathbb Z}$ . Then there exists a finitely generated infinite abelian group $G_0$ , and a tuple $D=(\delta_1,\ldots,\delta_{r-1})\in({\mathbb N}_+)^{r-1}$ , with $\delta_i|\delta_{i+1}$ for each i, such that
where each $H_1=\left\langle a_i,b_i\mid [[a_i,b_i],a_i], [[a_i,b_i],b_i]\right\rangle$ is a copy of the first Heisenberg group, $c_i$ denotes the commutator $[a_i,b_i]$ , and z generates an infinite cyclic subgroup of $G_0$ .
Remark 3.5. Note that due to the amalgamation in G, $c_1=z$ and for each i, we have $c_i=c_1^{\delta_{i-1}}$ .
Definition 3.6. Let $D=(\delta_1,\ldots,\delta_{r-1})\in({\mathbb N}_+)^{r-1}$ , with $\delta_i|\delta_{i+1}$ for each i. Then we will write
Note that if $I=(1,1,\ldots,1)\in({\mathbb N}_+)^{r-1}$ , we can express the rth higher Heisenberg group as $H_r=H_I$ . Since the abelian subgroup $\Gamma\;:\!=\;G_0/\langle z\rangle$ is central, we have the following immediate corollary of Lemma 3.4.
Corollary 3.7. Let G be a finitely generated 2-step nilpotent group with $[G,G]\cong{\mathbb Z}$ . Then there exists a finitely generated abelian group $\Gamma$ (possibly finite or trivial) and a tuple $D=(\delta_1,\ldots,\delta_{r-1})\in({\mathbb N}_+)^{r-1}$ , with $\delta_i|\delta_{i+1}$ for each i, such that
Definition 3.8. As in [Reference Stoll33], the Heisenberg rank of $G\cong\Gamma\times H_D$ will refer to the number, r, of copies of $H_1$ appearing in the construction. Furthermore, we will write s for the torsion-free rank of the finitely generated abelian factor $\Gamma$ .
To emphasise the importance of this class of groups, we record the following, which is the main result of [Reference Stoll33].
Theorem 3.9. If G is a class 2 nilpotent group with infinite cyclic derived subgroup and Heisenberg rank at least 2, then it possesses a generating set yielding rational standard growth series, and a generating set yielding transcendental standard growth series.
We fix some notation that we will use throughout the paper.
Definition 3.10. Suppose that $\Gamma\times H_D$ is torsion-free and choose a basis $\{z_1,\ldots,z_s\}$ for $\Gamma$ . Then the elements of $\Gamma\times H_D$ are in bijection with words of the form $z_1^{l_1}\cdots z_s^{l_s}a_1^{i_1}b_1^{j_1}\cdots a_r^{i_r}b_r^{j_r}c^k$ where $l_1,\ldots,l_s,i_1,j_1,\ldots,i_r,j_r,k\in{\mathbb Z}$ . This is known as the Mal’cev normal form, or Mal’cev coordinates, and a variation of it exists for all finitely generated nilpotent groups (see [Reference Clement, Majewicz and Zyman11], or the basic commutators of [Reference Hall23]). We will frequently represent elements of $\Gamma\times H_D$ as $\alpha c^k$ where $\alpha$ is an element of the form $z_1^{l_1}\cdots z_s^{l_s}a_1^{i_1}b_1^{j_1}\cdots a_r^{i_r}b_r^{j_r}$ and $c=c_1=[a_1,b_1]$ .
For the remainder of the paper, we use the generating set $\{z_1,\ldots,z_s,a_1,\ldots,b_r\}$ for $\Gamma\times H_D$ and lengths of elements will be taken with respect to this generating set.
Definition 3.11. For $g\in\Gamma\times H_D$ , we write $\bar{g}$ for the image of g in the abelianisation ${\textrm{Ab}}(\Gamma\times H_D)$ . With this notation, we have
The preimage of $x\in{\textrm{Ab}}(\Gamma\times H_D)$ under the abelianisation map is a coset of the commutator subgroup $\langle c\rangle$ . We define the canonical lift of x to be the unique element of the preimage whose c-coordinate is zero (when expressed in Mal’cev normal form) and denote it ${x}^{\wedge}$ . Therefore, if $x=\bar{z}_1^{l_1}\cdots\bar{z}_s^{l_s}\bar{a}_1^{i_1}\bar{b}_1^{j_1}\cdots\bar{a}_r^{i_r}\bar{b}_r^{j_r}$ then ${x}^{\wedge}=z_1^{l_1}\cdots z_s^{l_s}a_1^{i_1}b_1^{j_1}\cdots a_r^{i_r}b_r^{j_r}\in\Gamma\times H_D$ .
We will also write elements of ${\textrm{Ab}}(\Gamma\times H_D)$ as vectors in ${\mathbb Z}^{2r+s}$ with respect to the basis $\{\bar{z}_1,\ldots,\bar{z}_s,\bar{a}_1,\bar{b}_1,\ldots,\bar{a}_r,\bar{b}_r\}$ so that $z_1^{l_1}\cdots z_r^{l_r}a_1^{i_1}\cdots b_r^{j_r}=(l_1,\ldots,l_s,i_1,\ldots,j_r)$ .
3.2. Automorphisms of class 2 nilpotent groups
We will need to understand the automorphisms of our family of nilpotent groups. First, note the following easy lemma.
Lemma 3.12. Let G be a group with a characteristic subgroup H. Then the natural homomorphism $G\rightarrow G/H$ induces a homomorphism $\theta\colon{\textrm{Aut}}(G)\rightarrow{\textrm{Aut}}(G/H)$ .
Definition 3.13. Let $\Omega_r$ denote the $2r\times 2r$ block matrix consisting of blocks of the form $\begin{pmatrix} 0\;\;\;\; & 1\\[5pt] -1\;\;\;\; & 0 \end{pmatrix}$ on the diagonal with zeroes elsewhere. Let $\Omega_{r,s}$ be the $(2r+s)\times(2r+s)$ matrix with $\Omega_r$ in the bottom right corner and zeroes elsewhere.
For example,
Note that the skew-symmetric bilinear form $\Omega_{r,s}$ is precisely that given by taking commutators of pairs of elements of $\textrm{Ab}\left(\Gamma\times H_r\right)$ with respect to the basis $\{\bar{z}_1,\ldots,\bar{z}_s,\bar{a}_1,\bar{b}_1,\ldots,\bar{a}_r,\bar{b}_r\}$ , that is, $[\cdot,\cdot]\colon {\mathbb Z}^{2r+s}\times{\mathbb Z}^{2r+s}\to{\mathbb Z}$ . Let $\alpha c^{k_1}, \beta c^{k_2}\in\Gamma\times H_r$ . Then, $\overline{\alpha c^{k_1}}=\bar{\alpha}$ and $\overline{\beta c^{k_2}}=\bar{\beta}$ are elements of ${\mathbb Z}^{2r+s}$ and we have
Definition 3.14. Let $\mathbf{M}<{\textrm{GL}}_{2r+s}{\mathbb Z}$ be the group of matrices that either preserve or reverse the bilinear form given by $\Omega_{r,s}$ (i.e. the group of matrices M such that $M\Omega_{r,s}M^T=\varepsilon\Omega_{r,s}$ for $\varepsilon\in\{1,-1\}$ ). Note that if $s=0$ , then $\mathbf{M}\cong\textrm{Sp}(2r,{\mathbb Z})\rtimes{\mathbb Z}/2{\mathbb Z}$ .
Definition 3.15. Suppose that $N=\Gamma\times H_D$ is torsion-free and let $M\in\mathbf{M}$ . Define a map $\phi_M\colon N\to N$ as follows. For each $1\leq i\leq s$ , set $\phi_M(z_i)={\left(\bar{z}_iM\right)}^{\wedge}$ , that is, the canonical lift of the image of $\overline{z}_i$ under M. Similarly, set $\phi_M(a_i)={\left(\bar{a}_iM\right)}^{\wedge}$ and $\phi_M(b_i)={\left(\bar{b}_iM\right)}^{\wedge}$ for each $1\leq i\leq r$ . Then extend this map to an endomorphism of N in the usual way by setting $\phi_M(x_1\cdots x_n) = \phi_M(x_1)\cdots\phi_M(x_n)$ for any word $x_1\cdots x_n$ in the generators.
Lemma 3.16. The map $\phi_M$ is a well-defined automorphism of N.
Proof. First note that for any $x\in {\mathbb Z}^{2r+s}$ , we have $\overline{{x}^{\wedge}}=x$ . It is easily checked that the relators of N are mapped to the identity and therefore $\phi_M$ defines an endomorphism of N. For example, for the relator $[z_i,a_j]$ , we have
Furthermore, for any M, $\phi_{M^{-1}}$ is the inverse of $\phi_M$ , which is therefore an automorphism. It is sufficient to check this on the generators, for example,
Proposition 3.17. Suppose that $N=\Gamma\times H_D$ is torsion-free. Then the natural homomorphism $N\to N/N^{(1)}$ induces (via Lemma 3.12) an epimorphism $\theta\colon{\textrm{Aut}}(N)\to\mathbf{M}$ .
Proof. We first show that the image of the induced homomorphism $\theta$ is contained in $\mathbf{M}$ . Let $f\in{\textrm{Aut}}(N)$ and write $\theta(f)=M\in{\textrm{GL}}_{2r+s}({\mathbb Z})$ . Since the commutator subgroup $N^{(1)}$ is characteristic, f restricts to an automorphism of $N^{(1)}=\langle c\rangle\cong{\mathbb Z}$ , so we have $f\colon c\mapsto c^\varepsilon$ where $\varepsilon\in\{-1,1\}$ .
Then, for any $\alpha c^{k_1},\beta c^{k_2}\in N$ , we have
and hence $\left[f(\alpha),f(\beta)\right] = [\alpha,\beta]^{\varepsilon}$ . Therefore,
for all $\bar{\alpha},\bar{\beta}\in {\textrm{Ab}}(N)$ and hence $M\Omega_{2r+s}M^T = \varepsilon\Omega_{2r+s}$ , that is, $M\in\mathbf{M}$ as claimed.
To see that $\mathbf{M}$ is contained in the image of ${\textrm{Aut}}(N)$ , we note that for each $M\in\mathbf{M}$ , $\theta(\phi_M)=M$ .
Remark 3.18. Although we do not need it for the arguments that follow, we note that in the special case of $N=H_r$ , we can use a straightforward generalisation of an argument of Osipov [Reference Osipov29] to extend Proposition 3.17 to give the following short exact sequence, where ${\textrm{Inn}}(H_r)={\mathbb Z}^{2r}$ :
In Section 5, we will need a more explicit description of the automorphisms of N.
Proposition 3.19. Suppose $N=\Gamma\times H_D$ is torsion-free and fix $f\in{\textrm{Aut}}(N)$ . Write $\theta(f)=M\in\mathbf{M}$ for the image of f as above. Then there exists $\varepsilon\in\{1,-1\}$ and a polynomial function $\gamma\colon{\mathbb R}^{2r+s}\to{\mathbb R}$ of degree 2, with $\gamma(0,\ldots,0)=0$ , restricting to a function ${\mathbb Z}^{2r+s}\to{\mathbb Z}$ , such that for each element $z_1^{l_1}\cdots z_s^{l_s}a_1^{i_1}b_1^{j_1}\cdots a_r^{i_r}b_r^{j_r}c^k\in N$ we have
with $\varepsilon$ and $\gamma$ depending only on f.
Proof. Proposition 3.17 gives us a short exact sequence $1\to K\to {\textrm{Aut}}(N) \xrightarrow{\theta} \mathbf{M}\to1$ , for some normal subgroup $K\lhd {\textrm{Aut}}(N)$ (which contains but may not be equal to ${\textrm{Inn}}(N)$ ). The automorphisms $ \{\phi_M\in{\textrm{Aut}}(N) \mid M\in\mathbf{M}\}$ defined above form a transversal for ${\textrm{Aut}}(N)/K$ , and hence $f=\kappa\circ \phi_M$ for some $\kappa\in K$ and $M\in\mathbf{M}$ .
Since $\kappa\in K$ , $\kappa$ maps to the identity in $\mathbf{M}$ . So there exist integers $k_i$ such that
Therefore, applying $\kappa$ to some $z_1^{l_1}\cdots z_s^{l_s}a_1^{i_1}b_1^{j_1}\cdots a_r^{i_r}b_r^{j_r}c^k\in N$ fixes the powers of the generators $z_1,\ldots,b_r$ and adds ${p(l_1,\ldots,j_r)}$ to the power of c, where p is a linear function determined by the integers $k_1,\ldots,k_{s+2r}$ .
On the other hand, $\phi_M\in{\textrm{Aut}}(N)$ takes each generator to a linear combination of the generators $z_1,\ldots,b_r$ , determined by the matrix M. We also have $\phi_M(c)=c^{\varepsilon}$ where $\epsilon\in\{1,-1\}$ . Rearranging into the normal form then results in an adjustment to the power of c consisting of a sum of terms of the form $i_\lambda j_\lambda$ with coefficients determined by M. Thus, the image of $z_1^{l_1}\cdots z_s^{l_s}a_1^{i_1}b_1^{j_1}\cdots a_r^{i_r}b_r^{j_r}c^k$ under $\phi_M$ may be expressed as:
where q is a degree 2 polynomial.
Letting $x=z_1^{l_1}\cdots z_s^{l_s}a_1^{i_1}b_1^{j_1}\cdots a_r^{i_r}b_r^{j_r}c^k$ , we have
where $\gamma(l_1,\ldots,j_r)=q(l_1,\ldots,j_r)+p\left( (l_1,\ldots,j_r)M\right)$ is a degree 2 polynomial with coefficients determined by f.
4. Conjugacy growth of higher Heisenberg groups
This section is dedicated to proving Theorem 4.5. First, we show that the parameter D does not affect the conjugacy growth of the group $H_D$ .
Proposition 4.1. Let $D=(\delta_1,\delta_2,\ldots,\delta_{r-1})$ with each $\delta_i\mid\delta_{i+1}$ . Then the conjugacy growth of $H_D$ is equivalent to that of $H_r$ , that is, $c_{H_D}\sim c_{H_r}$ .
Proof. We will show that there exist groups $\Gamma_1$ and $\Gamma_2$ , both isomorphic to $H_r$ , such that $\Gamma_1\leq H_D\leq\Gamma_2$ , with $[H_D\colon\Gamma_1]$ and $[\Gamma_2\colon H_D]$ both finite. Then, Lemma 2.8 will give the result.
From the definition, $H_D$ is generated by elements $a_1,b_2,\ldots,a_r,b_r$ with $[a_i,b_i]=c_i=c^{\delta_{i-1}}$ for each i (and $[a_1,b_1]=c$ ). Let $\gamma_i=\frac{\delta_{r-1}}{\delta_{i-1}}$ for $i>1$ , and $\gamma_1=\delta_{r-1}$ , and define the subgroup:
For any i, we have $[a_i^{\gamma_i},b_i]=c_i^{\gamma_i}=c^{\delta_{i-1}\cdot\gamma_i}=c^{\delta_{r-1}}$ , and so $\Gamma_1$ is isomorphic to $H_r$ . For an element $a_1^{i_1}b_1^{j_1}a_2^{i_2}b_2^{j_2}\cdots a_r^{i_r}b_r^{j_r}c^k\in H_D$ , we have
where each $m_\lambda=i_\lambda\bmod\gamma_\lambda$ , and $n=k\bmod\delta_{r-1}$ . Therefore, the following is a set of representatives for the cosets $H_D/\Gamma_1$ :
and so $[H_D\;\colon\Gamma_1]=\gamma_1\gamma_2\cdots\gamma_r\delta_{r-1}<\infty$ as required.
Now let $\Gamma_2\cong H_r$ be generated by $\{d_1,e_1,d_2,e_2,\ldots,d_r,e_r\}$ and denote the commutator by $f=[d_i,e_i]$ . Define a map $\phi\colon H_D\to\Gamma_2$ by its action on the generators:
It is easily checked that the relators of $H_D$ are sent to the identity by $\phi$ , and thus it is a well-defined homo- morphism. Furthermore, if $\phi(a_1^{I_1}b_1^{J_1}a_2^{I_2}b_2^{J_2}\cdots a_r^{I_r}b_r^{J_r}c^K)=1$ , then $d_1^{I_1}e_1^{J_1}d_2^{\delta_1I_2}e_2^{\delta_1J_2}\cdots d_r^{\delta_{r-1}I_r}e_r^{\delta_{r-1}J_r}f^K=1$ and so $I_1=J_1=\cdots=J_r=K=0$ . Thus, $\phi$ is a monomorphism. Similarly to the previous argument, a set of representatives for the cosets $\Gamma_2/\phi(H_D)$ is
and so $[\Gamma_2\;\colon\phi(H_D)]=\delta_1\delta_2\cdots\delta_{r-1}$ .
Definition 4.2. If $\textbf{x}=(x_1,x_2,\ldots,x_n)$ is any tuple of integers, we will write
for the greatest common divisor of the entries of $\textbf{x}$ .
The next two results describe the structure of the conjugacy classes of $H_r$ .
Lemma 4.3. Let $\alpha c^k$ be an element of $H_r$ in Mal’cev normal form so that $\alpha=a_1^{i_1}b_1^{j_1}a_2^{i_2}b_2^{j_2}\cdots a_r^{i_r}b_r^{j_r}$ and $\overline{\alpha c^k}=\bar{\alpha}\in{\mathbb Z}^{2r}$ . Then the conjugacy class represented by $\alpha c^k\in H_r$ is either a singleton or a coset of a cyclic subgroup:
Proof. First note that $c^k$ is central, so its conjugacy class is $[c^k]=\{c^k\}$ . Now consider non-identity $\alpha$ . Since $[a_t,b_t]=c$ for each $1\leq t\leq r$ , we have
So we can express the conjugacy class of $\alpha c^k$ as follows:
Lemma 4.4. Let $\alpha=a_1^{i_1}b_1^{j_1}\cdots a_r^{i_r}b_r^{j_r}\in H_r$ be a non-trivial element in Mal’cev normal form. Then the length of the conjugacy class $[\alpha c^k]$ lies in the range $[|\alpha|,|\alpha|+2]$ , where $|\alpha|=\sum_\lambda(|i_\lambda|+|j_\lambda|)$ is the word length of $\alpha$ .
Proof. We claim that any element $\alpha c^k$ has length at least $|\alpha|$ , so $|\alpha|\leq|[\alpha c^k]|$ . To see this, consider any word over the generators $a_1,b_1,\ldots,a_r,b_r$ that represents $\alpha c^k$ . We can put this into normal form by collecting the powers of generators into the given order, at the cost of powers of c, using the identity $[a_t,b_t]=c$ for each t. Note that the exponent sum of each generator can never increase. So any word representing $\alpha c^k$ has at least $|i_1|$ instances of $a_1^{\pm1}$ , and so on.
From the structure of conjugacy classes given in Lemma 4.3, each conjugacy class $[\alpha c^k]$ has a representative of the form $\alpha c^{-l}$ where $0\leq l< g(\bar{\alpha})$ and a representative of the form $\alpha c^{m}$ where $0<m\leq g(\bar{\alpha})$ .
Assume that each power $i_t, j_t$ is non-negative. Let $I=\{m\in{\mathbb N}\mid i_m\neq0\}$ and $J=\{n\in{\mathbb N}\mid j_n\neq0\}$ . Then, by definition, $g(\bar{\alpha})\leq\min\{i_m,j_n\mid m\in I, n\in J\}$ .
Suppose that there is some $t\in I\cap J$ , that is, both $i_t$ and $j_t$ are non-zero. We have
and so
and thus we can represent the element $\alpha c^{-l}$ with a word of length $|\alpha|$ .
Now suppose that $I\cap J$ is empty. Since $\alpha\neq 1$ , there exists $t\in{\mathbb N}$ with either $i_t\neq0$ or $j_t\neq 0$ . If $i_t\neq 0$ , we have
and if $j_t\neq0$ we have
Now, similarly to equation (4.2), we can represent the element $\alpha c^{-l}$ with a word of length $|\alpha|+2$ , since in equation (4.3) we have inserted an extra $b_t$ and $b_t^{-1}$ , and in equation (4.4) we have inserted an extra $a_t$ and $a_t^{-1}$ . Therefore, in general, we have the bound $|\alpha c^{-l}|\leq |\alpha|+2$ . This in turn implies that $|\alpha|\leq|[\alpha c^k]|=|[\alpha c^{-l}]|\leq|\alpha|+2$ .
If some powers $i_t, j_t$ are negative, we can find words analogous to equation (4.2) that represent either $\alpha c^{-l}$ or $\alpha c^m$ , depending on the combination of signs. Thus, the result holds for all values of $\alpha$ .
Theorem 4.5. Let $G=\Gamma\times H_D$ be a finitely generated class 2 nilpotent group with infinite cyclic derived subgroup, with Heisenberg rank r. Let s be the torsion free rank of $\Gamma$ . Then,
Proof. By Lemma 2.9 and Theorem 2.11, we have $c_G(n)\sim n^s\cdot c_{H_D}(n)\sim n^s\cdot c_{H_r}(n)$ and so it suffices to show that the conjugacy growth of $H_r$ is equivalent to $n^2\log n$ in the case $r=1$ and $n^{2r}$ otherwise. This follows from [Reference Babenko2] but we provide a new argument here using more elementary methods.
For a fixed non-identity $\alpha=a_1^{i_1}b_1^{j_1}\cdots a_r^{i_r}b_r^{j_r}\in H_r$ , Lemma 4.3 implies that there are exactly $g(\bar{\alpha})$ conjugacy classes in the coset $\alpha\langle c\rangle$ , and Lemma 4.4 implies that they have length in the range $[|\alpha|,|\alpha|+2]$ . The conjugacy classes of elements where $\alpha$ is the identity are simply the elements of $\langle c\rangle$ . Thus, the conjugacy growth function satisfies the bounds:
where $B_{\ell_1}(n)$ denotes the n-ball in ${\mathbb Z}^{2r}\cong{\textrm{Ab}}(H_r)$ with respect to the $\ell_1$ norm. It is standard (and not hard to see) that $\beta_{\langle c\rangle}(n)\sim n^2$ for any $H_r$ , noting for example that $[a_i^n,b_i^n]$ is a geodesic spelling of $c^{n^2}$ , with length 4n. Thus, from Corollary 2.26 we have
which finishes the proof.
Remark 4.6. Comparing Theorem 4.5 to Corollary 4.2 of [Reference Babenko2], we see that Babenko proves a stronger result, covering a more general class of metrics and providing the leading coefficient of the growth function, but in a more restricted class of groups. In the case of $r\geq 2$ , we have
where $R_r$ is a rational number depending on the metric, and $\zeta$ is the Riemann zeta function. Although it would contradict Conjecture 1.1, if the conjugacy growth series of $H_r$ turns out to be rational (respectively algebraic), then Proposition 2.15 would imply that $\frac{\zeta(2r-1)}{\zeta(2r)}$ is a rational (respectively algebraic) number. As far as the author is aware, it is not known whether such a fraction is algebraic, although it seems unlikely.
5. Conjugacy growth of virtually higher Heisenberg groups
In this section, we show that if G is commensurable to a higher Heisenberg group then they have equivalent conjugacy growth functions. We will need the following lemma.
Lemma 5.1. If U is a subgroup of $H_1$ , then the Heisenberg rank of U is at most 1.
Proof. First, we claim that a pair of elements of $H_1$ commute if and only if their images in ${\textrm{Ab}}(H_1)$ are colinear as vectors in ${\mathbb Z}^2$ . To see this, let $g=a^ib^jc^k$ and $h=a^Ib^Jc^K$ be a elements of $H_1$ . We have $gh=a^{i+I}b^{j+J}c^{k+K-jI}$ and $hg=a^{I+i}b^{J+j}c^{K+k-Ji}$ and so they commute if and only if $jI=Ji$ . This is equivalent to the vectors $\bar{g}=(i,j)$ and $\bar{h}=(I,J)$ being colinear (including the possibility that one or both is the zero vector).
To prove the Lemma, suppose, on the contrary, that U has Heisenberg rank at least 2. This implies that there exist four elements $x_1,y_1,x_2,y_2\in U\leq H_1$ such that $[x_1,y_1]\neq 1$ , $[x_2,y_2]\neq1$ , and
Now the claim above along with (5.1) implies that $\bar{x}_1$ and $\bar{x}_2$ are colinear, and $\bar{y}_1$ and $\bar{x}_2$ are colinear. Since being colinear is a transitive relation, $\bar{x}_1$ and $\bar{y_1}$ are colinear, and hence $x_1$ and $y_1$ commute, which is a contradiction.
Next, we show that conjugacy growth is preserved when passing to finite index subgroups.
Lemma 5.2. Let G be a class 2 nilpotent group with infinite cyclic derived subgroup. If H is a finite index subgroup of G, then H is also class 2 nilpotent with infinite cyclic derived subgroup. Furthermore, G and H have equivalent conjugacy growth functions.
Proof. Since H is a subgroup of G it is nilpotent of class at most 2 and since it has finite index, [H, H] has finite index in [G, G] and is therefore infinite cyclic. We have $G=\Gamma_1\times H_{D_1}$ and $H=\Gamma_2\times H_{D_2}$ , where $\Gamma_1$ and $\Gamma_2$ are abelian and the groups $H_{D_1}$ and $H_{D_2}$ are as in Definition 3.6. Let $r_1$ and $r_2$ denote the corresponding Heisenberg ranks, and let $s_i$ denote the torsion-free rank of $\Gamma_i$ .
By Proposition 2.5, G and H have equivalent standard growth functions. Therefore by Theorem 2.11, we have $s_1+2r_1+2=s_2+2r_2+2$ , that is,
This also follows from Theorem 5.9. If $r_1,r_2\geq2$ , then Theorem 4.5 implies that $c_G(n)\sim n^{s_1+2r_1}$ and $c_H(n)\sim n^{s_2+2r}$ and hence $c_G\sim c_H$ by (5.2). If $r_1=r_2=1$ , then $s_1=s_2$ and Theorem 4.5 again implies that $c_G\sim c_H$ .
Suppose $r_1>1$ and $r_2=1$ . Applying Theorem 4.5, we have $c_G(n)\sim n^{s_1+2r_1}$ and $c_H(n)\sim n^{s_2+2}\log n$ . From (5.2), $n^{s_1+2r_1}=n^{s_2+2}$ , and then $c_G(n)\sim n^{s_2+2}$ and $c_H(n)\sim n^{s_2+2}\log n$ together violate Lemma 2.8.
Finally, suppose that $r_1=1$ and $r_2>1$ . So there exist four elements $x_1,y_1,x_2,y_2\in H<G=\Gamma_1\times H_1$ such that $\langle x_1,y_1,x_2,y_2\rangle\leq G$ has Heisenberg rank 2. Since G is a direct product and $\Gamma_1$ is abelian, two elements of G commute if and only if their $H_1$ components commute. So by passing to just the $H_1$ components of each element, we find elements $x'_1,y'_1,x'_2,y'_2$ contained in the $H_1$ factor of G, that generate a subgroup of Heisenberg rank 2. This contradicts Lemma 5.1.
Now we show that conjugacy growth is also preserved when passing to finite index supergroups.
Theorem 5.3. Suppose H is a class 2 nilpotent group with infinite cyclic derived subgroup. If a group G contains H as a finite index subgroup, then G and H have equivalent conjugacy growth functions.
Before we prove this Theorem, we note the following consequence.
Corollary 5.4. If G is (virtually) nilpotent with Heisenberg rank equal to 1, then its conjugacy growth series is non-holonomic, with respect to any finite generating set.
Proof. Theorems 5.3 and 4.5 show that the conjugacy growth of such a group is equivalent to $n^d\log n$ for some positive integer d. But by Theorem 2.16, no such function can have a holonomic power series.
To prove Theorem 5.3, we will use a general necessary and sufficient condition for conjugacy growth to be preserved under finite extensions. First, we define twisted conjugacy growth.
Definition 5.5. Let $\phi$ be a fixed automorphism of a group G. Define the $\phi$ -twisted conjugacy class of $g\in G$ by
and write $\mathcal{C}^{\phi}_G$ for the set of $\phi$ -twisted conjugacy classes of G (it is easy to check that $\phi$ -twisted conjugacy defines an equivalence relation). As in Definitions 2.1 and 2.2, the length of a twisted conjugacy class is defined as:
and the corresponding $\phi$ -twisted conjugacy growth function is
It is not hard to prove that the analogue of Proposition 2.4 holds in this case. In other words, the equivalence class of $\phi$ -twisted conjugacy growth does not depend on the choice of generating set and we can therefore drop the S from the notation. The following result is reminiscent of Theorem 3.1 of [Reference Bogopolski, Martino and Ventura5] in that conjugacy in an extension is understood via twisted conjugacy of the base group.
Proposition 5.6. Let H be any finitely generated group. Then $c^{\phi}_H(n)\preccurlyeq c_H(n)$ for every finite-order automorphism $\phi\in{\textrm{Aut}}(H)$ , if and only if every finite extension G of H satisfies $c_G(n)\sim c_H(n)$ .
Proof. First, suppose that $c^{\phi}_H(n)\preccurlyeq c_H(n)$ for every finite order $\phi\in{\textrm{Aut}}(H)$ . For any finite extension G, Lemma 2.8 states that $c_H(n)\preccurlyeq c_G(n)$ , so it remains to show the opposite bound. Fix a choice of transversal, $T\subset G$ , for $G/H$ . We consider the cosets tH separately, for each $t\in T$ (since the conjugacy class of any element of tH is contained within tH by normality). If a pair of elements of tH are conjugate by an element of H, then they are in the same conjugacy class (in G). Therefore, the number of H-conjugacy classes in the n-ball is at least the number of G-conjugacy classes. Therefore, only considering conjugation by elements of H will give an upper bound for the G-conjugacy growth of tH. Fixing $t\in T$ , and choosing an element $h\in H$ , we have the H-conjugacy class:
where $\phi_t\in{\textrm{Aut}} (H)$ is the finite-order automorphism defined by conjugation: $\phi_t\colon\gamma\mapsto t^{-1}\gamma t$ . So we can understand H-conjugation in tH as $\phi_t$ -twisted conjugation in H itself.
Since $c^{\phi_t}_H(n)\preccurlyeq c_H(n)$ by hypothesis, and the growth of tH is equivalent to the growth of H (as per Lemma 2.7), the contribution to the conjugacy growth of G from the coset tH is at most the conjugacy growth of H. Since there are only finitely many such cosets, we have $c_G(n)\preccurlyeq c_H(n)$ as claimed.
For the converse part of the statement, we prove its contrapositive, namely that if there exists some finite-order automorphism giving twisted conjugacy growth strictly greater than untwisted conjugacy growth, then there is a finite extension of H with inequivalent conjugacy growth. Suppose $\phi\in{\textrm{Aut}}(H)$ is such an automorphism. So there exists $k\in{\mathbb N}$ such that $\phi^k$ is the identity automorphism, and we have $c^{\phi}_H(n)\succ c_H(n)$ . Form the semidirect product $G\;:\!=\;H\rtimes_{\phi}{\mathbb Z}/k{\mathbb Z}$ , where the finite cyclic group is generated by t, which acts on H via $t^{-1}ht=\phi(h)$ for all $h\in H$ . We have $c_H\preccurlyeq c_G$ from Lemma 2.8, and we need to show this is a strict inequality to prove our result. It is enough to do this for one of the k cosets of H in G (which are again closed under conjugation since $H\lhd G$ ). We consider conjugating an element of the coset tH by elements of H. Let $h\in H$ . As above, we have
and so, again, H-conjugacy classes of the coset tH are t-translates of $\phi$ -twisted conjugacy classes of H. Therefore, the number of H-conjugacy classes in tH which intersect the n-ball is equivalent to the twisted conjugacy growth function $c^{\phi}_H(n)$ . We have
and so passing from H-conjugacy classes to full conjugacy can only increase the number of conjugacy classes in the n-ball by at most a constant factor of k, which does not change the equivalence class. Thus, the contribution to conjugacy growth from tH is equivalent to $c^\phi_H(n)$ , which is strictly greater than $c_H(n)$ by hypothesis, and we have $c_G(n)\succ c_H(n)$ as required.
Now we apply this general criterion to the specific case at hand. We will need the following simple observation.
Lemma 5.7. Let U be a subgroup of ${\mathbb Z}^d$ , and therefore a free abelian group of rank $u\leq d$ . Then the n-ball in ${\mathbb Z}^d$ (with respect to the standard generating set of unit vectors) intersects ${\mathcal O}(n^{d-u})$ distinct cosets of U.
Proof of Theorem 5.3. To apply Proposition 5.6, we need a normal subgroup. Any torsion in H is contained in the abelian factor and is therefore a finite subgroup, so we initially pass to the torsion-free direct factor. The conjugacy growth of a finite group is clearly eventually constant, so Lemma 2.9 shows that removing a finite direct factor preserves (the equivalence class of) the conjugacy growth function. Then, a standard argument allows us to pass to a normal subgroup of G, contained in H, also of finite index. By Lemma 5.2, this subgroup has equivalent conjugacy growth to the original subgroup. Therefore, we may assume without loss of generality that H is normal and torsion-free. Now to prove the Theorem, it is enough to show that $c^{\phi}_H(n)\preccurlyeq c_H(n)$ for any $\phi\in{\textrm{Aut}}(H)$ .
We explicitly describe $\phi$ -twisted conjugacy in H, for any fixed $\phi\in{\textrm{Aut}}(H)$ . With $h=z_1^{x_1}\cdots z_s^{x_s}a_1^{u_1}b_1^{v_1}\cdots a_r^{u_r}b_r^{v_r}c^w\in H$ and $x=z_1^{l_1}\cdots z_s^{l_s}a_1^{i_1}b_1^{j_1}\cdots a_r^{i_r}b_r^{j_r}c^k\in H$ , Proposition 3.19 gives
where $\gamma$ is a polynomial of degree 2 (and therefore so is f), $\varepsilon\in\{-1,1\}$ , and $M\in\mathbf{M}$ , and these all depend only on $\phi$ . We analyse various cases depending on the value of $\varepsilon$ and the nature of the matrix $M-I$ (which defines an endomorphism of $\textrm{Ab}(H)\cong{\mathbb Z}^{2r+s}$ ).
(1) If $\phi$ is the identity automorphism, then $\phi$ -twisted conjugacy is nothing more than standard conjugacy, and so $c^{\phi}_H(n)=c_H(n)$ in this case. In the remaining cases, we assume that $\phi$ is not the identity.
(2) Suppose $\varepsilon=-1$ , that is, $\phi$ inverts the commutator. If $x=c^k$ , then $\phi_t(x)hx^{-1}=hc^{-2k}$ . So just considering conjugators of the form $c^k$ for varying k, we can see that each fixed $(x_1,\ldots,v_r)\in{\textrm{Ab}}(H)$ corresponds to at most two $\phi$ -twisted conjugacy classes in H, which both have representatives of length $|(x_1,\ldots,v_r)|$ , as in Lemma 4.4. Considering all conjugators will not increase the number of conjugacy classes of a given length, so we have $c_H^{\phi}(n)\preccurlyeq \beta_{\textrm{Ab}(H)}(n) \sim n^{2r+s} \preccurlyeq c_H(n)$ .
(3) Now let $\varepsilon=1$ , and so equation (5.3) becomes
\begin{equation*} \phi(x)hx^{-1} = {\Big( (l_1,\ldots,j_r)(M-I) + (x_1,\ldots,v_r) \Big)}^{\wedge} c^{w +f(l_1,\ldots,j_r)}. \end{equation*}We think of (the Cayley graph of) H embedded in ${\mathbb R}^{2r+s+1}$ via Mal’cev coordinates (see Definition 3.10) and see that the projection of a $\phi$ -twisted conjugacy class $[h]^{\phi}$ onto the first $2r+s$ coordinates (i.e. the image under the abelianisation map) is a coset of the image of $M-I$ . That is,\begin{equation*}\overline{[h]^{\phi}} = \bar{h} + \textrm{Im}(M-I)\subset {\mathbb Z}^{2r+s}.\end{equation*}Write $\textrm{rk}(M-I)$ for the dimension of the image of $M-I$ . Lemma 5.7 implies that the number of distinct images of $\phi$ -twisted conjugacy classes in the abelianisation ${\mathbb Z}^{2r+s}$ that have length at most n is ${\mathcal O}(n^{2r+s-\textrm{rk}(M-I)})$ . Note that if $h\in H$ has length at most n, then its image in the abelianisation will also have length at most n in ${\mathbb Z}^{2r+s}$ . So the number of distinct $\phi$ -twisted conjugacy classes in the ball of radius n in H is ${\mathcal O}(n^{2r+s-\textrm{rk}(M-I)+2})$ , with the extra $n^2$ coming from the growth of the c axis. We consider subcases depending on the dimension of the image of $M-I$ .(a) For $2\leq\textrm{rk}(M-I)\leq 2r+s$ , we have
\begin{equation*}c_H^{\phi}(n)={\mathcal O}(n^{2r+s-\textrm{rk}(M-I)+2}) \preccurlyeq n^{2r+s}\preccurlyeq c_H(n)\end{equation*}as required.(b) Consider the case $\textrm{rk}(M-I)=1$ . Since the kernel of $M-I$ must be non-trivial in this case, we can fix a non-trivial element $\mathbf{b}\in\ker(M-I)$ . Then we have
\begin{equation*}\left\{\bar{h}c^{w+f(\eta\mathbf{b})}\mid \eta\in{\mathbb Z}\right\}\subset[\bar{h}c^w]^\phi=[h]^{\phi}.\end{equation*}Now $f(\eta\mathbf{b})$ is a quadratic polynomial in the single variable $\eta$ with zero constant term. Considering just those $\eta$ in the range $[\!-\!n,n]$ yields at least n distinct elements of $\bar{h}\langle c\rangle \cap [h]^{\phi}$ , all with c-coordinate of length ${\mathcal O}(n^2)$ (since $w={\mathcal O}(n^2)$ , f has degree 2, and $\mathbf{b}$ is constant), and hence contained in the n-ball in H (or more precisely the Cn-ball for some constant C). So amongst the ${\mathcal O}(n^2)$ elements of any $\langle c\rangle$ -coset in the n-ball, there can be at most ${\mathcal O}(n)$ distinct $\phi$ -twisted conjugacy classes. So in this case, the contribution from the c-axis is reduced from $n^2$ to n, and the number of distinct $\phi$ -twisted conjugacy classes in the n-ball is reduced from ${\mathcal O}(n^{2r+s-\textrm{rk}(M-I)+2})$ to ${\mathcal O}(n^{2r+s-\textrm{rk}(M-I)+1})$ . Since the rank is 1 in this case, we have the claimed upper bound:\begin{equation*}c_H^{\phi}(n)={\mathcal O}(n^{2r+s-1+1}) \preccurlyeq n^{2r+s} \preccurlyeq c_H(n).\end{equation*}(c) The final case is when $\textrm{rk}(M-I)=0$ , so $M=I$ and $\phi\in\ker\theta$ . In this specific case, we can be more explicit about the function f in the calculation at (5.3). We have
\begin{align*} \phi(x)hx^{-1} &= z_1^{l_1}\cdots b_r^{j_r}c^{k+p(l_1,\ldots,j_r)}hc^{-k}b_r^{-j_r}\cdots z_1^{-l_1} \\[5pt] &= hc^{p(l_1,\ldots,j_r)+x_1l_1+\cdots+v_rj_r} \\[5pt] &= {(x_1,\ldots, v_r)}^{\wedge} c^{w + (x_1+\lambda_1)l_1 + \cdots + (v_r+\lambda_{s+2r})j_r} \end{align*}where, as in the proof of Proposition 3.19, p is a linear function with coefficients $\lambda_i$ determined by $\phi$ .At each point $(x_1,\ldots,v_r)$ in the abelianisation, there are $\gcd(x_1+\lambda_1,\ldots,v_r+\lambda_{s+2r})$ $\phi$ -twisted conjugacy classes, all of length (asymptotically) equal to the length of $z_1^{x_1}\cdots b_r^{v_r}$ . Thus, by Corollary 2.27, we have $c_H^{\phi}(n)\sim c_H(n)$ .
5.1. A conjecture
As mentioned above, Hull and Osin [Reference Hull and Osin24] exhibit a finitely generated group with exponential conjugacy growth function, possessing a finite index subgroup with only two conjugacy classes, and so conjugacy growth fails to be a commensurability invariant in general. On the other hand, the conjugacy growth of a virtually abelian group is equivalent to its standard growth (see Proposition 2.12), and so conjugacy growth is a quasi-isometry invariant in this very restricted context. Theorem 5.3 tells us that conjugacy growth is a commensurability invariant amongst class 2 nilpotent groups with infinite cyclic derived subgroup.
We ask the following natural question.
Question 5.8. In which other classes of finitely generated groups is conjugacy growth a quasi-isometry invariant?
In light of Proposition 5.6, a detailed understanding of (finite-order) automorphisms would be valuable in understanding how conjugacy growth behaves under finite extensions. The following Theorem of Pansu gives insight into the nature of quasi-isometry in nilpotent groups.
Theorem 5.9 ([Reference Farb and Mosher16, Reference Pansu30]). For a finitely generated nilpotent group G, the numbers $r_i$ , the torsion-free ranks of the quotients $G^{(i+1)}/G^{(i)}$ , are quasi-isometry invariants.
This Theorem suggests that ‘geometrically speaking’, the numbers $r_i$ define the structure of a finitely generated nilpotent group. In light of this, and Theorem 5.3, we venture the following conjecture, which would imply that conjugacy growth was a quasi-isometry invariant amongst (virtually) nilpotent groups.
Conjecture 5.10. The conjugacy growth of a finitely generated nilpotent group G depends only on the numbers $r_i$ .
A natural next step to approach this conjecture would be to extend Theorems 4.5 and 5.3 to include groups whose derived subgroup is virtually cyclic, which would imply that conjugacy growth is a quasi-isometry invariant amongst this class of nilpotent groups.
Acknowledgments
The author was partially supported by EPSRC grant EP/R035814/1 and would like to thank Alex Bishop, Turbo Ho, Andrei Jaikin and Matthew Tointon for very early discussions on various aspects of this work. Thanks are also due to the anonymous referee for many useful comments and in particular for suggesting the upgrading of Proposition 5.6 from a sufficient condition to an equivalence.
Conflicts of interest
The author declares none.