Hostname: page-component-cd9895bd7-dzt6s Total loading time: 0 Render date: 2024-12-24T18:16:01.849Z Has data issue: false hasContentIssue false

Counting points by height in semigroup orbits

Published online by Cambridge University Press:  02 December 2024

Jason P. Bell
Affiliation:
Department of Mathematics, University of Waterloo, Waterloo, ON, N2L 3G1, Canada e-mail: [email protected]
Wade Hindes*
Affiliation:
Department of Mathematics, Texas State University, 601 University Dr., San Marcos, TX 78666 USA
Xiao Zhong
Affiliation:
Department of Mathematics, University of Waterloo, Waterloo, ON, N2L 3G1, Canada e-mail: [email protected]
Rights & Permissions [Opens in a new window]

Abstract

We improve known estimates for the number of points of bounded height in semigroup orbits of polarized dynamical systems. In particular, we give exact asymptotics for generic semigroups acting on the projective line. The main new ingredient is the Wiener-Ikehara Tauberian theorem, which we use to count functions in semigroups of bounded degree.

Type
Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press on behalf of Canadian Mathematical Society

1 Introduction

A general principle in arithmetic dynamics is that the arithmetic behavior of orbits of rational points of algebraic varieties under self-maps can often be understood in terms of dynamical properties of the associated dynamical systems. In particular, dynamical properties of algebraic dynamical systems often put strong constraints on the shapes of orbits of rational points. It is then natural to study the extent to which measures of complexity of orbits of rational points are reflected by the corresponding measures for complexity of dynamical systems. Typically, one uses some notion of asymptotic behavior of heights in orbits as a measure of the complexity of an orbit and uses some notion of degree for iterates to measure the complexity of a dynamical system. The Kawaguchi-Silverman conjecture [Reference Kawaguchi and Silverman9], which asserts that the dynamical degree should determine the height growth for Zariski dense orbits of $\overline {\mathbb {Q}}$ -points, is perhaps the most striking prediction concerning the connection between these seemingly unrelated notions of complexity. Their conjecture has since been verified in a number of cases [Reference Lesieutre and Satriano11, Reference Lin12, Reference Matsuzawa13, Reference Matsuzawa and Sano14, Reference Matsuzawa and Yoshikawa15] but remains open.

In this paper, we build on earlier work of the second-named author [Reference Hindes6] and investigate the growth of heights under semigroup orbits. More precisely, we consider the setting when one has a projective variety V and a finite set $S=\{\phi _1,\ldots ,\phi _r\}$ of regular self-maps defined over a number field K that are polarizable with respect to a common ample invertible sheaf; that is, there is an ample invertible sheaf $\mathcal {L}$ and integers $d_1,\ldots ,d_r\ge 2$ such that $\phi _i^*(\mathcal {L}) \cong \mathcal {L}^{\otimes d_i}$ for $i=1,\ldots ,r$ . We note that this polarizable condition holds for every set of regular maps that are not automorphisms when $V=\mathbb {P}^N$ with $\mathcal {L}=\mathcal {O}(1)$ . Now let $M_S$ denote the semigroup of endomorphisms of V generated by S under composition. In particular, when S is polarized, one has a natural notion of degree for each $f\in M_S$ by declaring that the degree of f, denoted by $\mathrm {deg}_{\mathcal {L}}(f)$ , is the unique integer d such that $f^*\mathcal {L}\cong \mathcal {L}^{\otimes d}$ . Likewise, we may choose a height function $h_{\mathcal {L}}:V\rightarrow \mathbb {R}$ associated to $\mathcal {L}$ , and to any point $P\in V(\overline {\mathbb {Q}})$ study the orbit growth of P, which is the function

$$ \begin{align*}\#\{f\in M_S \colon h_{\mathcal{L}}(f(P))\le X\}.\end{align*} $$

Then the general principle stated above suggests that the asymptotics of the orbit growth should be heavily constrained by the asymptotic behavior of the corresponding degree growth function

$$ \begin{align*}\#\{f\in M_S \colon \mathrm{deg}_{\mathcal{L}}(f)\le X\}.\end{align*} $$

It is relatively straightforward to give upper bounds for the orbit growth in terms of the degree growth. Indeed, Kawaguchi and Silverman [Reference Kawaguchi and Silverman9] showed that the analogous phenomenon holds for dominant rational self-maps. However, it is very difficult in general to give strong lower bounds for the orbit growth in terms of the degree growth. The reason for this is that distinct maps in $M_S$ can send a point P to a common point Q, and this sort of “collapse” can, in some cases, be extreme. For example, if P is a common fixed point of all elements of S, then the orbit growth function is $O(1)$ , while the degree growth function can be exponential in X.

In an earlier work, the second-named author [Reference Hindes6] considered orbit growth and degree growth for semigroups of polarizable morphisms. In this paper, we extend those results by getting optimal asymptotics for degree growth and getting optimal asymptotics for orbit growth for certain classes of semigroups of morphisms of the projective line. However, similar to the case of iterating a single function, we must avoid preperiodic points, defined below.

Definition 1.1 Let $S=\{ \phi _1,\ldots ,\phi _r\}$ be a set of endomorphisms on a variety V. We say that a point $P\in V$ is preperiodic for $M_S$ if there exist $f,g\in M_S$ , with g not the identity, such that $g\circ f(P)=f(P)$ .

Our first main result is the following. In what follows, given real functions F and G, we write $F\ll G$ if there is a constant C such that $F(x)\leq C\cdot G(x)$ holds for all x sufficiently large. Likewise, we write $F\asymp G$ to mean both $F\ll G$ and $G\ll F$ . Finally, we write $F\sim G$ to mean $\displaystyle {\lim _{x\rightarrow \infty } F(x)/G(x)=1}$ .

Theorem 1.2 Let V be a projective variety, let $S:=\{\phi _1,\dots ,\phi _r\}$ be a polarized set of endomorphisms on V, and say $\phi _i^*\mathcal {L}\cong \mathcal {L}^{\otimes d_i}$ for some $d_i>1$ . Moreover, assume that $r\geq 2$ , and let $\rho $ be the unique real number satisfying

$$\begin{align*}\frac{1}{d_1^\rho}+\dots+\frac{1}{d_r^\rho}=1.\end{align*}$$

Then, the following statements hold for all non-preperiodic $P\in V$ :

  1. (1) In all cases, $\#\{f\in M_S\,:\, h_{\mathcal {L}}(f(P))\leq X\}\ll X^{\rho }$ .

  2. (2) If $M_S$ contains a free, finitely generated sub-semigroup, then there is a positive constant $\rho '\leq \rho $ such that

    $$\begin{align*}X^{\rho'}\ll \;\#\{f\in M_S\,:\, h_{\mathcal{L}}(f(P))\leq X\}\ll X^{\rho}.\end{align*}$$
  3. (3) If $M_S$ is free, then $\#\{f\in M_S\,:\, h_{\mathcal {L}}(f(P))\leq X\}\asymp X^{\rho }$ .

  4. (4) If $M_S$ is free and the $d_1,\ldots ,d_r$ are not all pairwise multiplicatively dependent, then there is a positive constant c such that

    $$\begin{align*}\#\{f\in M_S\,:\, h_{\mathcal{L}}(f(P))\leq X\}\sim cX^{\rho}.\end{align*}$$

Here in all cases, the implicit constants depend on P and S.

Remark 1.3 In particular, if the individual sets of preperiodic points $\operatorname {\mathrm {Prep}}(\phi )$ of each $\phi \in S$ are not all identical, then we obtain bounds of the form

$$\begin{align*}X^{\rho'}\ll \;\#\{f\in M_S\,:\, h_{\mathcal{L}}(f(P))\leq X\}\ll X^{\rho}\end{align*}$$

for all non-preperiodic $P \in V$ by [Reference Bell, Huang, Peng and Tucker1, Theorem 1.3].

Remark 1.4 In fact, an exact asymptotic as in statement (4) of Theorem 1.2 is not possible when the degrees of the maps in S are all powers of a single integer. For a concrete example, consider the set of polynomials $S=\{2x^8, 3x^8\}$ and the point $P=1$ . Note that $\rho =1/3$ in this case. Now for $X>0$ , define the function

$$\begin{align*}\Theta(X)=\#\{f\in M_S\,:\, h(f(P))\leq X\}/X^{1/3},\end{align*}$$

where $h(\cdot )$ is the usual Weil height on $\mathbb {P}^1$ . Then, one can show that

$$\begin{align*}0<\liminf \Theta(X) < \limsup \Theta(X) <\infty\end{align*}$$

in this case.

However, since every point in the total semigroup orbit of P,

$$\begin{align*}\operatorname{\mathrm{Orb}}_S(P):=\{f(P)\,:\,f\in M_S\},\end{align*}$$

of height at most X is determined by at least one function $f\in M_S$ satisfying $h_{\mathcal {L}}(f(P))\leq X$ , we easily obtain a general upper bound for the number of points of bounded height in non-preperiodic orbits. However, with some additional work, we can prove that such an upper bound holds for all orbits.

Theorem 1.5 Let V be a projective variety, and let $S:=\{\phi _1,\dots ,\phi _r\}$ be a polarized set of endomorphisms on V. Moreover, assume that $r\geq 2$ and that $\rho $ and $\mathcal {L}$ are as in Theorem 1.2. Then,

$$\begin{align*}\#\{Q\in\operatorname{\mathrm{Orb}}_S(P)\,:\, h_{\mathcal{L}}(Q)\leq X\}\ll X^{\rho}\end{align*}$$

holds for all points $P\in V$ .

However, we prove an even stronger result for generic sets of maps on $V=\mathbb {P}^1$ and establish an asymptotic for the number of points in an orbit of bounded height, instead of just an asymptotic for the number of functions yielding a bounded height relation. In what follows, given a positive integer d, we let $\operatorname {\mathrm {Rat}}_d$ be the space of rational functions in one variable of degree d; see [Reference Silverman21, §4.3] for details.

Theorem 1.6 Let $d_1,\dots , d_r$ be integers all at least four. Moreover, assume that $r\geq 2$ , and let $\rho $ be the unique positive real number satisfying $\sum d_i^{-\rho }=1$ . Then, there is a proper Zariski open subset $U\subseteq \operatorname {\mathrm {Rat}}_{d_1}\times \dots \times \operatorname {\mathrm {Rat}}_{d_r}$ such that

$$\begin{align*}\#\{Q\in \operatorname{\mathrm{Orb}}_S(P)\,:\, h(Q)\leq X\}\asymp X^{\rho}\end{align*}$$

for all $S\in U$ and all P such that $\operatorname {\mathrm {Orb}}_S(P)$ is infinite. Moreover, if in addition the $d_1,\ldots ,d_r$ are not all pairwise multiplicatively dependent, then

$$\begin{align*}\#\{Q\in \operatorname{\mathrm{Orb}}_S(P)\,:\, h(Q)\leq X\}\sim c X^{\rho}\end{align*}$$

for some positive constant c, which depends on P and S.

Remark 1.7 The constants in Theorem 1.6 are not explicit since they incorporate height bounds for rational points on curves. However, the subset U is explicit and consists of all critically separate and critically simple sets of maps (cf. [Reference Hindes6, Theorem 1.5]); see section 3 for details.

Remark 1.8 Similar hypotheses were recently used in [Reference Hindes and Silverman7] to give strong lower bounds for the number of points in semigroup orbits modulo primes. Moreover, these results are analogous to known results for abelian varieties [Reference Ram Murty, Rosen and Silverman17]. With this analogy in mind, the number of points of height at most X on an abelian variety A over a number field K is asymptotic to $X^{\rho /2}$ , where $\rho $ is the rank of the Mordell-Weil group $A(K)$ ; see [Reference Serre, Brown and Waldschmidt19, p. 54]. Thus, the present paper, as well as the papers [Reference Hindes6, Reference Hindes and Silverman7], suggest that the analogy

$$\begin{align*}\left(\begin{array}{@{}l@{}} \text{arithmetic of points}\\ \text{of an abelian variety}\\ \end{array}\right) \quad\Longleftrightarrow\quad \left(\begin{array}{@{}l@{}} \text{arithmetic of points in orbits}\\ \text{of a dynamical system}\\ \end{array}\right) \end{align*}$$

described in [Reference Benedetto, Ingram, Jones, Manes, Silverman and Tucker2] and [Reference Silverman22, §6.5] may be more accurate when the dynamical system on the right-hand side is generated by at least two noncommuting maps, rather than using orbits coming from iteration of a single map.

1.1 Notation

Throughout this paper, we use the following notation. If S is a set of self-maps, we let $M_S$ denote the semigroup generated by S under composition. If $M_S$ is free on the set S, we let $S_n$ (respectively, $S_{\le n}$ ) denote the set of n-fold compositions of elements of S (respectively, elements that are compositions of at most n elements of S), and for $f\in M_S$ , we let $|f|$ denote the unique natural number n such that f is an n-fold composition of elements of S.

2 Counting functions in polarized systems

2.1 Functions of bounded degree

Let V be a projective variety, and let $S=\{\phi _1,\dots ,\phi _r\}$ be a set of endomorphisms on V. Moreover, we assume that S is polarized with respect to some ample line bundle $\mathcal {L}$ on V, meaning that there are $d_1,\dots , d_r>1$ satisfying $\phi _i^*\mathcal {L}\cong \mathcal {L}^{\otimes d_i}$ . In particular, we may define the $\mathcal {L}$ -degree (or simply degree when $\mathcal {L}$ is understood) of a function f in the semigroup generated by S as follows:

$$\begin{align*}\qquad\qquad\deg_{\mathcal{L}}(f):=\prod_{j=1}^n d_{i_j}\qquad \text{for }f=\phi_{i_1}\circ\dots\circ \phi_{i_n}\in M_S.\end{align*}$$

In particular, $f^*\mathcal {L}\cong \mathcal {L}^{\otimes \deg _{\mathcal {L}}(f)}$ follows from standard results in algebraic geometry. However, if we fix a height function associated to $\mathcal {L}$ on V and a point $P\in V$ , then the height of $f(P)$ tends to grow according to $\deg _{\mathcal {L}}(f)$ (a fact used by Call and Silverman to define canonical heights). Therefore, as a first step in counting points of bounded height in orbits, we count functions of bounded degree. In fact, we will prove in this section that if $M_S$ is free, then there are positive constants $\rho $ , $c_1$ , and $c_2$ (depending only on the initial degrees $d_1,\dots d_r$ ) such that

(2.0.1) $$ \begin{align} c_1 X^{\rho}\leq\#\{f\in M_S\,:\, \deg_{\mathcal{L}}(f)\leq X\}\leq c_2 X^{\rho} \end{align} $$

for all X sufficiently large. Here, the constant $\rho $ is the unique positive real number satisfying

$$\begin{align*}\frac{1}{d_1^\rho}+\dots+\frac{1}{d_r^\rho}=1.\end{align*}$$

In fact, we can give more exact asymptotics. However, the form of these asymptotics turns out to depend on whether or not the degrees of the maps in S are all powers of a single integer, and so we put off stating these more precise results until later.

To begin, we collect some basic facts about counting words in free semigroups of bounded, multiplicative weight. This problem is very similar to a classical arithmetic problem of counting bounded multiplicative compositions of integers [Reference Erdös3, Reference Knopfmacher and Mays10], though not exactly the same, since we want to allow factors of equal size (precisely to allow maps of equal degree in our main dynamical theorems). In particular, due to the lack of an exact reference, we include the relevant statements and proofs here. Let us fix some notation. Let M be a multiplicative semigroup generated by a finite set $S=\{\alpha _1,\dots ,\alpha _r\}$ . We say that M is a weighted semigroup with respect to a vector $\pmb {d}=(d_1,\dots , d_r)$ of positive real weights if there is a well-defined multiplicative weight function $|\cdot |_{\pmb {d}}: M\rightarrow \mathbb {R}$ given by

(2.0.2) $$ \begin{align} \quad\quad\qquad |\omega|_{\pmb{d}}:=\prod_{j=1}^md_{i_j},\qquad \omega=\alpha_{i_1}\cdots\alpha_{i_m}\in M. \end{align} $$

In the case when M is the free semigroup on S, then every set of positive weights endows M with the structure of a weighted semigroup. In general, however, to be weighted, one requires that all representations of an element of M as a composition of elements of S have the same weight.

Then, given a real number X, we would like to obtain an asymptotic formula for the number of words of weight at most X,

(2.0.3) $$ \begin{align} \#\{\omega\in M\,:\, |\omega|_{\pmb{d}}\leq X\}. \end{align} $$

In particular, we have the following general estimate, which assumes only that $r\geq 2$ and that $\min \{d_i\}>1$ .

Remark 2.1 Strictly speaking, the result below is not necessary for proving our main theorems in this paper; it is superseded by the asymptotics given in Theorem 2.6. Nevertheless, we include this result since it may be useful for future work. For one, the more precise asymptotics to come assume integral weights, and non-integral weights do appear elsewhere in dynamics [Reference Kawaguchi8, Reference Silverman20]. Hence, the result below may still be useful in a more general setting.

Proposition 2.2 Let M be a semigroup generated by a set $S=\{\alpha _1,\dots ,\alpha _r\}$ , and suppose that M is weighted with respect to the vector $\pmb {d}=(d_1,\dots , d_r)$ of positive real numbers. Assume that $r\geq 2$ and that $\min \{d_1,\dots ,d_r\}>1$ , and let $\rho $ be the unique real number satisfying $d_1^{-\rho }+\dots +d_r^{-\rho }=1$ . Then, there is a positive constant $c_2$ such that

$$\begin{align*}\#\{\omega\in M\,:\, |\omega|_{\pmb{d}}\leq X\}\leq c_2\, X^{\rho} \end{align*}$$

holds for all X sufficiently large. Moreover, if M is free on the set S, then there is a positive constant $c_1$ such that

$$\begin{align*}c_1\, X^{\rho}\leq \#\{\omega\in M\,:\, |\omega|_{\pmb{d}}\leq X\}.\end{align*}$$

Proof We first consider the case when M is free, and we adjoin an identity to make M a monoid with identity element of weight $1$ . Let $w(x)$ be the number of elements of M of weight at most x. Then,

$$\begin{align*}w(x)=\begin{cases} 0 & \text{if}\;0\leq x<1, \\ 1+\displaystyle{\sum_{i=1}^rw\Big(\frac{x}{d_i}\Big)}&\text{if}\;1\leq x<\infty. \end{cases} \end{align*}$$

Indeed, there are no words of weight less than $1$ , and any nonempty word $\omega $ of weight at most x is uniquely of the form $\omega =\alpha _{i_1}\omega '$ for some word $\omega '$ of weight at most $x/d_{i_1}$ (and there are exactly $w(x/d_{i_1})$ such $\omega '$ ). Now let $W(x):=w(x)+\frac {1}{r-1}$ , and note that it suffices to show that $W(x)\asymp x^{\rho }$ , to conclude the same for $w(x)$ ; the reason we consider W instead of w is that it satisfies a slightly simpler recursion:

(2.2.1) $$ \begin{align} W(x)=\begin{cases} 1/(r-1) & \text{if}\;0\leq x<1, \\ \displaystyle{\sum_{i=1}^rW\Big(\frac{x}{d_i}\Big)}&\text{if}\;1\leq x<\infty, \end{cases} \end{align} $$

which follows easily from w’s functional equation. Now assume that the d’s are arranged in nondecreasing order, $d_1\leq \dots \leq d_r$ , and let $c_1=W(1)/d_r^\rho $ and $c_2=W(d_r)$ . We will show that

(2.2.2) $$ \begin{align} c_1x^{\rho}\leq W(x)\leq c_2x^{\rho}\qquad \text{for all }1\leq x\leq d_r\cdot d_1^n \end{align} $$

and all $n\geq 0$ by induction on n. In particular, since $d_1>1$ , it follows that $c_1x^{\rho }\leq W(x)\leq c_2x^{\rho }$ for all $x\geq 1$ . Note first that if $1\leq x\leq d_r$ , then

$$ \begin{align*} \begin{aligned} c_1x^{\rho}=\frac{W(1)}{d_r^\rho}x^{\rho}\leq \frac{W(1)}{d_r^\rho}d_r^{\rho}=W(1)&\leq W(x)\\[2pt] &\leq W(d_r)\leq W(d_r)\cdot x^\rho=c_2x^\rho \end{aligned} \end{align*} $$

since $W(x)$ and $x^{\rho }$ are an increasing function. Hence, (2.2.2) is true for $n=0$ . Now to the induction step: suppose that (2.2.2) holds and that $1\leq x\leq d_r\cdot d_1^{n+1}$ . In particular, since we have established (2.2.2) for all $1\leq x\leq d_r$ , we may assume without loss that $d_r\leq x\leq d_r\cdot d_1^{n+1}$ . But then,

$$\begin{align*}1\leq \frac{d_r}{d_i}\leq\frac{x}{d_i}\leq d_r\cdot\frac{d_1}{d_i}\cdot d_1^{n}\leq d_r\cdot d_1^n \end{align*}$$

for all $1\leq i\leq r$ . In particular, $1\leq x/d_i\leq d_r\cdot d_1^n$ , and so we have that $c_1(x/d_i)^\rho \leq W(x/d_i)\leq c_2(x/d_i)^\rho $ by the induction hypothesis. Hence,

$$ \begin{align*} \begin{aligned} c_1x^{\rho}=c_1x^{\rho}\Big(\sum_{i=1}^r\frac{1}{d_i^\rho}\Big)=\sum_{i=1}^r c_1\Big(\frac{x}{d_i}\Big)^\rho&\leq\sum_{i=1}^rW\Big(\frac{x}{d_i}\Big)\\[3pt] &\leq \sum_{i=1}^r c_2\Big(\frac{x}{d_i}\Big)^\rho=c_2x^{\rho}\Big(\sum_{i=1}^r\frac{1}{d_i^\rho}\Big)=c_2x^{\rho}. \end{aligned} \end{align*} $$

However, (2.2.1) implies that $W(x)=\sum _{i=1}^r W(x/d_i)$ , and we deduce that $c_1x^{\rho }\leq W(x)\leq c_2x^{\rho }$ for all $x\geq 1$ , as claimed.

We note that when M is not free, the estimates giving the upper bound still hold since for general semigroups, we still have the inequality

$$ \begin{align*}W(x) \le \sum_{i=1}^r W(x/d_i),\end{align*} $$

and so we can use the same proof verbatim to obtain the upper bound in the non-free case.

In particular, we immediately deduce the bounds in (2.0.1) for polarized sets of endomorphisms from Proposition 2.2 and the fact that the degrees of endomorphisms are multiplicative with respect to composition. However, it is possible to obtain more exact asymptotics for the quantity in (2.0.3) in the case of integer weights by analyzing the analytic properties of an associated generating function. With this in mind, we assume for the remainder of this section that the vector $\pmb {d}$ consists of positive integer weights. Then for any $n\in \mathbb {Z}$ , we let

(2.2.3) $$ \begin{align} a_{\pmb{d},n}:=\#\{\omega\in F_S\,:\, |\omega|_{\pmb{d}}=n\} \end{align} $$

be the number of words of weight equal to n and let

(2.2.4) $$ \begin{align} \mathcal{D}_{\pmb{d}}(s):=\sum_{n\geq0} a_{\pmb{d},n}n^{-s} \end{align} $$

be the associated Dirichlet series generating function. In particular, we will improve upon Proposition 2.2 in this case by applying known Tauberian theorems to (2.2.4). However, there is a catch: we must make two separate cases depending on whether the generating weights are all powers of a single integer or not. With this in mind, we make the following definition.

Definition 2.3 A vector $\pmb {d}=(d_1,\dots , d_r)$ of positive integers is called cyclic if there exists a single positive integer d and a collection of positive integers $n_1,\ldots ,n_r$ such that $d_j=d^{n_j}$ for all $j\geq 1$ . Moreover, if this is not the case, then we call $\pmb {d}$ acyclic. If $(d_1,\ldots ,d_r)$ is cyclic, then there is a unique positive integer d such that $d_i=d^{a_i}$ for some positive integers $a_1,\ldots ,a_r$ with $\gcd (a_1,\ldots ,a_r)=1$ , and we call this integer d the base of $\pmb {d}$ .

We use the following elementary fact from arithmetic.

Lemma 2.4 Let $d_1,\dots , d_r$ be positive integers all at least $2$ . If the ratio $\log (d_i)/\log (d_j)\in \mathbb {Q}$ for all $i,j$ , then the vector $\pmb {d}=(d_1,\dots ,d_r)$ is cyclic.

Remark 2.5 When $d,e$ are positive integers with $\log (d)/\log (e)\in \mathbb {Q}$ , we say that d and e are multiplicatively dependent, and when this does not hold, we say that d and e are multiplicatively independent. Then, Lemma 2.4 gives that $\pmb {d}=(d_1,\dots , d_r)$ is cyclic if and only if the d’s are all pairwise multiplicatively dependent.

With these notions in place, we now establish an exact asymptotic for the number of words of bounded multiplicative weight in free semigroups.

Theorem 2.6 Let $S=\{\alpha _1,\dots ,\alpha _r\}$ be an alphabet, let $F_S$ be the free semigroup generated by S, and let $\pmb {d}=(d_1,\dots , d_r)$ be a vector of positive integers all at least $2$ . Moreover, assume that $r\geq 2$ , and let $\rho _{\pmb {d}}>0$ be the unique real number satisfying $G_{\pmb {d}}(\rho _{\pmb {d}})=1$ , where

$$\begin{align*}G_{\pmb{d}}(s)=d_1^{-s}+d_2^{-s}+\dots +d_r^{-s}. \end{align*}$$

Then, the following statements hold:

  1. (1) If $\pmb {d}$ is acyclic, then

    $$\begin{align*}\mathcal{D}_{\pmb{d}}(s)=\frac{1}{1-G_{\pmb{d}}(s)} \end{align*}$$
    converges for all $\Re (s)\geq \rho _{\pmb {d}}$ except for a simple pole at $s=\rho _{\pmb {d}}$ . Hence, the Wiener-Ikehara Tauberian theorem [Reference Montgomery and Vaughan16, §8.3] implies that
    $$\begin{align*}\#\{\omega\in F_S\,:\, |\omega|_{\pmb{d}}\leq X\}=\sum_{n\leq X}a_{\pmb{d},n}=\frac{-1}{\rho_{\pmb{d}}\, G'(\rho_{\pmb{d}})}\,X^{\rho_{\pmb{d}}}+o(X^{\rho_{\pmb{d}}})\end{align*}$$
    as X tends to infinity.
  2. (2) If $\pmb {d}$ is cyclic of base d, then there is a positive constant C such that

    $$\begin{align*}\#\{\omega\in F_S\,:\, |\omega|_{\pmb{d}}\leq X\}=C\cdot d^{\lfloor \log_d(X)\rfloor \cdot \rho_{\pmb{d}}}+o(d^{\lfloor \log_d(X)\rfloor \cdot \rho_{\pmb{d}}}) \end{align*}$$
    as X tends to infinity.

Before we can establish Theorem 2.6, we need an elementary fact, which is a simple consequence of the strong triangle inequality.

Lemma 2.7 Let $z_1,\dots ,z_r\in \mathbb {C}$ not all zero be such that

$$\begin{align*}z_1+\dots +z_n=|z_1|+\dots+|z_n|.\end{align*}$$

Then, $z_i=|z_i|$ for all i (i.e., $\arg (z_i)=2k_i\pi $ for some $k_i\in \mathbb {Z}$ ).

Proof of Theorem 2.6

For all $m\geq 0$ , let $F_S(m)$ be the set of sequences of length m in $F_S$ . Then, it follows from the definition of weight in (2.0.2) that

$$\begin{align*}G_{\pmb{d}}(s)^m=\sum_{\omega\in F_S(m)}\hspace{-.3cm}|\omega|_{\pmb{d}}^{-s}=\sum_{n\geq1}\hspace{-.1cm}\sum_{\substack{\;|\omega|_{\pmb{d}}=n\\[2pt] \;\;\omega\in F_S(m)\\}}\hspace{-.4cm}n^{-s}=\sum_{n\geq1}a_{\pmb{d},n,m}\; n^{-s},\end{align*}$$

where $a_{\pmb {d},n,m}=\#\{\omega \in F_S(m)\,:\,|\omega |_{\pmb {d}}=n\}$ . In particular, we see that

$$ \begin{align*} \begin{aligned} \frac{1}{1-G_{\pmb{d}}(s)}=\sum_{m\geq0} G_{\pmb{d}}(s)^m&=\sum_{m\geq0}\sum_{n\geq1}(a_{\pmb{d},n,m})\; n^{-s} \\[4pt] &=\sum_{n\geq1}\sum_{m\geq0}(a_{\pmb{d},n,m})\; n^{-s}=\sum_{n\geq1} a_{\pmb{d},n}n^{-s}=\mathcal{D}_{\pmb{d}}(s) \end{aligned} \end{align*} $$

holds formally. As for the convergence of this expression, note first that since $d_i\ge 2$ for all i, $G_{\pmb {d}}(s)$ is a strictly decreasing continuous function on the real line. Moreover, $G_{\pmb {d}}(0)=r\geq 2$ and $\displaystyle {\lim _{x\rightarrow \infty } G_{\pmb {d}}(x)=0}$ , and so there is a unique, simple, real solution $x=\rho _{\pmb {d}}>0$ to the equation $G_{\pmb {d}}(x)=1$ . However, for all $\Re (s)>\rho _{\pmb {d}}$ , we have that

$$ \begin{align*} \begin{aligned} |G_{\pmb{d}}(s)|&=|d_1^{-s}+d_2^{-s}+\dots +d_r^{-s}|\\[3pt] &\leq |d_1^{-s}|+|d_2^{-s}|+\dots+|d_r^{-s}|\\[3pt] &=d_1^{-\Re(s)}+d_2^{-\Re(s)}+\dots+d_r^{-\Re(s)}\\[3pt] &<d_1^{-\rho_{\pmb{d}}}+d_2^{-\rho_{\pmb{d}}}+\dots+d_r^{-\rho_{\pmb{d}}}\\[3pt] &=1. \end{aligned} \end{align*} $$

In particular, the meromorphic function $\mathcal {D}_{\pmb {d}}(s)=(1-G_{\pmb {d}}(s))^{-1}$ converges in $\Re (s)>\rho _{\pmb {d}}$ and has a simple pole at $s=\rho _{\pmb {d}}$ . However, this is, in fact, the only pole on the line $\Re (s)=\rho _{\pmb {d}}$ . To see this, suppose that $G_{\pmb {d}}(s)=1$ for some $\Re (s)=\rho _{\pmb {d}}$ , and write $s=\rho _{\pmb {d}}+ci$ for some $c\in \mathbb {R}$ . Then setting $z_i=d_i^{-s}$ , we deduce from Lemma 2.7 that

$$\begin{align*}\log(d_i)c=2\pi k_i\end{align*}$$

for some $k_i\in \mathbb {Z}$ . In particular, if $c\neq 0$ , then $\log (d_i)/\log (d_j)\in \mathbb {Q}$ for all $i,j$ , and Lemma 2.4 implies that $\pmb {d}$ is cyclic. Therefore, if $\pmb {d}$ is acyclic, then $\mathcal {D}_{\pmb {d}}(s)= (1-G_{\pmb {d}}(s))^{-1}$ converges whenever $\Re (s)$ is at least $\rho _{\pmb {d}}$ , except for the simple pole at $s=\rho _{\pmb {d}}$ . This completes the proof of statement (1).

However, the cyclic case (say with base d) can be handled with somewhat simpler analysis. Write $d_i=d^{a_i}$ for some positive integers $a_i$ with $\gcd (a_1,\ldots ,a_r)=1$ . Since $F_S$ is free on $S=\{\alpha _1,\dots ,\alpha _r\}$ , there are exactly ${n_1+\cdots + n_r \choose n_1,\ldots ,n_r}$ words with $n_i$ occurrences of $\alpha _i$ for all $i=1,\ldots ,r$ . Moreover, the weight of each such element is exactly $d^{\sum a_i n_i}$ . In particular, the number of elements in $F_S$ of weight $\le d^L$ is the sum of the coefficients of $x^i$ for $i=0,\ldots ,L$ in the polynomial

$$\begin{align*}\sum_{n\ge 0} (x^{a_1}+\cdots + x^{a_r})^n.\end{align*}$$

In particular, we see that for a positive integer L, the number of elements in $F_S$ of weight $\le d^L$ is the coefficient of $x^L$ in the formal power series

$$\begin{align*}F(x):=(1-x)^{-1} \cdot (1-x^{a_1}-\cdots - x^{a_r})^{-1}.\end{align*}$$

Observe that $G(x):=1-x^{a_1}-\cdots - x^{a_r}$ is strictly decreasing on $(0,\infty )$ , and since the function is $0$ when $x=0$ and is negative when $x=1$ , it has a unique zero $\theta $ in $(0,1)$ . Since the derivative of G is negative on $(0,\infty )$ , this is a simple zero. Observe further that $x=\theta $ is the unique pole of $F(x)$ in the closed disc of radius $\theta $ . To see this, observe that if $\beta $ is another pole in this disc, then we must have $G(\beta )=0$ . Then,

$$\begin{align*}|G(\beta)| \ge 1-|\beta|^{a_1}-\cdots -|\beta|^{a_r} \ge 1-\theta^{a_1}-\cdots -\theta^{a_r}=0,\end{align*}$$

with equality only if $|\beta |=\theta $ . Thus, it suffices to consider $\beta $ of the form $\theta e^{2\pi i\mu }$ . Then, $G(\beta ) = 1- \theta ^{a_1} \exp (2\pi i a_1 \mu ) -\cdots -\theta ^{a_r} \exp (2\pi i a_r \mu )$ , and since $\sum \theta ^{a_j} = 1$ , this can only be zero if $\exp (2\pi i a_1 \mu ) =\cdots = \exp (2\pi i a_r \mu )=1$ . Since the $a_j$ have gcd 1, we then see this forces $\mu $ to be an integer, and so $\beta =\theta $ as required. Using partial fractions and the fact that $F(x)$ has a simple pole at $x=\theta $ and no other poles in the closed disc of radius $\theta $ , we see that

$$ \begin{align*}F(x) = C/(1-x/\theta) + F_0(x)\end{align*} $$

for some positive constant C and some rational function $F_0(x)$ whose radius of convergence is strictly larger than $\theta $ . In particular, the coefficient of $x^L$ in $F(x)$ is asymptotic to $C\theta ^L$ , and so we have that

$$\begin{align*}\#\{\omega\in F_S\,:\, |\omega|_{\pmb{d}}\leq d^L\}\sim C\theta^L.\end{align*}$$

Since $\theta $ is the unique positive solution to $\sum \theta ^{a_i} = 1$ , we see that if $\rho $ is such that $d^{\rho }=\theta $ , then $\rho $ is the unique positive solution to $\sum 1/d_i^{\rho }=1$ , and so $\rho =\rho _{\pmb {d}}$ , and we have

$$\begin{align*}\#\{\omega\in F_S\,:\, |\omega|_{\pmb{d}}\leq d^L\}\sim C (d^L)^{\rho_{\pmb{d}}}.\end{align*}$$

Now for X large, there is a unique L such that $d^L \le X < d^{L+1}$ . Hence, in general, we have that

$$\begin{align*}\#\{\omega\in F_S\,:\, |\omega|_{\pmb{d}}\leq X\} \sim C d^{\lfloor \log_d(X)\rfloor \cdot \rho_{\pmb{d}}},\end{align*}$$

a fact equivalent to statement (2).

In particular, we obtain an improved asymptotic for the number of functions of bounded degree in free semigroups generated by polarized sets.

Corollary 2.8 Let V be a projective variety, let $S:=\{\phi _1,\dots ,\phi _r\}$ be a polarized set of endomorphisms on V, and say $\phi _i^*\mathcal {L}\cong \mathcal {L}^{\otimes d_i}$ for some integers $d_i\geq 2$ . Moreover, assume that $r\geq 2$ , that $M_S$ is free, and that $\rho $ is the real number satisfying $d_1^{-\rho }+\dots +d_r^{-\rho }=1$ . Then the following statements hold.

  1. (1) If $\pmb {d}=(d_1,\dots , d_r)$ is acyclic, there is a positive constant c such that

    $$\begin{align*}\#\{f\in M_S\,:\, \deg_{\mathcal{L}}(f)\leq X\}\sim cX^{\rho}. \end{align*}$$
  2. (2) If $\pmb {d}=(d_1,\dots , d_r)$ is cyclic of base d, there is a positive constant C such that

    $$\begin{align*}\#\{f\in M_S\,:\, \deg_{\mathcal{L}}(f)\leq X\}\sim C\cdot d^{\lfloor \log_d(X)\rfloor \cdot \rho}.\end{align*}$$

Remark 2.9 We note that our results on degrees in this section work over an arbitrary ground field. However, for the remainder of this paper (when we instead count by heights), we will restrict ourselves to number fields.

2.2 From degrees to heights

Now that we have a handle on the growth of degrees in polarized semigroups, we use this to study the growth of heights in orbits. To do this, we fix some notation. As before, let $S=\{\phi _1,\dots ,\phi _r\}$ be a polarized set of endomorphisms on a projective variety V with $\phi _i^*\mathcal {L}\cong \mathcal {L}^{\otimes d_i}$ for some $d_i>1$ and some ample line bundle $\mathcal {L}$ . Moreover, we may choose a height function $h_{\mathcal {L}}: V\rightarrow \mathbb {R}_{\geq 0}$ associated to $\mathcal {L}$ , well-defined up to $O(1)$ . Then the basic tool that makes the conversion between degrees and heights possible is the following fact: for each $\phi _i$ there is a constant $C_i$ such that

(2.9.1) $$ \begin{align} \Big|h_{\mathcal{L}}(\phi_i(P))-d_i\, h_{\mathcal{L}}(P)\Big|\leq C_i \end{align} $$

holds for all $P\in V$ ; see, for instance, [Reference Silverman21, Theorem 7.29]. In particular, one may iterate the bound above and use a telescoping argument (due to Tate) to obtain a similar statement for semigroups; see [Reference Healey and Hindes5, Lemma 2.1].

Lemma 2.10 Let $d_S=\min \{d_i\}$ , and let $C_S=\max \{C_{i}\}$ be as in (2.9.1). Then,

$$\begin{align*}\bigg|\frac{h_{\mathcal{L}}(f(P))}{\deg_{\mathcal{L}}(f)}-h_{\mathcal{L}}(P)\bigg|\leq \frac{C_S}{d_S-1} \end{align*}$$

holds for all $f\in M_S$ and all $P\in V$ .

In particular, we may deduce the following result, which says first, that the set of preperiodic points of $M_S$ is a set of bounded height, and second, that if P is not a preperiodic point, then given a positive constant B, there are at most finitely many elements $f\in M_S$ for which $h_{\mathcal {L}}(f(P))<B$ . We note that this does not require freeness of the semigroup $M_S$ but heavily uses the fact that the endomorphisms in S are polarizable.

Lemma 2.11 There is a positive constant C such that

  1. (1) if P is a preperiodic point for the semigroup $M_S$ , then $h_{\mathcal {L}}(P)\le C$ ;

  2. (2) if $h_{\mathcal {L}}(P)> C$ , then $h_{\mathcal {L}}(f(P))> h_{\mathcal {L}}(P)$ for all $f\in M_S$ .

In particular, if P is not preperiodic for $M_S$ and if B is a positive constant, then there is an $N:=N(V,S,P,B)$ such that $h_{\mathcal {L}}(f(P))>B$ whenever there is some $m>N$ such that $f\in M_S$ can be expressed as a length-m composition of elements of S.

Proof Notice that if P is a preperiodic point for $M_S$ and $Q:=f(P)=g\circ f(P)$ , then

$$\begin{align*}\bigg|\frac{h_{\mathcal{L}}(g(Q))}{\deg_{\mathcal{L}}(g)}-h_{\mathcal{L}}(Q)\bigg|\leq \frac{C_S}{d_S-1}, \end{align*}$$

and since $g(Q)=Q$ and $\deg _{\mathcal {L}}(g)\ge 2$ , we then see that

$$ \begin{align*}h_{\mathcal{L}}(Q)\le 2C_S\cdot (d_S-1)^{-1}.\end{align*} $$

Similarly, we have

$$\begin{align*}\bigg|\frac{h_{\mathcal{L}}(f(P))}{\deg_{\mathcal{L}}(f)}-h_{\mathcal{L}}(P)\bigg|\leq \frac{C_S}{d_S-1}, \end{align*}$$

and since $f(P)=Q$ , we use the inequality on the height of Q to get that $h_{\mathcal {L}}(P)\le 2\frac {C_S}{d_S-1}$ . Also, observe that if $h_{\mathcal {L}}(P)\ge 2C_S$ , then if $f\in M_S$ , then Lemma 2.10 gives that $h_{\mathcal {L}}(f(P))/\deg _{\mathcal {L}}(f)> h_{\mathcal {L}}(P)/2$ , and so $h_{\mathcal {L}}(f(P))>h_{\mathcal {L}}(P)$ for all $f\in M_S$ . In particular, setting $C=2C_S$ , we deduce both claims (1) and (2).

For the final claim, assume that P is not preperiodic for $M_S$ and that $B>0$ . Note that without loss of generality, we may assume that $B\geq C$ . Now let K be a number field over which P and every map in S is defined. Then, $f(P)\in V(K)$ for all $f\in M_S$ , and

$$\begin{align*}N:=\#\{Q\in V(K)\,:\, h_{\mathcal{L}}(P)\leq B\}\end{align*}$$

is finite by Northcott’s theorem. We claim that we can take $N(V,S,P,B)$ equal to this N. To see this, suppose that $f=\theta _m\circ \theta _{m-1}\circ \dots \circ \theta _1$ for some $\theta _i\in S$ with $m>N$ and that $h_{\mathcal {L}}(f(P))\le B$ . For $1\leq i\leq m$ , we let $f_i:=\theta _i\circ \theta _{i-1}\circ \dots \circ \theta _1$ . Note first that if $h_{\mathcal {L}}(f_i(P))>B\geq C$ for some i, then $h_{\mathcal {L}}(f(P))>B$ also by property (2). Therefore, we may assume that $h_{\mathcal {L}}(f_i(P))\leq B$ for all $i\leq m$ . However, $m>N$ , so that $f_i(P)=f_j(P)$ for some $i>j$ by the pigeon-hole principle and the definition of N. But then,

$$\begin{align*}g_{ij}\circ f_j(P)=f_i(P)=f_j(P)\end{align*}$$

for $g_{ij}=\theta _i\circ \dots \circ \theta _{j+1}$ . Since $\theta _1,\ldots ,\theta _m$ are polarizable, we see that $g_{ij}$ is not the identity map, and this now contradicts our assumption that P is not preperiodic. Therefore, $h_{\mathcal {L}}(f(P))>B$ , as claimed.

Lastly, before proving Theorem 1.2, we need the following result, which provides a key new ingredient for improving the estimates in [Reference Hindes6].

Lemma 2.12 Suppose that $M_S$ is free and that P is a not a preperiodic point of V. Then, there is a positive constant $\beta =\beta (S,\mathcal {L},P)$ such that

$$ \begin{align*}\lim_{n\rightarrow\infty}\sum_{g\in S_n} h_{\mathcal{L}}(g(P)))^{-\rho}=\beta.\end{align*} $$

Proof For n, let $\beta _n=\beta _n(P)=\sum _{g\in S_n} h_{\mathcal {L}}(g(P))^{-\rho }$ . Since $M_S$ is free, $S_{n+1}$ is the disjoint union of $\phi _1\circ S_n,\ldots , \phi _r\circ S_n$ . Then,

$$ \begin{align*}\left| h_{\mathcal{L}}(\phi_i(f(P))) - d_i h_{\mathcal{L}}(f(P))\right| \le C_S,\end{align*} $$

and so

$$ \begin{align*}d_i h_{\mathcal{L}}(f(P))-C_S\le h_{\mathcal{L}}(\phi_i(f(P))) \le d_i h_{\mathcal{L}}(f(P))+C_S.\end{align*} $$

In particular, the mean value theorem gives the inequality

$$ \begin{align*}\left| h_{\mathcal{L}}(\phi_i(f(P)))^{-\rho} - (d_i h_{\mathcal{L}}(f(P))^{-\rho}\right| \le C_S \rho\, \big| d_i\cdot h_{\mathcal{L}}(f(P))-C_S \big|^{-\rho-1}.\end{align*} $$

Assume first that $h_{\mathcal {L}}(P)>C:=2C_S$ . (We note that this C is the constant used in the proof of Lemma 2.11.) Then, since $\sum _{i=1}^r 1/d_i^{\rho }=1$ , we have

(2.12.1) $$ \begin{align} |\beta_{n+1}-\beta_n| &= \left|\sum_{i=1}^r \sum_{f\in S_n} h_{\mathcal{L}}(\phi_i\circ f(P))^{-\rho} - (d_ih_{\mathcal{L}}(f(P)))^{-\rho} \right|\nonumber \\[4pt] &\le \sum_{i=1}^r \sum_{f\in S_n} C_S \rho\, \big|d_i\cdot h_{\mathcal{L}}(f(P))-C_S\big|^{-\rho-1}. \end{align} $$

By Lemma 2.10, we have

$$ \begin{align*}h_{\mathcal{L}}(f(P))\ge \deg_{\mathcal{L}}(f) h_{\mathcal{L}}(P)- \deg_{\mathcal{L}}(f)\frac{C_S}{d_S-1}> \deg_{\mathcal{L}}(f) \left(2C_S-\frac{C_S}{d_S-1}\right),\end{align*} $$

and so $h_{\mathcal {L}}(f(P))>C_S \deg _{\mathcal {L}}(f)$ . It follows that

$$ \begin{align*}d_i h_{\mathcal{L}}(P)-C_S>C_S( d_i \mathrm{deg}_{\mathcal{L}}(f)-1),\end{align*} $$

and so using Equation (2.12.1), we see

$$ \begin{align*} |\beta_{n+1}-\beta_n| &\le \sum_{i=1}^r \sum_{f\in S_n} \frac{C_S\rho}{(C_S( d_i \mathrm{deg}_{\mathcal{L}}(f)-1))^{\rho+1}}\\[4pt] &\le C_S^{-\rho} 2^{\rho+1}\rho \sum_{i=1}^r \sum_{f\in S_n} 1/(d_i \mathrm{deg}_{\mathcal{L}}(f))^{\rho+1}\\[4pt] &= C_S^{-\rho} 2^{\rho+1}\rho \sum_{j_1+\cdots +j_r=n+1} {n+1\choose j_1,\ldots ,j_r} d_1^{-j_1(\rho+1)} \cdots d_r^{-j_r(\rho+1)} \\[4pt] &= C_S^{-\rho} 2^{\rho+1}\rho\cdot (1/d_1^{\rho+1} + \cdots + 1/d_r^{\rho+1})^{n+1}. \end{align*} $$

In particular, since $C':=\sum 1/d_i^{\rho +1} < 1$ and since $\sum _j (C')^j$ converges, we see that $\{\beta _n(P)\}$ is a Cauchy sequence converging to some $\beta (P)$ , as claimed. This completes the proof when $h_{\mathcal {L}}(P)\geq 2C_S$ . For the general case, assume only that P is not preperiodic for S. Then Lemma 2.11 implies that there exists N such that $h_{\mathcal {L}}(f(P))>2C_S$ for all $|f|\geq N$ . However, for $n>N$ , we have that

$$\begin{align*}\sum_{g\in S_n} h_{\mathcal{L}}(g(P))^{-\rho}=\sum_{f\in S_{N}}\sum_{F\in S_{n-N}}h_{\mathcal{L}}(F(f(P))^{-\rho}=\sum_{f\in S_N}\beta_{n-N}(f(P))\end{align*}$$

since $M_S$ is free. Moreover, for each $f\in M_S$ , the sequence $\{\beta _{n-N}(f(P))\}$ converges by our argument above. Therefore, $\{\beta _n(P)\}$ , a finite sum of convergent sequences, must converge also.

In particular, we may use Lemma 2.10, Lemma 2.12, and the degree growth estimates in Section 2 to bound the number of functions in a polarized semigroup yielding a bounded height relation; see Theorem 1.2 from the Introduction.

Proof of Theorem 1.2

Assume that $r=\#S\geq 2$ , let $\rho $ be the unique positive number satisfying ${d_1}^{-\rho }+\dots +{d_r}^{-\rho }=1$ , and let $b_S=C_S/(d_S-1)$ ; here, $d_S$ and $C_S$ are the constants in Lemma 2.10. We first prove (1) in the case when $P\in V$ satisfies $h_{\mathcal {L}}(P)>b_S$ and then show how the general non-preperiodic case follows from this one. In particular, if $h_{\mathcal {L}}(P)>b_S$ , then Lemma 2.10 implies

$$ \begin{align*} \begin{aligned} \bigg\{f\in M_S:\, \deg_{\mathcal{L}}(f)\leq \frac{X}{h_{\mathcal{L}}(P)+b_S}\bigg\}&\subseteq \bigg\{f\in M_S:\, h_{\mathcal{L}}(f(P))\leq X\bigg\}\\ &\subseteq \bigg\{f\in M_S:\, \deg_{\mathcal{L}}(f)\leq \frac{X}{h_{\mathcal{L}}(P)-b_S}\bigg\}. \end{aligned} \end{align*} $$

In particular, Proposition 2.2, using our degree function to assign weights to elements of our semigroup, gives that there is a constant c such that

$$ \begin{align*} \begin{aligned} \#\{f\in M_S\,:\, h_{\mathcal{L}}(f(P))\leq X\}&\leq\#\bigg\{f\in M_S\,:\, \deg_{\mathcal{L}}(f)\leq \frac{X}{h_{\mathcal{L}}(P)-b_S}\bigg\}\\ &\leq c\Big(\frac{X}{h_{\mathcal{L}}(P)-b_S}\Big)^\rho \\[5pt] \end{aligned} \end{align*} $$

holds for all X sufficiently large.

Hence, we obtain statement (1) in the case when $h_{\mathcal {L}}(P)>b_S$ , as claimed. In general, if P is not preperiodic, then by Lemma 2.11, there are only finitely many elements f of our semigroup for which $h_{\mathcal {L}}(f(P))\le b_S$ . In particular, there is some N such that if $g=\phi _{i_1}\circ \cdots \circ \phi _{i_N}\in M_S$ , then either g is equivalent in $M_S$ to a composition of elements of S of length $<N$ or $h_{\mathcal {L}}(g(P))>b_S$ . Let $g_1,\ldots ,g_r$ denote the length-N compositions of elements of S that are not equivalent in $M_S$ to a shorter composition of elements of S.

Then, since all but finitely many elements of $M_S$ have a right compositional factor from $\{g_1,\ldots ,g_r\}$ , we have

$$\begin{align*}\#\{f\in M_S\,:\, h_{\mathcal{L}}(f(P))\leq X\} \le \sum_{i=1}^r \#\{f\in M_S\,:\, h_{\mathcal{L}}(f(g_i(P)))\leq X\} + O(1).\end{align*}$$

Since $h_{\mathcal {L}}(g(P))>b_S$ , we see that each of the summands on the right side is $\ll X^{\rho }$ , and so we deduce (1) in general.

We next prove (3) and (4) in the case when $h_{\mathcal {L}}(P)>b_S$ , and so we now assume that $M_S$ is free. Then, Proposition 2.2 and Lemma 2.10 imply that there is a positive constant $c'$ such that

$$ \begin{align*} \begin{aligned} \#\{f\in M_S\,:\, h_{\mathcal{L}}(f(P))\leq X\}&\geq\#\bigg\{f\in M_S\,:\, \deg_{\mathcal{L}}(f)\leq \frac{X}{h_{\mathcal{L}}(P)+b_S}\bigg\}\\[5pt] &\geq c'\,\Big(\frac{X}{h_{\mathcal{L}}(P)+b_S}\Big)^{\rho}\\[5pt] \end{aligned} \end{align*} $$

holds for all X sufficiently large. Combining this inequality with (1), we see that $\#\{f\in M_S\,:\, h_{\mathcal {L}}(f(P))\leq X\}\asymp X^\rho $ for all sufficiently large X, as claimed in statement (3).

To prove (4) in the case when $h_{\mathcal {L}}(P)>b_S$ , suppose that $M_S$ is free and that the vector of degrees $\pmb {d}=(d_1,\dots ,d_r)$ is acyclic. Moreover, let $\epsilon>0$ , and let

$$ \begin{align*}\beta:=\lim_{n\to\infty} \sum_{g\in S_n} h_{\mathcal{L}}(g(P))^{-\rho}\end{align*} $$

be the constant from Lemma 2.12. Then, Lemma 2.11 gives that there are only finitely many elements of $M_S$ with $h_{\mathcal {L}}(g(P)) \le b_S/\epsilon $ . Thus, there is a natural number n such that

$$\begin{align*}\left| \sum_{g\in S_n} h_{\mathcal{L}}(g(P))^{-\rho}-\beta\right| < \epsilon\end{align*}$$

and

$$\begin{align*}h_{\mathcal{L}}(g(P))-b_S> (1-\epsilon) h_{\mathcal{L}}(g(P)) \end{align*}$$

for all $g\in S_n$ . Then, statement (1) of Corollary 2.8 implies that for all $g\in S_n$ and all large X, we have that

$$ \begin{align*} &~ \#\{f\in M_S\,:\, h_{\mathcal{L}}(f\circ g(P))\leq X\}\\[3pt]& \quad \leq \#\bigg\{f\in M_S\,:\, \deg_{\mathcal{L}}(f)\leq \frac{X}{h_{\mathcal{L}}(g(P))-b_S}\bigg\}\\[3pt]& \quad \leq c(1+\epsilon)\Big(\frac{X}{h_{\mathcal{L}}(g(P))-b_S}\Big)^{\rho} \\[3pt]& \quad \le c(1+\epsilon)(1-\epsilon)^{-\rho} X^{\rho}/h_{\mathcal{L}}(g(P))^{\rho}. \end{align*} $$

However, for large X, we have that

$$\begin{align*}\#\{f\in M_S\,:\, h_{\mathcal{L}}(f(P))\leq X\} = \#S_{<n} + \sum_{g\in S_n} \#\{f\in M_S\,:\, h_{\mathcal{L}}(f\circ g(P))\leq X\},\end{align*}$$

and so we get that

$$ \begin{align*}\#\{f\in M_S\,:\, h_{\mathcal{L}}(f(P))\leq X\} \le \#S_{<n} + c(1+\epsilon)(1-\epsilon)^{\rho} X^{\rho}(\beta+\epsilon)\end{align*} $$

for X sufficiently large. Moreover, by a completely analogous argument, we get a lower bound of

$$ \begin{align*}\#\{f\in M_S\,:\, h_{\mathcal{L}}(f(P))\leq X\} \ge \#S_{<n} + c(1-\epsilon)(1+\epsilon)^{\rho} X^{\rho}(\beta-\epsilon)\end{align*} $$

for all large X. Then, since n is fixed and $\epsilon $ is arbitrarily small, we see that

$$ \begin{align*}\#\{f\in M_S\,:\, h_{\mathcal{L}}(f(P))\leq X\}= c\beta X^{\rho}(1+o(1)),\end{align*} $$

as claimed in statement (4).

Next, we assume only that P is a non-preperiodic for $M_S$ and prove (3) and (4) in this case. In particular, we still assume that $M_S$ is free. Then, Lemma 2.11 implies that there exists N such that $h_{\mathcal {L}}(g(P))>b_S$ for all $|g|\geq N$ . In particular, for all large X, we have that

However, by replacing P with $g(P)$ in our arguments above, we see that each of the finitely many terms $\#\{f\in M_S\,:\, h_{\mathcal {L}}(f(g(P)))\leq X \}$ for $g\in S_N$ exhibit growth according to statements (3) and (4) of Theorem 1.2, assuming the corresponding hypotheses. In particular, the total count $\#\{f\in M_S\,:\, h_{\mathcal {L}}(f(P))\leq X\}$ exhibits the desired growth also.

Finally, we observe that (2) quickly follows from (3). To see this, if $M_S$ contains a free semigroup with finite basis $S'\subset M_S$ , let $\rho '$ be the unique positive number satisfying $\sum _{g\in S'}1/\deg (g)^{\rho '}=1$ . Then, (3) gives that there is a positive constant $c"$ such that

$$ \begin{align*} \begin{aligned} \#\{f\in M_{S'}\,:\, h_{\mathcal{L}}(f(P))\leq X\}&\geq c"\,\Big(\frac{X}{h_{\mathcal{L}}(P)+b_S}\Big)^{\rho'}\\[5pt] \end{aligned} \end{align*} $$

holds for all X sufficiently large. In particular, we deduce statement (2) as claimed; moreover, clearly $\rho '\leq \rho $ since $X^{\rho '}\ll X^{\rho }$ by combining the bounds in statements (1) and (2) with the fact that $M_{S'}\subseteq M_S$ .

Lastly, we note that our upper bound on the number of functions $f\in M_S$ such that $h_{\mathcal {L}}(f(P))\leq X$ for non-preperiodic basepoints P leads to a general upper bound for the number of points of bounded height in orbits; see Theorem 1.5 from the Introduction. The last step is the following decomposition lemma.

Lemma 2.13 Let $S=\{\phi _1,\dots ,\phi _r\}$ be a polarizable set of endomorphisms on a variety V, let $P\in V$ be such that $\operatorname {\mathrm {Orb}}_S(P)$ is infinite, and let $B>0$ . Then, there is a finite subset $F\subseteq \operatorname {\mathrm {Orb}}_S(P)$ and distinct points $Q_1,\dots , Q_n$ in $\operatorname {\mathrm {Orb}}_S(P)$ such that the following statements hold:

  1. (1) Each $Q_i$ satisfies $h_{\mathcal {L}}(Q_i)>B$ .

  2. (2) $\operatorname {\mathrm {Orb}}_S(P)=F\cup \operatorname {\mathrm {Orb}}_S(Q_1)\cup \dots \cup \operatorname {\mathrm {Orb}}_S(Q_{n})$ .

In particular, we may assume that $Q_1,\dots ,Q_n$ are not preperiodic for S.

Proof Let C be the constant in Lemma 2.11. As a reminder, $C=2C_S$ , where $C_S$ is the constant from Lemma 2.10. Moreover, let $d=\max \{d_i\}$ , where $d_i$ is as in (2.9.1). Now let $P\in V$ have infinite orbit and $B>0$ . Without loss, we may assume $B\geq C$ . Note that the desired conclusion follows easily if $h_{\mathcal {L}}(P)>B$ by taking $n=1$ , $P=Q_1$ , and $F=\varnothing $ . Therefore, we may assume that $h_{\mathcal {L}}(P)\leq B$ . Now define the set

$$\begin{align*}T:=\big\{Q\in\operatorname{\mathrm{Orb}}_S(P)\,:\, B< h_{\mathcal{L}}(Q)\leq2dB\big\}.\end{align*}$$

In particular, $T=Q_1,\dots , Q_n$ is a finite (possibly empty) set by Northcott’s theorem (seen by working over a fixed number field K over which P and every map in S is defined). Likewise, $F:=\{Q\in \operatorname {\mathrm {Orb}}_S(P)\,:\, h_{\mathcal {L}}(Q)\leq B\}$ is a finite set, and we will show that F and the points $Q_i$ have the desired properties. To see this, suppose that $Q=\theta _m\circ \dots \circ \theta _1(P)$ for some $\theta _i\in S$ satisfies $h_{\mathcal {L}}(Q)>B$ (such Q must exist since $\operatorname {\mathrm {Orb}}_S(P)$ is infinite), and define

$$\begin{align*}m_0=\min\big\{s\,:\, h_{\mathcal{L}}(\theta_s\circ\dots\circ\theta_1(P))>B\big\}.\end{align*}$$

Note that $m_0$ exists since m is in the defining set above. Moreover, $m_0\geq 1$ since $h_{\mathcal {L}}(P)\leq B$ by assumption. We claim that $Q':=\theta _{m_0}\circ \dots \circ \theta _1(P)\in T$ (in particular, T is nonempty): if not, then necessarily $h_{\mathcal {L}}(Q')>2d B$ . However, we then see that

$$ \begin{align*} \begin{aligned} 2dB&<h_{\mathcal{L}}(Q')=h_{\mathcal{L}}(\theta_{m_0}(\theta_{m_0-1}\circ\dots\circ\theta_1(P)))\\[3pt] &\leq\deg(\theta_{m_0})(h_{\mathcal{L}}(\theta_{m_0-1}\circ\dots\circ\theta_1(P))+C_S)\leq2dB, \end{aligned} \end{align*} $$

a contradiction; here, we use Lemma 2.10 and the minimality of $m_0$ . In particular, we have shown that for any $Q\in \operatorname {\mathrm {Orb}}_S(P)$ of height bigger than B, there is a point $Q_i\in T$ such that $Q\in \operatorname {\mathrm {Orb}}_S(Q_i)$ . Hence, we deduce that $\operatorname {\mathrm {Orb}}_S(P)=F\cup \operatorname {\mathrm {Orb}}_S(Q_1)\cup \dots \cup \operatorname {\mathrm {Orb}}_S(Q_{n})$ , as claimed. Finally, we note that since each $h_{\mathcal {L}}(Q_i))>B$ and we have taken $B\ge 2C_S$ , which is the constant C in the proof of Lemma 2.11, we have that the $Q_i$ are not preperiodic by Lemma 2.11.

In particular, since every point in $\operatorname {\mathrm {Orb}}_S(P)$ of height at most X is determined by at least one function $f\in M_S$ satisfying $h_{\mathcal {L}}(f(P))\leq X$ and (outside of a finite set) all orbits are covered by non-preperiodic orbits, we see that

$$\begin{align*}\#\{Q\in\operatorname{\mathrm{Orb}}_S(P)\,:\, h_{\mathcal{L}}(Q)\leq X\}\ll X^{\rho}\end{align*}$$

holds for all $P\in V$ by combining Theorem 1.2 part (1) with Lemma 2.13. Hence, we have Theorem 1.5 from the Introduction.

3 Orbit counts in $\mathbb {P}^1$

3.1 Generic bounds

Now that we have some handle on the growth of functions with various boundedness properties in a semigroup, we turn to the asymptotic growth of points in orbits of bounded height. To make this transition, we need some control on the functions that agree at a particular value. This is possible when $V=\mathbb {P}^1$ precisely because we have some control on the set of rational (or integral) points on curves. To state this refinement in dimension $1$ , it will be useful to have the following general definition.

Definition 3.1 Let S be a set of endomorphisms of a variety V that are polarizable with respect to a common ample invertible sheaf $\mathcal {L}$ . Then, we say that $M_S$ has the cancellation property if for each $P\in V$ , there is a positive constant $B=B(P)$ such that if $f(Q_1)=g(Q_2)$ for some $f,g\in M_S $ and some $Q_1, Q_2 \in \operatorname {\mathrm {Orb}}_S(P)$ with $h_{\mathcal {L}}(Q_i)> B$ , then we have compositional factorizations $f=\phi \circ f_0$ and $g=\phi \circ g_0$ for some $\phi \in S$ and $f_0,g_0\in M_S\cup \{ \mathrm {id}\}$ with $f_0(Q_1)=g_0(Q_2)$ .

In particular, if $M_S$ has the cancellation property above, then an asymptotic on the number of functions yielding a bounded height relation leads to an asymptotic for the number of points of bounded height in orbits.

Proposition 3.2 Let $S=\{\phi _1,\dots ,\phi _r\}$ be a polarizable set of endomorphisms of a variety V generating a free semigroup with the cancellation property, and suppose that $P\in V$ is such that $\operatorname {\mathrm {Orb}}_S(P)$ is infinite. Then, there exists a $B>0$ and points $Q_1,\ldots ,Q_{t}\in \operatorname {\mathrm {Orb}}_S(P)$ such that $h_{\mathcal {L}}(Q_i)>B$ and

(3.2.1)

In particular, we may choose non-preperiodic points $Q_1,\ldots ,Q_{t}\in \operatorname {\mathrm {Orb}}_S(P)$ such that (3.2.1) holds.

Proof Let $B_0$ be the constant from Lemma 2.11, let $B_1$ be the constant from Definition 3.1, and let $B = \max (B_1, B_0)$ . Then, Lemma 2.13 implies that there exists a finite set F and points $Q_1, \dots , Q_n$ satisfying $h_{\mathcal {L}}(Q_i)> B$ such that

(3.2.2) $$ \begin{align} \operatorname{\mathrm{Orb}}_S(P) = F \cup \bigcup^n_{i = 1} \operatorname{\mathrm{Orb}}_S(Q_i). \end{align} $$

We will show that the union over the $\operatorname {\mathrm {Orb}}_S(Q_i)$ may be written as a disjoint union. To see this, suppose that $f(Q_i) = g(Q_j)$ for some $f, g \in M_S$ . If $|f|=|g|$ , then repeated application of the cancellation property implies that $Q_i=Q_j$ . However, if $|f| \neq |g|$ , say without loss $|f|> |g|$ , then repeated application of the cancellation property implies that $f_0(Q_i) = Q_j$ for some $f_0 \in M_S$ . Hence, $\operatorname {\mathrm {Orb}}_S(Q_j) \subseteq \operatorname {\mathrm {Orb}}_S(Q_i)$ in this case. Thus, after reordering if necessary, we may choose $t\leq n$ such that

(3.2.3) $$ \begin{align} \operatorname{\mathrm{Orb}}_S(P) = F \cup \bigsqcup^t_{i = 1} \operatorname{\mathrm{Orb}}_S(Q_i). \end{align} $$

However, for each $Q_i$ , a similar argument implies that the evaluation map $f\rightarrow f(Q_i)$ is injective: if $f(Q_i) = g(Q_i)$ for some $|f| = |g|$ , then repeated application of the cancellation property implies that $f = g$ . Likewise, if $|f|>|g|$ , then repeated application of the cancellation property implies that $f_0(Q_i) = Q_i$ for some non-identity map $f_0\in M_S.$ But then $Q_i$ is preperiodic, contradicting Lemma 2.11 and the fact that $h_{\mathcal {L}}(Q_i)> B$ . Therefore, for each $Q_i$ , the evaluation map $f\rightarrow f(Q_i)$ is injective, and thus,

(3.2.4) $$ \begin{align} \# \{Q \in \operatorname{\mathrm{Orb}}_S(Q_i): h_{\mathcal{L}}(Q) \leq X\} = \#\{f \in M_S : h_{\mathcal{L}}(f(Q_i)) \leq X\}. \end{align} $$

Hence, combining (3.2.3) and (3.2.4), we obtain (3.2.1) as claimed.

We next note that many sets of rational maps on $V=\mathbb {P}^1$ generate semigroups with the cancellation property above. To make this precise, recall that $w\in \mathbb {P}^1(\overline {\mathbb {Q}})$ is called a critical value of $\phi \in \overline {\mathbb {Q}}(x)$ if $\phi ^{-1}(w)$ contains fewer than $\deg (\phi )$ elements. Likewise, we call a critical value w of $\phi $ simple if $\phi ^{-1}(w)$ contains exactly $\deg (\phi )-1$ points. In particular, we have the corresponding notions for sets:

Definition 3.3 Let $S=\{\phi _1,\dots ,\phi _r\}$ be a set of rational maps on $\mathbb {P}^1$ , and let $\mathcal {C}_{\phi _i}$ denote the set of critical values of $\phi _i$ . Then, S is called critically separate if $\mathcal {C}_{\phi _i}\cap \mathcal {C}_{\phi _j}=\varnothing $ for all $i\neq j$ . Moreover, S is called critically simple if every critical value of every $\phi \in S$ is simple.

In particular, critically separate and critically simple sets of maps have the cancellation property; see also Proposition 4.1 and Lemma 4.8 in [Reference Hindes6].

Proposition 3.4 Let $S=\{\phi _1,\dots ,\phi _r\}$ be a critically separate and critically simple set of rational maps on $\mathbb {P}^1$ all of degree at least four. The following statements hold:

  1. (1) $M_S$ is a free semigroup.

  2. (2) For all non-preperiodic P, there is a constant $t_{S,P}$ such that

    $$\begin{align*}\#\{f\in M_S\,:\, f(P)=Q\}\leq t_{S,P}\end{align*}$$
    holds for all $Q\in \operatorname {\mathrm {Orb}}_S(P)$ .
  3. (3) The semigroup $M_S$ has the cancellation property.

Proof The first two statements are proved in [Reference Hindes6, §4]. Therefore, it only remains to prove (3). Let K be a number field over which P and all maps in S are defined. By the main results in [Reference Pakovich18] (see also [Reference Hindes6, Proposition 4.6]), we have that the curves $C_k:= \left \{(x,y)\colon \frac {\phi _k(x)-\phi _k(y)}{x-y}=0\right \}$ for $k=1,\ldots ,r$ and $C_{i,j}:=\{(x,y)\colon \phi _i(x)=\phi _j(y)\}$ for $i\neq j$ have only finitely many K-points, and we let $\Sigma $ denote the finite set of K-points of the union of these curves.

Let

$$\begin{align*}B_0 = \max_{(P_1,P_2) \in \Sigma}\big\{\max\{h_{\mathcal{L}}(P_1),h_{\mathcal{L}}(P_2)\}\big\},\end{align*}$$

let $B_1$ be the constant from Lemma 2.11, and let $B = \max \{B_0, B_1\}$ . Now suppose that $f,g\in M_S$ and $Q_1, Q_2 \in {\mathbb {P}}^1 $ satisfy

$$\begin{align*}\min(h_{\mathcal{L}}(Q_1), h_{\mathcal{L}}(Q_2))> B\end{align*}$$

and $f(Q_1)=g(Q_2)$ . We will show that B satisfies the conditions needed in Definition 3.1. To see this, write $f=\phi _i\circ f_0$ and $g=\phi _j\circ g_0$ for $f_0,g_0\in M_S\cup \{ \mathrm {id} \}$ and $1\leq i,j\leq r$ . There are now two cases to consider. The first is when $i\neq j$ . In this case, $(f_0(Q_1),g_0(Q_2))\in C_{i,j}$ and hence must be in $\Sigma $ , and so

$$\begin{align*}\max(h_{\mathcal{L}}(f_0(Q_1)),h_{\mathcal{L}}(g_0(Q_2)) )\leq B_0\leq B.\end{align*}$$

However, since $\min (h_{\mathcal {L}}(Q_1), h_{\mathcal {L}}(Q_2))> B \geq B_1$ , it follows from Lemma 2.11 that

$$\begin{align*}\min(h_{\mathcal{L}}(f_0(Q_1)),h_{\mathcal{L}}(g_0(Q_2)) )> B,\end{align*}$$

a contradiction.

The remaining case is when $i=j$ and $f_0(Q_1)\neq g_0(Q_2)$ . In this case, we have that $(f_0(Q_1),g_0(Q_2))\in C_i$ and hence is in $\Sigma $ , and the result again follows as in the preceding case.

We can now improve the main result in [Reference Hindes6].

Theorem 3.5 Let $S=\{\phi _1,\dots ,\phi _r\}$ be a critically separate and critically simple set of rational maps on $\mathbb {P}^1$ all of degree at least four. Moreover, assume that $r\geq 2$ , and let $\rho $ be the positive real number satisfying $\sum \deg (\phi _i)^{-\rho }=1$ . Then,

$$\begin{align*}\#\{Q\in\operatorname{\mathrm{Orb}}_S(P)\,:\, h(Q)\leq X\}\asymp X^{\rho}\end{align*}$$

for all $P\in \mathbb {P}^1(\overline {\mathbb {Q}})$ such that $\operatorname {\mathrm {Orb}}_S(P)$ is infinite. Moreover, if the degrees of the maps in S are acyclic, then

$$ \begin{align*} \#\{Q\in\operatorname{\mathrm{Orb}}_S(P)\,:\, h(Q)\leq X\}\sim cX^{\rho} \end{align*} $$

for some positive constant c, depending on P and S.

Proof Assume that P is a point in ${\mathbb {P}}^1$ that satisfies that $\operatorname {\mathrm {Orb}}_S(P)$ is infinite. Then, Proposition 3.4 implies that $M_S$ is free and has the cancellation property. Hence, Proposition 3.2 implies that there exists non-preperiodic points $Q_1,\ldots ,Q_t\in \operatorname {\mathrm {Orb}}_S(P)$ such that

(3.5.1)

From here, the result now follows directly from Theorem 1.2. For instance, if the degrees of the maps in S are acyclic, then statement (4) of Theorem 1.2 implies that for each $Q_i$ there are positive constants $c_{S,Q_i}$ such that

(3.5.2) $$ \begin{align} \#\{f\in M_S\,:\, h(f(Q_i))\leq X\}\sim c_{S,Q_i}X^{\rho}. \end{align} $$

Here, we use that the $Q_i$ are not preperiodic. In particular, combining (3.5.1) and (3.5.2) yields the desired conclusion. Likewise, the general case follows from (3.5.1) and statement (3) of Theorem 1.2.

In particular, if we fix integers $d_1,\dots d_r$ all at least four, then the bound for orbits in Theorem 3.5 holds on a Zariski open subset of $\operatorname {\mathrm {Rat}}_{d_1}\times \dots \times \operatorname {\mathrm {Rat}}_{d_r}$ .

Proof of Theorem 1.6

For each $d_i\geq 4$ , the proof of [Reference Pakovich18, Theorem 1.4] on page 211 implies that there is a proper open subset $U_i\subset \operatorname {\mathrm {Rat}}_{d_i}$ such that each $\phi \in U_i$ is critically simple. Likewise, the proof of [Reference Pakovich18, Theorem 1.2] on pages 208–209 implies that for each pair $(d_i, d_j)$ , there are proper open subsets $V_{ij}\subset \operatorname {\mathrm {Rat}}_{d_i}$ and $W_{ij}\subset \operatorname {\mathrm {Rat}}_{d_j}$ such that $\{\phi ,\psi \}$ is a critically separate set of maps for all $\phi \in V_{ij}$ and $\psi \in W_{ij}$ . Now define the open set

In particular, if $(\phi _1,\dots ,\phi _r)\in U$ , then it follows that $S=\{\phi _1,\dots ,\phi _r\}$ is a critically simple and critically separate set of maps whose orbits satisfy the desired height bounds by Theorem 3.5.

Although Theorems 3.5 and 1.6 do not apply directly to polynomials, similar statements can be made in this case assuming the disjointness and simplicity of affine critical values and that the degrees of the polynomials in S are at least $5$ ; see, for instance, [Reference Ha and Yang4, §3]. However, such conditions are stronger than those in the following result; compare to [Reference Hindes6, Theorem 1.6].

Theorem 3.6 Let $S=\{\phi _1,\dots ,\phi _r\}$ be a set of polynomials defined over a number field K with distinct and acyclic degrees all at least $2$ . Moreover, let $\rho $ be the real number satisfying $\sum _{\phi \in S} \deg (\phi )^{-\rho }=1$ , and assume that the following conditions hold:

  1. (1) Each $\phi _i \in S$ is not of the form $R \circ E \circ L$ for some polynomial $R \in \overline {{\mathbb {Q}}}[x]$ , some cyclic or Chebyshev polynomial E with $\deg (E)\geq 2$ , and some linear polynomial $L \in \overline {{\mathbb {Q}}}[x]$ ;

  2. (2) For all $i \neq j$ , we have that $\phi _j \neq \phi _i \circ F$ for any $F \in \overline {{\mathbb {Q}}}[x]$ .

Then, $M_S$ is a free semigroup, and for all points $P\in {\mathbb {P}}^1(K)$ such that $\operatorname {\mathrm {Orb}}_S(P)$ is infinite, we have that

$$ \begin{align*}\# \{Q \in \operatorname{\mathrm{Orb}}_S(P) : h(P) \leq X\} \sim cX^\rho \end{align*} $$

for some positive constant c depending on S and P.

Proof The semigroup $M_S$ is free by [Reference Hindes6, Theorem 1.6]. For the rest, take $\mathcal {O} \subset K$ to be a ring of $\mathcal {S}$ -integers in K containing P and all the coefficients of polynomials in S. Then, the curves $C_{\phi }:=\left \{ (x,y)\colon \frac {\phi (x)-\phi (y)}{(x-y)}=0\right \}$ and $C_{\phi ,\psi }:=\left \{(x,y)\colon \phi (x)=\psi (y)\right \}$ have finitely many $\mathcal {O}$ -points by [Reference Hindes6, Proposition 4.5] for all distinct $\phi ,\psi \in S$ ; that is, the set

$$\begin{align*}\Sigma:=\{(x,y)\in \mathcal{O}\times\mathcal{O}\,: (x,y)\in C_{\phi}\;\text{or}\; (x,y)\in C_{\phi,\psi}\;\text{for some }\phi,\psi\in S\} \end{align*}$$

is finite. From here, an identical proof to that of statement (3) of Proposition 3.4 implies that $M_S$ has the cancellation property; specifically, if

$$\begin{align*}B_0 = \max_{(x,y) \in \Sigma}\big\{\max\{h(x),h(y)\}\big\},\end{align*}$$

if $B_1$ is the constant from Lemma 2.11, and if $B = \max \{B_0, B_1\}$ , then $B=B(P)$ satisfies the conditions of Definition 3.1. In particular, the desired asymptotics now follow directly from Proposition 3.2 and Theorem 1.2.

Acknowledgements

We thank the anonymous referee for many helpful comments and suggestions that greatly improved this paper.

References

Bell, J. P., Huang, K., Peng, W. and Tucker, T. J., A tits alternative for rational functions. J. Eur. Math. Soc., to appear. Preprint, arXiv:2103.09994.Google Scholar
Benedetto, R., Ingram, P., Jones, R., Manes, M., Silverman, J. H. and Tucker, T. J., Current trends and open problems in arithmetic dynamics . Bull. Amer. Math. Soc. (N.S.) 56(2019), no. 4, 611685.CrossRefGoogle Scholar
Erdös, P., On some asymptotic formulas in the theory of the “factorisation numerorum.” Ann. of Math. (2) 42(1941), 989993,CrossRefGoogle Scholar
Ha, H. K. and Yang, C. C., On the functional equation $P(f)=Q(g)$ . In Value distribution theory and related topics, Adv. Complex Anal. Appl., Vol. 3, Kluwer Acad. Publ., Boston, MA, 2004, 201207.Google Scholar
Healey, V. Olsiewski and Hindes, W., Stochastic canonical heights . J. Number Theory 201(2019), 228256.CrossRefGoogle Scholar
Hindes, W., Counting points of bounded height in monoid orbits . Math. Z. 301(2022), no. 4, 33953416.CrossRefGoogle Scholar
Hindes, W. and Silverman, J. H., The size of semigroup orbits modulo primes . Pacific J. Math. 325(2023), no. 2, 281297.CrossRefGoogle Scholar
Kawaguchi, S., Canonical heights, invariant currents, and dynamical eigensystems of morphisms for line bundles . J. Reine Angew. Math. 597(2006), 135173.Google Scholar
Kawaguchi, S. and Silverman, J. H., On the dynamical and arithmetic degrees of rational self-maps of algebraic varieties . J. Reine Angew. Math. 713(2016), 2148.CrossRefGoogle Scholar
Knopfmacher, A. and Mays, M. E., A survey of factorization counting functions . Int. J. Number Theory 1(2005), no. 4, 563581.CrossRefGoogle Scholar
Lesieutre, J. and Satriano, M., Canonical heights on hyper-Kähler varieties and the Kawaguchi-Silverman conjecture . Int. Math. Res. Not. IMRN 10(2021), 76777714.CrossRefGoogle Scholar
Lin, J.-L., On the arithmetic dynamics of monomial maps . Ergodic Theory Dynam. Systems 39(2019), no. 12, 33883406.CrossRefGoogle Scholar
Matsuzawa, Y., Kawaguchi-Silverman conjecture for endomorphisms on several classes of varieties . Adv. Math. 366(2020), 107086, 26.CrossRefGoogle Scholar
Matsuzawa, Y. and Sano, K., Arithmetic and dynamical degrees of self-morphisms of semi-abelian varieties . Ergodic Theory Dynam. Systems 40(2020), no. (6), 16551672.CrossRefGoogle Scholar
Matsuzawa, Y. and Yoshikawa, S., Kawaguchi-Silverman conjecture for endomorphisms on rationally connected varieties admitting an int-amplified endomorphism . Math. Ann. 382(2022), no. 3–4, 16811704.CrossRefGoogle Scholar
Montgomery, H. L. and Vaughan, R. C., Multiplicative number theory. I. Classical theory , Cambridge Studies in Advanced Mathematics, Vol. 97, Cambridge University Press, Cambridge, 2007.Google Scholar
Ram Murty, M., Rosen, M. and Silverman, J. H., Variations on a theme of Romanoff . Internat. J. Math. 7(1996), no. 3, 373391.CrossRefGoogle Scholar
Pakovich, F., Algebraic curves $P(x)-Q(y)=0$ and functional equations. Complex Var. Elliptic Equ. 56(2011), no. 1–4, 199213.CrossRefGoogle Scholar
Serre, J.-P., Brown, M., and Waldschmidt, M., Lectures on the Mordell-Weil theorem, Springer, 1989.CrossRefGoogle Scholar
Silverman, J. H., Rational points on $K3$ surfaces: A new canonical height. Invent. Math. 105(1991), no. 2, 347373.CrossRefGoogle Scholar
Silverman, J. H., The arithmetic of dynamical systems, Graduate Texts in Mathematics, Vol. 241, Springer, New York, 2007.CrossRefGoogle Scholar
Silverman, J. H., Moduli spaces and arithmetic dynamics, CRM Monograph Series, Vol. 30, American Mathematical Society, Providence, RI, 2012.CrossRefGoogle Scholar