Theta functions, fourth moments of eigenforms, and the sup-norm problem I

Ilya Khayutin; Raphael S. Steiner

doi:10.1112/S0010437X24007437

Theta functions, fourth moments of eigenforms, and the sup-norm problem I

Part of: Partial differential equations on manifolds; differential operators Discontinuous groups and automorphic forms

Published online by Cambridge University Press: 03 April 2025

Ilya Khayutin and

Raphael S. Steiner

Show author details

Ilya Khayutin: Affiliation:
Department of Mathematics, Northwestern University, Evanston IL 60203, USA
Raphael S. Steiner: Affiliation:
Computing Systems Lab, Huawei Zurich Research Center, Thurgauerstrasse 80, CH-8050 Zurich, Switzerland [email protected]

Article contents

Abstract
Introduction
General method
The Weil representation and theta series
Theta kernels for Eichler orders
The theta lift
The Bergman kernel
The spectral expansion
The geometric expansion
Second-moment count of quaternions by norm
Proof of the main theorem
Conflicts of interest
Financial support
Journal information
Footnotes
References

Rights & Permissions

Abstract

We give sharp point-wise bounds in the weight-aspect on fourth moments of modular forms on arithmetic hyperbolic surfaces associated to Eichler orders. Thereby, we strengthen a result of Xia and extend it to co-compact lattices. We realize this fourth moment by constructing a holomorphic theta kernel on $\mathbf {G} \times \mathbf {G} \times \mathbf {SL}_{2}$, for $\mathbf {G}$ an indefinite inner form of $\mathbf {SL}_2$ over $\mathbb {Q}$, based on the Bergman kernel, and considering its $L^2$-norm in the Weil variable. The constructed theta kernel further gives rise to new elementary theta series for integral quadratic forms of signature $(2,2)$.

Keywords

sup-norm holomorphic forms theta function theta correspondence fourth moment

MSC classification

Primary: 11F72: Spectral theory; Selberg trace formula

Secondary: 11F11: Holomorphic modular forms of integral weight 11F27: Theta series; Weil representation; theta correspondences 11F70: Representation-theoretic methods; automorphic representations over local and global fields 58J50: Spectral problems; spectral geometry; scattering theory

Type: Research Article
Information: Compositio Mathematica , Volume 160 , Issue 12 , December 2024 , pp. 2916 - 2969

DOI: https://doi.org/10.1112/S0010437X24007437 [Opens in a new window]
Copyright: © The Author(s), 2025. The publishing rights in this article are licensed to Foundation Compositio Mathematica under an exclusive licence

1. Introduction

The study of distributional aspects of automorphic forms has enjoyed ample consideration in the past couple of decades, in particular questions related to the quantum unique ergodicity conjecture, various bounds for $L^p$-norms, and restriction problems. In this paper, we are mainly concerned with the $L^{\infty }$-norm of holomorphic Hecke eigenforms on arithmetic hyperbolic surfaces in the large weight limit, though our method also gives essentially sharp results for moments of $L^4$-norms.

The sup-norm problem asks for the best possible bound on the sup-norm of a Hecke eigenform in terms of the analytic conductor. Specifically, one often seeks a non-trivial bound on the sup-norm separately with respect to the weight, Laplace eigenvalue, or level aspect. It is analogous and closely related to the Lindelöf Hypothesis for automorphic $L$-functions. The go-to method for the majority of previous work on this problem is amplification. It was first used in this context by Iwaniec and Sarnak in the pioneering paper [Reference Iwaniec and SarnakIS95], though the idea of an amplifier goes back to Selberg [Reference SelbergSel42]. Iwaniec and Sarnak showed the bound

(1)

\begin{equation} \|\varphi\|_{\infty} \ll_{\Gamma, \varepsilon} (1+|\lambda_{\varphi}|)^{{5/24}+\varepsilon} \|\varphi\|_2 \end{equation}

for a Hecke–Maass form $\varphi \colon {\Gamma \backslash \mathbb {H}}\to \mathbb {C}$, where the lattice $\Gamma <\mathbf {SL}_2(\mathbb {R})$ is the unit norm elements of an Eichler order in a quadratic division algebra. Here and henceforth, we have adopted Vinogradov's notation. Their result marked the first time a power of $1+|\lambda _{\varphi }|$ was saved over what holds for a general Riemannian surface. Indeed, (1) has been known to hold with exponent $\frac {1}{4}$ for a general compact Riemannian surface, without any further assumptions of arithmetic nature (cf. [Reference SoggeSog88]). The amplifying technique has been used heavily due to its versatility. In the context of automorphic forms on arithmetic hyperbolic surfaces, Blomer and Holowinsky [Reference Blomer and HolowinskyBH10], Templier [Reference TemplierTem10, Reference TemplierTem15], Harcos and Templier [Reference Harcos and TemplierHT12, Reference Harcos and TemplierHT13], Saha [Reference SahaSah17a, Reference SahaSah17b, Reference SahaSah20], Hu and Saha [Reference Hu and SahaHS20], and Kıral [Reference KıralKır14] have used it to show subconvex bounds in various level aspects; Das and Sengupta [Reference Das and SenguptaDS15] and Steiner [Reference SteinerSte17] have used it to show subconvex bounds in the weight aspect. Blomer, Harcos, and Milićević [Reference Blomer, Harcos and MilićevićBHM16] and Blomer, Harcos, Maga, and Milićević [Reference Blomer, Harcos, Maga and MilićevićBHMM20] applied it to a more general setting over number fields, which corresponds to products of hyperbolic 2- and 3-spaces. The most general $\mathbf {PGL}_2$ result is due to Assing [Reference AssingAss17]. Moreover, the technique has also been adopted to arithmetic 2-spheres by Vanderkam [Reference VanderKamVan97] and products of 2- and 3-spheres by Blomer and Michel [Reference Blomer and MichelBM11, Reference Blomer and MichelBM13], and generalized to higher rank, e.g., by Blomer and Pohl [Reference Blomer and PohlBP16] for $\mathbf {Sp}_4$, Blomer and Maga [Reference Blomer and MagaBM15, Reference Blomer and MagaBM16] for $\mathbf {PGL}_n$ ($n\ge 4$), and Marshall [Reference MarshallMar14] for semisimple split Lie groups over totally real fields and their totally imaginary quadratic extensions, to name a few examples.

In this paper, we employ a different tool, namely the theta correspondence. The theta correspondence was first used by the second named author [Reference SteinerSte20] to tackle sup-norm problems. It has been used previously by Nelson to answer questions regarding quantum unique ergodicity and quantum variance [Reference NelsonNel16, Reference NelsonNel17, Reference NelsonNel19, Reference NelsonNel20], and give Fourier-like expansions for forms living on compact spaces [Reference NelsonNel15]. The main advantage of this approach is that instead of looking at an amplified second moment, we are able to bound a fourth moment sharply. Another advantage is that it works for co-compact lattices equally well as it does for non-co-compact lattices. Our main theorem and its corollary read as follows.

Theorem 1.1 Let the arithmetic lattice $\Gamma < \mathbf {SL}_{2}(\mathbb {R})$ be the unit norm elements of an Eichler order in an indefinite quaternion algebra over $\mathbb {Q}$ and $\{f_j\}_j \subset S_m^{\mathrm {new}}(\Gamma )$ be an orthonormalFootnote ¹ basis of Hecke newforms of weight $m > 4$. Then, there is a constant $A \ge 1$, such that for any $\varepsilon >0$, there is a constant $C_{\varepsilon }$ for which we have

(2)

\begin{equation} \sum_j y^{2m}|f_j(z)|^4 \le C_{\varepsilon} \operatorname{covol}(\Gamma)^A m^{1+\varepsilon} \big(1+m^{-{1/2}} \mathrm{ht}_{\Gamma}(z)^2 \big), \end{equation}

where $\mathrm {ht}_{\Gamma }(z)=1$ if $\Gamma$ is co-compact and

\[ \mathrm{ht}_{\Gamma}(z)=\max_{\gamma \in \mathbf{SL}_2(\mathbb{Z})} \ \Im(\gamma z) \]

if $\Gamma < \mathbf {SL}_2(\mathbb {Z})$. Furthermore, we have

(3)

\begin{equation} \sum_j \|f_j\|_4^4 \le C_{\epsilon} \operatorname{covol}(\Gamma)^A m^{1+\epsilon}. \end{equation}

Corollary 1.2 Let $\Gamma < \mathbf {SL}_{2}(\mathbb {R})$ be as above with the additional assumption of being co-compact ($\Leftrightarrow B(\mathbb {Q})$ is non-split) and $f \in S_m^{\mathrm {new}}(\Gamma )$ a Hecke newform of weight $m > 4$. Then, there is a constant $A \ge 1$, such that for any $\varepsilon >0$, there is a constant $C_{\varepsilon }$ for which we have

(4)

\begin{equation} \sup_{z \in \mathbb{H}} y^{{m/2}}|f(z)| \le C_{\varepsilon} \operatorname{covol}(\Gamma)^{A} m^{{1/4}+\varepsilon} \|f\|_2. \end{equation}

The first half of Theorem 1.1 marks a significant improvement over what has been known previously. It shows that the $L^{\infty }$-norm of the fourth moment of holomorphic newforms of weight $m$ is, essentially, as small as it can be, meaning that they enjoy a stronger ‘orthogonality’ relation than what was known previously. Remarkably, our proof does not rely on any deep results from arithmetic geometry such as Deligne's bound for the Hecke eigenvalues, but rather a sharp bound for a second-moment matrix count as we shall explain in further detail in § 2. The second half of Theorem 1.1 is a simple consequence of the first half if $\Gamma$ is co-compact and otherwise it follows in conjunction with [Reference Blomer, Khan and YoungBKY13, Theorem 1.8], which says that the mass of the fourth norm is concentrated in the domain $\{z \in {\Gamma \backslash \mathbb {H}} : \mathrm {ht}_{\Gamma }(z) \le m^{{1/4}} \}$. Following Sarnak and Watson [Reference SarnakSar03], Inequality (3), through the use of Watson's formula [Reference WatsonWaT08, Theorem 3] or, more generally, Ichino's formula [Reference IchinoIch08] (cf. [Reference NelsonNel11, § 4]), may be reformulated as a Lindelöf on average statement about degree-eight $L$-functions. In particular, assuming that the product of the reduced discriminant $D_B$ of $B$ and the level $q$ of $\Gamma$ is square-free, one deduces

(5)

\begin{equation} \frac{1}{m}\sum_f \frac{1}{2m}\sum_g L\bigg(f \times f \times g, \frac{1}{2}\bigg) \le C_{\varepsilon} \operatorname{covol}(\Gamma)^A m^{\varepsilon}, \end{equation}

where $f \in S_m^{\mathrm {new}}(\Gamma )$ runs through a basis of newforms of weight $m$ for $\Gamma$ and $g \in S_{2m}(\Gamma )$ runs through an orthonormal set of newforms of weight $2m$ for $\Gamma$ with Hecke eigenvalues equal to $1$ for all primes $p \mid D_B$ and Atkin–Lehner eigenvalues equal to $-1$ for all primes $p \mid q$. This should be compared with the result of Sun and Ye [Reference Sun and YeSY19] who considered the double average of the degree-six $L$-function $L(\operatorname {Sym}^2 f \times g, \frac {1}{2})$, where $f,g$ are Hecke eigenforms of weight $m$, respectively $2m$, for $\mathbf {SL}_2(\mathbb {Z})$. Note that $L(f \times f \times g, \tfrac {1}{2})=L(\operatorname {Sym}^2 f \times g, \tfrac {1}{2}) L(g,\frac {1}{2})$. One should also mention a result of Khan [Reference KhanKha14], who managed to show an asymptotic formula for the left-hand side of (3) for $\Gamma =\mathbf {SL}_2(\mathbb {Z})$ with an extra (smooth) average over the weight $m$. Khan's result matches up with conjectures concerning the asymptotics of the $L^4$-norm in the large weight aspect. We refer to [Reference Blomer, Khan and YoungBKY13] for details regarding these conjectures. In the future, we plan to address the question whether one can upgrade the second half of Theorem 1.1 to an asymptotic without any extra average over the weight. We shall also mention the strongest individual bound for the $L^4$-norm of a Hecke eigenform $f$ of weight $m$ on $\mathbf {SL}_2(\mathbb {Z})$ which is due to Blomer, Khan, and Young [Reference Blomer, Khan and YoungBKY13]. They managed to show $\|f\|_4 \ll _{\varepsilon } m^{{1/12}+\varepsilon } \|f\|_2$.

The convex or trivial bound in the context of Corollary 1.2 is $\ll \operatorname {covol}(\Gamma )^{{1/2}}m^{{1/2}}$ and the first non-trivial bound in the weight aspect $\ll _\varepsilon m^{{1/2}-\delta +\varepsilon }$ for a small $\delta >0$ was achieved by Das–SenguptaFootnote ² [Reference Das and SenguptaDS15] through the use of an amplifier. The previous best bound in the weight aspect is due to Ramacher and Wakatsuki [Reference Ramacher and WakatsukiRW21] who established a subconvex bound for the sup-norm in significant generality.

The analogue of Corollary 1.2 for non-uniform lattices is much easier to establish, because one can use the Fourier expansion at a cusp and then apply Deligne's bound for the Fourier coefficients. This was observed by Xia [Reference XiaXia07], who worked out the case $\Gamma =\mathbf {SL}_2(\mathbb {Z})$. In the same fashion, a sharp hybrid bound for holomorphic forms of minimal type was derived by Hu, Nelson, and Saha [Reference Hu, Nelson and SahaHNS19]. We would also like to thank Paul Nelson for pointing out to us the relation between our technique and [Reference NelsonNel15, Theorem 3.1]. Nelson uses an explicit (non-holomorphic) version of Shimizu's theta kernel [Reference ShimizuShi72] to construct an expansion of $y^m|f(z)|^2$, where $f$ is an arithmetically normalized newform on a compact arithmetic surface, that resembles a Fourier expansion.

Finally, we shall mention that we did not attempt to optimize the dependence on the co-volume or level in Theorem 1.1 in this first paper. Due to our method requiring sharp bounds for a second-moment matrix count of length comparable to the conductor, any such undertaking must necessarily address the inability of pre-existing matrix counting techniques in the non-split case, such as [Reference TemplierTem10], to deal with large determinants. Furthermore, a strategy needs to be devised to incorporate the dependence on the reduced discriminant of the indefinite quaternion algebra. All of this shall be addressed in a sequel joint with Nelson [Reference Khayutin, Nelson and SteinerKNS22].

As far as the structure of this paper goes, in the following section, we shall briefly explain the main concept of the proof as well as mentioning an alternative approach using $L$-functions instead of a theta kernel. Sections 3 and 4 deal with local and global properties of the Weil representation and their consequences to the associated theta series. The action of the Hecke algebra on the theta kernel is computed in § 5. In §§ 6 and 7, we show that the Bergman kernel satisfies the required assumptions in the construction of the theta kernel and compute its spectral expansion. In § 8, we reduce a bound on the $L^2$-norm of the theta kernel to matrix counts. In § 9, we prove the essentially sharp second-moment matrix count. The main theorem is then established in § 10.

2. General method

In this section, we shall briefly explain two essentially equivalent strategies that lead to Theorem 1.1. We shall first lay out the approach which is conceptually closer to that of an amplifier. For simplicity, we shall assume everything is unramified, i.e. $\Gamma = \mathbf {SL}_2(\mathbb {Z})$, which is the set of determinant one elements of the maximal order $\mathcal {R}=\operatorname {Mat}_{2x2}(\mathbb {Z})$ inside the quaternion algebra $\operatorname {Mat}_{2x2}(\mathbb {Q})$. Let $\mathcal {R}_n$ denote the elements of $\mathcal {R}$ of norm $n$, such that $\Gamma =\mathcal {R}_1$. We begin with a Bergman kernel (also known as a reproducing kernel) on $S_m(\Gamma )$, the space of weight $m$ holomorphic cusp forms on $\Gamma$,

(6)

\begin{equation} B(z,w) = \sum_j \Im(z)^{{m/2}} f_j(z) \Im(w)^{{m/2}} \overline{f_j(w)}, \end{equation}

where $\{f_j\}_j$ is an orthonormal basis of Hecke eigenforms of the space $S_m(\Gamma )$. The amplified counterparts to the Bergman kernel are

(7)

\begin{equation} B_n(z,w) = \sum_j \lambda_j(n) \Im(z)^{{m/2}} f_j(z) \Im(w)^{{m/2}} \overline{f_j(w)}, \end{equation}

where $\lambda _j(n)$ is the $n$-th Hecke eigenvalue of the newform $f_j$. We normalize the Hecke operators so that Deligne's bound reads $|\lambda _j(n)| \le d(n)$, $d(n)$ is the divisor function. The kernels $B_n$ are roughly of the shape

(8)

\begin{equation} B_n(z,w) \approx \frac{m}{\sqrt{n}} \sum_{\substack{\alpha \in \mathcal{R}_n \\ u(\alpha z, w) \le {1/m}}} 1, \end{equation}

where $u(z,w)=|z-w|^2/(4 \Im (z) \Im (w))$. Instead of taking a suitable linear combination of (7) as one would do for an amplifier, we consider

(9)

\begin{equation} \int_0^1 \bigg| \sum_{n \le m} B_n(z,z) e(nt) \bigg|^2 \,dt = \sum_{i,j} \Im(z)^{2m} |f_j(z)|^2|f_i(z)|^2 \sum_{n \le m} \lambda_j(n) \overline{\lambda_i(n)}. \end{equation}

To the latter, or more precisely a smooth version thereof, one may apply Voronoï summation. If we set aside any intricacies stemming from Riemann zeta factors and smoothing, we pick up main terms for $i=j$ corresponding to the poles of $L(f_i \times f_j,s)$ at $s=1$ for $i=j$ and a dual sum of length $m^2/m$. Thus, we find that (9) is approximately

(10)

\begin{equation} m \sum_{j} \Im(z)^{2m} |f_j(z)|^4 + \sum_{\substack{i,j \\ i \neq j}} \Im(z)^{2m} |f_j(z)|^2|f_i(z)|^2 \sum_{n \leq m^2/m} \lambda_j(n) \overline{\lambda_i(n)}. \end{equation}

We see that the new dual sum is once again of the shape (9) and we may replace it with its geometric counterpart. Through rearranging and the use of the approximation of the Bergman kernel (8), one arrives at

(11)

\begin{equation} \sum_{j} \Im(z)^{2m} |f_j(z)|^4 \ll m \sum_{n \le m} \frac{1}{n} \sum_{\substack{\alpha_1,\alpha_2 \in \mathcal{R}_n \\ u(\alpha_i z,z) \le {1/m},\ i=1,2}} 1. \end{equation}

We see that we end up with a second-moment matrix count. Before we discuss the latter further, we shall describe how to arrive at the same inequality in an alternate fashion by using a theta kernel.

At its core, one wishes to find a kernelFootnote ³ $\vartheta : {\Gamma \backslash \mathbf {SL}_2(\mathbb {R})} \times {\Gamma \backslash \mathbf {SL}_2(\mathbb {R})} \times {\Lambda \backslash \mathbf {SL}_2(\mathbb {R})}\to \mathbb {C}$, such that

(12)

\begin{equation} \big \langle \vartheta(z,w;\bullet), (\Im\bullet)^{m/2}\tilde{f} \big\rangle = \Im(z)^{{m/2}} f(z) \Im(w)^{{m/2}} \overline{f(w)} \cdot \|\tilde{f}\|_2^2, \end{equation}

for an $L^2$-normalized newform $f$ and $\tilde {f}$ an arithmetically normalized newform in the Jacquet–Langlands transfer to $\mathbf {GL}_2$ of the automorphic representation generated by $f$. It immediately follows that

(13)

\begin{equation} \sum_{f} \|\tilde{f} \|_2^2 \cdot \Im(z)^{2m} |f(z)|^4 \le \|\vartheta(z,z;\bullet)\|_2^2 \end{equation}

by Bessel's inequality. For $\Gamma = \mathbf {SL}_2(\mathbb {Z})$, such a kernel may be given by

(14)

\begin{equation} \vartheta(z,w;\zeta) = \Im(\zeta)^{{m/2}} \sum_{n=1}^{\infty} B_n(z,w) n^{(m-1)/{2}} e(n\zeta). \end{equation}

This may be used to recover (11) upon using the Hoffstein–Lockhart bound for $\|\tilde {f}\|_2$ (see [Reference Hoffstein and LockhartHL94]) and standard bounds for the incomplete Gamma function.

We prefer to employ the latter approach as it avoids translating spectral data back into geometric terms. Specifically, in (10), we have been able to replace the dual sum by the integral of the same amplified Bergman kernel on the left-hand side of (9). This step cannot be reproduced verbatim in the ramified cases. Instead, one would need to express the dual sum in terms of Fourier expansions of amplified Bergman kernels associated to various levels and different cusps. The approach using the theta correspondence avoids these issues altogether.

Whilst the constructions of theta kernels in great generality have been known for a while, see [Reference ShimizuShi72] or [Reference NelsonNel15, § 5 & Appendix B] for an explicit example, they are unfortunately generally not in $L^2$. An attempt to rectify this, would be to project such a theta kernel to $S_m(\Gamma )$. Formulae for such projections are given by Gross and Zagier [Reference Gross and ZagierGZ86, § IV.5]. However, we follow a different path. Motivated by the simplicity of the kernel $\vartheta$ in the case $\Gamma =\mathbf {SL}_2(\mathbb {Z})$ (14), we modify the general construction of a theta kernel to mirror a classical Bergman kernel of weight $m$. In order to show that the novel theta kernel behaves in the prescribed fashion, we use a method of Vignéras [Reference VignérasVig77] at the infinite place and compute the Fourier–Whittaker expansion in the $\zeta$-variable. We compare the latter with Shimizu's explicit form of the Jacquet–Langlands correspondence [Reference ShimizuShi72]. As a corollary, we derive a new elementary theta series for indefinite quadratic forms of signature $(2,2)$.

Theorem 2.1 Let $\mathcal {R}$ be an Eichler order of level $q$ in an indefinite division quaternion algebra over $\mathbb {Q}$ of reduced discriminant $D_B$. Denote by $\mathcal {R}^{+}$ the subset of elements of positive norm and by $\Gamma$ the subset of elements of norm equal to one. Furthermore, let $f \in S_m(\Gamma )$ be a cusp form of weight $m>2$. Then, for each $z \in \mathbb {H}$, the function $\mathcal {F}_{f}(z;\bullet )$, given by

(15)

\begin{equation} \mathcal{F}_{f}(z;\zeta) = \sum_{\alpha \in {\Gamma \backslash \mathcal{R}^+}} \operatorname{Nr}(\alpha)^{{m/2}-1} (f|_m\alpha)(z) e(\operatorname{Nr}(\alpha)\zeta), \end{equation}

is a cusp form of weight $m$ for $\Gamma _0(qD_B)$. Moreover, we have $\mathcal {F}_{T_n f}(z;\bullet ) = (T_n\mathcal {F}_{f})(z;\bullet )$ for $(n,qD_B)=1$.

Returning to the second-moment matrix count, we see that upon using partial summation we need to bound the number of solutions to

(16)

\begin{equation} \alpha_1,\alpha_2 \in \mathcal{R}: 1 \le \operatorname{Nr}(\alpha_1)=\operatorname{Nr}(\alpha_2) \le N, \quad u(\alpha_i z,z) \le \frac{1}{m}, \quad i=1,2. \end{equation}

Consider $z$ fixed for the moment. Then, we are given a quadratic equation in eight variables all of size $N^{{1/2}}$ with four additional linear inequalities of density $m^{-{1/2}}$. Heuristics suggest that we should have on the order of $(N^{{1/2}})^8\cdot N^{-1} \cdot (m^{-{1/2}})^4=N^3m^{-2}$ solutions for $N$ large. We see that for $N \le m$, $N^3m^{-2} \le N$, which is the bound we are aiming for. Moreover, by considering the order $\operatorname {Mat}_{2 \times 2}(\mathbb {Z})$ and the special point $z=i$, we see that the matrices of the shape $\left (\begin{smallmatrix} a & -b \\ b & a \end{smallmatrix}\right )$ with $1 \le a^2+b^2 \le N$ satisfy the conditions in (16) and give rise to a lower bound of size $N$. Likewise, we should expect that such subvarieties with exceptionally many solutions exist also for other special points and orders under consideration. Hence, the general estimate we seek is at the edge of what is achievable. This is in stark contrast to the classical approach of an amplifier, where one may consider matrices of reduced norm up to only a small power of $m$ in order to get a non-trivial result. However, the difficulty of the task at hand is rewarded with a sharp fourth moment estimate. In order to achieve the required bound, we rely on geometry of numbers arguments, which have been successful in the past for first moments (cf. [Reference Harcos and TemplierHT13]), in particular with regards to uniformity in the varying point $z$. To account for the additional quadratic equation, we decompose each matrix $\alpha _i$ into two parts: a multiple of the identity and a traceless part $\alpha _i^0$. To the traceless parts $\alpha _i^0$ we apply the geometry of numbers arguments. The quadratic equation now reads

\[ \operatorname{Nr} \alpha_1 = \frac{\operatorname{Tr}(\alpha_1)^2}{4}+\operatorname{Nr} \alpha_1^0 = \operatorname{Nr} \alpha_2 = \frac{\operatorname{Tr}( \alpha_2)^2}{4} + \operatorname{Nr} \alpha_2^0 \]

and we may use the divisor bound to bound the number of possibilities for the traces. This gives the required bound if and only if the traces are not equal in absolute value. The latter case needs to be dealt with separately. We do so by showing that there are essentially only a constant number of matrices $\alpha \in \mathcal {R}$ satisfying $u(\alpha z,z) \le {1/m}$ of a given trace and reduced norm $\le m$.

As a final remark, we address the natural question, whether the method lends itself to further amplification. Albeit it being straightforward to produce amplified versions of (11), the problem lies within the matrix count, where there is no further space for savings as all of the savings stemming from $u(\alpha z,z)\le {1/m}$ are used up by the fact that we already have to consider matrices of determinant $\le m$. Any additional increase in the size of the determinant will thus automatically increase the bound on the matrix count and subsequently the geometric side of (11) by a considerable amount.

3. The Weil representation and theta series

3.1 Inner forms of SL2

Let $B$ be a quadratic central simple algebra over $\mathbb {Q}$ and for each place $v$ denote $B_v\colon= B\otimes \mathbb {Q}_v$. We define the affine algebraic group $\mathbf {G}$ over $\mathbb {Q}$ as representing the group functor

\[ \mathbf{G}(L)=\big\{x\in B\otimes L \mid \operatorname{Nr}(x)=1\big\} \]

for all $\mathbb {Q}$-algebras $L$. The group $\mathbf {G}$ is an inner form of $\mathbf {SL}_2$, and all inner forms of $\mathbf {SL}_2$ over $\mathbb {Q}$ arise this way.

Fix a maximal order $R\subset B$ and define $R_v$ to be the completion of $R$ in $B_v$. For each finite place $v$ the order $R_v$ is maximal in $B_v$. For $v<\infty$ set $K_v\colon= R_v^1 < \mathbf {G}(\mathbb {Q}_v)$ to be the group of norm $1$ elements in $R_v$. If $B$ splits over $\mathbb {R}$ we fix once and for all an isomorphism $B_\infty \simeq \operatorname {Mat}_{2\times 2}(\mathbb {R})$ and use it to identify the two spaces. We then set $K_\infty =\mathbf {SO}_2(\mathbb {R})$ if $B$ splits over $\mathbb {R}$ and $K_\infty =\mathbf {G}(\mathbb {R})$ otherwise. For almost all $v$ the group $K_v$ is a hyperspecial maximal compact subgroup of $\mathbf {G}(\mathbb {Q}_v)$. We define $\mathbf {G}(\mathbb {A})$ as the restricted direct product

\[ \mathbf{G}(\mathbb{A})\colon- \bigg\{ (g_v)_v\in \prod_{v\leq \infty} \mathbf{G}(\mathbb{Q}_v) \mid g_v\in K_v \textrm{ for almost all } v \bigg\}. \]

3.2 Normalization of Haar measures

For a linear algebraic group $\mathbf {L}$ defined over $\mathbb {Q}$ we denote $[\mathbf {L}(\mathbb {A})]\colon= {\mathbf {L}(\mathbb {Q})\backslash \mathbf {L}(\mathbb {A})}$. Assume $[\mathbf {L}(\mathbb {A})]$ is of finite volume. We shall always integrate with respect to the probability Haar measure on $[\mathbf {L}(\mathbb {A})]$. Let $U<\mathbf {L}(\mathbb {A}_f)$ be a compact open subgroup. Then, $\mathbf {L}(\mathbb {R})$ acts on $[\mathbf {L}(\mathbb {A}))]_U\colon= \mathbf {L}(\mathbb {Q})\backslash \mathbf {L}(\mathbb {A})/{U}$ with finitely many orbits [Reference BorelBor63], and $[\mathbf {L}(\mathbb {A})]_U\simeq \bigsqcup _i {\Gamma _i}\backslash{\mathbf {L}(\mathbb {R})}$ with $\Gamma _i<\mathbf {L}(\mathbb {R})$ finitely many lattices. On $[\mathbf {L}(\mathbb {A})]_U$ we integrate with respect to the push-forward of the probability Haar measure on $[\mathbf {L}(\mathbb {A})]$. This measure is evidently an $\mathbf {L}(\mathbb {R})$-invariant probability measure. If $[\mathbf {L}(\mathbb {A})]_U\simeq {\Gamma \backslash \mathbf {L}(\mathbb {R})}$ is a single $\mathbf {L}(\mathbb {R})$-orbit, then this measure is the probability Haar measure on $ {\Gamma \backslash \mathbf {L}(\mathbb {R})}$.

On ${\mathbf {SL}_2(\mathbb {R})/\mathbf {SO_2(\mathbb {R})}}$ and ${\mathbf {Spin}_3(\mathbb {R})/ \mathbf {SO}_2(\mathbb {R})}$ we fix the standard Haar measures corresponding to the volume form of Gaussian curvature $\pm 1$ on the hyperbolic plane and the $2$-sphere. We fix the unique Haar measures on $\mathbf {SL}_2(\mathbb {R})$ and $\mathbf {Spin}_3(\mathbb {R})$ whose push-forward to the symmetric space coincides with the measure above.

On $\mathbf {SL}_2(\mathbb {Q}_p)$ and $\mathbf {PGL}_2(\mathbb {Q}_p)$ we fix the Haar measure giving volume $1$ to $\mathbf {SL}_2(\mathbb {Z}_p)$ and $\mathbf {PGL}_2(\mathbb {Z}_p)$, respectively. Let $\mathbb {D}_p$ be the unique ramified quaternion algebra over $\mathbb {Q}_p$ with ring of integers $\mathcal {O}(\mathbb {D}_p)$. Denote by $\mathbb {D}_p^{(1)}$ the group of norm $1$ element in $\mathbb {D}_p$. We fix the Haar measures on $\mathbb {D}_p^{(1)}$, $ {\mathbb {Q}_p^\times \backslash \mathbb {D}_p^\times }$ that give volume $1$ to the compact open subgroups $\mathcal {O}(\mathbb {D}_p)\cap \mathbb {D}_p^{(1)}$, ${\mathbb {Z}_p^\times \backslash \mathcal {O}(\mathbb {D}_p)^\times }$ respectively. These choices fix a Haar measure $m_{\mathbf {G}(\mathbb {Q}_p)}$ on $\mathbf {G}(\mathbb {Q}_p)$ for all primes $p$.

The product of the local Haar measures $m_{\mathbf {G}(\mathbb {Q}_p)}$ at all primes $p$ induce a Haar measure on $\mathbf {G}(\mathbb {A}_f)=\prod '_p \mathbf {G}(\mathbb {Q}_p)$, which we call the unnormalized Haar measure on $\mathbf {G}(\mathbb {A}_f)$. Similarly, we call the product of the fixed Haar measure on $\mathbf {G}(\mathbb {R})$ with the unnormalized Haar measure on $\mathbf {G}(\mathbb {A}_f)$, the unnormalized Haar measure on $\mathbf {G}(\mathbb {A})$. The unnormalized Haar measure on $\mathbf {G}(\mathbb {A})$ is necessarily proportional to the covolume $1$ measure, but they are not equal. Our local measure normalization forces $m_{\mathbf {G}(\mathbb {Q}_p)}(K_p)=1$ for all primes $p$, hence the volume of $[\mathbf {G}(\mathbb {A})]$ with respect to the unnormalized measure is not $1$, but rather the sum of the volumes $m_{\mathbf {G}(\mathbb {R})}({R_i^{(1)}\backslash \mathbf {G}(\mathbb {R})})$ for orders $R_i\subset B$ representing all the classes in the class setFootnote ⁴ of $R$, where $R$ is the maximal order from above. Denote by $\varrho _{\mathbf {G}}$ the volume of $[\mathbf {G}(\mathbb {A})]$ with respect to the unnormalized measure. In the indefinite case, the class number is $1$ and the volume is $\varrho _{\mathbf {G}}=(\pi/3) \varphi (D_B)$ (see [Reference VoightVoi21, Theorem 39.1.2]). Exactly the same formula holds in the definite case, due to the Eichler mass formula [Reference VoightVoi21, Theorem 25.1.1]. We henceforth fix the Haar measure $m_{\mathbf {G}(\mathbb {A}_f)}$ on $\mathbf {G}(\mathbb {A}_f)$ to be the measure induced by $\varrho _{\mathbf {G}}^{-1} \prod _p m_{\mathbf {G}(\mathbb {Q}_p)}$. The product $m_{\mathbf {G}(\mathbb {R})}\times m_{\mathbf {G}(\mathbb {A}_f)}$ is the co-volume $1$ Haar measure. The same discussion applies mutatis mutandi to $\mathbf {SL}_2$.

Note that we have several normalizations of the Haar measure on $\mathbf {G}(\mathbb {R})$. When integrating over a quotient by a lattice ${\Gamma \backslash \mathbf {G}(\mathbb {R})}$ we always use the co-volume $1$ Haar measure. When integrating over $\mathbf {G}(\mathbb {R})$ we use the standard measure $m_{\mathbf {G}(\mathbb {R})}$ which is not a co-volume $1$ measure in general. The discrepancy is accounted for by the factor $\varrho _{\mathbf {G}} ^{-1}$ in the Haar measure of $\mathbf {G}(\mathbb {A}_f)$. The same discussion applies to $\mathbf {SL}_2$.

3.3 Local Weil representation

In this section, the field $F=\mathbb {Q}_v$ is a completion of $\mathbb {Q}$ at a place $v$, then $B_v$ is a quadratic central simple algebra over $F$, i.e. $B_v=\operatorname {Mat}_{2\times 2}(F)$ or $B_v$ is the unique quadratic division algebra over $F$. Denote by $x\mapsto ^\iota {x}{}$ the canonical involution on $B_v$. In the split case, the involution sends a matrix to its adjugate. Denote the reduced norm on $B_v$ by $\operatorname {Nr}$ and the reduced trace by $\operatorname {Tr}$. We shall also fix a unitary additive character $\psi _v\colon F\to \mathbb {C}^\times$. In this section, we recall the construction and elementary properties of the Weil representation.

The vector space $B_v$ is endowed with an additive Haar measure. For an integrable function $M\colon B_v\to \mathbb {C}$, we define the Fourier transform by

\[ \mathcal{F}{M}(x)=\int M(y)\psi_v(\langle x,y \rangle) \,{d} y, \]

where the bilinear form $\langle \, ,\rangle$ is defined by

\[ \langle x,y \rangle\colon= \operatorname{Tr}(x ^\iota{y}{}). \]

Note that this is twice the polarization of the norm quadratic form, i.e. $\langle x,x\rangle =2 \operatorname {Nr} x$. We normalize the measure on $B_v$ so that it is Fourier self-dual, i.e. $\mathcal {F}^2 M (x)=M(-x)$ for a Schwartz function $M$.

If $v$ is non-archimedean denote by $\Omega _v$ the space of Schwartz–Bruhat functions on $B_v$, i.e. locally constant functions of compact support. At the archimedean place, we need to consider a space that differs from the space of Schwartz functions because the Bergman kernel does not arise from a Schwartz function. To construct $\Omega _\infty$, we will start first with a larger space $L^2(B_\infty )$ and then restrict the Weil representation to a subspace $\Omega _\infty$ to be defined later.

The Weil representation of $\mathbf {SL}_2(F)$ on $\Omega _v$, $L^2(B_\infty )$ satisfies

\begin{align*} \rho\bigg(\!\!\begin{pmatrix} 1 & \sigma \\ 0 & 1 \end{pmatrix}\!\!\bigg) M (x) &= \psi_v\big(\sigma \operatorname{Nr}(x)\big) M(x),\\ \rho\bigg(\!\!\begin{pmatrix} \lambda & 0 \\ 0 & \lambda^{-1} \end{pmatrix}\!\!\bigg) M (x) &= |\lambda|_v^{2} M(\lambda x),\\ \rho\bigg(\!\!\begin{pmatrix} 0 & 1 \\ -1 & 0 \end{pmatrix}\!\!\bigg) M (x) &= \gamma \mathcal{F}{M}(x), \end{align*}

where $\gamma =1$ if $B_v$ is split and $\gamma =-1$ otherwise. For a proof that this defines a representation, see [Reference Jacquet and LanglandsJL70, § 1.1].

Note that the representation depends on the choice of an additive character $\psi _v$. We will usually suppress this dependence in the notation, but when we need to keep track of the character we shall write $\rho _{\psi _v}$. Because $\mathbb {Q}_v$ is Fourier self-dual, all non-trivial additive characters are of the form $^\varpi{\psi }{_v}(a)=\psi _v(a\varpi )$ for some $\varpi \in \mathbb {Q}_v^\times$. We see that

(17)

\begin{equation} \rho_{^\varpi{\psi}{_v}}(g)=\rho_{\psi_v}(\operatorname{diag}(\varpi,1) g \operatorname{diag}(\varpi,1)^{-1}). \end{equation}

Lemma 3.1 Let $\operatorname {O}(B_v,\operatorname {Nr})$ be the group of linear transformations preserving the norm form, this group acts on functions by $u.M(x)=M(u^{-1}x)$. The action of the orthogonal group $\operatorname {O}(B_v,\operatorname {Nr})$ commutes with the action of $\mathbf {SL}_2(F)$ via $\rho$.

Proof. It is sufficient to verify the claim for each of the formulae above. The action of the upper triangular matrices commutes with the action of any linear transformation that preserves the norm. The Fourier transform intertwines the action of $L\in \operatorname {GL}(B_v)$ with the action of $|\operatorname {det} L|_v^{-1t}L^{-1}$. Hence, it commutes with orthogonal transformations.

Corollary 3.2 The $\rho$ action of $\mathbf {SL}_2(F)$ commutes with the right and left actions of $\mathbf {G}(F)$ by multiplication. Moreover, the $\rho$ action commutes with the $B^\times$-action by conjugation.

Proof. The actions of $B^\times$ and $\mathbf {G}(F)$ preserve the norm form, hence they factor through the orthogonal group.

Lemma 3.3 The Weil representation is a continuous unitary representation of $\mathbf {SL}_2(F)$ on $\Omega _v$, $L^2(B_\infty )$.

Proof. This is established by Weil [Reference WeilWei64] for the space of Schwartz or Schwartz–Bruhat functions. The same proof works for $L^2(B_\infty )$.

3.4 The Archimedean Weil representation

To construct the appropriate subspace $\Omega _\infty \subset L^2(B_\infty )$, we will use a method based on the work of Vignéras [Reference VignérasVig77]. We define the Laplacian $\Delta$ on $B_\infty$ as the Fourier multiplier operator with symbol $-4\pi ^2 \operatorname {Nr}$. Write the archimedean additive character as $\psi _\infty (a)=\exp (2\pi i a \varpi )$ and consider the partial differential equation (PDE)

(18)

\begin{equation} -\Delta M(x)+\omega^2 \operatorname{Nr}(x)M(x)= \frac{\omega m }{\varpi}M(x), \end{equation}

where $m\in \mathbb {Z}$ and $\omega >0$. This is nothing but the PDE for energy eigenstates of four independent quantum harmonic oscillators with total energy $(\omega \varpi ) m$ and angular frequency $\omega \varpi$. We call $m$ the quantum number of the equation and we denote by $V_{m,\omega }$ the $L^2$-closure of the space of Schwartz solutions to (18). Note that unlike the standard harmonic oscillator, the individual oscillators may have either positive or negative energy depending on the signature of the quadratic form $\operatorname {Nr}$.

We fix henceforth $\psi _\infty (a)=\exp (2\pi ia)$, i.e. $\varpi =1$. Consider the densely defined linear operator $L_\omega \colon L^2(B_\infty )\to L^2(B_\infty )$ given by $L_\omega [M]=-\Delta M +\omega ^2 \operatorname {Nr}(x)\cdot M$ with the domain of Schwartz functions $D(L_\omega )=\mathcal {S}(B_\infty )$. Then, $L_\omega$ is real, i.e. $\langle L_\omega [M],M \rangle \in \mathbb {R}$ for all $M\in \mathcal {S}(B_\infty )$. Hence, $L_\omega$ is symmetric. For explicitness, we state the following classical linear algebra lemma.

Lemma 3.4 The spaces $\big \{V_{m,\omega }\big \}_{m\in \mathbb {Z}}$ are mutually orthogonal.

Proof. It is enough to show that if $M,M'$ are Schwartz solutions to (18) with quantum numbers $m\neq m'$, then $\langle M, M' \rangle =0$. Because the operator $L_\omega$ is symmetric, we have $\langle {\omega m} M,M' \rangle = \langle L[M],M' \rangle =\langle M, L[M'] \rangle =\langle M, {\omega m'} M' \rangle$. We deduce that $\langle M,M' \rangle =0$ in the usual fashion.

Lemma 3.5 Let $k_{\theta }\colon= \left (\begin{smallmatrix} \cos \theta & \sin \theta \\ -\sin \theta & \cos \theta \end{smallmatrix}\right )\in \mathbf {SO}_2(\mathbb {R})$ and set $\psi _\infty (a)=\exp (2\pi i a)$. Then, for every $M\in V_{m,2\pi }$ we have

\[ \big(\rho\big(k_{\theta}\big).M\big) (x)= e^{im\theta} M(x). \]

Moreover, $L^2(B_\infty )=\bigoplus _{m\in \mathbb {Z}} V_{m,2\pi }$. Therefore, $V_{m,2\pi }$ is the $\big (\rho (\mathbf {SO}_2(\mathbb {R})),e^{im\theta }\big )$-isotypic subspace of $L^2(B_\infty )$.

The idea to use the one-dimensional Hermite functions in the proof of the lemma has been suggested to us by J. Wunsch.

Proof. We establish first the direct sum decomposition. Recall that we need the Laplacian in (18) to be defined consistently as having Fourier symbol $-4\pi ^2 \operatorname {Nr}$. Choose a coordinate system $x_1,\ldots,x_4$ for $B_\infty$ such that $\operatorname {Nr}(x)=\sum _{i=1}^4 \epsilon _i x_i^2$ with $\epsilon _i\in \{\pm 1\}$. The Laplacian for our fixed character can be written in this coordinate system as

\[ \frac{1}{4}\sum_{i=1}^{4} \epsilon_i \frac{\partial^2}{\partial x_i^2}. \]

The $1/4$ factor appears because the Fourier transform is defined with respect to the bilinear form $\sum _{i=1}^4 2\epsilon _i x_i y_i$. The space of solutions to the one-dimensional quantum harmonic oscillator with angular frequency $4\pi$, $n\in \mathbb {Z}_{\geq 0}$,

\[ -\tfrac{1}{4}f''(x)+4\pi^2 x^2 f(x)=(2n+1)\pi f(x) \]

is one-dimensional and the $L^2$-normalized solution is

\[ f_n(x)\colon=\frac{1}{\sqrt{2^{n-1} n!}}\exp\big({-}2\pi x^2\big) H_n(2\sqrt{\pi} x), \]

where $H_n$ are the physicist's Hermite polynomials. Moreover, these solutions form an orthonormal basis of the Hilbert space $L^2(\mathbb {R})$. Define for every $\underline {n}=(n_1,n_2,n_3,n_4)\in \mathbb {Z}_{\geq 0}^4$ the function $M_{\underline {n}}\colon B_\infty \to \mathbb {C}$ by

\[ M_{\underline{n}}(x) = \prod_{j=1}^4 f_{n_j}(x_j) = \prod_{j=1}^4 \frac{1}{\sqrt{2^{n_j-1}n_j!}} \exp\big({-}2\pi x_j^2\big) H_{n_j}(2\sqrt{\pi} {x_j}), \]

where $x_1,\ldots,x_4$ are the normal form coordinates for the quadratic form $\operatorname {Nr}(x)$. Because $L^2(B_\infty )\simeq L^2(\mathbb {R})^{\bigotimes 4}$, we deduce that the functions $M_{\underline {n}}$ form an orthonormal basis of $L^2(B_\infty )$. These are Schwartz functions, and a separation of variables computation shows that $M_{\underline {n}}$ solves (18) withFootnote ⁵ $2m=\sum _{j=1}^4 \epsilon _j (2n_j+1)$. This and Lemma 3.4 establish that $L^2(B_\infty )=\bigoplus _{m\in \mathbb {Z}} V_{m,2\pi }$ as claimed.

We need to prove that if $M\in V_{m,2\pi }$ then $\rho (k_\theta ).M=e^{i m \theta }M$ for all $\theta \in [0,2\pi )$. By continuity of the Weil representation it is enough to establish this for Schwartz functions. Because Schwartz functions are smooth vectors for the Weil representation it is sufficient to show $({{d/d} \theta } )\rho (k_\theta ).M=i m \big (\rho (k_\theta ).M\big )$. Because the group $\mathbf {SO}_2(\mathbb {R})$ is abelian it is enough to verify this ordinary differential equation (ODE) at $\theta =0$. The formula $k_\theta =\exp ( \theta w)$ for $w=\left (\begin{smallmatrix} 0 & 1 \\ -1 & 0 \end{smallmatrix}\right )$ implies that the ODE at $\theta =0$ is equivalent to

(19)

\begin{equation} {d} \rho(w). M= im M, \end{equation}

where ${d} \rho$ is the Lie algebra representation of $\mathfrak {sl}_2(\mathbb {R})$ on $\mathcal {S}(B_\infty )$ differentiated from the Weil representation of $\mathbf {SL}_2(\mathbb {R})$. Using the definition of the Weil action for upper diagonal unipotent matrices one easily computes that $\big ({d}\rho \left (\begin{smallmatrix} 0 & 1 \\ 0 & 0 \end{smallmatrix}\right ).M\big )(x)=2\pi i \operatorname {Nr} (x) M(x)$. The formula $\left (\begin{smallmatrix} 0 & 0 \\ -1 & 0 \end{smallmatrix}\right )=w^{-1} \left (\begin{smallmatrix} 0 & 1 \\ 0 & 0 \end{smallmatrix}\right ) w$, then implies that ${d}\rho \left (\begin{smallmatrix} 0 & 0 \\ -1 & 0 \end{smallmatrix}\right ).M=({1/2\pi i})\Delta M$, and $({d} \rho (w).M)(x)=({1/2\pi i} )\Delta M(x)+2\pi i \operatorname {Nr}(x) M(x)$. Thus, (19) is equivalent to (18).

Corollary 3.6 Let $M\in V_{m, \omega }$ for arbitrary $\omega >0$ and fix $g=\left (\begin{smallmatrix} a & b\\ c & d \end{smallmatrix}\right )\in \mathbf {SL}_2(\mathbb {R})$. Then,

\[ (\rho(g).M)(x)= \frac{2 \pi}{\omega} \frac{1}{D^2} \psi_\infty\bigg(\frac{bd \frac{2\pi}{\omega} +ac \frac{\omega}{2\pi}}{D^2} \operatorname{Nr} x \bigg)\bigg(\sqrt{\frac{2\pi}{\omega}}\frac{d}{D}-i\sqrt{\frac{\omega}{2\pi}}\frac{c}{D}\bigg)^m M\bigg(\sqrt{\frac{2 \pi }{ \omega}}\frac{x}{D}\bigg), \]

where $D=\sqrt {c^2 ({\omega/2 \pi }) + d^2({2 \pi / \omega }})$.

Proof. We use the notation $a\colon= \operatorname {diag}\big (\sqrt { {2\pi / \omega }},\sqrt {{\omega/2\pi }}\big )$ and write $\rho (g).M=\rho (g a^{-1}) \rho (a).M$. The Iwasawa decomposition of $g a^{-1}$ is

\[ g a^{-1}=\begin{pmatrix} 1 & \frac{bd \frac{2 \pi}{\omega}+ac \frac{\omega}{2 \pi}}{D^2} \\ 0 & 1 \end{pmatrix} \begin{pmatrix} \frac{1}{D} & 0 \\ 0 & D \end{pmatrix} \begin{pmatrix} \sqrt{\frac{2\pi}{\omega}}\frac{d}{D} & -\sqrt{\frac{\omega}{2\pi}}\frac{c}{D}\\ \sqrt{\frac{\omega}{2\pi}}\frac{c}{D} & \sqrt{\frac{2\pi}{\omega}}\frac{d}{D} \end{pmatrix}. \]

The function

\[ (\rho(a).M)(x) =\frac{2 \pi}{\omega}M\bigg(\sqrt{\frac{2 \pi}{\omega}}x\bigg) \]

is a solution of (18) with angular frequency $2\pi$. Hence, we can apply Lemma 3.5 to $\rho (g a^{-1}) \rho (a).M$ and the Iwasawa decomposition of $g a^{-1}$.

Definition 3.7 Fix $m\in \mathbb {Z}$. We are now ready to define the subspace $\Omega _\infty < L^2(B_\infty )$. This space will depend on a choice of $m$. Recall that $V_{m,\omega }$ is the $L^2$-completion of the space of solutions of the quantum harmonic oscillator, (18), for a fixed $m\in \mathbb {Z}$ and $\omega >0$. Define

\begin{align*} \Omega_\infty\colon= \operatorname{Span}_{\mathbb{C}}\big\{&\psi_\infty(\sigma \operatorname{Nr} (x)) M(x) \mid \sigma\in\mathbb{R}, \exists \omega>0 \colon M\in V_{m,\omega}\\ &\quad \textrm{ and } \exists \delta>0\colon |M(x)| \ll (1+\|x\|)^{-4-\delta} \big\}. \end{align*}

The span allows only for finite linear combinations. In other words, $\Omega _\infty$ is the space generated by orbits of functions in $\bigcup _{\omega >0} V_{m,\omega }$ satisfying a decay condition at infinity under the Weil action of unipotent matrices. The decay condition implies that any function in $\Omega _\infty$ is in $L^p(B_\infty )$ for all $p\geq 1$.

Proposition 3.8 The space $\Omega _\infty$ is invariant under the Weil representation and the action of $\operatorname {O}(B_\infty,\operatorname {Nr})$.

Proof. The space $V_{m,\omega }$ is invariant under $\operatorname {O}(B_\infty,\operatorname {Nr})$ because (18) commutes with orthogonal transformations. Also, the decay condition is invariant under orthogonal transformations. Invariance under the Weil action follows from Corollary 3.6.

Remark We note that we may assume the functions in $\Omega _{\infty }$ to be continuous, since we may replace them with the Fourier inverse of its Fourier transform as it converges absolutely uniformly on compacta due to the decay condition.

3.5 The non-Archimedean Weil representation

We now describe the interaction between the Weil representation and Eichler orders in $B_v$ for $v<\infty$. In this section, we fix a prime $p$ and write $F=\mathbb {Q}_p$ and set $v$ to be the associated place. For clarity of notation, we will write $B_p\colon= B_v$. We assume that $\psi _p=\psi _v$ is an unramified character.

Definition 3.9 Let $\mathcal {R}\subset B_p$ be an order. Then, the dual lattice $\widehat {\mathcal {R}}$ is defined as

\[ \mathcal{\widehat{\mathcal{R}}}=\big\{x \in B_p \mid \forall x\in \mathcal{R}\colon \operatorname{Tr}(x^\iota{y}{})\in \mathbb{Z}_p \big\}. \]

We begin by discussing maximal orders.

Definition 3.10 Set $U_0(p^n)<\mathbf {SL}_2(\mathbb {Z}_p)$ to be the congruence subgroup defined by

\[ U_0(p^n)\colon= \begin{pmatrix} \mathbb{Z}_p & \mathbb{Z}_p \\ p^n\mathbb{Z}_p & \mathbb{Z}_p \end{pmatrix} \cap \mathbf{SL}_2(\mathbb{Z}_p). \]

Lemma 3.11 Let $\mathcal {R}\subset B_p$ be a maximal order. If $B_p$ is split, then $\rho \big (\mathbf {SL}_2(\mathbb {Z}_p)\big ).\mathbb {1}_{\mathcal {R}}=\mathbb {1}_{\mathcal {R}}$. If $B_p$ is ramified, then $\rho \big (U_0(p)\big ).\mathbb {1}_{\mathcal {R}}=\mathbb {1}_{\mathcal {R}}$.

Remark This lemma is closely related to Lemmata 7 and 10 of [Reference ShimizuShi72].

Proof. All maximal orders in $B_p$ are conjugate to each other by an element of $B_p^\times$. Because the Weil action commutes with conjugation, it is enough to prove the claim for a specific maximal order. Moreover, the group $\mathbf {SL}_2(\mathbb {Z}_p)$ is generated by the subgroup $\mathcal {P}<\mathbf {SL}_2(\mathbb {Z}_p)$ of upper triangular integral matrices and the involution $w$. The fact that $\mathbb {1}_\mathcal {R}$ is invariant under $\mathcal {P}$ follows because we have assumed $\psi _v$ is unramified. If $B_p$ is split, fix an isomorphism $B_p\simeq \operatorname {Mat}_{2\times 2}(\mathbb {Q}_p)$ and we need only show that $\mathbb {1}_{\operatorname {Mat}_{2\times 2}(\mathbb {Z}_p)}$ is invariant under the Fourier transform. This follows from the fact that $\mathbb {1}_{\mathbb {Z}_p}$ is invariant under the Fourier transform on $\mathbb {Q}_p$ with an unramified character.

If $B_p$ is a division algebra, we need only show invariance under the element $\left (\begin{smallmatrix} 1 & 0 \\ p & 1 \end{smallmatrix}\right ) =-w \left (\begin{smallmatrix} 1 & -p\\ 0 & 1 \end{smallmatrix}\right ) w$. This element and the upper triangular integral matrices generate $U_0(p)$. Because of the duality of the Fourier transform, this is equivalent to showing that $\rho \big (\!\left (\begin{smallmatrix} 1 & -p\\ 0 & 1 \end{smallmatrix}\right )\!\big ).\mathcal {F}\mathbb {1}_{\mathcal {R}}=\mathcal {F}\mathbb {1}_{\mathcal {R}}$.

Let $E=F(\sqrt {a})/F$ be the unique unramified quadratic extension and write $B_p$ as the cyclic algebra $\big (\frac {a,p}{\mathbb {Q}_p}\bigr )$ with the standard generators $i,j,k$ and $i^2=a$, $j^2=p$ and $ij=-ji=k$. As usual, we identify $E$ with the sub-ring $\mathbb {Q}_p+i\mathbb {Q}_p< B_p$. Denote by $\mathcal {O}_E$ the maximal order of $E$. Then, the unique maximal order of $B_p$ is $\mathcal {R}\simeq \mathcal {O}_E+j\mathcal {O}_E$. The Fourier transform on $B_p$ descends to the standard Fourier transform on $E$ with an unramified character. Identifying $B_p\simeq E\times E$ via $a+jb\mapsto (a,b)$, we can write the Fourier self-dual measure on $B_p$ in these coordinates as $p^{-1}m_E\times m_E$. The $p^{-1}$ factor normalizes the measure to be self-dual.

The Fourier transform on $E$ satisfies $\mathcal {F} \mathbb {1}_{\mathcal {O}_E}=\mathbb {1}_{\mathcal {O}_E}$. An explicit computation with the Fourier self-dual measure implies

\[ \mathcal{F} \mathbb{1}_{\mathcal{O}_E+j\mathcal{O}_E}=p^{-1}\mathbb{1}_{\mathcal{O}_E+j^{-1}\mathcal{O}_E}. \]

Hence, $\operatorname {Nr} x\in p^{-1}\mathbb {Z}_p$ for all $x\in \operatorname {supp} \mathcal {F} \mathbb {1}_{\mathcal {R}}$, from which we deduce $\big (\rho \big (\!\!\left (\begin{smallmatrix} 1 & -p \\ 0 & 1 \end{smallmatrix}\right )\!\!\big ). \mathcal {F} \mathbb {1}_{\mathcal {R}}\big )(x)=\psi _v(-p\operatorname {Nr} x) \big (\!\mathcal {F} \mathbb {1}_{\mathcal {R}}\big )(x)=\big (\!\mathcal {F} \mathbb {1}_{\mathcal {R}}\big )(x)$ and the claim follows.

Lemma 3.12 Assume $B_p$ is ramified and let $\mathcal {R}\subset B_p$ be the unique maximal order. Then, there is an isomorphism of finite abelian additive groups

\[ \jmath_v\colon \widehat{\mathcal{R}}/\mathcal{R}\to \mathbb{F}_{p^2} \]

such that $-p\operatorname {Nr} x \bmod p\mathbb {Z}_p \equiv \operatorname {Nr} \jmath _v(x)$ for all $x\in \widehat {\mathcal {R}}$. The norm on the right-hand side is the field norm $\mathbb {F}_{p^2}\to \mathbb {F}_p$.

Moreover, if $j$ is a uniformizer of $\mathcal {R}$, then we can choose $\jmath _v$ so that the composite map

\[ \mathcal{R}/j\mathcal{R}\xrightarrow{x\mapsto j^{-1}x}\widehat{\mathcal{R}}/\mathcal{R}\xrightarrow{\jmath_v} \mathbb{F}_{p^2} \]

is a field isomorphism.

Note that there are exactly two field isomorphisms $\mathcal {R}/j\mathcal {R}\to \mathbb {F}_{p^2}$ and they differ by post-composition with the Frobenius, i.e. by the action of the Galois group. If $f\colon \mathcal {R}/j\mathcal {R}\to \mathbb {F}_{p^2}$ is such an isomorphism, then $f(j x j^{-1})=f(x)^p$, i.e. conjugation by $j$ is intertwined with the Frobenius. Hence, the composition of $\mathcal {R}/j\mathcal {R}\xrightarrow {x\mapsto xj^{-1}}\widehat {\mathcal {R}}/\mathcal {R}\xrightarrow {\jmath _v} \mathbb {F}_{p^2}$ is necessarily also a field isomorphism differing from the original one by post-composition with the Frobenius.

Proof. Let $j$ be a uniformizer in $\mathcal {R}$. The field norm on $\mathcal {R}/j\mathcal {R}\simeq \mathbb {F}_{p^2}$ coincides with the reduction $\bmod p$ of the reduced norm in $\mathcal {R}$. This can be seen by taking a subfield $E\subset B_p$, such that $E$ is an unramified quadratic extension of $F$ that splits $B_p$. In these coordinates, we can write

(20)

\begin{equation} B_p = E+jE, \quad \mathcal{R}=\mathcal{O}_E+j\mathcal{O}_E, \quad \widehat{\mathcal{R}}=\mathcal{O}_E+j^{-1}\mathcal{O}_E. \end{equation}

Now, the restriction of the reduced norm in $B_p$ to $E$ coincides with the field norm on $E$ and $\mathcal {R}/j\mathcal {R}=\mathcal {O}_E/p\mathcal {O}_E\simeq \mathbb {F}_{p^2}$.

Observe that $\widehat {\mathcal {R}}= j^{-1} \mathcal {R}$ and $\widehat {\mathcal {R}}/\mathcal {R}\simeq \mathcal {R}/j\mathcal {R}\simeq \mathbb {F}_{p^2}$. The last isomorphism is a field isomorphism and, thus, it commutes with taking norms. The first isomorphism is via the map $x\mapsto j x$ and $\operatorname {Nr} (j x)=-p\operatorname {Nr} x$. This establishes the claimed formula for norms. To summarize, the map $\jmath _v$ may thus be given explicitly as

(21)

\begin{equation} \jmath_v(a+j^{-1}b)=b \mod p\mathcal{O}_E, \end{equation}

for $a+j^{-1}b \in \widehat {\mathcal {R}}$ with $a,b \in \mathcal {O}_E$.

Proposition 3.13 Assume $B_p$ is ramified and let $\mathcal {R}\subset B_p$ be the unique maximal order. Then,

\[ \rho(\mathbf{SL}_2(\mathbb{Z}_p)).\mathbb{1}_{\mathcal{R}} =\bigg\{{-}p^{-1}\psi\bigg(\frac{\operatorname{Nr} x}{t}\bigg)\mathbb{1}_{\widehat{\mathcal{R}}} \mid 0< t< p \bigg\} \cup \big\{\mathbb{1}_{\mathcal{R}},-p^{-1}\mathbb{1}_{\widehat{\mathcal{R}}} \big\}. \]

Moreover, each of the functions above corresponds to a single coset in ${\mathbf {SL}_2(\mathbb {Z}_p)/ U_0(p)}$.

Remark 3.14 Because $\operatorname {Nr} x \in p^{-1}\mathbb {Z}_p$ and $\psi$ is unramified, we can rewrite the result above as

\[ \rho(\mathbf{SL}_2(\mathbb{Z}_p)).\mathbb{1}_{\mathcal{R}} =\big\{\mathbb{1}_{\mathcal{R}} \big\} \cup \Big\{{-}p^{-1}\psi\big(t \operatorname{Nr} x\big)\mathbb{1}_{\widehat{\mathcal{R}}} \mid t\in^{\mathbb{Z}}/{p} \mathbb{Z}\Big\}. \]

Proof. As in the previous lemma, we put coordinates on $B_p$ corresponding to the cyclic algebra $\big (\frac {a,p}{\mathbb {Q}_p}\big )$ where $E=F(\sqrt {a})/F$ is the unique unramified quadratic extension, see (20). We shall make use of the explicit map $\jmath _v$ given by (21). It is an isomorphism of abelian groups $\widehat {\mathcal {R}}/\mathcal {R}\to \mathbb {F}_{p^2}$ and $-p \operatorname {Nr} x \bmod p\mathbb {Z}_p \equiv \operatorname {Nr} \jmath _v(x)$ for all $x\in \widehat {\mathcal {R}}$, where the norm on the right-hand side is the field norm $\mathbb {F}_{p^2}\to \mathbb {F}_p$.

For each $\alpha \in \mathbb {F}_{p^2}$, fix a representative $x_\alpha \in \jmath _v^{-1}(\alpha )$. Then,

(22)

\begin{equation} \widehat{\mathcal{R}}=\bigsqcup_{\alpha\in \mathbb{F}_{p^2}} \big(x_\alpha+\mathcal{R}\big). \end{equation}

Explicitly, for each $\alpha \in \mathbb {F}_{p^2}$, we take $x_\alpha =j^{-1}\check {\alpha }$, where $\check {\alpha }\in \mathcal {O}_E$ satisfies $\check {\alpha } \bmod p\mathcal {O}_E=\alpha$. The duality between $\mathcal {R}$ and $\widehat {\mathcal {R}}$ implies that $\operatorname {Nr}(x_\alpha +\mathcal {R})\subset \operatorname {Nr} x_\alpha + \mathbb {Z}_p$.

Recall that $\mathbb {1}_{\mathcal {R}}$ is $\rho (U_0(p))$-invariant. Hence, we need only calculate the action of each representative of $\mathbf {SL}_2(\mathbb {Z}_p)/U_0(p)$ on $\mathbb {1}_{\mathcal {R}}$. A set of representatives is given by $w$, $\left (\begin{smallmatrix} 1 & 0 \\ t & 1\end{smallmatrix}\right )$, $0\leq t< p$. The action of $\rho (w)$ is the Fourier transform (multiplied by $\gamma =-1$) and we have already seen in the proof of Lemma 3.11 that

\[ \rho(w).\mathbb{1}_{\mathcal{R}}=-p^{-1}\mathbb{1}_{\widehat{\mathcal{R}}}. \]

Write $x=a+j^{-1}b$ for $x\in \widehat {\mathcal {R}}$. Now, we compute the action of $\left (\begin{smallmatrix} 1 & 0 \\ t & 1\end{smallmatrix}\right )=-w\left (\begin{smallmatrix} 1 & -t \\ 0 & 1 \end{smallmatrix}\right ) w$ usingFootnote ⁶ (22)

\begin{align*} \bigg(\rho\bigg(\!\!\begin{pmatrix}1 & 0 \\ t & 1\end{pmatrix}\!\!\bigg).\mathbb{1}_{\mathcal{R}}\bigg)(x)&=-p^{-1}\rho(w). \big(\psi(-t\operatorname{Nr} y)\mathbb{1}_{\widehat{\mathcal{R}}}(y)\big)(x)\\ &=-p^{-1}\rho(w).\bigg(\sum_{\alpha\in\mathbb{F}_{p^2}} \psi(-t\operatorname{Nr} x_\alpha) \mathbb{1}_{x_\alpha+\mathcal{R}}\bigg)(x)\\ &=p^{-2}\mathbb{1}_{\widehat{\mathcal{R}}}(x) \sum_{\alpha\in\mathbb{F}_{p^2}} \psi(-t\operatorname{Nr} x_\alpha+\langle x, x_\alpha \rangle). \end{align*}

If $t\neq 0$, then the sum above becomes

\begin{align*} \sum_{\alpha\in\mathbb{F}_{p^2}} \psi\bigg(\frac{\operatorname{Nr} x- \operatorname{Nr}(x-tx_\alpha)}{t} \bigg)&= \psi\bigg(\frac{\operatorname{Nr} x}{t}\bigg)\sum_{\alpha\in\mathbb{F}_{p^2}}\psi\bigg(\frac{\operatorname{Nr}(b-t\alpha)}{tp}\bigg) =\psi\bigg(\frac{\operatorname{Nr} x}{t}\bigg)\sum_{\alpha_0\in\mathbb{F}_{p^2}}\psi\bigg(\frac{\operatorname{Nr} \alpha_0}{tp}\bigg)\\ &=\psi\bigg(\frac{\operatorname{Nr} x}{t}\bigg)\bigg((p+1)\sum_{\beta\in\mathbb{F}_p^\times} \psi\bigg(\frac{\beta}{p}\bigg)+1\bigg) =-p\psi\bigg(\frac{\operatorname{Nr} x}{t}\bigg). \end{align*}

We have used the fact that every element of $\mathbb {F}_p^\times$ is the norm of exactly $p+1$ elements in $\mathbb {F}_{p^2}^\times$. This establishes the claim for $0< t< p$. For $t=0$, the sum becomes

\[ \sum_{\alpha\in\mathbb{F}_{p^2}} \psi\bigg(-\frac{\operatorname{Tr}(\alpha b)}{p}\bigg)= \begin{cases} p^2, & b\equiv 0 \bmod p\mathcal{O}_E,\\ 0, & \textrm{otherwise.} \end{cases} \]

In addition, $\mathbb {1}_{\widehat {\mathcal {R}}}(x)\delta _{b\equiv 0 \bmod p\mathcal {O}_E}(x)=\mathbb {1}_{\mathcal {R}}(x)$. Of course, the case of $t=0$ is actually trivial to compute because it corresponds to the identity representative.

Lemma 3.15 Let $M\colon B_p\to \mathbb {C}$ be a Schwartz–Bruhat function. Then, there is an open subgroup $U_M<\mathbf {SL}_2(\mathbb {Z}_p)$ such that $\rho (U_M).M=M$. In particular, $\rho \big (\mathbf {SL}_2(\mathbb {Z}_p)\big ).M$ is a finite set.

Proof. Fix a maximal order $\mathcal {R}\subset B_p$. We first claim that for every Schwartz–Bruhat function $M\colon B_p\to \mathbb {C}$ there is some diagonal matrix $a\in \mathbf {SL}_2(\mathbb {Q}_p)$ such that $\rho (a).M$ is a linear combination of translates of $\mathbb {1}_{\mathcal {R}}$. Equivalently $\rho (a).M(x+\mathcal {R})=\rho (a).M(x)$. Because $M$ is Schwartz–Bruhat, there is a small neighborhood of the origin $\mathcal {V}\subset B_p$, such that $M(x+\mathcal {V})=M(x)$. There is $k\geq 1$ such that $p^k \mathcal {R}\subset \mathcal {V}$. The function $x\mapsto M(p^{k}x)$ is invariant under translations by $\mathcal {R}$. Set $a=\operatorname {diag}(p^k,p^{-k})$, then $\rho (a).M$ is a linear combination of translates of $\mathbb {1}_\mathcal {R}$ as claimed.

Fix $b\in B_p$ and consider the group

\[ \mathcal{A}(b)=\big\{\!\operatorname{diag}(u, u^{-1})\colon u\in\mathbb{Z}_p^\times \textrm{ and } bu-b\in\mathcal{R} \big\}. \]

Then, $\mathcal {A}(b)$ is an open subgroup of the diagonal group in $\mathbf {SL}_2(\mathbb {Z}_p)$ and $\rho (\mathcal {A}(b)).\mathbb {1}_{b+\mathcal {R}}=\mathbb {1}_{b+\mathcal {R}}$. Taking a finite intersection of such subgroups we find an open subgroup $\mathcal {A}_0$ of the diagonal group of $\mathbf {SL}_2(\mathbb {Z}_p)$, such that $\rho (\mathcal {A}_0 a).M=\rho (a).M$. Hence, $M$ is invariant under $\rho (\mathcal {A}_0)$.

In a similar fashion, let $k_b\geq 0$ such that $p^{k_b} b\in \mathcal {R}$ and define $\mathcal {N}(b)=\big (\begin{smallmatrix}1 & p^{k_b} \mathbb {Z}_p \\ 0 & 1 \end{smallmatrix}\big )$. Then, $\mathcal {N}(b)$ is an open subgroup of the upper-triangular unipotent group of $\mathbf {SL}_2(\mathbb {Z}_p)$ and $\rho (\mathcal {N}_b).\mathbb {1}_{b+\mathcal {R}}=\mathbb {1}_{b+\mathcal {R}}$. Taking a finite intersection of such subgroups we can find an open subgroup $\mathcal {N}_1'$ of the integral upper-triangular unipotent subgroup, such that $\rho (\mathcal {N}_1'a).M=\rho (a).M$. Set $\mathcal {N}_1=a^{-1} \mathcal {N}_1' a\cap \mathbf {SL}_2(\mathbb {Z}_p)$. Then, $\rho (\mathcal {N}_1).M=M$ and $\mathcal {N}_1$ is an open subgroup of the upper unipotent integral group. Replacing $M$ by $\rho (w).M$, we can find $\mathcal {N}_2$ such that $\rho (w^{-1} \mathcal {N_2} w).M=M$, and $w^{-1}\mathcal {N}_2 w$ is an open subgroup of the lower-triangular integral unipotent group.

Set now $U_M$ to be the group generated by $\mathcal {A}_0,\mathcal {N}_1,w^{-1}\mathcal {N}_2w$. Then, $U_M$ is an open subgroup of $\mathbf {SL}_2(\mathbb {Z}_p)$ and satisfies the requirements of the claim.

Assume now $B_p\simeq \operatorname {Mat}_{2\times 2}(\mathbb {Q}_p)$ is split. Maximal orders in $\operatorname {Mat}_{2\times 2}(\mathbb {Q}_p)$ are endomorphism rings of lattices in $\mathbb {Q}_p^2$ and they are in one-to-one correspondence with the vertices of the Bruhat–Tits tree of $\mathbf {SL}_2(\mathbb {Q}_p)$.

Definition 3.16 An Eichler order in $B_p$ of level $p^n$ is an intersection of two maximal orders corresponding to two vertices in the Bruhat–Tits tree with distance $n$ between them.

Lemma 3.17 Let $\mathcal {R}\subset B_p$ be an Eichler order of level $p^n$. Then, $\rho \big (U_0(p^n)\big ).\mathbb {1}_{\mathcal {R}}=\mathbb {1}_{\mathcal {R}}$.

Proof. The action of $B_p^\times$ on the vertices of the Bruhat–Tits tree is transitive on pairs of vertices of a fixed distance,Footnote ⁷ thus it acts transitively by conjugation on the set of Eichler orders of a fixed level $p^n$. Because the conjugation action commutes with the Weil representation, it is enough to consider a single Eichler order of the form

\[ \mathcal{R}=\begin{pmatrix} \mathbb{Z}_p & \mathbb{Z}_p \\ p^n \mathbb{Z}_p & \mathbb{Z}_p \end{pmatrix}. \]

Indeed, invariance of $\mathbb {1}_{\mathcal {R}}$ under upper-triangular integral matrices is immediate and it is enough to check invariance under the element $\left (\begin{smallmatrix} 1 & 0 \\ p^{n} & 1 \end{smallmatrix}\right ) =-w \left (\begin{smallmatrix} 1 & -p^{n}\\ 0 & 1 \end{smallmatrix}\right ) w$. Equivalently, we need to show $\rho \big (\!\left (\begin{smallmatrix} 1 & -p^{n}\\ 0 & 1 \end{smallmatrix}\right ) \!\big ).\mathcal {F}\mathbb {1}_{\mathcal {R}}=\mathcal {F}\mathbb {1}_{\mathcal {R}}$. We can compute the Fourier transform of $\mathbb {1}_{\mathcal {R}}$ explicitly and arrive at

\[ {\mathcal{F}}\mathbb{1}_{\begin{pmatrix} \mathbb{Z}_p & \mathbb{Z}_p \\ p^n \mathbb{Z}_p & \mathbb{Z}_p \end{pmatrix}}=p^{-n} \mathbb{1}_{\begin{pmatrix} \mathbb{Z}_p & p^{-n} \mathbb{Z}_p \\ \mathbb{Z}_p & \mathbb{Z}_p \end{pmatrix}}=p^{-n}\mathbb{1}_{\widehat{\mathcal{R}}}. \]

In particular, for all $x\in \operatorname {supp} \mathcal {F} \mathbb {1}_{\mathcal {R}}$, we have $\operatorname {Nr} x=\operatorname {det} x\in p^{-n}\mathbb {Z}_p$ and

\[ \bigg(\rho\bigg(\!\!\begin{pmatrix} 1 & -p^{n} \\ 0 & 1 \end{pmatrix}\!\!\bigg). \mathcal{F} \mathbb{1}_{\mathcal{R}}\bigg)(x)=\psi_v(-p^{n}\operatorname{Nr} x) \big(\mathcal{F} \mathbb{1}_{\mathcal{R}}\big)(x)=\big(\mathcal{F} \mathbb{1}_{\mathcal{R}}\big)(x) \]

as necessary.

Lemma 3.18 Let $\mathcal {R}=\mathcal {R}_1\cap \mathcal {R}_2$ be an Eichler order of level $p^n$, where $\mathcal {R}_1$ and $\mathcal {R}_2$ are maximal orders. There is an isomorphism of additive abelian groups $\jmath _v\colon {\widehat {\mathcal {R}}/ \mathcal {R}}\to \big ({\mathbb{Z}}/ {p^n}\mathbb{Z}\big )^{2}$ such that

\begin{align*} \jmath_v(\mathcal{R}_1)&=^{\mathbb{Z}}/p^n\mathbb{Z}\times \{0\},\\ \jmath_v(\mathcal{R}_2)&=\{0\}\times^{\mathbb{Z}}/{p^n}\mathbb{Z},\\ \forall x\in\widehat{\mathcal{R}}&\colon -p^n\operatorname{Nr} (x) \equiv \jmath_v(x)_1\cdot \jmath_v(x)_2 \mod p^n. \end{align*}

Moreover, the isomorphism $\jmath _v$ is unique up to post-composition with the map $(b,c)\mapsto (bu,cu^{-1})$ for $u\in ({\mathbb{Z}}/ {p^n}\mathbb{Z})^\times$.

Note that the isomorphism $\jmath _v$ depends not only on $\mathcal {R}$ but on an ordered choice of $\mathcal {R}_1$ and $\mathcal {R}_2$.

Proof. Because all local Eichler orders of fixed level are conjugate, it is enough to verify the claim for

\[ \mathcal{R}_1=\begin{pmatrix} \mathbb{Z}_p & \mathbb{Z}_p \\ \mathbb{Z}_p & \mathbb{Z}_p \end{pmatrix}, \quad \mathcal{R}_2=\begin{pmatrix} \mathbb{Z}_p & p^{-n} \mathbb{Z}_p \\ p^n \mathbb{Z}_p & \mathbb{Z}_p \end{pmatrix}. \]

In this case, we have

\[ \mathcal{R}=\begin{pmatrix} \mathbb{Z}_p & \mathbb{Z}_p \\ p^n \mathbb{Z}_p & \mathbb{Z}_p \end{pmatrix}, \quad \widehat{\mathcal{R}}=\begin{pmatrix} \mathbb{Z}_p & p^{-n} \mathbb{Z}_p \\ \mathbb{Z}_p & \mathbb{Z}_p \end{pmatrix}. \]

We define the abelian homomorphism $\jmath _v\colon \widehat {\mathcal {R}}\to \big ({\mathbb{Z}}/ {p^n}\mathbb{Z} \big )^{2}$ explicitly as

\[ \begin{pmatrix} a & b/p^n \\ c & d \end{pmatrix}\mapsto (c,b) \bmod p^n\mathbb{Z}_p. \]

A direct computation shows that this homomorphism has kernel $\mathcal {R}$ and that it satisfies the claimed properties.

This isomorphism is unique up to post-composition with a linear automorphism of the first and second coordinate of $\big ({\mathbb{Z}}/ {p^n}\mathbb{Z}\big )^{2}$, i.e. a transformation of the form $(b,c)\mapsto (b u_1, c u_2)$ for $u_1,u_2\in ({\mathbb{Z}}/ {p^n}\mathbb{Z})^{\times }$. The requirement that the quadratic form $-p^n\operatorname {Nr}$ descends to the product form $(b,c)\mapsto b\cdot c$ forces $u_2=u_1^{-1}$.

Remark 3.19 The previous lemma implies that given two maximal orders $\mathcal {R}_1$, $\mathcal {R}_2$ the map $x\mapsto (\operatorname {ord}_p \jmath _v(x)_1,\operatorname {ord}_p \jmath _v(x)_2)$ from $\widehat {\mathcal {R}}$ to $\{0,1,\ldots,n\}^{2}$ is uniquely defined.

Definition 3.20 Let $\mathcal {R}\subset B_p$ be an Eichler order of level $p^n$. For every $p^k\mid p^n$ define the lattice

\[ \widehat{\mathcal{R}}^{(p^k)}=\big\{x\in\widehat{\mathcal{R}} \mid \jmath_v(x)\equiv (0,0)\ {\rm mod}\ p^k \big\}. \]

The definition of $\widehat {\mathcal {R}}^{(p^k)}$ does not depend on the choice of $\jmath _v$. Note that $\widehat {\mathcal {R}}^{(1)}=\widehat {\mathcal {R}}$ and $\widehat {\mathcal {R}}^{(p^n)}=\mathcal {R}$.

Moreover, for $x\in \widehat {\mathcal {R}}^{(p^k)}$ defineFootnote ⁸

\[ \nu_{p^k}(x)\colon= -p^{-k}\jmath_v(x)_1 \cdot p^{-k}\jmath_v(x)_2\in ^{\mathbb{Z}}/{p^{n-k}}\mathbb{Z}. \]

Note that $\nu _{p^0}(x)\equiv p^{n} \operatorname {Nr} x \bmod p^{n}$.

Proposition 3.21 Let $\mathcal {R}\subset B_p$ be an Eichler order of level $p^n$. Then,

\begin{align*} \rho\big(\mathbf{SL}_2(\mathbb{Z}_p)\big).\mathbb{1}_{\mathcal{R}}&= \big\{\mathbb{1}_{\widehat{\mathcal{R}}}(x)\cdot p^{-n}\psi_v(-p t_0 \operatorname{Nr} x) \mid 0 < t_0 \leq p^{n-1} \big\}\\ &\quad \cup \bigg\{ \mathbb{1}_{\widehat{\mathcal{R}}^{\big(p^{\operatorname{ord}_p t}\big)}}(x)\cdot p^{-(n-\operatorname{ord}_p t)} \psi_v \bigg(\frac{\nu_{p^{\operatorname{ord}_p t}}(x)}{p^{n-2\operatorname{ord}_p t}t}\bigg) \mid 0< t \leq p^n \bigg\}. \end{align*}

Moreover, each of the functions above corresponds to a single coset of ${\mathbf {SL}_2(\mathbb {Z}_p)/ U_0(p^n)}$. For $t=p^n$ above, the phase is $1$, hence the representative for $t=p^n$ is simply $\mathbb {1}_{\mathcal {R}}$.

Remark 3.22 Because $\psi$ is unramified, we can rewrite the result above as

\begin{align*} \rho\big(\mathbf{SL}_2(\mathbb{Z}_p)\big).\mathbb{1}_{\mathcal{R}}&= \bigcup_{0< k \leq n} \bigg\{ \mathbb{1}_{\widehat{\mathcal{R}}^{(p^k)}}(x)\cdot p^{-(n-k)} \psi_v \bigg(\frac{u \cdot \nu_{p^k}(x)}{p^{n-k}}\bigg) \mid u\in\big(^{\mathbb{Z}}/{p^{n-k}}\mathbb{Z}\big)^\times \bigg\}\\ &\qquad \cup \Big\{ \mathbb{1}_{\widehat{\mathcal{R}}}(x)\cdot p^{-n} \psi_v\big(t \operatorname{Nr} x\big) \mid t\in^{\mathbb{Z}}/{p^n}\mathbb{Z} \Big\}. \end{align*}

The set on the second line is analogous to the $k=0$ case missing in the first line, but requires $t$ to traverse the whole congruence subgroup, not just the units.

Proof. Again, as all Eichler orders are conjugate, the claim reduces to the case of

and

\[ \jmath_v\bigg(\!\!\begin{pmatrix} a & b/p^n \\ c & d \end{pmatrix}\!\!\bigg) =(c,b) \mod p^n\mathbb{Z}_p. \]

Because of Lemma 3.17, it is enough to compute $\rho (s).\mathbb {1}_{\mathcal {R}}$ for each of the representatives of ${\mathbf {SL}_2(\mathbb {Z}_p)/ U_0(p^n)}$. To find these representatives, we first write representatives for ${\mathbf {SL}_2(\mathbb {Z}_p)/ U_0(p)}$

\[ \mathbf{SL}_2(\mathbb{Z}_p)=w U_0(p)\sqcup \bigsqcup_{0< t \leq p} \begin{pmatrix} 1 & 0 \\ t & 1 \end{pmatrix} U_0(p). \]

This decomposition follows from the fact that $U_0(p)$ is the stabilizer of an edge in the apartment of the diagonal torus in the Bruhat–Tits tree of $\mathbf {SL}_2(\mathbb {Q}_p)$ and the representatives above permute the $p+1$ neighbors of the vertex stabilized by $\mathbf {SL}_2(\mathbb {Z}_p)$.

Next, we find representatives for ${U_0(p)/ U_0(p^n)}$ using their definition as congruence subgroups

\[ U_0(p)=\bigsqcup_{0 < t_0 \leq p^{n-1}} \begin{pmatrix} 1 & 0 \\ p t_0 & 1 \end{pmatrix} U_0(p^n). \]

By combining, we arrive at

\[ \mathbf{SL}_2(\mathbb{Z}_p)=w \bigsqcup_{0 < t_0 \leq p^{n-1}} \begin{pmatrix} 1 & 0 \\ p t_0 & 1 \end{pmatrix} U_0(p^n) \sqcup \bigsqcup_{0 < t \leq p^n} \begin{pmatrix} 1 & 0 \\ t & 1 \end{pmatrix} U_0(p^n). \]

We now compute explicitly the action of all representatives. We need to decompose $\widehat {\mathcal {R}}$ into $\mathcal {R}$-cosets

\[ \widehat{\mathcal{R}}=\bigsqcup_{0\leq \alpha,\beta< p^n} \begin{pmatrix} 0 & \alpha/p^n\\ \beta & 0 \end{pmatrix} +\mathcal{R}. \]

To simplify notation, we denote $x_{\alpha,\beta }\colon= \big (\begin{smallmatrix} 0 & \alpha /p^n\\ \beta & 0 \end{smallmatrix}\big )$, with $\jmath _v(x_{\alpha,\beta })=(\beta,\alpha )$. The duality between $\widehat {\mathcal {R}}$ and $\mathcal {R}$ implies $\operatorname {Nr}(x_{\alpha,\beta }+\mathcal {R})=\operatorname {Nr} x_{\alpha,\beta } + \mathbb {Z}_p=-\alpha \beta /p^n+\mathbb {Z}_p$. Write $\left (\begin{smallmatrix} 1 & 0 \\ t & 1 \end{smallmatrix}\right )=-w \left (\begin{smallmatrix} 1 & -t \\ 0 & 1 \end{smallmatrix}\right ) w$ and $x=\big (\begin{smallmatrix} a & b/p^n \\ c & d \end{smallmatrix}\big )$. Then,

\begin{align*} \rho\bigg(\!\!\begin{pmatrix} 1 & 0 \\ t & 1 \end{pmatrix}\!\!\bigg).\mathbb{1}_{\mathcal{R}} (x) &=p^{-n}\rho(w) \rho\bigg(\!\!\begin{pmatrix} 1 & -t \\ 0 & 1 \end{pmatrix}\!\!\bigg). \mathbb{1}_{\widehat{\mathcal{R}}} (x) =p^{-n}\rho(w).\big( \psi_v(-t \operatorname{Nr} x) \mathbb{1}_{\widehat{\mathcal{R}}}(x)\big)\\ &= p^{-n}\rho(w).\bigg(\sum_{0\leq \alpha,\beta < p^n} \psi_v(t \alpha\beta/p^n) \mathbb{1}_{x_{\alpha,\beta}+ \mathcal{R}}(x) \bigg)\\ &= \mathbb{1}_{\widehat{\mathcal{R}}}(x) \cdot p^{-2n} \sum_{0\leq \alpha,\beta < p^n} \psi_v(t \alpha\beta/p^n+\langle x_{\alpha,\beta},x \rangle)\\ &= \mathbb{1}_{\widehat{\mathcal{R}}}(x) \cdot p^{-2n} \sum_{0\leq \alpha,\beta < p^n} \psi_v \bigg(\frac{-\alpha c- \beta b +t \alpha\beta}{p^n}\bigg). \end{align*}

Let $k=\operatorname {ord}_p t$. Then, summing first over $\alpha$ we deduce $p^k \mid c$ and summing first over $\beta$ we see that $p^k \mid b$. Using this input, we can sum first over $\alpha$ and then over $\beta$ to arrive at

\begin{align*} &\rho\bigg(\!\!\begin{pmatrix} 1 & 0 \\ t & 1 \end{pmatrix}\!\!\bigg).\mathbb{1}_{\mathcal{R}} (x)\\ &\quad=\mathbb{1}_{\widehat{\mathcal{R}}}(x) p^{-(n-k)} \begin{cases} \psi_v\bigg(-\dfrac{(c/p^k)(b/p^k)}{(t/p^k)p^{n-k}}\bigg)=\psi_v\bigg(\dfrac{\nu_{p^k}(x)}{tp^{n-2k}}\bigg), & \jmath_v(x)\equiv (0,0) \bmod p^k, \\ 0, & \textrm{otherwise.} \end{cases} \end{align*}

We need only deal now with representatives of the form $w\left (\begin{smallmatrix} 1 & 0 \\ p t_0 & 1\end{smallmatrix}\right )=\left (\begin{smallmatrix} 1 & -p t_0 \\ 0 & 1 \end{smallmatrix}\right ) w$ which are easier to compute

\[ \rho\bigg(w\begin{pmatrix} 1 & 0 \\ p t_0 & 1\end{pmatrix}\!\!\bigg).\mathbb{1}_{\mathcal{R}} (x)=p^{-n}\psi_v(-p t_0 \operatorname{Nr} x) \mathbb{1}_{\widehat{\mathcal{R}}}(x).\]

3.6 Local uniformity

As a preparation for the global theory, we will need the following proposition that uniformly controls the decay of test functions for the Weil representation.

Proposition 3.23 Let $M\in \Omega _v$, $s\in \mathbf {SL}_2(F)$ and $L\in \operatorname {O}(B_v,\operatorname {Nr})$. If $v=\infty$, then there is $\delta >0$ such that the inequality

\[ |(\rho(s)M)(Lx)| \ll (1+\|x\|)^{-4-\delta} \]

holds uniformly on compact sets in $\mathbf {SL}_2(F)\times \operatorname {O}(B_v,\operatorname {Nr})$. If $v<\infty$, then for every compact subset of $\mathcal {K}\subset \mathbf {SL}_2(F)\times \operatorname {O}(B_v,\operatorname {Nr})$ there is a compact subset $\mathcal {C}\subset B_v$ such that

\[ \forall (s,L)\in \mathcal{K} \colon |(\rho(s)M)(Lx)|\ll_{\mathcal{K}} \mathbb{1}_{\mathcal{C}}. \]

Proof. The claim for $v=\infty$ follows immediately from Corollary 3.6. Fix now $v<\infty$. We may cover the set $\mathcal {K}$ by the product of its projections onto $\mathbf {SL}_2(F)$ and $\operatorname {O}(B_v,\operatorname {Nr})$, which is compact. Hence, we may assume without loss of generality that $\mathcal {K}=\mathcal {K}_0 \times \mathcal {K}_1$ is a compact product set. Note that it is enough to show that there is some $\mathcal {C}_0\subset B_v$ such that $|\rho (s).{M}|\ll \mathbb {1}_{\mathcal {C}_0}$ for $s\in \mathcal {K}_0$. In particular, the compact set $\mathcal {C}=\bigcup _{L\in \mathcal {K}_1} L^{-1}\mathcal {C}_0$ will satisfy the claimed properties. Using the Iwasawa decomposition, we can find a compact subset $C_P$ of the group of upper triangular matrices such that $\mathcal {K}_0\subset C_P\mathbf {SL}_2(\mathbb {Z}_p)$. Recall from Lemma 3.15 that the $\rho \big (\mathbf {SL}_2(\mathbb {Z}_p)\big )$-orbit of $M$ is finite and a uniform bound on $\rho (\mathcal {K_0}).M$ will follow from a uniform bound on $\rho (C_P).M'$ for every $M'$ in $\rho \big (\mathbf {SL}_2(\mathbb {Z}_p)\big ).M$. A uniform bound on $|\rho (C_P).M'|$ follows immediately from the formulae for the Weil action of diagonal and upper unipotent matrices.

3.7 Global Weil representation and theta series

Fix an additive character $\psi \colon \mathbb {A} / \mathbb {Q}\to \mathbb {C}$ such that $\psi =\prod _{v} \psi _v$ and $\psi _v$ is unramified for all finite $v$. This is possible for the adèle ring of $\mathbb {Q}$ and the standard character with $\psi _\infty (a)=\exp (-2\pi i a)$ will do. We consider henceforth always the local Weil representations on $\Omega _v$ to be with respect to $\psi _v$.

Set

\[ \Omega\colon= {\bigotimes}' \Omega_v=\operatorname{Span}_{\mathbb{C}} \bigg\{\!\prod_v M_v \mid M_v\in\Omega_v, \forall \textrm{ a.e. } v\colon M_v= \mathbb{1}_{R_v} \!\bigg\}. \]

A pure tensor $M=\prod _v M_v \in \Omega$ is called a standard test function. The Weil representations for each $\Omega _v$ define in the standard way a representation of $\mathbf {SL}_2(\mathbb {A})$ on $\Omega$. To see that this action is well-defined we need to check that for almost every $v$ we have $\rho \big (\mathbf {SL}_2(\mathbb {Z}_v)\big ).\mathbb {1}_{R_v}=\mathbb {1}_{R_v}$, and this follows from Lemma 3.11. The complex vector space $\Omega$ also carries actions of $\mathbf {G}(\mathbb {A})$ by left and right multiplication because for every $v<\infty$ the function $\mathbb {1}_{R_v}$ is invariant under left and right multiplication by elements of $K_v$.

Definition 3.24 For $M\in \Omega$ define the theta kernel $\Theta _M\colon \mathbf {G}(\mathbb {A})\times \mathbf {G}(\mathbb {A})\times \mathbf {SL}_2(\mathbb {A})\to \mathbb {C}$ by

\[ \Theta_M(l,r;s)\colon= \sum_{\xi\in B} \big(\rho(s)M\big)(l^{-1} \xi r). \]

The uniform decay from Proposition 3.23 is sufficient for the theta series to converge absolutely and uniformly on compact sets in $\mathbf {G}(\mathbb {A})\times \mathbf {G}(\mathbb {A})\times \mathbf {SL}_2(\mathbb {A})$. In particular, the theta series is a well-defined continuous function on its domain.

The theta kernel is obviously $\mathbf {G}(\mathbb {Q})$-invariant on the left in the first two coordinates by virtue of its definition. Less obvious, yet well-known, is that it is also $\mathbf {SL}_2(\mathbb {Q})$ left-invariant in the third coordinate. A simple way to verify this is by first showing invariance under upper-triangular matrices by an elementary calculation and then establishing invariance under the involution $\left (\begin{smallmatrix} 0 & 1\\ -1 & 0 \end{smallmatrix}\right )$ using the Poisson summation formula. The decay conditions we have imposed on functions in $\Omega _\infty$ are sufficient for the Poisson summation formula to hold [Reference Stein and WeissSW71, p. 252, Corollary 2.6].

We now recall the Fourier–Whittaker decomposition of a continuous function $\varphi \colon [\mathbf {SL}_2(\mathbb {A})] \to \mathbb {C}$. For any $\alpha \in \mathbb {Q}$, define the Whittaker function

\[ W_{\varphi}(g,\alpha)=\int_{[\mathbf{N}(\mathbb{A})]} \varphi\bigg(\!\!\begin{pmatrix} 1 & n \\ 0 & 1 \end{pmatrix}g\bigg) \psi(-\alpha n)\,{d} n. \]

We have the following standard properties of the Whittaker function

\begin{align*} \forall n\in \mathbb{A}\colon W_{\varphi}\bigg(\!\!\begin{pmatrix} 1 & n \\ 0 & 1 \end{pmatrix}g,\alpha \bigg)&=\psi(\alpha n) W_{\varphi}(g,\alpha),\\ \forall \lambda\in\mathbb{Q}^\times\colon W_{\varphi}\bigg(\!\!\begin{pmatrix} \lambda & 0 \\ 0 & \lambda^{-1} \end{pmatrix}g,\alpha \bigg)&=W_{\varphi}(g,\lambda^2\alpha). \end{align*}

Because our function $\varphi$ is defined on $[\mathbf {SL}_2(\mathbb {A})]$ and not $[\mathbf {PGL}_2(\mathbb {A})]$, we cannot reduce the dependence on $\alpha$ to the two cases $0$ and $1$. Pontryagin duality for the compact abelian group $[\mathbf {N}(\mathbb {A})]$ implies that the following equality

(23)

\begin{equation} \varphi(g)=\sum_{\alpha\in\mathbb{Q}} W_{\varphi}(g,\alpha) \end{equation}

holds pointwise as long as the right-hand side is absolutely convergent.Footnote ⁹

Proposition 3.25 Fix $M\in \Omega$. Then, the Fourier–Whittaker coefficients of $\Theta _M(l,r;s)$ in the $s$-variable are

\[ W_{\Theta_M(l,r;\bullet)}(s,\alpha)=\sum_{\substack{\xi\in B \\ \operatorname{Nr} \xi=\alpha}} \big(\rho(s)M\big)(l^{-1} \xi r). \]

Because the theta series in Definition 3.24 converges absolutely, an immediate corollary is that the Fourier–Whittaker expansion (23) holds pointwise for $\Theta _M(l,r;\bullet )$.

Proof. Because the theta series converges absolutely, we may exchange summation and integration in the definition of $W_{\Theta _M(l,r;\bullet )}$ and write

\begin{align*} W_{\Theta_M(l,r;\bullet)}(s,\alpha)&= \sum_{\xi\in B }\int_{[\mathbf{N}(\mathbb{A})]} \bigg(\rho\bigg(\!\!\begin{pmatrix} 1 & n \\ 0 & 1 \end{pmatrix}s\bigg)M \bigg) (l^{-1}\xi r) \psi(-\alpha n) \,{d} n\\ &= \sum_{\xi\in B }\int_{[\mathbf{N}(\mathbb{A})]} \big(\rho\big(s\big)M \big) (l^{-1}\xi r) \psi(\operatorname{Nr} \xi\cdot n-\alpha n) \,{d} n\\ &=\sum_{\substack{\xi\in B \\ \operatorname{Nr} \xi=\alpha}} \big(\rho(s)M\big)(l^{-1} \xi r). \end{align*}

4. Theta kernels for Eichler orders

4.1 Weil action on Eichler orders

We first introduce the notation $D_B$ for the (reduced) discriminant of $B$, explicitly

\[ D_B=\prod_{p: B_p \textrm{ is ramified}} p. \]

Definition 4.1 An Eichler order $\mathcal {R}\subset B$ is an intersection of two maximal orders $\mathcal {R}_1$ and $\mathcal {R}_2$. The completion $\mathcal {R}_v$ of $\mathcal {R}$ at any finite place $v$ is a local Eichler order in $B_v$. We say that $\mathcal {R}$ is ramified at $v$ if $\mathcal {R}_v$ is non-maximal. If $B$ is ramified at $v$ then $\mathcal {R}_v$ is unramified at $v$ because $B_v$ has a unique maximal order.

For almost all places, the local orders $\mathcal {R}_{1,v}$ and $\mathcal {R}_{2,v}$ coincideFootnote ¹⁰ and $\mathcal {R}_v$ is a maximal order, i.e. the level of $\mathcal {R}_v$ at these places is $1$. We define the level of $\mathcal {R}$ as

\[ q = \prod_{v<\infty} \textrm{level of }\mathcal{R}_v. \]

The reader may recall Definition 3.16, where we defined the level of a local Eichler order.

The dual lattice to $\mathcal {R}$ is

\[ \widehat{\mathcal{R}}\colon=\big\{x\in B \mid \forall y\in \mathcal{R}\colon \operatorname{Tr} (x^\iota{y}{})\in\mathbb{Z} \big\}. \]

Dualization commutes with localization, i.e. $(\widehat {\mathcal {R}})_v=\widehat {\mathcal {R}_v}$. Using the decomposition

\[ \widehat{\mathcal{R}}/\mathcal{R}=\prod_{v<\infty} \widehat{\mathcal{R}}_v/ \mathcal{R}_v \]

and Lemmata 3.18 and 3.12, we see the existence of an isomorphism of abelian groups

\[ \jmath\colon \widehat{\mathcal{R}}/\mathcal{R}\to \left(^{\mathbb{Z}}/{q}\mathbb{Z} \right)^{2} \times \prod_{p \mid D_B} \mathbb{F}_{p^2} . \]

The map $\jmath$ fibers through the local maps $\jmath _v$ and satisfies the properties inherited from Lemma 3.18:

\begin{align*} \jmath(\mathcal{R}_1)&= ^{\mathbb{Z}}/{q}\mathbb{Z}\times \{0\}\times \prod_{p \mid D_B} \mathbb{F}_{p^2},\\ \jmath(\mathcal{R}_2)&= \{0\}\times^{\mathbb{Z}}/{q}\mathbb{Z} \times \prod_{p \mid D_B} \mathbb{F}_{p^2},\\ \forall x\in \widehat{\mathcal{R}} &\colon -q\operatorname{Nr} x \equiv \jmath(x)_1\cdot \jmath(x)_2 \mod q,\\ \forall x\in \widehat{\mathcal{R}}, p\mid D_B &\colon -p\operatorname{Nr} x \equiv \operatorname{Nr} \jmath(x)_3 \mod p. \end{align*}

For $m\in \mathbb {N}$ and $x\in \mathbb {F}_{p^2}$ define

\[ x\bmod m=\begin{cases} x, & p\mid m, \\ 0, & p\nmid m. \end{cases} \]

Similarly, set $x \bmod m= x \bmod \gcd (m,q)$ for all $x\in {\mathbb{Z}}/ {q}\mathbb{Z}$. We extend this definition element-wise to a map

\[ \left(^{\mathbb{Z}}/{q}\mathbb{Z}\right)^{2} \times \prod_{p \mid D_B} \mathbb{F}_{p^2}\xrightarrow{\bmod m} \Big(^{\mathbb{Z}}/{\gcd(m,q)}\mathbb{Z}\Big)^{2} \times \prod_{p\mid D_B} \mathbb{F}_{p^2}. \]

Definition 4.2 Let $\mathcal {R}=\mathcal {R}_1\cap \mathcal {R}_2 \subset B$ be an Eichler order of level $q$. For every $m\mid q D_B$ define

\[ \widehat{\mathcal{R}}^{(m)}=\Big\{x\in\widehat{\mathcal{R}}\mid \jmath(x)\equiv 0 \mod m \Big\}. \]

Note that $\widehat {\mathcal {R}}^{(1)}=\widehat {\mathcal {R}}$ and $\widehat {\mathcal {R}}^{(qD_B)}=\mathcal {R}$ and

\[ \big[\widehat{\mathcal{R}} \colon \widehat{\mathcal{R}}^{(m)}\big]=m^2. \]

Moreover, the definition of $\widehat {\mathcal {R}}^{(m)}$ does not depend on the choices involved in the definition of $\jmath$.

We also define for $x\in \widehat {\mathcal {R}}^{(m)}$

\[ \nu_m(x)\colon= \prod_{p \mid qD_B} \begin{cases} p^{1-\operatorname{ord}_p m}\operatorname{Nr}(x) \mod p^{\operatorname{ord}_p D_B - \operatorname{ord}_p m}, & p \mid D_B, \\ \nu_{p^{\operatorname{ord}_p m}} (x), & p \mid q, \end{cases}\in\Big(^{\mathbb{Z}}/{(qD_B/m)}\mathbb{Z}\Big). \]

The lattices $\widehat {\mathcal {R}}^{(m)}$ will feature prominently in the description of the action of the Weil representation. In classical terms, they will appear in the Fourier expansion of a theta series at different cusps. We will use the following notation for the completion of an integral lattice at all finite places.

Definition 4.3 If $L\subset B$ is a lattice, then define $\mathbb {1}_{L_f}\colon B_f\to \mathbb {C}$ to be $\mathbb {1}_{L_f}\colon= \prod _p \mathbb {1}_{L_p}$, where $L_p\subset B_p$ is the $p$-adic closure of $L$.

Our goal now is to describe the $\rho \big (\mathbf {SL}_2(\widehat {\mathbb {Z}})\big )$-action on $\mathbb {1}_{\mathcal {R}_f}$. The first step is to identify the stabilizer of $\mathbb {1}_{\mathcal {R}_f}$.

Definition 4.4 Define the compact-open subgroup $U_{\mathcal {R}}=\prod _{p<\infty }U_{p}<\mathbf {SL}_2(\mathbb {A}_f)$ by

\[ U_p=\begin{cases} U_0(p), & B \textrm{ is ramified at } p,\\ U_0(p^n), & \mathcal{R}_p \textrm{ has level } p^n,\\ \mathbf{SL}_2(\mathbb{Z}_p), & \textrm{otherwise.} \end{cases} \]

From Lemmata 3.11 and 3.17, we know that $\rho (U_{\mathcal {R}}).\mathbb {1}_{\mathcal {R}_f}=\mathbb {1}_{\mathcal {R}_f}$.

We define the arithmetic function $\rho (a \mid q D_B)$ as

\[ \rho (a \mid q D_B)\colon= \prod_{p\mid \gcd(q D_B/a, a)} (1-p^{-1}) . \]

Note that $(\log \log (10 q D_B))^{-1} \ll \rho (a \mid q D_B) \leq 1$.

Proposition 4.5 Let $\mathcal {R}\subset B$ be an Eichler order. Then,

\begin{align*} &\rho\big(\mathbf{SL}_2(\widehat{\mathbb{Z}})\big).\mathbb{1}_{\mathcal{R}_f}\\ &\quad=\Bigg\{\frac{a (-1)^{\omega(D_B/\gcd(a,D_B))}}{q D_B} \psi\bigg(\frac{ t \cdot \nu_a(x)}{qD_B/a}\bigg) \cdot \mathbb{1}_{\widehat{\mathcal{R}}^{(a)}_f} \,\Big| a \mid q D_B,\, t\in ^{\mathbb{Z}}/_{{\frac{q D_B}{a}}\mathbb{Z}},\, \gcd(t,a)=1 \Bigg\}. \end{align*}

Moreover, each function on the right-hand side corresponds to a single coset of ${\mathbf {SL}_2(\widehat {\mathbb {Z}})/ U_{\mathcal {R}}}$.

Remark 4.6 For every $a\mid q D_B$, the characteristic function $\mathbb {1}_{\widehat {\mathcal {R}}^{(a)}}$ appears above exactly $(q D_B/a) \rho (a\mid q D_B)$ times with different phase functions.

Proof. This follows from combining the local contributions as calculated in Propositions 3.13 and 3.21 and the remarks following these propositions.

4.2 Theta series for Eichler orders

We fix once and for all an Eichler order $\mathcal {R}=\mathcal {R}_1\cap \mathcal {R}_2\subset B$ of level $q$. In this section, we unwind the adelic definition of a theta series for the case of Eichler orders into classical terms.

Denote $K_{\mathcal {R}}=\prod _{v<\infty } \big (\mathbf {G}(\mathbb {Q}_v)\cap \mathcal {R}_v\big )$. Strong approximation for the simply connected group $\mathbf {G}$ implies that the double quotient

\[\mathbf{G}(\mathbb{Q})\backslash^{ \mathbf{G}(\mathbb{A})}/_{ K_{\mathcal{R}}} \]

is a single orbit of $\mathbf {G}(\mathbb {R})$. The stabilizer of the identity double coset in $\mathbf {G}(\mathbb {R})$ is

\[ \Gamma\colon= K_{\mathcal{R}}\cap \mathbf{G}(\mathbb{Q})=\big\{x\in\mathcal{R} \mid \operatorname{Nr} x = 1\big\}. \]

Hence, there is a canonical quotient map

\[ \pi_{\Gamma}\colon {\mathbf{G(\mathbb{Q})}\backslash ^{\mathbf{G}(\mathbb{A})}}\to {\Gamma \backslash^{ \mathbf{G}(\mathbb{R})}}. \]

Each fiber of this map is a torsor for $K_{\mathcal {R}}$. The quotient map $\pi _{\Gamma }$ induces a natural isomorphism

\[ \pi_{\Gamma}^*\colon\operatorname{Map}\Big({\Gamma\backslash^{ \mathbf{G}(\mathbb{R})}},\mathbb{C}\Big)\to \operatorname{Map}\big([\mathbf{G}(\mathbb{A})],\mathbb{C}\big)^{K_\mathcal{R}}, \]

where the right-hand side is the set of all $K_{\mathcal {R}}$-invariant maps.

Set $\Lambda =U_{\mathcal {R}}\cap \mathbf {SL}_2(\mathbb {Q})<\mathbf {SL}_2(\mathbb {R})$. Explicitly, $\Lambda =\Gamma _0(q D_B)$ where $D_B$ is the product of the primes ramified in $B$ and $q$ is the level of $\mathcal {R}$. Again, the double quotient

\[{\mathbf{SL}_2(\mathbb{Q})^{\backslash ^{\mathbf{SL}_2(\mathbb{A})}/_{U_{\mathcal{R}}}}} \]

is a single orbit of $\mathbf {SL}_2(\mathbb {R})$ and the stabilizer of the identity is $\Lambda$. This induces a map

(24)

\begin{equation} \pi_{\Lambda}\colon {\mathbf{SL}_2(\mathbb{Q})\backslash \mathbf{SL}_2(\mathbb{A})}\to {\Lambda \backslash \mathbf{SL}_2(\mathbb{R})} \end{equation}

and, furthermore, a natural isomorphism

\[ \pi_{\Lambda}^*\colon \operatorname{Map}\Big({\Lambda ^{\backslash \mathbf{SL}_2(\mathbb{R})}},\mathbb{C}\Big)\to \operatorname{Map}\big([\mathbf{SL}_2(\mathbb{A})],\mathbb{C}\big)^{U_\mathcal{R}}. \]

We can now write the adelic theta kernel in classical terms.

Proposition 4.7 Fix $M=M_\infty \cdot \prod _{v<\infty }M_v \in \Omega$ such that $M_v=\mathbb {1}_{\mathcal {R}_v}$ for all finite $v$. Let $l_\infty, r_\infty \in \mathbf {G}(\mathbb {R})$ and $s_\infty \in \mathbf {SL}_2(\mathbb {R})$. Then,

\[ \Theta_M\big(l_\infty K_{\mathcal{R}},r_\infty K_{\mathcal{R}}; s_\infty U_{\mathcal{R}}\big)= \sum_{\xi\in \mathcal{R}} (\rho(s_\infty)M_\infty)(l_\infty^{-1}\xi r_\infty). \]

Hence, $\Theta _M$ defines a classical theta kernel on $\Gamma \backslash \mathbf {G}(\mathbb {R})\times {\Gamma \backslash \mathbf {G}(\mathbb {R})} \times {\Lambda \backslash \mathbf {SL}_2(\mathbb {R})}$.

Proof. This follows from the discussion above, Lemma 3.17, and the local-to-global principle for lattices that implies

\[ \mathcal{R}=\bigcap_{v<\infty} \mathcal{R}_v, \]

where the intersection is taken in $B$.

Definition 4.8 Fix $M_\infty \in \Omega _\infty$ and define $\vartheta _{M_\infty }\colon {\Gamma \backslash \mathbf {G}(\mathbb {R})}\times {\Gamma \backslash \mathbf {G}(\mathbb {R})} \times {\Lambda \backslash \mathbf {SL}_2(\mathbb {R})}$ by

\[ \vartheta_{M_\infty}(l_\infty,r_\infty; s_\infty)=\Theta_M(l_\infty K_{\mathcal{R}},r_\infty K_{\mathcal{R}}; s_\infty U_{\mathcal{R}}).\]

4.3 The Weil $L^2$-norm of the theta kernel

Our method relies heavily on bounding the $L^2$-norm of $\Theta _M(l,r;s)$ in the $s$-variable. We use the Fourier–Whittaker decomposition from Proposition 3.25 to bound the $L^2$-norm by a second-moment count of rational matrices. Unfortunately, the classical representation above is not well adapted to this calculation because of the many cusps of ${\Lambda \backslash \mathbf {SL}_2(\mathbb {R})}$. Instead, we compute adelically the $L^2$-norm. This is easier to execute because the adelic quotient $[\mathbf {SL}_2(\mathbb {A})]={\mathbf {SL}_2(\mathbb {Q})\backslash \mathbf {SL}_2(\mathbb {A})}$ has a single cusp.

Proposition 4.9 Fix $M_\infty \in \Omega _\infty$. Then,

\begin{align*} &\frac{1}{\operatorname{covol}(\Lambda)}\int_{{\Lambda}\backslash{\mathbf{SL}_2(\mathbb{R})}} |\vartheta_{M_\infty}(l_\infty, r_\infty;s_\infty)|^2 \,{d} s_\infty\\ &\quad \leq (q D_B)^{-1} \sum_{a\mid q D_B} \sum_{\alpha\in\mathbb{Q}} \int_{\sqrt{3}/2}^\infty \int_{\mathbf{SO}_2(\mathbb{R})} \sum_{\substack{\xi\in\widehat{\mathcal{R}}^{(a)}\\\operatorname{Nr} \xi=\alpha}}\big| \big(\rho\big(\!\operatorname{diag}(y^{1/2},y^{-1/2}) k\big).M_\infty\big)(l_\infty^{-1} \xi r_\infty)\big|^2 \,{d} k \frac{{d} y}{y^2}, \end{align*}

where the measure on $\mathbf {SO}_2(\mathbb {R})$ is normalized to be a probability measure, and the left-hand side is independent of the measure normalization on $\mathbf {SL}_2(\mathbb {R})$.

Proof. Fix $M=M_\infty \cdot \prod _{v<\infty }M_v \in \Omega$ such that $M_v=\mathbb {1}_{\mathcal {R}_v}$ for all finite $v$. Then, Proposition 4.7 and the isomorphism (24) imply

\[ \frac{1}{\operatorname{covol}(\Lambda)}\int_{{\Lambda}\backslash{\mathbf{SL}_2(\mathbb{R})}} |\vartheta_{M_\infty}(l_\infty, r_\infty;s_\infty)|^2 \,{d} s_\infty= \int_{[\mathbf{SL}_2(\mathbb{A})]} |\Theta_M(l_\infty K_{\mathcal{R}}, r_\infty K_{\mathcal{R}};s)|^2 \,{d} s. \]

We proceed to bound the adelic integral by expanding the domain of integration to a Siegel set.

Denote by $\mathbf {N}<\mathbf {SL}_2$ the algebraic subgroup of upper triangular matrices. We have $\mathbf {N}\simeq \mathbb {G}_a$ and a fundamental domain for the action of $\mathbf {N}(\mathbb {Q})$ on $\mathbf {N}(\mathbb {A})$ is

\[ \mathcal{N}=\begin{pmatrix} 1 & [0,1) \\ 0 & 1 \end{pmatrix} \cdot \prod_{v<\infty} \begin{pmatrix} 1 & \mathbb{Z}_v \\ 0 & 1 \end{pmatrix}. \]

Set $A^>\colon= \big \{\!\operatorname {diag}(y^{1/2},y^{-1/2}) \colon y>\sqrt {3}/2\big \}\subset \mathbf {SL}_2(\mathbb {R})$. A Siegel set for the action of $\mathbf {SL}_2(\mathbb {Q})$ on $\mathbf {SL}_2(\mathbb {A})$ is given by

\[ \mathcal{S}=\mathcal{N}\cdot A^> \cdot \mathbf{SO}_2(\mathbb{R}) \mathbf{SL}_2(\hat{\mathbb{Z}}). \]

Denote $l=(l_\infty,e,e,\ldots )$ and similarly $r=(r_\infty,e,e,\ldots )$. Because the Siegel set contains a fundamental domain for the action of the lattice, we can write

\begin{align*} &\int_{[\mathbf{SL}_2(\mathbb{A})]} |\Theta_M(l_\infty K_{\mathcal{R}}, r_\infty K_{\mathcal{R}};s)|^2 \,{d} s\\ &\quad \leq \int_{\mathcal{S}} |\Theta_M(l_\infty K_{\mathcal{R}}, r_\infty K_{\mathcal{R}};s)|^2 \,{d} s\\ &\quad =\varrho_{\mathbf{SL}_2}^{-1}\int_{[\mathbf{N}(\mathbb{A})]} \int _{\sqrt{3}/2}^\infty \int_{\mathbf{SL}_2(\hat{\mathbb{Z}})} \int_{\mathbf{SO}_2(\mathbb{R})} \bigg|\sum_{\xi\in B} (\rho(n)\rho(\operatorname{diag}(y^{1/2},y^{-1/2}) k k_f).M)(l^{-1} \xi r) \bigg|^2 \,{d} n \frac{{d} y}{y^2} \,{d} k_f\,{d} k\\ &\quad=\varrho_{\mathbf{SL}_2}^{-1}\sum_{\alpha\in\mathbb{Q}} \int _{\sqrt{3}/2}^\infty \int_{\mathbf{SL}_2(\hat{\mathbb{Z}})} \int_{\mathbf{SO}_2(\mathbb{R})} \bigg|\sum_{\substack{\xi\in B \\ \operatorname{Nr} \xi=\alpha}} (\rho(\operatorname{diag}(y^{1/2},y^{-1/2}) k)\rho(k_f).M)(l^{-1} \xi r) \bigg|^2 \,{d} k_f \frac{{d} y}{y^2} \,{d} k\\ &\quad \leq\varrho_{\mathbf{SL}_2}^{-1}\sum_{\alpha\in\mathbb{Q}} \int _{\sqrt{3}/2}^\infty \int_{\mathbf{SL}_2(\hat{\mathbb{Z}})} \int_{\mathbf{SO}_2(\mathbb{R})} \bigg|\sum_{\substack{\xi\in B \\ \operatorname{Nr} \xi=\alpha}} \big|(\rho(\operatorname{diag}(y^{1/2},y^{-1/2}) k)\rho(k_f).M)(l^{-1} \xi r)\big| \bigg|^2 \,{d} k_f \frac{{d} y}{y^2}\, {d} k. \end{align*}

The last equality follows from the computation of the Fourier coefficients in the unipotent variable in Proposition 3.25 and the orthogonality of characters. We normalize the measure on $\mathbf {SL}_2(\mathbb {A}_f)$ so that $\mathbf {SL}_2(\widehat {\mathbb {Z}})$ has volume $1$. Then, the global measure normalization constant $\varrho _{\mathbf {SL}_2}$ is equal to the volume of ${\mathbf {SL}_2(\mathbb {Z})\backslash \mathbb {H}}$ with respect to the standard hyperbolic measure ${{d} x\, {d} y/y^2}$, i.e. $\varrho _{\mathbf {SL}_2}=\operatorname {covol}(\mathbf {SL}_2(\mathbb {Z}))= {\pi /3}$.

In the last line we have inserted the absolute value into the sum using the triangle inequality, and we continue to evaluate the integral along $\mathbf {SL}_2(\hat {\mathbb {Z}})$. From Lemmata 3.11 and 3.17, we know that the integrand is invariant under the finite index subgroup $U_\mathcal {R}<\mathbf {SL}_2(\hat {\mathbb {Z}})$. We decompose the integral into $\big [\mathbf {SL}_2(\hat {\mathbb {Z}})\colon U_{\mathcal {R}}\big ]$-integrals along the different cosets of $U_{\mathcal {R}}$ in $\mathbf {SL}_2(\hat {\mathbb {Z}})$. We have an exact expression for the integrand on each coset due to Proposition 4.5. The phases of the form $\psi \big ( {t \cdot \nu _a(x)/qD_B/a}\big )$ that appear in each element in the $\rho \big (\mathbf {SL}_2(\widehat {\mathbb {Z}})\big )$-orbit are discarded because of the innermost absolute value. Hence, each $U_{\mathcal {R}}$-coset reduces to a sum over elements in a lattice $\widehat {\mathcal {R}}^{(a)}$ for $a\mid q D_B$. From Remark 4.6, we deduce for any $a \mid q D_B$ that the weight of the sum over $\mathbb {1}_{\widehat {R}^{(a)}}$ is $\rho (a \mid q D_B) \varrho _{\mathbf {SL}_2}^{-1} \big [\mathbf {SL}_2(\widehat {\mathbb {Z}})\colon U_{\mathcal {R}}\big ]^{-1}$. Because $\Lambda =U_{\mathcal {R}}\cap \mathbf {SL}_2(\mathbb {A}_f)$ and $\mathbf {SL}_2(\mathbb {Z})=\mathbf {SL}_2(\widehat {\mathbb {Z}})\cap \mathbf {SL}_2(\mathbb {A}_f)$, the index satisfies $\varrho _{\mathbf {SL}_2}\big [\mathbf {SL}_2(\widehat {\mathbb {Z}})\colon U_{\mathcal {R}}\big ]=\operatorname {covol}(\Lambda )$. Because $\Lambda =\Gamma _0(qD_B)$, we see that the weight factor is equal to

\[ \operatorname{covol}(\Lambda)^{-1}\rho(a \mid q D_B)=\frac{\prod_{p\mid \gcd(q D_B/a,a)}(1-p^{-1})}{(\pi/3) q D_B\prod_{p\mid q D_B}(1+p^{-1})}\leq (q D_B)^{-1}.\]

5. The theta lift

In this section we discuss the pairing between a theta kernel and an automorphic form. This subject is well studied in the literature, we need to review and revisit several results because of the explicit form that we require.

5.1 Cuspidal theta series

Definition 5.1 We say that a test function $M\colon B_{\mathbb {A}}\to \mathbb {C}$ is cuspidal if $(\rho (s)M)(l^{-1}\xi r)=0$ for all $l,r\in \mathbf {G}(\mathbb {A})$, $s\in \mathbf {SL}_2(\mathbb {A})$, and $\xi \in B$ with $\operatorname {Nr} \xi =0$.

For example, $M$ is cuspidal if $M=\prod _v M_v$ and there is a place $v$ such that $\rho (s_v).M_v(x_v)=0$ for every $s_v\in \mathbf {SL}_2(\mathbb {Q}_v)$, and $x_v\in B_v$ with $\operatorname {Nr} x_v=0$. The importance of cuspidal test functions is that their theta series, when well-defined, is a cuspidal function of $s$ on $[\mathbf {SL}_2(\mathbb {A})]$. This follows from Proposition 3.25. Note that the cuspidality condition for $M$ is very restrictive if $\mathbf {G}$ is split. For example, if $\mathbf {G}=\mathbf {SL}_2$, then the test function $\exp (-D \operatorname {Tr}(x_\infty ^t{x}{_\infty }))P(x_\infty ) \prod _{v<\infty } \mathbb {1}_{\mathcal {R}_v}$, for a polynomial $P\colon \operatorname {Mat}_{2\times 2}(\mathbb {R})\to \mathbb {C}$ and $D>0$, is used in [Reference ShimizuShi72]. This test function is generally not cuspidal.

5.2 Unfolding

Lemma 5.2 If $\mathcal {R}_p$ is an Eichler order, then $\operatorname {Nr} \mathcal {R}_p^\times =\mathbb {Z}_p^\times$.

Proof. This is simple to verify if $B$ is split at $p$ by conjugating $\mathcal {R}_p$ to $\big (\begin{smallmatrix} \mathbb {Z}_p & \mathbb {Z}_p \\ p^n \mathbb {Z}_p & \mathbb {Z}_p \end{smallmatrix}\big )$. If $B$ is ramified at $p$, then $\mathcal {R}_p^\times =\mathcal {O}_{B_p}^\times$ is the unit group of the ring of algebraic integers in $B_p$. The algebra $B_p$ contains an unramified quadratic extension $E/\mathbb {Q}_p$, hence $\operatorname {Nr} \mathcal {R}_p^\times$ contains $\operatorname {Nr} \mathcal {O}_E^\times =\mathbb {Z}_p^\times$.

Lemma 5.3 Let $M_f=\prod _p \mathbb {1}_{p^{-k_p}\mathcal {R}_p}\colon B\otimes \mathbb {A}_f\to \mathbb {C}$, where $k_p\in \mathbb {Z}_{\geq 0}$ for all $p$ and $k_p=0$ for almost every prime $p$. Denote $N=\prod _p p^{2k_p}\in \mathbb {N}$ and fix $\xi \in B^\times$. Then,

\[ \int_{\mathbf{G}(\mathbb{A}_f)} M_f(l^{-1}\xi)\,{d} l \ll_\varepsilon \varrho_{\mathbf{G}}^{-1} N |\operatorname{Nr} \xi| q^{\varepsilon}, \]

where we recall that $q$ denotes the level of $\mathcal {R}$, and this integral vanishes unless $\operatorname {Nr}\xi \in N^{-1}\mathbb {Z}$.

Proof. The integral decomposes into a product of local integrals $\varrho _{\mathbf {G}}^{-1}\prod _p \int _{\mathbf {G}(\mathbb {Q}_p)} \mathbb {1}_{p^{-k_p}\mathcal {R}_p}(l_p^{-1}\xi ) \,{d} l_p$. All elements of $p^{-k_p}\mathcal {R}_p$ have norms in $p^{-2 k_p} \mathbb {Z}_p$. Hence, the local integral vanishes if $\operatorname {Nr}(l_p^{-1}\xi )=\operatorname {Nr}(\xi )\ \not \in p^{-2 k_p} \mathbb {Z}_p$. Because $\operatorname {Nr} \xi \in \mathbb {Q}$, the non-vanishing conditions at all primes $p$ imply that $\int _{\mathbf {G}(\mathbb {A}_f)}\cdots$ vanishes if $\operatorname {Nr} \xi \not \in N^{-1}\mathbb {Z}$.

Fix now $p$ and assume $\operatorname {Nr}\xi \in p^{-2k_p} \mathbb {Z}_p$. Then, the local integral is equal to $\int _{\mathbf {G}(\mathbb {Q}_p)} \mathbb {1}_{\mathcal {R_p}}(l_p^{-1}p^{k_p}\xi )\, {d} l_p$. The integrand is right-invariant under $K_p$. Denote by $\mathcal {R}_p(\alpha )$ the set of elements in $\mathcal {R}_p$ of norm $\alpha \in \mathbb {Q}_p^{\times }$. Of course, $\mathcal {R}_p(\alpha )=\emptyset$ if $\alpha \not \in \mathbb {Z}_p$. The set $\mathcal {R}_p(\alpha )$ is left-invariant under multiplication by $K_p \colon- \mathcal {R}_p(1)$, and

\[ \int_{\mathbf{G}(\mathbb{Q}_p)} \mathbb{1}_{\mathcal{R_p}}(l_p^{-1}p^{k_p}\xi) \,{d} l_p = m_{\mathbf{G}(\mathbb{Q}_p)}(K_p) \cdot\#\Big({K_p ^{\backslash ^{\mathcal{R}_p\big(p^{2k_p}\operatorname{Nr}\xi\big)}}} \Big). \]

We have $m_{\mathbf {G}(\mathbb {Q}_p)}(K_p)=1$ if $p\not \mid q$ and $m_{\mathbf {G}(\mathbb {Q}_p)}(K_p)=(p+1)^{-1}p^{-n+1}$ if $p^n\parallel q$ with $n>0$.

We now estimate $\#\big ({K_p \backslash \mathcal {R}_p(\alpha )} \big )$. Define

\[ \mathcal{R}_p(\alpha)^{\dagger}=\big\{x\in\mathcal{R}_p\colon |\operatorname{Nr} x|_p=|\alpha|_p \big\}, \]

evidently $\mathcal {R}_p(\alpha )\subset \mathcal {R}_p(\alpha )^{\dagger}$. The set $\mathcal {R}_p(\alpha )^{\dagger}$ is invariant under left multiplication by $\mathcal {R}_p^\times$. Because $\operatorname {Nr} \mathcal {R}_p^\times =\mathbb {Z}_p^\times$, each coset of ${\mathcal {R}_p^\times \backslash \mathcal {R}_p(\alpha )^{\dagger} }$ contains exactly one coset of ${K_p \backslash \mathcal {R}_p(\alpha )}$. Thus, $\#\big ({K_p \backslash \mathcal {R}_p(\alpha )} \big )=\#\big ({\mathcal {R}_p^\times \backslash \mathcal {R}_p(\alpha )^{\dagger} } \big )$.

If $B_p$ ramifies, then the fact that $\operatorname {ord}_p(\operatorname {Nr} \bullet )$ is a valuation on $B_p$ implies that $\mathcal {R}(\alpha )^{\dagger}$ is a single coset of $\mathcal {R}_p^\times$ if $\alpha \in \mathbb {Z}_p$. In the split case, we can assume $B_p=\operatorname {Mat}_{2\times 2}(\mathbb {Q}_p)$ and $\mathcal {R}_p=\big (\begin{smallmatrix}\mathbb {Z}_p & \mathbb {Z}_p \\ p^n \mathbb {Z}_p & \mathbb {Z}_p \end{smallmatrix}\big )$, where $p^n\parallel q$. Let $\widetilde {\mathcal {R}}_p^\times$ be the image of $\mathcal {R}_p^\times$ in $\mathbf {PGL}_2(\mathbb {Q}_p)$. Then, the map ${\mathcal {R}_p^\times \backslash \mathcal {R}_p(\alpha )^{\dagger} }\to {\widetilde {\mathcal {R}}_p^\times \backslash \mathbf {PGL}_2(\mathbb {Q}_p)}$ is injective because $Z_{\mathbf {GL}_2}(\mathbb {Q}_p)\cap \{g\in \mathbf {GL}_2(\mathbb {Q}_p)\colon |\operatorname {det} g|_p=1\}\subset \mathcal {R}_p^\times$. Hence, it is enough to find an upper bound for the number of $\widetilde {\mathcal {R}}_p^\times$ cosets in the image of $\mathcal {R}_p(\alpha )^{\dagger}$ in $\mathbf {PGL}_2(\mathbb {Q}_p)$.

The group $\mathbf {PGL}_2(\mathbb {Z}_p)$ is the stabilizer of a vertex $v_0$ in the Bruhat–Tits tree of $\mathbf {PGL}_2(\mathbb {Q}_p)$ and $\operatorname {diag} (p^{-n}, 1) \mathbf {PGL}_2(\mathbb {Z}_p) \operatorname {diag}(p^n,1)$ is a stabilizer of a vertex $v_n$, with $\operatorname {dist}(v_0,v_n)=n$. Hence, $\widetilde {\mathcal {R}}_p^\times$, which is the intersection of the two, is the stabilizer of the geodesic path of length $n$ connecting $v_0$ and $v_n$ in the tree. Because $\mathbf {PGL}_2(\mathbb {Q}_p)$ acts strongly transitively on its Bruhat–Tits tree, it acts transitively on the set of geodesic paths of length $n$. Hence, the map $g_p\mapsto (g_p^{-1}.v_0, g_p^{-1}.v_n)$ is a bijection between ${\widetilde {\mathcal {R}}_p^{\times }\backslash \mathbf {PGL}_2(\mathbb {Q}_p)}$ and the set of oriented geodesic paths of length $n$ in the tree. We need to find an upper bound on the number of paths that correspond to the image of $\mathcal {R}_p(\alpha )^{\dagger}$.

Denote $\operatorname {ord}_p(\alpha )=m$. If $g_p\in \mathcal {R}_p(\alpha )^{\dagger}$, then the existence of the Smith normal form for $g_p\in M_2(\mathbb {Z}_p)$ implies that $g_p\in \mathbf {GL}_2(\mathbb {Z}_p) \operatorname {diag}(p^{m_1}, p^{m_2})\mathbf {GL}_2(\mathbb {Z}_p)$ for some $m_2\geq m_1\geq 0$, with $m_2+m_1=m$. Then, $\operatorname {dist}(g_p^{-1}.v_0,v_0)=m_2-m_1\leq m$. Hence, the number of possibilities for the first vertex of the $n$-path is at most the number of vertices in a ball of radius $m$, that is $1+ ((p+1)/(p-1))(p^m-1)\ll p^m$. Because the length of the path is $n$, the number of possibilities for the final vertex, after the first vertex has been fixed, is at most $\lfloor (p+1)p^{n-1}\rfloor =m_{\mathbf {G}(\mathbb {Q}_p)}(K_p)^{-1}$. We conclude that if $B_p$ is split, then

\[ m_{\mathbf{G}(\mathbb{Q}_p)}(K_p) \cdot\#\Big({K_p ^{\backslash ^{\mathcal{R}_p(p^{2k_p}\operatorname{Nr}\xi)}}} \Big)\ll p^{2k_p+\operatorname{ord}_p(\operatorname{Nr}\xi)}\ll |N \operatorname{Nr} \xi|_p^{-1}. \]

Multiplying the contributions from all primes $p$ we arrive at the claimed bound.

Proposition 5.4 Let $M\colon B_{\mathbb {A}}\to \mathbb {C}$ be a finite linear combination of standard test functions such that the component at infinity satisfies the decay condition of Proposition 3.23. Assume that $M$ is cuspidal. Fix $\varphi,\varphi ' \in L^\infty ([\mathbf {G}(\mathbb {A})])$ and let $\xi _\alpha \in B$ be an arbitrary element of norm $\alpha \in \mathbb {Q}^\times$. Then,

\[ \int_{[\mathbf{G}(\mathbb{A})]} \int_{[\mathbf{G}(\mathbb{A})]} \sum_{\xi\in B} M(l^{-1}\xi r) \varphi(l) \varphi'(r) \,{d} l \,{d} r =\sum_{\alpha \in \mathbb{Q}^\times} \int_{[\mathbf{G}(\mathbb{A})]} \varphi'(r) \int_{\mathbf{G}(\mathbb{A})} M(l^{-1}\xi_\alpha r) \varphi(l) \,{d} l \,{d} r, \]

where we make the convention that if there is no $\xi _{\alpha } \in B$ of norm $\alpha \in \mathbb {Q}^{\times }$, then the summand pertaining such $\alpha$ is to be omitted.

Remark 5.5 The assumption that $M$ is cuspidal is crucial here. Otherwise, there will be an additional contribution from the norm-zero elements of $B$. This contribution may in general diverge.

Proof. The theta series $\Theta _M$ can be rewritten as a sum over $B^\times$ due to the vanishing condition for norm-zero elements. A priori, we do not even know that the left-hand side is integrable. Thus, we proceed with the following computation as formal operations which hold for positive-valued functions. We will then use the positive-valued case to show absolute convergence which will justify these operations in general.

Unfold first the integral along the $l$ variable to rewrite the left-hand side above as

\[ \int_{[\mathbf{G}(\mathbb{A})]}\varphi'(r) \sum_{\xi\in {\mathbf{G}(\mathbb{Q})\backslash B^\times}} \int_{\mathbf{G}(\mathbb{A})} M(l^{-1}\xi r) \varphi(l) \,{d} l \,{d} r. \]

Two elements in $B^\times$ are in the same left $\mathbf {G}(\mathbb {Q})$-orbit exactly when they have the same norm. Thus, the equality in question holds if we can establish absolute integrability. To show absolute integrability, we first bound

\[ \int_{[\mathbf{G}(\mathbb{A})]} \int_{[\mathbf{G}(\mathbb{A})]} \sum_{\xi\in B}\big| M(l^{-1}\xi r) \varphi(l) \varphi'(r)\big| \,{d} l \,{d} r \leq \|\varphi\|_\infty \|\varphi'\|_\infty \sum_{\alpha\in \mathbb{Q}^\times} \int_{[\mathbf{G}(\mathbb{A})]} \int_{\mathbf{G}(\mathbb{A})} \big| M(l^{-1}\xi_\alpha r) \big| \,{d} l \,{d} r . \]

By expanding the function $M$ into finite summands of standard test functions, we reduce to the case that $M=\prod _v M_v$. Furthermore, because we are only interested in upper bounds, we can further reduce to the case that in any finite place $v=p$ the function $M_v$ is a multiple of the characteristic function of $p^{-k_p} \mathcal {R}_p$, where $k_p=0$ for almost all $p$. Taking into account this reduction, the function $M$ is right- and left-invariant under $K_{\mathcal {R}}$ and we can apply Lemma 5.3 above. First we deduce that the integral over $l$ vanishes unless $\alpha \in N^{-1}\mathbb {Z}$ for some fixed integer $N$ depending only on $M$. In addition, by using the bound from Lemma 5.3, we can write

(25)

\begin{align} &\int_{[\mathbf{G}(\mathbb{A})]} \int_{[\mathbf{G}(\mathbb{A})]} \sum_{\xi\in B}\big| M(l^{-1}\xi r) \varphi(l) \varphi'(r)\big| \,{d} l \,{d} r\nonumber\\ &\quad \ll_{\,\mathbf{G},\varphi,\varphi', M,\varepsilon} Nq^{\varepsilon} \sum_{0\neq \alpha\in N^{-1}\mathbb{Z}} |\alpha| \int_{{\Lambda \backslash \mathbf{G}(\mathbb{R})}} \int_{\mathbf{G}(\mathbb{R})} \big|M_\infty(l^{-1}\xi_\alpha r)\big|\,{d} l \,{d} r\nonumber\\ &\quad = Nq^{\varepsilon}\sum_{0\neq \alpha\in N^{-1}\mathbb{Z}} |\alpha| \int_{{\Lambda \backslash \mathbf{G}(\mathbb{R})}} \int_{\mathbf{G}(\mathbb{R})} \big|M_\infty(l^{-1}\xi_\alpha)\big| \,{d} l \,{d} r\nonumber\\ &\quad \ll_{\,\mathbf{G},\Lambda} Nq^{\varepsilon}\sum_{0\neq \alpha\in N^{-1}\mathbb{Z}} |\alpha| \int_{\mathbf{G}(\mathbb{R})} \big|M_\infty(l^{-1}\xi_\alpha)\big|\,{d} l\nonumber\\ &\quad \ll Nq^{\varepsilon}\sum_{0\neq \alpha\in N^{-1}\mathbb{Z}} |\alpha| \int_{\mathbf{G}(\mathbb{R})} \big(1+\big\|l^{-1}\xi_\alpha \big\|\big)^{-4-\delta} \,{d} l. \end{align}

In the second line, we have made a change of variable $l\mapsto \xi _\alpha r \xi _\alpha ^{-1}l$. Note that we can take here $\xi _\alpha$ to be any real matrix of determinant $\alpha$, choose $\xi _\alpha =\operatorname {diag} \big (\sqrt {|\alpha |}, \operatorname {sign}(\alpha )\sqrt {|\alpha |}\big )$. The last integral in (25) can be computed using the formula for the Haar measure in Cartan coordinates

\begin{align*} &\int_{\mathbf{G}(\mathbb{R})} \big(1+\sqrt{|\alpha|}\big\|l^{-1}\operatorname{diag}(1,\operatorname{sign}(\alpha)) \big\|\big)^{-4-\delta} \, {d} l\\ &\quad \ll \int_0^{\infty}\big(1+\sqrt{2|\alpha|\cosh(t))}\big)^{-4-\delta}\sinh(t) \,{d} t \\ &\quad\ll |\alpha|^{-2-\delta/2}\int_0^\infty \frac{\sinh(t)}{\cosh(t)^{2+\delta/2}} \,{d} t\\ &\quad =|\alpha|^{-2-\delta/2}\bigg[\frac{-1}{\cosh(t)^{1+\delta/2}(1+\delta/2)}\bigg]_0^\infty\ll |\alpha|^{-2-\delta/2}. \end{align*}

Finally, we see that the expression in (25) is bounded from above by

\[ N^{2+\delta/2}q^{\varepsilon}\sum_{0\neq n \in \mathbb{Z}} \frac{1}{|n|^{1+\delta/2}}<\infty.\]

Proposition 5.6 Let $M=\prod _vM_v\in \Omega$ be cuspidal and assume $M_\infty \in V_{m,2\pi }$ for $m\in \mathbb {Z}$. Fix $\varphi,\varphi '\in L^\infty ([\mathbf {G}(\mathbb {A})])$. Denote

\[ F(s)=\int_{[\mathbf{G}(\mathbb{A})]} \int_{[\mathbf{G}(\mathbb{A})]} \Theta_M(l,r;s) \varphi(l) \varphi'(r) \,{d} l \,{d} r. \]

Then, $F(s)\in L^2([\mathbf {SL}_2(\mathbb {A})])$.

Proof. By Proposition 5.4, we know that $F(s)$ is well-defined pointwise and can be rewritten as

\[ F(s)=\sum_{\alpha \in \mathbb{Q}^\times} \int_{[\mathbf{G}(\mathbb{A})]} \varphi'(r) \int_{\mathbf{G}(\mathbb{A})} \big(\rho(s)M\big)(l^{-1}\xi_\alpha r) \varphi(l) \,{d} l \,{d} r. \]

To calculate $\int _{[\mathbf {SL}_2(\mathbb {A})]} |F(s)|^2\,{d} s$, we will bound the integral over $[\mathbf {SL}_2(\mathbb {A})]$ by an integral over a Siegel set $\mathcal {S}=\mathcal {N}\cdot A^> \cdot \mathbf {SO}_2(\mathbb {R}) \mathbf {SL}_2(\hat {\mathbb {Z}})$ as in the proof of Proposition 4.9. Because $M$ belongs to $\Omega$ and $M_\infty \in V_{m,2\pi }$, the function $M$ has a finite $\rho \big ( \mathbf {SL}_2(\hat {\mathbb {Z}})\big )$-orbit and $\rho \big (\mathbf {SO}_2(\mathbb {R})\big )$-isotypic. Hence, it is enough to bound $\int _{\mathcal {N}\cdot A^>} F_1(z) \overline {F_2(z)} \,{d} z$ where $F_1$, $F_2$ are defined in the same manner as $F$ but with $M$ replaced by test functions $M_1$, $M_2$ in the $\rho \big (\mathbf {SO}_2(\mathbb {R}) \mathbf {SL}_2(\hat {\mathbb {Z}})\big )$-orbit of $M$. Denote $a(y)=\operatorname {diag}(y^{1/2},y^{-1/2})\in \mathbf {SL}_2(\mathbb {R})$. Using the orthogonality relation of additive characters and the sup-norm bound on $\varphi$, $\varphi '$, we arrive at

(26)

\begin{align} &\bigg|\int_{\mathcal{N}\cdot A^>} F_1(z) \overline{F_2(z)} \,{d} z\bigg|\nonumber\\ &\quad \ll_{\,\varphi,\varphi'} \sum_{\alpha\in\mathbb{Q}^\times} \int_{[\mathbf{G}(\mathbb{A})]} \int_{\mathbf{G}(\mathbb{A})} \int_{[\mathbf{G}(\mathbb{A})]} \int_{\mathbf{G}(\mathbb{A})}\nonumber\\ &\qquad \int_{\sqrt{3}/2}^{\infty} \bigg|\rho_\infty(a(y)).M_1(l_1^{-1} \xi_\alpha r_1) \overline{\rho_\infty(a(y)). M_2(l_2^{-1} \xi_\alpha r_2)}\bigg| \frac{{d} y}{y^2} \,{d} l_2 \,{d} r_2 \,{d} l_1 \,{d} r_1\nonumber\\ &\quad =\sum_{\alpha\in\mathbb{Q}^\times} \int_{\mathbf{G}(\mathbb{A})} \int_{\mathbf{G}(\mathbb{A})} \int_{\sqrt{3}/2}^{\infty} \bigg|\rho_\infty(a(y)).M_1(l_1^{-1} \xi_\alpha) \overline{\rho_\infty(a(y)).M_2(l_2^{-1} \xi_\alpha )}\bigg|\frac{{d} y}{y^2} \,{d} l_2 \,{d} l_1. \end{align}

As in the proof of Proposition 5.4, we apply Lemma 5.3 to the integrals in the $l_1$ and $l_2$ variables. The integral vanishes unless $\alpha \in N^{-1}\mathbb {Z}$ for some integer $N>0$ depending only on $M$. For every $\varepsilon >0$, we can bound (26) from above by

\begin{align*} &\varrho_{\mathbf{G}}^{-2}\sum_{0\neq \alpha\in N^{-1}\mathbb{Z}} |N\alpha|^{2}q^{\varepsilon} \int_{\mathbf{G}(\mathbb{R})} \int_{\mathbf{G}(\mathbb{R})} \int_{\sqrt{3}/2}^{\infty} \Big|\rho_\infty(a(y)).M_1(l_1^{-1} \xi_\alpha) \overline{\rho_\infty(a(y)).M_2(l_2^{-1} \xi_\alpha )}\Big| \frac{{d} y}{y^2} \,{d} l_2 \,{d} l_1\\ &\quad=\varrho_{\mathbf{G}}^{-2} \sum_{0\neq \alpha\in N^{-1}\mathbb{Z}} |N\alpha|^{2}q^{\varepsilon} \int_{\mathbf{G}(\mathbb{R})} \int_{\mathbf{G}(\mathbb{R})} \int_{\sqrt{3}/2}^{\infty} \Big|M_1(\sqrt{y} l_1^{-1} \xi_\alpha) \overline{M_2(\sqrt{y} l_2^{-1} \xi_\alpha )}\Big|\, {{d} y} \,{d} l_2 \,{d} l_1\\ &\quad\ll_{\mathbf{G},M} \sum_{0\neq \alpha\in N^{-1}\mathbb{Z}} |N\alpha|^{2}q^{\varepsilon} \int_{\mathbf{G}(\mathbb{R})} \int_{\mathbf{G}(\mathbb{R})} \int_{\sqrt{3}/2}^{\infty} \|\sqrt{y} l_1^{-1} \xi_\alpha\|^{-4-\delta}\|\sqrt{y} l_2^{-1} \xi_\alpha\|^{-4-\delta} \,{{d} y} \,{d} l_2 \,{d} l_1\\ &\quad \ll \sum_{0\neq \alpha\in N^{-1}\mathbb{Z}} |N\alpha|^{2}q^{\varepsilon} \int_{\mathbf{G}(\mathbb{R})} \int_{\mathbf{G}(\mathbb{R})} \|l_1^{-1} \xi_\alpha\|^{-4-\delta}\|l_2^{-1} \xi_\alpha\|^{-4-\delta} \,{d} l_2 \,{d} l_1 . \end{align*}

Take $\xi _\alpha =\operatorname {diag}(\sqrt {|\alpha |},\operatorname {sign}(\alpha )\sqrt {|\alpha |})$ and bound the last integral from above in the same manner as in the proof of Proposition 5.4 by a multiple of $|\alpha |^{-4-\delta }$. It follows that

\[ \int_{[\mathbf{SL}_2(\mathbb{A})]} |F(s)|^2 \,{d} s \ll_{\varphi,\varphi',G, M, \varepsilon} N^{4+\delta}q^{\varepsilon}\sum_{0\neq n \in \mathbb{Z}} \frac{1}{|n|^{2+\delta}}<\infty.\]

5.3 The theta lift

Definition 5.7 Let $\varphi, \varphi ' \in L^\infty ([\mathbf {G}(\mathbb {A})])$ and fix $M\in \Omega$ cuspidal with $M_{\infty } \in V_{m, 2\pi }$ for some $m \in \mathbb {Z}$. Define $(\varphi \otimes \overline {\varphi '} )_M\colon [\mathbf {SL}_2(\mathbb {A})]\to \mathbb {C}$ by

\[ (\varphi \otimes \overline{\varphi'} )_M(s)\colon= \int_{[\mathbf{G}(\mathbb{A})]} \int_{[\mathbf{G}(\mathbb{A})]} \Theta_M(l,r;s) \varphi(l) \overline{\varphi'(r)} \,{d} r \,{d} l. \]

We call $(\varphi \otimes \overline {\varphi '} )_M$ the theta lift of $\varphi \otimes \overline {\varphi '}$.

In the case when $\varphi = \varphi '$, we simply let $\varphi _M \colon= (\varphi \otimes \overline {\varphi } )_M$ and call $\varphi _M$ the theta lift of $\varphi$.

For any $\alpha \in \mathbb {Q}^\times$, we also define

\[ T_\alpha^M\varphi(r)\colon= \begin{cases} \displaystyle\int_{\mathbf{G}(\mathbb{A})} M(l^{-1}\xi_\alpha r) \varphi(l) \,{d} l, & \alpha\in\operatorname{Nr} B^\times, \\ 0, & \textrm{otherwise}. \end{cases} \]

Assume $M=M_\infty M_f$ with $M_f=\prod _p M_p$. It would be useful to separate the finite and the archimedean parts in the integral above. This motivates the definition

\[ T_{\alpha}^{M_f}\varphi(r)\colon= \int_{\mathbf{G}(\mathbb{A}_f)} M_f(l_f^{-1}\xi_\alpha r_f) \varphi\bigg(\!\bigg(\frac{\xi_\alpha}{\sqrt{|\alpha|}}\bigg)_\infty r_\infty \epsilon_\infty^{(1-\operatorname{sign} \alpha)/2}\cdot l_f\bigg) \,{d} l_f, \]

where $\epsilon _\infty \in B\otimes \mathbb {R}$ normalizes $K_\infty$ and satisfiesFootnote ¹¹ $\operatorname {Nr}\epsilon _\infty =-1$, $\epsilon _\infty ^2=1$. Using the change of variable $\big ({\xi _\alpha / \sqrt {|\alpha |}}\big )^{-1}l_\infty \epsilon _\infty ^{(1-\operatorname {sign}(\alpha ))/2}\mapsto l_\infty$, we arrive at

(27)

\begin{align} T_{\alpha}^M \varphi(r)&= \int_{\mathbf{G}(\mathbb{R})} M_\infty\big(\sqrt{|\alpha|} \epsilon_\infty^{(1-\operatorname{sign} \alpha)/2} l_\infty^{-1} r_\infty\big) T_\alpha^{M_f}\varphi(l_\infty r_f) \,{d} l_\infty\nonumber\\ &=\big(T_\alpha^{M_f}\varphi \star_{\mathbf{G}(\mathbb{R})} M_\infty\big(\sqrt{|\alpha|} \epsilon_\infty^{(1-\operatorname{sign} \alpha)/2} \cdot \bullet\big)\big)(r) . \end{align}

Note that by Propositions 5.4 and 5.6 the theta lift $\varphi _M$ is well-defined pointwise and belongs to $L^2([\mathbf {SL}_2(\mathbb {A})])$. The proof of Proposition 5.6 implies that $T_\alpha ^M\varphi$ is a square-integrable function on $[\mathbf {G}(\mathbb {A})]$ and that

(28)

\begin{equation} \varphi_M(s)=\sum_{\alpha\in\mathbb{Q}^\times} \langle T^{\rho(s)M}_\alpha \varphi,\varphi\rangle \end{equation}

not only in an $L^2$-sense, but in fact pointwise.

Proposition 5.8 Let $\varphi$ and $M$ be as in Definition 5.7. Then, for all $\alpha \in \mathbb {Q}^\times$,

\[ W_{\varphi_M}(s;\alpha)=\langle T^{\rho(s)M}_\alpha \varphi,\varphi\rangle. \]

More generally, fix $\varphi,\varphi '\in L^\infty ([\mathbf {G}(\mathbb {A})])$. Then, $W_{(\varphi \otimes \overline {\varphi '} )_M}(s;\alpha )=\langle T^{\rho (s)M}_\alpha \varphi,\varphi '\rangle$, and

(29)

\begin{align} (\varphi \otimes \overline{\varphi'} )_M(s)=\sum_{\alpha\in\mathbb{Q}^\times} \langle T^{\rho(s)M}_\alpha \varphi,\varphi'\rangle. \end{align}

Proof. We only establish the second claim as it immediately implies the first. Proposition 5.6 implies $(\varphi \otimes \overline {\varphi '} )_M(s)\in L^2([\mathbf {SL}_2(\mathbb {A})])$. We then apply Propositions 5.4 to deduce (29).

We use the notation $u_n\colon= \left (\begin{smallmatrix} 1 & n \\ 0 & 1 \end{smallmatrix}\right )$. Fubini's theorem and the orthogonality of characters imply for all $\alpha,\beta \in \mathbb {Q}^\times$, $s \in \mathbf {SL}_2(\mathbb {A})$, and $x\in \mathbf {G}(\mathbb {A})$

\[ \int_{[\mathbf{N}(\mathbb{A})]} T^{\rho(u_n s)M}_\alpha \varphi(x) \psi(-\beta n)\,{d} n=\begin{cases} T^{\rho(s)M}_\alpha\varphi (x), & \alpha=\beta,\\ 0, & \alpha \neq \beta. \end{cases} \]

The claim follows from substituting this expression in the definition of the Whittaker function applied to (29).

5.3.1 Hecke operators

We would like to describe the relation between the Fourier–Whittaker expansion of $\varphi _M$ and the Hecke translates of $\varphi$. A minor difficulty is that the Hecke algebra of $\mathbf {G}(\mathbb {A})$ is not rich enough and we would prefer to work with the Hecke algebra of the adjoint group $\mathbf {G}^\mathrm {adj}(\mathbb {A})$. To that end, we lift a $K_{\mathcal {R}}$-invariant function on $[\mathbf {G}(\mathbb {A})]$ to $[\mathbf {G}^\mathrm {adj}(\mathbb {A})]$. An alternative more conceptual approach is to work with a $\mathbf {PGL}_2$ Weil representation, cf. [Reference WaldspurgerWal85, § I.3] and [Reference NelsonNel17, § 2.2.5].

Let us recall that the adjoint group is the affine algebraic group over $\mathbb {Q}$ representing the functor

\[ \mathbf{G}^\mathrm{adj}(L)\colon= {L^{\times \backslash ^{(B\otimes L)^\times}}} \]

for any $\mathbb {Q}$-algebra $L$, where $L^\times$ is embedded centrally in $(B\otimes L)^\times$. We will also use the algebraic group $\mathbf {B}^\times (L)=(B\otimes L)^\times$, i.e. $\mathbf {G}^\mathrm {adj}={Z_{\mathbf {B}^\times }\backslash \mathbf {B}^\times }$.

Definition 5.9 For each finite place $v$ denote by $\widetilde {K}_v$ the image of $\mathcal {R}_v^\times$ in $\mathbf {G}^\mathrm {adj}(\mathbb {Q}_v)= {\mathbb {Q}_v^\times \backslash B_v^\times }$. We further let $\widetilde {K}_{\mathcal {R}} \colon- \prod _{v < \infty } \widetilde {K}_v$.

Proposition 5.10 The natural map

\[ {\mathbf{G}(\mathbb{Q})}^{{\backslash^{\mathbf{G}(\mathbb{A})}}/_{K_{\mathcal{R}}}}\rightarrow {\mathbf{G}^\mathrm{adj}(\mathbb{Q})}^{\backslash^{\mathbf{G}^\mathrm{adj}(\mathbb{A})}/_{\widetilde{K}_{\mathcal{R}}}} \]

is a measure-preserving bijection. In particular, we have a Hilbert space isomorphism between $L^2([\mathbf {G}(\mathbb {A})])^{K_{\mathcal {R}}}$ and $L^2([\mathbf {G}^\mathrm {adj}(\mathbb {A})])^{K_{\mathcal {R}}}$.

Proof. Denote by $h \colon \mathbf {G}\to \mathbf {G}^{\mathrm {adj}}$ the standard isogeny. The image is a normal subgroup and the quotient is abelian. The kernel of the map $h$ is the center $\mathbf {Z}<\mathbf {G}$. The center is isomorphic to the group of second-order roots of unity $\mu _2$. The reduced norm map then completes the exact sequence

\[ 1\to\mu_2\to\mathbf{G}\overset{h}{\to}\mathbf{G}^{\mathrm{adj}}\xrightarrow{\operatorname{Nr}}{\mathbb{G}_m^{2}}\backslash ^{\mathbb{G}_m\to 1.} \]

For a local field or a number field $F$ the image of $\operatorname {Nr}((B\otimes F)^\times )$ in $F^\times$, is determined by the Hasse–Schilling–Maass theorem. In particular, $\operatorname {Nr}((B\otimes F)^\times )=F^\times$ if $F=\mathbb {Q}_p$, or $F=\mathbb {R}$ and $B$ is indefinite. If $B$ is definite, then $\operatorname {Nr}((B\otimes \mathbb {R})^\times )=\mathbb {R}_{>0}$. Finally, $\operatorname {Nr}((B\otimes \mathbb {Q})^\times )$ is $\mathbb {Q}^\times$ if $B$ is indefinite and $\mathbb {Q}_{>0}$ otherwise. It follows that $ {h(\mathbf {G}(\mathbb {A}))\backslash \mathbf {G}^\mathrm {adj}(\mathbb {A})}\xrightarrow [\operatorname {Nr}]{\sim }{\mathbb {A}^{\times 2}\backslash \mathbb {A}^\times }$ if $B$ is indefinite and

\[ \textstyle {h(\mathbf{G}(\mathbb{A}))\backslash \mathbf{G}^\mathrm{adj}(\mathbb{A})}\xrightarrow[\operatorname{Nr}]{\sim}{\mathbb{A}^{\times 2}\backslash \big(\mathbb{R}_{>0}\times\mathbb{A}_f^\times\big)}\simeq {\mathbb{A}_f^{\times 2}\backslash \mathbb{A}_f^\times} \]

if $B$ is definite.

Injectivity. Assume $h(g')=\tilde {\gamma } h(g) \tilde {k}$ for some $g,g'\in \mathbf {G}(\mathbb {A})$, $\tilde {\gamma }\in \mathbf {G}^{\mathrm {adj}}(\mathbb {Q})$, and $\tilde {k}\in \widetilde {K}_{\mathcal {R}}$. We need to show $[g]=[g']$ in ${\mathbf {G}(\mathbb {Q})}\backslash{\mathbf {G}(\mathbb {A})}/{K_{\mathcal {R}}}$. To show that $\tilde {\gamma }\in h(\mathbf {G}(\mathbb {Q}))$ we establish that $\operatorname {Nr}(\tilde {\gamma })$ is a square in $\mathbb {Q}^\times$, this can be checked locally at all places. Examining the archimedean component of the equality, we arrive at $\tilde {\gamma }=h(g'_\infty g_\infty ^{-1})$. Hence, $\operatorname {Nr} \tilde {\gamma }$ is positive. Similarly, at all finite places $p<\infty$ we can write $\tilde {\gamma }=h(g'_pg_p^{-1})\tilde {k}_p^{-1}$ and $|\operatorname {Nr}\tilde {\gamma }|_p=|h(g_p'g_p^{-1})|_p\in p^{2\mathbb {Z}}$. Thus, $\tilde {\gamma }=h(\gamma )$ for some $\gamma \in \mathbf {G}(\mathbb {Q})$. We can now write $h(g')=h(\gamma g)\tilde {k}$. Hence, $\operatorname {Nr} \tilde {k}=1$ as well, and $\tilde {k}=h(k)$ for some $k\in K_{\mathcal {R}}$. We deduce $h(g')=h(\gamma g k)$, and $[g']\in [\mathbf {Z}(\mathbb {A}) g ]$ in ${\mathbf {G}(\mathbb {Q})}\backslash{\mathbf {G}(\mathbb {A})}/{K_{\mathcal {R}}}$.

To conclude $[g']=[g]$ it is enough to show that ${\mathbf {Z}(\mathbb {Q})}\backslash{\mathbf {Z}(\mathbb {A})}/{\prod _{p}\big (K_p\cap \mathbf {Z}(\mathbb {Q}_p)\big )}$ is a trivial group. Because $K_p$ contains $\pm \operatorname {Id}$ for all $p$, this group is ${\pm 1}\backslash{\mathbb {A}^\times [2]}/{\prod _p \mathbb {Z}_p^\times [2]}\simeq 1$ as required.

Surjectivity. Using the norm map, it is enough to demonstrate that $ {\mathbb {A}^{\times 2}\mathbb {Q}^\times }\backslash{\mathbb {A}^\times }/{\prod _p \operatorname {Nr} \widetilde {K}_p}$ is trivial. Lemma 5.2 implies that $\operatorname {Nr} \widetilde {K}_p= {\mathbb {Z}_p^{\times 2}\backslash \mathbb {Z}_p^\times }$ for all $p$. Because $\mathbb {Q}$ has class number $1$, the double quotient is isomorphic to ${\mathbb {R}^{\times 2}}\backslash{\mathbb {R}^\times }/{\mathbb {Z}^\times }\simeq 1$.

Measure preservation. Strong approximation implies that the group $\mathbf {G}(\mathbb {R})$ acts transitively on the left-hand side in the claimed equality. Hence, it acts transitively on the right-hand side as well because the map is equivariant. The Haar measure on both spaces is a $\mathbf {G}(\mathbb {R})$-invariant probability measure on a locally compact homogeneous $\mathbf {G}(\mathbb {R})$-space. Uniqueness of Haar measure implies that the map is measure preserving.

Definition 5.11 Let $\varphi \colon [\mathbf {G}(\mathbb {A})]\to \mathbb {C}$ be $K_{\mathcal {R}}$-invariant. Denote by $\widetilde {\varphi }\colon [\mathbf {G}^{\mathrm {adj}}(\mathbb {A})]\to \mathbb {C}$ its unique lift to a $\widetilde {K}_\mathcal {R}$-invariant function on $[\mathbf {G}^{\mathrm {adj}}(\mathbb {A})]$.

We use the notation $\widetilde {\Gamma }=\mathbf {G}^\mathrm {adj}(\mathbb {Q})\cap \widetilde {K}_\mathcal {R}$, which is a lattice in $\mathbf {G}^\mathrm {adj}(\mathbb {R})$. Equivalently, $\widetilde {\Gamma }$ is the image of $\mathcal {R}^\times$ in $\mathbf {G}^\mathrm {adj}(\mathbb {R})$. Proposition 5.10 implies that $\mathbf {G}^{\mathrm {adj}}(\mathbb {R})$ acts transitively on ${\mathbf {G}^\mathrm {adj}(\mathbb {Q})\backslash\mathbf {G}^\mathrm {adj}(\mathbb {A})/ \widetilde {K}_{\mathcal {R}}}$ and ${\mathbf {G}^\mathrm {adj}(\mathbb {Q})\backslash\mathbf {G}^\mathrm {adj}(\mathbb {A})/\widetilde {K}_{\mathcal {R}}}\simeq {\widetilde {\Gamma }\backslash \mathbf {G}^{\mathrm {adj}}}(\mathbb {R})$. We introduce Hecke operators adapted to the order $\mathcal {R}$.

Definition 5.12 Let $\alpha \in \mathbb {Q}$ and $f\colon {\widetilde {\Gamma }\backslash \mathbf {G}^{\mathrm {adj}}(\mathbb {R})}\to \mathbb {C}$ continuous. Set

\[ \mathcal {R}(\alpha )=\{x\in \mathcal {R}\mid \operatorname {Nr} x =\alpha \}, \mathcal {R}(\alpha )^{\dagger} =\{x\in \mathcal {R}\mid |\operatorname {Nr} x|_\infty =|\alpha |_\infty \} \]

and define

\[ T_\alpha f(g)= \sum_{[\delta^+]\in {\mathcal{R}(1)\backslash \mathcal{R}(\alpha)}} f\big(\delta^+ g \epsilon_{\infty}^{(1-\operatorname{sign}\alpha)/2}\big) =\sum_{[\delta]\in {\mathcal{R}^\times \backslash \mathcal{R}(\alpha)^{\dagger}}} f\big(\delta g \epsilon_{\infty}^{(1-\operatorname{sign}\alpha)/2}\big) . \]

The two expressions are equal because $\mathcal {R}^\times$ contains an element of determinant $-1$ if $B$ is indefinite. These operators coincide with the classical Hecke operators for $\alpha >0$ co-prime to $q D_B$. Note that if $\alpha \not \in \operatorname {Nr}\mathcal {R}$ then $T_\alpha =0$.

Lemma 5.13 Let $\alpha \in \mathbb {Q}^\times$ and $f\colon [\mathbf {G}^{\mathrm {adj}}(\mathbb {A})]\to \mathbb {C}$ continuous and $\widetilde {K}_\mathcal {R}$-invariant. Set

\[ \mathcal{R}_f(\alpha)^{\dagger}=\prod_p \big\{ x_p\in\mathcal{R}_p \mid |\operatorname{Nr} x_p|_p = |\operatorname{Nr} \alpha|_p\big\} . \]

Then, for every $g\in \mathbf {G}^{\mathrm {adj}}(\mathbb {R})$,

\[ \big(T_\alpha f(\bullet \widetilde{K}_\mathcal{R})\big)(g)=\big( f\star \mathbb{1}_{\mathcal{R}_f(\alpha)^{\dagger}}\big)(g \epsilon_{\infty}^{(1-\operatorname{sign}\alpha)/2}), \]

where the convolution takes place in $\mathbf {B}^\times (\mathbb {A}_f)$ with the measure normalization $m_{\mathbf {B}^\times (\mathbb {A}_f)}(\mathcal {R}_f^\times )=1$.

Proof. The right $\widetilde {K}_{\mathcal {R}}$-invariance of $f$ and the left $\mathcal {R}_f^\times$-invariance of $\mathbb {1}_{\mathcal {R}_f(\alpha )^{\dagger} }$ imply

\[ \big( f\star \mathbb{1}_{\mathcal{R}_f(\alpha)^{\dagger}}\big)(g)=\int_{\mathcal{R}_f(\alpha)^{\dagger}} f(g h_f^{-1}) \,{d} h_f =\sum_{[a_f]\in {\mathcal{R}_f^\times\backslash \mathcal{R}_f(\alpha)^{\dagger}}} f(g a_f^{-1}). \]

There is a natural map ${\mathcal {R}^\times \backslash \mathcal {R}(\alpha )^{\dagger} }\to {\mathcal {R}_f^\times \backslash \mathcal {R}_f(\alpha )}^{\dagger}$. Strong approximation implies that this map is surjective. To show this map is also injective, we observe that if $\delta \equiv \delta ' \bmod \mathcal {R}_f^\times$ for $\delta,\delta '\in \mathcal {R}(\alpha )^{{\dagger} }$, then $\delta \delta '^{-1},\delta '\delta ^{-1}\in B \cap \mathcal {R}_f^\times \subset \mathcal {R}$, and $\delta \delta '^{-1}\in \mathcal {R}^\times$. By choosing a rational representative for each coset in ${\mathcal {R}_f^\times \backslash \mathcal {R}_f(\alpha )^{{\dagger} }}$ and using the left $\mathbf {G}^\mathrm {adj}(\mathbb {Q})$-invariance of $f$, we arrive at the claim.

Proposition 5.14 Let $0\neq \alpha \in \mathbb {Q}^\times$ and assume $M_f=\prod _p \mathbb {1}_{\mathcal {R}_p}$. Then, for $\varphi$ and $\widetilde {\varphi }$ as in Definition 5.11, we have

\[ \widetilde{T^{M_f}_\alpha \varphi}(r)=T_\alpha \widetilde{\varphi}\big(r\big). \]

Proof. Assume $\alpha \in \operatorname {Nr}\mathcal {R}$, otherwise the claim is trivial. Although we claim the equality for all $r\in [\mathbf {G}^\mathrm {adj}(\mathbb {A})]$, because of the uniqueness of the lift in Definition 5.11, it is enough to verify the claim for $r\in [{\mathbf {Z}\backslash \mathbf {G}}(\mathbb {A})]$. We apply Lemma 5.13 and evaluate the convolution by decomposing the Haar measure on $\mathbf {B}^\times (\mathbb {A}_f)$ into fibers over ${\mathbf {G}(\mathbb {A}_f)\backslash \mathbf {B}^\times (\mathbb {A}_f)}$, this is possible because $\mathbf {B}^\times (\mathbb {A}_f)$ and $\mathbf {G}(\mathbb {A}_f)$ are unimodular. For consistent measure normalization, we set $m_{\mathbf {B}^\times (\mathbb {A}_f)}(\mathcal {R}_f^\times )=m_{\mathbf {G}(\mathbb {A}_f)}(K_f)=m_{\operatorname {Nr} B_{\mathbb {A}_f}^\times }(\widehat {\mathbb {Z}}^\times )=1$:

\begin{align*} \widetilde{\varphi} \star \mathbb{1}_{\mathcal{R}_f(\alpha)^{\dagger}} (r)&= \int_{\mathbf{B}^\times(\mathbb{A}_f)} \mathbb{1}_{\mathcal{R}_f(\alpha)^{\dagger}}(l_f^{-1} r_f) \widetilde{\varphi}(r_\infty l_f) \,{d} l_f \\ &=\int_{{\mathbf{G}(\mathbb{A}_f)\backslash \mathbf{B}^\times(\mathbb{A}_f)}} \int_{\mathbf{G}(\mathbb{A}_f)} \mathbb{1}_{\mathcal{R}_f(\alpha)^{\dagger}}(\lambda^{-1} l_f^{-1} r_f) \widetilde{\varphi}(r_\infty l_f \lambda) \,{d} l_f \,{d}\lambda\\ &= \int_{{\mathbf{G}(\mathbb{A}_f)}\backslash{\mathbf{B}^\times(\mathbb{A}_f)/\mathcal{R}_f^\times}} \int_{\mathbf{G}(\mathbb{A}_f)} \mathbb{1}_{\mathcal{R}_f(\alpha)^{\dagger}}(\lambda^{-1} l_f^{-1} r_f) \widetilde{\varphi}(r_\infty l_f \lambda) \,{d} l_f \,{d}\lambda. \end{align*}

In the last line, we have used the fact that $\mathbb {1}_{\mathcal {R}_f(\alpha )^{\dagger} }(l_f^{-1} r_f)$ is left $\mathcal {R}_f^\times$-invariant and $\widetilde {\varphi }$ is right $\widetilde {K}_f$-invariant. Fix $\xi _\alpha \in B^\times$ with $\operatorname {Nr} \xi _\alpha =\alpha$. Because $\operatorname {Nr} l_f^{-1} r_f=1$ and $\operatorname {Nr} \mathcal {R}_f(\alpha )^{\dagger} =\hat {\mathbb {Z}}^\times \alpha$, the inner integral vanishes unless $\lambda \equiv \xi _\alpha ^{-1} \bmod {\mathbf {G}(\mathbb {A}_f)\backslash\mathbf {B}^\times (\mathbb {A}_f)}/{\mathcal {R}_f^\times }$. We conclude that

\begin{align*} \widetilde{\varphi} \star \mathbb{1}_{\mathcal{R}_f(\alpha)^{\dagger}} (r) &=\int_{\mathbf{G}(\mathbb{A}_f)} \mathbb{1}_{\mathcal{R}_f(\alpha)^{\dagger}}(\xi_\alpha l_f^{-1} r_f) \widetilde{\varphi}(r_\infty l_f (\xi_\alpha)_f^{-1}) \,{d} l_f\\ &=\int_{\mathbf{G}(\mathbb{A}_f)} \mathbb{1}_{\mathcal{R}_f(\alpha)^{\dagger}}(l_f^{-1} \xi_\alpha r_f) \widetilde{\varphi}(r_\infty(\xi_\alpha)_f^{-1} l_f) \,{d} l_f\\ &=\int_{\mathbf{G}(\mathbb{A}_f)} \mathbb{1}_{\mathcal{R}_f(\alpha)^{\dagger}}(l_f^{-1} \xi_\alpha r_f) \widetilde{\varphi}((\xi_\alpha)_\infty r_\infty l_f) \,{d} l_f\\ &=\int_{\mathbf{G}(\mathbb{A}_f)} \mathbb{1}_{\mathcal{R}_f(\alpha)^{\dagger}}(l_f^{-1} \xi_\alpha r_f) \widetilde{\varphi}\bigg(\!\bigg(\frac{\xi_\alpha}{\sqrt{|\alpha|}}\bigg)_\infty r_\infty l_f\bigg) \,{d} l_f\\ &= T_\alpha^{M_f} \varphi (r\epsilon_\infty^{(1-\operatorname{sign} \alpha)/2}), \end{align*}

where in the first line we have used the change of variables $\xi _\alpha l_f \xi _\alpha ^{-1}\mapsto l_f$ and in the second line we have applied the left $\mathbf {G}^{\mathrm {adj}}(\mathbb {Q})$-invariance of $\widetilde {\varphi }$.

Corollary 5.15 Let $\varphi,\varphi '\in L^{\infty }([\mathbf {G}(\mathbb {A})])^{K_\mathcal {R}}$ and $\alpha \in \mathbb {Q}^\times$. Assume $T_\alpha \varphi =\lambda (\alpha )\varphi$, $M_f=\prod _p \mathbb {1}_{\mathcal {R}_p}$, and $M_{\infty } \in V_{m,2\pi }$ for some $m \in \mathbb {Z}$. Then, the function

\[ F(s)= \int_{[\mathbf{G}(\mathbb{A})]} \int_{[\mathbf{G}(\mathbb{A})]} \Theta_M(l,r;s) \varphi(l) \overline{\varphi'(r)} \,{d} r \,{d} l\]

satisfies

\[ W_{F}(s_\infty U_{\mathcal{R}};\alpha)=\lambda(\alpha)\big\langle \varphi \star \big(\rho(s_\infty).M_\infty\big(\sqrt{|\alpha|} \epsilon_\infty^{(1-\operatorname{sign} \alpha)/2} \cdot \bullet\big)\big), \varphi' \big\rangle \]

for all $s_\infty \in \mathbf {SL}_2(\mathbb {R})$, where the convolution takes place in $\mathbf {G}(\mathbb {R})$ and where we recall $U_{\mathcal {R}}$ from Definition 4.4.

Proof. Recall $\rho (U_{\mathcal {R}}).\mathbb {1}_{\mathcal {R}_f}=\mathbb {1}_{\mathcal {R}_f}$ from the sentence immediately below Definition 4.4. Propositions 5.8, 5.14, and (27) then imply that

\begin{align*} W_{F}(s_\infty U_{\mathcal{R}};\alpha)&=\big\langle T_{\alpha}^{M_f}\varphi \star \big(\rho(s_\infty).M_\infty\big(\sqrt{|\alpha|} \epsilon_\infty^{(1-\operatorname{sign} \alpha)/2} \cdot \bullet\big)\big), \varphi' \big\rangle\\ &=\lambda(\alpha)\big\langle \varphi \star \big(\rho(s_\infty).M_\infty\big(\sqrt{|\alpha|} \epsilon_\infty^{ (1-\operatorname{sign} \alpha)/2} \cdot \bullet\big)\big), \varphi' \big\rangle. \end{align*}

6. The Bergman kernel

6.1 The Bergman Archimedean test function

From now on, we shall assume that $B$ is split over $\mathbb {R}$. Recall that we have fixed an isomorphism $B_\infty \simeq \operatorname {Mat}_{2\times 2}(\mathbb {R})$ and have used it to identify the two spaces. We construct a theta series whose Fourier–Whittaker coefficients coincide with the Bergman kernel. For this endeavor, we will use the following archimedean test function. We fix the global character $\psi$ as in § 3.4.

Definition 6.1 Fix a weight $m\geq 2$ and define

\[ M_{\infty}^{(m)}(x)=\exp(-2\pi \operatorname{Nr} x) \begin{cases} \displaystyle\frac{\operatorname{Nr}(x)^{m-1}}{\Big(\frac{(b-c)+i(a+d)}{2i}\Big)^m}, & \operatorname{Nr} x >0, \\ 0, & \operatorname{Nr} x \leq 0, \end{cases} \]

for $x=\left (\begin{smallmatrix} a & b \\ c & d \end{smallmatrix}\right )$. Note that $M_\infty ^{(m)}(^\iota{x}{})=\overline {M_\infty ^{(m)}}(x)$.

Set $\mu \colon \mathbf {PGL}_2(\mathbb {R})\to \mathbb {C}$

\[ \mu(x)= \begin{cases} \displaystyle \frac{2i\sqrt{\operatorname{Nr} x}}{(b-c)+i(a+d)}, & \operatorname{Nr} x >0,\\ 0, & \operatorname{Nr} x \le 0. \end{cases} \]

Then, we can write $M_\infty ^{(m)}(x)=\exp (-2 \pi \operatorname {Nr} x) \operatorname {Nr}(x)^{m/2-1}\mu (x)^m$.

Lemma 6.2 Let $k_{\theta }\colon= \left (\begin{smallmatrix} \cos \theta & \sin \theta \\ -\sin \theta & \cos \theta \end{smallmatrix}\right ) \in \mathbf {SO}_2(\mathbb {R})$. Then, for every $g\in \mathbf {PGL}_2(\mathbb {R})$,

(30)

\begin{equation} \mu(g k_{\theta})=\mu(g)e^{i\theta}. \end{equation}

Proof. We assume $\operatorname {Nr} g>0$ as the claim is trivial for non-positive determinants. We verify the claim by a direction computation. Let $g = \left (\begin{smallmatrix} a & b \\ c & d \end{smallmatrix}\right )$ and $gk_{\theta } = \left (\begin{smallmatrix} a' & b' \\ c' & d' \end{smallmatrix}\right )$. We have

\begin{align*} (b'-c')+i(a'+d') &= \big(a \sin \theta + b \cos \theta-c \cos \theta+d \sin \theta\big)+i\big(a \cos \theta - b \sin \theta + c \sin \theta+ d \cos \theta \big) \\ &=\big((b-c)+i(a+d)\big) \big(\cos \theta-i\sin \theta \big). \end{align*}

Hence, $\mu (gk_{\theta }) = \mu (g) e^{i \theta }$.

Corollary 6.3 For every weight $m\geq 2$ and $k_{\theta _1},k_{\theta _2}\in \mathbf {SO}_2(\mathbb {R})$,

\[ M_\infty^{(m)} (k_{\theta_1}x k_{\theta_2})=e^{i m (\theta_2+\theta_1)} M_\infty^{(m)}(x). \]

Proof. Apply the previous lemma to $M_\infty ^{(m)}(x)=\exp (-2\pi \operatorname {Nr} x) \operatorname {Nr}(x)^{m/2-1} \mu (x)^m$ and use the identity $M_\infty ^{(m)}(x^{\iota })=\overline {M_\infty ^{(m)}(x)}$.

Lemma 6.4 If $m\geq 2$, then

\[ |M_\infty^{(m)}(x)| \ll_m (1+\|x\|)^{-m}. \]

Proof. This is trivial if $\operatorname {Nr} x \le 0$, hence we assume the determinant is positive. Denote $r=\|x\|=\sqrt {\operatorname {Tr}(x^t{x}{})}$. A simple calculation shows that

\begin{align*} |\mu(x)|^{-2}&=\frac{r^2+2\operatorname{Nr}(x)}{4\operatorname{Nr}(x)}, \\ |M_\infty^{(m)}(x)|&=\exp(-2\pi \operatorname{Nr} x)\frac{2^m\operatorname{Nr}(x)^{m-1}}{(r^2+2\operatorname{Nr}(x))^{m/2}}. \end{align*}

If $r\leq 1$, then we write

\[ |M_\infty^{(m)}(x)|\ll_m \exp(-2\pi \operatorname{Nr} x) \operatorname{Nr}(x)^{m/2-1} \ll 1. \]

The last equality holds for all $\operatorname {Nr}(x)>0$ because we have assumed $m\geq 2$. Otherwise, if $r>1$, then

\[ |M_\infty^{(m)}(x)|\leq \exp(-2\pi \operatorname{Nr} x)2^m \operatorname{Nr}(x)^{m-1} r^{-m}\ll_m r^{-m}, \]

where we have used the fact that the real function $\exp (-2\pi t) t^{m-1}$ is bounded for $t \in [0,\infty )$ and $m\geq 1$.

Up until this point, we have established that $M_\infty ^{(m)}$ satisfies the decay condition in Definition 3.7 if $m>4$. We now turn to show that it also belongs to the space $V_{m,2\pi }$ by checking that it solves the quantum harmonic oscillator, see (18).

Lemma 6.5 If $m \ge 6$, then the function $M_\infty ^{(m)}$ from Definition 6.1 belongs to $V_{m,2\pi }$.

Proof. Lemma 6.4 above implies that $M_\infty ^{(m)}\in L^2(B_\infty )$ if $m>4$. To prove $M_\infty ^{(m)}\in V_{m,2\pi }$ we will show that $M_\infty ^{(m)} \perp V_{m',2\pi }$ for all $m'\neq m$. It is enough to establish $\langle M_\infty ^{(m)}, M' \rangle =0$ for any Schwartz solution $M'$ of (18) with quantum number $m'\neq m$ and $\omega =2\pi$.

Define $F(x)= \exp (-2\pi \operatorname {Nr}(x))N(x)$ and $N(x)=\operatorname {Nr}(x)^{m-1}(2i)^m\big ((b-c)+i(a+d)\big )^{-m}$. Then, $F(x)$ is a well-defined continuous function on the open subset $\mathcal {U}=B_\infty \setminus \big \{\!\left (\begin{smallmatrix} a & b \\ b & -a \end{smallmatrix}\right )\colon a,b\in \mathbb {R} \big \}$. Moreover, $M_\infty ^{(m)}=\mathbb {1}_{\operatorname {Nr}(x)>0}\cdot F$. Define $V=\{x\in B_\infty \mid \operatorname {Nr} x\geq 0 \}$ and $V_R=V\cap B(0,R) \setminus B(0,R^{-1})$ for $R>1$, where $B(0,r)$ is a closed ball of radius $r$ centered at the origin. Note that $V_R\subset \mathcal {U}$.

We claim that $F(x)$ solves the PDE (18) on $\mathcal {U}$ with $\omega =2\pi$ and quantum number $m$. The PDE (18) with $\omega =2\pi$, $\varpi =1$ for $F$ is equivalent to the following PDE for $N$:

(31)

\begin{equation} -\Delta N+2\pi \langle x, \nabla \rangle N=2\pi (m-2) N, \end{equation}

where $\nabla$ denotes the gradient operator and the bilinear form $\langle x_1, x_2\rangle$ is the twisted trace form $\operatorname {Tr}(x_1 ^\iota {x}{_2})$ as before. Using the definition of the Laplace operator as the Fourier multiplier with symbol $-4\pi ^2 \operatorname {Nr}$ and the definition of the gradient, we arrive at

\begin{align*} \Delta &= \frac{\partial^2}{\partial a \partial d}-\frac{\partial^2}{\partial b \partial c},\\ \langle x, \nabla \rangle & = a\frac{\partial}{\partial a} + d\frac{\partial}{\partial d} + b\frac{\partial}{\partial b} + c\frac{\partial}{\partial c}. \end{align*}

Substituting the definition $N(x)=(2i)^m({(ad-bc)^{m-1}}/{((b-c)+i(a+d))^m}$ into the formulae above we see that

\begin{align*} \Delta N &= 0,\\ \langle x, \nabla \rangle N &=(m-2) N. \end{align*}

These and (31) show that $F$ is a solution with quantum number $m$. Because the PDE (18) is local, this establishes thatFootnote ¹² $L_{2\pi }[M_\infty ^{(m)}]=2\pi m M_\infty ^{(m)}$.

Fix a Schwartz function $M'\colon B_\infty \to \mathbb {C}$. We want to show now that $\langle L_{2\pi }[M_\infty ^{(m)}],M'\rangle =\langle M_\infty ^{(m)}, L_{2\pi }[M']\rangle$. The equality $\langle \operatorname {Nr}(x) M_\infty ^{(m)},M' \rangle =\langle M_\infty ^{(m)}, \operatorname {Nr}(x) M' \rangle$ is obvious. We need only show

(32)

\begin{equation} \langle\Delta M_\infty^{(m)},M' \rangle =\langle M_\infty^{(m)}, \Delta M' \rangle. \end{equation}

Note that the integrals defining these individual inner products are absolutely convergent because $M'$ and $\Delta M'$ are Schwartz, and $M_\infty ^{(m)}$, $\operatorname {Nr}(x) M_\infty ^{(m)}$, $\Delta M_\infty ^{(m)}$ have at most polynomial growth. To establish (32) we use the equality

\[ \langle \Delta M_\infty^{(m)},M' \rangle= \lim_{R\to\infty} \int_{V_R} \Delta F(x) \overline{M'(x)}\,{d} x, \]

and the analogous formula for $\langle M_\infty ^{(m)},\Delta M' \rangle$. These follow from the dominated convergence theorem. Denote

\[ W=\begin{pmatrix} 0 & 1/2 & 0 & 0 \\ 1/2 & 0 & 0 & 0 \\ 0 & 0 & 0 & -1/2 \\ 0 & 0 & -1/2 & 0\end{pmatrix} \]

and write $\Delta =\nabla \cdot (W \nabla )$ with respect to the coordinates $(a,d,b,c)$. Using the symmetry of the matrix $W$ and the divergence theorem we arrive at

\begin{align*} \int_{V_R} \Delta F(x) \overline{M'(x)}\,{d} x&= \int_{V_R} \nabla \cdot \big(\overline{M'(x)}W \nabla F\big)\,{d} x-\int_{V_R} ( W\nabla F) \cdot (\nabla \overline{M'(x)})\, {d} x\\ &=\int_{V_R} \nabla \cdot \big(\overline{M'(x)}W \nabla F- F W \nabla \overline{M'(x)} \big)\,{d} x +\int_{V_R} F(x)\Delta \overline{M'(x)}\,{d} x\\ &= \int_{\partial V_R} \big(\overline{M'(x)}W \nabla F- F W \nabla \overline{M'(x)} \big) \cdot \hat{n} \,{d} A(x) +\int_{V_R} F(x)\Delta \overline{M'(x)}\,{d} x. \end{align*}

The conditions of the divergence theorem are satisfied because $F$ and $M$ are smooth in an open neighborhood of the closure of $V_R$, which is compact, and the boundary $\partial V_R$ is piecewise smooth. A direct computation, as in the proof of Lemma 6.4, shows that $F$ and $\nabla F$ vanish on the boundary of the cone $V$, except perhaps the origin where they remain bounded. It remains to consider the contributions from the surfaces $S_R=\partial V_R \cap B(0,R)$ and $s_R=\partial V_R\cap B(0,1/R)$. The area of $S_R$ is bounded from above by the area of a $3$-sphere of radius $R$, thus $\operatorname {Area}(S_R)\ll R^3$. On the other hand, because $M'$ is Schwartz and $F, \nabla F$ are bounded on $V$ we have that

\[ \bigg| \int_{S_R} \big(\overline{M'(x)}W \nabla F- F W \nabla \overline{M'(x)} \big) \cdot \hat{n} \,{d} A(x) \bigg| \ll_N R^{-N} R^3 \to_{R\to\infty} 0. \]

Similarly, $\operatorname {Area}(s_R)\ll R^{-3}$ and $\overline {M'(x)}W \nabla F- F W \nabla \overline {M'(x)}$ is bounded on $B(0,1)\cap V$, hence the surface integral over $s_R$ converges to $0$ as $R\to \infty$.

Now let $M'\in V_{m', 2\pi }$ be a Schwartz function and assume $m'\neq m$, then we have

\[ 2\pi m\langle M_\infty ^{(m)}, M' \rangle =\langle L_{2\pi }[M_\infty ^{(m)}], M' \rangle = \langle M_\infty ^{(m)}, L_{2\pi }[M']\rangle =2\pi m' \langle M_\infty ^{(m)}, M' \rangle \]

and we deduce $\langle M_\infty ^{(m)}, M' \rangle =0$ as necessary.

Corollary 6.6 The Bergman test function of weight $m \ge 6$ belongs to $\Omega _\infty$.

Proof. Lemma 6.4 implies that this test function satisfies the decay condition in the definition of $\Omega _\infty$ and Lemma 6.5 above implies that the Bergman test function transforms under $\rho (\mathbf {SO}_2(\mathbb {R}))$ by a character.

7. The spectral expansion

Fix a global Eichler order $\mathcal {R}=\mathcal {R}_1\cap \mathcal {R}_2\subset B$ and a weight $m>2$. We focus henceforth on the test function $M=M_\infty ^{(m)}\cdot \prod _p \mathbb {1}_{\mathcal {R}_p}\in \Omega$. This test function is cuspidal as $M_\infty ^{(m)}(x)=0$ if $\operatorname {Nr} x=0$ and we denote the classical theta series attached to the test function $M$ by Proposition 4.7 by

\[ \vartheta^{(m)}(l,r;s)=\sum_{\xi\in \mathcal{R}} \big(\rho(s)M_\infty^{(m)}\big)(l^{-1}\xi r) \]

for $s\in {\Lambda \backslash \mathbf {SL}_2(\mathbb {R})}$, $l,r\in {\Gamma \backslash \mathbf {G}(\mathbb {R})}$. In this section, we prove the main theorem about the spectral expansion of $\vartheta ^{(m)}$.

Definition 7.1 Denote by $S_m(\Gamma )$ the space of $\Gamma$-modular weight $m$ modular forms on $\mathbb {H}$. Write $S_m(\Gamma )=S_m^\mathrm {old}(\Gamma )\oplus S_m^\mathrm {new}(\Gamma )$ for the direct sum decomposition into new and old forms. The decomposition is orthogonal with respect to the Petersson inner product.

If $f\in S_m(\Gamma )$, we denote by $f^\sharp \colon {\Gamma \backslash \mathbf {G}(\mathbb {R})}\to \mathbb {C}$ the automorphic lift of $f$. Specifically, if $g=\left (\begin{smallmatrix} 1 & x \\ 0 & 1 \end{smallmatrix}\right ) \big (\begin{smallmatrix} y^{1/2} & 0 \\ 0 & y^{-1/2} \end{smallmatrix}\big ) k_\theta$, then $f^\sharp (g)=y^{m/2} e^{im\theta } f(x+iy)$. Following the discussion in § 4.2, we shall also consider $f^\sharp$ as a $K_\mathcal {R}$-invariant function on $[\mathbf {G}(\mathbb {A})]$.

Similarly, we decompose $S_m(\Lambda )=S_m^\mathrm {old}(\Lambda )\oplus S_m^\mathrm {new}(\Lambda )$, and denote by ${f^*}^\sharp \colon {\Lambda \backslash \mathbf {SL}_2(\mathbb {R})}\to \mathbb {C}$ the automorphic lift of $f^*\in S_m(\Lambda )$. Moreover, we shall also consider ${f^*}^\sharp$ as a $U_\mathcal {R}$-invariant function on $[\mathbf {SL}_2(\mathbb {A})]$.

Theorem 7.2 Fix an orthonormal basis $\mathcal {B}_m$ for $S_m(\Gamma )$ of Hecke eigenforms for all Hecke operators $T_n$ with $(n,qD_B)=1$. For two Hecke eigenforms $f,f' \in \mathcal {B}_m$, write $f \sim f'$ if and only if their Hecke eigenvalues agree for $(n,qD_B)=1$. Denote by $(f^\sharp \otimes \overline {f'^\sharp })_M$ the theta lift of $f^\sharp \otimes \overline {f'^\sharp }$. Then,

\[ \vartheta^{(m)}(l,r;s)= \frac{1}{\operatorname{covol}(\Gamma)} \frac{8\pi}{m-1} \sum_{\substack{f, f' \in \mathcal{B}_m \\ f \sim f' } } (f^\sharp \otimes \overline{f'^\sharp})_M(s) \cdot \overline{f^\sharp(l)}f^\sharp(r) \]

for all $s\in {\Lambda \backslash \mathbf {SL}_2(\mathbb {R})}$, $l,r\in {\Gamma \backslash \mathbf {G}(\mathbb {R})}$. Furthermore, $(f^\sharp \otimes \overline {f'^\sharp })_M$ is an automorphic lift of some cusp form $(f \otimes \overline {f'})_M \in S_m(\Lambda )$.

Suppose further that $f \in S_m^{\rm new}(\Gamma )$ is a newform. Let $\lambda _f(\alpha )$ be the $T_\alpha$-eigenvalue of $f^\sharp$, then $f_M^\sharp = (f^\sharp \otimes \overline {f^\sharp })_M$ is the automorphic lift of a cusp form $f_M\in S_m(\Lambda )$ with Fourier expansion

\[ f_M(\zeta)= \sum_{n>0} n^{m/2-1} \lambda_f(n) \exp(2\pi i n \zeta). \]

Remark 7.3 The operator $T_1$ acts as the identity on $K_\mathcal {R}$-invariant functions on $[\mathbf {G}(\mathbb {A})]$, i.e. functions on ${\Gamma \backslash \mathbf {G}(\mathbb {R})}$. Hence, $\lambda _f(1)=1$ and the theta lift $f_M$ is an arithmetically normalized cusp form.

The case where ${\Gamma \backslash \mathbf {G}(\mathbb {R})}= {\mathbf {SL}_2(\mathbb {Z})\backslash \mathbf {SL}_2(\mathbb {R})}$ is already contained in [Reference ZagierZag77, § 2, Proposition 1], see also (14). For the general case, the proof will bootstrap from the fact that the convolution operator $\star _{\mathbf {G}(\mathbb {R})} \mu ^m$ acting on ${\Gamma \backslash \mathbf {G}(\mathbb {R})}$ coincides with the Bergman kernel on ${\Gamma \backslash \mathbb {H}}$. This dates back at least to Selberg [Reference SelbergSel56], though geometric expressions for the Bergman kernel in terms of Poincaré series were already known to Petersson [Reference PeterssonPet40, Reference PeterssonPet41]. The particular expression for the Bergman kernel suitable for our needs may be found in [Reference ZagierZag76] and [Reference ZagierZag77, § 2, Proposition 1], [Reference SteinerSte16, Theorem 3], or [Reference Das and SenguptaDS15, § 2.3]. The first three references each contain the split case and the latter the non-split case. There does, however, appear to be an error in the constant in [Reference Das and SenguptaDS15]. Compare with the computation in [Reference ZagierZag76, Reference SteinerSte16], whose proofs also apply to co-compact lattices. The statement is as follows.

Proposition 7.4 Set

\[ k^{(m)}(l,r)\colon=\sum_{\gamma\in \Gamma} \mu^m(l^{-1}\gamma r). \]

The function $k^{(m)}$ is the kernel of the convolution operator $\star \mu ^m$ acting on $L^2({\Gamma \backslash \mathbf {G}(\mathbb {R})})$, where the convolution takes place in $\mathbf {G}(\mathbb {R})$. Fix an orthonormal basis $\mathcal {B}_m$ for $S_m(\Gamma )$. Then, for all $l,r\in \mathbf {G}(\mathbb {R})$,

\[ k^{(m)}(l,r)=\frac{1}{\operatorname{covol}(\Gamma)} \frac{8\pi}{m-1} \sum_{f \in \mathcal{B}_m} \overline{f^\sharp(l)}f^\sharp(r). \]

In particular, the operator $\star \mu ^{m}$ annihilates all the continuous, residual, and cuspidal spectrum, whose archimedean component is not discrete series.

Proof Proof of Theorem 7.2

Let $\zeta =\sigma +i\tau \in \mathbb {H}$ and fix $s=\left (\begin{smallmatrix} 1 & \sigma \\ 0 & 1 \end{smallmatrix}\right ) \big (\begin{smallmatrix} \tau ^{1/2} & 0 \\ 0 & \tau ^{-1/2} \end{smallmatrix}\big ) k_\theta \in {\mathbf {SL}_2(\mathbb {R})}$. The definition of the Weil action, Definition 6.1, and Lemmata 3.5 and 6.5 imply for $n>0$, $g\in \mathbf {G}(\mathbb {R})$

(33)

\begin{equation} \big(\rho(s)M_\infty^{(m)}\big)(\sqrt{n}g)= \tau^{m/2} n^{m/2-1} \exp(2\pi i n \zeta+i m \theta)\mu^{m}(g). \end{equation}

We will establish that $\vartheta ^{(m)}(l,r;s)$ coincides with the spectral expansion in the claim by showing equality in $L^2({\Gamma \backslash \mathbf {G}(\mathbb {R})}\times {\Gamma \backslash \mathbf {G}(\mathbb {R})})$. Pointwise identity then follows because both sides are continuous.

The Bergmann test function $M_\infty ^{(m)}$ vanishes on the null-cone $\{x\in B_\infty \colon \operatorname {Nr} x=0\}$, thus it follows from Corollary 3.6 that $M$ is cuspidal. For any $\varphi,\varphi '\in L^\infty ({\Gamma \backslash \mathbf {G}(\mathbb {R})}) \subset L^2({\Gamma \backslash \mathbf {G}(\mathbb {R})})$, we can use Proposition 5.8, (27), and (33) to write the Fourier expansion

(34)

\begin{align} (\varphi \otimes \overline{\varphi'})_M(s) &= \int_{[\mathbf{G}(\mathbb{A})]} \int_{[\mathbf{G}(\mathbb{A})]} \Theta_M(l,r;s) \varphi(l) \overline{\varphi'(r)} \,{d} l \,{d} r \nonumber\\ & = \sum_{n>0} \big\langle (T_n^{M_f}\varphi) \star \big(\big(\rho(s)M_\infty\big)(\sqrt{n}\cdot\bullet)\big), \varphi' \big\rangle \nonumber\\ &=\sum_{n>0} \tau^{m/2} n^{m/2-1} \exp(2\pi i n \zeta+i m \theta) \big\langle (T_n^{M_f}\varphi) \star \mu^m, \varphi' \big\rangle. \end{align}

Because $T_n^{M_f}$ is a convolution operator, the maps $\varphi \mapsto T_n^{M_f}\varphi$, $\varphi \mapsto (T_n^{M_f}\varphi ) \star \mu ^m$ preserve the continuous and the discrete spectra. Proposition 7.4 then implies that (34) vanishes whenever $\varphi$ or $\varphi '$ is a bounded function in the continuous spectrum. Using pseudo-Eisenstein series we can construct a dense set of bounded vectors in the continuous spectrum of $L^2([\mathbf {G}(\mathbb {A})])$, hence $\Theta _M(l,r;s)\in L^2_\mathrm {discrete}([\mathbf {G}(\mathbb {A})]\times [\mathbf {G}(\mathbb {A})])$. Moreover, $\Theta _M(l,r;s)$ is $K_{\mathcal {R}}\times K_{\mathcal {R}}$-invariant. There is an orthonormal basis of $L^2_\mathrm {discrete}([\mathbf {G}(\mathbb {A})])^{K_{\mathcal {R}}}$ consisting of bounded Hecke eigenforms for all Hecke operators $T_n$ with $(n,qD_B)=1$.

Furthermore, $\varphi \mapsto T_n^{M_f}\varphi$, $\varphi \mapsto (T_n^{M_f}\varphi ) \star \mu ^m$ preserve adelic automorphic representations. Hence, Proposition 7.4 implies that (34) vanishes unless both $\varphi$ and $\varphi '$ are lifts of weight $m$ modular forms with the same Hecke eigenvalues for $(n,qD_B)=1$. The claimed spectral expansion follows as the automorphic lifts of $\mathcal {B}_m(\Gamma )\times \mathcal {B}_m(\Gamma )$ can be completed to an orthogonal basis of $L^2_\mathrm {discrete}([\mathbf {G}(\mathbb {A})]\times [\mathbf {G}(\mathbb {A})])$ consisting of bounded Hecke eigenforms for all Hecke operators $T_n$ with $(n,qD_B)=1$. The Whittaker expansion (34) further shows that the Whittaker function of $(\varphi \otimes \overline {\varphi '})_M$ at infinity agrees with the Whittaker functions of a weight $m$ holomorphic form, is furthermore clear, following the discussion in § 4.2, that it is an automorphic lift of a holomorphic form in $S_m(\Lambda )$.

If $\varphi$ is a bounded Hecke newform with eigenvalues $\lambda (\bullet )$, we can use Corollary 5.15 to rewrite (34) as

(35)

\begin{equation} \int_{[\mathbf{G}(\mathbb{A})]} \int_{[\mathbf{G}(\mathbb{A})]} \Theta_M(l,r;s) \varphi(l) \overline{\varphi'(r)} \,{d} l \,{d} r=\sum_{n>0} \lambda(n) \tau^{m/2} n^{m/2-1} \exp(2\pi i n \zeta+i m \theta)\big\langle \varphi \star \mu^m, \varphi' \big\rangle . \end{equation}

The formula for the Fourier–Whittaker expansion of $f_M$ follows from (35) with $\varphi =\varphi '=f^\sharp$.

A careful local analysis, following Shimizu [Reference ShimizuShi72] shows that if $f$ is a newform, then $f_M(s)$ is the unique arithmetically normalized new modular form in the Jacquet–Langlands transfer of the automorphic representation generated by $\widetilde {f^\sharp }$ (to be defined momentarily). We will need only a weaker result. To discuss the Jacquet–Langlands transfer we need to lift functions from $[\mathbf {SL}_2(\mathbb {A})]$ to $[\mathbf {PGL}_2(\mathbb {A})]$. Define $\widetilde {U}_{\mathcal {R}}$ to be the image of $\big \{g \in \big (\begin{smallmatrix} \widehat {\mathbb {Z}} & \widehat {\mathbb {Z}} \\ q D_B\widehat {\mathbb {Z}} & \widehat {\mathbb {Z}} \end{smallmatrix}\big ) \colon \operatorname {det} g\in \widehat {\mathbb {Z}}^\times \big \}$ in $\mathbf {PGL}_2(\mathbb {A}_f)$. Then, $\widetilde {U}_{\mathcal {R}}$ is a compact open subgroup, and an argument identical to Proposition 5.10 proves that

\[ {\mathbf{SL}_2(\mathbb{Q})\backslash ^{\mathbf{SL}_2(\mathbb{A})}}/_{U_{\mathcal{R}}}\to {\mathbf{PGL}_2(\mathbb{Q})\backslash^{\mathbf{PGL}_2(\mathbb{A})}}/_{{\widetilde{U}_{\mathcal{R}}}} \]

is a measure-preserving bijection. Hence, we have a unique lift $\varphi \mapsto \widetilde {\varphi }$ from $L^2([\mathbf {SL}_2(\mathbb {A})])^{U_{\mathcal {R}}}$ to $L^2([\mathbf {PGL}_2(\mathbb {A})])^{\widetilde {U}_{\mathcal {R}}}$.

Definition 7.5 Let $f\in S_m(\Gamma )$ be a Hecke eigenform for all Hecke operators $T_n$ with $(n,qD_B)=1$. If $f$ is a newform denote by $f_\mathrm {JL}\in S_m^\mathrm {new}(\Lambda )$ the unique arithmetically normalized holomorphic newform such that $\widetilde {f_\mathrm {JL}^\sharp }$ belongs to the Jacquet–Langlands transfer of the automorphic representation generated by $\widetilde {f^\sharp }$. That such a vector exists and is unique follows from [Reference Jacquet and LanglandsJL70, Reference ShimizuShi72]. If $f$ is an oldform, then it factors through a newform with respect to a lattice arising from an Eichler order $\mathcal {R}' \supsetneq \mathcal {R}$ with level $q'\mid q$. In this case, we denote by $f_\mathrm {JL}$ the lift of the Jacquet–Langlands transfer, defined as above, from $S_m^\mathrm {new}(\Gamma _0(q'D_B))$ to $S_m^\mathrm {old}(\Lambda =\Gamma _0(q D_B))$.

In both cases, the modular form $f_{\mathrm {JL}}$ is an eigenform of all the classical Hecke operators corresponding to $n$ co-prime to $q D_B$, and its $n$-Fourier coefficient coincides with the $T_n$ Hecke eigenvalue of $f^\sharp$ if $\gcd (n, q D_B)=1$.

Lemma 7.6 Let $f\in S_m(\Gamma )$ be a Hecke eigenform for all Hecke operators $T_n$ with $(n,qD_B)=1$. If $f$ is a newform, then the orthogonal projection of $f_M$ onto $S_m^\mathrm {new}(\Lambda )$ is equal to $f_\mathrm {JL}$. If $f \sim f'$ are oldforms, then $(f \otimes \overline {f'})_M$ is an oldform as well.

Proof. Theorem 7.2 implies that the Fourier coefficients of $f_M$ and $f_{\mathrm {JL}}$ coincide for all $n$ co-prime to $q D_B$, which is the level of ${\Lambda \backslash \mathbb {H}}$. Theorem 1 of [Reference Atkin and LehnerAL70] then implies that $f_M-f_{\mathrm {JL}}$ is an oldform. Because $f_\mathrm {JL}$ is a newform if $f$ is, the claim holds for newforms. If $f \sim f'$ are oldforms, then so is $f_\mathrm {JL}=f'_{\mathrm {JL}}$. Hence, in this case $(f \otimes \overline {f'})_M$ is a sum of oldforms and is an oldform by itself.

Corollary 7.7 Let $l,r\in \mathbf {G}(\mathbb {R})$ and set $z_1=l.i$, $z_2=r.i\in \mathbb {H}$. Fix an orthonormal basis $\mathcal {B}_m^\mathrm {new}$ of Hecke eigenforms for $S_m^\mathrm {new}(\Gamma )$. Then,

\begin{align*} &\frac{1}{\operatorname{covol}(\Lambda)}\|\vartheta^{(m)}(l,r;\bullet)\|_{L^2({\Lambda \backslash \mathbf{SL}_2(\mathbb{R})})}^2\\ &\quad \geq \bigg(\frac{1}{\operatorname{covol}(\Gamma)}\frac{8\pi}{m-1}\bigg)^2 (\Im (z_1) \Im (z_2))^{m}\sum_{f \in \mathcal{B}_m^\mathrm{new}} \frac{\|f_{\mathrm{JL}}\|_2^2}{\operatorname{covol}(\Lambda)} |f(z_1)|^2 |f(z_2)|^2\\ &\quad \gg_\varepsilon \frac{1}{\operatorname{covol}(\Gamma)^2} (mqD_B)^{-\varepsilon }\frac{\Gamma(m)}{(4\pi)^m m^2} (\Im (z_1) \Im (z_2))^{m}\sum_{f \in \mathcal{B}_m^\mathrm{new}} |f(z_1)|^2 |f(z_2)|^2. \end{align*}

Proof. Define $\vartheta _\sharp (\zeta )=\tau ^{-m/2}\vartheta ^{(m)}(l,r;s)$, where $\zeta =\sigma +i\tau \in \mathbb {H}$ and $s=\left (\begin{smallmatrix} 1 & \sigma \\ 0 & 1 \end{smallmatrix}\right ) \big (\begin{smallmatrix} \tau ^{1/2} & 0 \\ 0 & \tau ^{-1/2} \end{smallmatrix}\big )$. Because $\vartheta ^{(m)}(l,r;\bullet )$ is $K_\infty$-isotypic we have ${\vartheta _\sharp }^\sharp (s)=\vartheta ^{(m)}(l,r;s)$. Theorem 7.2 and Lemma 7.6 above imply that the orthogonal project of $\vartheta _\sharp$ onto $S_m^\mathrm {new}(\Lambda )$ is equal to

(36)

\begin{equation} \vartheta_\sharp^\mathrm{new}(\zeta)=\frac{1}{\operatorname{covol}(\Gamma)}\frac{8\pi}{m-1} \sum_{f \in \mathcal{B}_m^\mathrm{new}} f_\mathrm{JL}(\zeta)\overline{f^\sharp(l)}f^\sharp(r). \end{equation}

Because oldforms are orthogonal to newforms $\|\vartheta ^{(m)}(l,r;\bullet )\|_{L^2( {\Lambda \backslash \mathbf {SL}_2(\mathbb {R})})}^2\geq \|\vartheta _\sharp ^\mathrm {new}\|_2^2$. The first claimed inequality follows from (36) and the orthogonality relations of Hecke eigenforms. The second inequality follows from the Hoffstein and Lockhart [Reference Hoffstein and LockhartHL94] bound on the $L^2$-norm of an arithmetically normalized holomorphic Hecke newform $f$ of level $N$

\[ \|f\|_2^2\gg_\varepsilon \frac{\Gamma(m)}{(4\pi)^m} (mN)^{-\varepsilon}. \]

This bound holds when the Petersson inner product is normalized with respect to the probability measure on ${\Lambda \backslash \mathbf {SL}_2(\mathbb {R})}$.

At this point, we shall note that we have also proven Theorem 2.1. Indeed, if we lift adelically $f \in S_m(\Gamma )$ to $f^{\sharp }$, then we find

\[ \Im(z)^{{m/2}} \Im(\zeta)^{{m/2}} \mathcal{F}_{f}(z;\zeta) = c\sum_{n>0} T_{n}^{M_f}f^{\sharp} \star \big(\rho(s_\infty).M_\infty(\sqrt{n} \cdot \bullet)\big)(r_{\infty}) = c\int_{[\mathbf{G}(\mathbb{A})]} \Theta_M(l,r;s) f^{\sharp}(l) \,{d} l \]

by (34) and Propositions 5.14 and 7.4, where $c=\operatorname {covol}(\Gamma )(m-1)/(8\pi )$ and $r_\infty =\left (\begin{smallmatrix} 1 & x \\ & 1 \end{smallmatrix}\right ) \big (\begin{smallmatrix} y^{1/2} & \\ & y^{-1/2}\end{smallmatrix}\big )$ and $r_p$ being the identity for all finite places, and similarly for $s$ (with respect to $\zeta$). Thus, $\mathcal {F}_{f}(z;\zeta )$ is the classical holomorphic modular form associated to $\int \Theta _M(l,r;s)f^{\sharp }(l) \,{d} l$, from which the theorem follows.

8. The geometric expansion

We have now established in Corollary 7.7 a lower bound on $\|\vartheta ^{(m)}(l,r;\bullet )\|_{L^2( {\Lambda \backslash \mathbf {SL}_2(\mathbb {R})})}^2$ in terms of a fourth moment of Hecke eigenforms of weight $m$. In this section, our goal is to establish an upper bound in terms of a count of quaternions by norm. In the next section, we will establish a sharp upper-bound for this count.

Definition 8.1 For $g\in \mathbf {PGL}_2(\mathbb {R})$, define

\[ u(g)=\frac{\operatorname{Tr} (g^t{g}{})-2|\operatorname{det} g|}{4|\operatorname{det} g|}. \]

Specifically, for $g=\left (\begin{smallmatrix} a & b \\ c & d \end{smallmatrix}\right )$

\[ 4 u(g)=\frac{a^2+b^2+c^2+d^2}{|ad-bc|}-2. \]

Using the fixed isomorphism $B\otimes \mathbb {R}\simeq \operatorname {Mat}_{2\times 2}(\mathbb {R})$, we extend the function $u$ to the group $(B\otimes \mathbb {R})^\times$.

Lemma 8.2 For all $g\in \mathbf {PGL}_2(\mathbb {R})$ with $\operatorname {det} g>0$,

\[ |\mu(g)|^2=(1+u(g))^{-1}. \]

Proof. Fix $g=\left (\begin{smallmatrix} a & b \\ c & d \end{smallmatrix}\right )$ with $\operatorname {det} g >0$. We deduce from Definition 6.1 that

\[ |\mu(g)|^{-2}=\frac{(b-c)^2+(a+d)^2}{4\operatorname{det} g} =\frac{a^2+b^2+c^2+d^2+2\operatorname{det} g}{4\operatorname{det} g}=1+u(g) .\]

Proposition 8.3 We have

\begin{align*} \frac{1}{\operatorname{covol}(\Lambda)}\|\vartheta^{(m)}(l,r;\bullet)\|_{L^2({\Lambda \backslash \mathbf{SL}_2(\mathbb{R})})}^2 &\ll_\varepsilon (qD_B)^{1+\varepsilon} \frac{\Gamma(m-1)}{(4\pi)^m}\sum_{n>0} \frac{1}{n}\bigg( \sum_{\substack{\xi\in \mathcal{R} \\ \operatorname{Nr} \xi =n}} \big(1+u(l^{-1}\xi r)\big)^{-m/2}\bigg)^2\\ &\quad \cdot\begin{cases} 1, & n\le(qD_B)^2 m,\\ \exp(-n/(qD_B)^2), & n>(q D_B)^2 m. \end{cases} \end{align*}

Proof. We first apply Proposition 4.9 to $\vartheta ^{(m)}$ and use the fact that our choice of $M_\infty =M_\infty ^{(m)}$ is $K$-isotypical and transforms simply under the diagonal group:

\begin{align*} &\frac{1}{\operatorname{covol}(\Lambda)}\int_{{\Lambda}\backslash{\mathbf{SL}_2(\mathbb{R})}} |\vartheta^{(m)}(l, r;s)|^2 \,{d} s\\ &\quad \leq (q D_B)^{-1}\sum_{a\mid q D_B} \sum_{\substack{\alpha\in\mathbb{Q}\\\alpha>0}} \int_{\sqrt{3}/2}^\infty \tau^{m}\alpha^{m-2}\exp(-4\pi \alpha \tau) \Bigg( \sum_{\substack{\xi\in\widehat{\mathcal{R}}^{(a)}\\\operatorname{Nr} \xi=\alpha}}\big| \mu^m(l^{-1} \xi r) \big| \Bigg)^2 \frac{{d} \tau}{\tau^2}. \end{align*}

We bound the integral over $\tau$, which is equivalent to the definition of the partial gamma function, in two ways. Write first

\[ \int_{\sqrt{3}/2}^\infty \tau^{m-2} \alpha^{m-2} \exp(-4\pi\alpha \tau) \,{d} \tau= \frac{1}{\alpha} \int_{\sqrt{3}/2 \cdot \alpha}^\infty x^{m-2} \exp(-4\pi x) \,{d} x. \]

For $\alpha \leq m$, we complete the integral to deduce

\[ \int_{\sqrt{3}/2 \cdot \alpha}^{\infty} x^{m-2} \exp(-4\pi x) \,{d} x \leq (4\pi)^{-(m-1)}\Gamma(m-1). \]

For $\alpha >m$, we argue

\begin{align*} \int_{\sqrt{3}/2 \cdot \alpha}^{\infty} x^{m-2} \exp(-4\pi x) \,{d} x &\leq \exp(-2\pi \sqrt{3}/2 \cdot \alpha) 2^{m-2} \int_{\sqrt{3}/2 \cdot \alpha}^{\infty} (x/2)^{m-2} \exp(-4\pi (x/2)) \,{d} x\\ &\leq \exp(-2\pi \sqrt{3}/2 \cdot \alpha) 2^{m-1} (4\pi)^{-(m-1)} \Gamma(m-1)\\ &\ll \exp\big(-\big(2\pi \sqrt{3}/2-\log 2\big) \cdot \alpha\big) (4\pi)^{-(m-1)} \Gamma(m-1)\\ &\leq \exp(-\alpha) (4\pi)^{-(m-1)} \Gamma(m-1). \end{align*}

We thus arrive at

\begin{align*} \int_{\sqrt{3}/2}^\infty \tau^{m}\alpha^{m-2}\exp(-4\pi \alpha \tau) \Bigg( \sum_{\substack{\xi\in\widehat{\mathcal{R}}^{(a)}\\\operatorname{Nr} \xi=\alpha}}\!\big| \mu^m(l^{-1} \xi r) \big| \Bigg)^2 \frac{{d} \tau}{\tau^2} &\ll\! (4\pi)^{-m}\Gamma(m\!-\!1) \frac{1}{\alpha} \Bigg(\sum_{\substack{\xi\in\widehat{\mathcal{R}}^{(a)}\\\operatorname{Nr} \xi=\alpha}}\!|\mu^{m} (l^{-1}\xi r)|\!\Bigg)^2\\ &\quad \cdot \begin{cases} 1, & \alpha\leq m,\\ \exp(-\alpha), & \alpha>m. \end{cases} \end{align*}

Note that $\widehat {\mathcal {R}}^{(a)}\subset (qD_B)^{-1}\mathcal {R}$ and

\[ \Bigg(\sum_{\substack{\xi\in\widehat{\mathcal{R}}^{(a)}\\\operatorname{Nr} \xi=\alpha}}|\mu^{m}(l^{-1}\xi r)|\Bigg)^2 \leq \Bigg(\sum_{\substack{\xi\in \mathcal{R}\\ \operatorname{Nr} \xi =(qD_B)^2\alpha}}|\mu^{m}(l^{-1}\xi r)|\Bigg)^2 . \]

The claimed bound follows from combining these inequalities with the divisor bound and Lemma 8.2 above.

Definition 8.4 For any $g\in \mathbf {G}(\mathbb {R})$, $n\in \mathbb {N}$, and $\delta >0$, set

\[ M(g,n;\delta)\colon= \big\{ \xi\in\mathcal{R} \mid \operatorname{Nr} \xi=n, u(g^{-1}\xi g) < \delta \big\}. \]

Corollary 8.5 If $m>2$, then

\begin{align*} &\frac{1}{\operatorname{covol}(\Lambda)}\|\vartheta^{(m)}(g,g;\bullet)\|_{L^2({\Lambda \backslash \mathbf{SL}_2(\mathbb{R})})}^2\\ &\quad\ll_\varepsilon (qD_B)^{1+\varepsilon} \frac{\Gamma(m-1)}{(4\pi)^m} \bigg\{\frac{m}{2}\int_0^\infty \bigg[\sum_{n=1}^{(qD_B)^2m} \frac{1}{n}M(g,n;\delta)^2\bigg]^{1/2}\\ &\qquad +\bigg[\sum_{n>(qD_B)^2m}^\infty \frac{\exp(-n/(qD_B)^2)}{n} M(g,n;\delta)^2 \bigg]^{1/2} \frac{{d}\delta}{(1+\delta)^{m/2+1}}\bigg\}^2. \end{align*}

Proof. Integration by parts for Riemann–Stieltjes integrals implies

(37)

\begin{align} \sum_{\substack{\xi\in \mathcal{R} \nonumber\\ \operatorname{Nr} \xi =n}} \big(1+u(g^{-1}\xi g)\big)^{-m/2} &=\int_{0}^{\infty} (1+\delta)^{-m/2}\,{d} M(g,n;\delta) \nonumber\\ &=\lim_{\delta\to\infty}M(g,n;\delta)(1+\delta)^{-m/2} +\frac{m}{2}\int_0^\infty (1+\delta)^{-m/2-1} M(g,n;\delta) \,{d} \delta. \end{align}

The left-hand side is exactly the expression we need to bound in Proposition 8.3. Iwaniec and Sarnak in [Reference Iwaniec and SarnakIS95, Lemma 1.3 & Appendix 1] establish the bound

(38)

\begin{equation} M(g,n;\delta)\ll_{\varepsilon,q,D_B,g} n^{\varepsilon}+(\delta+\delta^{1/4})n^{1+\varepsilon}. \end{equation}

Thus, the first term in (37) vanishes for $m>2$. Set $w_n=1/n$ if $n\leq (qD_B)^2m$ and $w_n=\exp (-n/(qD_B)^2)/n$ otherwise. We apply Minkowski's integral inequality to deduce

\[ \sum_{n=1}^{\infty} w_n \Bigg|\sum_{\substack{\xi\in\mathcal{R} \\ \operatorname{Nr} \xi=n}} (1+u(g^{-1}\xi g))^{-m/2}\Bigg|^2 \leq \Bigg(\frac{m}{2}\int_0^\infty \sqrt{\sum_{n=1}^\infty w_n M(g,n;\delta)^2} \frac{{d}\delta}{(1+\delta)^{m/2+1}}\Bigg)^2. \]

The claim follows by splitting the sum into two ranges: $1\leq n \leq (qD_B)^2m$ and $n>(qD_B)^2m$ and using the $l^2$–$l^1$ inequality.

9. Second-moment count of quaternions by norm

In this section, we prove our main results about the second-moment count of quaternions by norm in a small ball. This bound in combination with the results of the previous sections will lead to the proof of Theorem 1.1. To bound $\sum _{n=1}^N M(g,n;\delta )^2$ we can assume henceforth without loss of generality that $\mathcal {R}$ is a maximal order, otherwise we can replace the Eichler order $\mathcal {R}=\mathcal {R}_1\cap \mathcal {R}_2$ by $\mathcal {R}_1$ and the second-moment sum will only increase.

We shall deal separately with the split case $\mathbf {G}=\mathbf {SL}_2$ and the case of anisotropic $\mathbf {G}$. The proof in both cases is very similar except that we need to track the dependence on $g$ differently. While in the split case we shall work with the Iwasawa decomposition of $g$, in the anisotropic case we will use an adapted Cartan decomposition of $g$.

9.1 Second-moment bound for the split matrix algebra

In this section, we fix $\mathbf {G}=\mathbf {SL}_2$, i.e. $B=\operatorname {Mat}_{2\times 2}(\mathbb {Q})$ and $\mathcal {R} = \operatorname {Mat}_{2\times 2}(\mathbb {Z})$. If we write in coordinates $\xi =\left (\begin{smallmatrix} a & b \\c & d \end{smallmatrix}\right )$, then the inequalities $u(\xi )<\delta$, $0<\operatorname {det} \xi < N$ imply

(39)

\begin{align} a^2+b^2+c^2+d^2&< N(4\delta+2), \end{align}

(40)

\begin{align} (a-d)^2+(b+c)^2&< 4N \delta. \end{align}

For $g\in \mathbf {G}(\mathbb {R})$, write $g=n a k$ with $k\in \mathbf {SO}_2(\mathbb {R})$ and

\begin{align*} n&=\begin{pmatrix} 1 & x \\ 0 & 1 \end{pmatrix},\\ a&=\begin{pmatrix} y^{1/2} & 0 \\ 0 & y^{-1/2} \end{pmatrix}. \end{align*}

This is the standard Iwasawa decomposition of $g$.

Proposition 9.1 Let $g\in \mathbf {SL}_2(\mathbb {R})$ and write $g.i=x+iy$. Assume $|x| \le C$ and $y \ge A>0$. Then,

\[ \sum_{n=1}^N M(g,n;\delta)^2\ll_{A,C,\varepsilon} N^{3+\varepsilon}\delta^2+N^{1+\varepsilon} +N^{1/2+\varepsilon}\min(N^{1/2},(N\delta)^{1/2}+1)(y^2N\delta+1) . \]

Remark 9.2 In the end, we may restrict ourselves to $g$ in a fundamental domain for $\mathbf {SL}_2(\mathbb {Z})$ and, hence, the restrictions on $x,y$ will be satisfied.

Using the inequality of geometric and arithmetic means we can split the second-moment count into two cases. The first is when both matrices are upper triangular and the second is when neither is. We now prepare some preliminary results needed in the proof of Proposition 9.1.

Lemma 9.3 Denote by $\mathfrak {U}\subset B_\infty$ the subset of upper triangular matrices. Then,

\begin{align*} \#\Big\{(\xi_1,\xi_2) \in \big(\operatorname{Mat}_{2\times 2}(\mathbb{Z})\cap \mathfrak{U}\big)^{2}&\colon u(g^{-1}\xi_1g),u(g^{-1}\xi_2g)<\delta,\ 0<\operatorname{det} \xi_1=\operatorname{det}\xi_2 < N \Big\}\\ &\ll_\varepsilon N^{1/2+\varepsilon}\min(N^{1/2},(N\delta)^{1/2}+1)(y^2N\delta+1). \end{align*}

Proof. Write $\xi _i= \big (\begin{smallmatrix} a_i & b_i \\ 0 & d_i \end{smallmatrix}\big )$. For upper triangular matrices we rewrite (40) for $g^{-1}\xi _ig$ as

\[ (a_i-d_i)^2+\frac{\big(b_i+x(a_i-d_i)\big)^2}{y^2}<4 N \delta. \]

Hence, we have at most $\ll (N\delta )^{1/2}+1$ choices for $|a_1-d_1|$. In addition, the condition $0<4\operatorname {det} \xi _1=(a_1+d_1)^2-(a_1-d_1)^2<4N$ implies $0<|a_1+d_1|-|a_1-d_1|\ll N^{1/2}$. We deduce that there are at most $\ll ((N\delta )^{1/2}+1)N^{1/2}$ possibilities for $(|a_1+d_1|,|a_1-d_1|)$ and a similar statement holds for $(a_1,d_1)$.

On the other hand, $0< a_1 d_1< N$ and the divisor bound implies that the number of possible pairs $(a_1,d_1)$ is also bounded by $\ll _\varepsilon N^{1+\varepsilon }$. The number of possibilities for $b_1$ is now bounded above by $y(N\delta )^{1/2}+1$. Thus, there are at most $\ll _\varepsilon N^{1/2+\varepsilon }\min \big (N^{1/2},(N\delta )^{1/2}+1\big )(y(N\delta )^{1/2}+1)$ possibilities for $\xi _1$.

Once $\xi _1$ is fixed the condition $\operatorname {det}\xi _1=\operatorname {det}\xi _2>0$ fixes $a_2 d_2$ and the divisor bound restricts the number of possible pairs $(a_2,d_2)$ to $\ll _\varepsilon N^{\varepsilon }$. Finally, the number of possible $b_2$ after fixing $(a_2,d_2)$ is at most $\ll y(N\delta )^{1/2}+1$.

We continue to analyze the case when neither matrix is upper triangular. We will use the direct sum decomposition $B_\infty = \mathbb {R}\operatorname {Id} + B_\infty ^0$. This decomposition is preserved by the conjugation action. We denote by $\xi ^0=\xi -\frac {1}{2}\operatorname {Tr}(\xi )$ the traceless part of $\xi \in B_\infty$. In coordinates we write

\[ \xi^0=\begin{pmatrix} e & b \\ c & -e \end{pmatrix}, \]

where $e={(a-d)/{2}}$. If $\xi$ satisfies (39) and (40), then $\xi ^0$ satisfies $2 e^2+b^2+c^2< N(4\delta +2)$. This leads us to define

\begin{align*} B_\infty^0\supset X&\colon=\bigg\{\xi^0=\begin{pmatrix} e & b \\ c & -e \end{pmatrix} \colon 4e^2+(b+c)^2<4 N \delta,\; 2e^2+b^2+c^2< N(4\delta+2) \bigg\},\\ B_\infty^0\supset X^g&\colon- g X g^{-1}. \end{align*}

The set $X$ is invariant under conjugation by $K$ and using the Iwasawa decomposition we can write the conditions for $\left (\begin{smallmatrix} e & b \\ c & -e \end{smallmatrix}\right ) \in X^g$ explicitly as

(41)

\begin{align} 2(e-xc)^2+\frac{(b+2xe-x^2c)^2}{y^2}+y^2c^2&< N(4\delta+2), \end{align}

(42)

\begin{align} 4(e-xc)^2+\frac{\big(b+2x(e-xc)+(x^2+y^2)c\big)^2}{y^2}&<4N\delta. \end{align}

Lemma 9.4 Assume $y\geq A>0$ and $0<\delta \le 1$, then

\[ \#\big(\tfrac{1}{2}\operatorname{Mat}_{2\times 2}(\mathbb{Z})^0 \cap X^g\setminus\mathfrak{U}\big)\ll N^{3/2}\delta+N^{1/2}. \]

Proof. From (41), we learn that there are $\ll y^{-1}N^{1/2}(1+\delta )$ options for $c$ ($c\neq 0$ because the matrices are not upper triangular). For any fixed $c$, (42) describes an ellipse in the $e,b$ plane with radii $\ll \sqrt {N\delta }, y\sqrt {N\delta }$. Hence, the number of possibilities for $(e,b)\in \frac {1}{2}\mathbb {Z}\times \frac {1}{2}\mathbb {Z}$ is bounded from above by

\[ \ll y N\delta +(N\delta)^{1/2}\max(y,1)+1\ll_A y \big(N\delta + (N\delta)^{1/2}+1\big). \]

Multiplying this by the bound for possible values of $c$ and the inequality $N\delta ^{1/2}<\max (N^{3/2}\delta,N^{1/2})$ establish the claim.

Lemma 9.5 Assume $y \ge A>0$, $|x|< C$, and $0<\delta \le 1$. Then,

\[ \#\big\{(\xi_1^0,\xi_2^0)\in \big(\tfrac{1}{2}\operatorname{Mat}_{2\times 2}(\mathbb{Z})^0 \cap X^g\setminus\mathfrak{U}\big)^{2} \colon \operatorname{det} \xi_1^0=\operatorname{det} \xi_2^0 \big\}\ll_{A,C} N^{5/2}\delta^2+N^{1/2}. \]

Proof. Note that the number of possible elements $\xi _1^0$ is bounded by Lemma 9.4 above. We fix henceforth $\xi _1^0$ as in the claim and count the number of possible $\xi _2^0\not \in \mathfrak {U}$ with $\operatorname {det}\xi _2^0=\operatorname {det}\xi _1^0$. Denote

\[ g^{-1}\xi_i^0 g=\begin{pmatrix} \tilde{e}_i & \tilde{b}_i \\ \tilde{c}_i & -\tilde{e}_i \end{pmatrix}. \]

We now rewrite (40) for $\xi _i^0$ as

(43)

\begin{equation} 4(e_i-c_ix)^2+\big((\tilde{b}_i-\tilde{c}_i)+2yc_i\big)^2<4N\delta. \end{equation}

Then, $(\tilde {b}_1-\tilde {c}_1)$ is restricted to an interval of length $\ll \sqrt {N\delta }$. Equation (40) implies

\[ (\tilde{b}_i+\tilde{c}_i)^2 +4 \tilde{e_i}^2= (\tilde{b}_i-\tilde{c}_i)^2-4\operatorname{det}(\xi_i^0)=(\tilde{b}_i-\tilde{c}_i)^2-4\operatorname{det}(g^{-1}\xi_i^0g)\in[0,4N\delta] \]

and $\big |(\tilde {b_1}-\tilde {c}_1)^2-(\tilde {b_2}-\tilde {c}_2)^2\big |\ll N\delta$. We deduce that $\big ||\tilde {b_1}-\tilde {c}_1|-|\tilde {b_2}-\tilde {c}_2|\big |\ll \sqrt {N\delta }$. In particular, $(\tilde {b}_2-\tilde {c}_2)$ is restricted to two intervals of length $\ll \sqrt {N\delta }$.

Consider (43) for $\xi _2^0$ with $(\tilde {b}_2-\tilde {c}_2)$ as a varying quantity in the aforementioned intervals, thus ignoring their dependencies on $e_2,c_2$. It describes an ellipse in the variables $e_2,c_2$ with center $-{(\tilde {b}_2-\tilde {c}_2)/{2y}}\cdot (x,1)$. Because $\xi _1^0$ is fixed, the center of the ellipse is restricted to one of two intervals of length $\ll _{C} y^{-1} \sqrt {N\delta }$. The radii of the ellipse satisfy $\ll _{C} \sqrt {N\delta }, y^{-1}\sqrt {N\delta }$. We deduce that the number of possibilities for $(e_2,c_2)$ is $\ll _{A,C} y^{-1}N\delta +(N\delta )^{1/2}+1\ll _{A,C} N\delta +1$. Once $\xi _1^0$, $e_2$ and $c_2\neq 0$ are fixed, the value of $b_2$ is fixed by the equality $\operatorname {det} \xi _2^0=\operatorname {det}\xi _1^0$. Hence, the total number of pairs $(\xi _1^0,\xi _2^0)$ is bounded from above in this case by

\[ \ll_{A,C} (N^{3/2}\delta+N^{1/2})(N\delta+1)\ll_{A,C} N^{5/2}\delta^2+N^{3/2}\delta+N^{1/2} \ll_{A,C} N^{5/2}\delta^2+N^{1/2} .\]

Proof Proof of Proposition 9.1

Define

\begin{align*} M_\star(g,n;\delta)&\colon= \big\{ \xi\in\operatorname{Mat}_{2\times 2}(\mathbb{Z})\setminus\mathfrak{U} \mid \operatorname{det} \xi=n,\ u(g^{-1}\xi g) < \delta\big\},\\ M_u(g,n;\delta)&\colon= \big\{ \xi\in\operatorname{Mat}_{2\times 2}(\mathbb{Z})\cap\mathfrak{U} \mid \operatorname{det} \xi=n, u(g^{-1}\xi g) < \delta\big\}. \end{align*}

Then, the inequality of means imply

\[ \sum_{n=1}^N M(g,n;\delta)^2\leq 2\sum_{n=1}^N M_\star(g,n;\delta)^2+2\sum_{n=1}^N M_u(g,n;\delta)^2 \]

and we turn to bounding each term individually. The second term is controlled by Lemma 9.3 and is consistent with the claim.

To bound the first term we need to bound the number of pairs $(\xi _1,\xi _2)\in \big (\operatorname {Mat}_{2\times 2}(\mathbb {Z})\setminus \mathfrak {U}\big )^{2}$ such that $0<\operatorname {det}\xi _1=\operatorname {det}\xi _2< N$ and $u(g^{-1}\xi _i g)<\delta$ for $i=1,2$. Assume first $\delta >1$, we then argue as in [Reference Iwaniec and SarnakIS95] to show the stronger bound $M_\star (g,n;\delta )\ll _{A,C,\varepsilon } n^{1+\varepsilon }\delta$. Let $\operatorname {det}\xi =n$ and write $\xi =\left (\begin{smallmatrix} a & b \\ c & d \end{smallmatrix}\right )$ as usual. When $\delta >1$, we can replace the right-hand side in inequalities (39) and (41) by $6N\delta$. If either $a=0$ or $d=0$, then the equation $bc=n$ and the divisor bound imply that we have at most $\ll _\varepsilon n^{\varepsilon }$ possibilities for $(b,c)$. Moreover, (39) implies that there are at most $\ll (n\delta )^{1/2}$ options for $\operatorname {Tr}(\xi )$. Hence, the number of possible values of $\xi$ in these cases is $\ll _\varepsilon n^{1/2+\varepsilon }\delta ^{1/2}\ll n^{1+\varepsilon }\delta$. Assume next $a\neq 0$ and $d\neq 0$. Equation (41) implies that we have at most $\ll y^{-1}(n\delta )^{1/2}$ options for $c$. Likewise, we have $\ll 1+(n\delta )^{{1/2}}(x^2+y^2)y^{-1} \ll _{A,C} y(n \delta )^{{1/2}}$ choices for $b$. This may be seen by either repeating the computation for (41) using the Iwasawa decomposition with respect to the lower triangular unipotents or noting that $\left (\begin{smallmatrix} & 1 \\ -1 & \end{smallmatrix}\right ) g.i= (-x+iy)/(x^2+y^2)$ and $\left (\begin{smallmatrix} & 1 \\ -1 & \end{smallmatrix}\right ) \xi \left (\begin{smallmatrix} & 1 \\ -1 & \end{smallmatrix}\right )^{-1} = \left (\begin{smallmatrix} d & -c \\ -b & a \end{smallmatrix}\right )$. Now that $(b,c)$ is fixed, we use the equality $ad=bc+n$ and the divisor bound to see that there are at most $\ll _\varepsilon n^{\varepsilon }$ possibilities for $(a,d)$. This establishes the inequality for $\delta >1$.

Assume henceforth $\delta <1$. We will be using the simple identity

(44)

\begin{equation} \operatorname{det} \xi=\frac{\operatorname{Tr}(\xi)^2}{4}+\operatorname{det} \xi^0 \end{equation}

and argue in two different ways depending on whether the traces of $\xi _1$, $\xi _2$ are equal or not.

Case I: $|\operatorname {Tr}(\xi _1)|\neq |\operatorname {Tr}(\xi _2)|$. Lemma 9.4 implies that there are at most $\ll _A N^3\delta ^2+N$ options for $(\xi _1^0,\xi _2^0)$. After fixing the traceless parts, (44) fixed $\operatorname {Tr}(\xi _1)^2-\operatorname {Tr}(\xi _2)^2$. Because the traces are not equal in absolute value the divisor bound and the trivial bound $|\operatorname {Tr}(\xi )| \ll N$ imply there are at most $\ll _\varepsilon N^\varepsilon$ choices for the traces. This establishes the claim in this case.

Case II: $|\operatorname {Tr}(\xi _1)|=|\operatorname {Tr}(\xi _2)|$. In this case, we use the trivial bound $|\operatorname {Tr}(\xi _1)|=|\operatorname {Tr}(\xi _2)|\ll N^{1/2}$ from (39) to fix the traces and Lemma 9.5 to fix the traceless part. The final bound is consistent with the claim.

9.2 Second-moment bound for division algebras

In the section, we assume $\mathbf {G}$ is anisotropic, i.e. $B$ is a ramified quaternion algebra over $\mathbb {Q}$, and that $B$ is split over $\mathbb {R}$. Fix an imaginary quadratic field $E/\mathbb {Q}$ of discriminant $D_E$, such that every prime dividing $D_B$ is inert in $E$. Let us denote by $\mathcal {O}_E$ the ring of integers of $E$. By a theorem of Chinburg and Friedman [Reference Chinburg and FriedmanCF99], there is an optimal embedding $\mathcal {O}_E \hookrightarrow \mathcal {R}$. We henceforth identify $\mathcal {O}_E$ with its image in $\mathcal {R}$. Denote by $K_E<\mathbf {G}(\mathbb {R})$ the group of norm $1$ elements in $(E\otimes \mathbb {R})^\times$. Recall that we have a fixed isomorphism $B_\infty \simeq \operatorname {Mat}_{2\times 2}(\mathbb {R})$ that induces a group isomorphism $\mathbf {G}(\mathbb {R})\simeq \mathbf {SL}_2(\mathbb {R})$, which we use to identify the two groups. Moreover, $K_\infty$ was defined as the image of $\mathbf {SO}_2(\mathbb {R})$ in $\mathbf {G}(\mathbb {R})$, and we define similarly $A$ to be the image of the diagonal subgroup in $\mathbf {G}(\mathbb {R})$. The group $K_E$ is conjugate to $K_\infty$, and we can write $K_\infty =h K_E h^{-1}$.

Proposition 9.6 Assume $\mathcal {O}_E \hookrightarrow \mathcal {R}$ is an optimal embedding of the ring of integers $\mathcal {O}_E$ of an imaginary quadratic field $E$ in the maximal order $\mathcal {R}$. Let $h\in \mathbf {G}(\mathbb {R})$ be an element conjugating $K_E$ to $K_{\infty }<\mathbf {SL}_2(\mathbb {R})$. Then, for any $g\in \mathbf {G}(\mathbb {R})$, $1>\delta >0$,

\[ \sum_{n=1}^N M(g,n;\delta)^2 \ll |D_E|^{2+\varepsilon}N^{\varepsilon}\big[N^3\delta^2+(\lambda+\lambda^{-1})^{2+\varepsilon}(N^{5/2}\delta^{3/2}+N)\big] , \]

where we write $\sqrt {\lambda }\geq (\sqrt {\lambda })^{-1}>0$ for the eigenvalue of the diagonal part in the $K_\infty A K_\infty$ Cartan decomposition of $hg$. Moreover, if $\delta \geq 1$ the bound

\[ M(g,n;\delta)\ll_{\varepsilon}\big((\lambda+\lambda^{-1})n \delta |D_E|^{1/2}\big)^{1+\varepsilon} \]

holds for all $g\in \mathbf {G}(\mathbb {R})$ and $n\in \mathbb {N}$.

We now fix $\mathcal {R},E,h,g$ as in the proposition above and prepare some notation and lemmata that we will use in the course of the proof. The proof is very similar to the split case, except that we track the dependence on $g$ differently, not using its Iwasawa decomposition but rather its Cartan decomposition relative to the stabilizer of $E\hookrightarrow B$.

Because of our choice of $E$ as optimally embedded in $\mathcal {R}$, we can find an isomorphism $B\otimes E\simeq M_2(E)$ where $\mathcal {R}$ is mapped to

\[ \bigg\{\!\begin{pmatrix} a & D_B b \\ ^\sigma{b}{} & ^\sigma{a}{} \end{pmatrix} \colon a,b\in\widehat{\mathcal{O}_E}, a+b\in \mathcal{O}_E \bigg\} \]

and after fixing a field embedding $E\hookrightarrow \mathbb {C}$ the algebra $B_\infty$ coincides with $\big \{\big (\begin{smallmatrix} a & D_B b \\ ^\sigma{b}{} & ^\sigma{a}{} \end{smallmatrix}\big )\colon a,b\in \mathbb {C}\big \}$. We denote by $B_\infty ^0$ the subspace of traceless elements, equivalently pure quaternions. There is a direct sum decomposition $B_\infty = \mathbb {R}\operatorname {Id} + B_\infty ^0$. This decomposition is preserved by the conjugation action. In our new coordinate system, the space $B_\infty ^0$ is identified with $i\mathbb {R}\times \mathbb {C}$ and the projection map becomes $(a,b)\mapsto (a^0,b)$ where $a^0=(a-^\sigma{a}{})/2=i \Im a$ is the traceless part of $a\in \mathbb {C}$. The space $B_\infty ^0$ is equipped with an inner product constructed as the direct sum of the standard inner product on $\mathbb {R}$ and $\mathbb {C}$, i.e. $|(a^0,b)|^2=|a^0|^2+|b|^2$. Let $\mathcal {R}^0$ be the projection of $\mathcal {R}$ to $B_\infty ^0$. Then, $\mathcal {R}^0< B_\infty ^0$ is a lattice of co-volume $\asymp 1$.

In this new coordinate system, we have for a quaternion of positive normFootnote ¹³

\[ u\bigg(h\begin{pmatrix} a & D_B b \\ ^\sigma{b}{} & ^\sigma{a}{} \end{pmatrix}h^{-1}\bigg)=\frac{1}{2}\bigg[\frac{\operatorname{Nr} a+ D_B \operatorname{Nr} b}{\operatorname{Nr} a - D_B\operatorname{Nr} b}-1\bigg]= \frac{D_B \operatorname{Nr} b}{\operatorname{Nr} a - D_B\operatorname{Nr} b}. \]

Hence, if we write in coordinates

\[ (gh)^{-1} \xi (gh)=\begin{pmatrix} a & D_B b \\ ^\sigma{b}{} & ^\sigma{a}{} \end{pmatrix} \]

with $\operatorname {Nr} \xi >0$, then the conditions $u(g^{-1}\xi g)<\delta$, $\operatorname {Nr} \xi < N$ imply

(45)

\begin{align} \operatorname{Nr} a + D_B \operatorname{Nr} b&< N(2\delta+1), \end{align}

(46)

\begin{align} D_B \operatorname{Nr} b &< \delta N. \end{align}

The traceless part of $(gh)^{-1} \xi (gh)$ is

\[ (gh)^{-1} \xi^0 (gh)= \begin{pmatrix} a^0 & D_B b \\ ^\sigma{b}{} & ^\sigma{a}{^0} \end{pmatrix}, \]

where $a^0=i \Im a$ is the traceless part of $a$. Equation (45) implies that $\operatorname {Nr} a^0 \leq N(2\delta +1)$. Motivated by these inequalities, we denote

\begin{align*} B_\infty^0\supset X&\colon- \left\{x=\begin{pmatrix} a^0 & D_B b \\ ^\sigma{b}{} & ^\sigma{a}{^0} \end{pmatrix}\colon D_B \operatorname{Nr} b\leq \delta N, \operatorname{Nr} a^0\leq N(2\delta+1) \right\},\\ B_\infty^0\supset X^g&\colon- (gh)X(gh)^{-1}. \end{align*}

We decompose $gh$ according to a Cartan decomposition in $K_E A_E K_E$ where $A_E$ is the orthogonal group preserving the quadratic form $(\Im a)^{2}-D_B(\Im b)^2$. Equivalently, the Lie algebra of $A_E$ is

\[ \operatorname{Lie} A_E= \mathbb{R}\cdot \begin{pmatrix} 0 & D_B \\ 1 & 0 \end{pmatrix}< B_\infty^0. \]

Write $gh=k_2 a_E k_1$ with $k_1,k_2\in K_E$, $a_E\in A_E$, and denote by $\sqrt {\lambda }\geq (\sqrt {\lambda })^{-1}>0$ the eigenvalues of $a_E$. Then, $\sqrt {\lambda },(\sqrt {\lambda })^{-1}$ are also the eigenvalues of the diagonal part of the regular $K_\infty AK_\infty$ Cartan decomposition of $hg=h(gh)h^{-1}$, i.e. the singular values.

The set $X$ is invariant under conjugation by $K_E$, hence $X^g=(k_2 a_E)X (k_2 a_E)^{-1}$. We can write the equations defining the set $a_E X a_E^{-1}$ explicitly by decomposing the Lie algebra $B_\infty ^0$ into the weight spaces of $A_E$. The result of the computation is that every $x=\big (\begin{smallmatrix} a^0 & D_B b \\ ^\sigma{b}{} & ^\sigma{a}{^0} \end{smallmatrix}\big )\in a_E X a_E^{-1}$ satisfies

(47)

\begin{align} \bigg(\frac{\lambda+\lambda^{-1}}{2}\Im a^0+\sqrt{D_B}\frac{\lambda-\lambda^{-1}}{2}\Im b\bigg)^2&\leq N(2\delta+1), \end{align}

(48)

\begin{align} \bigg(\frac{\lambda-\lambda^{-1}}{2}\Im a^0+\sqrt{D_B}\frac{\lambda+\lambda^{-1}}{2}\Im b\bigg)^2 +D_B (\Re b)^2&\leq N\delta. \end{align}

The set $X^g=k_2 (a_E X a_E^{-1}) k_2^{-1}$ is a rotation of $a_E X a_E^{-1}$ around the $\Im a^0$ axis. Hence, the equations defining $X^g$ are derived from (47) and (48) by a rotation in the $b$-plane. Note that (47) and (48) imply that $|\Im a^0| \ll (\lambda +\lambda ^{-1})N^{1/2}(1+\delta ^{1/2})$. Because the axis $\Im a^0$ is invariant under conjugation by $k_2$ this inequality holds also for $X^g$.

Lemma 9.7 Assume $0<\delta \leq 1$. Then,

\[ \#\big(\mathcal{R}^0 \cap X^{g} \big) \ll |D_E|\big(N^{3/2}\delta+(\lambda+\lambda^{-1})(N\delta^{1/2}+N^{1/2})\big). \]

Proof. From $|\Im a^0| \ll (\lambda +\lambda ^{-1})N^{1/2}$ we deduce that there are $\ll (\lambda +\lambda ^{-1})(N|D_E|)^{1/2}$ possibilities for $a^0=i\Im a^0$. The second equation (48) implies that for any fixed $a^0=i\Im a^0$ the element $b$ belongs to an ellipse with radii $\sqrt {N\delta /D_B}$, $(2/(\lambda +\lambda ^{-1}))\sqrt {N\delta /D_B}$. Conjugation by $k_2$ amounts to rotating the set around the $\Im a^0$ axis. Hence, this observation remains valid for $X^g$. We deduce that for any fixed $\Im a^0$ we have

\[ \ll \frac{N\delta\sqrt{|D_E|}}{D_B(\lambda+\lambda^{-1})}+\bigg(\frac{N\delta|D_E|}{D_B}\bigg)^{1/2}+1 \]

possibilities for $b$. The claim follows by multiplying the number of possibilities for $a^0$ by the number of possible $b$ for each $a^0$.

Lemma 9.8 Assume $0<\delta \leq 1$. Then,

\[ \#\big\{ (\xi_1^0,\xi_2^0)\in\big(\mathcal{R}^0 \cap X^g\big)^{2} \colon \operatorname{Nr} \xi_1^0=\operatorname{Nr} \xi_2^0 \big\}\ll_\varepsilon |D_E|^{1+\varepsilon} (\lambda+\lambda^{-1})^{2+\varepsilon}N^\varepsilon \big(N^2\delta^{3/2}+N^{1/2}\big) . \]

Proof. Write

\[ \xi_{1,2}^0=\begin{pmatrix} a_{1,2}^0 & D_B b_{1,2} \\ ^\sigma{b}{_{1,2}} & ^\sigma{a}{_{1,2}^0} \end{pmatrix} \]

and assume $\xi _1^0,\xi _2^0\in X^g$ and $\operatorname {Nr}\xi _1^0=\operatorname {Nr} \xi _2^0$. Our goal is to count the number of possible pairs $(\xi _1^0,\xi _2^0)$.

For every $\xi \in B_\infty ^0$ let $\tilde {a}^0$ be the $a^0$ coordinate of $a_E^{-1}\xi a_E$. Then,

\[ \Im \tilde{a}^0=\frac{\lambda+\lambda^{-1}}{2}\Im a^0+\sqrt{D_B}\frac{\lambda-\lambda^{-1}}{2}\Im b. \]

Moreover, $\tilde {a}^0$ is also the $a^0$ coordinate of $(gh)^{-1}\xi (gh)$ because conjugation by $K_E$ acts trivially on the $a^0$-axis. By substitution, we can rewrite (48) as

(49)

\begin{equation} \bigg(\frac{\lambda-\lambda^{-1}}{\lambda+\lambda^{-1}}\Im \tilde{a}^0+\sqrt{D_B}\frac{2}{\lambda+\lambda^{-1}}\Im b\bigg)^2+D_B(\Re b)^2 \leq N\delta . \end{equation}

Assume $\xi \in X^g$. Because (45) implies that $|\Im \tilde {a}^0|\ll N^{1/2}$, we see that (49) restricts $b\in X^g$ to an ellipse with radii $(N\delta /D_B)^{1/2},(\lambda +\lambda ^{-1})(N\delta /D_B)^{1/2}$ and center in an interval of length $\ll N^{1/2}$. We deduce that there are at most

(50)

\begin{equation} \ll (\lambda+\lambda^{-1})N\delta^{1/2}\sqrt{\frac{|D_E|}{D_B}}+(\lambda+\lambda^{-1})N^{1/2}\sqrt{ \frac{|D_E|}{D_B}}+1 \end{equation}

choices for $b$ if $\xi \in X^g$. Moreover, we see that necessarily $\operatorname {Nr} b \ll (\lambda +\lambda ^{-1})N$

Case I: $|a_1^0|=|a_2^0|$. In this case, the condition $\operatorname {Nr}\xi _1^0=\operatorname {Nr} \xi _2^0$ implies that $\operatorname {Nr} b_1=\operatorname {Nr} b_2$. Because there are at most $\ll _\varepsilon (n|D_E|)^{\varepsilon }$ elements of norm $n$ in $\widehat {\mathcal {O}}_E$ and $\operatorname {Nr} b_2\ll (\lambda +\lambda ^{-1})N$ we see that for any fixed $\xi _1^0$ there are at most $\ll _\varepsilon ((\lambda +\lambda ^{-1})N|D_E|)^\varepsilon$ possibilities for $\xi _2^0$. We deduce from Lemma 9.7 that the number of possible pairs $(\xi _1,\xi _2)$ with $|a_1^0|=|a_2^0|$ satisfies

\[ \ll_\varepsilon ((\lambda+\lambda^{-1})N|D_E|)^\varepsilon|D_E| \big(N^{3/2}\delta+(\lambda+\lambda^{-1})(N\delta^{1/2}+N^{1/2})\big) \]

and this bound is compatible with the claim.

Case II: $|a_1^0|\neq |a_2^0|$. In this case, we will first count the number of possibilities for $(b_1,b_2)$. We bound the number of choices for $b_1$ using (50) above. If $\xi \in X^g$, then (46) implies

\[ (\Im \tilde{a}^0)^2-\operatorname{Nr} \xi=(\Im \tilde{a}^0)^2-\operatorname{Nr}((gh)^{-1}\xi(gh))\in[0,N\delta]. \]

Thus, we deduce for $\xi _{1,2}^0$ that $\big |(\Im \tilde {a}_1^0)^2-(\Im \tilde {a}_2^0)^2 \big |\leq 2N\delta$ and

(51)

\begin{equation} \big||\Im \tilde{a}_1^0|-|\Im \tilde{a}_2^0|\big|\ll \sqrt{N\delta}. \end{equation}

Once $b_1$ is fixed, (49) restricts $(( {\lambda -\lambda ^{-1})/(\lambda +\lambda ^{-1}}))\Im \tilde {a}_1^0$ to an interval of length $\ll \sqrt {N\delta }$. Equation (51) then restricts $((\lambda -\lambda ^{-1})/ (\lambda +\lambda ^{-1}))|\Im \tilde {a}_2^0|$ to an interval also of length $\ll \sqrt {N\delta }$.

This constraints the possibilities for the center of the ellipse in (49) for $b_2$ into two intervals of length $\ll \sqrt {N\delta }$. Hence, given $b_1$, there are at most

\[ \ll (\lambda+\lambda^{-1})N\delta\sqrt\frac{|D_E|}{|D_B|}+(\lambda+\lambda^{-1})(N\delta)^{1/2}\sqrt\frac{|D_E|}{|D_B|}+1 \]

options for the $b_2$.

After fixing $b_1,b_2$, we use the condition $\operatorname {Nr} \xi _1^0=\operatorname {Nr} \xi _2^0$ to fix $(\Im a_1^0)^2-(\Im a_2^0)^2$. The divisor bound and the condition $|\Im a_1^0|\neq |\Im a_2^0|$ now implies there are at most $\ll _\varepsilon ((\lambda +\lambda ^{-1})N|D_E|)^\varepsilon$ options for the pair $(a_1^0,a_2^0)$.

The total number of possible pairs $(\xi _1^0,\xi _2^0)$ in this case is thus bounded by

\begin{align*} &\ll_\varepsilon ((\lambda+\lambda^{-1})N|D_E|)^\varepsilon|D_E|\big( (\lambda+\lambda^{-1})N\delta^{1/2}+(\lambda+\lambda^{-1})N^{1/2}\big)\\ &\quad \cdot \big((\lambda+\lambda^{-1})N\delta+(\lambda+\lambda^{-1})(N\delta)^{1/2}+1\big). \end{align*}

This bound is also compatible with the claim.

Proof Proof of Proposition 9.6

Assume first $\delta >1$. Then, we follow [Reference Iwaniec and SarnakIS95] to establish the bound $M(g,n;\delta )\ll _{\varepsilon } \big ((\lambda +\lambda ^{-1})n \delta |D_E|^{1/2} \big )^{1+\varepsilon }$. We have the bounds $\operatorname {Tr}(a)=2\Re a\ll (n\delta )^{1/2}$ and $\Im a=\Im a^0\ll (\lambda +\lambda ^{-1})(n\delta )^{1/2}$. After fixing $a$ we can fix $b$ using the equality $n=\operatorname {det}\xi =\operatorname {Nr} a- D_B\operatorname {Nr} b$. The divisor bound and the inequality $\operatorname {Nr} b \ll (\lambda +\lambda ^{-1})^2n \delta$ imply we have at most $\ll _\varepsilon (\lambda +\lambda ^{-1})^\varepsilon n^\varepsilon \delta ^{\varepsilon } |D_E|^{\varepsilon }$ choices for $b$.

Assume next $\delta \leq 1$. Once again, an important role is reserved for the simple formula

(52)

\begin{equation} \operatorname{Nr} x=\frac{\operatorname{Tr}(x)^2}{4}+\operatorname{Nr} x^0 \end{equation}

that holds for all $x\in B_\infty$ with $x^0\in B_\infty ^0$ the traceless part of $x$. Our goal is to bound the number of pairs $(\xi _1,\xi _2)\in \mathcal {R}$ such that $0\leq \operatorname {Nr}\xi _1=\operatorname {Nr}\xi _2 \leq N$ and $u(g^{-1}\xi _1g)=u(g^{-1}\xi _2 g)<\delta$.

Case I: $|\operatorname {Tr}(\xi _1)|\neq |\operatorname {Tr}(\xi _2)|$. Lemma 9.7 implies that the number of possibilities for the pair $(\xi _1^0,\xi _2^0)$ is bounded by

\[ \ll |D_E|^2 \big(N^3\delta^2+(\lambda+\lambda^{-1})^2(N^{5/2}\delta^{3/2}+N)\big). \]

For any pair $(\xi _1^0,\xi _2^0)\in B_\infty ^0\times B_\infty ^0$, the lifts to $B_\infty \times B_\infty$ are determined by $\big (\operatorname {Tr}(\xi _1),\operatorname {Tr}(\xi _2)\big )$.

From the formula (52), we derive $\operatorname {Tr}(\xi _1)^2-\operatorname {Tr}(\xi _2)^2=4\big (\operatorname {Nr} \xi _2^0-\operatorname {Nr}\xi _1^0\big )$. The right-hand side is bounded in absolute value by $\ll N$. The divisor bound and the assumption $|\operatorname {Tr}(\xi _1)|\neq |\operatorname {Tr}(\xi _2)|$ imply that for every $(\xi _1^0,\xi _2^0)$ the number of possible pairs $\big (\operatorname {Tr}(\xi _1),\operatorname {Tr}(\xi _2)\big )$ is bounded by $\ll _\varepsilon N^\varepsilon$. The cumulative bound is consistent with the claim.

Case II: $|\operatorname {Tr}(\xi _1)|= |\operatorname {Tr}(\xi _2)|$. In this case, formula (52) implies that $\operatorname {Nr} \xi _1^0=\operatorname {Nr} \xi _2^0$ and we can bound the total number of pairs $(\xi _1^0,\xi _2^0)$ using Lemma 9.8. The number of pairs $\big (\operatorname {Tr}(\xi _1),\operatorname {Tr}(\xi _2)\big )$ is trivially bounded by $\ll \sqrt {N}$ because $|\operatorname {Tr}(\xi _1)|= |\operatorname {Tr}(\xi _2)|$. The resulting bound on the pairs $(\xi _1,\xi _2)$ is consistent with the claim.

10. Proof of the main theorem

This section is dedicated to establishing our main result, Theorem 1.1. Recall that $\mathcal {B}_m^{\mathrm {new}}$ is an orthonormal basis of Hecke newforms of weight $m>2$. We can combine Corollaries 7.7 and 8.5 to deduce

(53)

\begin{align} \sum_{f \in \mathcal{B}_m^\mathrm{new}} |f^\sharp(g)|^4 &\ll_{\varepsilon} (qD_B)^{3+\varepsilon} \frac{m^2}{m-2} \bigg\{\frac{m}{2}\int_0^\infty \bigg[\sum_{n=1}^{(qD_B)^2m} \frac{1}{n}M(g,n;\delta)^2\bigg]^{1/2}\nonumber\\ &\quad+\bigg[\sum_{n>(qD_B)^2m} \frac{\exp(-n/(qD_B)^2)}{n} M(g,n;\delta)^2 \bigg]^{1/2} \frac{{d}\delta}{(1+\delta)^{m/2+1}}\bigg\}^2. \end{align}

On the right-hand side, we have denoted by $M(g,n;\delta )$ the counting function associated to a maximal order containing $\mathcal {R}$.

10.1 Proof of the main theorem for the split matrix algebra

Let $\mathcal {F}$ be a fundamental domain for the action $\mathbf {SL}_2(\mathbb {Z})$ on $\mathbb {H}$. Recall that in this case $\Gamma =\Gamma _0(q)<\mathbf {SL}_2(\mathbb {Z})$. For $g\in \mathbf {G}(\mathbb {R})=\mathbf {SL}_2(\mathbb {R})$, we denote

\[ \mathrm{ht}_\Gamma(g)=\min\big\{ y\mid \exists \gamma\in\mathbf{SL}_2(\mathbb{Z})\colon (\gamma g).i=x+iy\in\mathcal{F}\big\}. \]

We first bound the sum $\sum _{n=1}^{(qD_B)^2m}$. Because $M(g,n;\delta )$ is the count associated to the maximal order $\operatorname {Mat}_{2\times 2}(\mathbb {Z})$, the sum is invariant under the operation of replacing $g$ by $\gamma g$ for any $\gamma \in \mathbf {SL}_2(\mathbb {Z})$. In particular, we can arrange $g.i=x+iy$ with $y=\mathrm {ht}_\Gamma (g)$. We need to convert the logarithmic sum $\sum ( {1}/{n})M(g,n;\delta )^2$ to an unweighted sum. We achieve this using the general, integration-by-parts, identity

\[ \sum_{n=1}^N \frac{1}{n}f(n)=\frac{1}{N}\sum_{n=1}^N f(n)+\int_1^N\frac{1}{t}\sum_{n=1}^t f(n) \frac{{d} t}{t}, \]

which holds for any $f\colon \mathbb {N}\to \mathbb {C}$. This identity and Proposition 9.1 imply

(54)

\begin{equation} \sum_{n=1}^{N} \frac{1}{n}M(g,n;\delta)^2\ll_{\varepsilon} N^{2+\varepsilon}\delta^2+N^\varepsilon+\mathrm{ht}_\Gamma(g)^2\begin{cases} N^{1+\varepsilon}\delta^{3/2}+N^{1/2+\varepsilon}\delta, & \delta<1,\\ N^{1+\varepsilon}\delta, & \delta\geq 1, \end{cases} \end{equation}

where $N=(qD_B)^2m$. We next need to compute the integral $\int _0^\infty \sqrt {\cdots }( {{d}\delta }/{(1+\delta )^{m/2+1}})$. We use the $l^2$–$l^1$ inequality to separate the terms in (54) under the square root. To compute the contribution of the term proportional to $\mathrm {ht}_\Gamma (g)$ in (54), we split the integral over $\delta$ into $\int _0^1+\int _1^\infty$. For the former integral, we have $(1+\delta ) \ge e^{{\delta /2}}$ for $\delta \in [0,1]$ as the function $(1+\delta )e^{- {\delta /2}}$ only increases in the interval $[0,1]$ and is equal to $1$ at $0$. Hence, for $\kappa >-1$, we haveFootnote ¹⁴

\[ \int_0^1 \frac{\delta^{\kappa}}{(1+\delta)^{m/2+1}}\,{d} \delta \le \int_0^{\infty} \delta^{\kappa}e^{-\delta(m/4+1/2)} \,{d} \delta = \frac{\Gamma(1+\kappa)}{(m/4+1/2)^{1+\kappa}} \ll_{\kappa} m^{-1-\kappa}. \]

On the other hand, we have for $\kappa < {m/2}$ that

\[ \int_1^{\infty} \frac{\delta^{\kappa}}{(1+\delta)^{m/2+1}}\,{d} \delta \le \int_1^{\infty} (1+\delta)^{\kappa-m/2-1} \,{d} \delta = \frac{2^{-(m/2-\kappa)}}{m/2-\kappa}. \]

Hence, for the first two terms in (54), we have

\[ \int_0^\infty \frac{N^{1+\varepsilon}\delta+N^{\varepsilon}}{(1+\delta)^{m/2+1}}\,{d} \delta \ll_{\kappa} \frac{N^{1+\varepsilon}m^{-1}+N^{\varepsilon}}{m-2}\ll \frac{(qD_B)^{2+\varepsilon}m^{\varepsilon}}{m-2}, \]

and for the last term, we have

\begin{align*} &\mathrm{ht}_\Gamma(g) \int_0^\infty\begin{cases} N^{1/2+\varepsilon}\delta^{3/4}+N^{1/4+\varepsilon}\delta^{1/2}, & \delta<1,\\ N^{1/2+\varepsilon}\delta^{1/2}, & \delta\geq~1, \end{cases}\cdot\frac{{d}\delta}{(1+\delta)^{m/2+1}} \\ &\quad \ll_{\kappa} \mathrm{ht}_\Gamma(g) N^\varepsilon\bigg (\frac{N^{1/2}}{m^{1+3/4}}+\frac{N^{1/4}}{m^{1+1/2}}+2^{-m/2}\frac{N^{1/2}}{m-2}\bigg)\\ &\quad \ll \mathrm{ht}_\Gamma(g) (qD_B)^{1+\varepsilon}\frac{m^{-1/4+\varepsilon}}{m-2}. \end{align*}

In conclusion,

\[ \int_0^\infty \bigg[ \sum_{n=1}^{(qD_B)^2m} \frac{1}{n}M(g,n;\delta)^2\bigg]^{1/2} \frac{{d} \delta}{(1+\delta)^{m/2+1}}\ll_\varepsilon (qD_B)^{2+\varepsilon}m^{\varepsilon}\frac{1+\mathrm{ht}_\Gamma(g)m^{-1/4}}{m-2}. \]

The computation of the bound for the integral

\[ \int_0^{\infty} \bigg[\sum_{n>(qD_B)^2m} \frac{\exp(-n/(qD_B)^2)}{n} M(g,n;\delta)^2 \bigg]^{1/2} \frac{{d}\delta}{(1+\delta)^{m/2+1}} \]

uses a very similar argument, except that we need to apply the integration-by-parts identity

\[ \sum_{n>A m} \frac{\exp(-n/A)}{n}f(n)=-\frac{\exp(-m)}{Am}\sum_{n=1}^{Am} f(n) +\frac{1}{A}\int_m^\infty \exp(-t) \sum_{n=1}^{At}f(n)\bigg(1+\frac{1}{t}\bigg)\frac{{d} t}{t}, \]

that holds for any function $f\colon \mathbb {N}\to \mathbb {C}$ satisfying $\log f(n)=o(n)$ and $A,m\geq 1$. The contributions of these terms is then easily seen to be negligible.

Combining these inequalities with (53), we arrive at

\[ \sum_{f \in \mathcal{B}_m^\mathrm{new}} |f^\sharp(g)|^4 \ll_{\varepsilon} (qD_B)^{7+\varepsilon}\frac{m^{4+\varepsilon}}{(m-2)^3}\big(1+\mathrm{ht}_\Gamma(g)^2m^{-1/2}\big). \]

This is consistent with the first claim in Theorem 1.1 for $m>2$. As mentioned in the introduction, the second claim requires the additional input of [Reference Blomer, Khan and YoungBKY13, Theorem 1.8], which says that most of the $L^4$-mass is concentrated on $\mathrm {ht}_{\Gamma }(g) \ll m^{{1/4}}$. Since the extension of said theorem to include a polynomial-level dependence follows their proof almost verbatim, we leave it to the reader.

10.2 Proof of the main theorem for division algebras

In this section, we use the notation of § 9.2. We follow the same arguments as for the split algebra replacing Proposition 9.1 by Proposition 9.6 to arrive at

\[ \sum_{f \in \mathcal{B}_m^\mathrm{new}} |f^\sharp(g)|^4 \ll_{\varepsilon} |D_E|^{2+\varepsilon}(\lambda+\lambda^{-1})^{2+\varepsilon}(qD_B)^{7+\varepsilon}\frac{m^{4+\varepsilon}}{(m-2)^3}. \]

Recall that $\mathcal {O}_E \hookrightarrow \mathcal {R}$ is any optimal embedding of the ring of integers of an imaginary quadratic field into the fixed maximal order. By [Reference Chinburg and FriedmanCF99], this is always possible if any prime dividing $D_B$ is inert in $E$. Using the Chinese remainder theorem, one deduces that such a discriminant $D_E$ exists satisfying $|D_E|\ll D_B$.

Lastly, we can replace $g$ by any $\gamma g$ for any $\gamma \in \Gamma$, hence $\lambda +\lambda ^{-1}$ is polynomially bounded by the volume of ${\Gamma \backslash \mathbf {G}(\mathbb {R})}$, as follows from [Reference Chu and LiCL16]. The latter has been recently improved by the second named author in [Reference SteinerSte23].

Acknowledgements

It is a pleasure to thank Paul Nelson for many enlightening discussions on this project and very useful comments on an earlier draft of this manuscript. We would like to deeply thank Peter Sarnak for his continuous encouragement and numerous fruitful discussions on the topic. We thank Jared Wunsch for his help with the analytic aspects of this work. We are also grateful to Valentin Blomer and Simon Marshall for their comments on a previous version of the manuscript. Last but not least, we would like to thank the referees for their thorough reading of the manuscript and their suggestions which lead to simplifications in various parts and improved the overall quality of the manuscript.

The majority of this work was conducted during a stay of the second named author at Northwestern University, and subsequently completed at the respective home institutions: Northwestern University and Institute for Advanced Study/ETH Zürich.

Conflicts of interest

None.

Financial support

The first named author has been supported by an AMS Centennial Fellowship and he would like to thank the AMS for its generosity. The second named author would like to thank Northwestern University for their hospitality, the Institute for Advanced Study, where he was supported by the National Science Foundation Grant No. DMS-1638352 and the Giorgio and Elena Petronio Fellowship Fund II, and the Institute for Mathematical Research (FIM) at ETH Zürich. During revisions for publication, the second named author was employed at the Huawei Research Center in Zürich.

Journal information

Compositio Mathematica is owned by the Foundation Compositio Mathematica and published by the London Mathematical Society in partnership with Cambridge University Press. All surplus income from the publication of Compositio Mathematica is returned to mathematics and higher education through the charitable activities of the Foundation, the London Mathematical Society and Cambridge University Press.

Footnotes

1 With respect to the invariant probability measure.

2 In the published version $\delta =1/32$ appears, though this has been corrected to $\delta =1/64$ in a recent revision on the arXiv [Reference Das and SenguptaDS20, Remark 4.1].

3 Here, $\Gamma <\mathbf {G}(\mathbb {Q})$ is a congruence lattice in an indefinite inner form $\mathbf {G}$ of $\mathbf {SL}_2$ and $\Lambda <\mathbf {SL}_2(\mathbb {Q})$ is a lattice in the split form $\mathbf {SL}_2$ that arises in Shimizu's explicit Jacquet–Langlands transfer of modular forms on ${\Gamma \backslash \mathbf {G}(\mathbb {R})}$.

4 That is, all orders everywhere locally conjugate to $R$ by a norm $1$ element, where two orders are equivalent if they are globally conjugate by a rational norm $1$ element.

5 The sum of four odd numbers $\sum _{j=1}^4 \epsilon _j (2n_j+1)$ is always even.

6 Note that $\rho (-1).M(x)=M(-x)$ and that our function is symmetric, so $-1$ acts trivially.

7 This follows from the facts that the action of $\mathbf {SL}_2(\mathbb {Q}_p)$ is strongly transitive, i.e. it is transitive on pairs $(\mathcal {C},\mathcal {A})$ where $\mathcal {C}$ is a chamber in the apartment $\mathcal {A}$, and that $\mathbf {PGL}_2(\mathbb {Q}_p)$ has an element which inverts the orientation of a single chamber.

8 This definition does not depend on the choice of $\jmath _v$.

9 For a fixed $g\in \mathbf {PGL}_2(\mathbb {A})$ this is the Fourier expansion of the function $n\mapsto \varphi (ng)$ evaluated at $n=e$. The function $n\mapsto \varphi (ng)$ is a continuous function on the compact Abelian group $[\mathbf {N}(\mathbb {A})]$. If the Fourier transform of a continuous functions converges absolutely, then it coincides with the function everywhere.

10 This happens for any two lattices in a rational vector space.

11 Such an element does not exist if $B$ is ramified at infinity, but then there are also no elements $\xi _\alpha \in B$ of negative norm.

12 The function $M_\infty ^{(m)}$ is not necessarily in $C^2(B_\infty )$ and we extend the definition of $L_{2\pi }$ to this function in the naive way, in particular this equality does not need to be well-defined on the cone $\operatorname {Nr} x =0$.

13 Note that the transpose operation with respect to $K_E$ is $\big (\begin{smallmatrix} \alpha & D_B \beta \\ ^\sigma{\beta }{} & ^\sigma{\alpha }{}\end{smallmatrix}\big )\mapsto \big (\begin{smallmatrix} ^\sigma{\alpha }{} & D_B \beta \\ ^\sigma{\beta }{} & \alpha \end{smallmatrix}\big )$.

14 We thank the referee for this simplification in estimating the integral.

References

Atkin, A. O. L. and Lehner, J., Hecke operators on

$\Gamma _{0}(m)$, Math. Ann. 185 (1970), 134–160.Google Scholar

Assing, E., On sup-norm bounds part I: ramified Maaß newforms over number fields, J. Eur. Math. Soc. (JEMS) 26 (2024), 1559–1609.Google Scholar

Blomer, V. and Holowinsky, R., Bounding sup-norms of cusp forms of large level, Invent. Math. 179 (2010), 645–681.CrossRef Google Scholar

Blomer, V., Harcos, G. and Milićević, D., Bounds for eigenforms on arithmetic hyperbolic 3-manifolds, Duke Math. J. 165 (2016), 625–659.CrossRef Google Scholar

Blomer, V., Harcos, G., Maga, P. and Milićević, D., The sup-norm problem for GL(2) over number fields, J. Eur. Math. Soc. (JEMS) 22 (2020), 1–53.Google Scholar

Blomer, V., Khan, R. and Young, M., Distribution of mass of holomorphic cusp forms, Duke Math. J. 162 (2013), 2609–2644.Google Scholar

Blomer, V. and Michel, P., Sup-norms of eigenfunctions on arithmetic ellipsoids, Int. Math. Res. Not. IMRN 2011 (2011), 4934–4966.Google Scholar

Blomer, V. and Michel, P., Hybrid bounds for automorphic forms on ellipsoids over number fields, J. Inst. Math. Jussieu 12 (2013), 727–758.Google Scholar

Blomer, V. and Maga, P., The sup-norm problem for PGL(4), Int. Math. Res. Not. IMRN 2015 (2015), 5311–5332.CrossRef Google Scholar

Blomer, V. and Maga, P., Subconvexity for sup-norms of cusp forms on

${\rm {PGL}}(n)$, Selecta Math. (N.S.) 22 (2016), 1269–1287.Google Scholar

Borel, A., Some finiteness properties of adele groups over number fields, Publ. Math. Inst. Hautes Études Sci. 16 (1963), 5–30.Google Scholar

Blomer, V. and Pohl, A., The sup-norm problem on the Siegel modular space of rank two, Amer. J. Math. 138 (2016), 999–1027.Google Scholar

Chinburg, T. and Friedman, E., An embedding theorem for quaternion algebras, J. Lond. Math. Soc. (2) 60 (1999), 33–44.Google Scholar

Chu, M. and Li, H., Small generators of cocompact arithmetic Fuchsian groups, Proc. Amer. Math. Soc. 144 (2016), 5121–5127.Google Scholar

Das, S. and Sengupta, J.,

$L^{\infty }$ norms of holomorphic modular forms in the case of compact quotient, Forum Math. 27 (2015), 1987–2001.CrossRef Google Scholar

Das, S. and Sengupta, J.,

$L^\infty$ norms of holomorphic modular forms in the case of compact quotient, Preprint (2020), arXiv:1301.3677v3.Google Scholar

Gross, B. H. and Zagier, D. B., Heegner points and derivatives of

$L$-series, Invent. Math. 84 (1986), 225–320.CrossRef Google Scholar

Hoffstein, J. and Lockhart, P., Coefficients of Maass forms and the Siegel zero, Ann. of Math. (2) 140 (1994), 161–181; with an appendix by D. Goldfeld, J. Hoffstein and D. Lieman.CrossRef Google Scholar

Hu, Y., Nelson, P. D. and Saha, A., Some analytic aspects of automorphic forms on

${\rm {GL}}(2)$ of minimal type, Comment. Math. Helv. 94 (2019), 767–801.Google Scholar

Hu, Y. and Saha, A., Sup-norms of eigenfunctions in the level aspect for compact arithmetic surfaces, II: newforms and subconvexity, Compos. Math. 156 (2020), 2368–2398.CrossRef Google Scholar

Harcos, G. and Templier, N., On the sup-norm of Maass cusp forms of large level: II, Int. Math. Res. Not. IMRN 2012 (2012), 4764–4774.CrossRef Google Scholar

Harcos, G. and Templier, N., On the sup-norm of Maass cusp forms of large level. III, Math. Ann. 356 (2013), 209–216.CrossRef Google Scholar

Ichino, A., Trilinear forms and the central values of triple product

$L$-functions, Duke Math. J. 145 (2008), 281–307.Google Scholar

Iwaniec, H. and Sarnak, P.,

$L^{\infty }$ norms of eigenfunctions of arithmetic surfaces, Ann. of Math. (2) 141 (1995), 301–320.CrossRef Google Scholar

Jacquet, H. and Langlands, R. P., Automorphic forms on GL(2), Lecture Notes in Mathematics, vol. 114 (Springer, Berlin–New York, 1970).CrossRef Google Scholar

Khan, R., On the fourth moment of holomorphic Hecke cusp forms, Ramanujan J. 34 (2014), 83–107.CrossRef Google Scholar

Kıral, E. M., Bounds on sup-norms of half-integral weight modular forms, Acta Arith. 165 (2014), 385–399.CrossRef Google Scholar

Khayutin, I., Nelson, P. D. and Steiner, R. S., Theta functions, fourth moments of eigenforms, and the sup-norm problem II, Forum Math. Pi 12 (2024), e11.Google Scholar

Marshall, S., Upper bounds for Maass forms on semisimple groups, Preprint (2014), arXiv:1405.7033.Google Scholar

Nelson, P. D., Equidistribution of cusp forms in the level aspect, Duke Math. J. 160 (2011), 467–501.CrossRef Google Scholar

Nelson, P. D., Evaluating modular forms on Shimura curves, Math. Comp. 84 (2015), 2471–2503.CrossRef Google Scholar

Nelson, P. D., Quantum variance on quaternion algebras, I, Preprint (2016), arXiv:1601.02526.Google Scholar

Nelson, P. D., Quantum variance on quaternion algebras, II, Preprint (2017), arXiv:1702.02669.Google Scholar

Nelson, P. D., Quantum variance on quaternion algebras, III, Preprint (2019), arXiv:1903.08686.Google Scholar

Nelson, P. D., Bounds for twisted symmetric square

$L$-functions via half-integral weight periods, Forum Math. Sigma 8 (2020), e44.CrossRef Google Scholar

Petersson, H., Über eine Metrisierung der automorphen Formen und die Theorie der Poincaréschen Reihen, Math. Ann. 117 (1940), 453–537.CrossRef Google Scholar

Petersson, H., Einheitliche Begründung der Vollständigkeitssätze für die Poincaréschen Reihen von reeller Dimension bei beliebigen Grenzkreisgruppen von erster Art, Abh. Math. Semin. Hansischen Univ. 14 (1941), 22–60.Google Scholar

Ramacher, P. and Wakatsuki, S., Subconvex bounds for Hecke-Maass forms on compact arithmetic quotients of semisimple Lie groups, Math. Z. 298 (2021), 1383–1424.CrossRef Google Scholar

Saha, A., Hybrid sup-norm bounds for Maass newforms of powerful level, Algebra Number Theory 11 (2017), 1009–1045.CrossRef Google Scholar

Saha, A., On sup-norms of cusp forms of powerful level, J. Eur. Math. Soc. (JEMS) 19 (2017), 3549–3573.Google Scholar

Saha, A., Sup-norms of eigenfunctions in the level aspect for compact arithmetic surfaces, Math. Ann. 376 (2020), 609–644.CrossRef Google Scholar

Sarnak, P., Spectra of hyperbolic surfaces, Bull. Amer. Math. Soc. (N.S.) 40 (2003), 441–478.CrossRef Google Scholar

Selberg, A., On the zeros of Riemann's zeta-function, Skr. Norske Vid.-Akad. Oslo I 1942 (1942), 59.Google Scholar

Selberg, A., Harmonic analysis and discontinuous groups in weakly symmetric Riemannian spaces with applications to Dirichlet series, J. Indian Math. Soc. (N.S.) 20 (1956), 47–87.Google Scholar

Shimizu, H., Theta series and automorphic forms on

${\rm GL}_{2}$, J. Math. Soc. Japan 24 (1972), 638–683.CrossRef Google Scholar

Sogge, C. D., Concerning the

$L^p$ norm of spectral clusters for second-order elliptic operators on compact manifolds, J. Funct. Anal. 77 (1988), 123–138.CrossRef Google Scholar

Steiner, R. S., Uniform bounds on sup-norms of holomorphic forms of real weight, Int. J. Number Theory 12 (2016), 1163–1185.CrossRef Google Scholar

Steiner, R. S., Supnorm of modular forms of half-integral weight in the weight aspect, Acta Arith. 177 (2017), 201–218.CrossRef Google Scholar

Steiner, R. S., Sup-norm of Hecke–Laplace eigenforms on

$S^3$, Math. Ann. 377 (2020), 543–553.CrossRef Google Scholar

Steiner, R. S., Small diameters and generators for arithmetic lattices in

$\mathrm {SL}_2(\mathbb {R})$ and certain Ramanujan graphs, Ramanujan J. 62 (2023), 953–966.CrossRef Google Scholar

Stein, E. M. and Weiss, G., Introduction to Fourier analysis on Euclidean spaces. Princeton Mathematical Series, No. 32 (Princeton University Press, Princeton, NJ, 1971).Google Scholar

Sun, H. and Ye, Y., Double first moment for

$L(\frac 12,{\rm Sym}^2f\times g)$ by applying Petersson's formula twice, J. Number Theory 202 (2019), 141–159.CrossRef Google Scholar

Templier, N., On the sup-norm of Maass cusp forms of large level, Selecta Math. (N.S.) 16 (2010), 501–531.CrossRef Google Scholar

Templier, N., Hybrid sup-norm bounds for Hecke–Maass cusp forms, J. Eur. Math. Soc. (JEMS) 17 (2015), 2069–2082.CrossRef Google Scholar

VanderKam, J. M.,

$L^\infty$ norms and quantum ergodicity on the sphere, Int. Math. Res. Not. IMRN 1997 (1997), 329–347.CrossRef Google Scholar

Vignéras, M.-F., Séries thêta des formes quadratiques indéfinies, in Séminaire Delange-Pisot-Poitou, 17e année (1975/76), Théorie des nombres: Fasc. 1, Exp. No. 20 (1977), 3, http://www.numdam.org/article/SDPP_1975-1976__17_1_A18_0.pdf.Google Scholar

Voight, J., Quaternion algebras, Graduate Texts in Mathematics, vol. 288 (Springer, Cham, 2021).CrossRef Google Scholar

Waldspurger, J.-L., Sur les valeurs de certaines fonctions

$L$ automorphes en leur centre de symétrie, Compos. Math. 54 (1985), 173–242.Google Scholar

Watson, T. C., Rankin triple products and quantum chaos, PhD dissertation, Princeton University (2008), arXiv:0810.0425.Google Scholar

Weil, A., Sur certains groupes d'opérateurs unitaires, Acta Math. 111 (1964), 143–211.CrossRef Google Scholar

Xia, H., On

$L^\infty$ norms of holomorphic cusp forms, J. Number Theory 124 (2007), 325–327.CrossRef Google Scholar

Zagier, D., Appendix: The Eichler–Selberg trace formula on

$\mathrm {SL}_2(\mathbb {Z})$, in Introduction to modular forms. Grundlehren der mathematischen Wissenschaften, vol. 222 (Springer, Berlin–New York, 1976), 44–55.Google Scholar

Zagier, D., Modular forms whose Fourier coefficients involve zeta-functions of quadratic fields, in Modular functions of one variable, VI (Proc. Second Internat. Conf., Univ. Bonn, Bonn, 1976), Lecture Notes in Mathematics, vol. 627 (Springer, 1977), 105–169.Google Scholar

Article contents

Theta functions, fourth moments of eigenforms, and the sup-norm problem I

Abstract

Keywords

MSC classification

1. Introduction

2. General method

3. The Weil representation and theta series

3.1 Inner forms of SL2

3.2 Normalization of Haar measures

3.3 Local Weil representation

3.4 The Archimedean Weil representation

3.5 The non-Archimedean Weil representation

3.6 Local uniformity

3.7 Global Weil representation and theta series

4. Theta kernels for Eichler orders

4.1 Weil action on Eichler orders

4.2 Theta series for Eichler orders

4.3 The Weil $L^2$-norm of the theta kernel

5. The theta lift

5.1 Cuspidal theta series

5.2 Unfolding

5.3 The theta lift

5.3.1 Hecke operators

6. The Bergman kernel

6.1 The Bergman Archimedean test function

7. The spectral expansion

8. The geometric expansion

9. Second-moment count of quaternions by norm

9.1 Second-moment bound for the split matrix algebra

9.2 Second-moment bound for division algebras

10. Proof of the main theorem

10.1 Proof of the main theorem for the split matrix algebra

10.2 Proof of the main theorem for division algebras

Acknowledgements

Conflicts of interest

Financial support

Journal information

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests