1 Introduction
This paper deals with the study of the possible limits of periodic orbits in homogeneous spaces. Before explaining what we mean by this, we start by motivating this study. In many instances arithmetic properties of an object are captured by periodicity of a corresponding orbit in some dynamical system. A simple instance of this phenomenon is that $\unicode[STIX]{x1D6FC}\in \mathbb{R}$ is rational if and only if its decimal expansion is eventually periodic. In dynamical terms this is expressed by the fact that the orbit of $\unicode[STIX]{x1D6FC}$ modulo 1 on the torus $\mathbb{R}/\mathbb{Z}$ under multiplication by 10 (modulo 1) is eventually periodic. Furthermore, from knowing distributional information regarding the periodic orbit one can draw meaningful arithmetical conclusions. In the above example this means that if the orbit is very close to being evenly distributed on the circle then the frequency of appearance of, say, the digit 3 in the period of the decimal expansion is roughly $\frac{1}{10}$ . This naive scheme has far-reaching analogous manifestations capturing deep arithmetic concepts in dynamical terms. More elaborate instances are, for example, the following.
– Similarly to the above example regarding decimal expansion, periodic geodesics on the modular surface correspond to continued fraction expansions of quadratic numbers, and distributional properties of the former imply statistical information regarding the latter (see [Reference Aka and ShapiraAS18] where this was used).
– Representing an integral quadratic form by another is related to periodic orbits of orthogonal groups (see [Reference Ellenberg and VenkateshEV08]).
– Class groups of number fields correspond to adelic torus orbits (see [Reference Einsiedler, Lindenstrauss, Michel and VenkateshELMV11]).
– Values of rational quadratic forms are governed by the volume of periodic orbits of orthogonal groups (see [Reference Einsiedler, Margulis and VenkateshEMV09, Theorem 1.1])
– Asymptotic formulas for counting integer and rational points on varieties are encoded by distributional properties of periodic orbits (see [Reference Duke, Rudnick and SarnakDRS93, Reference Eskin and McMullenEM93, Reference Eskin, Mozes and ShahEMS96, Reference Gorodnik, Maucourant and OhGMO08], for example).
In all the above examples the orbits that are considered are of finite volume. Recently in [Reference Kelmer and KontorovichKK18, Reference Oh and ShahOS14] this barrier was crossed and particular instances of the above principle were used for infinite-volume orbits in order to obtain asymptotic estimates for counting integral points on some varieties and weighted second moments of $\operatorname{GL}(2)$ automorphic $L$ -functions.
At this point let us make our terminology more precise. Let $X$ be a locally compact second countable Hausdorff space and let $H$ be a unimodular topological group acting continuously on $X$ . We say that an orbit $Hx$ is periodic if it supports an $H$ -invariant locally finite Borel measure. In such a case the orbit is necessarily closed and this measure is unique up to scaling and is obtained by restricting the Haar measure of $H$ to a fundamental domain of $\operatorname{Stab}_{H}(x)$ in $H$ which is identified with the orbit via $h\mapsto hx$ . We say that such an orbit is of finite volume if the total mass of the orbit is finite. It is then customary to normalize the total mass of the orbit to 1. We remark that in some texts the term ‘periodic orbit’ is reserved for finite-volume ones, but we wish to extend the terminology as above. If $Hx$ is a periodic orbit we denote by $\unicode[STIX]{x1D707}_{Hx}$ a choice of such a measure, which in the finite-volume case is assumed to be normalized to a probability measure.
Given a sequence of periodic orbits $Hx_{i}$ , it makes sense to ask if they converge in some sense to a limiting object. When the orbits are of finite volume the common definition is that of weak* convergence; each orbit is represented by the probability measure $\unicode[STIX]{x1D707}_{Hx_{i}}$ and one equips the space of probability measures ${\mathcal{P}}(X)$ with the weak* topology coming from identifying ${\mathcal{P}}(X)$ as a subset of the unit sphere in the dual of the Banach space of continuous functions on $X$ vanishing at infinity $C_{0}(X)$ . The starting point of this paper is to challenge this and propose a slight modification which will allow us to bring into the picture periodic orbits of infinite volume. For that we will shortly concern ourselves with topologizing the space of equivalence classes $[\unicode[STIX]{x1D707}]$ of locally finite measures $\unicode[STIX]{x1D707}$ on $X$ .
This approach has several advantages over the classical weak* convergence approach. As said above, it allows us to discuss limiting distributions of infinite-volume orbits, but also it allows us to detect in some cases information which is invisible for the weak* topology. In the classical discussion, it is common that a sequence of periodic probability measures $\unicode[STIX]{x1D707}_{Hx_{i}}$ converges to the zero measure (a phenomenon known as full escape of mass). Nevertheless it sometimes happens that the orbits themselves do converge to a limiting object but this information was lost because the measures along the sequence were not scaled properly. This phenomenon happens, for example, in [Reference ShapiraSha17], which inspired us to define the notion of convergence to be defined below.
Although the results we will prove are rather specialized, we wish to present the framework in which our discussion takes place in some generality. Let $G$ be a Lie groupFootnote 1 and let $\unicode[STIX]{x1D6E4}<G$ be a lattice.
Question 1.1. Let $X=G/\unicode[STIX]{x1D6E4}$ and let $H_{i}x_{i}$ be a sequence of periodic orbits. Under what conditions do the following statements hold?
(i) The sequence $[\unicode[STIX]{x1D707}_{H_{i}x_{i}}]$ has a converging subsequence.
(ii) The accumulation points of $[\unicode[STIX]{x1D707}_{H_{i}x_{i}}]$ are themselves (homothety classes of) periodic measures.
2 Basic definitions and results
2.1 Topologies
We now make our discussion above more rigorous. Let $X$ be a locally compact second countable Hausdorff space and ${\mathcal{M}}(X)$ the space of locally finite measures on $X$ . We say that two locally finite measures $\unicode[STIX]{x1D707}$ and $\unicode[STIX]{x1D708}$ in ${\mathcal{M}}(X)$ are equivalent if there exists a constant $\unicode[STIX]{x1D706}>0$ such that $\unicode[STIX]{x1D707}=\unicode[STIX]{x1D706}\unicode[STIX]{x1D708}$ . This forms an equivalence relation and we denote the equivalence class of $\unicode[STIX]{x1D707}$ by $[\unicode[STIX]{x1D707}]$ . We denote by $\mathbb{P}{\mathcal{M}}(X)$ the set of all equivalence classes of nonzero locally finite measures on $X$ .
We topologize ${\mathcal{M}}(X)$ and $\mathbb{P}{\mathcal{M}}(X)$ as follows. Let $C_{c}(X)$ be the space of compactly supported continuous functions on $X$ . For any $\unicode[STIX]{x1D70C}\in C_{c}(X)$ , define
by sending $d\unicode[STIX]{x1D707}\in {\mathcal{M}}(X)$ to $\unicode[STIX]{x1D70C}d\unicode[STIX]{x1D707}\in C_{0}(X)^{\ast }$ . Here $C_{0}(X)$ is the space of continuous functions on $X$ vanishing at infinity equipped with the supremum norm, and $C_{0}(X)^{\ast }$ denotes its dual space. The weak* topology on $C_{0}(X)^{\ast }$ then induces a topology $\unicode[STIX]{x1D70F}_{\unicode[STIX]{x1D70C}}$ on ${\mathcal{M}}(X)$ via the map $i_{\unicode[STIX]{x1D70C}}$ . We will denote by $\unicode[STIX]{x1D70F}_{X}$ the topology on ${\mathcal{M}}(X)$ generated by $({\mathcal{M}}(X),\unicode[STIX]{x1D70F}_{\unicode[STIX]{x1D70C}})$ $(\unicode[STIX]{x1D70C}\in C_{c}(X))$ . Equivalently, $\unicode[STIX]{x1D70F}_{X}$ is the smallest topology on ${\mathcal{M}}(X)$ such that for any $f\in C_{c}(X)$ the map
is a continuous map from ${\mathcal{M}}(X)$ to $\mathbb{R}$ .
Definition 2.1. Let $\unicode[STIX]{x1D70B}_{P}$ be the natural projection map from ${\mathcal{M}}(X)\setminus \{0\}$ to $\mathbb{P}{\mathcal{M}}(X)$ . We define $\unicode[STIX]{x1D70F}_{P}$ to be the quotient topology on $\mathbb{P}{\mathcal{M}}(X)$ induced by $\unicode[STIX]{x1D70F}_{X}$ via $\unicode[STIX]{x1D70B}_{P}$ . In other words, $U$ is an open subset in $\mathbb{P}{\mathcal{M}}(X)$ if and only if $\unicode[STIX]{x1D70B}_{P}^{-1}(U)$ is open in ${\mathcal{M}}(X)\setminus \{0\}$ . In this way, we obtain a topological space $(\mathbb{P}{\mathcal{M}}(X),\unicode[STIX]{x1D70F}_{P})$ .
2.2 Main results
Let $G=\operatorname{SL}(n,\mathbb{R})$ , $\unicode[STIX]{x1D6E4}=\operatorname{SL}(n,\mathbb{Z})$ and $X=G/\unicode[STIX]{x1D6E4}$ . Denote by $m_{X}$ the unique $G$ -invariant probability measure on $X$ and by $\operatorname{Ad}$ the adjoint representation of $G$ . We write
for the connected component of the full diagonal group in $G$ , and
for the upper triangular unipotent group. Let $K=\operatorname{SO}(n,\mathbb{R})$ . In this paper, we address Question 1.1 in the space $X=\operatorname{SL}(n,\mathbb{R})/\operatorname{SL}(n,\mathbb{Z})$ with certain periodic orbits $H_{i}x_{i}$ , and prove the convergence of $[\unicode[STIX]{x1D707}_{H_{i}x_{i}}]$ with respect to the topology $(\unicode[STIX]{x1D70F}_{P},\mathbb{P}{\mathcal{M}}(X))$ . As a simple exercise, and to motivate such a statement, the reader can show that if $[\unicode[STIX]{x1D707}_{H_{i}x_{i}}]\rightarrow [m_{X}]$ , for example, then the orbits $H_{i}x_{i}$ become dense in $X$ . In many cases our results imply that indeed the limit homothety class is the class of the uniform measure $m_{X}$ .
Before stating our theorems, we need some notation. For a Lie subgroup $H<G$ , let $H^{0}$ denote the connected component of identity of $H$ , and $\operatorname{Lie}(H)$ its Lie algebra. Denote by $C_{G}(H)$ (respectively, $C_{G}(\operatorname{Lie}(H))$ ) the centralizer of $H$ (respectively, $\operatorname{Lie}(H)$ ) in $G$ . We write $\mathfrak{g}=\operatorname{Lie}(G)=\mathfrak{sl}(n,\mathbb{R})$ , and
for the exponential map from $\mathfrak{g}$ to $G$ . We also write $\Vert \cdot \Vert _{\mathfrak{g}}$ for the norm on $\mathfrak{g}$ induced by the Euclidean norm on the space of $n\,\times \,n$ matrices. For any $g\in G$ and any measure $\unicode[STIX]{x1D707}$ on $X$ , define the measure $g_{\ast }\unicode[STIX]{x1D707}$ by
An $A$ -orbit $Ax$ in $X$ is called divergent if the map $a\mapsto ax$ from $A$ to $X$ is proper.
Definition 2.2. Let $\{{g_{k}\}}_{k\in \mathbb{N}}$ be a sequence in $G$ . For any subgroup $S\subset A$ , we define
This is a subalgebra in $\operatorname{Lie}(S)$ .
Remark 2.3. By Definition 2.2, $\{\operatorname{Ad}(g_{k})Y\}_{k\in \mathbb{N}}$ is unbounded for any $Y\in \operatorname{Lie}(S)\setminus {\mathcal{A}}(S,\{{g_{k}\}}_{k\in \mathbb{N}})$ . Then one can find a subsequence $\{{g_{i_{k}}\}}_{k\in \mathbb{N}}$ such that for any $Y\in \operatorname{Lie}(S)\setminus {\mathcal{A}}(S,\{{g_{i_{k}}\}}_{k\in \mathbb{N}})$ , the sequence $\{\operatorname{Ad}(g_{i_{k}})Y\}_{k\in \mathbb{N}}$ diverges to infinity.
Indeed, suppose that, for an element $Y\in \operatorname{Lie}(S)\setminus {\mathcal{A}}(S,\{{g_{k}\}}_{k\in \mathbb{N}})$ , $\{\operatorname{Ad}(g_{k})Y\}_{k\in \mathbb{N}}$ does not diverge. Then there is a subsequence ${\{g_{k}^{\prime }\}}_{k\in \mathbb{N}}$ such that $\{\operatorname{Ad}(g_{k}^{\prime })Y\}_{k\in \mathbb{N}}$ is bounded. This implies that ${\mathcal{A}}(S,{\{g_{k}^{\prime }\}}_{k\in \mathbb{N}})$ contains the linear span of $Y$ and ${\mathcal{A}}(S,\{{g_{k}\}}_{k\in \mathbb{N}})$ . Because of this, one can keep on enlarging the set ${\mathcal{A}}(S,\{{g_{k}\}}_{k\in \mathbb{N}})$ by passing to subsequences of $\{{g_{k}\}}_{k\in \mathbb{N}}$ . But due to the finite dimension of $\operatorname{Lie}(S)$ , this process would stop at some point. Then one can get a subsequence $\{{g_{i_{k}}\}}_{k\in \mathbb{N}}$ such that, for any vector $Y\in \operatorname{Lie}(S)\setminus {\mathcal{A}}(S,\{{g_{i_{k}}\}}_{k\in \mathbb{N}})$ , the sequence $\operatorname{Ad}(g_{i_{k}})Y\rightarrow \infty$ .
The following theorem answers Question 1.1 for translates of a divergent diagonal orbit in $G/\unicode[STIX]{x1D6E4}$ . Moreover, it gives a description of all accumulation points.
Theorem 2.4. Let $Ax$ be a divergent orbit in $X$ . Then, for any $\{{g_{k}\}}_{k\in \mathbb{N}}$ in $G$ , the sequence $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Ax}]$ has a subsequence converging to an equivalence class of a periodic measure on $X$ .
Furthermore, by passing to a subsequence, we assume that, for any $Y\in \operatorname{Lie}(A)\setminus {\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}})$ , the sequence $\{\operatorname{Ad}(g_{k})Y\}_{k\in \mathbb{N}}$ diverges (see Remark 2.3). Then we have the following description of the limit points of the sequence $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Ax}]$ . The subgroup $\exp ({\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}}))$ is the connected component of the center of the reductive group $C_{G}({\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}}))$ , and any limit point of the sequence $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Ax}]$ is a translate of the equivalence class $[\unicode[STIX]{x1D707}_{C_{G}({\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}}))^{0}x}]$ . In particular, if ${\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}})=\{0\}$ , then $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Ax}]$ converges to the equivalence class of the Haar measure $m_{X}$ on $X$ .
In fact, we deduce Theorem 2.4 as a corollary of the following theorem.
Theorem 2.5. Let $Ax$ be a divergent orbit in $X$ . Suppose that $\{{g_{k}\}}_{k\in \mathbb{N}}$ is a sequence in $N$ with
such that for each pair $(i,j)$ $(1\leqslant i<j\leqslant n)$ ,
Then the sequence $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Ax}]$ converges to the equivalence class $[\unicode[STIX]{x1D707}_{C_{G}({\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}}))^{0}x}]$ .
We will also deduce the following theorem from Theorems 2.4 and 2.5, which answers Question 1.1 for translates of an orbit of a connected reductive group $H$ containing $A$ . We will see by Lemma 10.2 that for such a reductive group $H$ , and for $x\in X$ with $Ax$ divergent, $Hx$ is a closed orbit.
Theorem 2.6. Let $Ax$ be a divergent orbit in $X$ and let $H$ be a connected reductive group containing $A$ . Then, for any $\{{g_{k}\}}_{k\in \mathbb{N}}$ in $G$ , the sequence $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Hx}]$ has a subsequence converging to an equivalence class of a periodic measure on $X$ .
Furthermore, let $S$ be the connected component of the center of $H$ , and assume that, for any $Y\in \operatorname{Lie}(S)\setminus {\mathcal{A}}(S,\{{g_{k}\}}_{k\in \mathbb{N}})$ , the sequence $\{\operatorname{Ad}(g_{k})Y\}_{k\in \mathbb{N}}$ diverges. Then we have the following description of the limit points of $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Hx}]$ . The subgroup $\exp ({\mathcal{A}}(S,\{{g_{k}\}}_{k\in \mathbb{N}}))$ is the connected component of the center of the reductive group $C_{G}({\mathcal{A}}(S,\{{g_{k}\}}_{k\in \mathbb{N}}))$ , and any limit point of the sequence $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Hx}]$ is a translate of the equivalence class $[\unicode[STIX]{x1D707}_{C_{G}({\mathcal{A}}(S,\{{g_{k}\}}_{k\in \mathbb{N}}))^{0}x}]$ . In particular, if ${\mathcal{A}}(S,\{{g_{k}\}}_{k\in \mathbb{N}})=\{0\}$ , then $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Hx}]$ converges to the equivalence class of the Haar measure $m_{X}$ on $X$ .
Remark 2.7. The proof of Theorem 2.4 also gives a criterion on the convergence of $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Ax}]$ . A similar criterion on the convergence of $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Hx}]$ for a connected reductive group $H$ containing $A$ could be obtained from the proof of Theorem 2.6.
We give several examples to illustrate Theorems 2.4–2.6.
(1) Let $G=\operatorname{SL}(3,\mathbb{R})$ and $\unicode[STIX]{x1D6E4}=\operatorname{SL}(3,\mathbb{Z})$ . Pick the initial point $x=\mathbb{Z}^{n}\in X$ and the sequence $g_{k}=\left(\!\begin{smallmatrix}1 & k & k^{2}/2\\ 0 & 1 & k\\ 0 & 0 & 1\end{smallmatrix}\!\right)$ . In this case one can show that the subalgebra ${\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}})=\{0\}$ , and $\operatorname{Ad}(g_{k})Y$ diverges for any nonzero $Y\in \operatorname{Lie}(A)$ . We also have $C_{G}({\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}}))=\operatorname{SL}(3,\mathbb{R})$ . Theorem 2.4 then says that $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Ax}]$ converges to $[\unicode[STIX]{x1D707}_{\operatorname{SL}(3,\mathbb{R})x}]=[m_{X}]$ .
(2) Fix $G,\unicode[STIX]{x1D6E4},x$ and $g_{k}$ as in example (1). Let $H$ be the connected component of the reductive subgroup
$$\begin{eqnarray}\left(\begin{array}{@{}ccc@{}}\ast & \ast & 0\\ \ast & \ast & 0\\ 0 & 0 & \ast \end{array}\right)\cap \operatorname{SL}(3,\mathbb{R}).\end{eqnarray}$$Then the center $S$ of $H$ is equal to $\{\operatorname{diag}(a,a,a^{-2}):a\neq 0\}$ , and one could check that the subalgebra ${\mathcal{A}}(S,\{{g_{k}\}}_{k\in \mathbb{N}})=\{0\}$ , and $\operatorname{Ad}(g_{k})Y$ diverges for any nonzero $Y\in \operatorname{Lie}(S)$ . Also $C_{G}({\mathcal{A}}(S,\{{g_{k}\}}_{k\in \mathbb{N}}))=\operatorname{SL}(3,\mathbb{R})$ . Then Theorem 2.6 implies that the sequence $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Hx}]$ converges to $[\unicode[STIX]{x1D707}_{\operatorname{SL}(3,\mathbb{R})x}]=[m_{X}]$ .(3) Let $G=\operatorname{SL}(4,\mathbb{R})$ and $\unicode[STIX]{x1D6E4}=\operatorname{SL}(4,\mathbb{Z})$ . Pick the initial point $x=\mathbb{Z}^{n}\in X$ and the sequence $g_{k}=\left(\!\begin{smallmatrix}1 & k & 0 & 0\\ 0 & 1 & 0 & 0\\ 0 & 0 & 1 & k\\ 0 & 0 & 0 & 1\end{smallmatrix}\!\right)$ . In this case one can show that ${\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}})=\{\operatorname{diag}(t,t,-t,-t):t\in \mathbb{R}\}$ and
$$\begin{eqnarray}C_{G}({\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}}))=\left(\begin{array}{@{}cccc@{}}\ast & \ast & 0 & 0\\ \ast & \ast & 0 & 0\\ 0 & 0 & \ast & \ast \\ 0 & 0 & \ast & \ast \end{array}\right)\cap \operatorname{SL}(4,\mathbb{R}).\end{eqnarray}$$Theorem 2.5 then says that the sequence $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Ax}]$ converges to $[\unicode[STIX]{x1D707}_{C_{G}({\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}}))^{0}x}].$(4) Fix $G,\unicode[STIX]{x1D6E4}$ and $x$ as in example (3), and pick the sequence $g_{k}=\left(\!\begin{smallmatrix}1 & k & k^{2}/2 & 0\\ 0 & 1 & k & 0\\ 0 & 0 & 1 & 0\\ 0 & 0 & 0 & 1\end{smallmatrix}\!\right)$ . Let $H$ be the connected component of the reductive subgroup
$$\begin{eqnarray}\left(\begin{array}{@{}cccc@{}}\ast & \ast & 0 & 0\\ \ast & \ast & 0 & 0\\ 0 & 0 & \ast & 0\\ 0 & 0 & 0 & \ast \end{array}\right)\cap \operatorname{SL}(4,\mathbb{R}).\end{eqnarray}$$Then the center $S$ of $H$ is equal to $\{\operatorname{diag}(a,a,b,c):a^{2}bc=1\}$ , and one could check that ${\mathcal{A}}(S,\{{g_{k}\}}_{k\in \mathbb{N}})=\{\operatorname{diag}(s,s,s,-3s):s\in \mathbb{R}\}$ and$$\begin{eqnarray}C_{G}({\mathcal{A}}(S,\{{g_{k}\}}_{k\in \mathbb{N}}))=\left(\begin{array}{@{}cccc@{}}\ast & \ast & \ast & 0\\ \ast & \ast & \ast & 0\\ \ast & \ast & \ast & 0\\ 0 & 0 & 0 & \ast \end{array}\right)\cap \operatorname{SL}(4,\mathbb{R}).\end{eqnarray}$$In this case, Theorem 2.6 says that any limit point of the sequence $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Hx}]$ is a translate $[\unicode[STIX]{x1D707}_{C_{G}({\mathcal{A}}(S,\{{g_{k}\}}_{k\in \mathbb{N}}))^{0}x}]$ , and the proof of Theorem 2.6 would imply that $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Hx}]$ actually converges to $[\unicode[STIX]{x1D707}_{C_{G}({\mathcal{A}}(S,\{{g_{k}\}}_{k\in \mathbb{N}}))^{0}x}]$ .
By comparing examples (1) and (3) (respectively, (2) and (4)), one can see that the subalgebra ${\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}})$ (respectively, ${\mathcal{A}}(S,\{{g_{k}\}}_{k\in \mathbb{N}})$ ) plays an important role in indicating what kinds of limit points the sequence $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Ax}]$ (respectively, $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Hx}]$ ) could converge to. In example (1), we have ${\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}})=\{0\}$ . By pushing $Ax$ with $g_{k}$ , the orbit $g_{k}Ax$ starts snaking in the space $\operatorname{SL}(3,\mathbb{R})/\operatorname{SL}(3,\mathbb{Z})$ , and eventually fills up the entire space. In example (3), ${\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}})$ is a one-dimensional subalgebra in $\operatorname{Lie}(A)$ which commutes with $g_{k}$ , and it corresponds to the part of the orbit $Ax$ which stays still and is not affected when we push $\unicode[STIX]{x1D707}_{Ax}$ by $g_{k}$ . This would result in the limit orbit having this part as the ‘central direction’, and the ‘orthogonal’ part in $Ax$ would be pushed by $g_{k}$ and fill up the sub-homogeneous space $\left(\!\begin{smallmatrix}\operatorname{SL}(2,\mathbb{R}) & 0\\ 0 & \operatorname{SL}(2,\mathbb{R})\end{smallmatrix}\!\right)x$ in $\operatorname{SL}(4,\mathbb{R})/\operatorname{SL}(4,\mathbb{Z})$ .
By the characterization of convergence given in Proposition 3.3, Theorems 2.4 and 2.6 can be restated in the following form.
Theorem 2.8. Let $Ax$ be a divergent orbit and $\{{g_{k}\}}_{k\in \mathbb{N}}$ be a sequence in $G$ such that $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Ax}]$ converges to an equivalence class of a locally finite periodic measure $[\unicode[STIX]{x1D708}]$ as in Theorem 2.4. Then there exists a sequence $\unicode[STIX]{x1D706}_{k}>0$ such that
with respect to the topology $\unicode[STIX]{x1D70F}_{X}$ . In particular, for any $F_{1},F_{2}\in C_{c}(X)$ , we have
whenever $\int F_{1}\,d\unicode[STIX]{x1D708}\neq 0$ . The same results hold if $A$ is replaced by any connected reductive group $H$ containing $A$ .
Remark 2.9. From the proofs of Theorems 2.4 and 2.5, we will see that in the case ${\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}})=\{0\}$ , the numbers $\unicode[STIX]{x1D706}_{k}$ in Theorem 2.8 are related to the volumes of convex polytopes of a special type in $\operatorname{Lie}(A)$ (see Definition 4.1 and Corollary 10.1). We remark here that in view of Theorem 2.8, the $\unicode[STIX]{x1D706}_{k}$ in this case can also be calculated by a function $F_{1}\in C_{c}(X)$ with its support being a large compact subset. This makes Theorem 2.8 practical in other problems.
2.3 Applications
As an application of our results, we give one example of a counting problem. More details about this counting problem can be found in [Reference Duke, Rudnick and SarnakDRS93, Reference Eskin and McMullenEM93, Reference Eskin, Mozes and ShahEMS96, Reference ShahSha00].
Let $M(n,\mathbb{R})$ be the space of $n\,\times \,n$ matrices with the norm
for $M=(x_{ij})_{1\leqslant i,j\leqslant n}\in M(n,\mathbb{R})$ . Denote by $B_{T}$ the ball of radius $T$ centered at 0 in $M(n,\mathbb{R})$ . Fix a monic polynomial $p_{0}(\unicode[STIX]{x1D706})$ in $\mathbb{Z}[\unicode[STIX]{x1D706}]$ which splits completely over $\mathbb{Q}$ . By Gauss’s lemma, the roots $\unicode[STIX]{x1D6FC}_{i}$ of $p(\unicode[STIX]{x1D706})$ are integers. We assume that the $\unicode[STIX]{x1D6FC}_{i}$ are distinct and nonzero. Let
For $M\in M(n,\mathbb{R})$ , denote by $p_{M}(\unicode[STIX]{x1D706})$ the characteristic polynomial of $M$ . We define by
the variety of matrices $M$ with characteristic polynomial $p_{M}(\unicode[STIX]{x1D706})$ equal to $p_{0}(\unicode[STIX]{x1D706})$ , and by
the integer points in the variety $V(\mathbb{R})$ .
The metric $\Vert \cdot \Vert _{\mathfrak{g}}$ on $\mathfrak{g}=\mathfrak{sl}_{n}(\mathbb{R})$ induces Haar measures on $A$ and $N$ . The $K$ -invariant probability measure on $K$ and the Haar measures on $A$ , $N$ then give a Haar measure on $G$ via the Iwasawa decomposition $G=KNA$ . We will denote by $c_{X}$ the volume of $X=G/\unicode[STIX]{x1D6E4}$ with respect to the Haar measure on $G$ .
There is a natural volume form on the variety $V(\mathbb{R})$ inherited from $G=\operatorname{SL}(n,\mathbb{R})$ . Specifically, the orbit map
defined by $g\mapsto \operatorname{Ad}(g)M_{\unicode[STIX]{x1D6FC}}$ gives an isomorphism between the quotient space $G/C_{G}(A)$ and the variety $V(\mathbb{R})$ , and the volume form is defined to be the $G$ -invariant measure on $G/C_{G}(A)$ . The existence of such a measure is well known, and the proof of it can be found, for example, in [Reference RaghunathanRag72]. With this volume form, one can compute (see Proposition 11.7) that for any $T$ , the volume of $V(\mathbb{R})\cap B_{T}$ equals $cT^{n(n-1)/2}$ for some constant $c>0$ . The following theorem concerns the asymptotic formula for the number of integer points in $V(\mathbb{Z})\cap B_{T}$ . We will see that the set $V(\mathbb{Z})\cap B_{T}$ behaves differently from $V(\mathbb{R})\cap B_{T}$ , with an extra log term.
By a well-known theorem of Borel and Harish-Chandra [Reference BorelBHC62], the subset $V(\mathbb{Z})$ is a finite disjoint union of $\operatorname{Ad}(\unicode[STIX]{x1D6E4})$ -orbits. One can write this disjoint union as
for some $h_{0}\in \mathbb{N}$ and $M_{i}\in V(\mathbb{Z})$ ( $1\leqslant i\leqslant h_{0}$ ). Note that for each $M_{i}$ , the stabilizer $\unicode[STIX]{x1D6E4}_{M_{i}}$ of $M_{i}$ is finite. Also the number of the orbits $h_{0}$ is equal to the number of equivalence classes of nonsingular ideals in the subring in $M(n,\mathbb{R})$ generated by $M_{\unicode[STIX]{x1D6FC}}$ , for which readers may refer to [Reference BorelBHC62, Reference Latimer and MacDuffeeLM33]. In the following theorem, for ease of notation, we write $\mathbf{t}$ for a vector $(t_{1},t_{2},\ldots ,t_{n})\in \mathbb{R}^{n}$ .
Theorem 2.10. We have
where $\operatorname{Vol}(B_{1})$ is the volume of the unit ball in $\mathbb{R}^{n(n-1)/2}$ and $c_{0}$ is the volume of the $(n-1)$ -convex polytope
with respect to the natural measure induced by the Lebesgue measure on $\mathbb{R}^{n}$ .
In the sequel, we will mainly focus on Theorem 2.5 as all the other theorems will be corollaries of it. In the course of the proof of Theorem 2.5, the case ${\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}})=\{0\}$ plays an important role, and other cases can be proved by induction. Therefore, most of our arguments in this paper would work for the case ${\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}})=\{0\}$ . We remark that our proof is inspired by [Reference Oh and ShahOS14], where Oh and Shah deal with the case $G=\operatorname{SL}(2,\mathbb{R})$ by applying exponential mixing and obtain an error estimate. This effective result was recently improved by Kelmer and Kontorovich [Reference Kelmer and KontorovichKK18].
When we showed an earlier draft of this paper to Shah, he pointed out to us that similar results to those appearing in this paper were established by him at the beginning of this century, but were never published.
The paper is organized as follows.
– We start our work in § 3 by studying the topology $\unicode[STIX]{x1D70F}_{P}$ on $\mathbb{P}{\mathcal{M}}(X)$ for a locally compact second countable Hausdorff space $X$ . In particular, a characterization of convergence in $\mathbb{P}{\mathcal{M}}(X)$ is given, and Theorem 2.8 is obtained as a natural corollary, if Theorems 2.4 and 2.6 are presumed.
– In § 4, a special type of convex polytope in $\operatorname{Lie}(A)$ is introduced. Such convex polytopes are related to nondivergence of the orbits $g_{k}Ax$ . In order to analyze these convex polytopes in the setting of Theorem 2.5, we define graphs associated to them and prove some auxiliary results concerning the graphs in § 5. With the assumption ${\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}})=\{0\}$ , these auxiliary results imply some properties of the convex polytopes, which are proved in § 6.
– Keeping the assumption ${\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}})=\{0\}$ in § 7, we prove a statement on the nondivergence of the sequence of $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Ax}]$ and show that $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Ax}]$ converges to $[\unicode[STIX]{x1D708}]$ for some probability measure $\unicode[STIX]{x1D708}$ invariant under a unipotent subgroup. Then we translate § 7 in terms of adjoint representation in § 8. The linearization technique and the measure classification theorem for unipotent actions on homogeneous spaces are discussed in § 9, enabling us to study the measure rigidity in our setting.
– We complete the proof of Theorem 2.5 in § 10. Then we prove Theorems 2.4 and 2.6. We give the proof of Theorem 2.10 in § 11.
3 Topology on $\mathbb{P}{\mathcal{M}}(X)$
In this section, we study the topology $\unicode[STIX]{x1D70F}_{P}$ on $\mathbb{P}{\mathcal{M}}(X)$ for any locally compact second countable Hausdorff space $X$ . We will give a description of the convergence of a sequence $[\unicode[STIX]{x1D707}_{k}]$ in $\mathbb{P}{\mathcal{M}}(X)$ (Proposition 3.3). This will help us study the convergence of the sequence $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Ax}]$ in Theorems 2.4 and 2.5 (respectively, $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Hx}]$ in Theorem 2.6).
Before proving Proposition 3.3, we need some preparatory work.
Proposition 3.1. The topology $(\unicode[STIX]{x1D70F}_{P},\mathbb{P}{\mathcal{M}}(X))$ is Hausdorff. In particular, any convergent sequence in $\mathbb{P}{\mathcal{M}}(X)$ has a unique limit.
Proof. Let $[\unicode[STIX]{x1D707}]$ and $[\unicode[STIX]{x1D708}]$ be two distinct elements in $\mathbb{P}{\mathcal{M}}(X)$ . We choose $f\in C_{c}(X)$ and representatives $\unicode[STIX]{x1D707}$ and $\unicode[STIX]{x1D708}$ such that
Since $[\unicode[STIX]{x1D707}]\neq [\unicode[STIX]{x1D708}]$ , there exists a nonnegative function $g\in C_{c}(X)$ such that
We define neighborhoods of $\unicode[STIX]{x1D707}$ and $\unicode[STIX]{x1D708}$ in ${\mathcal{M}}(X)$ by
Since $\unicode[STIX]{x1D70B}_{P}:{\mathcal{M}}(X)\setminus \{0\}\rightarrow \mathbb{P}{\mathcal{M}}(X)$ is an open map, $\unicode[STIX]{x1D70B}_{P}(V(\unicode[STIX]{x1D707};f,g,\unicode[STIX]{x1D716}))$ and $\unicode[STIX]{x1D70B}_{P}(V(\unicode[STIX]{x1D708};f,g,\unicode[STIX]{x1D716}))$ are open neighborhoods of $[\unicode[STIX]{x1D707}]$ and $[\unicode[STIX]{x1D708}]$ in $\mathbb{P}{\mathcal{M}}(X)$ for any $\unicode[STIX]{x1D716}>0$ . Let $\unicode[STIX]{x1D705}=\int g\,d\unicode[STIX]{x1D707}$ . We prove that, for any $\unicode[STIX]{x1D716}<\min \{0.1,|\unicode[STIX]{x1D705}-1|/5\}$ ,
Suppose, to the contrary, that $[\unicode[STIX]{x1D706}]\in \unicode[STIX]{x1D70B}_{P}(V(\unicode[STIX]{x1D707};f,g,\unicode[STIX]{x1D716}))\cap \unicode[STIX]{x1D70B}_{P}(V(\unicode[STIX]{x1D708};f,g,\unicode[STIX]{x1D716}))$ . Then there exist constants $\unicode[STIX]{x1D6FC},\unicode[STIX]{x1D6FD}>0$ such that
This implies that
and
This is a contradiction, for $\unicode[STIX]{x1D716}<\min \{0.1,|\unicode[STIX]{x1D705}-1|/5\}$ .◻
Proposition 3.2. A sequence $[\unicode[STIX]{x1D707}_{k}]$ in $\mathbb{P}{\mathcal{M}}(X)$ converges to $[\unicode[STIX]{x1D708}]$ if and only if, for each $k\in \mathbb{N}$ , there exists a representative $\unicode[STIX]{x1D707}_{k}^{\prime }$ in $[\unicode[STIX]{x1D707}_{k}]$ and for $[\unicode[STIX]{x1D708}]$ a representative $\unicode[STIX]{x1D708}^{\prime }\in [\unicode[STIX]{x1D708}]$ such that $\unicode[STIX]{x1D707}_{k}^{\prime }$ converges to $\unicode[STIX]{x1D708}^{\prime }$ in ${\mathcal{M}}(X)$ .
Proof. Let $[\unicode[STIX]{x1D707}_{k}]$ be a sequence in $\mathbb{P}{\mathcal{M}}(X)$ converging to $[\unicode[STIX]{x1D708}]$ . We choose $f\in C_{c}(X)$ and representatives $\unicode[STIX]{x1D707}_{k}^{\prime }$ and $\unicode[STIX]{x1D708}^{\prime }$ of $[\unicode[STIX]{x1D707}_{k}]$ and $[\unicode[STIX]{x1D708}]$ such that
Suppose that $\unicode[STIX]{x1D707}_{k}^{\prime }\not \rightarrow \unicode[STIX]{x1D708}^{\prime }$ in ${\mathcal{M}}(X)$ . Then there exists a nonnegative function $g\in C_{c}(X)$ such that, after passing to a subsequence,
for some $\unicode[STIX]{x1D6FF}>0$ . Then by the same argument as in Proposition 3.1, we can find a neighborhood $\unicode[STIX]{x1D70B}_{P}(V(\unicode[STIX]{x1D708};f,g,\unicode[STIX]{x1D716}))$ of $[\unicode[STIX]{x1D708}]$ in $\mathbb{P}{\mathcal{M}}(X)$ for some $\unicode[STIX]{x1D716}<\min \{0.1,\unicode[STIX]{x1D6FF}/5\}$ such that
which contradicts the condition $[\unicode[STIX]{x1D707}_{k}]\rightarrow [\unicode[STIX]{x1D708}]$ . The other direction follows from Definition 2.1.◻
We now prove the following important proposition, which provides a characterization of the convergence of a sequence $[\unicode[STIX]{x1D707}_{k}]$ in $\mathbb{P}{\mathcal{M}}(X)$ . This will help us study the convergence of equivalence classes of locally finite measures on $\operatorname{SL}(n,\mathbb{R})/\operatorname{SL}(n,\mathbb{Z})$ in the rest of the paper.
Proposition 3.3.
(i) Let $\{{\unicode[STIX]{x1D707}_{k}\}}_{k\in \mathbb{N}}$ be a sequence in ${\mathcal{M}}(X)$ . Then $[\unicode[STIX]{x1D707}_{k}]$ converges to $[\unicode[STIX]{x1D708}]$ in $\mathbb{P}{\mathcal{M}}(X)$ if and only if there exists a sequence $\{\unicode[STIX]{x1D706}_{k}\}$ of positive numbers such that $\unicode[STIX]{x1D706}_{k}\unicode[STIX]{x1D707}_{k}$ converges to $\unicode[STIX]{x1D708}$ in ${\mathcal{M}}(X)$ . If there exists another sequence $\{\unicode[STIX]{x1D706}_{k}^{\prime }\}$ with $\unicode[STIX]{x1D706}_{k}^{\prime }\unicode[STIX]{x1D707}_{k}\rightarrow \unicode[STIX]{x1D708}^{\prime }\neq 0$ in ${\mathcal{M}}(X)$ , then
$$\begin{eqnarray}[\unicode[STIX]{x1D708}^{\prime }]=[\unicode[STIX]{x1D708}]\end{eqnarray}$$and $\lim _{k}\unicode[STIX]{x1D706}_{k}^{\prime }/\unicode[STIX]{x1D706}_{k}$ exists.(ii) The sequence $[\unicode[STIX]{x1D707}_{k}]$ converges to $[\unicode[STIX]{x1D708}]$ if and only if, for any $f,g\in C_{c}(X)$ with $\int g\,d\unicode[STIX]{x1D708}\neq 0$ , we have $\int g\,d\unicode[STIX]{x1D707}_{k}\neq 0$ for sufficiently large $k$ and
$$\begin{eqnarray}\frac{\int f\,d\unicode[STIX]{x1D707}_{k}}{\int g\,d\unicode[STIX]{x1D707}_{k}}\rightarrow \frac{\int f\,d\unicode[STIX]{x1D708}}{\int g\,d\unicode[STIX]{x1D708}}.\end{eqnarray}$$
Proof. The first statement follows from Propositions 3.1 and 3.2. For $\lim _{k}\unicode[STIX]{x1D706}_{k}^{\prime }/\unicode[STIX]{x1D706}_{k}$ , we choose $f\in C_{c}(X)$ with $\int f\,d\unicode[STIX]{x1D708}\neq 0$ , and we have
For the second statement, if $[\unicode[STIX]{x1D707}_{k}]\rightarrow [\unicode[STIX]{x1D708}]$ , then there exists a sequence $\unicode[STIX]{x1D706}_{k}>0$ such that $\unicode[STIX]{x1D706}_{k}\unicode[STIX]{x1D707}_{k}\rightarrow \unicode[STIX]{x1D708}\neq 0$ . For any $f,g\in C_{c}(X)$ with $\int g\,d\unicode[STIX]{x1D708}\neq 0$ , we have
for sufficiently large $k$ and
Conversely, let $g\in C_{c}(X)$ with $\int g\,d\unicode[STIX]{x1D708}\neq 0$ and
Then we have $\unicode[STIX]{x1D706}_{k}\unicode[STIX]{x1D707}_{k}\rightarrow \unicode[STIX]{x1D708}$ and $[\unicode[STIX]{x1D707}_{k}]\rightarrow [\unicode[STIX]{x1D708}]$ .◻
From the discussions in this section, we know that to prove Theorem 2.5 one needs to find a sequence of $\unicode[STIX]{x1D706}_{k}>0$ such that $\unicode[STIX]{x1D706}_{k}(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Ax}$ converges to a locally finite measure $\unicode[STIX]{x1D708}$ , and then prove that $\unicode[STIX]{x1D708}$ is a periodic measure. From §§ 4 to 6, we will construct the sequence $\unicode[STIX]{x1D706}_{k}$ in an explicit way. In the rest of the paper, $X$ will denote the homogeneous space $G/\unicode[STIX]{x1D6E4}$ .
4 Convex polytopes
In this section we will construct a special type of convex polytope in $\operatorname{Lie}(A)$ . These convex polytopes will play an important role in the rest of the paper.
By [Reference Tomanov and WeissTW03, Theorem 1.4], $Ax$ is divergent in $X=G/\unicode[STIX]{x1D6E4}$ if and only if $x\in A\cdot \operatorname{SL}(n,\mathbb{Q})\unicode[STIX]{x1D6E4}$ . Note that, for any $q\in \operatorname{SL}(n,\mathbb{Q})$ , the lattice $q\unicode[STIX]{x1D6E4}q^{-1}$ is commensurable with $\unicode[STIX]{x1D6E4}$ , and all results in this paper will hold if $\unicode[STIX]{x1D6E4}$ is replaced by $q\unicode[STIX]{x1D6E4}q^{-1}$ . Therefore, without loss of generality, we may assume that the initial point $x=x_{e}=e\operatorname{SL}(n,\mathbb{Z})$ , where $e$ is the identity matrix in $G$ . We will denote by $m_{\operatorname{Lie}(A)}$ the natural measure on $\operatorname{Lie}(A)\subset \mathfrak{sl}(n,\mathbb{R})$ induced by the Lebesgue measure on the space of $n\,\times \,n$ matrices.
For ease of notation, we will write $\mathbf{t}$ for a vector $(t_{1},t_{2},\ldots ,t_{n})$ in an $n$ -dimensional space, and denote by $[n]$ the index set $\{1,2,\ldots ,n\}$ . We write ${\mathcal{I}}_{n}$ for the collection of all multi-index subsets of $[n]$ , and ${\mathcal{I}}_{n}^{l}$ for the collection of the index subsets of cardinality $l$ in ${\mathcal{I}}_{n}$ . Let $\{e_{1},e_{2},\ldots ,e_{n}\}$ be the standard basis of $\mathbb{R}^{n}$ . For any index subset $I=\{i_{1}<i_{2}\cdots <i_{l}\}\in {\mathcal{I}}_{n}$ , we denote by
the wedge product of the vectors $e_{i_{1}},\ldots ,e_{i_{l}}$ . We write $\unicode[STIX]{x1D714}_{I}(\mathbf{t})$ ( $\mathbf{t}=(t_{1},t_{2},\ldots ,t_{n})\in \mathbb{R}^{n}$ ) for the linear functional $\sum _{i\in I}t_{i}$ on $\mathbb{R}^{n}$ .
Let $g\in \operatorname{SL}(n,\mathbb{R})$ and $\unicode[STIX]{x1D6FF}>0$ . We define a region $\unicode[STIX]{x1D6FA}_{g,\unicode[STIX]{x1D6FF}}$ in $\operatorname{Lie}(A)$ as follows. Let $\mathbf{t}=(t_{1},t_{2},\ldots ,t_{n})\in \operatorname{Lie}(A)$ . For each $e_{i}\in \mathbb{R}^{n}$ , the vector
if and only if
Here $B_{\unicode[STIX]{x1D6FF}}$ denotes the ball of radius $\unicode[STIX]{x1D6FF}>0$ around 0 in $\mathbb{R}^{n}$ with the standard Euclidean norm $\Vert \cdot \Vert$ . We also consider the wedge product $e_{I}$ for any nonempty subset $I\in {\mathcal{I}}_{n}^{l}$ $(1\leqslant l\leqslant n)$ , and
if and only if
Here, in an abuse of notation, $\Vert \cdot \Vert$ is the norm on $\wedge ^{l}\mathbb{R}^{n}$ induced by the Euclidean norm on $\mathbb{R}^{n}$ , and $B_{\unicode[STIX]{x1D6FF}}$ is the ball of radius $\unicode[STIX]{x1D6FF}>0$ around 0 in $\wedge ^{l}\mathbb{R}^{n}$ . This leads to the following definition.
Definition 4.1. For any $g\in G$ and $\unicode[STIX]{x1D6FF}>0$ , we define
Remark 4.2. By the construction above, for any $\mathbf{t}\in \operatorname{Lie}(A)\setminus \unicode[STIX]{x1D6FA}_{g,\unicode[STIX]{x1D6FF}}$ , the lattice $g\exp (\mathbf{t})\mathbb{Z}^{n}$ has a short nonzero vector with length depending on $\unicode[STIX]{x1D6FF}>0$ . Hence, by Mahler’s compactness criterion, the point $g\exp (\mathbf{t})\unicode[STIX]{x1D6E4}\in gA\unicode[STIX]{x1D6E4}$ is close to infinity. Due to this reason, we will mainly study the part $\{g\exp (\mathbf{t})\unicode[STIX]{x1D6E4}:\mathbf{t}\in \unicode[STIX]{x1D6FA}_{g,\unicode[STIX]{x1D6FF}}\}$ of the orbit $gA\unicode[STIX]{x1D6E4}$ .
Lemma 4.3. The region $\unicode[STIX]{x1D6FA}_{g,\unicode[STIX]{x1D6FF}}$ is a bounded convex polytope in $\operatorname{Lie}(A)$ for any $g\in G$ and $\unicode[STIX]{x1D6FF}>0$ .
Proof. Since the region $\unicode[STIX]{x1D6FA}_{g,\unicode[STIX]{x1D6FF}}$ is defined by various linear functionals on $\operatorname{Lie}(A)$ , $\unicode[STIX]{x1D6FA}_{g,\unicode[STIX]{x1D6FF}}$ is a convex polytope. Now by definition, $\unicode[STIX]{x1D6FA}_{g,\unicode[STIX]{x1D6FF}}$ is contained in the region
which is bounded. The boundedness of $\unicode[STIX]{x1D6FA}_{g,\unicode[STIX]{x1D6FF}}$ then follows.◻
In § 6 we will closely study the convex polytope $\unicode[STIX]{x1D6FA}_{g,\unicode[STIX]{x1D6FF}}$ . We list here some properties of convex polytopes which will be used later. For a bounded convex subset $\unicode[STIX]{x1D6FA}$ in a Euclidean space $E$ , we denote by $\operatorname{Vol}(\unicode[STIX]{x1D6FA})$ the volume of $\unicode[STIX]{x1D6FA}$ with respect to the Lebesgue measure on $E$ , and by $\operatorname{Area}(\unicode[STIX]{x2202}\unicode[STIX]{x1D6FA})$ the surface area of the boundary $\unicode[STIX]{x2202}\unicode[STIX]{x1D6FA}$ of $\unicode[STIX]{x1D6FA}$ induced by the Lebesgue measure.
The following lemma is well known. We learnt it from Roy Meshulam.
Lemma 4.4. Let $\unicode[STIX]{x1D6FA}$ be a bounded convex subset in $\mathbb{R}^{d}$ . Suppose that $\unicode[STIX]{x1D6FA}$ contains a ball of radius $r>0$ . Then we have
Proof. Let $B_{r}(0)$ denote the ball of radius $r$ centered at $0$ in $\mathbb{R}^{d}$ and we may assume, without loss of generality, that $B_{r}(0)\subset \unicode[STIX]{x1D6FA}$ . We have
This completes the proof of the lemma. ◻
Lemma 4.5. Let $R\subset \unicode[STIX]{x1D6FA}$ be two bounded $d$ -dimensional convex polytopes in $\mathbb{R}^{d}$ . Suppose that $\unicode[STIX]{x1D6FA}$ contains a ball of radius $r>0$ and
for some constant $c>0$ . Then $R$ contains a ball of radius $rc/d$ .
Proof. Let $\unicode[STIX]{x1D70C}$ be the largest number such that $R$ contains a ball of radius $\unicode[STIX]{x1D70C}$ . It suffices to show that $\unicode[STIX]{x1D70C}\geqslant rc/d$ . Let $\{f_{i}\}$ be the collection of the facets of $R$ , and denote by $P_{i}$ the hyperplane determined by $f_{i}$ . First, we prove two claims.
Claim 1. Let $p$ be a point in $R$ , and let $f_{i_{0}}$ be a facet of $R$ such that the hyperplane $P_{i_{0}}$ is closest to $p$ among all the hyperplanes $P_{i}$ . Then the orthogonal projection of $p$ in $P_{i_{0}}$ is in the facet $f_{i_{0}}$ .
Proof of Claim 1.
Let $p_{i_{0}}$ be the orthogonal projection of $p$ in $P_{i_{0}}$ , and denote by $\overline{p_{i_{0}}p}$ the line segment connecting $p$ and $p_{i_{0}}$ . Suppose that $p_{i_{0}}$ is outside the facet $f_{i_{0}}$ . Then $\overline{p_{i_{0}}p}$ intersects another facet of $R$ , say, $f_{j_{0}}$ . This implies that the distance between $p$ and the hyperplane $P_{j_{0}}$ is smaller than the length of $\overline{p_{i_{0}}p}$ , which contradicts the choice of $P_{i_{0}}$ .◻
Claim 2. $\operatorname{Vol}(R)\leqslant \unicode[STIX]{x1D70C}\operatorname{Area}(\unicode[STIX]{x2202}R).$
Proof of Claim 2.
For each facet $f_{i}$ of $R$ , let $B_{i}$ be the unique cylinder with the following properties.
(i) The base of $B_{i}$ is $f_{i}$ , and the height of $B_{i}$ is equal to $\unicode[STIX]{x1D70C}$ .
(ii) $B_{i}$ and $R$ lie in the same half-space determined by $P_{i}$ .
The maximality of $\unicode[STIX]{x1D70C}$ then implies
otherwise, by Claim 1, one would find a point $x\in R\setminus \bigcup _{i}B_{i}$ such that, for each $f_{i}$ , the distance between $x$ and $f_{i}$ is strictly larger than $\unicode[STIX]{x1D70C}$ . Now we have
and Claim 2 follows. ◻
Now we can finish the proof of the lemma. By Claim 2 and Lemma 4.4, we have
Here we use the fact that $\operatorname{Area}(\unicode[STIX]{x2202}R)\leqslant \operatorname{Area}(\unicode[STIX]{x2202}\unicode[STIX]{x1D6FA})$ for any two convex polytopes $R\subset \unicode[STIX]{x1D6FA}$ .◻
For a bounded convex polytope $\unicode[STIX]{x1D6FA}$ in $\mathbb{R}^{d}$ and $\unicode[STIX]{x1D716}>0$ , its $\unicode[STIX]{x1D716}$ -neighborhood is defined by
Here $\Vert \cdot \Vert$ is the Euclidean norm on $\mathbb{R}^{d}$ .
Lemma 4.6. Let $\unicode[STIX]{x1D6FA}$ be a bounded convex subset in $\mathbb{R}^{d}$ which contains a ball of radius $r>0$ . Let $\unicode[STIX]{x1D6FA}_{\unicode[STIX]{x1D716}}$ be the $\unicode[STIX]{x1D716}$ -neighborhood of $\unicode[STIX]{x1D6FA}$ for $\unicode[STIX]{x1D716}>0$ . Then we have
Proof. The proof is similar to Lemma 4.4. Assume that $\unicode[STIX]{x1D6FA}$ contains the ball $B_{r}(0)$ of radius $r$ around $0$ . We have
This completes the proof of the lemma. ◻
5 Auxiliary results in graph theory
In this section we will study a special class of graphs and prove some properties of these graphs (Proposition 5.5 and Lemma 5.8), which will be crucial in our study of convex polytopes in § 6. From now until § 10, we assume that $\{{g_{k}\}}_{k\in \mathbb{N}}$ satisfies the condition in Theorem 2.5; that is, $\{{g_{k}\}}_{k\in \mathbb{N}}$ is a sequence in $N$ with
such that, for each $(i,j)$ $(1\leqslant i<j\leqslant n)$ , either $u_{ij}(k)=0\text{ for any }k$ , or $u_{ij}(k)\neq 0$ and diverges to infinity as $k\rightarrow \infty$ .
In order to prove Proposition 5.5, we will need some lemmas involving complex calculations which will guarantee the validity of the proof of Proposition 5.5. Here we introduce the following notation. For any $g\in \operatorname{SL}(n,\mathbb{R})$ and any $1\leqslant l\leqslant n$ , denote by $(g)_{l\,\times \,l}$ the $l\,\times \,l$ submatrix in the upper left corner of $g$ . Note that if $g,h\in \operatorname{SL}(n,\mathbb{R})$ are upper triangular, then $(gh)_{l\,\times \,l}=(g)_{l\,\times \,l}(h)_{l\,\times \,l}$ .
Lemma 5.1. For any $a\in A$ and any $1\leqslant l\leqslant n$ , we have either $(g_{k})_{l\,\times \,l}=(a^{-1}g_{k}a)_{l\,\times \,l}$ for all $k$ or $(g_{k})_{l\,\times \,l}\neq (a^{-1}g_{k}a)_{l\,\times \,l}$ for all $k$ .
Proof. Write $a=(a_{1},a_{2},\ldots ,a_{n})\in A$ . By definition, we have
and
The equation $(g_{k})_{l\,\times \,l}=(a^{-1}g_{k}a)_{l\,\times \,l}$ then yields
Now the lemma follows from the dichotomy assumption on the entries of $g_{k}$ ( $k\in \mathbb{N}$ ).◻
Lemma 5.2. Let $a\in A$ . Suppose that the sequence ${\{g_{k}ag_{k}^{-1}\}}_{k\in \mathbb{N}}$ is bounded in $\operatorname{SL}(n,\mathbb{R})$ . Then $g_{k}$ commutes with $a$ for any $k$ .
Proof. Suppose not. Then by Lemma 5.1 with $l=n$ we have
In this case, we would like to find a contradiction.
Let $l_{0}$ be the minimum of the integers $0\leqslant l\leqslant n-1$ with the property
for any $k$ . By Lemma 5.1, $l_{0}$ is also the maximum of $0\leqslant l\leqslant n-1$ such that $(g_{k})_{l\,\times \,l}$ commutes with $(a)_{l\,\times \,l}$ for all $k$ .
We write $a=\operatorname{diag}(a_{1},a_{2},\ldots ,a_{n})\in A$ . Then, for any $1\leqslant l\leqslant n$ ,
We also write
where $\mathbf{v}_{k}$ is the $l_{0}$ -dimensional column vector next to $(g_{k})_{l_{0}\,\times \,l_{0}}$ in $g_{k}$ . Since $(g_{k})_{l_{0}\,\times \,l_{0}}$ commutes with $(a)_{l_{0}\,\times \,l_{0}}$ , one can compute
where
As $(g_{k})_{(l_{0}+1)\,\times \,(l_{0}+1)}$ does not commute with $(a)_{(l_{0}+1)\,\times \,(l_{0}+1)}$ , we have
From this and the dichotomy assumption on the entries of $g_{k}$ $(k\in \mathbb{N})$ , one can then deduce that $\mathbf{v}_{k}\neq \mathbf{0}$ , $\mathbf{v}_{k}\rightarrow \infty$ and
as $k\rightarrow \infty$ . Here $\operatorname{I}_{l_{0}}$ is the $l_{0}\,\times \,l_{0}$ identity matrix.
Now we can compute
Since $\mathbf{w}_{k}-\mathbf{v}_{k}\rightarrow \infty$ as $k\rightarrow \infty$ , the equation above implies that ${\{a^{-1}g_{k}ag_{k}^{-1}\}}_{k\in \mathbb{N}}$ diverges, which contradicts the boundedness of ${\{g_{k}ag_{k}^{-1}\}}_{k\in \mathbb{N}}$ . This completes the proof of the lemma.◻
Corollary 5.3. Let $S\subset A$ be a subgroup in $A$ . Then, for any $\mathbf{t}\in \operatorname{Lie}(S)$ , either $\operatorname{Ad}(g_{k})\mathbf{t}\rightarrow \infty$ as $k\rightarrow \infty$ or $\operatorname{Ad}(g_{k})\mathbf{t}=\mathbf{t}$ for all $k$ .
Definition 5.4. We define a graph $G(\{{g_{k}\}}_{k\in \mathbb{N}})=(V,E)$ associated to $\{{g_{k}\}}_{k\in \mathbb{N}}$ as follows. The set of vertices $V$ is the index set $[n]=\{1,2,\ldots ,n\}$ . Two vertices $i<j$ are connected by an edge in the edge set $E$ , which we denote by $i\sim j$ , if $u_{ij}(k)\rightarrow \infty$ as $k\rightarrow \infty$ .
We can now prove our first result in this section.
Proposition 5.5. The subalgebra ${\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}})$ of $\operatorname{Lie}(A)$ (as defined in Definition 2.2) is trivial if and only if the graph $G(\{{g_{k}\}}_{k\in \mathbb{N}})$ associated to $\{{g_{k}\}}_{k\in \mathbb{N}}$ is connected.
Proof. Suppose that the graph $G(\{{g_{k}\}}_{k\in \mathbb{N}})$ associated to $\{{g_{k}\}}_{k\in \mathbb{N}}$ is not connected. Let $G_{l}=(V_{l},E_{l})$ ( $1\leqslant l\leqslant m$ ) be the connected components of $G(\{{g_{k}\}}_{k\in \mathbb{N}})$ . We pick $x_{l}\in \mathbb{R}\setminus \{0\}$ such that $\sum _{l=1}^{m}|V_{l}|x_{l}=0$ . Now if a vertex $i\in V_{l}\subset [n]$ , we set $t_{i}=x_{l}$ . In this way we obtain an element $\mathbf{t}=(t_{i})_{1\leqslant i\leqslant n}\in \operatorname{Lie}(A)\setminus \{0\}$ . Note that $\mathbf{t}$ is invertible. We show that
Indeed, since $\mathbf{t}$ is invertible, we compute
For $u_{ij}(k)\neq 0$ , by the definition of the graph $G(\{{g_{k}\}}_{k\in \mathbb{N}})$ , the vertices $i$ and $j$ are in the same connected component. Hence we have $t_{i}=t_{j}$ and
as desired. This implies that $\operatorname{Ad}(g_{k})$ fixes $\mathbf{t}$ , and by definition $\mathbf{t}\in {\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}})\neq \{0\}$ .
Now assume that the graph $G(\{{g_{k}\}}_{k\in \mathbb{N}})$ is connected. Suppose that ${\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}})$ is not zero. Then there exists an element $\mathbf{t}\in \operatorname{Lie}A\setminus \{0\}$ such that $\operatorname{Ad}(g_{k})\mathbf{t}$ is bounded as $k\rightarrow \infty$ . Let $a=\exp \mathbf{t}\in A\setminus \{e\}$ . Then $\{g_{k}ag_{k}^{-1}\}$ is bounded in $\operatorname{SL}(n,\mathbb{R})$ . By Lemma 5.2, $g_{k}$ commutes with $a$ . If we write $a=\operatorname{diag}(a_{1},a_{2},\ldots ,a_{n})$ , then the equation $g_{k}=ag_{k}a^{-1}$ yields
and hence $a_{i}=a_{j}$ whenever $u_{ij}(k)\neq 0$ . The connectedness of the graph $G(\{{g_{k}\}}_{k\in \mathbb{N}})$ then implies that all the $a_{i}$ are equal and $a=e$ , which contradicts $a\in A\setminus \{e\}$ . This completes the proof of the proposition.◻
Definition 5.6. Let $G(V,E)$ be a graph consisting of the set of vertices $V$ and the set of edges $E$ . Here we assume $V=\{v_{1},v_{2},\ldots ,v_{n}\}$ is an ordered set with the ordering $\prec$ , and we write $v_{i}\sim v_{j}$ if $v_{i}$ and $v_{j}$ are connected by an edge in $E$ . A subset $S\subset V$ is called UDS (uniquely determined by successors) if it satisfies the following property: for any $v_{i}\in V$ ,
For our purpose, we will consider UDS subsets of $[n]$ in the graph $G(\{{g_{k}\}}_{k\in \mathbb{N}})$ associated to $\{{g_{k}\}}_{k\in \mathbb{N}}$ . The ordering of $[n]$ inherits the natural ordering on $\mathbb{N}$ . The following proposition will be needed in our computations later.
Proposition 5.7. For any $1\leqslant l\leqslant n$ and any nonempty $I\in {\mathcal{I}}_{n}^{l}$ , the sequence $\{g_{k}{e_{I}\}}_{k\in \mathbb{N}}\subset \wedge ^{l}\mathbb{R}^{n}$ is bounded if and only if $I$ is UDS in the vertex set $[n]$ of $G(\{{g_{k}\}}_{k\in \mathbb{N}})$ . If this case happens, then we have $g_{k}e_{I}=e_{I}$ for any $k\in \mathbb{N}$ .
Proof. Let $I=\{i_{1}<i_{2}<\cdots <i_{l}\}$ . Suppose that $\{g_{k}{e_{I}\}}_{k\in \mathbb{N}}$ is bounded. We show that $I$ is UDS in $[n]$ . If not, let $i_{0}$ be the minimum in $I=\{i_{1},\ldots ,i_{l}\}$ such that property (1) in Definition 5.6 does not hold for $i_{0}$ . Then there is $j_{0}<i_{0}$ with $j_{0}\sim i_{0}$ but $j_{0}\notin I$ . By the minimality of $i_{0}$ , for any $i\in I=\{i_{1},i_{2},\ldots ,i_{l}\}$ with $j_{0}<i<i_{0}$ , we have $j_{0}\not \sim i$ ; otherwise $j_{0}\in I$ . This implies that $u_{j_{0},i}(k)=0$ for all $i\in \{i_{1},i_{2},\ldots ,i_{l}\}$ with $i<i_{0}$ . Note that $u_{j_{0},i_{0}}(k)\rightarrow \infty$ as $k\rightarrow \infty$ by our assumption on the entries of $g_{k}$ $(k\in \mathbb{N})$ .
Now we compute $g_{k}e_{I}$ . In particular, by expanding $g_{k}e_{I}$ in terms of the standard basis $\{e_{J}:J\in {\mathcal{I}}_{n}^{l}\}$ in $\wedge ^{l}\mathbb{R}^{n}$ , we are interested in the coefficient in the $e_{J_{0}}$ -coordinate, where $J_{0}=\{i\in I:i\neq i_{0}\}\cup \{j_{0}\}$ . As $u_{j_{0},i}(k)=0$ for all $i\in \{i_{1},i_{2},\ldots ,i_{l}\}$ with $i<i_{0}$ , one can compute
for some $c_{J}\in \mathbb{R}$ $(J\neq J_{0})$ . The divergence of $u_{j_{0},i_{0}}(k)$ then contradicts the boundedness of $g_{k}e_{I}$ . This proves that $I$ is UDS.
Conversely, suppose that $I$ is a UDS subset in $[n]$ . In this case, we will show inductively that, for any $1\leqslant j\leqslant l$ ,
and hence obtain that $g_{k}e_{I}=g_{k}(e_{i_{1}}\wedge e_{i_{2}}\wedge \cdots \wedge e_{i_{l}})$ remains fixed. For $j=1$ , since $\{i_{1},\ldots ,i_{l}\}$ is UDS, this implies that $u_{i,i_{1}}=0$ for all $i<i_{1}$ and $g_{k}e_{i_{1}}=e_{i_{1}}$ . Now assume that the formula holds for $j$ . For $j+1$ , we know that
and hence
This completes the proof of the proposition. ◻
Finally, we will show the following lemma, which will be crucial in our study of convex polytopes in § 6.
Lemma 5.8. Let $G(V,E)$ be a connected graph, where $V=\{v_{1},v_{2},\ldots ,v_{n}\}$ is an ordered set with the ordering $\prec$ . Then we can assign values $x_{1},x_{2},\ldots ,x_{n}$ to the vertices $v_{1},v_{2},\ldots ,v_{n}$ such that:
(i) $\sum _{v_{i}\in V}x_{i}=0$ ;
(ii) for any proper UDS subset $S\subset V$ , $\sum _{v_{i}\in S}x_{i}>0.$
Proof. We use induction on the number of vertices in $G(V,E)$ . There is nothing to prove for $n=1$ . Now suppose that we have $n+1$ vertices. Assume without loss of generality that $v_{1}$ is the smallest according to the ordering $\prec$ on $V$ . We remove the vertex $v_{1}$ and all the edges adjacent to $v_{1}$ from the graph $G$ . This yields a new graph $G^{\prime }$ with $m$ connected components $G_{1}^{\prime }=(V_{1}^{\prime },E_{1}^{\prime }),\ldots ,G_{m}^{\prime }=(V_{m}^{\prime },E_{m}^{\prime })$ for some $m\in \mathbb{N}$ . Since $|V_{j}^{\prime }|\leqslant n$ $(1\leqslant j\leqslant m)$ and $V_{j}^{\prime }$ inherits the ordering from $V$ , we can apply the induction hypothesis on each $G_{j}^{\prime }=(V_{j}^{\prime },E_{j}^{\prime })$ . In particular, we obtain a vector $(x_{2}^{\prime },\ldots ,x_{n+1}^{\prime })\in \mathbb{R}^{n}$ such that the value assignment
satisfies conditions (1) and (2) for each of the graphs $G_{j}^{\prime }$ $(1\leqslant j\leqslant m)$ .
Now we pick a sufficiently small positive number $\unicode[STIX]{x1D716}>0$ such that the new value assignment $x_{i}=x_{i}^{\prime }-\unicode[STIX]{x1D716}$ $(2\leqslant i\leqslant n+1)$ still satisfies condition (2) for each $G_{j}^{\prime }=(V_{j}^{\prime },E_{j}^{\prime })$ , and let $x_{1}=n\unicode[STIX]{x1D716}$ . We show that this value assignment
meets our requirements for $G(V,E)$ . The sum of $x_{i}$ is zero by induction hypothesis. For a proper UDS subset $S\subset V$ , if $v_{1}\notin S$ , then
where $S_{j}^{\prime }$ is a subset in $G_{j}^{\prime }=(V_{j}^{\prime },E_{j}^{\prime })$ $(1\leqslant j\leqslant m)$ , and either $S_{j}^{\prime }$ is a proper UDS subset in $G_{j}^{\prime }=(V_{j}^{\prime },E_{j}^{\prime })$ or $S_{j}^{\prime }=V_{j}^{\prime }$ . Since $v_{1}\notin S$ , by the connectedness of $G(V,E)$ and the UDS property of $S$ , there is some $j$ with $S_{j}^{\prime }\neq V_{j}^{\prime }$ and hence, by taking $\unicode[STIX]{x1D716}$ sufficiently small,
If $S=\{v_{1}\}$ , then condition (2) holds automatically. If $v_{1}\in S$ and $S\neq \{v_{1}\}$ , then
where $S_{j}^{\prime }$ is a subset in $G_{j}^{\prime }=(V_{j}^{\prime },E_{j}^{\prime })$ $(1\leqslant j\leqslant m)$ , and either $S_{j}^{\prime }$ is a proper UDS subset in $G_{j}^{\prime }=(V_{j}^{\prime },E_{j}^{\prime })$ or $S_{j}^{\prime }=V_{j}^{\prime }$ . Since $S$ is proper in $V$ , there is some $j$ with $S_{j}^{\prime }\neq V_{j}^{\prime }$ , and hence we have
This completes the proof of the lemma. ◻
6 Convex polytopes revisited
In this section we will study the convex polytopes $\unicode[STIX]{x1D6FA}_{g_{k},\unicode[STIX]{x1D6FF}}$ , where $\{{g_{k}\}}_{k\in \mathbb{N}}$ is a sequence in $G$ satisfying the condition in Theorem 2.5. Our aim in this section is Proposition 6.3, which shows a crucial property of $\unicode[STIX]{x1D6FA}_{g_{k},\unicode[STIX]{x1D6FF}}$ concerning its surface area and volume. This property will play an important role at various points in the paper.
In the proof of Theorem 2.5, the case of ${\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}})=\{0\}$ plays a central role, and other cases can be deduced from this case. We remark here that in view of Corollary 5.3, ${\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}})=\{0\}$ if and only if the limit points of $\{\operatorname{Ad}(g_{k})\operatorname{Lie}(A)\}_{k\in \mathbb{N}}$ in the Grassmanian manifold of $\mathfrak{g}$ are subalgebras consisting of nilpotent matrices. So from this section to § 9, we will make additional assumptions on $\{{g_{k}\}}_{k\in \mathbb{N}}$ that ${\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}})=\{0\}$ , and by passing to a subsequence, $\operatorname{Ad}(g_{k})\operatorname{Lie}(A)$ converges to a subalgebra consisting of nilpotent matrices in the Grassmanian manifold of $\mathfrak{g}$ . We write $\lim _{k\rightarrow \infty }\operatorname{Ad}(g_{k})\operatorname{Lie}(A)$ for the limiting subalgebra and $\lim _{k\rightarrow \infty }\operatorname{Ad}(g_{k})A$ for the corresponding limiting unipotent subgroup.
Lemma 6.1. For any $0<\unicode[STIX]{x1D6FF}<1$ , the region
is a convex subset in $\operatorname{Lie}(A)$ which contains an unbound open cone.
Proof. It suffices to prove the lemma for the region
By our assumptions on $\{{g_{k}\}}_{k\in \mathbb{N}}$ and Proposition 5.5, the graph $G(\{{g_{k}\}}_{k\in \mathbb{N}})$ associated to $\{{g_{k}\}}_{k\in \mathbb{N}}$ is connected. Now by applying Lemma 5.8 with the graph $G(\{{g_{k}\}}_{k\in \mathbb{N}})$ , one can find $\mathbf{x}=(x_{1},x_{2},\ldots ,x_{n})\in \operatorname{Lie}(A)$ such that
Then by linearity, for any $\unicode[STIX]{x1D706}>0$ ,
This implies that there exists an unbounded open cone around the axis $\{\unicode[STIX]{x1D706}\mathbf{x},\unicode[STIX]{x1D706}>0\}$ , which is contained in
This completes the proof of the lemma. ◻
Lemma 6.2. Let $0<\unicode[STIX]{x1D6FF}<1$ . For every $k\in \mathbb{N}$ , the region $\unicode[STIX]{x1D6FA}_{g_{k},\unicode[STIX]{x1D6FF}}$ contains a ball $B_{k}$ of radius $r_{k}$ , and $r_{k}\rightarrow \infty$ as $k\rightarrow \infty$ .
Proof. By definition, we know that
Note that the origin belongs to $\unicode[STIX]{x1D6FA}_{g_{k},\unicode[STIX]{x1D6FF}}$ , because each $g_{k}$ is in the upper triangular unipotent subgroup and $\Vert g_{k}e_{I}\Vert \geqslant 1$ for nonempty $I\in {\mathcal{I}}_{n}$ . Now we can write
where we use $g_{k}e_{I}=e_{I}$ for any UDS set $I$ by Proposition 5.7. For a non-UDS set $I$ , we have $g_{k}e_{I}\rightarrow \infty$ as $k\rightarrow \infty$ .
Since $g_{k}e_{I}\rightarrow \infty$ for any non-UDS set $I$ , the region
contains a large ball $S_{k}$ around the origin for sufficiently large $k$ . By Lemma 6.1, the region
contains an unbounded cone $C$ (which does not depend on $k$ ) with cusp at the origin. This implies that
and $\unicode[STIX]{x1D6FA}_{g_{k},\unicode[STIX]{x1D6FF}}$ contains a large ball $B_{k}$ of radius $r_{k}$ with $r_{k}\rightarrow \infty$ as $k\rightarrow \infty$ .◻
Proposition 6.3. For any $0<\unicode[STIX]{x1D6FF}<1$ , we have
Actually, we will apply the following variant of Proposition 6.3 later.
Corollary 6.4. Let $0<\unicode[STIX]{x1D6FF}_{1}<\unicode[STIX]{x1D6FF}_{2}<1$ . Then
Proof. By definition, we know that $\unicode[STIX]{x1D6FA}_{g_{k},\unicode[STIX]{x1D6FF}_{2}}\subset \unicode[STIX]{x1D6FA}_{g_{k},\unicode[STIX]{x1D6FF}_{1}}$ . Let $\{f_{i}\}$ be the collection of the facets of $\unicode[STIX]{x1D6FA}_{g_{k},\unicode[STIX]{x1D6FF}_{1}}$ , and denote by $P_{i}$ the hyperplane determined by $f_{i}$ . For each $f_{i}$ , let $B_{i}$ be the unique cylinder with the following properties.
(i) The base of $B_{i}$ is $f_{i}$ , and the height of $B_{i}$ is equal to $\ln \unicode[STIX]{x1D6FF}_{2}-\ln \unicode[STIX]{x1D6FF}_{1}$ .
(ii) $B_{i}$ and $\unicode[STIX]{x1D6FA}_{g_{k},\unicode[STIX]{x1D6FF}_{1}}$ lie in the same half-space determined by $P_{i}$ .
Then we have
and
Now the corollary follows from Proposition 6.3. ◻
From now on, we will fix a $\unicode[STIX]{x1D6FF}>0$ for any $g\in G$ in the notation $\unicode[STIX]{x1D6FA}_{g,\unicode[STIX]{x1D6FF}}$ unless otherwise specified. For each $k\in \mathbb{N}$ , we choose the representative
in $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Ax_{e}}]$ . We will show in the following section that these representatives converge to a locally finite measure $\unicode[STIX]{x1D708}$ . We will denote by
the restriction of $\unicode[STIX]{x1D707}_{Ax_{e}}$ on $\exp (\unicode[STIX]{x1D6FA}_{g_{k},\unicode[STIX]{x1D6FF}})x_{e}$ .
7 Nondivergence
In this section we will study the nondivergence of the sequence
The study relies on a growth property of a special class of functions studied by Eskin, Mozes and Shah [Reference Eskin, Mozes and ShahEMS97], and a nondivergence theorem proved by Kleinbock and Margulis [Reference Kleinbock and MargulisKM98, Reference KleinbockKle10]. As a corollary we will deduce that these measures actually converge to a probability measure, which is invariant under a unipotent subgroup. This is where Ratner’s theorem will come into play in § 9 and help us prove the measure rigidity. The goal in this section is to prove Proposition 7.7.
First, we need the following definition of a class of functions, which is introduced in [Reference Eskin, Mozes and ShahEMS97].
Definition 7.1 [Reference Eskin, Mozes and ShahEMS97, Definition 2.1].
Let $d\in \mathbb{N}$ and $\unicode[STIX]{x1D706}>0$ be given. Define by $E(d,\unicode[STIX]{x1D706})$ the set of functions $f:\mathbb{R}\rightarrow \mathbb{C}$ of the form
where $a_{i}\in \mathbb{C}$ and $\unicode[STIX]{x1D706}_{i}\in \mathbb{C}$ with $|\unicode[STIX]{x1D706}_{i}|\leqslant \unicode[STIX]{x1D706}$ .
The following proposition describes the growth property of functions in $E(d,\unicode[STIX]{x1D706})$ . We denote by $m_{\mathbb{R}}$ the Lebesgue measure on $\mathbb{R}$ .
Proposition 7.2 [Reference Eskin, Mozes and ShahEMS97, Corollary 2.10].
For any $d\in \mathbb{N}$ and $\unicode[STIX]{x1D706}>0$ , there exists a constant $\unicode[STIX]{x1D6FF}_{0}=\unicode[STIX]{x1D6FF}_{0}(d,\unicode[STIX]{x1D706})$ satisfying the following condition: for any $\unicode[STIX]{x1D716}>0$ , there exists $M>0$ such that, for any $f\in E(d,\unicode[STIX]{x1D706})$ and any interval $\unicode[STIX]{x1D6EF}$ of length at most $\unicode[STIX]{x1D6FF}_{0}$ ,
For any nonzero discrete subgroup $\unicode[STIX]{x1D6EC}$ in $\mathbb{R}^{n}$ , one could define its covolume as follows. Let $\{v_{1},v_{2},\ldots ,v_{l}\}$ be a $\mathbb{Z}$ -basis of $\unicode[STIX]{x1D6EC}$ , where $l$ is the rank of $\unicode[STIX]{x1D6EC}$ . Then the covolume of $\unicode[STIX]{x1D6EC}$ is defined to be the length of the wedge product $v_{1}\wedge \cdots \wedge v_{l}$ in $\wedge ^{l}\mathbb{R}^{n}$ , where the norm in $\wedge ^{l}\mathbb{R}^{n}$ is induced by the Euclidean norm on $\mathbb{R}^{n}$ . In an abuse of notation, we will write $\Vert \unicode[STIX]{x1D6EC}\Vert$ for the covolume of $\unicode[STIX]{x1D6EC}$ . One can check that this notion of covolume is well defined.
The following theorem is essentially proved in [Reference KleinbockKle10, Reference Kleinbock and MargulisKM98].
Theorem 7.3 (Cf. [Reference KleinbockKle10, Theorem 3.4], [Reference Kleinbock and MargulisKM98, Theorem 5.2]).
Let $d\in \mathbb{N}$ and $\unicode[STIX]{x1D706}>0$ . Let $\unicode[STIX]{x1D6FF}_{0}=\unicode[STIX]{x1D6FF}_{0}(d,\unicode[STIX]{x1D706})$ be as in Proposition 7.2. Suppose that an interval $\unicode[STIX]{x1D6EF}\subset \mathbb{R}$ of length at most $\unicode[STIX]{x1D6FF}_{0}$ , $0<\unicode[STIX]{x1D70C}<1$ and a continuous map $h:\unicode[STIX]{x1D6EF}\rightarrow \operatorname{SL}(n,\mathbb{R})$ are given. Assume that for any nonzero discrete subgroup $\unicode[STIX]{x1D6E5}$ in $\mathbb{Z}^{n}$ we have that
(i) the function $x\rightarrow \Vert h(x)\unicode[STIX]{x1D6E5}\Vert ^{2}$ on $\unicode[STIX]{x1D6EF}$ belongs to $E(d,\unicode[STIX]{x1D706})$ ; and
(ii) $\sup _{x\in \unicode[STIX]{x1D6EF}}\Vert h(x)\unicode[STIX]{x1D6E5}\Vert \geqslant \unicode[STIX]{x1D70C}$ .
Then, for any $\unicode[STIX]{x1D716}<\unicode[STIX]{x1D70C}$ , there exists a constant $\unicode[STIX]{x1D6FF}(\unicode[STIX]{x1D716})>0$ depending only on $d$ and $\unicode[STIX]{x1D706}$ such that
Proof. The proof is the same as in [Reference Kleinbock and MargulisKM98, Theorem 5.2], but inequality (2) is used instead of the $(C,\unicode[STIX]{x1D6FC})$ -good property.◻
Lemma 7.4. Let $E$ be a normed vector space, and let $\unicode[STIX]{x1D6FC}_{i}$ $(1\leqslant i\leqslant m)$ be different linear functionals on $E$ . Then, for any $r>0$ , we can find $m$ vectors $x_{1},x_{2},\ldots ,x_{m}\in B_{r}(0)$ such that
Here $B_{r}(0)$ is the ball of radius $r$ around $0$ in $E$ .
Proof. We can find a line $L$ through the origin such that $\unicode[STIX]{x1D6FC}_{i}|L$ are different functionals defined on $L$ . This could be achieved by picking a line which avoids all the kernels of $\unicode[STIX]{x1D6FC}_{i}-\unicode[STIX]{x1D6FC}_{j}$ . Hence, it suffices to prove the lemma for $\dim E=1$ .
Let $E=\mathbb{R}$ and $\unicode[STIX]{x1D6FC}_{i}(x)=\unicode[STIX]{x1D706}_{i}x$ for different $\unicode[STIX]{x1D706}_{i}$ . We will show inductively that for any $r>0$ there exist $x_{1},x_{2},\ldots ,x_{m}\in (-r,r)$ such that
It is easy to verify for $m=1$ . Now, for $m+1$ different $\unicode[STIX]{x1D706}_{i}$ , we compute
where $A_{m+1}=\det ((e^{\unicode[STIX]{x1D706}_{i}x_{j}})_{1\leqslant i,j\leqslant m})$ . By the induction hypothesis, we can find $x_{1},x_{2},\ldots ,x_{m}\in (-r,r)$ such that $A_{m+1}\neq 0$ . By the fact that $e^{\unicode[STIX]{x1D706}_{i}x}$ $(1\leqslant i\leqslant m)$ are linearly independent functions, and by the choice of $x_{1},x_{2},\ldots ,x_{m}$ , the function $\det ((e^{\unicode[STIX]{x1D706}_{i}x_{j}})_{1\leqslant i,j\leqslant m+1})$ is a nonzero analytic function in $x_{m+1}$ . Since zeros of any analytic function are isolated, this implies that there exists $x_{m+1}\in (-r,r)$ such that $\det ((e^{\unicode[STIX]{x1D706}_{i}x_{j}})_{1\leqslant i,j\leqslant m+1})\neq 0$ .◻
The following proposition describes the supremum of a special function. We will need this proposition to verify assumption (ii) in Theorem 7.3.
Proposition 7.5. Let $E$ and $V$ be normed vector spaces, and $v_{i}\in V$ $(1\leqslant i\leqslant m)$ . Let $f$ be a map from $E$ to $V$ defined by
where the $\unicode[STIX]{x1D6FC}_{i}$ $(1\leqslant i\leqslant m)$ are different linear functionals on $E$ . Suppose that on an open ball $R\subset E$ of radius $r>0$ we have
for some $M>0$ . Then there exists a constant $c>0$ which only depends on the $\unicode[STIX]{x1D6FC}_{i}$ and $r$ such that
Proof. Let $x_{0}$ be the center of $R$ and $B_{r}(0)$ the ball of radius $r$ around $0$ in $E$ . Then $R=x_{0}+B_{r}(0)$ . By Lemma 7.4, we can find $y_{j}\in B_{r}(0)$ $(1\leqslant j\leqslant m)$ such that
We fix this choice of $y_{j}$ which only depends on the $\unicode[STIX]{x1D6FC}_{i}$ and $r$ . Let $x_{j}=x_{0}+y_{j}\in R$ $(1\leqslant j\leqslant m)$ . We have
Let $C$ be the matrix norm of $(e^{\unicode[STIX]{x1D6FC}_{i}(y_{j})})_{1\leqslant j,i\leqslant m}^{-1}$ . Since
this implies that one of $\Vert f(x_{j})\Vert$ $(1\leqslant j\leqslant m)$ is at least $M/mC$ . Hence $\sup _{x\in R}\Vert f(x)\Vert \geqslant cM$ with $c=1/mC$ .◻
For any $g\in G$ , $x_{0}\in \operatorname{Lie}(A)$ , a unit vector $\vec{v}\in \operatorname{Lie}(A)$ and $w=\sum _{I\in {\mathcal{I}}_{n}^{l}}w_{I}e_{I}\in \wedge ^{l}\mathbb{R}^{n}$ ( $w_{I}\in \mathbb{R}$ ), the function
belongs to $E(d,\unicode[STIX]{x1D706})$ , where $d=n^{2l}$ , $\unicode[STIX]{x1D706}=2l$ and $\Vert \cdot \Vert$ is the norm on $\wedge ^{l}\mathbb{R}^{n}$ induced by the Euclidean norm on $\mathbb{R}^{n}$ . Indeed,
is a vector in $\wedge ^{l}\mathbb{R}^{n}$ with coordinates being exponential functions of $t$ . Hence, $g\exp (x_{0}+t\vec{v})\cdot w$ is a vector whose coordinates are sums of exponential functions of $t$ . By a simple calculation, one could get that the function $\Vert g\exp (x_{0}+t\vec{v})\cdot w\Vert ^{2}$ belongs to $E(d,\unicode[STIX]{x1D706})$ with $d=n^{2l}$ and $\unicode[STIX]{x1D706}=2l$ . In what follows, we will study functions of this kind.
With the help of Theorem 7.3 and Proposition 7.5, we can now study the nondivergence of the sequence $(1/\operatorname{Vol}(\unicode[STIX]{x1D6FA}_{g_{k},\unicode[STIX]{x1D6FF}}))(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Ax}$ . We write
By Mahler’s compactness criterion, this is a compact subset in $G/\unicode[STIX]{x1D6E4}$ . The following proposition is crucial in the proof of Proposition 7.7.
Proposition 7.6. For any $\unicode[STIX]{x1D716}>0$ , there exists a constant $\unicode[STIX]{x1D6FF}(\unicode[STIX]{x1D716})>0$ such that. for sufficiently large $k\in \mathbb{N}$ .
Proof. Fix a unit vector $\vec{v}\in \operatorname{Lie}(A)$ such that the values in
are all different. Let $d=n^{2n}$ and $\unicode[STIX]{x1D706}=2n$ such that for any $x_{0}\in \operatorname{Lie}(A)$ , $l\in \mathbb{N}$ , $w\in \wedge ^{l}\mathbb{R}^{n}$ and $k\in \mathbb{N}$ , the function
belongs to $E(d,\unicode[STIX]{x1D706})$ as defined in Definition 7.1. We will write $\unicode[STIX]{x1D6FF}_{0}$ for the constant $\unicode[STIX]{x1D6FF}_{0}(d,\unicode[STIX]{x1D706})$ defined in Proposition 7.2.
We can find a cover of $\unicode[STIX]{x1D6FA}_{g_{k},\unicode[STIX]{x1D6FF}}$ by countably many disjoint small boxes of diameter at most $\unicode[STIX]{x1D6FF}_{0}$ such that each box is of the form
where $S$ is the base of $B$ perpendicular to $\vec{v}$ and $\unicode[STIX]{x1D6EF}=[0,\unicode[STIX]{x1D6FF}_{0}]$ . We denote by ${\mathcal{F}}$ the collection of these boxes. Let ${\mathcal{F}}={\mathcal{F}}_{1}\cup {\mathcal{F}}_{2}$ where ${\mathcal{F}}_{1}$ is the collection of the boxes in ${\mathcal{F}}$ which intersect $\unicode[STIX]{x2202}\unicode[STIX]{x1D6FA}_{g_{k},\unicode[STIX]{x1D6FF}}$ and ${\mathcal{F}}_{2}={\mathcal{F}}\setminus {\mathcal{F}}_{1}$ . Then for any box $B\in {\mathcal{F}}_{2}$ , $B$ is contained in $\unicode[STIX]{x1D6FA}_{g_{k},\unicode[STIX]{x1D6FF}}$ .
Since the diameter of each box in ${\mathcal{F}}$ is at most $\unicode[STIX]{x1D6FF}_{0}$ , in view of Lemmas 4.6 and 6.2 and Proposition 6.3, for any $\unicode[STIX]{x1D716}>0$ , we have
for sufficiently large $k$ . In order to prove the proposition, it suffices to show that for any $\unicode[STIX]{x1D716}>0$ , there exists $\unicode[STIX]{x1D6FF}(\unicode[STIX]{x1D716})>0$ such that for each box $B\in {\mathcal{F}}_{2}$ , we have
Now fix a box $B\in {\mathcal{F}}_{2}$ with
where $S$ is the base of $B$ and $\unicode[STIX]{x1D6EF}=[0,\unicode[STIX]{x1D6FF}_{0}]$ . We will apply Theorem 7.3. Let $\unicode[STIX]{x1D6E5}$ be a nonzero discrete subgroup of rank $l$ in $\mathbb{Z}^{n}$ with a $\mathbb{Z}$ -basis $\{v_{1},v_{2},\ldots ,v_{l}\}\subset \mathbb{Z}^{n}.$ The wedge product $v_{1}\wedge \cdots \wedge v_{l}\in \wedge ^{l}\mathbb{R}^{n}$ can be written as
where $a_{I}\in \mathbb{Z}$ . We define a map from $B$ to $\wedge ^{l}\mathbb{R}^{n}$ by
For each $x_{0}\in S$ , we consider the map
from $\unicode[STIX]{x1D6EF}=[0,\unicode[STIX]{x1D6FF}_{0}]$ to $\wedge ^{l}\mathbb{R}^{n}$ . Since $B\subset \unicode[STIX]{x1D6FA}_{g_{k},\unicode[STIX]{x1D6FF}}$ , by our construction of $\unicode[STIX]{x1D6FA}_{g_{k},\unicode[STIX]{x1D6FF}}$ , we have
By Proposition 7.5, we have
Note that by Proposition 7.5, this inequality holds with a uniform constant $c>0$ depending only on $\unicode[STIX]{x1D714}_{I}(\mathbf{t})$ $(I\in {\mathcal{I}}_{n})$ and $\unicode[STIX]{x1D6FF}_{0}$ for any nonzero $\unicode[STIX]{x1D6E5}\subset \mathbb{Z}^{n}$ . Since $\Vert f_{\unicode[STIX]{x1D6E5}}(x_{0}+t\vec{v})\Vert$ is the covolume of $g_{k}(\exp (x_{0}+t\vec{v}))\unicode[STIX]{x1D6E5}$ and $\Vert f_{\unicode[STIX]{x1D6E5}}(x_{0}+t\vec{v})\Vert ^{2}$ is a function in $E(d,\unicode[STIX]{x1D706})$ , we can apply Theorem 7.3 and obtain that
for some constant $\unicode[STIX]{x1D6FF}(\unicode[STIX]{x1D716})>0$ and for any $x_{0}\in S$ . Now by integrating the inequality above over the region $x_{0}\in S$ , we have
The proposition now follows. ◻
We can now prove the main result in this section.
Proposition 7.7. By passing to a subsequence, the sequence $(1/\operatorname{Vol}(\unicode[STIX]{x1D6FA}_{g_{k},\unicode[STIX]{x1D6FF}}))(g_{k})_{\ast }(\unicode[STIX]{x1D707}_{Ax_{e}}|_{\unicode[STIX]{x1D6FA}_{g_{k},\unicode[STIX]{x1D6FF}}})$ converges to a probability measure $\unicode[STIX]{x1D708}$ . Furthermore, we have
and hence the sequence $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Ax_{e}}]$ converges to $[\unicode[STIX]{x1D708}]$ . Here the probability measure $\unicode[STIX]{x1D708}$ is invariant under the action of the unipotent subgroup $\lim _{n\rightarrow \infty }\operatorname{Ad}(g_{k})A$ .
Proof. Suppose that the sequence of probability measures
weakly converges to a measure $\unicode[STIX]{x1D708}$ after passing to a subsequence. We show that $\unicode[STIX]{x1D708}$ is a probability measure. It is obvious that $\unicode[STIX]{x1D708}(X)\leqslant 1$ .
Now, for any $\unicode[STIX]{x1D716}>0$ , let ${\mathcal{K}}_{\unicode[STIX]{x1D6FF}(\unicode[STIX]{x1D716})}$ be the compact subset in $G/\unicode[STIX]{x1D6E4}$ as in Proposition 7.6. Let $f_{\unicode[STIX]{x1D716}}$ be a nonnegative continuous function with compact support on $G/\unicode[STIX]{x1D6E4}$ such that $0\leqslant f_{\unicode[STIX]{x1D716}}\leqslant 1$ and $f_{\unicode[STIX]{x1D716}}=1$ on ${\mathcal{K}}_{\unicode[STIX]{x1D6FF}(\unicode[STIX]{x1D716})}$ . Then we have
where the last inequality follows from Proposition 7.6. By taking $\unicode[STIX]{x1D716}\rightarrow 0$ , we conclude that $\unicode[STIX]{x1D708}$ is a probability measure.
For the second claim, we will show that
Let $f\in C_{c}(X)$ . Since $f$ has compact support, there exists a small number $\unicode[STIX]{x1D6FF}^{\prime }<\unicode[STIX]{x1D6FF}$ such that
Here $d\mathbf{t}=dm_{\operatorname{Lie}(A)}(\mathbf{t})$ is the natural measure on $\operatorname{Lie}(A)$ . By Corollary 6.4, we have
Here $\Vert f\Vert _{\infty }$ is the supremum of $f$ . Since $(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Ax}$ is invariant under the action of $\operatorname{Ad}(g_{k})A$ , the probability measure $\unicode[STIX]{x1D708}$ is invariant under the action of $\lim _{k\rightarrow \infty }\operatorname{Ad}(g_{k})A$ , which is a unipotent subgroup by our assumption on $\{{g_{k}\}}_{k\in \mathbb{N}}$ .◻
8 Nondivergence in terms of adjoint representations
In this section we rewrite § 7 in terms of adjoint representations. The reason of doing this is that we can then apply Ratner’s theorem for unipotent actions on homogeneous spaces.
Let $\operatorname{Ad}:G\rightarrow \operatorname{SL}(\mathfrak{g})$ be the adjoint representation of $G=\operatorname{SL}(n,\mathbb{R})$ . The Lie algebra $\mathfrak{g}=\mathfrak{sl}(n,\mathbb{R})$ has a $\mathbb{Q}$ -basis
where $E_{ij}$ $(i\neq j)$ is the matrix with only nonzero entry $1$ in the $i$ th row and the $j$ th column, and $E_{ii}$ $(1\leqslant i\leqslant n-1)$ is the diagonal matrix with $1$ as $(i,i)$ th entry and $-1$ as $(i+1,i+1)$ th entry. We will also consider the representations $\wedge ^{l}\operatorname{Ad}:G\rightarrow \operatorname{SL}(\wedge ^{l}\mathfrak{g})$ for $1\leqslant l\leqslant \dim \mathfrak{g}-1$ . The set of all $l$ th wedge products of vectors in ${\mathcal{B}}$ is then a $\mathbb{Q}$ -basis of $\wedge ^{l}\mathfrak{g}$ , which we denote by ${\mathcal{B}}_{l}$ .
Let $1\leqslant l\leqslant \dim \mathfrak{g}-1$ . For $\wedge ^{l}\mathfrak{g}$ , its decomposition with respect to the action of $\wedge ^{l}\operatorname{Ad}A$ is given by
where each $\unicode[STIX]{x1D712}$ is a linear functional on $\operatorname{Lie}(A)$ such that, for any $\mathbf{t}\in \operatorname{Lie}(A)$ and $v\in \mathfrak{g}_{\unicode[STIX]{x1D712}}$ ,
We denote by ${\mathcal{W}}_{l}(\mathfrak{g})$ the collection of all such linear functionals $\unicode[STIX]{x1D712}$ , and let
We know that each $\mathfrak{g}_{\unicode[STIX]{x1D712}}$ $(\unicode[STIX]{x1D712}\in {\mathcal{W}}_{l}(\mathfrak{g}))$ has a $\mathbb{Q}$ -basis from ${\mathcal{B}}_{l}$ , and we denote by $\mathfrak{g}_{\unicode[STIX]{x1D712}}(\mathbb{Z})$ the subset of integer vectors with respect to this basis.
Now let $g\in G$ . We define for $gA\unicode[STIX]{x1D6E4}$ another convex polytope in $\operatorname{Lie}(A)$ in terms of adjoint representations, which is similar to the convex polytope $\unicode[STIX]{x1D6FA}_{g,\unicode[STIX]{x1D6FF}}$ in § 4. Let $1\leqslant l\leqslant \dim \mathfrak{g}-1$ and $\unicode[STIX]{x1D712}\in {\mathcal{W}}_{l}(\mathfrak{g})$ . Let $v\in \mathfrak{g}_{\unicode[STIX]{x1D712}}(\mathbb{Z})\setminus \{0\}$ . Then for $\mathbf{t}\in \operatorname{Lie}(A)$ , the vector
if and only if
Here $B_{\unicode[STIX]{x1D6FF}}$ denotes the ball of radius $\unicode[STIX]{x1D6FF}>0$ around 0 with the norm $\Vert \cdot \Vert$ on $\wedge ^{l}\mathfrak{g}$ induced by the norm $\Vert \cdot \Vert _{\mathfrak{g}}$ on $\mathfrak{g}$ . We now give the following definition.
Definition 8.1. For any $g\in G$ and $\unicode[STIX]{x1D6FF}>0$ , we denote by $R_{g,\unicode[STIX]{x1D6FF}}$ the subset of points $\mathbf{t}\in \operatorname{Lie}(A)$ satisfying
for any $v\in \mathfrak{g}_{\unicode[STIX]{x1D712}}(\mathbb{Z})\setminus \{0\}$ , $\unicode[STIX]{x1D712}\in {\mathcal{W}}_{l}(\mathfrak{g})$ and $1\leqslant l\leqslant \dim \mathfrak{g}-1.$
The proof of the following proposition is similar to that of Lemma 4.3.
Proposition 8.2. The subset $R_{g,\unicode[STIX]{x1D6FF}}$ is a bounded convex polytope in $\operatorname{Lie}(A)$ for any $g\in G$ and $\unicode[STIX]{x1D6FF}>0$ .
Here we list some properties about the convex polytopes $R_{g_{k},\unicode[STIX]{x1D6FF}}$ $(k\in \mathbb{N})$ , which are parallel to those in §§ 6 and 7.
Proposition 8.3. Let $\unicode[STIX]{x1D6FF}>0$ . We have the following statements.
(i) For any $\unicode[STIX]{x1D716}>0$ there exists $\unicode[STIX]{x1D6FF}(\unicode[STIX]{x1D716})>0$ such that, for sufficiently large $k>0$ ,
$$\begin{eqnarray}m_{\operatorname{Lie}(A)}(R_{g_{k},\unicode[STIX]{x1D6FF}(\unicode[STIX]{x1D716})}\cap \unicode[STIX]{x1D6FA}_{g_{k},\unicode[STIX]{x1D6FF}})\geqslant (1-\unicode[STIX]{x1D716})m_{\operatorname{Lie}(A)}(\unicode[STIX]{x1D6FA}_{g_{k},\unicode[STIX]{x1D6FF}}).\end{eqnarray}$$(ii) For sufficiently large $k$ , $R_{g_{k},\unicode[STIX]{x1D6FF}}$ contains a ball of radius $r_{k}>0$ , and $r_{k}\rightarrow \infty$ as $k\rightarrow \infty$ .
Proof. For any $\unicode[STIX]{x1D716}>0$ , let $\unicode[STIX]{x1D6FF}(\unicode[STIX]{x1D716})$ be as in Proposition 7.6. By applying Mahler’s compactness criterion on the space of unimodular lattices in $\wedge ^{l}\mathfrak{g}$ $(1\leqslant l\leqslant \dim \mathfrak{g}-1)$ , we can find a $\unicode[STIX]{x1D6FF}^{\prime }(\unicode[STIX]{x1D716})>0$ such that
Now the first part of the proposition follows from Proposition 7.6.
For the second part, we fix $\unicode[STIX]{x1D716}>0$ . By Lemmas 4.5 and 6.2 and the first claim of the proposition, for sufficiently large $k\in \mathbb{N}$ , the convex polytope $R_{g_{k},\unicode[STIX]{x1D6FF}(\unicode[STIX]{x1D716})}\supset R_{g_{k},\unicode[STIX]{x1D6FF}(\unicode[STIX]{x1D716})}\cap \unicode[STIX]{x1D6FA}_{g_{k},\unicode[STIX]{x1D6FF}}$ contains a ball of radius $r_{k}$ , and $r_{k}\rightarrow \infty$ as $k\rightarrow \infty$ . By definition, the same holds for $R_{g_{k},\unicode[STIX]{x1D6FF}}$ for any $\unicode[STIX]{x1D6FF}>0$ .◻
Proposition 8.4. For any $\unicode[STIX]{x1D6FF}>0$ , we have
Proof. The proof is identical to that of Proposition 6.3. ◻
Proposition 8.5. Let $\unicode[STIX]{x1D6FF}>0$ . For any $\unicode[STIX]{x1D716}>0$ , there exists a constant $\unicode[STIX]{x1D6FF}(\unicode[STIX]{x1D716})>0$ such that, for sufficiently large $k$ ,
Proof. The proof is similar to that of Proposition 7.6, except that we replace the linear functionals $\unicode[STIX]{x1D714}_{I}(\mathbf{t})$ by $\unicode[STIX]{x1D712}$ in ${\mathcal{W}}_{l}(\mathfrak{g})$ $(1\leqslant l\leqslant \dim \mathfrak{g}-1)$ .◻
Proposition 8.6. Let $\unicode[STIX]{x1D6FF}>0$ . By passing to a subsequence, the sequence $(1/\operatorname{Vol}(R_{g_{k},\unicode[STIX]{x1D6FF}}))(g_{k})_{\ast }(\unicode[STIX]{x1D707}_{Ax}|_{R_{g_{k},\unicode[STIX]{x1D6FF}}})$ converges to a probability measure $\unicode[STIX]{x1D708}$ . We also have
and hence the sequence $[g_{k}\unicode[STIX]{x1D707}_{Ax}]$ converges to $[\unicode[STIX]{x1D708}]$ . Furthermore, the probability measure $\unicode[STIX]{x1D708}$ is invariant under the action of the unipotent subgroup $\lim _{n\rightarrow \infty }\operatorname{Ad}(g_{k})A$ .
Proof. The proof is identical to Proposition 7.7 with $\unicode[STIX]{x1D6FA}_{g_{k},\unicode[STIX]{x1D6FF}}$ replaced by $R_{g_{k},\unicode[STIX]{x1D6FF}}$ .◻
The following is an immediate corollary of Propositions 3.3, 7.7 and 8.6.
Corollary 8.7. For any $\unicode[STIX]{x1D6FF}>0$ , we have
In the rest of the paper we will fix a $\unicode[STIX]{x1D6FF}>0$ for $R_{g_{k},\unicode[STIX]{x1D6FF}}$ $(k\in \mathbb{N})$ unless otherwise specified.
9 Ratner’s theorem and linearization
Because of Proposition 8.6, we can apply the measure classification theorem for unipotent actions on homogeneous spaces. This theorem was first conjectured by Raghunathan and Dani [Reference DaniDan81], and later a breakthrough was made by Margulis in his celebrated proof of the Oppenheim conjecture [Reference MargulisMar89]. Afterwards, the measure classification theorem was proved by Ratner in her seminal work [Reference RatnerRat90a, Reference RatnerRat90b, Reference RatnerRat91]. One could also consult the paper by Margulis and Tomanov [Reference Margulis and TomanovMT94] for a different proof. In this section, for convenience, we borrow the framework and presentation of [Reference Mozes and ShahMS95]. Readers may refer to [Reference Dani and MargulisDM93] and [Reference ShahSha91] for related discussions. This section is the final step of preparation for the proof of Theorem 2.5, and is devoted to proving Proposition 9.6.
9.1 Prerequisites
We start by recalling some well-known results (see [Reference Mozes and ShahMS95] for more details). Let ${\mathcal{H}}$ be the countable collection of all closed connected subgroups $H$ of $G$ such that $H\cap \unicode[STIX]{x1D6E4}$ is a lattice in $H$ and the group generated by one-parameter unipotent subgroups in $H$ acts ergodically on $H\unicode[STIX]{x1D6E4}/\unicode[STIX]{x1D6E4}$ with respect to the $H$ -invariant probability measure.
Let $W=\lim _{k\rightarrow \infty }\operatorname{Ad}(g_{k})A$ . By our assumptions on $\{{g_{k}\}}_{k\in \mathbb{N}}$ , $W$ is a connected unipotent subgroup of $G$ . Let $\unicode[STIX]{x1D70B}:G\rightarrow G/\unicode[STIX]{x1D6E4}$ be the natural projection map. For $H\in {\mathcal{H}}$ , define
For any $H_{1},H_{2}\in {\mathcal{H}}$ , $T_{H_{1}}(W)$ and $T_{H_{2}}(W)$ intersect if and only if $T_{H_{1}}(W)=T_{H_{2}}(W)$ .
Theorem 9.1 [Reference RatnerRat91], [Reference Mozes and ShahMS95, Theorem 2.2].
Let $\unicode[STIX]{x1D707}$ be a $W$ -invariant probability measure on $X$ . For any $H\in {\mathcal{H}}$ , let $\unicode[STIX]{x1D707}_{H,W}$ be the restriction of $\unicode[STIX]{x1D707}$ on $T_{H}(W)$ .
(i) We have $\unicode[STIX]{x1D707}=\sum _{H\in {\mathcal{H}}^{\ast }}\unicode[STIX]{x1D707}_{H,W}$ . Here ${\mathcal{H}}^{\ast }$ is a set of representatives of $\unicode[STIX]{x1D6E4}$ -conjugacy classes in ${\mathcal{H}}$ .
(ii) For each $H\in {\mathcal{H}}^{\ast }$ , $\unicode[STIX]{x1D707}_{H,W}$ is $W$ -invariant. Any $W$ -invariant ergodic component of $\unicode[STIX]{x1D707}_{H,W}$ is the invariant probability measure on $gH\unicode[STIX]{x1D6E4}/\unicode[STIX]{x1D6E4}$ for some $g\in N(H,W)$ .
In the following, we will fix a subgroup $H\in {\mathcal{H}}$ $(H\not =G)$ . Let $d_{H}=\dim \operatorname{Lie}(H)$ and $V_{H}=\wedge ^{d_{H}}\mathfrak{g}$ . Then $G$ acts on $V_{H}$ via the wedge product representation $\wedge ^{d_{H}}\operatorname{Ad}$ . Since $H$ is a $\mathbb{Q}$ -group, we can find an integral point $p_{H}\in \wedge ^{d_{H}}\operatorname{Lie}(H)\setminus \{0\}$ . We will fix this $p_{H}$ . Let $N(H)$ be the normalizer of $H$ in $G$ , and $\unicode[STIX]{x1D6E4}_{H}=N(H)\cap \unicode[STIX]{x1D6E4}$ . Then $\unicode[STIX]{x1D6E4}_{H}\cdot p_{H}\subset \{p_{H},-p_{H}\}$ . Define $\overline{V}_{H}=V_{H}/\{1,-1\}$ if $\unicode[STIX]{x1D6E4}_{H}\cdot p_{H}=\{p_{H},-p_{H}\}$ , and $\overline{V}_{H}=V_{H}$ if $\unicode[STIX]{x1D6E4}_{H}\cdot p_{H}=p_{H}$ . The action of $G$ on $V_{H}$ induces an action on $\overline{V}_{H}$ , and we define by
the orbit map $\overline{\unicode[STIX]{x1D702}}_{H}:G\rightarrow \overline{V}_{H}$ , where $\overline{p}_{H}$ is the image of $p_{H}$ in $\overline{V}_{H}$ . Since $\bar{p}_{H}$ is an integral point, the orbit $\unicode[STIX]{x1D6E4}\cdot \overline{p}_{H}$ is discrete in $\overline{V}_{H}$ . Let $L_{H}$ be the Zariski closure of $\overline{\unicode[STIX]{x1D702}}_{H}(N(H,W))$ in $\overline{V}_{H}$ . By [Reference Dani and MargulisDM93, Proposition 3.2], we have $\overline{\unicode[STIX]{x1D702}}_{H}^{-1}(L_{H})=N(H,W)$ .
Proposition 9.2 [Reference Mozes and ShahMS95, Proposition 3.2].
Let $D$ be a compact subset of $L_{H}$ . Let
Then $S(D)\subset S(H,W)$ and $\unicode[STIX]{x1D70B}(S(D))$ is closed in $X$ . Moreover, for any compact subset ${\mathcal{K}}\subset X\setminus \unicode[STIX]{x1D70B}(S(D))$ , there exists a neighborhood $\unicode[STIX]{x1D6F7}$ of $D$ in $\overline{V}_{H}$ such that, for any $y\in \unicode[STIX]{x1D70B}(\overline{\unicode[STIX]{x1D702}}_{H}^{-1}(\unicode[STIX]{x1D6F7}))\cap {\mathcal{K}}$ , the set $\overline{\unicode[STIX]{x1D702}}_{H}(\unicode[STIX]{x1D70B}^{-1}(y))\cap \unicode[STIX]{x1D6F7}$ is a singleton.
9.2 Proof of Proposition 9.6
Now we begin to prove Proposition 9.6. Let $\{f_{1},f_{2},\ldots ,f_{m}\}$ be a set of polynomials defining $L_{H}$ in $\overline{V}_{H}$ . In the rest of the section we will fix a unit vector $\vec{v}\in \operatorname{Lie}(A)$ such that all the linear functionals $\unicode[STIX]{x1D712}\in {\mathcal{W}}(\mathfrak{g})$ are different on $\vec{v}$ . One can find $d\in \mathbb{N}$ and $\unicode[STIX]{x1D706}>0$ such that for any $x_{0}\in \operatorname{Lie}(A)$ , the functions of $t\in \mathbb{R}$ ,
belong to $E(d,\unicode[STIX]{x1D706})$ as defined in Definition 7.1. Here the norm $\Vert \cdot \Vert$ on $\overline{V}_{H}$ is induced by the norm $\Vert \cdot \Vert _{\mathfrak{g}}$ on $\mathfrak{g}$ . We write $\unicode[STIX]{x1D6FF}_{0}$ for the constant $\unicode[STIX]{x1D6FF}_{0}(d,\unicode[STIX]{x1D706})$ defined in Proposition 7.2.
Proposition 9.3 (Cf. [Reference Dani and MargulisDM93, Proposition 4.2]).
Let $C$ be a compact subset in $L_{H}$ and $\unicode[STIX]{x1D716}>0$ . Then there exists a compact subset $D$ in $L_{H}$ with $C\subset D$ such, that for any neighborhood $\unicode[STIX]{x1D6F7}$ of $D$ in $\overline{V}_{H}$ , there exists a neighborhood $\unicode[STIX]{x1D6F9}$ of $C$ in $\overline{V}_{H}$ with the following property. For $x_{0}\in \operatorname{Lie}(A)$ , $w\in \overline{V}_{H}$ , $\unicode[STIX]{x1D6EF}\subset [0,\unicode[STIX]{x1D6FF}_{0}]$ and $k\in \mathbb{N}$ , if $\{g_{k}\exp (x_{0}+t\vec{v})\cdot w:t\in \unicode[STIX]{x1D6EF}\}\not \subset \unicode[STIX]{x1D6F7}$ , then we have
Proof. Let $d$ and $\unicode[STIX]{x1D706}$ be defined as above. We choose a ball $B_{0}(r)$ of radius $r>0$ centered at 0 in $\overline{V}_{H}$ such that the closure $\overline{C}\subset B_{0}(r)$ . Now for $\unicode[STIX]{x1D716}>0$ , let $M>0$ be the constant as in Proposition 7.2. Denote by $B_{0}(M^{1/2}r)$ the ball of radius $M^{1/2}r>0$ centered at 0. Then we take
and we will prove the proposition for this $D$ .
Indeed, for any neighborhood $\unicode[STIX]{x1D6F7}$ of $D$ in $\overline{V}_{H}$ , one can find $\unicode[STIX]{x1D6FC}>0$ such that
Define
which is a neighborhood of $C$ in $\overline{V}_{H}$ , and contained in $\unicode[STIX]{x1D6F7}$ . We show that $\unicode[STIX]{x1D6F7}$ and $\unicode[STIX]{x1D6F9}$ satisfy the desired property.
Suppose
for $x_{0}\in \operatorname{Lie}(A)$ , $w\in \overline{V}_{H}$ and $\unicode[STIX]{x1D6EF}\subset [0,\unicode[STIX]{x1D6FF}_{0}]$ . Denote by $\mathfrak{I}$ the closed subset
One can write $\mathfrak{I}$ as a disjoint union of the connected components $I_{i}$ of $\mathfrak{I}$ ,
On each $I_{i}$ , we have either
or
for some $1\leqslant j\leqslant m.$ Since $\Vert g_{k}\exp (x_{0}+t\vec{v})\cdot w\Vert ^{2}$ and $f_{j}(g_{k}\exp (x_{0}+t\vec{v})\cdot w)\;(1\leqslant j\leqslant m)$ belong to $E(d,\unicode[STIX]{x1D706})$ , by Proposition 7.2 and the definition of $\unicode[STIX]{x1D6F9}$ , we obtain
Now we compute
This completes the proof of the proposition. ◻
For our purpose, we define a convex polytope in $R_{g_{k},\unicode[STIX]{x1D6FF}}$ as follows. By Proposition 8.4, we know that
Therefore, for each $k\in \mathbb{N}$ , we can find a constant $d_{k}>0$ such that
Then we denote by $R_{g_{k},\unicode[STIX]{x1D6FF}}^{\prime }$ the subset of points $\mathbf{t}\in \operatorname{Lie}(A)$ satisfying
for any $v\in \mathfrak{g}_{\unicode[STIX]{x1D712}}(\mathbb{Z})\setminus \{0\},\unicode[STIX]{x1D712}\in {\mathcal{W}}_{l}(\mathfrak{g}),1\leqslant l\leqslant \dim \mathfrak{g}-1.$ This is a convex polytope inside $R_{g_{k},\unicode[STIX]{x1D6FF}}$ .
In the following lemma we list some properties concerning $R_{g_{k},\unicode[STIX]{x1D6FF}}^{\prime }$ $(k\in \mathbb{N})$ .
Lemma 9.4. Let $d_{k}$ and $R_{g_{k}\unicode[STIX]{x1D6FF}}^{\prime }$ be as defined above.
(i) We have
$$\begin{eqnarray}\lim _{k\rightarrow \infty }\frac{\operatorname{Vol}(R_{g_{k},\unicode[STIX]{x1D6FF}}^{\prime })}{\operatorname{Vol}(R_{g_{k},\unicode[STIX]{x1D6FF}})}=1.\end{eqnarray}$$(ii) For $1\leqslant l\leqslant \dim \mathfrak{g}-1$ , a functional $\unicode[STIX]{x1D712}\in {\mathcal{W}}_{l}(\mathfrak{g})$ , a nonzero $v\in \mathfrak{g}_{\unicode[STIX]{x1D712}}(\mathbb{Z})$ and $k\in \mathbb{N}$ , we have
$$\begin{eqnarray}\Vert e^{\unicode[STIX]{x1D712}(\mathbf{t})}(\wedge ^{l}\operatorname{Ad}(g_{k})v)\Vert \geqslant \unicode[STIX]{x1D6FF}e^{d_{k}}\quad (\forall \mathbf{t}\in R_{g_{k},\unicode[STIX]{x1D6FF}}^{\prime }).\end{eqnarray}$$(iii) For any $x_{0}$ in the $\unicode[STIX]{x1D6FF}_{0}$ -neighborhood $R_{g_{k},\unicode[STIX]{x1D6FF}}^{\prime }$ and for the interval $\unicode[STIX]{x1D6EF}=[0,\unicode[STIX]{x1D6FF}_{0}]$ , there exists a constant $c>0$ which depends only on the linear functionals in ${\mathcal{W}}(\mathfrak{g})$ and $\unicode[STIX]{x1D6FF}_{0}$ , such that for any nonzero integer vector $w\in \wedge ^{l}\mathfrak{g}$ $(1\leqslant l\leqslant \dim \mathfrak{g}-1)$ , we have
$$\begin{eqnarray}\sup _{t\in \unicode[STIX]{x1D6EF}}\Vert \wedge ^{l}(\operatorname{Ad}(g_{k}\exp (x_{0}+t\vec{v})))\cdot w\Vert \geqslant c\unicode[STIX]{x1D6FF}e^{d_{k}}.\end{eqnarray}$$
Proof. The proof of the first claim is similar to that of Corollary 6.4. Indeed, let $\{f_{i}\}$ be the collection of the facets of $R_{g_{k},\unicode[STIX]{x1D6FF}}$ , and denote by $P_{i}$ the hyperplane determined by $f_{i}$ . For each $f_{i}$ , let $B_{i}$ be the unique cylinder with the following properties.
(a) The base of $B_{i}$ is $f_{i}$ , and the height of $B_{i}$ is equal to $d_{k}$ .
(b) $B_{i}$ and $R_{g_{k},\unicode[STIX]{x1D6FF}}$ lie in the same half-space determined by $P_{i}$ .
Then we have
and
Now the first claim follows from our choice of $d_{k}$ .
The second claim follows from the definition of $R_{g_{k},\unicode[STIX]{x1D6FF}}^{\prime }$ . To prove the last statement, we write, for any nonzero integer vector $w\in \wedge ^{l}\mathfrak{g}$ ,
where $w_{\unicode[STIX]{x1D712}}\in \mathfrak{g}_{\unicode[STIX]{x1D712}}(\mathbb{Z})$ . One can compute
Now the last claim follows from the second claim of the lemma and Proposition 7.5. ◻
The following proposition is an important step toward Proposition 9.6.
Proposition 9.5 (Cf. [Reference Mozes and ShahMS95, Proposition 3.4]).
Let $C$ be a compact subset in $L_{H}$ and $0<\unicode[STIX]{x1D716}<1$ . Then there exists a closed subset ${\mathcal{S}}$ in $\unicode[STIX]{x1D70B}(S(H,W))$ with the following property. For any compact set ${\mathcal{K}}\subset X\setminus {\mathcal{S}}$ , there exists a neighborhood $\unicode[STIX]{x1D6F9}$ of $C$ in $\overline{V}_{H}$ such that, for sufficiently large $k$ , for any $x_{0}$ in the $\unicode[STIX]{x1D6FF}_{0}$ -neighborhood of $R_{g_{k},\unicode[STIX]{x1D6FF}}^{\prime }$ and $\unicode[STIX]{x1D6EF}=[0,\unicode[STIX]{x1D6FF}_{0}]$ , we have
Proof. Let $D\subset L_{H}$ be a compact set as in Proposition 9.3 for $C$ and $\unicode[STIX]{x1D716}$ . Then we get a closed subset ${\mathcal{S}}=\unicode[STIX]{x1D70B}(S(D))$ as in Proposition 9.2. Now for a compact subset ${\mathcal{K}}$ in $X\setminus {\mathcal{S}}$ , let $\unicode[STIX]{x1D6F7}$ be an open neighborhood of $D$ in $\overline{V}_{H}$ as in Proposition 9.2. Then we have a neighborhood $\unicode[STIX]{x1D6F9}$ of $C$ in $\overline{V}_{H}$ as in Proposition 9.3.
By the choice of $x_{0}$ and Lemma 9.4, for any nonzero integer vector $w\in \wedge ^{d_{H}}\mathfrak{g}$ , we have
for some $c>0$ depending only on ${\mathcal{W}}(\mathfrak{g})$ and $\unicode[STIX]{x1D6FF}_{0}$ . Hence,
for sufficiently large $k$ .
Now, for any $s\in \unicode[STIX]{x1D6EF}$ with
by Proposition 9.2, there is a unique element $w_{s}$ in $\overline{\unicode[STIX]{x1D702}}_{H}(\unicode[STIX]{x1D6E4})$ such that
Let $I_{s}=[a_{s},b_{s}]$ be the largest closed interval in $\unicode[STIX]{x1D6EF}$ containing $s$ such that:
(i) for any $t\in I_{s}$ , we have $g_{k}\exp (x_{0}+t\vec{v})\cdot w_{s}\in \overline{\unicode[STIX]{x1D6F7}};$
(ii) either $g_{k}\exp (x_{0}+a_{s}\vec{v})\cdot w_{s}\text{ or }g_{k}\exp (x_{0}+b_{s}\vec{v})\cdot w_{s}\in \overline{\unicode[STIX]{x1D6F7}}\setminus \unicode[STIX]{x1D6F7}$ .
We denote by ${\mathcal{F}}$ the collection of all these intervals $I_{s}$ as $s$ runs over $\unicode[STIX]{x1D6EF}$ with
By Proposition 9.2, we know that the intervals in ${\mathcal{F}}$ cover $\unicode[STIX]{x1D6EF}$ at most twice. By Proposition 9.3, we have
This completes the proof of the proposition. ◻
Proposition 9.6. Let $C$ be a compact set in $L_{H}$ and $0<\unicode[STIX]{x1D716}<1$ . Then there exists a closed subset ${\mathcal{S}}$ in $\unicode[STIX]{x1D70B}(S(H,W))$ with the following property. For any compact set ${\mathcal{K}}\subset X\setminus {\mathcal{S}}$ , there exists a neighborhood $\unicode[STIX]{x1D6F9}$ of $C$ in $\overline{V}_{H}$ such that, for sufficiently large $k>0$ , we have
Proof. By Lemma 9.4, let $k$ be sufficiently large such that
We can find a cover of the region $R_{g_{k},\unicode[STIX]{x1D6FF}}^{\prime }$ by countably many disjoint small boxes of diameter at most $\unicode[STIX]{x1D6FF}_{0}$ such that each box is of the form
where $S$ is the base of $B$ perpendicular to $\vec{v}$ , and $\unicode[STIX]{x1D6EF}=[0,\unicode[STIX]{x1D6FF}_{0}]$ . Denote by ${\mathcal{F}}$ the collection of these boxes.
For any $B\in {\mathcal{F}}$ , and for any $x_{0}$ in the base $S$ of $B$ , $x_{0}$ is in the $\unicode[STIX]{x1D6FF}_{0}$ -neighborhood of $R_{g_{k},\unicode[STIX]{x1D6FF}}^{\prime }$ . By Proposition 9.5 we obtain that
for sufficiently large $k$ . By integrating the inequality above over the base $S$ , we have
By the choice of $d_{k}$ and ${\mathcal{F}}$ , for sufficiently large $k$ , we have
Now we compute
The proposition now follows. ◻
10 Proofs of Theorems 2.4, 2.5 and 2.6
Proof of Theorem 2.5.
We will prove the theorem by induction. Let $g_{k}=(u_{ij}(k))_{1\leqslant i,j\leqslant n}$ $(k\in \mathbb{N})$ be a sequence in the upper triangular unipotent subgroup $N$ of $\operatorname{SL}(n,\mathbb{R})$ , and for each pair $i<j$ , either $u_{ij}(k)$ is zero for all $k$ or $u_{ij}(k)\neq 0$ and diverges to infinity.
Suppose for a start that ${\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}})=\{0\}$ . By passing to a subsequence, we may further assume that $\operatorname{Ad}g_{k}(\operatorname{Lie}A)$ converges to a subalgebra consisting of nilpotent elements in $\mathfrak{g}$ , in the space of the Grassmanian of $\mathfrak{g}$ . Then, by Proposition 8.6, after passing to a subsequence, $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Ax_{e}}]$ converges to $[\unicode[STIX]{x1D708}]$ for a probability measure $\unicode[STIX]{x1D708}$ . Furthermore, we have
and $\unicode[STIX]{x1D708}$ is invariant under the unipotent subgroup $W=\lim _{k\rightarrow \infty }\operatorname{Ad}(g_{k})A$ .
We will apply Ratner’s theorem and the technique of linearization to prove that $\unicode[STIX]{x1D708}$ is the Haar measure on $\operatorname{SL}(n,\mathbb{R})/\operatorname{SL}(n,\mathbb{Z})$ . According to Theorem 9.1, suppose by way of contradiction that for some $H\in {\mathcal{H}}^{\ast }$ $(H\neq G)$ we have $\unicode[STIX]{x1D708}(T_{H}(W))>0$ . Then we can find a compact subset $C\subset T_{H}(W)$ such that
Now let $0<\unicode[STIX]{x1D716}<\unicode[STIX]{x1D6FC}$ , $C_{1}=\overline{\unicode[STIX]{x1D702}}_{H}(C)$ and ${\mathcal{S}}$ the closed subset of $X$ as in Proposition 9.6. Since $C\cap {\mathcal{S}}=\emptyset$ , we can pick a compact neighborhood ${\mathcal{K}}\subset X\setminus S$ of $C$ . Then by Proposition 9.6, there exists a neighborhood $\unicode[STIX]{x1D6F9}$ of $C_{1}$ in $\overline{V}_{H}$ such that, for sufficiently large $k>0$ ,
and
This implies that
which contradicts the equation $\unicode[STIX]{x1D708}(C)=\unicode[STIX]{x1D6FC}$ . Hence $\unicode[STIX]{x1D708}$ is the Haar measure on $\operatorname{SL}(n,\mathbb{R})/\operatorname{SL}(n,\mathbb{Z})$ .
Now suppose that ${\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}})\neq \{0\}$ . Then, by Corollary 5.3, the subgroup
is connected and nontrivial, and $\operatorname{Lie}(S)={\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}})$ . This implies that all elements in $A$ and $\{{g_{k}\}}_{k\in \mathbb{N}}$ belong to the reductive group $C_{G}(S)^{0}$ . Moreover, by the definition of $S$ , $S$ is also the connected component of the center of $C_{G}(S)^{0}$ . So we have
where $H$ is the semisimple component of $C_{G}(S)^{0}$ and $H$ is isomorphic to the product of various $\operatorname{SL}(n_{i},\mathbb{R})$ with $n_{i}<n$ , that is,
Let $A_{i}=A\cap \operatorname{SL}(n_{i},\mathbb{R})$ be the connected component of the full diagonal subgroup in $\operatorname{SL}(n_{i},\mathbb{R})$ , and we have
Since $g_{k}\in N$ is unipotent $(\forall k\in \mathbb{N})$ , one has $g_{k}\in H$ . Then we can write $g_{k}=\prod g_{i,k}\in \prod \operatorname{SL}(n_{i},\mathbb{R})$ . Note that by the definition of $S$ and Corollary 5.3, ${\mathcal{A}}(A_{i},\{{g_{i,k}\}}_{k\in \mathbb{N}})=\{0\}$ for all $i$ .
The above discussions tell us that our problem can now be reduced to the following setting (recall that $x_{e}=e\operatorname{SL}(n,\mathbb{Z})$ ).
(i) The measure $\unicode[STIX]{x1D707}_{Ax_{e}}$ is supported in the homogeneous space $C_{G}(S)^{0}/(\unicode[STIX]{x1D6E4}\cap C_{G}(S)^{0})$ , where one has
$$\begin{eqnarray}\displaystyle C_{G}(S)^{0}/(\unicode[STIX]{x1D6E4}\cap C_{G}(S)^{0}) & = & \displaystyle S/(\unicode[STIX]{x1D6E4}\cap S)\,\times \,H/(\unicode[STIX]{x1D6E4}\cap H)\nonumber\\ \displaystyle & = & \displaystyle S\,\times \,\prod (\operatorname{SL}(n_{i},\mathbb{R})/\operatorname{SL}(n_{i},\mathbb{Z})).\nonumber\end{eqnarray}$$(ii) The measure $\unicode[STIX]{x1D707}_{Ax_{e}}$ can be decomposed, according to the decomposition of $C_{G}(S)^{0}/(\unicode[STIX]{x1D6E4}\cap C_{G}(S)^{0})$ , as
$$\begin{eqnarray}\unicode[STIX]{x1D707}_{Ax_{e}}=\unicode[STIX]{x1D707}_{S}\,\times \,\prod \unicode[STIX]{x1D707}_{A_{i}x_{i}}.\end{eqnarray}$$Here $\unicode[STIX]{x1D707}_{S}$ denotes the $S$ -invariant measure on $S$ . For each $i$ , $x_{i}=e\operatorname{SL}(n_{i},\mathbb{Z})$ is the identity coset in $\operatorname{SL}(n_{i},\mathbb{R})/\operatorname{SL}(n_{i},\mathbb{Z})$ , and $\unicode[STIX]{x1D707}_{A_{i}x_{i}}$ denotes the $A_{i}$ -invariant measure on $A_{i}x_{i}$ in $\operatorname{SL}(n_{i},\mathbb{R})/\operatorname{SL}(n_{i},\mathbb{Z})$ .(iii) The measure $\unicode[STIX]{x1D707}_{Ax_{e}}$ is pushed by the sequence $\{{g_{k}\}}_{k\in \mathbb{N}}$ in the space $C_{G}(S)^{0}/(\unicode[STIX]{x1D6E4}\cap C_{G}(S)^{0})$ in the following manner:
$$\begin{eqnarray}(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Ax_{e}}=\unicode[STIX]{x1D707}_{S}\,\times \,\prod (g_{i,k})_{\ast }\unicode[STIX]{x1D707}_{A_{i}x_{i}}.\end{eqnarray}$$(iv) For each $A_{i}x_{i}$ in $\operatorname{SL}(n_{i},\mathbb{R})/\operatorname{SL}(n_{i},\mathbb{Z})$ , we have ${\mathcal{A}}(A_{i},\{{g_{i,k}\}}_{k\in \mathbb{N}})=\{0\}$ .
Since $n_{i}<n$ , we can now apply the induction hypothesis to the sequence $(g_{i,k})_{\ast }\unicode[STIX]{x1D707}_{A_{i}x_{i}}$ , and obtain that $[g_{i,k}\unicode[STIX]{x1D707}_{A_{i}x_{i}}]$ converges to the equivalence class of the Haar measure $m_{\operatorname{SL}(n_{i},\mathbb{R})/\operatorname{SL}(n_{i},\mathbb{Z})}$ on $\operatorname{SL}(n_{i},\mathbb{R})/\operatorname{SL}(n_{i},\mathbb{Z})$ . Now, by putting all the measures $m_{\operatorname{SL}(n_{i},\mathbb{R})/\operatorname{SL}(n_{i},\mathbb{Z})}$ and $\unicode[STIX]{x1D707}_{S}$ back together in the space $\operatorname{SL}(n,\mathbb{R})/\operatorname{SL}(n,\mathbb{Z})$ , we conclude that $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Ax_{e}}]$ converges to $[\unicode[STIX]{x1D707}_{C_{G}(S)^{0}x_{e}}]$ . This completes the proof of Theorem 2.5.◻
Proof of Theorem 2.4.
We first prove the following claim.
Claim. Let $\{{u_{k}\}}_{k\in \mathbb{N}}$ be a sequence in the upper triangular unipotent group $N$ of $G=\operatorname{SL}(n,\mathbb{R})$ . Then there is a subsequence $\{{u_{i_{k}}\}}_{k\in \mathbb{N}}$ of $\{{u_{k}\}}_{k\in \mathbb{N}}$ such that
for a bounded sequence $\{{b_{k}\}}_{k\in \mathbb{N}}$ in $N$ , and a sequence $\{{v_{k}\}}_{k\in \mathbb{N}}$ in $N$ with $v_{k}=(v_{ij}(k))_{1\leqslant i,j\leqslant n}$ satisfying the following condition: for each pair $(i,j)$ $(1\leqslant i<j\leqslant n)$ ,
Proof of the claim.
We proceed by induction on $n$ . For $n=2$ and $G=\operatorname{SL}(2,\mathbb{R})$ , $\{{u_{k}\}}_{k\in \mathbb{N}}$ is a sequence in the $2\,\times \,2$ upper triangular unipotent group. Write $u_{k}=(u_{ij}(k))_{1\leqslant i,j\leqslant 2}$ . By passing to a subsequence, we may assume that $\{u_{12}(k)\}_{k\in \mathbb{N}}$ is bounded, or diverges to infinity. If $\{u_{12}(k)\}_{k\in \mathbb{N}}$ is bounded, then set $b_{k}=u_{k}$ , and $v_{k}=e$ the identity matrix. If $\{u_{12}(k)\}_{k\in \mathbb{N}}$ diverges to infinity, then set $b_{k}=e$ and $v_{k}=u_{k}$ . Either way we have $u_{k}=b_{k}v_{k}$ , and the claim holds in this case.
Suppose that the claim holds for $\operatorname{SL}(n-1,\mathbb{R})$ $(n\geqslant 3)$ . Now let $G=\operatorname{SL}(n,\mathbb{R})$ and $\{{u_{k}\}}_{k\in \mathbb{N}}$ a sequence in the $n\,\times \,n$ upper triangular unipotent group $N$ . We will use the notation in § 5: for any $g\in G$ , we will denote by $(g)_{l\,\times \,l}$ the $l\,\times \,l$ submatrix in the upper left corner of $g$ .
Now write $u_{k}=(u_{ij}(k))_{1\leqslant i,j\leqslant n}$ . Then $(u_{k})_{(n-1)\,\times \,(n-1)}=(u_{ij}(k))_{1\leqslant i,j\leqslant n-1}$ . By applying the induction hypothesis on $(u_{k})_{(n-1)\,\times \,(n-1)}$ , after passing to a subsequence, one can find a bounded sequence $\{{w_{k}\}}_{k\in \mathbb{N}}$ in $N$ and a sequence $\{{x_{k}\}}_{k\in \mathbb{N}}$ in $N$ with $x_{k}=(x_{ij}(k))_{1\leqslant i,j\leqslant n}$ such that
and for each pair $(i,j)$ $(1\leqslant i<j\leqslant n-1)$ ,
Now, by passing to a subsequence, one can assume that for $1\leqslant i\leqslant n-1$ ,
By Gaussian elimination, there exist a bounded sequence $y_{k}\in N$ and a sequence $v_{k}\in N$ such that
and the following condition holds for $v_{k}=(v_{ij}(k))_{1\leqslant i,j\leqslant n}$ : for any $1\leqslant i<j\leqslant n$ ,
Now we complete the proof of the claim by setting $v_{k}$ as above and $b_{k}=w_{k}y_{k}$ .◻
We now prove Theorem 2.4. By the Iwasawa decomposition, for each element $g_{k}$ in the sequence $\{{g_{k}\}}_{k\in \mathbb{N}}$ , we can write
where $s_{k}\in K=\operatorname{SO}(n,\mathbb{R})$ , $u_{k}\in N$ and $a_{k}\in A$ . By the claim above, we can assume that, after passing to a subsequence, we can write
for a bounded sequence $b_{k}\in N$ and a sequence $\tilde{u} _{k}=(\tilde{u} _{ij}(k))_{1\leqslant i,j\leqslant n}$ in $N$ such that, for each pair $1\leqslant i<j\leqslant n$ ,
Since $\unicode[STIX]{x1D707}_{Ax}$ is $A$ -invariant, we have
Now the first paragraph of Theorem 2.4 follows by applying Theorem 2.5 to $(\tilde{u} _{k})_{\ast }\unicode[STIX]{x1D707}_{Ax}$ and the boundedness of $\{{b_{k}\}}_{k\in \mathbb{N}}$ and $\{{s_{k}\}}_{k\in \mathbb{N}}$ .
We now prove the second paragraph of Theorem 2.4. Assume that, for any $Y\in \operatorname{Lie}(A)\setminus {\mathcal{A}}(A,\{{g_{k}\}}_{k\in \mathbb{N}})$ , $\{\operatorname{Ad}(g_{k})Y\}_{k\in \mathbb{N}}$ diverges. Let $[\unicode[STIX]{x1D708}]$ be a limit point of $\{[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Ax}]\}_{k\in \mathbb{N}}$ . Then there is a subsequence $\{{g_{i_{k}}\}}_{k\in \mathbb{N}}$ such that $[(g_{i_{k}})_{\ast }\unicode[STIX]{x1D707}_{Ax}]$ converges to $[\unicode[STIX]{x1D708}]$ . By the same argument as above, after passing to a subsequence of $\{g_{i_{k}}\}$ , one can find $s_{k}\in K$ , $\tilde{u} _{k}=(\tilde{u} _{ij}(k))_{1\leqslant i,j\leqslant n}\in N$ , $a_{k}\in A$ , and a bounded sequence $b_{k}\in N$ such that
and, for any $1\leqslant i<j\leqslant n$ ,
Since $\unicode[STIX]{x1D707}_{Ax}$ is $A$ -invariant, we have
Note that, by the boundedness of $\{{b_{k}\}}_{k\in \mathbb{N}}$ and $\{{s_{k}\}}_{k\in \mathbb{N}}$ ,
Now the second paragraph of Theorem 2.4 follows from Theorem 2.5 and the boundedness of $\{{b_{k}\}}_{k\in \mathbb{N}}$ and $\{{s_{k}\}}_{k\in \mathbb{N}}$ .◻
The following is an immediate corollary from the proof of Theorem 2.4, which gives an example of $\unicode[STIX]{x1D706}_{k}$ in a special case of Theorem 2.8. This also generalizes the result in [Reference Oh and ShahOS14]. We will apply this special case of Theorem 2.8 in the counting problem in § 11.
Corollary 10.1 (Cf. Theorem 2.8).
Let $\{{g_{k}\}}_{k\in \mathbb{N}}$ be a sequence in $KN$ such that, for any nonzero $Y\in \operatorname{Lie}(A)$ , the sequence $\{\operatorname{Ad}(g_{k})Y\}_{k\in \mathbb{N}}$ diverges to infinity. Then we have
where $m_{X}$ is the $G$ -invariant probability measure on $X$ .
In the rest of this section we will prove Theorem 2.6. Let $H$ be a connected reductive group containing $A$ . It is known that, up to conjugation by an element in the Weyl group of $G$ , $H$ consists of diagonal blocks, with each block isomorphic to $\operatorname{GL}(m,\mathbb{R})$ with $m<n$ . For convenience, we will assume that $H$ has the form of diagonal blocks, since conjugations by Weyl elements do not affect the theorem.
The following lemma clarifies an assumption in Theorem 2.6.
Lemma 10.2. Let $Ax$ be a divergent orbit in $X$ and let $H$ be a connected reductive group containing $A$ . Then $Hx$ is closed in $X$ .
Proof. By the classification of divergent $A$ -orbits of Margulis which appears in the appendix of [Reference Tomanov and WeissTW03], we may assume without loss of generality that $x$ is commensurable to $\mathbb{Z}^{n}$ . Thus, it is enough to prove the lemma for $x=\mathbb{Z}^{n}$ . Then the lemma follows easily for any reductive group $H$ under consideration.◻
By reasoning in the same way as at the beginning of § 4, there is no harm in assuming $x=x_{e}=e\operatorname{SL}(n,\mathbb{Z})$ in the proof of Theorem 2.6.
Let $P$ be the standard $\mathbb{Q}$ -parabolic subgroup in $G$ having $H$ as (the connected component of) a Levi component. Let $U\subset N$ be the unipotent radical of $P$ . We write
where $S$ is the connected component of the center of $H$ , and $H_{ss}$ is the semisimple component of $H$ . We will denote by $A_{ss}$ the connected component of the full diagonal group in $H_{ss}$ . Note that we have
By Theorem 2.5, we can find a sequence of upper triangular unipotent matrices $h_{k}\in H$ satisfying the dichotomy condition in Theorem 2.5 such that
We will fix such a sequence $\{{h_{k}\}}_{k\in \mathbb{N}}$ .
As $G=KUH$ where $K=\operatorname{SO}(n,\mathbb{R})$ , for every $g_{k}$ in the sequence $\{{g_{k}\}}_{k\in \mathbb{N}}$ , we can write
where $s_{k}\in K$ , $u_{k}\in U$ and $l_{k}\in H$ . We have
Following the same strategy as in the proof of Theorem 2.4, to prove Theorem 2.6, we may assume that $g_{k}\in U$ . Now let $g_{k}=(u_{ij}(k))_{1\leqslant i,j\leqslant k}\in U$ . By Gaussian elimination as explained in the proof of Theorem 2.4, we may further assume that, for each pair $i<j$ , either $u_{ij}(k)$ equals $0$ for all $k$ or $u_{ij}(k)\neq 0$ diverges to infinity.
Proposition 10.3. If ${\mathcal{A}}(S,\{{g_{k}\}}_{k\in \mathbb{N}})=\{0\}$ , then, for any subsequence $\{{g_{m_{k}}\}}_{k\in \mathbb{N}}$ of $\{{g_{k}\}}_{k\in \mathbb{N}}$ and any subsequence $\{{h_{n_{k}}\}}_{k\in \mathbb{N}}$ of $\{{h_{k}\}}_{k\in \mathbb{N}}$ , we have $\operatorname{Ad}(g_{m_{k}}h_{n_{k}})Y\rightarrow \infty$ as $k\rightarrow \infty$ for any nonzero $Y\in \operatorname{Lie}(A)$ .
Proof. Let $Y=Y_{1}+Y_{2}\neq 0$ , where $Y_{1}\in \operatorname{Lie}(S)$ and $Y_{2}\in \operatorname{Lie}(A_{ss})$ . If $Y_{2}=0$ , then
diverges to $\infty$ by the condition ${\mathcal{A}}(S,\{{g_{k}\}}_{k\in \mathbb{N}})=\{0\}$ and Corollary 5.3. If $Y_{2}\neq 0$ , then we have
Since $H$ normalizes $U$ , we know that
Also $\operatorname{Ad}(h_{n_{k}})Y_{2}\in \operatorname{Lie}(H)$ and $\operatorname{Ad}(h_{n_{k}})Y_{2}\rightarrow \infty$ by our choice of $\{{h_{k}\}}_{k\in \mathbb{N}}$ and Corollary 5.3. Hence, $\operatorname{Ad}(g_{m_{k}}h_{n_{k}})Y$ diverges to $\infty$ .◻
We will fix a nonnegative function $f_{0}\in C_{c}(X)$ such that $\operatorname{supp}(f_{0})$ contains the compact orbit $N\mathbb{Z}^{n}$ in $X$ . This implies that, for any $g\in N$ , we have
Proposition 10.4. Suppose that the subalgebra ${\mathcal{A}}(S,\{{g_{k}\}}_{k\in \mathbb{N}})=\{0\}$ . Let $f\in C_{c}(X)$ . Then, for any $\unicode[STIX]{x1D716}>0$ , there exists $M>0$ such that, for any $m,n>M$ ,
Proof. Suppose that there exists $\unicode[STIX]{x1D716}>0$ such that, for any $l>0$ , there are $m_{l},n_{l}>l$ satisfying
By Proposition 10.3, we know that $\operatorname{Ad}(g_{m_{l}}h_{n_{l}})Y\rightarrow \infty$ as $l\rightarrow \infty$ for any nonzero $Y\in \operatorname{Lie}(A)$ . Hence, by Theorem 2.4, we have
which contradicts the inequality above. This completes the proof of the proposition. ◻
Proof of Theorem 2.6.
We will prove the theorem by induction. Let $\{{g_{k}\}}_{k\in \mathbb{N}}$ be a sequence in $G$ and, by the discussions above, we may assume that every $g_{k}=(u_{ij}(k))_{1\leqslant i,j\leqslant n}$ is in $U\subset N$ , and for $1\leqslant i<j\leqslant n$ , $u_{ij}(k)$ either equals 0 for all $k$ or diverges to infinity as $k\rightarrow \infty$ .
Suppose that ${\mathcal{A}}(S,\{{g_{k}\}}_{k\in \mathbb{N}})=\{0\}$ . Let $f\in C_{c}(X)$ . By Proposition 10.4, for any $\unicode[STIX]{x1D716}>0$ , there exists $M>0$ such that, for any $m,n>M$ ,
We now fix $m$ , let $n\rightarrow \infty$ and obtain
This implies that $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Hx_{e}}]\rightarrow [m_{X}]$ .
Now suppose that ${\mathcal{A}}(S,\{{g_{k}\}}_{k\in \mathbb{N}})\neq \{0\}$ . The proof in this case would be similar to that of Theorem 2.5. By Corollary 5.3, the subgroup
is connected and nontrivial, and $\operatorname{Lie}(S^{\prime })={\mathcal{A}}(S,\{{g_{k}\}}_{k\in \mathbb{N}})$ . This implies that all elements of $H$ and $\{g_{k}\}$ belong to $C_{G}(S^{\prime })^{0}$ . Moreover, we have
where $H^{\prime }$ is the semisimple component of $C_{G}(S^{\prime })^{0}$ and $H^{\prime }$ is isomorphic to the product of various $\operatorname{SL}(n_{i},\mathbb{R})$ with $n_{i}<n$ ,
Let $H_{i}$ be the reductive subgroup $H\cap \operatorname{SL}(n_{i},\mathbb{R})$ in $\operatorname{SL}(n_{i},\mathbb{R})$ , and we have
Since $g_{k}\in U$ is unipotent $(\forall k\in \mathbb{N})$ , we have $g_{k}\in H^{\prime }$ . Then we can write $g_{k}=\prod _{i}g_{i,k}$ ( $g_{i,k}\in \operatorname{SL}(n_{i},\mathbb{R})$ ).
Similarly to the proof of Theorem 2.5, the above discussions imply that the problem is reduced to the following setting.
(i) The measure $\unicode[STIX]{x1D707}_{Hx_{e}}$ is supported in the homogeneous space $C_{G}(S^{\prime })^{0}/(\unicode[STIX]{x1D6E4}\cap C_{G}(S^{\prime })^{0})$ , where we have
$$\begin{eqnarray}\displaystyle C_{G}(S^{\prime })^{0}/(\unicode[STIX]{x1D6E4}\cap C_{G}(S^{\prime })^{0}) & = & \displaystyle S^{\prime }/(\unicode[STIX]{x1D6E4}\cap S^{\prime })\,\times \,H^{\prime }/(\unicode[STIX]{x1D6E4}\cap H^{\prime })\nonumber\\ \displaystyle & = & \displaystyle S^{\prime }\,\times \,\prod (\operatorname{SL}(n_{i},\mathbb{R})/\operatorname{SL}(n_{i},\mathbb{Z})).\nonumber\end{eqnarray}$$(ii) The measure $\unicode[STIX]{x1D707}_{Hx_{e}}$ can be decomposed, according to the decomposition of $C_{G}(S^{\prime })^{0}/(\unicode[STIX]{x1D6E4}\cap C_{G}(S^{\prime })^{0})$ , as
$$\begin{eqnarray}\unicode[STIX]{x1D707}_{Hx_{e}}=\unicode[STIX]{x1D707}_{S^{\prime }}\,\times \,\prod \unicode[STIX]{x1D707}_{H_{i}x_{i}}.\end{eqnarray}$$Here $\unicode[STIX]{x1D707}_{S^{\prime }}$ denotes the $S^{\prime }$ -invariant measure on $S^{\prime }$ . For each $i$ , $x_{i}=e\operatorname{SL}(n_{i},\mathbb{Z})$ is the identity coset in $\operatorname{SL}(n_{i},\mathbb{R})/\operatorname{SL}(n_{i},\mathbb{Z})$ , and $\unicode[STIX]{x1D707}_{H_{i}x_{i}}$ denotes the $H_{i}$ -invariant measure on $H_{i}x_{i}$ in $\operatorname{SL}(n_{i},\mathbb{R})/\operatorname{SL}(n_{i},\mathbb{Z})$ .(iii) The measure $\unicode[STIX]{x1D707}_{Hx_{e}}$ is pushed by the sequence $\{g_{k}\}$ in the space $C_{G}(S^{\prime })^{0}/(\unicode[STIX]{x1D6E4}\cap C_{G}(S^{\prime })^{0})$ in the following way:
$$\begin{eqnarray}(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Hx_{e}}=\unicode[STIX]{x1D707}_{S^{\prime }}\,\times \,\prod (g_{i,k})_{\ast }\unicode[STIX]{x1D707}_{H_{i}x_{i}}.\end{eqnarray}$$(iv) If $S_{i}$ is the connected component of the center of $H_{i}$ , then we have ${\mathcal{A}}(S_{i},\{{g_{i,k}\}}_{k\in \mathbb{N}})=\{0\}$ .
Since $n_{i}<n$ , we can now apply the induction hypothesis to the sequence $(g_{i,k})_{\ast }\unicode[STIX]{x1D707}_{H_{i}x_{i}}$ , and obtain that $[(g_{i,k})_{\ast }\unicode[STIX]{x1D707}_{H_{i}x_{i}}]$ converges to the equivalence class of the Haar measure $m_{\operatorname{SL}(n_{i},\mathbb{R})/\operatorname{SL}(n_{i},\mathbb{Z})}$ on $\operatorname{SL}(n_{i},\mathbb{R})/\operatorname{SL}(n_{i},\mathbb{Z})$ . Now, by putting all the measures $[m_{\operatorname{SL}(n_{i},\mathbb{R})/\operatorname{SL}(n_{i},\mathbb{Z})}]$ and $\unicode[STIX]{x1D707}_{S^{\prime }}$ back together in the space $\operatorname{SL}(n,\mathbb{R})/\operatorname{SL}(n,\mathbb{Z})$ , we have $[(g_{k})_{\ast }\unicode[STIX]{x1D707}_{Hx_{e}}]\rightarrow [\unicode[STIX]{x1D707}_{C_{G}({\mathcal{A}}(S,\{g_{k}\}))^{0}x_{e}}]$ .◻
11 An application to a counting problem
In this section we will prove Theorem 2.10. Let $p_{0}(\unicode[STIX]{x1D706})$ be a monic polynomial in $\mathbb{Z}[x]$ such that $p_{0}(\unicode[STIX]{x1D706})$ splits completely in $\mathbb{Q}$ . Then, by Gauss’s lemma, we have $p_{0}(\unicode[STIX]{x1D706})=(\unicode[STIX]{x1D706}-\unicode[STIX]{x1D6FC}_{1})(\unicode[STIX]{x1D706}-\unicode[STIX]{x1D6FC}_{2})\cdots (\unicode[STIX]{x1D706}-\unicode[STIX]{x1D6FC}_{n})$ for $\unicode[STIX]{x1D6FC}_{i}\in \mathbb{Z}$ . We assume that the $\unicode[STIX]{x1D6FC}_{i}$ are distinct and nonzero. Let $M(n,\mathbb{R})$ be the space of $n\,\times \,n$ matrices with the norm
for $M=(x_{ij})_{1\leqslant i,j\leqslant n}$ . Note that this norm is $\operatorname{Ad}(K)$ -invariant. We will denote by $B_{T}$ the ball of radius $T$ centered at 0 in $M(n,\mathbb{R})$ . We write
For $M\in M(n,\mathbb{R})$ , we denote by $p_{M}(\unicode[STIX]{x1D706})$ the characteristic polynomial of $M$ . We consider
and its subset of integral points
We would like to get an asymptotic formula for
We begin with the following proposition which is a corollary of [Reference BorelBHC62, Reference Latimer and MacDuffeeLM33].
Proposition 11.1. We have
and there are finitely many $\operatorname{SL}(n,\mathbb{Z})$ -orbits in $V(\mathbb{Z})$ . The number of $\operatorname{SL}(n,\mathbb{Z})$ -orbits in $V(\mathbb{Z})$ is equal to the number of classes of nonsingular ideals in the ring $\mathbb{Z}[M_{\unicode[STIX]{x1D6FC}}]$ .
By Proposition 11.1, it suffices to compute the integral points of an $\operatorname{SL}(n,\mathbb{Z})$ -orbit. In what follows, we will consider the $\operatorname{SL}(n,\mathbb{Z})$ -orbit of $M_{\unicode[STIX]{x1D6FC}}$ . We will apply Theorem 2.8 (more precisely, Corollary 10.1) with initial point $x=e\unicode[STIX]{x1D6E4}$ to compute
For any other $\operatorname{SL}(n,\mathbb{Z})$ -orbit of $M^{\prime }\in V(\mathbb{Z})$ , there exists $M_{q}\in \operatorname{SL}(n,\mathbb{Q})$ such that
and the treatment for $\operatorname{Ad}(\operatorname{SL}(n,\mathbb{Z}))M^{\prime }$ would be similar, just with a change of initial point from $e\unicode[STIX]{x1D6E4}$ to $x_{q}=M_{q}\unicode[STIX]{x1D6E4}$ . See also the beginning of § 4.
As explained in § 2, the metric $\Vert \cdot \Vert _{\mathfrak{g}}$ on $\mathfrak{g}$ defines a Haar measure $\unicode[STIX]{x1D707}_{A}$ on $A$ and a Haar measure $\unicode[STIX]{x1D707}_{N}$ on $N$ . Let $\unicode[STIX]{x1D707}_{K}$ be the $K$ -invariant probability measure on $K$ . Then we define a Haar measure $\unicode[STIX]{x1D707}_{G}$ on $G$ by Iwasawa decomposition $G=KNA$ . Let $c_{X}$ be the volume of $X=G/\unicode[STIX]{x1D6E4}$ with respect to $\unicode[STIX]{x1D707}_{G}$ .
Now let $h=(u_{ij})_{1\leqslant i,j\leqslant n}\in N$ and write
where $x_{ii}=\unicode[STIX]{x1D6FC}_{i}$ and $u_{ij}=0$ $(i>j)$ . We have
and
Let $q_{i}(x)=\prod _{k=1}^{i}(x-\unicode[STIX]{x1D6FC}_{k}).$ The next two lemmas describe the relation between $u_{ij}$ and $x_{ij}$ .
Lemma 11.2. For $j>i$ , we have
where $f_{ij}$ is a polynomial in variables $x_{pq}$ with $0<q-p<j-i$ , and $f_{ij}=0$ for $j-i=1$ . In particular, we have the change of coordinates of the Haar measure $\unicode[STIX]{x1D707}_{N}$ on $N$ ,
Proof. It is easy to see that $u_{ij}=x_{ij}=0$ $(i>j)$ and $u_{ii}=1$ . We prove the lemma by induction on $j-i$ . For $j-i=1$ , we have
Now we have
where $j-k<j-i$ . We complete the proof by applying the induction hypothesis on $u_{kj}$ .◻
Lemma 11.3. For $j>i$ , we have
where $f_{ij}(x)$ is a polynomial in variables $x_{pq}\;(p<q)$ of degree less than $j-i$ .
Proof. We prove the lemma by induction on $j-i$ . For $j-i=1$ , we have
Now we have
where $j-k<j-i$ . By applying the induction hypothesis on $u_{kj}$ we have
Here we omit terms of degree less than $j-i$ . This completes the proof of the lemma.◻
Lemma 11.4. For any $1\leqslant l\leqslant n$ and $1\leqslant i_{1}<i_{2}<\cdots <i_{l}\leqslant n$ , we have
Proof. By algebraic manipulations, we can rewrite the determinant above as
Since $\deg q_{i}=i$ , row reductions yield
◻
Proposition 11.5. For any $h\in N$ (recall $\operatorname{Ad}(h)M_{\unicode[STIX]{x1D6FC}}=(x_{ij})_{1\leqslant i,j\leqslant n}$ ), we have
Here $c(i_{1},i_{2},\ldots ,i_{l})$ is the number in Lemma 11.4 and we omit the terms of polynomials in variables $x_{pq}\;(p<q)$ of degree smaller than $\sum _{j=1}^{l}(i_{j}-j)$ .
Proof. By Lemma 11.3, we know that $u_{ij}$ is a polynomial of degree $j-i$ . This implies that the term in $h(e_{i_{1}}\wedge e_{i_{2}}\wedge \cdots \wedge e_{i_{l}})$ corresponding to the $e_{j_{1}}\wedge e_{j_{2}}\wedge \cdots \wedge e_{j_{l}}$ -coordinate has degree at most $i_{1}+i_{2}+\cdots +i_{l}-j_{1}-j_{2}-\cdots -j_{l}$ . To prove the proposition, it suffices to prove that the term corresponding to $e_{1}\wedge e_{2}\wedge \cdots \wedge e_{l}$ is a polynomial with its leading term
of degree $i_{1}+i_{2}+\cdots +i_{l}-1-2-\cdots -l$ .
We know that the coefficient of $e_{1}\wedge e_{2}\wedge \cdots \wedge e_{l}$ is equal to
and by Lemma 11.3 we know that the leading term of this coefficient is equal to
The expansion formula of determinant then gives
where $\unicode[STIX]{x1D70E}$ runs over all the permutations in the symmetric group $S_{l}$ . Note that we have
This implies that
where $c(i_{1},i_{2},\ldots ,i_{l})$ is the number as in Lemma 11.4.◻
Now we define
Lemma 11.6. We have
Here $\operatorname{Vol}(B_{1})$ is the volume of the unit ball in $\mathbb{R}^{n(n-1)/2}$ .
Proof. This follows immediately from Lemma 11.2. ◻
Let $\unicode[STIX]{x1D707}_{G/A}$ be the $G$ -invariant measure on $G/A$ . In the following, we compute the volume of $V(\mathbb{R})\cap B_{T}$ with respect to a volume form $\unicode[STIX]{x1D707}_{V(\mathbb{R})}$ on $V(\mathbb{R})$ induced by a $G$ -invariant measure on $G/C_{G}(A)$ . We may assume that the natural projection map $G/A\rightarrow G/C_{G}(A)$ sends $\unicode[STIX]{x1D707}_{G/A}$ to $\unicode[STIX]{x1D707}_{V(\mathbb{R})}$ . By Iwasawa decomposition, we have $G/A\cong KN$ , and it is well known that, for any $f\in C_{c}(G/A)$ ,
via this isomorphism.
Proposition 11.7. The volume of $V(\mathbb{R})\cap B_{T}$ with respect to the volume form $\unicode[STIX]{x1D707}_{V(\mathbb{R})}$ equals
Here $\operatorname{Vol}(B_{1})$ is as in Lemma 11.6.
Proof. Note that by the discussion above, we have
By Lemma 11.6 and the $\operatorname{Ad}(K)$ -invariance of the norm on $M(n,\mathbb{R})$ , we compute
This completes the proof of the proposition. ◻
Proposition 11.8. For any $k\in K$ and $h\in N(T)$ , we have
where the implicit constant depends only on $\unicode[STIX]{x1D6FF}$ and $M_{\unicode[STIX]{x1D6FC}}$ . Furthermore, for $h\in N(\unicode[STIX]{x1D716},T)$ , we have
where the implicit constant depends on $\unicode[STIX]{x1D716},\unicode[STIX]{x1D6FF},M_{\unicode[STIX]{x1D6FC}}$ , and $c_{0}$ equals the volume of
Proof. From the definition of $\unicode[STIX]{x1D6FA}_{kh,\unicode[STIX]{x1D6FF}}$ , we know that
Since $k\in \operatorname{SO}(n,\mathbb{R})$ , by Proposition 11.5, for any $i_{1}<i_{2}<\cdots <i_{l}$ we have
where the implicit constant depends only on $\unicode[STIX]{x1D6FF}$ and $M_{\unicode[STIX]{x1D6FC}}$ . Moreover, if $h\in N(\unicode[STIX]{x1D716},T)$ then we have
where the implicit constant depends only on $\unicode[STIX]{x1D716}$ , $\unicode[STIX]{x1D6FF}$ and $M_{\unicode[STIX]{x1D6FC}}$ . The proposition now follows from these equations.◻
Define
where $\unicode[STIX]{x1D712}_{T}$ is the characteristic function of $B_{T}$ in $M(n,\mathbb{R})$ and $\unicode[STIX]{x1D6E4}_{M_{\unicode[STIX]{x1D6FC}}}$ is the stabilizer of $M_{\unicode[STIX]{x1D6FC}}$ in $\unicode[STIX]{x1D6E4}$ . This defines a function on $G/\unicode[STIX]{x1D6E4}$ . Note that $\unicode[STIX]{x1D712}_{T}$ is $\operatorname{Ad}(K)$ -invariant and $\unicode[STIX]{x1D6E4}_{M_{\unicode[STIX]{x1D6FC}}}$ is finite. In the following proposition, we will write
for any two functions $f,\unicode[STIX]{x1D719}$ on $G/\unicode[STIX]{x1D6E4}$ , whenever this integral is valid.
Proposition 11.9. For any $\unicode[STIX]{x1D713}\in C_{c}(G/\unicode[STIX]{x1D6E4})$ , we have
Here
where $c_{0}$ is the number as in Proposition 11.8 and $\operatorname{Vol}(B_{1})$ is as in Lemma 11.6.
Proof. We have
Now fix $\unicode[STIX]{x1D716}>0$ . By Corollary 10.1, we can continue the calculation and we have
Note that, since $\unicode[STIX]{x1D713}\in C_{c}(G/\unicode[STIX]{x1D6E4})$ , we can find $\unicode[STIX]{x1D6FF}_{\unicode[STIX]{x1D713}}>0$ such that
So by Lemma 11.6 and Proposition 11.8, we can continue the calculation and we have
This implies that
We complete the proof by letting $\unicode[STIX]{x1D716}\rightarrow 0$ .◻
Proof of Theorem 2.10.
Following the same proofs as in [Reference Duke, Rudnick and SarnakDRS93, Reference Eskin, Mozes and ShahEMS96], and combining Lemma 11.6 and Proposition 11.9, we conclude that
Now Theorem 2.10 follows from this equation and Proposition 11.1. ◻
Acknowledgements
We would like to express our gratitude to Nimish Shah for insightful comments and support on this work. We are grateful to Barak Weiss for valuable communications and Roy Meshulam for teaching us Lemma 4.4. We thank Ofir David, Asaf Katz, Rene Rühr, Oliver Sargent, Lei Yang, Pengyu Yang and Runlin Zhang for helpful discussions and support. We also thank the anonymous referees for carefully reading our manuscript and for their valuable comments.